Query lcl|NC_021324.1_cdsid_YP_008058909.1 [gene=M185_gp70] [protein=portal protein] [protein_id=YP_008058909.1] [location=complement(42694..44136)] Match_columns 480 No_of_seqs 130 out of 503 Neff 9.7 Searched_HMMs 1612 Date Thu Nov 7 17:18:12 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_83 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_83_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78537 Length: 480 100.0 1E-117 6E-121 662.0 54.5 480 1-480 1-480 (480) 2 protein:vir:78227 Length: 480 100.0 1E-117 8E-121 661.4 54.3 480 1-480 1-480 (480) 3 protein:vir:4223 Length: 486 # 100.0 1E-103 8E-107 584.7 53.8 470 1-476 11-486 (486) 4 protein:vir:2427 Length: 485 # 100.0 6E-103 4E-106 581.0 52.8 470 1-477 11-485 (485) 5 protein:vir:104082 Length: 485 100.0 4E-102 2E-105 576.7 53.6 467 1-477 11-485 (485) 6 protein:vir:7768 Length: 484 # 100.0 1E-101 8E-105 573.8 52.8 470 1-479 10-484 (484) 7 protein:vir:2341 Length: 488 # 100.0 4E-100 3E-103 565.5 52.6 463 1-474 7-488 (488) 8 protein:vir:99916 Length: 504 100.0 1.2E-96 7E-100 546.5 51.2 465 1-480 18-499 (504) 9 protein:vir:80680 Length: 441 100.0 3.8E-95 2.4E-98 538.2 49.2 434 1-455 1-441 (441) 10 protein:vir:8184 Length: 474 # 100.0 3.9E-95 2.4E-98 538.2 47.7 445 1-455 12-474 (474) 11 protein:vir:2500 Length: 501 # 100.0 3.3E-92 2E-95 522.1 50.2 455 1-475 23-501 (501) 12 protein:vir:9568 Length: 410 # 100.0 6.6E-90 4.1E-93 509.5 41.2 399 15-438 1-410 (410) 13 protein:vir:99072 Length: 479 100.0 1.3E-87 8.2E-91 496.9 50.0 460 1-479 9-479 (479) 14 protein:vir:94742 Length: 409 100.0 6.6E-89 4.1E-92 504.0 42.0 399 1-425 1-409 (409) 15 protein:vir:1634 Length: 409 # 100.0 2.2E-88 1.4E-91 501.2 41.6 399 1-425 1-409 (409) 16 protein:vir:102602 Length: 456 100.0 2E-85 1.3E-88 484.9 48.2 432 1-462 1-456 (456) 17 protein:vir:105819 Length: 456 100.0 2E-85 1.3E-88 484.9 48.2 432 1-462 1-456 (456) 18 protein:vir:9751 Length: 422 # 100.0 2.6E-86 1.6E-89 489.8 43.1 412 1-439 1-422 (422) 19 protein:vir:7987 Length: 456 # 100.0 7.3E-85 4.5E-88 481.8 48.7 433 1-462 1-456 (456) 20 protein:vir:106571 Length: 499 100.0 2.6E-82 1.6E-85 467.9 49.7 460 1-480 13-497 (499) 21 protein:vir:96494 Length: 501 100.0 1.4E-82 8.7E-86 469.3 45.9 443 1-472 37-501 (501) 22 protein:vir:2732 Length: 501 # 100.0 5.3E-82 3.3E-85 466.2 46.6 439 1-468 37-501 (501) 23 protein:vir:98444 Length: 434 100.0 1.1E-81 6.6E-85 464.5 46.6 416 37-473 1-434 (434) 24 protein:vir:97171 Length: 512 100.0 2.3E-81 1.4E-84 462.6 48.2 447 1-468 31-512 (512) 25 protein:vir:99781 Length: 511 100.0 9E-81 5.6E-84 459.4 49.8 449 1-470 39-511 (511) 26 protein:vir:9306 Length: 511 # 100.0 9.9E-81 6.2E-84 459.2 48.7 447 1-468 31-511 (511) 27 protein:vir:4898 Length: 502 # 100.0 4.1E-81 2.6E-84 461.3 46.5 440 1-480 38-499 (502) 28 protein:vir:78805 Length: 511 100.0 1.3E-80 8.3E-84 458.5 49.0 447 1-468 39-511 (511) 29 protein:vir:96366 Length: 511 100.0 1.3E-80 8.3E-84 458.5 49.0 447 1-468 39-511 (511) 30 protein:vir:3964 Length: 453 # 100.0 5.1E-81 3.2E-84 460.8 46.6 428 1-466 11-453 (453) 31 protein:vir:97336 Length: 492 100.0 4.1E-81 2.6E-84 461.3 45.7 435 1-467 42-492 (492) 32 protein:vir:94805 Length: 492 100.0 4.9E-81 3E-84 460.9 45.4 435 1-470 42-492 (492) 33 protein:vir:103951 Length: 511 100.0 1.8E-80 1.1E-83 457.8 48.4 446 1-468 39-511 (511) 34 protein:vir:1236 Length: 483 # 100.0 2.9E-80 1.8E-83 456.7 46.3 435 1-470 33-483 (483) 35 protein:vir:96240 Length: 511 100.0 2.5E-79 1.5E-82 451.5 48.9 447 1-468 31-511 (511) 36 protein:vir:93747 Length: 472 100.0 6E-80 3.7E-83 454.9 45.5 435 1-470 22-472 (472) 37 protein:vir:733 Length: 453 # 100.0 9.8E-80 6.1E-83 453.7 45.9 429 1-463 11-453 (453) 38 protein:vir:3609 Length: 452 # 100.0 1.7E-79 1E-82 452.5 46.5 428 1-466 17-452 (452) 39 protein:vir:96266 Length: 474 100.0 2.1E-79 1.3E-82 451.9 47.0 434 1-478 25-474 (474) 40 protein:vir:95899 Length: 474 100.0 2.1E-79 1.3E-82 451.9 47.0 434 1-478 25-474 (474) 41 protein:vir:5961 Length: 503 # 100.0 3.6E-79 2.2E-82 450.6 47.7 451 1-479 27-503 (503) 42 protein:vir:9871 Length: 429 # 100.0 3.4E-79 2.1E-82 450.7 45.9 420 1-462 1-429 (429) 43 protein:vir:99522 Length: 470 100.0 1.6E-78 1E-81 447.0 49.1 438 1-468 19-470 (470) 44 protein:vir:97447 Length: 474 100.0 5.9E-79 3.6E-82 449.5 46.5 434 1-467 25-474 (474) 45 protein:vir:94498 Length: 474 100.0 5.9E-79 3.6E-82 449.5 46.5 434 1-467 25-474 (474) 46 protein:vir:105292 Length: 478 100.0 1.5E-78 9.3E-82 447.3 47.2 433 1-466 1-478 (478) 47 protein:vir:102330 Length: 451 100.0 2.5E-78 1.5E-81 446.0 46.2 427 1-453 1-451 (451) 48 protein:vir:107112 Length: 478 100.0 4.9E-78 3E-81 444.4 47.6 433 1-466 24-478 (478) 49 protein:vir:106639 Length: 481 100.0 1.5E-77 9.5E-81 441.7 47.9 437 1-465 29-481 (481) 50 protein:vir:105461 Length: 470 100.0 8.6E-78 5.3E-81 443.1 45.8 431 1-467 3-470 (470) 51 protein:vir:102950 Length: 471 100.0 9.3E-78 5.8E-81 442.9 45.8 428 1-461 1-471 (471) 52 protein:vir:95113 Length: 474 100.0 1.8E-77 1.1E-80 441.4 46.3 436 1-472 25-474 (474) 53 protein:vir:95806 Length: 440 100.0 1.4E-77 8.7E-81 441.9 44.7 423 10-459 1-440 (440) 54 protein:vir:9922 Length: 489 # 100.0 1.2E-76 7.3E-80 436.9 48.6 445 1-471 13-489 (489) 55 protein:vir:96839 Length: 474 100.0 4.8E-77 3E-80 439.0 46.5 430 1-468 24-474 (474) 56 protein:vir:78083 Length: 537 100.0 4.5E-77 2.8E-80 439.2 44.0 449 1-479 1-537 (537) 57 protein:vir:79043 Length: 479 100.0 1.9E-76 1.2E-79 435.7 46.7 431 1-463 18-479 (479) 58 protein:vir:96179 Length: 468 100.0 3.1E-76 1.9E-79 434.6 46.3 423 1-472 24-468 (468) 59 protein:vir:94546 Length: 506 100.0 1.3E-75 8.1E-79 431.1 46.4 443 1-469 16-506 (506) 60 protein:vir:105889 Length: 474 100.0 8.4E-75 5.2E-78 426.7 45.8 429 1-462 14-474 (474) 61 protein:vir:94101 Length: 474 100.0 8.4E-75 5.2E-78 426.7 45.8 429 1-462 14-474 (474) 62 protein:vir:38 Length: 496 # N 100.0 7.2E-55 4.5E-58 317.4 42.4 438 1-468 1-496 (496) 63 protein:vir:80959 Length: 499 100.0 1.1E-51 6.6E-55 300.0 43.3 443 1-466 1-499 (499) 64 protein:vir:79703 Length: 505 100.0 7E-48 4.3E-51 279.1 44.1 440 1-462 4-505 (505) 65 protein:vir:1587 Length: 508 # 100.0 1.5E-47 9.5E-51 277.2 43.3 440 1-470 4-508 (508) 66 protein:vir:101494 Length: 527 100.0 2.3E-48 1.4E-51 281.8 35.5 455 1-472 21-527 (527) 67 protein:vir:102239 Length: 527 100.0 2.7E-48 1.7E-51 281.4 35.5 455 1-472 21-527 (527) 68 protein:vir:3028 Length: 500 # 100.0 5.2E-46 3.2E-49 268.8 45.7 442 1-466 4-500 (500) 69 protein:vir:9815 Length: 500 # 100.0 5.2E-46 3.2E-49 268.8 45.7 442 1-466 4-500 (500) 70 protein:vir:4782 Length: 522 # 100.0 1.7E-42 1E-45 249.6 46.2 451 1-477 4-522 (522) 71 protein:vir:7430 Length: 563 # 100.0 4.3E-42 2.7E-45 247.4 35.0 466 1-480 9-558 (563) 72 protein:vir:98883 Length: 517 100.0 6.8E-40 4.2E-43 235.3 44.7 445 1-470 4-517 (517) 73 protein:vir:78907 Length: 518 100.0 7E-39 4.4E-42 229.8 41.5 440 1-466 4-518 (518) 74 protein:vir:97265 Length: 513 100.0 9.6E-28 5.9E-31 168.7 38.1 457 1-477 1-513 (513) 75 protein:vir:94956 Length: 452 99.9 1.3E-25 8.3E-29 157.0 37.3 423 1-477 1-452 (452) 76 protein:vir:80453 Length: 535 99.9 8.1E-25 5E-28 152.7 37.7 452 1-476 32-535 (535) 77 protein:vir:95149 Length: 501 99.9 2.3E-22 1.4E-25 139.2 38.7 440 1-476 1-501 (501) 78 protein:vir:78393 Length: 489 99.9 4.4E-21 2.7E-24 132.2 36.0 439 1-476 1-489 (489) 79 protein:vir:95014 Length: 491 99.9 7.5E-20 4.7E-23 125.4 37.4 434 1-466 11-491 (491) 80 protein:vir:96783 Length: 488 99.8 1.7E-18 1.1E-21 118.0 37.2 423 1-440 16-488 (488) 81 protein:vir:93630 Length: 776 99.8 4E-20 2.5E-23 126.9 25.2 465 1-480 38-688 (776) 82 protein:vir:108295 Length: 711 99.8 2.8E-17 1.7E-20 111.3 33.5 465 1-480 25-672 (711) 83 protein:vir:80040 Length: 461 99.7 3.8E-17 2.4E-20 110.6 28.7 435 1-473 1-461 (461) 84 protein:vir:79538 Length: 502 99.7 4.9E-15 3E-18 99.0 38.9 443 1-479 1-502 (502) 85 protein:vir:105619 Length: 772 99.7 3.1E-16 2E-19 105.6 28.6 464 1-480 14-675 (772) 86 protein:vir:9950 Length: 714 # 99.7 2.3E-15 1.4E-18 100.8 32.1 452 1-480 8-651 (714) 87 protein:vir:3296 Length: 714 # 99.7 2.3E-15 1.4E-18 100.8 32.1 452 1-480 8-651 (714) 88 protein:vir:817 Length: 714 # 99.7 2.3E-15 1.4E-18 100.8 32.1 452 1-480 8-651 (714) 89 protein:vir:10117 Length: 714 99.7 2.3E-15 1.4E-18 100.8 32.1 452 1-480 8-651 (714) 90 protein:vir:2764 Length: 714 # 99.7 2.3E-15 1.4E-18 100.8 32.1 452 1-480 8-651 (714) 91 protein:vir:77597 Length: 725 99.6 5.5E-16 3.4E-19 104.2 25.9 470 1-480 1-649 (725) 92 protein:vir:104437 Length: 714 99.6 2.8E-15 1.7E-18 100.4 27.5 452 1-480 10-651 (714) 93 protein:vir:8846 Length: 705 # 99.6 3.6E-15 2.2E-18 99.8 27.8 449 1-480 10-633 (705) 94 protein:vir:100920 Length: 725 99.6 5.5E-15 3.4E-18 98.8 28.3 472 1-480 1-644 (725) 95 protein:vir:96738 Length: 505 99.6 4.3E-13 2.7E-16 88.4 39.4 431 1-475 14-505 (505) 96 protein:vir:80165 Length: 651 99.6 2.5E-13 1.5E-16 89.7 35.4 458 1-480 15-630 (651) 97 protein:vir:9263 Length: 725 # 99.6 1.7E-14 1E-17 96.1 27.8 472 1-480 1-640 (725) 98 protein:vir:172 Length: 708 # 99.6 1.4E-14 8.7E-18 96.5 25.8 471 1-480 1-678 (708) 99 protein:vir:95449 Length: 584 99.6 2.3E-13 1.4E-16 89.9 32.4 435 1-456 1-584 (584) 100 protein:vir:79647 Length: 435 99.6 2.3E-14 1.4E-17 95.3 26.8 406 1-476 5-435 (435) 101 protein:vir:94049 Length: 532 99.5 2E-14 1.2E-17 95.7 24.7 429 1-480 27-527 (532) 102 protein:vir:3420 Length: 533 # 99.5 8.3E-13 5.2E-16 86.8 33.2 445 1-478 1-533 (533) 103 protein:vir:6382 Length: 553 # 99.5 4.3E-13 2.6E-16 88.4 31.6 456 1-480 1-552 (553) 104 protein:vir:105429 Length: 708 99.5 7.3E-14 4.5E-17 92.6 25.8 471 1-480 1-678 (708) 105 protein:vir:107662 Length: 427 99.5 1.4E-13 9E-17 91.0 27.2 397 12-475 1-427 (427) 106 protein:vir:95542 Length: 548 99.5 5.1E-12 3.1E-15 82.5 36.6 448 1-480 1-544 (548) 107 protein:vir:107742 Length: 537 99.5 5E-14 3.1E-17 93.5 24.2 426 1-480 70-536 (537) 108 protein:vir:5249 Length: 437 # 99.5 1.6E-12 1E-15 85.2 32.2 398 1-478 1-437 (437) 109 protein:vir:389 Length: 530 # 99.5 6.1E-12 3.8E-15 82.1 37.2 440 1-477 1-530 (530) 110 protein:vir:104338 Length: 422 99.5 1.3E-13 8E-17 91.2 25.4 401 1-474 1-422 (422) 111 protein:vir:105520 Length: 706 99.5 8.8E-14 5.5E-17 92.2 23.3 468 1-480 1-651 (706) 112 protein:vir:3139 Length: 599 # 99.4 3.8E-12 2.3E-15 83.2 30.0 442 1-468 15-599 (599) 113 protein:vir:3520 Length: 720 # 99.4 2.1E-12 1.3E-15 84.6 25.5 467 1-480 1-659 (720) 114 protein:vir:10321 Length: 495 99.3 7.4E-11 4.6E-14 76.1 37.7 439 1-476 3-495 (495) 115 protein:vir:99563 Length: 862 99.3 1.2E-12 7.7E-16 85.9 21.2 424 1-480 100-587 (862) 116 protein:vir:95821 Length: 763 99.3 3.8E-12 2.4E-15 83.2 23.6 452 1-480 29-683 (763) 117 protein:vir:96068 Length: 765 99.3 5.8E-12 3.6E-15 82.2 22.8 429 1-480 71-557 (765) 118 protein:vir:80644 Length: 551 99.2 9E-10 5.6E-13 70.2 29.1 401 1-480 54-542 (551) 119 protein:vir:63755 Length: 547 99.1 7.5E-10 4.7E-13 70.6 27.3 400 1-480 50-527 (547) 120 protein:vir:95315 Length: 559 99.0 9.4E-09 5.8E-12 64.6 37.4 453 1-480 1-559 (559) 121 protein:vir:101648 Length: 518 98.9 7.1E-09 4.4E-12 65.3 25.7 411 1-480 9-451 (518) 122 protein:vir:3153 Length: 467 # 98.9 1.1E-08 7E-12 64.2 30.1 390 46-479 1-467 (467) 123 protein:vir:1326 Length: 457 # 98.9 1.1E-08 7E-12 64.1 26.4 420 5-480 1-450 (457) 124 protein:vir:3843 Length: 397 # 98.9 1.1E-08 6.7E-12 64.3 26.2 385 5-476 1-397 (397) 125 protein:vir:94599 Length: 641 98.9 1.5E-08 9.1E-12 63.5 31.7 460 1-479 20-641 (641) 126 protein:vir:102668 Length: 547 98.9 1.6E-08 9.8E-12 63.4 38.0 433 1-468 1-547 (547) 127 protein:vir:6240 Length: 457 # 98.9 1.8E-08 1.1E-11 63.1 27.2 418 5-480 1-454 (457) 128 protein:vir:102080 Length: 429 98.8 1.7E-08 1.1E-11 63.2 24.4 397 5-475 1-429 (429) 129 protein:vir:79772 Length: 648 98.8 3.4E-08 2.1E-11 61.5 30.6 423 1-480 51-511 (648) 130 protein:vir:107404 Length: 555 98.8 5.5E-08 3.4E-11 60.4 35.6 453 1-479 1-555 (555) 131 protein:vir:107822 Length: 555 98.8 5.5E-08 3.4E-11 60.4 35.6 453 1-479 1-555 (555) 132 protein:vir:98506 Length: 555 98.8 5.5E-08 3.4E-11 60.4 35.6 453 1-479 1-555 (555) 133 protein:vir:7853 Length: 518 # 98.8 5.5E-08 3.4E-11 60.4 28.3 409 1-480 9-455 (518) 134 protein:vir:1538 Length: 535 # 98.8 6.3E-08 3.9E-11 60.1 41.0 439 1-473 1-535 (535) 135 protein:vir:2198 Length: 536 # 98.8 6.4E-08 4E-11 60.0 38.5 439 1-470 1-536 (536) 136 protein:vir:7321 Length: 556 # 98.7 6.7E-08 4.2E-11 59.9 36.8 450 1-473 1-556 (556) 137 protein:vir:104500 Length: 537 98.7 8.2E-08 5.1E-11 59.4 23.8 421 1-476 50-537 (537) 138 protein:vir:10447 Length: 536 98.7 1.1E-07 6.8E-11 58.8 39.4 439 1-470 1-536 (536) 139 protein:vir:103177 Length: 533 98.7 2.3E-08 1.5E-11 62.4 19.7 435 1-480 27-532 (533) 140 protein:vir:1785 Length: 555 # 98.6 1.6E-07 9.7E-11 57.9 35.3 429 4-477 1-555 (555) 141 protein:vir:100039 Length: 522 98.6 1.8E-07 1.1E-10 57.6 36.7 433 1-470 1-522 (522) 142 protein:vir:7407 Length: 392 # 98.6 1.9E-07 1.2E-10 57.4 27.1 373 7-471 1-392 (392) 143 protein:vir:94709 Length: 522 98.6 2.1E-07 1.3E-10 57.2 40.4 428 1-477 1-522 (522) 144 protein:vir:93610 Length: 454 98.6 2.3E-07 1.4E-10 57.0 27.2 402 11-480 1-449 (454) 145 protein:vir:99452 Length: 651 98.6 3.2E-08 2E-11 61.7 18.6 441 1-480 1-549 (651) 146 protein:vir:103765 Length: 549 98.6 2.4E-07 1.5E-10 56.9 33.3 432 1-467 1-549 (549) 147 protein:vir:106999 Length: 564 98.6 8.4E-08 5.2E-11 59.4 20.7 448 1-480 25-563 (564) 148 protein:vir:102855 Length: 432 98.6 2.8E-07 1.8E-10 56.5 27.6 397 5-475 1-432 (432) 149 protein:vir:105002 Length: 432 98.6 2.8E-07 1.8E-10 56.5 27.6 397 5-475 1-432 (432) 150 protein:vir:107605 Length: 432 98.6 2.8E-07 1.8E-10 56.5 27.6 397 5-475 1-432 (432) 151 protein:vir:80796 Length: 574 98.5 3.1E-07 1.9E-10 56.3 28.0 419 1-480 13-529 (574) 152 protein:vir:99312 Length: 563 98.5 3.2E-07 2E-10 56.2 24.9 420 1-480 26-544 (563) 153 protein:vir:95599 Length: 563 98.5 3.2E-07 2E-10 56.2 24.9 420 1-480 26-544 (563) 154 protein:vir:4156 Length: 542 # 98.5 3.4E-07 2.1E-10 56.1 23.1 421 1-480 11-478 (542) 155 protein:vir:8418 Length: 409 # 98.5 3.8E-07 2.3E-10 55.8 27.0 385 5-476 1-409 (409) 156 protein:vir:3361 Length: 535 # 98.5 3.8E-07 2.4E-10 55.8 40.6 441 1-477 1-535 (535) 157 protein:vir:1380 Length: 422 # 98.4 6.1E-07 3.8E-10 54.6 24.2 388 5-478 1-422 (422) 158 protein:vir:101647 Length: 460 98.4 6.2E-07 3.9E-10 54.6 25.7 410 1-478 1-460 (460) 159 protein:vir:4952 Length: 386 # 98.4 6.3E-07 3.9E-10 54.6 27.5 372 5-467 1-386 (386) 160 protein:vir:1266 Length: 416 # 98.4 6.5E-07 4E-10 54.5 28.2 390 6-473 1-416 (416) 161 protein:vir:102727 Length: 945 98.4 8.5E-07 5.3E-10 53.9 28.3 416 1-480 40-537 (945) 162 protein:vir:78696 Length: 542 98.4 9.2E-07 5.7E-10 53.7 38.8 437 1-477 1-542 (542) 163 protein:vir:100691 Length: 535 98.3 1.1E-06 7.1E-10 53.2 28.7 393 1-479 53-535 (535) 164 protein:vir:81152 Length: 411 98.3 1.2E-06 7.7E-10 53.0 27.3 381 5-463 1-411 (411) 165 protein:vir:104892 Length: 558 98.3 1.3E-06 8.3E-10 52.8 23.5 449 1-480 43-551 (558) 166 protein:vir:3989 Length: 392 # 98.3 1.4E-06 8.5E-10 52.7 27.5 370 7-471 1-392 (392) 167 protein:vir:1023 Length: 392 # 98.3 1.4E-06 8.5E-10 52.7 27.5 370 7-471 1-392 (392) 168 protein:vir:9702 Length: 406 # 98.3 1.4E-06 8.8E-10 52.6 21.2 382 12-479 1-406 (406) 169 protein:vir:100882 Length: 383 98.3 1.8E-06 1.1E-09 52.0 23.4 368 5-476 1-383 (383) 170 protein:vir:96579 Length: 576 98.2 2.1E-06 1.3E-09 51.7 28.4 396 1-480 54-540 (576) 171 protein:vir:4828 Length: 382 # 98.2 2.3E-06 1.4E-09 51.5 24.9 369 5-467 1-382 (382) 172 protein:vir:81095 Length: 416 98.2 2.4E-06 1.5E-09 51.4 26.7 382 12-476 1-416 (416) 173 protein:vir:4598 Length: 416 # 98.2 2.4E-06 1.5E-09 51.4 26.7 382 12-476 1-416 (416) 174 protein:vir:100598 Length: 516 98.2 2.4E-06 1.5E-09 51.4 20.2 398 1-462 64-516 (516) 175 protein:vir:8883 Length: 543 # 98.2 2.9E-06 1.8E-09 50.9 37.6 441 1-480 1-542 (543) 176 protein:vir:94572 Length: 535 98.2 2.9E-06 1.8E-09 50.9 37.5 433 1-472 1-535 (535) 177 protein:vir:99672 Length: 532 98.2 3.4E-06 2.1E-09 50.6 37.8 432 1-470 1-532 (532) 178 protein:vir:103330 Length: 517 98.1 4.2E-06 2.6E-09 50.0 36.7 419 1-468 1-517 (517) 179 protein:vir:960 Length: 413 # 98.1 4.5E-06 2.8E-09 49.9 23.0 379 1-477 1-413 (413) 180 protein:vir:105782 Length: 449 98.1 4.6E-06 2.9E-09 49.8 27.7 400 1-478 1-449 (449) 181 protein:vir:94426 Length: 409 98.1 5.5E-06 3.4E-09 49.4 29.8 386 1-477 1-409 (409) 182 protein:vir:4454 Length: 414 # 98.1 5.6E-06 3.5E-09 49.4 29.6 387 5-479 1-414 (414) 183 protein:vir:100150 Length: 437 98.1 5.6E-06 3.5E-09 49.4 28.9 400 1-480 1-437 (437) 184 protein:vir:81017 Length: 521 98.0 6E-06 3.7E-09 49.2 22.4 415 1-459 46-521 (521) 185 protein:vir:98265 Length: 524 98.0 5.6E-06 3.5E-09 49.4 19.3 404 1-459 71-524 (524) 186 protein:vir:3648 Length: 695 # 98.0 6.4E-06 4E-09 49.1 27.3 427 1-480 79-573 (695) 187 protein:vir:4194 Length: 540 # 98.0 7.3E-06 4.6E-09 48.7 25.7 411 11-480 1-461 (540) 188 protein:vir:106716 Length: 698 98.0 7.6E-06 4.7E-09 48.6 25.0 427 1-480 79-574 (698) 189 protein:vir:78589 Length: 695 98.0 8.1E-06 5E-09 48.5 27.5 429 1-480 79-573 (695) 190 protein:vir:9408 Length: 441 # 98.0 9.1E-06 5.6E-09 48.2 26.7 393 5-478 1-441 (441) 191 protein:vir:79984 Length: 441 98.0 9.1E-06 5.6E-09 48.2 26.7 393 5-478 1-441 (441) 192 protein:vir:9359 Length: 348 # 98.0 9.3E-06 5.7E-09 48.2 28.6 331 35-467 1-348 (348) 193 protein:vir:101541 Length: 694 97.9 1E-05 6.3E-09 48.0 27.4 429 1-480 78-572 (694) 194 protein:vir:6596 Length: 521 # 97.9 1.1E-05 6.5E-09 47.9 22.6 416 1-459 46-521 (521) 195 protein:vir:99853 Length: 488 97.9 1.1E-05 6.7E-09 47.8 28.9 390 8-480 1-414 (488) 196 protein:vir:106282 Length: 521 97.9 1.2E-05 7.5E-09 47.6 23.2 407 1-459 47-521 (521) 197 protein:vir:107880 Length: 491 97.9 1.3E-05 8E-09 47.4 28.1 393 1-480 15-430 (491) 198 protein:vir:100187 Length: 385 97.8 1.5E-05 9.3E-09 47.0 25.9 366 1-475 1-385 (385) 199 protein:vir:1884 Length: 424 # 97.8 1.5E-05 9.5E-09 47.0 23.5 382 1-477 1-424 (424) 200 protein:vir:4995 Length: 384 # 97.8 1.6E-05 9.6E-09 46.9 23.6 367 5-458 1-384 (384) 201 protein:vir:6322 Length: 510 # 97.8 1.7E-05 1.1E-08 46.7 37.7 425 4-461 1-510 (510) 202 protein:vir:189 Length: 424 # 97.8 1.9E-05 1.2E-08 46.4 23.9 379 1-477 1-424 (424) 203 protein:vir:94666 Length: 723 97.8 2.2E-05 1.4E-08 46.1 29.9 400 1-480 1-446 (723) 204 protein:vir:104259 Length: 403 97.7 2.5E-05 1.6E-08 45.8 26.9 369 1-470 1-403 (403) 205 protein:vir:4854 Length: 386 # 97.7 3E-05 1.8E-08 45.4 28.9 373 5-467 1-386 (386) 206 protein:vir:7017 Length: 515 # 97.7 3.1E-05 1.9E-08 45.3 34.2 420 1-467 1-515 (515) 207 protein:vir:93943 Length: 409 97.7 3.1E-05 1.9E-08 45.3 30.9 388 1-477 1-409 (409) 208 protein:vir:9507 Length: 395 # 97.7 3.2E-05 2E-08 45.3 20.9 372 5-474 1-395 (395) 209 protein:vir:100650 Length: 395 97.7 3.2E-05 2E-08 45.3 20.9 372 5-474 1-395 (395) 210 protein:vir:101289 Length: 395 97.7 3.2E-05 2E-08 45.3 20.9 372 5-474 1-395 (395) 211 protein:vir:6896 Length: 523 # 97.7 3.2E-05 2E-08 45.3 18.3 404 1-459 36-523 (523) 212 protein:vir:103458 Length: 524 97.6 3.3E-05 2.1E-08 45.1 19.5 398 1-459 69-524 (524) 213 protein:vir:7208 Length: 524 # 97.6 3.5E-05 2.2E-08 45.0 19.5 398 1-459 69-524 (524) 214 protein:vir:102118 Length: 409 97.6 3.6E-05 2.2E-08 45.0 26.6 380 10-476 1-409 (409) 215 protein:vir:98396 Length: 441 97.6 3.8E-05 2.4E-08 44.8 30.0 396 5-478 1-441 (441) 216 protein:vir:95378 Length: 406 97.6 4.1E-05 2.6E-08 44.6 25.8 382 5-480 1-406 (406) 217 protein:vir:101189 Length: 516 97.6 4.3E-05 2.7E-08 44.5 22.8 400 1-462 64-516 (516) 218 protein:vir:101806 Length: 516 97.6 4.3E-05 2.7E-08 44.5 22.8 400 1-462 64-516 (516) 219 protein:vir:95254 Length: 488 97.5 5.1E-05 3.2E-08 44.1 28.7 428 1-480 1-484 (488) 220 protein:vir:5665 Length: 511 # 97.5 5.2E-05 3.2E-08 44.1 18.0 400 1-462 59-511 (511) 221 protein:vir:2683 Length: 412 # 97.5 5.8E-05 3.6E-08 43.8 31.3 386 1-477 1-412 (412) 222 protein:vir:3868 Length: 417 # 97.5 5.8E-05 3.6E-08 43.8 25.7 374 26-478 1-417 (417) 223 protein:vir:78942 Length: 510 97.5 6E-05 3.7E-08 43.7 38.6 423 4-461 1-510 (510) 224 protein:vir:108049 Length: 524 97.4 7.8E-05 4.9E-08 43.1 20.9 409 1-459 45-524 (524) 225 protein:vir:99232 Length: 526 97.4 8.3E-05 5.2E-08 43.0 32.4 410 1-480 1-456 (526) 226 protein:vir:103860 Length: 528 97.4 8.6E-05 5.4E-08 42.9 31.9 412 1-480 1-461 (528) 227 protein:vir:96980 Length: 409 97.3 9.6E-05 6E-08 42.6 30.0 389 1-477 1-409 (409) 228 protein:vir:483 Length: 413 # 97.3 9.8E-05 6.1E-08 42.6 31.9 389 6-479 1-413 (413) 229 protein:vir:79233 Length: 526 97.3 0.0001 6.3E-08 42.5 30.7 392 1-480 25-455 (526) 230 protein:vir:81072 Length: 432 97.3 0.0001 6.4E-08 42.4 28.7 400 1-478 1-432 (432) 231 protein:vir:4337 Length: 434 # 97.3 0.00011 7E-08 42.2 28.3 393 1-473 1-434 (434) 232 protein:vir:8317 Length: 409 # 97.2 0.00012 7.7E-08 42.0 19.9 352 12-460 1-409 (409) 233 protein:vir:105641 Length: 516 97.2 0.00014 8.6E-08 41.7 34.9 423 1-467 1-516 (516) 234 protein:vir:96988 Length: 516 97.2 0.00015 9E-08 41.6 36.7 423 1-467 5-516 (516) 235 protein:vir:79063 Length: 491 97.1 0.00015 9.4E-08 41.5 27.2 394 1-480 1-430 (491) 236 protein:vir:98816 Length: 446 97.1 0.00019 1.2E-07 41.0 22.1 406 1-428 1-446 (446) 237 protein:vir:108215 Length: 469 97.0 0.00022 1.3E-07 40.7 27.8 427 1-480 1-468 (469) 238 protein:vir:1082 Length: 359 # 97.0 0.00024 1.5E-07 40.5 24.0 344 5-426 1-359 (359) 239 protein:vir:6210 Length: 394 # 96.9 0.00024 1.5E-07 40.4 25.7 371 5-475 1-394 (394) 240 protein:vir:10362 Length: 432 96.9 0.00025 1.6E-07 40.3 27.6 402 1-478 1-432 (432) 241 protein:vir:4509 Length: 424 # 96.8 0.0003 1.9E-07 39.9 26.1 377 7-473 1-424 (424) 242 protein:vir:5737 Length: 419 # 96.7 0.00039 2.4E-07 39.3 30.0 372 23-480 1-418 (419) 243 protein:vir:78641 Length: 278 96.7 0.0004 2.5E-07 39.2 24.3 260 64-389 1-278 (278) 244 protein:vir:80134 Length: 403 96.6 0.00049 3E-07 38.7 25.2 372 1-479 1-403 (403) 245 protein:vir:1986 Length: 512 # 96.6 0.0005 3.1E-07 38.7 31.2 384 1-480 39-448 (512) 246 protein:vir:8100 Length: 466 # 96.5 0.00053 3.3E-07 38.5 27.2 402 5-475 1-466 (466) 247 protein:vir:97060 Length: 432 96.5 0.00055 3.4E-07 38.4 28.5 399 1-478 1-432 (432) 248 protein:vir:103219 Length: 201 96.5 0.0005 3.1E-07 38.7 14.2 196 218-476 1-201 (201) 249 protein:vir:81218 Length: 423 96.5 0.00059 3.6E-07 38.3 24.1 392 5-479 1-423 (423) 250 protein:vir:105064 Length: 421 96.3 0.00076 4.7E-07 37.7 28.8 383 10-480 1-420 (421) 251 protein:vir:5839 Length: 533 # 95.7 0.0017 1E-06 35.8 22.1 424 1-480 9-525 (533) 252 protein:vir:95965 Length: 385 95.3 0.0024 1.5E-06 35.0 23.6 356 1-466 1-385 (385) 253 protein:vir:1431 Length: 419 # 95.0 0.0029 1.8E-06 34.5 30.2 387 10-480 1-417 (419) 254 protein:vir:100249 Length: 431 94.8 0.0033 2.1E-06 34.2 31.0 385 1-479 1-431 (431) 255 protein:vir:78161 Length: 355 94.0 0.0056 3.5E-06 32.9 22.8 312 137-480 1-336 (355) 256 protein:vir:4089 Length: 395 # 93.6 0.0069 4.3E-06 32.4 27.9 374 5-476 1-395 (395) 257 protein:vir:78310 Length: 376 93.5 0.0072 4.5E-06 32.3 21.6 353 1-476 1-376 (376) 258 protein:vir:79511 Length: 448 93.5 0.0073 4.6E-06 32.3 27.0 401 1-480 1-440 (448) 259 protein:vir:80333 Length: 419 92.9 0.0095 5.9E-06 31.7 30.6 383 10-480 1-416 (419) 260 protein:vir:77981 Length: 448 91.8 0.014 8.9E-06 30.7 28.8 409 1-480 1-445 (448) 261 protein:vir:98853 Length: 219 91.0 0.018 1.1E-05 30.2 13.5 203 154-393 1-219 (219) 262 protein:vir:80211 Length: 514 90.9 0.018 1.1E-05 30.1 39.3 417 1-464 1-514 (514) 263 protein:vir:1661 Length: 378 # 90.6 0.02 1.2E-05 29.9 20.8 350 19-466 1-378 (378) 264 protein:vir:94869 Length: 378 89.9 0.024 1.5E-05 29.5 21.8 349 19-476 1-378 (378) 265 protein:vir:78191 Length: 351 89.8 0.024 1.5E-05 29.4 21.4 303 1-400 33-351 (351) 266 protein:vir:98643 Length: 395 89.6 0.025 1.6E-05 29.3 22.2 368 5-469 1-395 (395) 267 protein:vir:100328 Length: 346 88.8 0.03 1.9E-05 28.9 18.1 295 1-398 26-346 (346) 268 protein:vir:93867 Length: 378 86.5 0.045 2.8E-05 28.0 19.7 338 19-476 1-378 (378) 269 protein:vir:5691 Length: 344 # 85.6 0.052 3.2E-05 27.6 18.9 289 1-394 28-344 (344) 270 protein:vir:2013 Length: 344 # 85.4 0.053 3.3E-05 27.6 19.9 290 1-392 28-344 (344) 271 protein:vir:267 Length: 348 # 83.8 0.066 4.1E-05 27.1 22.0 309 1-399 1-348 (348) 272 protein:vir:6058 Length: 344 # 83.6 0.067 4.2E-05 27.0 20.0 289 1-394 1-344 (344) 273 protein:vir:79207 Length: 351 83.2 0.07 4.4E-05 26.9 21.0 300 1-402 33-351 (351) 274 protein:vir:1150 Length: 350 # 79.7 0.1 6.3E-05 26.0 21.9 299 12-392 1-350 (350) 275 protein:vir:103971 Length: 376 79.4 0.1 6.5E-05 26.0 22.8 291 1-400 58-376 (376) 276 protein:vir:94002 Length: 378 78.0 0.12 7.4E-05 25.6 21.0 336 19-476 1-378 (378) 277 protein:vir:345 Length: 663 # 74.5 0.16 9.8E-05 25.0 26.4 458 1-480 1-617 (663) 278 protein:vir:98567 Length: 340 73.2 0.17 0.00011 24.8 20.2 289 1-397 25-340 (340) 279 protein:vir:3780 Length: 345 # 70.5 0.21 0.00013 24.3 23.3 299 1-394 1-345 (345) 280 protein:vir:9641 Length: 395 # 67.0 0.26 0.00016 23.8 27.4 365 22-469 1-395 (395) 281 protein:vir:79150 Length: 368 64.0 0.31 0.00019 23.4 17.7 305 1-412 41-368 (368) 282 protein:vir:78749 Length: 337 61.2 0.36 0.00022 23.0 21.5 297 1-392 22-337 (337) 283 protein:vir:106639 Length: 481 54.9 0.49 0.00031 22.3 23.2 414 1-476 23-481 (481) 284 protein:vir:858 Length: 378 # 34.5 1.3 0.0008 20.0 24.5 338 19-476 1-378 (378) 285 protein:vir:101418 Length: 569 32.1 1.5 0.0009 19.7 16.1 433 1-476 63-569 (569) 286 protein:vir:105154 Length: 525 20.1 2.8 0.0018 18.1 16.9 442 1-480 11-523 (525) No 1 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=1e-117 Score=661.99 Aligned_cols=480 Identities=100% Similarity=1.438 Sum_probs=460.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) |+|+.++|++|+++|..+++|++++++||+|+|+|++.+..++++++++++++|||++|||++++||+++||++++|++. T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~~ 80 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHHHHHHhhhccCceecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999889 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.|+++|+.|+++.++.+++++++++|+||++||+++..+.|++|.++++++||.+++|+||+...+++.+++++|... T Consensus 81 ~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~~~~~ 160 (480) T protein:vir:78 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee Confidence 99999999999999999999999999999999999988888899999999999999999999999889999999999988 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHH Q lcl|NC_021324. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s 240 (480) ++.+.++++++|+++.+++|+..++...++++..++.+|+||+||||+|+|+++..++||+|+|+++|++|+|+||+++| T Consensus 161 d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s 240 (480) T protein:vir:78 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) T ss_pred cCCcceEEEEEEeCCeEEEEEecCCCcccccccccccccCCCCcceEEeecccccCCccCccchhHHHHHHHHHHHHHHH Confidence 88888899999999999999999888888888889999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCCh Q lcl|NC_021324. 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) Q Consensus 241 ~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~ 320 (480) ++++++++|++|+++++|.+.+.+.++.+..++....++++.++|++++|++++.+++++|+++++.++++|++++++|+ T Consensus 241 ~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~ 320 (480) T protein:vir:78 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) T ss_pred HHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCH Confidence 99999999999999999999888877777788888999999999999999999999999999999999999999999999 Q ss_pred HHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH Q lcl|NC_021324. 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) Q Consensus 321 ~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~ 400 (480) +.||+++.|++||+||++++..|+.||.++++.|+.+|++++++++++++.....++..++++|+++.++|+++.+|+++ T Consensus 321 ~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~ 400 (480) T protein:vir:78 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) T ss_pred HHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999877778889999999999999999999999 Q ss_pred HHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 401 kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) |++++|.+++|++|+++++||+++++++|+++++++.+++.+++.+...+.+++.+++.+++...+++.+|++-+++.+| T Consensus 401 kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCccCCCcccCCCcCCC Confidence 99999999999999999999999999999999999999999999999888888999999999999999999999999999 No 2 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=1.3e-117 Score=661.42 Aligned_cols=480 Identities=100% Similarity=1.437 Sum_probs=459.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) |+|+.++|++|+++|..+++|+.++.+||+|+|+|++.+..++++++++++++|||++|||++++||+++||++++|++. T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~d~~~ 80 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHHHHHHHhhhccCceecCCCchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999889 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.|+++|+.|+++.++.+++++++++|+||++||+++..+.|++|.++++++||.+++|+||+...+++.+++++|... T Consensus 81 ~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~~~~~ 160 (480) T protein:vir:78 81 LEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEEEEee Confidence 99999999999999999999999999999999999998888899999999999999999999999889999999999988 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHH Q lcl|NC_021324. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s 240 (480) ++.+.++++++|+++.+++|+..++...+++...++.+|+||+||||+|+|+++.+++||+|+|+++|++|+|+||+++| T Consensus 161 ~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s 240 (480) T protein:vir:78 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) T ss_pred cCCCceEEEEEEeCCeEEEEEecCCCccccccccccccCCCCCcceEEeecccccCCccCcccchhhHHHHHHHHHHHHH Confidence 88888999999999999999998888888888889999999999999999999999999999999899999999999999 Q ss_pred HHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCCh Q lcl|NC_021324. 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) Q Consensus 241 ~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~ 320 (480) ++++++++|++|+++++|.+.+++.++.++.+++...++++.++|++++|++++.+++++|+++++.++++|++.+++|+ T Consensus 241 ~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~ 320 (480) T protein:vir:78 241 NLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPP 320 (480) T ss_pred HHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhccCCCCCceEEecCccCHHHHHHHHHHHHHHHhcccCCCh Confidence 99999999999999999999888887777888888999999999999999999999999999999999999999999999 Q ss_pred HHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH Q lcl|NC_021324. 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) Q Consensus 321 ~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~ 400 (480) ++||+++.|++||+||++++..|+.||+++++.|+++|++++++++++.|.....++..++++|+++.++|+++.+|+++ T Consensus 321 ~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~ 400 (480) T protein:vir:78 321 QYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) T ss_pred HHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999877778889999999999999999999999 Q ss_pred HHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 401 kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) |++++|.+++|++|+++++||+++++++|+++++++.+++++++.+...+.++..+++.+++..++++.+|+|.++++|| T Consensus 401 kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:78 401 KLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) T ss_pred HHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCccccccCCCCcccCC Confidence 99999999999999999999999999999999999999999999988888888888888889999999999999999999 No 3 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=1.3e-103 Score=584.65 Aligned_cols=470 Identities=44% Similarity=0.737 Sum_probs=401.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) -.++.+++++|+.+|..+++|++++.+||+|+|+|++++..++++++++++++|||++|||++++||+++||+++++++. T Consensus 11 ~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~~~v~n~~~~iVd~~~~~l~~~g~~~~~~~~~ 90 (486) T protein:vir:42 11 IEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQLLAHVGYPRLYVDSVAERQAVEGFRLGDADEA 90 (486) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhhhhccchHHHHHHHHHhhhcccceecCCCchh Confidence 34456789999999999999999999999999999999999999988999999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc--CCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES--GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~--~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) .+.++++|+.|+|+.++.++++++++||+||++||.++.+. .+.++.+++++++|.+++++||+..+ ++.+++++|. T Consensus 91 ~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~-~~~~~~~~~~ 169 (486) T protein:vir:42 91 DEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRIN-RVSKAIRVAY 169 (486) T ss_pred HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCCC-CeEEEEEEEE Confidence 88999999999999999999999999999999999876443 35678899999999999999998754 6888888776 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) . ++.+.++++++|+++.+++|...++. + ...+..+|+||+||||+|+|+++.+++||+|+|+++|++|||+||++ T Consensus 170 ~-~~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~ 244 (486) T protein:vir:42 170 D-KEGNEIQAATLYTPMETIGWFRADGE---W-AEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARI 244 (486) T ss_pred e-cCCCeEEEEEEEcCCcEEEEEecCCc---E-EeecceecCCCCceEEEeccccccCCCCCcccchhhHHHHHHHHHHH Confidence 4 45666788999999999999876543 2 23456789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021324. 239 LMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~p~l~i~G~~~~~~~~~--~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|++++++++|++|+++++|.+++.+..+ .+..+++...+++|.+++++++|++++.+++++|+++|+.++++|+.++ T Consensus 245 ~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~ 324 (486) T protein:vir:42 245 LMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYT 324 (486) T ss_pred HHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhcccCCCCceEEeecccCHHHHHHHHHHHHHHHhccc Confidence 99999999999999999999987765433 4445678888999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021324. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~~~~~~~~~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++..|++||+++++.|+++|++++++++++.+. ....+..+++++|+++.|+|+++. T Consensus 325 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ 404 (486) T protein:vir:42 325 GLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAK 404 (486) T ss_pred CCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHH Confidence 9999999999999999999999999999999999999999999999999998764 345677889999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC-CCCccCCCCCCCC Q lcl|NC_021324. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-ETKTETQTSPSGF 474 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~~~~~~ 474 (480) ||+++|+++++.+++|+||+++++||+++++++|+++++++.+.....+.......+....++..+ ++.++....++|+ T Consensus 405 ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 484 (486) T protein:vir:42 405 ADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTAPPKPQPAIESSGG 484 (486) T ss_pred HHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCCCCCCCcccCCCCC Confidence 999999999999999999999999999999999988777665544333322222221111112212 2222233334444 Q ss_pred Cc Q lcl|NC_021324. 475 NR 476 (480) Q Consensus 475 ~~ 476 (480) ++ T Consensus 485 ~~ 486 (486) T protein:vir:42 485 DA 486 (486) T ss_pred CC Confidence 44 No 4 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=5.9e-103 Score=581.04 Aligned_cols=470 Identities=44% Similarity=0.728 Sum_probs=401.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) -.+...++..|+++|..+++|+.++++||+|+|+|++.+..++++++++++++|||++|||++++||+++||+++++++. T Consensus 11 ~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~ 90 (485) T protein:vir:24 11 IADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVGYPRLYVDSIAERQAVEGFRLGDADEA 90 (485) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccchHHHHHHHHhhhhccCceecCCCchh Confidence 34446688899999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc--CCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES--GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~--~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ++.++++|++|+|+.++.++++++++||+||++||.++... .+.++.++++.+||.+++++||+... ++.+++++|. T Consensus 91 ~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~~~~-~~~~~~~~~~ 169 (485) T protein:vir:24 91 DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDPRIG-RPAKAIRVAY 169 (485) T ss_pred HHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeCCcC-ceeEEEEEEE Confidence 89999999999999999999999999999999999876432 34568899999999999999998865 5666666654 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) . ++.+..+++++|+++.+++|...++. + ...+..+|+||.||||+|+|+++..++||+|+|+++|++|||+||++ T Consensus 170 ~-~~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~ 244 (485) T protein:vir:24 170 D-AEGNEIQAATLYTPNETFGWFRAEGE---W-VEWFSDPHGLGAVPVVPLPNRTRLSDLYGTSEITPELRSMTDAAARI 244 (485) T ss_pred e-ecCCeEEEEEEEcCCcEEEEEecCCc---e-EeecccccCCCcccEEEeccCcccCCcCCcccchhhHHHHHHHHHHH Confidence 4 34556788999999999999876643 2 23456789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCcccccc--ccccchhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021324. 239 LMNLQSASQILGTPLRVISGVTTDELTN--DGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~p~l~i~G~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|++++++++|++|+++++|.+++.+.. .++..+++...+++|.+++++++|++++.+++++|+++|+.++++|++++ T Consensus 245 ~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~ 324 (485) T protein:vir:24 245 LMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYT 324 (485) T ss_pred HHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccCCCCceEEeecccchHHHHHHHHHHHHHHhccc Confidence 9999999999999999999998876543 33456778889999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021324. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-REVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-~~~~~~~~~~~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++.+|++||+++++.|+++|++++++++.+.+ .....+...++++|+++.++|+++. T Consensus 325 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ 404 (485) T protein:vir:24 325 GLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAK 404 (485) T ss_pred CCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHH Confidence 999999999999999999999999999999999999999999999999998865 4456677899999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) +|+++|++++|.+++|+||+++++||+++++++|+++.+++.+.....+.+.....+....++..++ ..+.+..+++++ T Consensus 405 ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e-~~~~~~~~~~~~ 483 (485) T protein:vir:24 405 ADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTP-APKPQPAIEGGD 483 (485) T ss_pred HHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCC-CCCCccCCCCCC Confidence 9999999999999999999999999999999998877777655443333222222221112222222 222333445666 Q ss_pred cc Q lcl|NC_021324. 476 RT 477 (480) Q Consensus 476 ~~ 477 (480) ++ T Consensus 484 ~a 485 (485) T protein:vir:24 484 SA 485 (485) T ss_pred CC Confidence 66 No 5 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=3.6e-102 Score=576.71 Aligned_cols=467 Identities=42% Similarity=0.716 Sum_probs=399.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) .+|+..++++|+.+|..+++|++++.+||+|+|+|+++++..++.++++++++|||++|||++++||+++||+++++++. T Consensus 11 ~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~ 90 (485) T protein:vir:10 11 IEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPRLYVDSIAERQAVEGFRFGDADEA 90 (485) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHHHHHHHHHhhhcccceecCCCchh Confidence 66778899999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccC--CCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG--DPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~--d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) .+.++++|++|+|+.++.++++++++||+||++||+++.... +.++.++|++++|.+++++||+..++ +.+++++|. T Consensus 91 ~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~-~~~~~~~~~ 169 (485) T protein:vir:10 91 DEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGR-VSKAIRVAY 169 (485) T ss_pred HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCCCc-eeEEEEEEE Confidence 899999999999999999999999999999999998865432 35688999999999999999987654 556666655 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) . ++.+..+++++|+++.+++|...++. | ...+..+|+||+||||+|+|+++..++||+|+|+++|++|||+||++ T Consensus 170 ~-~~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~ 244 (485) T protein:vir:10 170 D-AEGNEIQAATLYTPNDIFGWYRVENE---W-QEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEITPELRSMTDAAARI 244 (485) T ss_pred e-eCCCeEEEEEEEeCCeEEEEEEcCCc---e-EEeccccCCCCcccEEEeccccccCCCCCccchhHHHHHHHHHHHHH Confidence 3 34455778899999999999876543 2 23456789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCcccccc--ccccchhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021324. 239 LMNLQSASQILGTPLRVISGVTTDELTN--DGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~p~l~i~G~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|++++++++|++|+++++|.+.+.+.. ..+..+++...+++|.+++++++|++++.+++++|+++|+.++++|++++ T Consensus 245 ~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~d~k~~q~~~~~~~~~~~~l~~~i~~~~~~~ 324 (485) T protein:vir:10 245 LMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFEDAEGKIQQFSAAELANFTNALDQIAKQVAAYT 324 (485) T ss_pred HHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhcccceeccCCCCceEEeecccchHHHHHHHHHHHHHHhccc Confidence 9999999999999999999998776543 33455678888999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021324. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~~~~~~~~~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++..|++||+++++.|+.+|++++++++.+.+. ....+...++|+|+++.|+|+++. T Consensus 325 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ 404 (485) T protein:vir:10 325 GLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAK 404 (485) T ss_pred CCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHH Confidence 9999999999999999999999999999999999999999999999999988754 345567789999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHH---HHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCC Q lcl|NC_021324. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETE---DMIDTLYSTTKAQADATPKPTVTETKTETQTSPS 472 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 472 (480) +|+++|++++|.+++|+|++++++||+++++++|+++.+++.+ ...+++.+...+.+++.... +..+.++..+ T Consensus 405 ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~ 480 (485) T protein:vir:10 405 ADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPA----PAPKPAALES 480 (485) T ss_pred HHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCcc----ccccCcCCCC Confidence 9999999999999999999999999999999888766555543 34455554433332222111 2222222233 Q ss_pred CCCcc Q lcl|NC_021324. 473 GFNRT 477 (480) Q Consensus 473 ~~~~~ 477 (480) +++++ T Consensus 481 ~~~~~ 485 (485) T protein:vir:10 481 GGDAA 485 (485) T ss_pred CCCCC Confidence 44444 No 6 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=1.2e-101 Score=573.79 Aligned_cols=470 Identities=44% Similarity=0.767 Sum_probs=399.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) =.|+++++++|++.|..+.+|++++.+||+|+|++++++...+++++++++++|||++|||++++||+++||+++++++. T Consensus 10 ~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~~~~~ 89 (484) T protein:vir:77 10 NVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVGYPRLYIDAIAARQELEGFRLGGADKA 89 (484) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhHHhhhhhcCcHHHHHHHHHhhhccCceecCCcchh Confidence 23457789999999999999999999999999999999999999999989999999999999999999999999988888 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCC--CCceeEEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD--PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d--~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) .+.++++|++|+|+.++.++++++++||+||++||.++.+... ..+.++|+++||.+++++||+.. +++.+++++|. T Consensus 90 ~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~-~~~~~a~~~~~ 168 (484) T protein:vir:77 90 DEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPRT-RQVMRAIRAIE 168 (484) T ss_pred HHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCCC-CceEEEEEEEE Confidence 8999999999999999999999999999999999987654332 34568899999999999999874 57888888876 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) ..+ .+.++++++|+++.+++|...++. | ...+..+|+||+||||+|+|++..+++||+|+|+++|++|+|+||++ T Consensus 169 ~~~-~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~ 243 (484) T protein:vir:77 169 DEE-GNEVIGATLYLPNNTVIWNREDGQ---W-VQVANVAHNLEMVPVIPIPNRTRLSDLYGTTEITPELRSVTDAAART 243 (484) T ss_pred eec-CCcEEEEEEEecCeEEEEEecCCc---e-EeeccccCCCCCcceEEeccccccCccCCcccchHHHHHHHHHHHHH Confidence 544 455678899999999998876542 2 23455789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhcchhhhhcCCCcccccc--ccccchhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhc Q lcl|NC_021324. 239 LMNLQSASQILGTPLRVISGVTTDELTN--DGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASIT 316 (480) Q Consensus 239 ~s~~~~~~~~~~~p~l~i~G~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 316 (480) +|+++++++++++|+++++|.+++.+.. ..+...++...+++|.+++++++|++++.+++++|+++|+.++++|++++ T Consensus 244 ~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~ 323 (484) T protein:vir:77 244 LMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAFEDHESKAQQFSAAELRNFVDALDALDRKAAAYT 323 (484) T ss_pred HHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhhcccCCCCceeEeecCCChHHHHHHHHHHHHHHhccc Confidence 9999999999999999999998776543 33455677888999999999999999999999999999999999999999 Q ss_pred CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCCHHHH Q lcl|NC_021324. 317 GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTVAAK 395 (480) Q Consensus 317 ~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~~~~~~~~~v~f~~~~~~~~~~~ 395 (480) ++|+++||+++.|++||+||++++.+|++||+++++.|+++|++++++++.+.+. ....+..+++++|+++.++|+++. T Consensus 324 ~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ 403 (484) T protein:vir:77 324 GLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAK 403 (484) T ss_pred CCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHH Confidence 9999999999999999999999999999999999999999999999999988764 345566789999999999999999 Q ss_pred HHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 396 ADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 396 ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) +|+++|++++|.+++|++++++++||+++++++|+++++++.......+.......++.+..+ +..+..++.|+... T Consensus 404 ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~ 480 (484) T protein:vir:77 404 ADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNP---DNPETPEPQPNPAE 480 (484) T ss_pred HHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCC---CCCCcccccCCCcc Confidence 999999999999999999999999999999999887777665544443333322222222222 22222223343433 Q ss_pred cccC Q lcl|NC_021324. 476 RTKT 479 (480) Q Consensus 476 ~~~~ 479 (480) +++. T Consensus 481 ~~~~ 484 (484) T protein:vir:77 481 EAAA 484 (484) T ss_pred ccCC Confidence 3333 No 7 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=4e-100 Score=565.50 Aligned_cols=463 Identities=44% Similarity=0.760 Sum_probs=392.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC----- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS----- 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~----- 75 (480) ++ +.+||++|+.+|..+++|++++++||+|+|+|++.+..+++.++++++++|||++|||++++||+++||+++ T Consensus 7 ~d-~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~ivd~~a~~l~~~Gf~~~~~~~~ 85 (488) T protein:vir:23 7 ID-PEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYVDAIAERQELEGFRIPSANGE 85 (488) T ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHHHHHHHhhhccceeccCCccc Confidence 65 478999999999999999999999999999999999999999999999999999999999999999999764 Q ss_pred -----CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc--CCCCceeEEEEEcccceEEEEeCCCCC Q lcl|NC_021324. 76 -----EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES--GDPAGIPLIRVESPLYMYAELDPRNTR 148 (480) Q Consensus 76 -----~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~--~d~~g~~~~~~~~p~~~~~v~d~~~~~ 148 (480) +|++..+.++++|++|+|+.++.++++++++||+||++|++++... .++++.+++++++|.+++++||+..+ T Consensus 86 ~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~d~~~~- 164 (488) T protein:vir:23 86 EPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAEVDPRTR- 164 (488) T ss_pred ccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeEEEEecCCC- Confidence 3556778899999999999999999999999999999999876432 36788999999999999999998754 Q ss_pred ceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHH Q lcl|NC_021324. 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) ++.+++++|... +.+..+++++|+++.+++|...++. | ...+..+|+||+||||+|+|+++..++||+|+|+++| T Consensus 165 ~~~~~~~~~~~~-~~~~~~~~~~y~~~~~~~~~~~~~~---~-~~~~~~~h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v 239 (488) T protein:vir:23 165 KVLYAIRAIYGA-DGNEIVSATLYLPDTTMTWLRAEGE---W-EAPTSTPHGLEMVPVIPISNRTRLSDLYGTSEISPEL 239 (488) T ss_pred ceEEEEEEEEec-CCCcEEEEEEEecCcEEEEEecCCc---e-EeccccccCCCCcceEEeccccccCCcCCccchhhhH Confidence 677778777554 4455678899999999999876543 2 2345678999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhcchheecC-CCCceEEEeccccHHHHHHHH Q lcl|NC_021324. 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 229 ~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~--~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l 305 (480) ++|+|+||+++|++++.+++|++|+++++|++++.+... ....+++...++++.++ |++++|++++.+++++|+++| T Consensus 240 ~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~v~~~~~g~~~~~~q~~~~~~~~~~~~l 319 (488) T protein:vir:23 240 RSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAYMARILAFEGGEGAHAEQFSAAELRNFVDAL 319 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhhhhhhccCCCCCCceeEecCCCChHHHHHHH Confidence 999999999999999999999999999999987766433 34456788888888875 567899999999999999999 Q ss_pred HHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEe Q lcl|NC_021324. 306 EVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVW 384 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~v~f 384 (480) +.++++|++.+++|+++||+++.|++||+||++++..|++||+++++.|+.+|++++++++++.+.. ...++.+++++| T Consensus 320 ~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~f 399 (488) T protein:vir:23 320 DALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGGDIPTEYYRMETVW 399 (488) T ss_pred HHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcchhhccceEEe Confidence 9999999999999999999999999999999999999999999999999999999999999987643 345667899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHH---hhhcccCCCcCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTL---YSTTKAQADATPKPTVT 461 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 461 (480) +++.++|+++.+|+++|++++|.+++|+||+++++||+++++++|+++++++.+....++ .+.......++..+ T Consensus 400 ~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 476 (488) T protein:vir:23 400 RDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAP--- 476 (488) T ss_pred cCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCC--- Confidence 999999999999999999999999999999999999999999988876665554443333 33222222222221 Q ss_pred CCCccCCCCCCCC Q lcl|NC_021324. 462 ETKTETQTSPSGF 474 (480) Q Consensus 462 d~~~~~~~~~~~~ 474 (480) ..+..+.+|..+ T Consensus 477 -~~~~~~~e~~~a 488 (488) T protein:vir:23 477 -VGEPPAPEPDAA 488 (488) T ss_pred -CCCCCCCCCCCC Confidence 112222222222 No 8 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=1.2e-96 Score=546.47 Aligned_cols=465 Identities=22% Similarity=0.294 Sum_probs=386.9 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |++. .++|++|+.+|..+++|+.++.+||+|+|++++++..++++++++++++||+++|||+++++|.++||+++++++ T Consensus 18 l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~d~~~ 97 (504) T protein:vir:99 18 LNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRTATVLGWSAKAVDTLARRCNLESFVWPDGDY 97 (504) T ss_pred CCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHHhhccCcHHHHHHHHHhhhccceeeCCCCCh Confidence 6655 568999999999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ....++++|+.|+|+.++.++++++++|||||++||.++ +.++.++|+++||.+++++||+..+ ++.+++++|. T Consensus 98 ~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~----d~~~~~~I~~~sP~~~~~iyD~~~~-~~~~a~~~~~- 171 (504) T protein:vir:99 98 GSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGG----AGEPDSLIHVKSAMQATGEWNSRRN-AMDSLLSITS- 171 (504) T ss_pred hhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCC----CCCceeEEEEeccceeEEEEeCCCC-ceeEEEEEEE- Confidence 888999999999999999999999999999999999764 3345678999999999999998754 5666666554 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) .+..+..+.+++|+++.+++|...++. .+..+..+|++| ||||+|+|+++..+++|+|+|++.|++|+|+||+++ T Consensus 172 ~d~~g~~~~~~~y~~~~~~~~~~~~~~----~~~~~~~~~~~g-vPvV~~~n~~~~~~~~G~sei~~~v~~l~Da~~~~~ 246 (504) T protein:vir:99 172 RDAEGHPTGIALYEDGVTVTADMDDDG----DWHADVRTHKLG-VPVEVLPYKPREDRPLGSSRITRPVMSLQQRALKGC 246 (504) T ss_pred ecCCCeEEEEEEEcCCcEEEEEEcCCc----eeeeccccCCCC-cceEEecccccCccccCcccchhhHHHHHHHHHHHH Confidence 355667788999999999999876543 334567789998 999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccc--cchhhhhcchheecCC---------CCceEEEeccccHHHHHHHHHHH Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDGE--NTTLDIYYGRILTLAS---------EAAKISEFKAAELRNFAEEMEVF 308 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~--~~~~~~~~~~~~~~~g---------~~~~~~~~~~~~~~~~~~~l~~~ 308 (480) +++++..+||++|+++++|.+++++.+..+ ...++...+++|.+++ .+++|++++++++++|+++|+.+ T Consensus 247 ~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~ 326 (504) T protein:vir:99 247 IRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQFPASSPQPHIEMLEQI 326 (504) T ss_pred HHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeeecCCCChHHHHHHHHHH Confidence 999999999999999999998876644433 3467788888887653 35889999999999999999999 Q ss_pred HHHHHhhcCCChHHhcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEec Q lcl|NC_021324. 309 RKEAASITGLPPQYLSSSS-ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWR 385 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~~-~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~~~~~~~~~v~f~ 385 (480) +++|++++++|+++||..+ .|++||+||++++.+|..||+++++.|+.+|++++++++.+.+. ....++.+++++|+ T Consensus 327 i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~~~~~v~w~ 406 (504) T protein:vir:99 327 AMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEWKTIDSKFR 406 (504) T ss_pred HHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccceeEec Confidence 9999999999999999754 57899999999999999999999999999999999999988764 34566788999999 Q ss_pred CCCCCCHHHHHHHHHHHHhcccc-CCCHHHHHHhCCCChhHHHHHHHH-HHHHHHHHHHHHhhhcccCCCcCCCCCCCCC Q lcl|NC_021324. 386 DPSTPTVAAKADAVSKLYANGQG-PIPKEQARIDLGYTATQREQMRDW-DKQETEDMIDTLYSTTKAQADATPKPTVTET 463 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~-~~s~e~~~~~~~~~~d~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) ++.++|+++.||+++|++++|.. +.+.+++++++|++++++++++++ ++++....++++.+.......+...+ +++ T Consensus 407 d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~ 484 (504) T protein:vir:99 407 SPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQ--DQG 484 (504) T ss_pred CCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCC--CcC Confidence 99999999999999999998864 446789999999999999887644 44555566777766543222221111 111 Q ss_pred CccCCCCCCCCCcccCC Q lcl|NC_021324. 464 KTETQTSPSGFNRTKTR 480 (480) Q Consensus 464 ~~~~~~~~~~~~~~~~~ 480 (480) .++..++ ..+.+..| T Consensus 485 ~~e~a~~--~~~~~~~~ 499 (504) T protein:vir:99 485 AGEPPAN--EPPAALGR 499 (504) T ss_pred CCCCCCC--CCCccCCC Confidence 1111111 11122223 No 9 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=3.8e-95 Score=538.23 Aligned_cols=434 Identities=28% Similarity=0.411 Sum_probs=374.3 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |++. .+||++|+++|..+++|++++.+||+|+|++++.+...++.++++|+++|||++|||++++|+.++||+.+++ T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~g~~~~d~-- 78 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAVDALEERLDWLGWTNGDG-- 78 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHHHHHHhhhccccccCCCh-- Confidence 7665 6799999999999999999999999999999999999999999999999999999999999999999987653 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) +.++++|+.|+|+.++.+++++++++|+||++||. |++|.|++++++|.+++++||+...+...++++++ . T Consensus 79 --~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~------d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~-~ 149 (441) T protein:vir:80 79 --YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIP------HGDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQ-T 149 (441) T ss_pred --HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEe------CCCCceEEEEEccceEEEEEeCCCCceeEEEEEEE-E Confidence 56899999999999999999999999999999996 45799999999999999999987765555544444 3 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) .++ ...++++|+++.+++|...+... ....+..+|+||+||||+|+|+++..++||+|+|+++|++|||+||+++ T Consensus 150 ~~~--~~~~~~vy~~~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~liDa~~~~~ 224 (441) T protein:vir:80 150 CDP--EVVEAELLLPDVIVQVERRGSRE---WVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYTDEAVRTL 224 (441) T ss_pred ecC--ceEEEEEEecCeEEEEEEcCCcc---eeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHHHHHHHHH Confidence 332 24578999999999988766532 3456778999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC----CCCceEEEeccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) |++++++++|++|+++++|++++++..+. ++...++++.++ ++.+++++++.+++++|+++|+.++++|++. T Consensus 225 s~~~~~~~~~~~~~~~i~G~~~~~~~~~~----~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~ 300 (441) T protein:vir:80 225 LGQSVNRDFYAYPQRWVTGVSADEFSQPG----WVLSMASVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGE 300 (441) T ss_pred HHHHHHHHhhcCceeeeecCCccccccch----hhhcccccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcc Confidence 99999999999999999998876654432 455667777654 3357889999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC--cccceeeeEEecCCCCCCHH Q lcl|NC_021324. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV--TEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 316 ~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~--~~~~~~~~v~f~~~~~~~~~ 393 (480) +++|++.||+.+.|++||+||++++.+|..||+++++.|+++|++++++++.+.+... ...+.+++++|+++.|+|++ T Consensus 301 ~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~~f~~~~~~~~~ 380 (441) T protein:vir:80 301 AAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGLRWRDASTPTRA 380 (441) T ss_pred cCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeEEeCCCCCcCHH Confidence 9999999999888888999999999999999999999999999999999999977433 33457899999999999999 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcC Q lcl|NC_021324. 394 AKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT 455 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (480) +.+|+++|++++|.+++|++++++.+|+++++++++++++++ +++.+.++.+....++++- T Consensus 381 e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~~~~~~e~~e-~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 381 ATADAVTKLVGAGILPADSRTVLEMLGLDDVQVEAVMRHRAE-SSDPLAVLAGAISRQTNEV 441 (441) T ss_pred HHHHHHHHHHhcCcccccHHHHHHhCCCCHHHHHHHHHHHHH-HHHHHHHHhhhhhcccccC Confidence 999999999999988899999999999999999988765444 4445555555444333333 No 10 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=3.9e-95 Score=538.17 Aligned_cols=445 Identities=22% Similarity=0.266 Sum_probs=388.7 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |++. .+++.+|+++|..+++|+.++.+||+|+|++++++..+|++++++++++|||+++||+++++|.++||++++++. T Consensus 12 l~~~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d~~~ 91 (474) T protein:vir:81 12 LSNDENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNLGLVLGWTGKAVDALARRCNLEGFVWPDGDL 91 (474) T ss_pred CChhHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHHHhhcChHHHHHHHHHhhhcccceECCCCCc Confidence 6655 668999999999999999999999999999999999999999999999999999999999999999999988777 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ....++++|+.|+|+....++++++++|||||++|++++ |.++.++|+++||.+++++||+... ++.+++..+. T Consensus 92 ~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~----d~~~~~~i~~~sp~~~~~~~D~~~~-~~~~al~~~~- 165 (474) T protein:vir:81 92 DSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGE----DDEPEALIHVKDASEATGEWNRRRR-GLNNLLSIID- 165 (474) T ss_pred cchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCC----CCCceeEEEEeccceEEEEEeCCCC-cceeeeEEEE- Confidence 778899999999999999999999999999999999865 4466789999999999999999765 5566665553 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) .+..+..+.+++|+++.+++|...++. +.+..+..+|++| ||||+|+|+++..+++|+|+|++.|++|+|++|+++ T Consensus 166 ~~~~g~~~~~~ly~~~~~~~~~~~~~~---~~w~~~~~~~~~g-vPvV~~~n~~~~~~~~G~s~i~e~v~~l~da~~r~~ 241 (474) T protein:vir:81 166 KDKEGKVLSLALYLDNETVTAQRDKAT---LKWQVDRDEHVYG-VPAQVLPYKPAPKRPFGQSRITKPMMGLQDAGVREL 241 (474) T ss_pred EcCCCcEEEEEEEeCCcEEEEEEcCcc---ceeeeccCCCCCC-cceEEecccccccCcCCccccchhHHHHHHHHHHHH Confidence 445566778899999999998876542 2334577889998 899999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCcccccccc--ccchhhhhcchheecCCC---------CceEEEeccccHHHHHHHHHHH Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDG--ENTTLDIYYGRILTLASE---------AAKISEFKAAELRNFAEEMEVF 308 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~--~~~~~~~~~~~~~~~~g~---------~~~~~~~~~~~~~~~~~~l~~~ 308 (480) +++.+..+||++|+++++|.+++.+.+.. ....|+...+++|.++++ .++|+|++++++++|++.|+.+ T Consensus 242 ~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~~a~l~~~~~~l~~~ 321 (474) T protein:vir:81 242 ARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADIPQLARADVKQFPAASPDAHWSDINGL 321 (474) T ss_pred HHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccccccccccccccCCCChhHHHHHHHHH Confidence 99999999999999999999987765433 345678888888877543 3679999999999999999999 Q ss_pred HHHHHhhcCCChHHhccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccceeeeEE Q lcl|NC_021324. 309 RKEAASITGLPPQYLSSS-SENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEYTRLETV 383 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~-~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----~~~~~~~~~~v~ 383 (480) +++|++.+++|+++||.. +.|++||+||++++..|..|++++++.|+.+|++++++++.+.|+ ....++..+++. T Consensus 322 ~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~ 401 (474) T protein:vir:81 322 AKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAK 401 (474) T ss_pred HHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeE Confidence 999999999999999964 588999999999999999999999999999999999999999874 334557789999 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHH-HHHHHHHHHHHHHHhhhcccCCCcC Q lcl|NC_021324. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMR-DWDKQETEDMIDTLYSTTKAQADAT 455 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (480) |+++.++++++.||+++|++++|.++++++++++++|++++++++++ ++++++..+.++++.+....++.+. T Consensus 402 W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 402 WRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred ecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 99999999999999999999999999999999999999999999876 5556667778887776544332222 No 11 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=3.3e-92 Score=522.11 Aligned_cols=455 Identities=23% Similarity=0.311 Sum_probs=373.6 Q ss_pred CCCH--HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhc--ceeccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MTTY--HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~--~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |+.. .+++.+|+.+|..+++|++++++||+|+|+++.+++..++.++. .++++|||++|||++++||+++||+.++ T Consensus 23 ~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~d 102 (501) T protein:vir:25 23 MSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRNAL 102 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhcccceecCC Confidence 4433 55788999999999999999999999999999999888887764 3578899999999999999999999876 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEE-eCCCCCceEEEEE Q lcl|NC_021324. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL-DPRNTRRVTRAVR 155 (480) Q Consensus 77 d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~-d~~~~~~~~~~~~ 155 (480) ++ ..+.++++|+.|+|+.++.++++++++||+||++||.++ ++ ++++++||.+++++| |+..++.+.++++ T Consensus 103 ~~-~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de------~~-~~i~~~sp~~~~~iy~D~~~~~~~~~ai~ 174 (501) T protein:vir:25 103 AK-ENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTD------EG-PVFRTRSPRQILAVYADPSVDAWPQYALE 174 (501) T ss_pred cc-chHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCC------CC-CeEEEeccccEEEEEecCCCCcceeEEEE Confidence 55 467899999999999999999999999999999999753 34 578999999999999 5567778999999 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCCc-------c----------ccccccccccccccccceEEeecccccCcc Q lcl|NC_021324. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLN-------D----------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~----------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~ 218 (480) +|....+.+...++++|+++.+++|....... . .........+|+||.||||+|+|+++. .. T Consensus 175 ~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~-~~ 253 (501) T protein:vir:25 175 TWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDA-DD 253 (501) T ss_pred EEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCcccc-Cc Confidence 99888777777889999998888776442100 0 001122456899999999999999876 45 Q ss_pred CCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCCCceEEEeccccH Q lcl|NC_021324. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 219 ~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) +|+|+|++ |++|+|+||+++|++++.++++++|+++++|++.+.+. .+++..+++|.++|++++|++++.+++ T Consensus 254 ~g~sdie~-v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~------~~~~~~~~i~~~~~~~~~~~q~~~~~~ 326 (501) T protein:vir:25 254 MIVGEVAP-LILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAE------VLKASALRVWTFEDPEVKAQAFPPASV 326 (501) T ss_pred cccchhhh-hHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccc------hhhhcccceeccCCCCceEEEecccCh Confidence 79999974 99999999999999999999999999999998765432 467888999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021324. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 378 (480) ++|+++++.++++|++.+++|+++||+.+.| +||+||++++.+|.+||.++++.|+++|++++++++.+.|.....+.. T Consensus 327 ~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N-~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~ 405 (501) T protein:vir:25 327 EPYNLILEEMLQHVAMVAQISPAQVTGKMIN-VSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADS 405 (501) T ss_pred HHHHHHHHHHHHHHHhhcCCChhhhccccCC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccce Confidence 9999999999999999999999999987666 699999999999999999999999999999999999999877666678 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHHHHHHHHHH-HHHHHHhhhcccCCCcCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMRDWDKQETE-DMIDTLYSTTKAQADATP 456 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~-~~~~d~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 456 (480) ++++.|+++.|+|+++.||+++|++++| +|.++++.++ |++++++++++++++++.. ..++++.+....+..+.+ T Consensus 406 ~i~v~w~~~~~~s~~~~ada~~kl~~~g---is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 482 (501) T protein:vir:25 406 GAEVLWRDTEARSFGAVVDGITKLASAG---IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPPP 482 (501) T ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhcC---CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCCC Confidence 8999999999999999999999999875 6889988875 6777777877766555443 345555443332322333 Q ss_pred CCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 457 KPTVTETKTETQTSPSGFN 475 (480) Q Consensus 457 ~~~~~d~~~~~~~~~~~~~ 475 (480) .....+++.++...|+++. T Consensus 483 ~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 483 PQAAAQALNEGGVNGNGGA 501 (501) T ss_pred CCCCccccccccCCCCCCC Confidence 3333333333333333333 No 12 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=6.6e-90 Score=509.51 Aligned_cols=399 Identities=19% Similarity=0.218 Sum_probs=350.6 Q ss_pred HHHHHHHHHHHHHHhhccCcchhcCcccchhhh-cceeccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHHHhcCH Q lcl|NC_021324. 15 LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQANDL 93 (480) Q Consensus 15 ~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~~~~~n~~ 93 (480) +..+.+|+.++.+||+|+|++++++..+|++++ ++++++|||+++||+++++|.++||+.++ ..++++|+.|+| T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~Gf~~~d-----~~l~~i~~~N~l 75 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRAFANDD-----FNVTEIFDRNNP 75 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHHHhhcchhHHHHHHhHhhhccccccCCC-----chHHHHHhhcCh Confidence 667789999999999999999999999999887 67899999999999999999999998654 248999999999 Q ss_pred HHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEe Q lcl|NC_021324. 94 DEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYL 173 (480) Q Consensus 94 ~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 173 (480) +..+.++++++++|||||++|++++ +|.|+|+++||.+++++||+.. +++.++++++.. ++.+..+.+.+|+ T Consensus 76 d~~~~~~~~~al~~G~sf~~v~~~~------d~~~~i~~~sP~~~~~i~Dp~~-~~~~~al~~~~~-~~~~~~~~~~~~~ 147 (410) T protein:vir:95 76 DIFFDSAILSALIGSCSFVYISKGE------DDEVRLQVIESSNATGVIDPIT-GLLVEGYAVLAR-DDYNRPTLEAYFE 147 (410) T ss_pred HHHHHHHHHHHHHhCceeEEEecCC------CCceEEEEEcccceEEEEeCCC-CceEEEEEEEEe-cCCCeEEEEEEEe Confidence 9999999999999999999999753 6789999999999999999854 678888877643 4556778899999 Q ss_pred CCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchh Q lcl|NC_021324. 174 PDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPL 253 (480) Q Consensus 174 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~ 253 (480) ++.+++|...++. ...+|++|+||||+|+|++++.++||+|+|++.|++|+|++|++++++++..+||++|+ T Consensus 148 ~~~~~~~~~~~~~--------~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~e~~a~pq 219 (410) T protein:vir:95 148 PNATHFIPKDGEP--------YSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITAEFYSWPQ 219 (410) T ss_pred CCcEEEEeeCCcc--------ccccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchh Confidence 9999998875432 24689999999999999999999999999998999999999999999999999999999 Q ss_pred hhhcCCCccccccccccchhhhhcchheecCC----CCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccccc Q lcl|NC_021324. 254 RVISGVTTDELTNDGENTTLDIYYGRILTLAS----EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN 329 (480) Q Consensus 254 l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g----~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~ 329 (480) ++++|.+.+.+ ....++...+++|.++. +.++|+|++++++++|+++|+.++++|++++++|+++||+.+.| T Consensus 220 r~i~G~d~d~~----~~~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~N 295 (410) T protein:vir:95 220 KYILGLDPDAE----PMEKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSDN 295 (410) T ss_pred heeeccCCCCC----cCchhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccCc Confidence 99999865433 23357788888888743 35899999999999999999999999999999999999999999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEec---CCCCCCHHHHHHHHHHHHh Q lcl|NC_021324. 330 PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWR---DPSTPTVAAKADAVSKLYA 404 (480) Q Consensus 330 ~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~v~f~---~~~~~~~~~~ad~~~kl~~ 404 (480) ++||+||++++.+|..|++++++.|+.+|++++++++.+.+.. ...++.+++|.|+ ++..+++++.||+++||++ T Consensus 296 psSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~ 375 (410) T protein:vir:95 296 PSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANTMTMIGDGVVKLNQ 375 (410) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhhHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999987643 3456678999998 7888899999999999999 Q ss_pred ccccCCCHHHHHHhCCCChhHHHHHH-HHHHHHHH Q lcl|NC_021324. 405 NGQGPIPKEQARIDLGYTATQREQMR-DWDKQETE 438 (480) Q Consensus 405 ~~~~~~s~e~~~~~~~~~~d~~~~~~-~~~~~~~~ 438 (480) +++++++++++++.+||+++++.++. ++++++.+ T Consensus 376 a~~g~~~~~~~~~~lg~~~~~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 376 ALPGYINAETIRDLTGIAGDMSAKPVVSEGGSNGE 410 (410) T ss_pred hccCCccHHHHHHhcCCChHHHHHHHHHHHHhCCC Confidence 99999999999999999988765543 22222111 No 13 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=1.3e-87 Score=496.90 Aligned_cols=460 Identities=16% Similarity=0.233 Sum_probs=359.6 Q ss_pred CCCHH--H-HHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchh---hhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYH--E-HVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPE---LAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~--~-~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~---~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ||++. + ++.+|+.+|..+++|++++++||+|+|+|++.+...++. .-.+++++|||++|||++++|++++||+. T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~ 88 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRK 88 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccC Confidence 77653 3 345789999999999999999999999998776544332 12345689999999999999999999997 Q ss_pred CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 75 SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 75 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ++++ ..+.++++|+.|+|+.++.++++++++||+||++||++... .|++|.+++++++|.+++++||+....... + T Consensus 89 ~d~~-~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~-~d~~g~~~i~~~~p~~~~~iydd~~~~~~~--~ 164 (479) T protein:vir:99 89 TGTN-ENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISP-LDGTTVARIKCIDPRDAFAIWEDPYWDEWP--K 164 (479) T ss_pred CCch-hhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-cCCCCceEEEEechhheEEEecCCccccee--e Confidence 7554 56789999999999999999999999999999999976432 467899999999999999999765543322 2 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHH Q lcl|NC_021324. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~ 234 (480) ++...+.. ..+.+|+.+.++.|...++. ....+..+|+||+||||+|+|+++. +++|+|+|+ .+++|||+ T Consensus 165 -~~~~~~~~---~~~~~~~~~~~~~~~~~~~~----~~~~~~~~h~~g~vPvv~f~n~~~~-~~~g~sd~e-~v~~liDa 234 (479) T protein:vir:99 165 -YLLERQPN---GQYWWWTEEDYSIFEFKQGK----FIYRETVSHDYGHIPFVRYVNVMDL-RGVCYGDVE-PLVTVAKA 234 (479) T ss_pred -EEEeecCc---eeEEEEecceEEEEEecCCc----eeeccccccCCCCcceEEeecCCCc-CcCCcchhH-HHHHHHHH Confidence 22222222 24567888888777765442 2334678999999999999999887 568999996 59999999 Q ss_pred HHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHh Q lcl|NC_021324. 235 ASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAAS 314 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~ 314 (480) ||+++|++++.+++|++|+++++|....+.... ....+++..++++..+|+++++++++.+++++|+++++.++++|++ T Consensus 235 ~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~ 313 (479) T protein:vir:99 235 IDKTGLDILLVQHHQSFQIRWATGLMLPEGANA-DQEKMRFAQESMLISQNEKASFGAIPAAPLDGLLNAYKESLLEFLA 313 (479) T ss_pred HHHHHHHHHHHHHHhhchhhhhcCCCccccccc-chhccccccccceeecCCCceEEEecccchHHHHHHHHHHHHHHhc Confidence 999999999999999999999999876544332 2334566778888888999999999999999999999999999999 Q ss_pred hcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021324. 315 ITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 315 ~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~ 394 (480) ++++|+++||.. + ++||+||++++.+|+.||+.+++.|+.+|++++++++.+.+.....+...++++|+++.++|+++ T Consensus 314 ~t~~p~~~~g~~-~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~~~~~~s~~~ 391 (479) T protein:vir:99 314 LAQLPPHIAGQI-V-NVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQDVTIQSLAQ 391 (479) T ss_pred cCCCCHHHcccc-c-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeeeEEecCCCCCCHHH Confidence 999999999853 2 37999999999999999999999999999999999999998776666778999999999999999 Q ss_pred HHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHHHHHHHHHH--HHHHHHhhhccc-CCCcCCCC-CCCCCCccCCC Q lcl|NC_021324. 395 KADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMRDWDKQETE--DMIDTLYSTTKA-QADATPKP-TVTETKTETQT 469 (480) Q Consensus 395 ~ad~~~kl~~~~~~~~s~e~~~~~~-~~~~d~~~~~~~~~~~~~~--~~~~~~~~~~~~-~~~~~~~~-~~~d~~~~~~~ 469 (480) .+|+++||+++| ++|.||+++++ |++++++++++++++++.+ ...+.+.+.... +...+..+ ...++.++.++ T Consensus 392 ~ad~~~kl~~ag--~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (479) T protein:vir:99 392 FADAWAKMVESL--KIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGPNGATNMQQANNKTG 469 (479) T ss_pred HHHHHHHHHhcC--CCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCCCCCCCCCCCCCCCc Confidence 999999999875 68999999998 5667667777655544433 233333322111 11111111 11122233344 Q ss_pred CCCCCCcccC Q lcl|NC_021324. 470 SPSGFNRTKT 479 (480) Q Consensus 470 ~~~~~~~~~~ 479 (480) +|.+.|++-. T Consensus 470 ~~~~~~~~~~ 479 (479) T protein:vir:99 470 EPASLNKSGA 479 (479) T ss_pred chhccCCCCC Confidence 5555555444 No 14 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=6.6e-89 Score=504.03 Aligned_cols=399 Identities=19% Similarity=0.211 Sum_probs=354.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhh-cceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |+ .++|.+|.+++..+++|+.++.+||+|+|++++++..+|++++ ++++++|||++|||++++++.++||+.++ T Consensus 1 ~~--~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d--- 75 (409) T protein:vir:94 1 MT--EKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDD--- 75 (409) T ss_pred CC--HHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHHhhhcchhHHHHHHhHhhcccCcccCCc--- Confidence 64 6899999999999999999999999999999999999998875 67899999999999999999999998542 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..++++|+.|+|+....++++++++|||||++|+++ ++|+|+|+++||.+++++||+..+ ++.++++++.. T Consensus 76 --~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~------~dg~~~i~~~sp~~~~~i~D~~~~-~~~~a~~~~~~ 146 (409) T protein:vir:94 76 --FTVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKG------ENDAVRLQVIEAVNATGIIDPITG-LLTEGYAVLER 146 (409) T ss_pred --hHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecC------CCCceEEEEeccceEEEEEecCCC-ceeeeEEEEEe Confidence 358999999999999999999999999999999975 378899999999999999998654 68888877643 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) +..+..+...+|+++.++++...++. ....+|++|+||||+|+|++++.++||+|+|++.|++|+|++|+++ T Consensus 147 -d~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da~~r~~ 218 (409) T protein:vir:94 147 -DENNNVVLEAHFLPDRTDYYYRDSRN-------NISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTL 218 (409) T ss_pred -cCCCceEEEEEEecCcEEEEEecCce-------eEeeeCCCCCcceEEeccccccccccCccccchhHHHHHHHHHHHH Confidence 44556677889999999998876543 1335799999999999999999999999999988999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC----CCCceEEEeccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) +++++..+||++|+++++|.+.+. .....++...+++|.++ |++++|++++++++++|+++|+.++++|+++ T Consensus 219 ~~~~~~~e~~a~pqr~i~G~d~d~----~~~~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~ 294 (409) T protein:vir:94 219 ERADVTAEFYSFPQKYVTGLSDDA----EPMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGE 294 (409) T ss_pred HHHHHHHHHhcChhheeEecCCCC----cccchhhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhh Confidence 999999999999999999986433 23346788888888763 4568999999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEecCC---CCC Q lcl|NC_021324. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWRDP---STP 390 (480) Q Consensus 316 ~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~v~f~~~---~~~ 390 (480) +++|+++||+.+.|++||+||++++.+|..|++++++.|+++|++++++++.+.+.. ...++.++++.|++. ..+ T Consensus 295 t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~~~~~~ 374 (409) T protein:vir:94 295 TGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKPKWEPLFEADAS 374 (409) T ss_pred cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceEEeccCCCcchH Confidence 999999999999999999999999999999999999999999999999999987753 345668899999954 455 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH Q lcl|NC_021324. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ 425 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~ 425 (480) ++++.||+++||+++|.++.+++++++++||++++ T Consensus 375 ~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 375 MLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred HHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 57888999999999999999999999999999887 No 15 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=2.2e-88 Score=501.17 Aligned_cols=399 Identities=20% Similarity=0.214 Sum_probs=354.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhh-cceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |+ .++|.+|.+++..+++|+.++.+||+|+|++++++...|++++ ++++++|||++|||++++++.++||+.++ T Consensus 1 ~~--~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~iVds~a~rl~~~Gf~~~d--- 75 (409) T protein:vir:16 1 MT--EKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQYRSILGWCAKGVDSLADRLVFREFENDD--- 75 (409) T ss_pred CC--HHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHHhhhcChhHHHHHHhHhhcccccccCcc--- Confidence 64 6899999999999999999999999999999999999999885 67899999999999999999999998532 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..++++|+.|+|+....++++++++|||||++|++++ +|+|+|+++||.+++++||+... ++.+++++|. T Consensus 76 --~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~------dg~~~i~~~sP~~~~~i~D~~~~-~~~~a~~~~~- 145 (409) T protein:vir:16 76 --FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGE------NDAVRLQVIEATNATGIIDPITG-LLTEGYAVLE- 145 (409) T ss_pred --hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCC------CCceEEEEEcccceEEEeecccc-cceeeeEEEE- Confidence 4589999999999999999999999999999999753 78899999999999999998654 6667776664 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) .+..+..+..++|+++.++++...++.. ...+|++|+||||+|+|++++.++||+|+|++.|++|+|++|+++ T Consensus 146 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da~~r~~ 218 (409) T protein:vir:16 146 RDENNNVVLEAHFLPDRTDYYYRDSRNN-------ISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSNAKRTL 218 (409) T ss_pred ecCCCceEEEEEEecCcEEEEEecCccc-------cceecCCCCcceEEecccccccccCCccccchhHHHHHHHHHHHH Confidence 3445556778999999998887765431 345799999999999999999999999999988999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC----CCCceEEEeccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) +++++..+||++|+++++|.+.+.. ....++...+++|.++ |++++|+|++++++++|+++++.++++|+++ T Consensus 219 ~~~~~~~e~~a~pqr~i~G~d~d~~----~~~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~ 294 (409) T protein:vir:16 219 ERADVTAEFYSFPQKYVTGLSDDAE----PMETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGE 294 (409) T ss_pred HHHHHHHHHhcChhheeEecCCCCC----ccchhhhhhhHhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhh Confidence 9999999999999999999865332 3345788888898763 4568999999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCC---CC Q lcl|NC_021324. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPS---TP 390 (480) Q Consensus 316 ~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~~~~~~~~~v~f~~~~---~~ 390 (480) +++|+++||+++.|++||+||++++.+|..|++++++.|+.+|++++++++.+.+. .....+.++++.|+++. .+ T Consensus 295 s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v~W~~~~~~~~~ 374 (409) T protein:vir:16 295 TGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKPKWEPLFEADAS 374 (409) T ss_pred cCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceEEecCCCCcchh Confidence 99999999999999999999999999999999999999999999999999999775 33455678999999655 56 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH Q lcl|NC_021324. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ 425 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~ 425 (480) ++++.||+++||+++|+++..++++++++||++++ T Consensus 375 s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 375 MLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred hHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 68999999999999999999999999999999887 No 16 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=2e-85 Score=484.88 Aligned_cols=432 Identities=21% Similarity=0.227 Sum_probs=347.1 Q ss_pred CC--CHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhc--ceeccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MT--TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~--~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) || |+.+++++|+.+|..+++|++++++||+|+|+|+++++..+++++. +|+++|||++|||+.++|++++||++++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 65 4578999999999999999999999999999999998888887764 6899999999999999999999998754 Q ss_pred --CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 77 --DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 77 --d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) |.+..+.++++|++|+++.++.++++++++||+||++||.+ ++|.+++++++|.+++++||+...+++.+++ T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d------~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i 154 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR------DDGTATITADSPETMVVSVDPLQPWRIRAAM 154 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC------CCCceEEEEEccceeEEEEcCCCCcceEEEE Confidence 45566789999999999999999999999999999999964 5789999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEec-------CC-----CccccccccccccccccccceEEeecccccCccCCCc Q lcl|NC_021324. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRN-------GG-----LNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~-----~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s 222 (480) ++|.+.++. ..+.++|.++.+.+|... .. ....+ ......+|++|.|||++|.| ++|.| T Consensus 155 ~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~pvv~~~N------~~g~g 225 (456) T protein:vir:10 155 RWWRDLDAE--SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSW-VPVGDAVVTGSPPPVVVYQN------PDGMG 225 (456) T ss_pred EEEEecCCc--eeEEEEEeccceeEEEEEEEEeecccceeeeecCCce-eeccccCCCCCceeEEEecC------CCCCc Confidence 998765443 345566666554443221 00 01111 12344689999999987754 46889 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhhhcchheecCCCCceEEEeccc Q lcl|NC_021324. 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDIYYGRILTLASEAAKISEFKAA 296 (480) Q Consensus 223 ~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~~~~~~~~~~ 296 (480) ++++ +++|||+||+++|++++.++++++|+++++|...+.+..+..+ ..++...+++|.++ ++++|++++.+ T Consensus 226 d~e~-vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~~~ 303 (456) T protein:vir:10 226 EVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQAN 303 (456) T ss_pred hhhh-hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCC-CCcceEEeccc Confidence 9965 9999999999999999999999999999999876554322222 23566777788765 56899999999 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021324. 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 297 ~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 376 (480) ++++|+++++.++++|++.+++|++.||+.+.| +||+||++++.+|++||..+++.|+++|++++++++++.|.. + T Consensus 304 ~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~ 379 (456) T protein:vir:10 304 DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---V 379 (456) T ss_pred ChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---c Confidence 999999999999999999999999999987766 699999999999999999999999999999999999887733 3 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCC Q lcl|NC_021324. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ...++++|+++.|+|+++.||+++|++++| ++|.+++++++|++++++++++..+..++.+ ++..... ..+ T Consensus 380 ~~~~~v~w~~~~~~~~~~~ada~~kl~~~g--i~~~~~~~~~lg~~~~~i~~~e~er~~~e~~---~~~~~~~----~~~ 450 (456) T protein:vir:10 380 EDTVDVSFESPDRVTLGEKYSAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQIT---LFAGNPV----QRP 450 (456) T ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHHcC--CChHHHHHhhCCCCHHHHHHHHHHHHHHHHH---HHhhhhh----hcC Confidence 457999999999999999999999998875 6899999999999998876543222222111 1111111 111 Q ss_pred CCCCCC Q lcl|NC_021324. 457 KPTVTE 462 (480) Q Consensus 457 ~~~~~d 462 (480) ++.++- T Consensus 451 ~~~~~~ 456 (456) T protein:vir:10 451 QEDGSR 456 (456) T ss_pred CCCCCC Confidence 111111 No 17 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=2e-85 Score=484.88 Aligned_cols=432 Identities=21% Similarity=0.227 Sum_probs=347.1 Q ss_pred CC--CHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhc--ceeccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MT--TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~--~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) || |+.+++++|+.+|..+++|++++++||+|+|+|+++++..+++++. +|+++|||++|||+.++|++++||++++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 65 4578999999999999999999999999999999998888887764 6899999999999999999999998754 Q ss_pred --CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 77 --DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 77 --d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) |.+..+.++++|++|+++.++.++++++++||+||++||.+ ++|.+++++++|.+++++||+...+++.+++ T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d------~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i 154 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR------DDGTATITADSPETMVVSVDPLQPWRIRAAM 154 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeC------CCCceEEEEEccceeEEEEcCCCCcceEEEE Confidence 45566789999999999999999999999999999999964 5789999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEec-------CC-----CccccccccccccccccccceEEeecccccCccCCCc Q lcl|NC_021324. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRN-------GG-----LNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~-----~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s 222 (480) ++|.+.++. ..+.++|.++.+.+|... .. ....+ ......+|++|.|||++|.| ++|.| T Consensus 155 ~~~~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~pvv~~~N------~~g~g 225 (456) T protein:vir:10 155 RWWRDLDAE--SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSW-VPVGDAVVTGSPPPVVVYQN------PDGMG 225 (456) T ss_pred EEEEecCCc--eeEEEEEeccceeEEEEEEEEeecccceeeeecCCce-eeccccCCCCCceeEEEecC------CCCCc Confidence 998765443 345566666554443221 00 01111 12344689999999987754 46889 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhhhcchheecCCCCceEEEeccc Q lcl|NC_021324. 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDIYYGRILTLASEAAKISEFKAA 296 (480) Q Consensus 223 ~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~~~~~~~~~~ 296 (480) ++++ +++|||+||+++|++++.++++++|+++++|...+.+..+..+ ..++...+++|.++ ++++|++++.+ T Consensus 226 d~e~-vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~q~~~~ 303 (456) T protein:vir:10 226 EVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQAN 303 (456) T ss_pred hhhh-hHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCC-CCcceEEeccc Confidence 9965 9999999999999999999999999999999876554322222 23566777788765 56899999999 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021324. 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 297 ~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 376 (480) ++++|+++++.++++|++.+++|++.||+.+.| +||+||++++.+|++||..+++.|+++|++++++++++.|.. + T Consensus 304 ~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N-~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~---~ 379 (456) T protein:vir:10 304 DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---V 379 (456) T ss_pred ChhHHHHHHHHHHHHHHhccCCChHHhcccccC-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---c Confidence 999999999999999999999999999987766 699999999999999999999999999999999999887733 3 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCC Q lcl|NC_021324. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ...++++|+++.|+|+++.||+++|++++| ++|.+++++++|++++++++++..+..++.+ ++..... ..+ T Consensus 380 ~~~~~v~w~~~~~~~~~~~ada~~kl~~~g--i~~~~~~~~~lg~~~~~i~~~e~er~~~e~~---~~~~~~~----~~~ 450 (456) T protein:vir:10 380 EDTVDVSFESPDRVTLGEKYSAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQIT---LFAGNPV----QRP 450 (456) T ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHHcC--CChHHHHHhhCCCCHHHHHHHHHHHHHHHHH---HHhhhhh----hcC Confidence 457999999999999999999999998875 6899999999999998876543222222111 1111111 111 Q ss_pred CCCCCC Q lcl|NC_021324. 457 KPTVTE 462 (480) Q Consensus 457 ~~~~~d 462 (480) ++.++- T Consensus 451 ~~~~~~ 456 (456) T protein:vir:10 451 QEDGSR 456 (456) T ss_pred CCCCCC Confidence 111111 No 18 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=2.6e-86 Score=489.79 Aligned_cols=412 Identities=18% Similarity=0.235 Sum_probs=347.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhh-cceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELA-YLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~-~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) |+ ...|.+|+.++..+++|++++.+||+|+|++++++..++++++ .+++++|||+++||++++++.++||+.++ T Consensus 1 m~--~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~Vd~~a~rl~~~Gf~~~d--- 75 (422) T protein:vir:97 1 MN--YMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKGVDSLADRIIFREFTNDD--- 75 (422) T ss_pred CC--hHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHHHHHHHhccccceeeCCc--- Confidence 54 4678999999999999999999999999999999999999886 44788899999999999999999998653 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..++++|+.|+|+..+.++++++++|||||++|++++ .+|.|+++++||.+++++||+... ++.+++++|. T Consensus 76 --~~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~-----~~~~p~i~~~sp~~~~~i~D~~~~-~~~~a~~~~~- 146 (422) T protein:vir:97 76 --FNAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGA-----EDGLPKMQVIEASKATGILDPTTF-LLTEGYAILE- 146 (422) T ss_pred --hhHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCC-----CCCeeEEEEechhhEEEEEeCCCC-cceeeEEEEE- Confidence 3589999999999999999999999999999999753 357899999999999999998765 5556665553 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) .+..+..+.+.+|++..++++... +. ....+|++|+||||+|+|+++..++||+|+|++.|++|+|+||+++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~-~~-------~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~l~da~~r~~ 218 (422) T protein:vir:97 147 SDSNGNPTLEAYFTDKDIWYYPKK-GK-------PYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMYHQKAAKRTL 218 (422) T ss_pred ecCCCcEEEEEEEcCceEEEEcCC-Cc-------cccccCCCCCcceEEecccCCCccccCccccchhHHHHHHHHHHHH Confidence 344455555555555555444432 21 1246899999999999999999999999999988999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC----CCCceEEEeccccHHHHHHHHHHHHHHHHhh Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFKAAELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~ 315 (480) +++.+..+||++|+++++|++.+... ...++...+++|.++ |+.++|++++++++++|+++|+.++++|+++ T Consensus 219 ~~~~~~~e~~a~pqr~i~G~d~d~~~----~~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~ 294 (422) T protein:vir:97 219 ERAEVTAEFYSFPQKYVLGMDPDAKP----MEKWRATVSTLLEISKDEDGDKPTVGQFTTASMAPFMEHLKMYASLFAGG 294 (422) T ss_pred HHHHHHHHHhcchhhhhcccCccccc----CchhhhhhhhhhccCCCCCCCcceeeecCCCChhHHHHHHHHHHHHHhcc Confidence 99999999999999999998754332 335777778888764 3458999999999999999999999999999 Q ss_pred cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEecCCCCCC-- Q lcl|NC_021324. 316 TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWRDPSTPT-- 391 (480) Q Consensus 316 ~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~v~f~~~~~~~-- 391 (480) +++|+++||+.+.|++||+||++++.+|..||+++++.|+.+|++++++++++.+.. ....+.++++.|++..+.+ T Consensus 295 s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~~~w~p~~~~~~~ 374 (422) T protein:vir:97 295 SGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFMDTVIKWEPLFEADAN 374 (422) T ss_pred cCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhccceEEEccCCCCChH Confidence 999999999999999999999999999999999999999999999999999998753 3445678999999665555 Q ss_pred -HHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021324. 392 -VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 392 -~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~ 439 (480) +++.||+++|+++++.++++++++++++||++.+.+ +.+..+++++- T Consensus 375 s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~-~~~~~~~~~d~ 422 (422) T protein:vir:97 375 MLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADKP-IPAITEVTTDG 422 (422) T ss_pred HHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhHH-HHHHHhhhccC Confidence 788999999999999999999999999999766443 22333332211 No 19 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=7.3e-85 Score=481.84 Aligned_cols=433 Identities=20% Similarity=0.201 Sum_probs=350.0 Q ss_pred CC--CHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhc--ceeccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MT--TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~--~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~--~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |+ |+.+++++|+.+|..+++|++++.+||+|+|+|+++++..+++++. +++++||+++|||++++|++++||+++. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 80 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 54 5678999999999999999999999999999999998888887763 4688999999999999999999998753 Q ss_pred --CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 77 --DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 77 --d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) |.+..+.++++|++|+|+.++.+++++++++|+||+++|++ ++|.+++++++|.+++++||+...+.+.+++ T Consensus 81 ~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~------edg~~~i~~~~p~~~~~i~d~~~~~~~~~~~ 154 (456) T protein:vir:79 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR------DDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) T ss_pred CCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeC------CCCceEEEEeccceeEEEEcCCCCCceEEEE Confidence 44567789999999999999999999999999999999974 4788999999999999999999999999999 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCC-----------ccccccccccccccccccceEEeecccccCccCCCcc Q lcl|NC_021324. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGL-----------NDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~ 223 (480) ++|.+.++ ...+..+|+++.+++|...... ...........+|++|.||||+|.| ++|.|+ T Consensus 155 ~~~~~~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N------~~~~gd 226 (456) T protein:vir:79 155 RWWRDLDA--ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN------PDGMGE 226 (456) T ss_pred EEEEecCC--ceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecC------CCCCch Confidence 99875543 3456778888887766442110 0011122334689999999998854 568889 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhhhcchheecCCCCceEEEecccc Q lcl|NC_021324. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDIYYGRILTLASEAAKISEFKAAE 297 (480) Q Consensus 224 i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~~~~~~~~~~~ 297 (480) |++ +++|||+||+++|++++.++++++|+++++|.....+..+..+ ..++...+++|..+ +++++++++.++ T Consensus 227 ~e~-v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~-~~~~~~q~~~~~ 304 (456) T protein:vir:79 227 VEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQTND 304 (456) T ss_pred hhh-hHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCC-CCcceeeecccC Confidence 975 9999999999999999999999999999999876554333222 23555677777765 568999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 377 (480) +++|+++++.++++|++.+++|++.||+.+.| +||+||++++.+|++||+.+++.|+++|++++++++.+.|.. +. T Consensus 305 ~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~---~~ 380 (456) T protein:vir:79 305 FTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---VE 380 (456) T ss_pred hHHHHHHHHHHHHHHHhhcCCChhHhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---cc Confidence 99999999999999999999999999987665 699999999999999999999999999999999999888742 34 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) .+++|+|+++.++|+++.||+++|++++| ++|.+++++.+|++++++++++..+..++.+. +.....+ .++ T Consensus 381 ~~i~v~w~~~~~~s~~~~ada~~kl~~~G--~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~---~~~~~~~----~~~ 451 (456) T protein:vir:79 381 DTVDVSFESPDRVTLGEKYSAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITL---FAGNPVQ----RPQ 451 (456) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHHhcC--CChHHHHHhcCCCCHHHHHHHHHHHHHHHHHH---HhhhHhh----cCC Confidence 67999999999999999999999998875 68999999999999988765432222222111 1111111 111 Q ss_pred CCCCC Q lcl|NC_021324. 458 PTVTE 462 (480) Q Consensus 458 ~~~~d 462 (480) +.+.- T Consensus 452 ~~~~~ 456 (456) T protein:vir:79 452 EDGSR 456 (456) T ss_pred CCCCC Confidence 11111 No 20 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=2.6e-82 Score=467.88 Aligned_cols=460 Identities=13% Similarity=0.110 Sum_probs=350.6 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc-CCCc Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDS 78 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~ 78 (480) ++.. .++|.+++++|..+.+|++++.+||+|+|+|.+... ....++++|+++||+++||++.++||+++++++ ++++ T Consensus 13 ~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~~~ 91 (499) T protein:vir:10 13 VNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEF-DNATVEAANVMVNHAKYITDMNVGFMTGNPVKYVAEKG 91 (499) T ss_pred hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCc-CcCCCCcceeecchHHHHHHHHhhhhcccCceeecCCh Confidence 2222 567999999999999999999999999999976533 334456899999999999999999999999875 4566 Q ss_pred hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccC-----------CCCceeEEEEEcccceEEEEeCCCC Q lcl|NC_021324. 79 EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG-----------DPAGIPLIRVESPLYMYAELDPRNT 147 (480) Q Consensus 79 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~-----------d~~g~~~~~~~~p~~~~~v~d~~~~ 147 (480) +..+.|+++|+.|+|+.++.++++.++++|+||++||.++.+.. .....++++.++|.+++|+|++... T Consensus 92 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d~~~ 171 (499) T protein:vir:10 92 KNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDDTVE 171 (499) T ss_pred hHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEecCCCC Confidence 67889999999999999999999999999999999998653311 1233578999999999999999888 Q ss_pred CceEEEEEEEEEec--CCceEEEEEEEeCCcEEEEEecCCCcc-ccccccccccccccccceEEeecccccCccCCCccc Q lcl|NC_021324. 148 RRVTRAVRLYTTRD--DVAVPDRATLYLPDETVPLRRNGGLND-QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 148 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i 224 (480) +.+.+++++|...+ ..+.+.++++||++++++|...+.... .........+|+||.||||+|+|+ .+|.|++ T Consensus 172 ~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~~~~d~ 246 (499) T protein:vir:10 172 HDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYDGENLFGAVPIIEFRNN-----EERQGDF 246 (499) T ss_pred cceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceecccccCCCCccceEEecCC-----CCCCCch Confidence 89999999987654 344567899999999999987654321 222345667899999999999986 4588888 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec---CCCCceEEEec--cccHH Q lcl|NC_021324. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL---ASEAAKISEFK--AAELR 299 (480) Q Consensus 225 ~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~---~g~~~~~~~~~--~~~~~ 299 (480) + ++++|||+||.++|++++.++++++|+++++|+..++..+.. .....+.++.. ++.++++.+++ ....+ T Consensus 247 e-~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~----~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~ 321 (499) T protein:vir:10 247 E-QLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDI----QRLKRGAIEAPPREEGADIEWLTKSFDETQVN 321 (499) T ss_pred H-hHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchh----hhhhhcceeccCCCCCCcceEEeccCCHHHHH Confidence 6 499999999999999999999999999999998766533221 12233444433 23345565543 34567 Q ss_pred HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccce Q lcl|NC_021324. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYT 378 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~~~~~~ 378 (480) ++++.|+..|+.++.++++++..|+++ +||+||+++++.|.+||..+++.|+++|++++++++.+.+. +...++. T Consensus 322 ~~~~~l~~~I~~~s~~p~~~~~~~~gn----~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~~ 397 (499) T protein:vir:10 322 LLSQSIENDIHKISYVPNMNDEKFMGN----VSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGANDDAS 397 (499) T ss_pred HHHHHHHHHHHHHhCcccCCchhhccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc Confidence 778888888888888887777776543 69999999999999999999999999999999999988653 4455667 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) .++++|+++.|.|+++.+++++++. +++|.||+++++|+++|+.++++++++++++.............++..... T Consensus 398 ~i~i~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (499) T protein:vir:10 398 GCKISLVANIPSNLSDVVNNVKNAD----GIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELE 473 (499) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Confidence 8999999999999999999999983 368999999999999887777777766665543332222222222222111 Q ss_pred CC---CCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 459 TV---TETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 459 ~~---~d~~~~~~~~~~~~~~~~~~ 480 (480) .. ..+..+..++. ....+||| T Consensus 474 ~~~~~~~~~~~~~~~~-~~~~~~~~ 497 (499) T protein:vir:10 474 DKQDDSSENDKEAGSN-HNQSHRTR 497 (499) T ss_pred CCCcccCCCCCCCccc-cccCCCCC Confidence 11 11111222222 22235788 No 21 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=1.4e-82 Score=469.33 Aligned_cols=443 Identities=15% Similarity=0.126 Sum_probs=338.5 Q ss_pred CCCHHHHHHHHHHHHHHH-HHHHHHHHHHhhccC-cchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCC- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARD-LPNLLEAEAYRNGTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-~~~~~~~~~YY~G~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d- 77 (480) +.+..++|++++.+|..+ .+||+++.+||+|+| .|.......+...+++|+++||+++||++.++|++++|+++..+ T Consensus 37 ~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~~ 116 (501) T protein:vir:96 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 116 (501) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHHHHHHhhhhcccCeeEeeCC Confidence 566678899999999865 579999999999985 55444444445566889999999999999999999999876322 Q ss_pred ----chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 78 ----SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 78 ----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) +...+.|+++|+.|+|+.++.+++++++++|+||+++|. +++|.+++++++|.+++|+||+...+++.++ T Consensus 117 ~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~------dedg~~~i~~~~p~~~~~v~d~~~~~~~~~~ 190 (501) T protein:vir:96 117 NDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYR------SEYDETRIKRLSPLETFVIYDNSLEDNSIAA 190 (501) T ss_pred ccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEE------cCCCceEEEEEccceeEEEEcCCCCCceEEE Confidence 234567899999999999999999999999999999997 4578999999999999999999888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +++|...+..+...++++||++.+++|...++ ....+..+|+||.||||+|+|+ ++|+|+|++ |++||| T Consensus 191 v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~-----~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~-v~~liD 259 (501) T protein:vir:96 191 VRYYNRGTLQSAKDVVEIYTDEHIYTLDASDD-----FNEISVTTHAFGTVPITEYLNN-----IDGIGDYET-ELYLID 259 (501) T ss_pred EEEEEeecCCCcEEEEEEEcCCcEEEEeeCCC-----ceeccccccCCCccceEEecCC-----ccCCCchhh-hHHHHH Confidence 99998888777788999999999999976543 2345668899999999999987 468999975 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec----------CCCCceEEE--eccccHHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL----------ASEAAKISE--FKAAELRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~----------~g~~~~~~~--~~~~~~~~~ 301 (480) +||+++|++++.++++++|+++++|.......+.... +...+++.+ .+.++++.+ ++.++.+++ T Consensus 260 a~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 335 (501) T protein:vir:96 260 LYDSAESDTANHMSDMADAILAIYGDLALPKGMQASD----MKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAY 335 (501) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecccccCcccchhh----hhhcCeeeecccccccccccCcceeeEeccCCHHHHHHH Confidence 9999999999999999999999999876554322111 111222211 223445543 344567788 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC---CCCcccce Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG---REVTEEYT 378 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~---~~~~~~~~ 378 (480) +++|+..++.++.++++++..|++ ++||+||++++.+|.+||..+++.|+.+|++++++++++.+ .....++. T Consensus 336 ~~~l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~ 411 (501) T protein:vir:96 336 KTRLNRDIHIFTNTPDMSDTNFSG----NTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDES 411 (501) T ss_pred HHHHHHHHHHHhCCcccCcccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 888887777777777766665553 36999999999999999999999999999999999988764 33445567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) +++|+|+++.|.|.++.+++++|++ |++|+||+++++|+++|+.++++++++|+++..................+. T Consensus 412 ~i~i~f~~~~p~n~~e~ad~~~kl~----g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 487 (501) T protein:vir:96 412 LLKITFTPNLPKSLNEQVSILTGLG----GQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDE 487 (501) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcccccCCc Confidence 7999999999999999999999985 368999999999999998888888877765422111111111000000000 Q ss_pred CCCCCCccCCCCCC Q lcl|NC_021324. 459 TVTETKTETQTSPS 472 (480) Q Consensus 459 ~~~d~~~~~~~~~~ 472 (480) .....+++++..-. T Consensus 488 ~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 488 VKETHTDDFEREYE 501 (501) T ss_pred CCCCCCCccccccC Confidence 00000011110000 No 22 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=5.3e-82 Score=466.18 Aligned_cols=439 Identities=15% Similarity=0.135 Sum_probs=340.3 Q ss_pred CCCHHHHHHHHHHHHHHH-HHHHHHHHHHhhccC-cchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCC- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARD-LPNLLEAEAYRNGTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-~~~~~~~~~YY~G~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d- 77 (480) +....++|++++.+|..+ .+|++++.+||+|+| .|........+..+++|+++||+++||++.++|++++|+++..+ T Consensus 37 ~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d 116 (501) T protein:vir:27 37 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 116 (501) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEecCC Confidence 555567899999999755 689999999999985 45444444445567889999999999999999999999875432 Q ss_pred ----chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 78 ----SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 78 ----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) +...+.|+++|+.|+|+.++.+++++++++|+||++||. +++|++++++++|.+++|+||+...+++.++ T Consensus 117 ~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~------ded~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 190 (501) T protein:vir:27 117 NDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYR------NEYDETRIKRLNPLETFVIYDNSLEDNSIAA 190 (501) T ss_pred ccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEe------CCCCceEEEEEccceeEEEecCCCCCceEEE Confidence 234567899999999999999999999999999999997 4578999999999999999999888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +++|...+..+...++++||++.+++|...++ .+..+..+|+||.||||+|+|+ ++|.|++++ |++||| T Consensus 191 ir~~~~~~~~~~~~~~~vyt~~~v~~~~~~~~-----~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~-v~~liD 259 (501) T protein:vir:27 191 VRYYNRGTLQNAKDVVEIYTNEHIYTLDASDD-----FNEISVTTHAFGTVPITEFLNN-----VDGIGDYET-ELYLID 259 (501) T ss_pred EEEEEeeecCCcEEEEEEEeCCeEEEEEeCCc-----eeeccccccCCCcccEEEecCC-----CCCCCchhh-hHHHHH Confidence 99998877777788999999999999886643 3455678899999999999987 468999975 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec----------CCCCceEEEe--ccccHHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL----------ASEAAKISEF--KAAELRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~----------~g~~~~~~~~--~~~~~~~~ 301 (480) +||+++|++++.++++++|+++++|......++... .+...+++.+ ++.++++..+ +....+++ T Consensus 260 a~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~ 335 (501) T protein:vir:27 260 LYDSAESDTANHMSDMADAILAIYGDLALPKGMQAS----DMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAY 335 (501) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCccCCcccchh----hhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHHH Confidence 999999999999999999999999986654332211 1112222221 1223455433 34567788 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcccce Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---EVTEEYT 378 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~ 378 (480) +++|+..++.++.+|++++..|++ ++||+||++++.+|.+||..+++.|+.+|++++++++++.+. ....++. T Consensus 336 ~~~l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~ 411 (501) T protein:vir:27 336 KTRLNRDIHIFTNIPDMSDTNFSG----NTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDES 411 (501) T ss_pred HHHHHHHHHHHhCCcccCcccccc----CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 888888888888887777666654 369999999999999999999999999999999999887652 3345567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcc----cCCCc Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK----AQADA 454 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~ 454 (480) .++|.|++++|.|.++.+++++|++ |++|.||+++++|+++|+.++++++++|+++.......+... ...+. T Consensus 412 ~i~v~f~~~~p~n~~e~ad~~~kl~----g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~ 487 (501) T protein:vir:27 412 LLKITFTPNLPKSLNEQVSILTGLG----GQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDE 487 (501) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCccccccccccCC Confidence 8999999999999999999999874 368999999999999988888888877765533222222111 11111 Q ss_pred CCCCCCCCCCccCC Q lcl|NC_021324. 455 TPKPTVTETKTETQ 468 (480) Q Consensus 455 ~~~~~~~d~~~~~~ 468 (480) ..+..++|..+..+ T Consensus 488 ~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 488 VKETHTDDFERAYE 501 (501) T ss_pred CCCCccccccccCC Confidence 11111111111111 No 23 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=1.1e-81 Score=464.50 Aligned_cols=416 Identities=20% Similarity=0.256 Sum_probs=328.3 Q ss_pred hcCcccchhhhcc--eeccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEE Q lcl|NC_021324. 37 TIGIGAPPELAYL--DVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITV 114 (480) Q Consensus 37 ~~~~~~~~~~~~~--~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v 114 (480) ++++..+++++.. ++++|||++|||+++++|.++||+.+++ +.++.++++|++|+|+.++.+++++++++|+||++| T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~d~-~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v 79 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTGPDG-EPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYMLV 79 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceecCCC-chHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEE Confidence 6777787777643 4688999999999999999999997654 467889999999999999999999999999999999 Q ss_pred ecCcccc-CCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCC--ccc-- Q lcl|NC_021324. 115 SHPDVES-GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGL--NDQ-- 189 (480) Q Consensus 115 ~~~~~~~-~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~-- 189 (480) |.+.... .+.++.|+|+++||.+++++||+..+ ++.+++++|....+... +..+|+++..++|...... ... T Consensus 80 ~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~-~~~~ai~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (434) T protein:vir:98 80 GAHPTRTEDNGRPSPLITMEHPSECIVEYDPETG-EPLVGLKVWHNDIDGFG--YARVFFDDTSFPYRTRERTGARLPWG 156 (434) T ss_pred ecCCCcccccCCceeEEEEeccceeEEEEeCCCC-ceEEEEEEEEeccCCce--EEEEEEeCcEEEEEEeeccccccccc Confidence 9865332 24456789999999999999998765 58888888876554432 3455666655544332211 111 Q ss_pred ---cc---cccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccc Q lcl|NC_021324. 190 ---WV---VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDE 263 (480) Q Consensus 190 ---~~---~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~ 263 (480) ++ ......+|+||+||||+|+|++...+ +|.|+|+ .|++|||+||+++|++++.+++|++|+++++|.+++. T Consensus 157 ~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~-~g~sd~e-~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~ 234 (434) T protein:vir:98 157 PDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGE-DPEPEFA-GVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAK 234 (434) T ss_pred cccceecccccccccCCCCccceEEeccCCCcCc-CCcchhh-hHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccc Confidence 11 12345789999999999999998766 6999995 6999999999999999999999999999999998887 Q ss_pred cccccccc-----hhhhhcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH Q lcl|NC_021324. 264 LTNDGENT-----TLDIYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 264 ~~~~~~~~-----~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~ 338 (480) +.++..+. .+....+++|.+++++++|++++.+++++|+++|+.++++|++.+++|++.||+. .+++||+||++ T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~-~~n~Sg~Al~~ 313 (434) T protein:vir:98 235 RTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATD-LVNISADTIGA 313 (434) T ss_pred ccccccccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhccc-cCChHHHHHHH Confidence 66554432 3456678889889999999999999999999999999999999999999999974 45689999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh Q lcl|NC_021324. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID 418 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~ 418 (480) ++..|++||+++++.|+++|++++++++++.|. ..+..++++.|+++.|+|+++.||+++|++++| +|.++++++ T Consensus 314 ~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~--~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g---~~~e~~~~~ 388 (434) T protein:vir:98 314 LDILHVAKVREHIASFSEGLESVLALAAAQAGV--PEDYTEAEVRWANPAHVTMAVKADAATKLKSIG---YPLDVIAEE 388 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--ChhheeeeEEecCCCCCCHHHHHHHHHHHHhcC---CcHHHHHHh Confidence 999999999999999999999999999988764 445678999999999999999999999999876 689999999 Q ss_pred CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCC Q lcl|NC_021324. 419 LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSG 473 (480) Q Consensus 419 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 473 (480) +|++++++++++++++++.. ..+..+...+++.++. ++.++.-++ | T Consensus 389 lg~~~~e~~r~~~e~~~~~~--~~~~~~~~~~~~~~g~-----~~~~~~~~d--g 434 (434) T protein:vir:98 389 LDESPARVRRIVAGAASQAL--LAASLLPAPGAPSAGN-----VPDSGGAVD--G 434 (434) T ss_pred CCCCHHHHHHHHHHHHHHHH--HHHhhhccCCCCCCCC-----CCcccCCCC--C Confidence 99999988887655443322 2222222222211111 111111111 1 No 24 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=2.3e-81 Score=462.63 Aligned_cols=447 Identities=11% Similarity=0.087 Sum_probs=337.1 Q ss_pred CCCH-------HHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccc-hhhhcceeccchHHHHHHHHHhhhccCc Q lcl|NC_021324. 1 MTTY-------HEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~-------~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) |+.. .+.|.+++.+|.. +.+||+++.+||+|+|+|.+.....+ +...++|+++||+++||++.++|++++| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p 110 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccC Confidence 3332 3557788888864 56899999999999999876554443 3445889999999999999999999999 Q ss_pred ccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 72 FRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 72 ~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) +++ +++++..+.|+++|+.|+++.++.+++++++++|+||++||. +++|.++++.++|.+++|+||+...+++ T Consensus 111 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~------ded~~~~i~~~~p~~~~~iyd~~~~~~~ 184 (512) T protein:vir:97 111 IQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNS 184 (512) T ss_pred ceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCce Confidence 876 456667889999999999999999999999999999999996 4578999999999999999999888889 Q ss_pred EEEEEEEEEecC----CceEEEEEEEeCCcEEEEEecCCCccc-cccccccccccccccceEEeecccccCccCCCccch Q lcl|NC_021324. 151 TRAVRLYTTRDD----VAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 151 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .+++++|..... .+.+.++++|+++.+++|...++.... .....+..+|+||.||||+|+|+ .+|.|+++ T Consensus 185 ~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~~~gd~e 259 (512) T protein:vir:97 185 IAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNN-----ERRKGDYE 259 (512) T ss_pred EEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCC-----CCCCCchh Confidence 999999875432 234678899999999999877654322 22345678899999999999986 45788886 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhhcc----hh--e-ecCCCCceEEEec- Q lcl|NC_021324. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYG----RI--L-TLASEAAKISEFK- 294 (480) Q Consensus 226 ~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~---~~~~~~~~~~~----~~--~-~~~g~~~~~~~~~- 294 (480) .|++|||+||.++|++++.++++++|+++++|.......+. .....+..... .. . ..+|.++++.+++ T Consensus 260 -~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 338 (512) T protein:vir:97 260 -KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY 338 (512) T ss_pred -hhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecC Confidence 59999999999999999999999999999999754432211 11111111110 00 0 1233445565543 Q ss_pred -cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-- Q lcl|NC_021324. 295 -AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-- 371 (480) Q Consensus 295 -~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-- 371 (480) ....++++++|...|+.++.++++++..|++ ++||+||++++.+|.+||..+++.|+++|++++++|+++.+. T Consensus 339 ~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~g----n~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~ 414 (512) T protein:vir:97 339 DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTR 414 (512) T ss_pred CHHHHHHHHHHHHHHHHHHhCCcccCcccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 4457778888887777777777776666653 379999999999999999999999999999999999887542 Q ss_pred --CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_021324. 372 --EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 372 --~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) ....++..++++|+++.|.|+++.++++++++ |++|.||+++++|+++|+.++++++++|+.+......... . T Consensus 415 ~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~----giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-~ 489 (512) T protein:vir:97 415 SIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGI-Y 489 (512) T ss_pred CcccccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcc-c Confidence 23445678999999999999999999999884 4689999999999999888888877777665443332221 1 Q ss_pred cCCCcCCCC----CCCCCCccCC Q lcl|NC_021324. 450 AQADATPKP----TVTETKTETQ 468 (480) Q Consensus 450 ~~~~~~~~~----~~~d~~~~~~ 468 (480) ..+....+. +..+.+.+.+ T Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 490 KDPRDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred CCCCCCCCCCCCCCccccccccC Confidence 111111111 1111111111 No 25 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=9e-81 Score=459.41 Aligned_cols=449 Identities=11% Similarity=0.092 Sum_probs=338.8 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccc-hhhhcceeccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) +.+. +.|.+++..|.. +++||+++.+||+|+|+|.+.....+ ....++|+++||+++||++.++||+++++++. ++ T Consensus 39 ~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:99 39 LQNV-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hccH-HHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHHhhhcccCceeecCc Confidence 3333 347778888865 56899999999999999876554443 34458899999999999999999999998764 56 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEE Q lcl|NC_021324. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.|+++|+.|+++.++.++++.++++|+||++||. +++|++++++++|.+++|+||+....++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~------ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~ 191 (511) T protein:vir:99 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCceEEEEEEE Confidence 667889999999999999999999999999999999996 45789999999999999999998888899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCCccc-cccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) .... +.+.+.++++|+++.+++|...+..... ........+|+||.||||+|+|+ ++|.|+++ .|++|| T Consensus 192 ~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e-~v~~li 265 (511) T protein:vir:99 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNN-----ERRKGDYE-KVITLI 265 (511) T ss_pred EeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCC-----CCCCCchh-hhHHHH Confidence 7543 2345678999999999999877654322 23345678899999999999986 46889886 599999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhh------cchheecCCCCceEEEec--cccHHHH Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIY------YGRILTLASEAAKISEFK--AAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~---~~~~~~~~~------~~~~~~~~g~~~~~~~~~--~~~~~~~ 301 (480) |+||.++|++++.+++|++|+++++|......... .....+... .......++.++++.+++ ....+++ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:99 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHH Confidence 99999999999999999999999999654332221 111111111 111112234456665544 3456777 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----~~~~~~ 377 (480) +++|+..++.++.+|++++..|++ ++||+||+++++++.+||..+++.|+++|++++++|+++++. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~g----n~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:99 346 KDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccc Confidence 777777777777777766666653 379999999999999999999999999999999999887542 234456 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ..+++.|+++.|.|.++.+++++++. |++|+||+++++|+++|+.++++++++|+++.....+........+...+ T Consensus 422 ~~i~i~f~~~~p~n~~e~~~~~~kl~----GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:99 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDD 497 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh----ccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCC Confidence 78999999999999999999999874 46899999999999998888888888877765544443332221111111 Q ss_pred CCCC-CCCccCCCC Q lcl|NC_021324. 458 PTVT-ETKTETQTS 470 (480) Q Consensus 458 ~~~~-d~~~~~~~~ 470 (480) ...+ +..+..+++ T Consensus 498 ~~~~~~~~~~d~~e 511 (511) T protein:vir:99 498 EQDDSTKDSIDKKE 511 (511) T ss_pred CCCCCCcCcccccC Confidence 1111 111111111 No 26 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=9.9e-81 Score=459.19 Aligned_cols=447 Identities=11% Similarity=0.086 Sum_probs=337.1 Q ss_pred CCC-------HHHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccc-hhhhcceeccchHHHHHHHHHhhhccCc Q lcl|NC_021324. 1 MTT-------YHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~-------~~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) |+. ..+.|.+++..|.. +.+||+++.+||+|+|+|.+.....+ +...++|+++||+++||++.++||+++| T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p 110 (511) T protein:vir:93 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (511) T ss_pred ccchhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHhhhhcccC Confidence 221 13458888998864 56899999999999999876554443 3445889999999999999999999999 Q ss_pred ccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 72 FRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 72 ~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) +++ +++++..+.|+++|+.|+++.++.+++++++++|+||++||. +++|.+++++++|.+++|+||+....++ T Consensus 111 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~------de~~~~~i~~~~p~~~~~vydd~~~~~~ 184 (511) T protein:vir:93 111 IQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNS 184 (511) T ss_pred eeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCce Confidence 886 456667789999999999999999999999999999999996 4578999999999999999998888889 Q ss_pred EEEEEEEEEec----CCceEEEEEEEeCCcEEEEEecCCCcccc-ccccccccccccccceEEeecccccCccCCCccch Q lcl|NC_021324. 151 TRAVRLYTTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQW-VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 151 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .+++++|.... +.+.+.++++|+++.+++|...++..... .......+|+||.||||+|+|+. +|.|+++ T Consensus 185 ~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~gd~e 259 (511) T protein:vir:93 185 IAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE 259 (511) T ss_pred EEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCC-----CCCCchh Confidence 99999986543 23456789999999999998776543222 23455778999999999999864 6788886 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhhcc------hheecCCCCceEEEec-- Q lcl|NC_021324. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYG------RILTLASEAAKISEFK-- 294 (480) Q Consensus 226 ~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~---~~~~~~~~~~~------~~~~~~g~~~~~~~~~-- 294 (480) .|++|||+||.++|++++.+++|++|+++++|.......+. .....+..... .....+++++++.+++ T Consensus 260 -~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 338 (511) T protein:vir:93 260 -KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYD 338 (511) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCC Confidence 59999999999999999999999999999999654433221 11111111111 1112234556665543 Q ss_pred cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--- Q lcl|NC_021324. 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--- 371 (480) Q Consensus 295 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--- 371 (480) ....+.++++|...++.++.+|++++..|++ ++||+||+++++++.+||..+++.|+++|++++++++++.+. T Consensus 339 ~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~ 414 (511) T protein:vir:93 339 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWS 414 (511) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 4456777777777777777777776666653 369999999999999999999999999999999999887542 Q ss_pred -CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_021324. 372 -EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKA 450 (480) Q Consensus 372 -~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (480) ....++..+++.|+++.|.|.++.++++.++. |++|.||+++++|+++|+.++++++++|+.+........ ... T Consensus 415 ~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~-~~~ 489 (511) T protein:vir:93 415 IDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKG-IYK 489 (511) T ss_pred cccccccccceEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhh-ccc Confidence 22345678999999999999999999999873 368999999999999988888877777765543222221 111 Q ss_pred CCCcCCCC----CCCCCCccCC Q lcl|NC_021324. 451 QADATPKP----TVTETKTETQ 468 (480) Q Consensus 451 ~~~~~~~~----~~~d~~~~~~ 468 (480) .+....+. +..+...+.+ T Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 490 DPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCCCCcccccccccC Confidence 11111111 1111111111 No 27 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=4.1e-81 Score=461.27 Aligned_cols=440 Identities=15% Similarity=0.117 Sum_probs=333.1 Q ss_pred CCCHHHHHHHHHHHHHHH-HHHHHHHHHHhhccC-cchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCC-- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARD-LPNLLEAEAYRNGTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE-- 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~-~~~~~~~~~YY~G~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~-- 76 (480) ..+..++|++++.+|..+ .+|++++.+||+|+| .|.......+...+++|+++|||++||++.++||+++|+++.. T Consensus 38 ~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d 117 (502) T protein:vir:48 38 MVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDD 117 (502) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHHHHHHhhhhcccCeeEecCC Confidence 334457899999999765 589999999999975 5655554455556688999999999999999999999987532 Q ss_pred C---chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 77 D---SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 77 d---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) + +...+.|+++|+.|+|+.++.+++++++++|+||+++|. +++|++++++++|.+++|+||+...+++.++ T Consensus 118 ~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~------dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ 191 (502) T protein:vir:48 118 NEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYR------SEYDETRIKRLSPLETFVIYDNSLEDNSIAA 191 (502) T ss_pred ccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 2 234567899999999999999999999999999999996 4578999999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +++|......+...++++||++.+++|...++ ....+..+|+||.||||+|+|+ ++|.|++++ +++||| T Consensus 192 ir~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~-----~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~-v~~liD 260 (502) T protein:vir:48 192 VRYYNRGTLQNAKDVVEIYTNQHIYTLDASDS-----FNEISVTPHAFGTVPITEFLNN-----ADGIGDYET-ELYLID 260 (502) T ss_pred EEEEEEeecCCcEEEEEEEeCCeEEEEEeCCc-----eeeccceecCCCccceEEecCC-----CCCCCchhh-hHHHHH Confidence 99998777777788999999999999876543 3455678899999999999987 468899965 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec----------CCCCceEEEec--cccHHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL----------ASEAAKISEFK--AAELRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~----------~g~~~~~~~~~--~~~~~~~ 301 (480) +||+++|++++.+++|++|+++++|........... . +...+++.+ ++.++++.+++ ....+++ T Consensus 261 a~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~ 336 (502) T protein:vir:48 261 LYDSAESDTANHMSDMADAILAIYGDLALPQGMQAS--D--MKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEAY 336 (502) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCcccccccchh--h--hhhcceeeccccccccccccCcceeEeeecCCHHHHHHH Confidence 999999999999999999999999976543322211 1 111122211 22345554443 3445666 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCcccce Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---EVTEEYT 378 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~ 378 (480) ++.|...|+.++.+|++++..|++ ++||+||++++++|.+||..+++.|+.+|++++++++++.+. ....+.. T Consensus 337 ~~~L~~~I~~~s~~p~~~~~~~~~----n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~ 412 (502) T protein:vir:48 337 KTRLNKDIHVFTNTPDMSDNHFSG----NASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDES 412 (502) T ss_pred HHHHHHHHHHHhCCCCcCcccccc----CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 666666666666655555444433 379999999999999999999999999999999999887653 3345567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) +++++|+++.|+|.++.++++.|+. +++|+||+++++|+++|+.++++++++|+.+................ T Consensus 413 ~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~---- 484 (502) T protein:vir:48 413 RLKITFTPNLPKSLYEQVSILNDLG----GQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNVGK---- 484 (502) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccccc---- Confidence 8999999999999999999999873 36899999999999998888888887776543222211111111111 Q ss_pred CCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 459 TVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 459 ~~~d~~~~~~~~~~~~~~~~~~ 480 (480) +.+..+++..+-+.| T Consensus 485 -------~~d~~~e~~~~~~~~ 499 (502) T protein:vir:48 485 -------YTDEVKETHTDDFER 499 (502) T ss_pred -------cCCCccCCCCcCcCC Confidence 111111111112222 No 28 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=1.3e-80 Score=458.48 Aligned_cols=447 Identities=12% Similarity=0.107 Sum_probs=336.9 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccc-hhhhcceeccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) +.+. +.|.+++++|.. +++|++++.+||+|+|+|.+.....+ +...++|+++||+++||++.++||+++++++. ++ T Consensus 39 ~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:78 39 LQNV-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hcCH-HHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCc Confidence 3333 447778888764 56899999999999999876554433 34458899999999999999999999998764 55 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEE Q lcl|NC_021324. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.|+++|+.|+++.++.++++.++++|+||++||. |++|++++++++|.+++|+||+....++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~------d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~ 191 (511) T protein:vir:78 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 667899999999999999999999999999999999996 45789999999999999999988878899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCCcccc-ccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQW-VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) .... ..+.+.++++||++.+++|...++..... ....+..+|+||.||||+|+|+ ++|.|++++ |++|| T Consensus 192 ~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~gd~e~-v~~li 265 (511) T protein:vir:78 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNN-----ERRKGDYEK-VITLI 265 (511) T ss_pred EeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCC-----CCCCCchhh-hHHHH Confidence 7543 22456788999999999998776543222 2345678899999999999986 468888865 99999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCcccccc---ccccchhhhhcchhe------ecCCCCceEEEec--cccHHHH Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTN---DGENTTLDIYYGRIL------TLASEAAKISEFK--AAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~---~~~~~~~~~~~~~~~------~~~g~~~~~~~~~--~~~~~~~ 301 (480) |+||.++|++++.++++++|+++++|.......+ ......+........ ..++.++++.+++ .+..+++ T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:78 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHH Confidence 9999999999999999999999999965433221 111111211111111 1234455655443 4456777 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----~~~~~~ 377 (480) +++|...|+.++.+|++++..|++ ++||+||+++++.+.+||..+++.|+.+|++++++++++++. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~~----n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:78 346 KDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Confidence 777777777777777776666653 379999999999999999999999999999999999887542 234556 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcC-- Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT-- 455 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 455 (480) .+++++|+++.|.|.++.++++++++ |++|+||+++++|+++|+.++|+++++|+.+....+.........+.. T Consensus 422 ~~i~~~f~~~~p~n~~e~~d~~~kl~----G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:78 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCC Confidence 78999999999999999999999884 368999999999999988888887777766544333332221111111 Q ss_pred -CCCCCCCCCccCC Q lcl|NC_021324. 456 -PKPTVTETKTETQ 468 (480) Q Consensus 456 -~~~~~~d~~~~~~ 468 (480) +.++..+..++++ T Consensus 498 ~~~~~~~~~~~e~~ 511 (511) T protein:vir:78 498 EQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCccCcccccC Confidence 1111111111111 No 29 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=1.3e-80 Score=458.48 Aligned_cols=447 Identities=12% Similarity=0.107 Sum_probs=336.9 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccc-hhhhcceeccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) +.+. +.|.+++++|.. +++|++++.+||+|+|+|.+.....+ +...++|+++||+++||++.++||+++++++. ++ T Consensus 39 ~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:96 39 LQNV-NEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred hcCH-HHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCc Confidence 3333 447778888764 56899999999999999876554433 34458899999999999999999999998764 55 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEE Q lcl|NC_021324. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.|+++|+.|+++.++.++++.++++|+||++||. |++|++++++++|.+++|+||+....++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~------d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~ 191 (511) T protein:vir:96 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEE Confidence 667899999999999999999999999999999999996 45789999999999999999988878899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCCcccc-ccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQW-VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) .... ..+.+.++++||++.+++|...++..... ....+..+|+||.||||+|+|+ ++|.|++++ |++|| T Consensus 192 ~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~gd~e~-v~~li 265 (511) T protein:vir:96 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNN-----ERRKGDYEK-VITLI 265 (511) T ss_pred EeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCC-----CCCCCchhh-hHHHH Confidence 7543 22456788999999999998776543222 2345678899999999999986 468888865 99999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCcccccc---ccccchhhhhcchhe------ecCCCCceEEEec--cccHHHH Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTN---DGENTTLDIYYGRIL------TLASEAAKISEFK--AAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~---~~~~~~~~~~~~~~~------~~~g~~~~~~~~~--~~~~~~~ 301 (480) |+||.++|++++.++++++|+++++|.......+ ......+........ ..++.++++.+++ .+..+++ T Consensus 266 Da~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:96 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccccCCCCcceeEEeecCCHHHHHHH Confidence 9999999999999999999999999965433221 111111211111111 1234455655443 4456777 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----~~~~~~ 377 (480) +++|...|+.++.+|++++..|++ ++||+||+++++.+.+||..+++.|+.+|++++++++++++. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~~----n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:96 346 KDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc Confidence 777777777777777776666653 379999999999999999999999999999999999887542 234556 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcC-- Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT-- 455 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 455 (480) .+++++|+++.|.|.++.++++++++ |++|+||+++++|+++|+.++|+++++|+.+....+.........+.. T Consensus 422 ~~i~~~f~~~~p~n~~e~~d~~~kl~----G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:96 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPRDINDD 497 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCCCCCCC Confidence 78999999999999999999999884 368999999999999988888887777766544333332221111111 Q ss_pred -CCCCCCCCCccCC Q lcl|NC_021324. 456 -PKPTVTETKTETQ 468 (480) Q Consensus 456 -~~~~~~d~~~~~~ 468 (480) +.++..+..++++ T Consensus 498 ~~~~~~~~~~~e~~ 511 (511) T protein:vir:96 498 EQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCccCcccccC Confidence 1111111111111 No 30 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=5.1e-81 Score=460.78 Aligned_cols=428 Identities=13% Similarity=0.134 Sum_probs=333.5 Q ss_pred CCCH----HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC- Q lcl|NC_021324. 1 MTTY----HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS- 75 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~- 75 (480) |+.. .++|.+|+++|..+++|++++.+||+|+|+|.+.+.. .+...++|+++||+++||++.++||+++|+++. T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 89 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTK-DLWKPDNRLTVNFTKYIVDTFTGYFNGIPVKKSH 89 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCc-cccCccceeecchHHHHHHHHhhhhcccCceecc Confidence 3332 5689999999999999999999999999999776542 334468899999999999999999999998764 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEE Q lcl|NC_021324. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 76 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ++++..+.|+++|++|+|+.++.+++++++++|+||++||. +++|.+++++++|.+++|+||+...+.+.++++ T Consensus 90 ~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir 163 (453) T protein:vir:39 90 SDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQ------NEETQTNVIYNTPENMFMVYDDTIKQEPLFAVR 163 (453) T ss_pred CChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEEcccceEEEecCCCCCeEEEEEE Confidence 56667889999999999999999999999999999999997 457899999999999999999888888999998 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHH Q lcl|NC_021324. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~ 235 (480) +|.. .+...++++|+++++++|...++. ....+..+|+||.||||+|+|++ +|+|+|++ |++|||+| T Consensus 164 ~~~~---~~~~~~~~~yt~~~i~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~~-----~g~sd~e~-v~~liDa~ 230 (453) T protein:vir:39 164 YGYD---DDYKLYGEVYTKETTYALNGTMGF----YNMTEQAPNPFDDLPVVEFYFNE-----ERMSIFES-VISLVNAF 230 (453) T ss_pred EEEe---CCeEEEEEEEeCCeEEEEEecCCc----eeeecccccCCCceeEEEecCCC-----CCCcchhh-hHHHHHHH Confidence 8753 334678999999999999876542 23456779999999999999863 68999975 99999999 Q ss_pred HHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-------CCCCceEEEec--cccHHHHHHHHH Q lcl|NC_021324. 236 SRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-------ASEAAKISEFK--AAELRNFAEEME 306 (480) Q Consensus 236 ~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-------~g~~~~~~~~~--~~~~~~~~~~l~ 306 (480) |+++|++++.++++++|+++++|...+..... +...++++.. +++++++.+++ ....++++++|. T Consensus 231 ~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~ 304 (453) T protein:vir:39 231 NKAISEKANDVDYFSDQYLTFLGAAVEEEDLK------NIRSNRVINYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLT 304 (453) T ss_pred HHHHHHHHHHHHHhhCceeeeecCCCCchhhh------hhhhcceeeecCCCCCCCCCceeEEeecCCHHHHHHHHHHHH Confidence 99999999999999999999999866543221 1122233322 34456666554 334556666666 Q ss_pred HHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEec Q lcl|NC_021324. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWR 385 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~~~~~~~~~v~f~ 385 (480) ..|+.++.++++++..|| ++||+||++++++|..||.++++.|+.+|++++++++++.+. +...++.+++|.|+ T Consensus 305 ~~I~~~s~~p~~~~~~~g-----n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~ 379 (453) T protein:vir:39 305 KLIFQTTMVANISDESFG-----SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFT 379 (453) T ss_pred HHHHHHhCCccccccccc-----CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeC Confidence 666666666555544443 369999999999999999999999999999999999988753 34556778999999 Q ss_pred CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCc Q lcl|NC_021324. 386 DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKT 465 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) ++.|.|+++.|++++|+. +++|+||+++++|+++|+.++++++++|+.+......... .+....+...++... T Consensus 380 ~~~p~~~~~~a~~~~kl~----g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~---~~~~~~~~~~~~~~~ 452 (453) T protein:vir:39 380 RNEPKDIKEQAETANILM----GITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDKDKQ---PSEKGTDTVVPETNE 452 (453) T ss_pred CCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhcc---CCCCCCCCCCCCcCC Confidence 999999999999999873 3689999999999999888888887777665432211111 111111111111111 Q ss_pred c Q lcl|NC_021324. 466 E 466 (480) Q Consensus 466 ~ 466 (480) + T Consensus 453 e 453 (453) T protein:vir:39 453 E 453 (453) T ss_pred C Confidence 1 No 31 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=4.1e-81 Score=461.28 Aligned_cols=435 Identities=11% Similarity=0.084 Sum_probs=341.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) +....++|++|+.+|..+.+|++++.+||+|+|+|....... .+..+++|+++||+++|||+.++||+++|+++ T Consensus 42 ~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~ 121 (492) T protein:vir:97 42 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 121 (492) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhcccCcee Confidence 445577899999999999999999999999999986554321 23345789999999999999999999999876 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.|+++++ |+++..+.++++++++||+||+++|. +++|++++++++|.+++|+||+...+.+.++ T Consensus 122 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~~------d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~ 194 (492) T protein:vir:97 122 KHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 194 (492) T ss_pred ccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 55556677887764 79999999999999999999999996 4578999999999999999998878889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc------ccccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++.+++|...++... ...+..+..+|+||.||||+|+|++ +|.|++++ T Consensus 195 vr~~~~~~~----~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~g~sd~e~- 264 (492) T protein:vir:97 195 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM- 264 (492) T ss_pred EEEEeeccc----eeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCCC-----CCCCchHh- Confidence 999876443 3678999999999887654321 1223455678999999999999874 68899975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhee-cCCCCceEEEe--ccccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISEF--KAAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~~--~~~~~~~~~~~ 304 (480) |++|||+||+++|++++.++++++|+++++|.+.++..+... .+....++. .+++++++.++ +....++++++ T Consensus 265 v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 340 (492) T protein:vir:97 265 YKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKR----LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 340 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHH----HHhhccceecCCCCcceeEeccCCHHHHHHHHHH Confidence 999999999999999999999999999999987654332211 122233333 34556777544 45567888899 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.++.++++++..|+++ +||+||++++.+|..||..+++.|+++|++++++++.+.+. ..++.+++|+| T Consensus 341 L~~~I~~~s~~p~~~~~~~~~n----~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~--~~~~~~i~v~f 414 (492) T protein:vir:97 341 LYQKIMLFGQAVDFSSDKFGSA----PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHKDVDISF 414 (492) T ss_pred HHHHHHHHhCCCCCCccccccC----cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CcccceeeEEe Confidence 9988999988888888777653 79999999999999999999999999999999999998874 44577899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.+++++|+. |++|+||+++++|+++|+.++++++++|+++... .+.....++.+...+.+..++. T Consensus 415 ~~~~p~~~~e~a~~~~kl~----G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 489 (492) T protein:vir:97 415 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQTEYNK-QLPNLDDGGADSAQQQERSNNK 489 (492) T ss_pred cCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHH-hhhccccCCCCCCccccccccc Confidence 9999999999999999873 4689999999999999888888777777654332 2222222222222222111111 Q ss_pred ccC Q lcl|NC_021324. 465 TET 467 (480) Q Consensus 465 ~~~ 467 (480) .+. T Consensus 490 ~~e 492 (492) T protein:vir:97 490 ESE 492 (492) T ss_pred ccC Confidence 111 No 32 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=4.9e-81 Score=460.89 Aligned_cols=435 Identities=11% Similarity=0.083 Sum_probs=340.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .....++|++|+.+|..+.+|++++.+||+|+|+|....... .+...++|+++||+++|||+.++|++++|+++ T Consensus 42 ~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~ 121 (492) T protein:vir:94 42 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 121 (492) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhcccCcee Confidence 455677899999999999999999999999999986543321 22345789999999999999999999999876 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.|+++++ |+++..+.+++++++++|+||++||. |++|++++++++|.+++|+||+...+.+.++ T Consensus 122 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~~~v~~------d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ 194 (492) T protein:vir:94 122 KHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 194 (492) T ss_pred ccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 45556677887775 78999999999999999999999996 4578999999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc------ccccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++.+++|...++... ...+..+..+|+||.||||+|+|++ +|.|+++. T Consensus 195 ir~~~~~~~----~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~~~sd~e~- 264 (492) T protein:vir:94 195 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNND-----LEISDIFM- 264 (492) T ss_pred EEEEeeccc----eeEEEEecCeEEEEEEecCeeeeccccccccccccccccCCCccceEEecCCC-----CCCCchHH- Confidence 999876443 3679999999999876654311 1223455678999999999999874 68899965 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhee-cCCCCceEEE--eccccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-LASEAAKISE--FKAAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~~~~--~~~~~~~~~~~~ 304 (480) |++|||+||+++|++++.+++|++|+++++|.+.++..+... ......++. .+++++++.+ ++....++++++ T Consensus 265 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 340 (492) T protein:vir:94 265 YKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKR----LLRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 340 (492) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHH----HHhhccceecCCCCcceeEeccCCHHHHHHHHHH Confidence 999999999999999999999999999999987654333211 112223332 3455677754 445678899999 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.++.++++++..|+++ +||+||++++.+|..||.++++.|+.+|++++++++.+.+.. .++.+++|+| T Consensus 341 l~~~I~~~s~~p~~~~~~~~~n----~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~--~~~~~i~v~f 414 (492) T protein:vir:94 341 LYQKIMLFGQAVDFSSDKFGSA----PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK--GEHKDVDISF 414 (492) T ss_pred HHHHHHHHhCCcCCCccccccC----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--cccceeeEEe Confidence 9999999999999888887753 699999999999999999999999999999999999988743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.++++++++ |++|+||+++++|+++|+.++++++++|+++... ++......+++.+++.+ .. T Consensus 415 ~~~~p~~~~e~~~~~~kl~----giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~-~~~~~~~~~~~~~~~~~---~~ 486 (492) T protein:vir:94 415 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNK-QLPNLDDGGADSAQQQE---RS 486 (492) T ss_pred cCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHh-hccccccccCCCCcccc---CC Confidence 9999999999999999874 4789999999999999888888877777654433 22222222221111111 11 Q ss_pred ccCCCC Q lcl|NC_021324. 465 TETQTS 470 (480) Q Consensus 465 ~~~~~~ 470 (480) .+.+.+ T Consensus 487 ~~~e~e 492 (492) T protein:vir:94 487 NNKESE 492 (492) T ss_pred ccccCC Confidence 111111 No 33 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=1.8e-80 Score=457.77 Aligned_cols=446 Identities=11% Similarity=0.092 Sum_probs=336.7 Q ss_pred CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccc-hhhhcceeccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) +.+. +.|.+++.+|.. +++||+++.+||+|+|+|.+.....+ +...++|+++||+++||++.++|++++|+++. ++ T Consensus 39 ~~~~-~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d 117 (511) T protein:vir:10 39 LQNV-NEVSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDD 117 (511) T ss_pred ccCH-HHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHHHHHHhhhhcccCceeecCc Confidence 4443 457778888765 56999999999999999976654443 34458899999999999999999999998764 55 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEE Q lcl|NC_021324. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 78 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ++..+.|+++|+.|+++.++.+++++++++|+||++||. |++|++++++++|.+++|+||+....++.+++++| T Consensus 118 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~------dedg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~ 191 (511) T protein:vir:10 118 KDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIR------NQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYL 191 (511) T ss_pred hHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCceEEEEEEE Confidence 667889999999999999999999999999999999996 45789999999999999999998888899999998 Q ss_pred EEec----CCceEEEEEEEeCCcEEEEEecCCCccc-cccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 158 TTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 158 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) .... +.+.+.++++|+++.+++|...++.... ........+|+||.||||+|+|+. +|.|+++ .|++|| T Consensus 192 ~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~-----~g~gd~e-~v~~li 265 (511) T protein:vir:10 192 RTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE-KVITLI 265 (511) T ss_pred EeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCC-----CCCCchh-hhHHHH Confidence 7543 2345678899999999999877554322 223456778999999999999864 5788886 599999 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhhcch------heecCCCCceEEEe--ccccHHHH Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYYGR------ILTLASEAAKISEF--KAAELRNF 301 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~---~~~~~~~~~~~~------~~~~~g~~~~~~~~--~~~~~~~~ 301 (480) |+||.++|++++.++++++|+++++|.......+. .....+...... ....++.++++.++ +....+++ T Consensus 266 Da~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~ 345 (511) T protein:vir:10 266 DLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAY 345 (511) T ss_pred HHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceecccccccccccccCCCCcceeEEeecCCHHHHHHH Confidence 99999999999999999999999999654332211 111112111111 11123345555444 34456777 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccc Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR----EVTEEY 377 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~----~~~~~~ 377 (480) +++|...|+.++.+|++++..|++ ++||+||+++++++.+||..+++.|+++|++++++++++++. ....++ T Consensus 346 ~~~L~~~I~~~s~~P~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~ 421 (511) T protein:vir:10 346 KDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF 421 (511) T ss_pred HHHHHHHHHHHhCCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccc Confidence 777777777777777776666653 369999999999999999999999999999999999887542 234556 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ..+++.|+++.|+|.++.+++++|++ |++|+||+++++|+++|+.++++++++|+++....+..+... .+....+ T Consensus 422 ~~i~i~f~~~~p~d~~~~~~~~~kl~----G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~-~~~~~~~ 496 (511) T protein:vir:10 422 NTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK-DPRDIND 496 (511) T ss_pred ceeeEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhccc-CCCCCCC Confidence 78999999999999999999999984 368999999999999988888887777766544333222211 1111111 Q ss_pred C----CCCCCCccCC Q lcl|NC_021324. 458 P----TVTETKTETQ 468 (480) Q Consensus 458 ~----~~~d~~~~~~ 468 (480) . +..+...+.+ T Consensus 497 ~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 497 DEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCcccCcccccC Confidence 1 1111111111 No 34 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=2.9e-80 Score=456.67 Aligned_cols=435 Identities=11% Similarity=0.094 Sum_probs=337.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc------cchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) +....++|.+|+++|..+++|+.++.+||+|+|+|...... ..+..+++|+++||+++|||+.++||+++|+++ T Consensus 33 ~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~~ 112 (483) T protein:vir:12 33 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 112 (483) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCcee Confidence 44457789999999999999999999999999998654322 223345789999999999999999999999886 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.|+++++ |+++..+.+++++++++|+||++||. |++|+|++++++|.+++|+||+...+++.++ T Consensus 113 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~------d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 185 (483) T protein:vir:12 113 KHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 185 (483) T ss_pred ccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEE------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 55556677888775 78999999999999999999999996 4578999999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCc------cccccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN------DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++++++|...++.. ....+.....+|+||.||||+|+|+ ++|.|++++ T Consensus 186 ir~~~~~~~----~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~- 255 (483) T protein:vir:12 186 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFM- 255 (483) T ss_pred EEEEEeecc----eEEEEEecCeEEEEEEeCCeeeecccccccccccccccCCCCccceEEecCC-----CCCCCchhh- Confidence 999876443 357999999999987665431 1122344567899999999999986 468899975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhe-ecCCCCceEEEec--cccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||.++|++++.+++|++|+++++|...++..+... . ....+++ ..+++++++.+++ ....++++++ T Consensus 256 v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 331 (483) T protein:vir:12 256 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR--L--LRYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 331 (483) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHH--h--hhhccccccCCCCcceEEeecCCHHHHHHHHHH Confidence 999999999999999999999999999999987655433221 1 1222333 3355677776554 3456677777 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.++.++++++..|++ ++||+||++++.+|..||..+++.|+++|++++++++++.+. ..++.+++|+| T Consensus 332 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~--~~~~~~i~v~f 405 (483) T protein:vir:12 332 LYQKIMLFGQAVDFSSDKFGS----APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI--KGEHKDVDISF 405 (483) T ss_pred HHHHHHHHhCCCCCCcccccc----CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCccceeeEEe Confidence 777777777777776666654 379999999999999999999999999999999999998874 34577899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.+++++|+. |++|+||+++++|+++|+.++++++++|+++.... +......+++...+. +.. T Consensus 406 ~~~~p~~~~~~a~~~~kl~----GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~d~~~~~---~~~ 477 (483) T protein:vir:12 406 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQQ---ERS 477 (483) T ss_pred CCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh-cccccccccCCcccC---CCC Confidence 9999999999999999873 46899999999999999888888877776654322 222222222211111 111 Q ss_pred ccCCCC Q lcl|NC_021324. 465 TETQTS 470 (480) Q Consensus 465 ~~~~~~ 470 (480) .+.+++ T Consensus 478 ~~~e~e 483 (483) T protein:vir:12 478 NNKESE 483 (483) T ss_pred CcccCC Confidence 111111 No 35 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=2.5e-79 Score=451.54 Aligned_cols=447 Identities=11% Similarity=0.086 Sum_probs=336.0 Q ss_pred CCC-------HHHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccch-hhhcceeccchHHHHHHHHHhhhccCc Q lcl|NC_021324. 1 MTT-------YHEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAPP-ELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~-------~~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~~-~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) |+. ..+.|.+++.+|.. +.+||+++.+||+|+|+|.+.....+. ...++|+++||+++||++.++|+++++ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p 110 (511) T protein:vir:96 31 YDGTESDLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYISDFINGYFLGNP 110 (511) T ss_pred cchhhhhhhccHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHHHHHHHhhhccCC Confidence 221 12347788888865 568999999999999999766544443 345789999999999999999999999 Q ss_pred ccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 72 FRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 72 ~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) +++ +++++..+.|+++|+.|+|+.++.+++++++++|+||++||. |++|++++++++|.+++|+||+....++ T Consensus 111 ~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~------ded~~~~i~~~~p~~~~~vydd~~~~~~ 184 (511) T protein:vir:96 111 IQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIR------NQDDETRLYKSDAMSTFVIYDNTIERNS 184 (511) T ss_pred ceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEe------CCCCceEEEEEccceeEEEEcCCCCCce Confidence 876 456667899999999999999999999999999999999996 4578999999999999999998888889 Q ss_pred EEEEEEEEEec----CCceEEEEEEEeCCcEEEEEecCCCccc-cccccccccccccccceEEeecccccCccCCCccch Q lcl|NC_021324. 151 TRAVRLYTTRD----DVAVPDRATLYLPDETVPLRRNGGLNDQ-WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 151 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .+++++|.... ..+.+.++++|+++.+++|...++.... ........+|+||.||||+|+|+. +|.|+++ T Consensus 185 ~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~-----~g~gd~e 259 (511) T protein:vir:96 185 IAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNE-----RRKGDYE 259 (511) T ss_pred EEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCC-----CCCCchh Confidence 99999987543 2334668899999999999876554322 223456778999999999999864 6888886 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc---cccchhhhhc------chheecCCCCceEEEec-- Q lcl|NC_021324. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND---GENTTLDIYY------GRILTLASEAAKISEFK-- 294 (480) Q Consensus 226 ~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~---~~~~~~~~~~------~~~~~~~g~~~~~~~~~-- 294 (480) .+++|||+||.++|++++.++++++|+++++|.......+. .....+.... +.....++.++++.+++ T Consensus 260 -~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 338 (511) T protein:vir:96 260 -KVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYD 338 (511) T ss_pred -hhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccccccccccCCCCcceeEEeecCC Confidence 59999999999999999999999999999999654332211 1111111111 11112234445555443 Q ss_pred cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--- Q lcl|NC_021324. 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--- 371 (480) Q Consensus 295 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--- 371 (480) ....++++++|...++.++.+|++++..|++ ++||+||+++++.+.+||..+++.|+++|++++++|+++.+. T Consensus 339 ~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~ 414 (511) T protein:vir:96 339 VQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG----TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWS 414 (511) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 4457777777777777777777776666653 379999999999999999999999999999999999887542 Q ss_pred -CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_021324. 372 -EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKA 450 (480) Q Consensus 372 -~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (480) ....++..+++.|+++.|.|.++.+++++++. |++|.||+++++|+++|+.++++++++|+.+......... .. T Consensus 415 ~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~----G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~-~~ 489 (511) T protein:vir:96 415 IDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG----GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGI-YK 489 (511) T ss_pred cccccccccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcc-cc Confidence 23445678999999999999999999998873 4689999999999999888888877777655433322221 11 Q ss_pred CCCcCCCCC----CCCCCccCC Q lcl|NC_021324. 451 QADATPKPT----VTETKTETQ 468 (480) Q Consensus 451 ~~~~~~~~~----~~d~~~~~~ 468 (480) .+....+.+ ..+...+++ T Consensus 490 ~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 490 DPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCCCCcccccccccC Confidence 111111111 111111111 No 36 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=6e-80 Score=454.89 Aligned_cols=435 Identities=11% Similarity=0.096 Sum_probs=337.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc------cchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) +....++|.+|+++|..+++|++++.+||+|+|+|...... ..+...++|+++||+++|||+.++||+++|+++ T Consensus 22 ~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~~~~~ 101 (472) T protein:vir:93 22 PETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAF 101 (472) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhcccCeee Confidence 45567899999999999999999999999999998654322 123345789999999999999999999999886 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.|+++|+ |+++..+.+++++++++|+||++||. |++|++++++++|.+++|+||+...+++.++ T Consensus 102 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~------d~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~ 174 (472) T protein:vir:93 102 KHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYL------DEEGEFKLFRVPAEQGIPIWTDKEHEELEAF 174 (472) T ss_pred ccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEE------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 55556677777774 79999999999999999999999996 4578999999999999999998878889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCc------cccccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN------DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|...+. .++++|+++.+++|...++.. ....+.....+|+||.||||+|+|+ ++|+|+|++ T Consensus 175 ir~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~g~s~~e~- 244 (472) T protein:vir:93 175 IRMYKLENE----TKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHFSTGSWGKIPFIPFKNN-----DLEISDIFM- 244 (472) T ss_pred EEEEEeecc----eeEEEEecCeEEEEEEecCeeeecccccccccccccccCCCCCcceEEecCC-----CCCCCchhh- Confidence 999876543 357899999998887665421 1122344567899999999999986 479999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhe-ecCCCCceEEEec--cccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL-TLASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||+++|++++.+++|++|+++++|.+.++..+... .+ ....++ ..+++++++.+++ ....++++++ T Consensus 245 v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~--~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 320 (472) T protein:vir:93 245 YKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKR--LL--RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 320 (472) T ss_pred hHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHH--HH--hhccccccCCCCcceeEeecCCHHHHHHHHHH Confidence 999999999999999999999999999999987654332211 12 222333 3355667776544 4567788888 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.+++++++++..|+++ +||+||++++.+|..||+++++.|+++|++++++++.+.+.. .++.+++|+| T Consensus 321 l~~~i~~~s~~p~~~~~~~~~n----~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~i~v~f 394 (472) T protein:vir:93 321 LYQKIMLFGQAVDFSSDKFGSA----PSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIK--GEHKDVDISF 394 (472) T ss_pred HHHHHHHHhCCCCCCccccccC----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 8888888888888777666643 699999999999999999999999999999999999988743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) .++.|.|.++.+++++|++ |++|+||+++++|+++|+.++++++++|+.+.... +.+....+++.+++. +.. T Consensus 395 ~~~~p~~~~~~~~~~~k~~----giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~d~~~~~---~~~ 466 (472) T protein:vir:93 395 NYNKVANTELQVQTAQQSM----GIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQ-LPNLDDGGADGAQQQ---ERS 466 (472) T ss_pred CCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-ccCcCcccCCCCCCC---CCC Confidence 9999999999999999874 36899999999999998888887777776554322 222211111111111 111 Q ss_pred ccCCCC Q lcl|NC_021324. 465 TETQTS 470 (480) Q Consensus 465 ~~~~~~ 470 (480) .+.+.+ T Consensus 467 ~~~~~e 472 (472) T protein:vir:93 467 NNKESE 472 (472) T ss_pred CcccCC Confidence 111111 No 37 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=9.8e-80 Score=453.73 Aligned_cols=429 Identities=13% Similarity=0.104 Sum_probs=332.2 Q ss_pred CCCH----HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC- Q lcl|NC_021324. 1 MTTY----HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS- 75 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~- 75 (480) |+.. .+.|.+|+++|..+.+||+++.+||+|+|+|.+... ..+..+++|+++||+++|||+.++|++++|+++. T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 89 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKA-KDSWKPDNRLTNNFAKYIVDTFVGYFNGIPIKKTH 89 (453) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCC-CCccCccceeecchHHHHHHHhhhhhcccCceeec Confidence 2211 246888999999999999999999999999977543 3445568999999999999999999999998764 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEE Q lcl|NC_021324. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 76 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ++++..+.++++|+.|+|+.++.+++++++++|+||++||.+ ++|.+++++++|.+++|+||+..++.+.++++ T Consensus 90 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d------~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~ 163 (453) T protein:vir:73 90 DDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQN------ESTESEVIYCSPLNVFMVYDDSIKQKPLFAVY 163 (453) T ss_pred CChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeC------CCCceEEEEEcccceEEEEeCCCCceeEEEEE Confidence 555677889999999999999999999999999999999974 47899999999999999999988888888888 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHH Q lcl|NC_021324. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~ 235 (480) +|.+.+ + ..++++|+++++++|...++. ....+..+|+||.||||+|+|++ +|+|+|++ |++|||+| T Consensus 164 ~~~~~~--~-~~~~~vyt~~~i~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~~-----~g~s~~~~-v~~liDa~ 230 (453) T protein:vir:73 164 YGFDEE--G-NLSGTVYTLLETISITGKAGE----VKFGESTYNVYSDLPIVEYNFNE-----ERQSIFEP-VHSLINSY 230 (453) T ss_pred EEEecC--c-eEEEEEEeCCeEEEEEecCCc----eEEccceeccCCceeEEEecCCC-----CCCcchhh-HHHHHHHH Confidence 775432 2 467899999999998876432 34456788999999999999874 68899974 99999999 Q ss_pred HHHHHHHHHHHHHhcchhhhhcCCCccccccccc--cchhhh---hcchhe-ecCCCCceEEEec--cccHHHHHHHHHH Q lcl|NC_021324. 236 SRTLMNLQSASQILGTPLRVISGVTTDELTNDGE--NTTLDI---YYGRIL-TLASEAAKISEFK--AAELRNFAEEMEV 307 (480) Q Consensus 236 ~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~--~~~~~~---~~~~~~-~~~g~~~~~~~~~--~~~~~~~~~~l~~ 307 (480) |+++|++++.+++|++|+++++|+.++.+..... ...+.. ..+... ...+.++++.+++ ....+++++.|+. T Consensus 231 ~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~ 310 (453) T protein:vir:73 231 NKVTSEKANDVEYFSDQYLVFLGAEVDEEDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLER 310 (453) T ss_pred HHHHHHHHHHHHHhccceeeeecCCCCchhhhcccccccccccccccccccccccCceeEEeeecCCHHHHHHHHHHHHH Confidence 9999999999999999999999987664332221 111111 011111 1123345565554 3445666666676 Q ss_pred HHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCcccceeeeEEecC Q lcl|NC_021324. 308 FRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-REVTEEYTRLETVWRD 386 (480) Q Consensus 308 ~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-~~~~~~~~~~~v~f~~ 386 (480) .|+.++.++++++..| +++||+||++++.+|.+||.++++.|+.+|++++++++++.+ .+...++.+++|+|++ T Consensus 311 ~I~~~s~~p~~~~~~~-----gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~ 385 (453) T protein:vir:73 311 SIFQFTMAANISDENF-----GNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTR 385 (453) T ss_pred HHHHHhCCcccCcccc-----cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCC Confidence 6666666665554443 347999999999999999999999999999999999998865 3445567789999999 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCC Q lcl|NC_021324. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTET 463 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) +.|.|.++.+++++|++ +++|+||+++++|+++|+.++++++++|+++....+...... ++++.-++- T Consensus 386 ~~p~~~~~~a~~~~k~~----giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~-----~~~~~~~~~ 453 (453) T protein:vir:73 386 NEPKDIKEQAETANILK----GITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNLV-----RMKQMRGNL 453 (453) T ss_pred CCCCCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccCC-----cchhhhcCC Confidence 99999999999999885 368999999999999988888887777766544332222111 111211222 No 38 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=1.7e-79 Score=452.46 Aligned_cols=428 Identities=14% Similarity=0.129 Sum_probs=329.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc-CCCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~~ 79 (480) |+ .+.|.+|+++|..+++|+.++.+||+|+|+|.+.... .+.+.++|+++||+++||++.++||+++|+++ +++++ T Consensus 17 ~~--~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~d~~ 93 (452) T protein:vir:36 17 IT--VEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAK-DSWKPDNRLAVNFTKYIVDTFTGYFNGIPVKKSHSDKE 93 (452) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccc-cccCccceeecchHHHHHHHHhhhhcccCceeecCChh Confidence 53 3578889999999999999999999999999775432 33456889999999999999999999999875 45666 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..+.|+++|+.|+|+.++.+++++++++|+||+++|. |++|++++++++|.+++|+||+...+++.+++++|.. T Consensus 94 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~~ 167 (452) T protein:vir:36 94 ILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQ------DEDTQTNVVYNSPENMFMVYDDTVKQEPLFAVRYGVD 167 (452) T ss_pred HHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEe------cCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEEe Confidence 7889999999999999999999999999999999996 4578999999999999999999888889999998875 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) .+. ..++++|+++++++|...++. ....+..+|+||.||||+|+|++ +|+|+++ .|++|||+||+++ T Consensus 168 ~~~---~~~~~vyt~~~i~~~~~~~~~----~~~~~~~~~~~g~iPvv~~~n~~-----~g~sd~e-~v~~liDa~d~~~ 234 (452) T protein:vir:36 168 EDK---KLQGEVYTLLETIKISGENDE----ISFGEGTYNPYPDLPVVEFYFNE-----ERMSIFE-SVISLVNAFNKAI 234 (452) T ss_pred cCc---eEEEEEEecCeEEEEEEcCCc----eEEecceeccCCcccEEEecCCC-----CCCcchH-HHHHHHHHHHHHH Confidence 443 467899999999999876542 34456778999999999999874 5889996 4999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC------CCCceEEEeccccHHHHHHHHHHHHHHHH Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA------SEAAKISEFKAAELRNFAEEMEVFRKEAA 313 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~------g~~~~~~~~~~~~~~~~~~~l~~~~~~i~ 313 (480) |++++.++++++|+++++|+..+..... +...++++.++ +.++++.+++. +.+.+...++.+...|+ T Consensus 235 s~~~~~~~~~~~p~~~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~I~ 307 (452) T protein:vir:36 235 SEKANDVDYFSDQYLTFLGAAVEEEDLK------NIRSNRVINYYADGEGKNVDVKFLEKPD-SDSQTENLLDRLTKLIF 307 (452) T ss_pred HHHHHHHHHhcCceeEeecCCcCchhhh------hhhhcceEEecCCCCccCCcceeEeecC-CHHHHHHHHHHHHHHHH Confidence 9999999999999999999876532211 12223333332 12355655543 23444444444444444 Q ss_pred hhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CCcccceeeeEEecCCCCCCH Q lcl|NC_021324. 314 SITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTV 392 (480) Q Consensus 314 ~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~~~~~~~~~v~f~~~~~~~~ 392 (480) ..+++|...++. .+++||+||++++++|.+||..+++.|+.+|++++++++++.+. +...++.+++|.|+++.|.|+ T Consensus 308 ~~s~~p~~~~~~--~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~f~~~~p~d~ 385 (452) T protein:vir:36 308 QTTMVANISDES--FGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYTFTRNEPKDI 385 (452) T ss_pred HHhCccccCccc--ccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCcCH Confidence 555555433332 13479999999999999999999999999999999999988653 344566789999999999999 Q ss_pred HHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 393 AAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 393 ~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) ++.+++++|++ +++|.||+++++|+++|+.++++++++|+++... .......++...+...++...| T Consensus 386 ~~~a~~~~k~~----g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~---~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 386 KEQAETANILM----GITSQETALSVISVIPDVQAEMEKIKKEEASTAI---FDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred HHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH---HHhhccCCCCcccccCccccCC Confidence 99999999873 3689999999999998888888877777655322 1111112222222222222222 No 39 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=2.1e-79 Score=451.93 Aligned_cols=434 Identities=11% Similarity=0.102 Sum_probs=335.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+..++|++|+++|..+.+|+.++.+||+|+|+|.+..... .+..+++|+++||+++||++.++||+++|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~ 104 (474) T protein:vir:96 25 VETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY 104 (474) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCcee Confidence 667789999999999999999999999999999997653221 22345789999999999999999999999986 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.++++++ |+++..+.+++++++++|+||+++|. |++|++++++++|.+++|+||+.....+.++ T Consensus 105 ~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ 177 (474) T protein:vir:96 105 AHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYI------NEDGELKLFRVPAEQAIPIWTDKEREQLNAF 177 (474) T ss_pred ccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeee------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 5 44456677777764 78999999999999999999999996 4578999999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc------ccccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... ..++++|+++++++|...++... ......+..+|+||.||||+|+|+ ++|.|++++ T Consensus 178 ir~~~~~~----~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~~~~d~e~- 247 (474) T protein:vir:96 178 IRIFTFNG----ETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNN-----PEEVSDIWM- 247 (474) T ss_pred EEEEeecC----eeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCC-----CCCCCchHH- Confidence 99986432 35789999999999987755321 123345567899999999999987 457888865 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-CCCCceEEEec--cccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||.++|++++.+++|++|+++++|+..++..+.. ......+++.+ +++++++.+++ ....+.++++ T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~----~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:96 248 YKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFM----EGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchh----hhhhccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 99999999999999999999999999999998765433221 12233344433 45567776554 4456677777 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.++.++++.+..|++ ++||+||+++++++.+||..+++.|+++|++++++++++.|.. .+..+++++| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~--~d~~~i~i~f 397 (474) T protein:vir:96 324 MRAYIVEFGQGVDFQTDKFGS----ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK--LDAKEIEITF 397 (474) T ss_pred HHHHHHHHhCCcCcccccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777777666665553 3799999999999999999999999999999999999998743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.++++.+ + |++|+||+++++|+++|+.++++++++|+.+.. ..+.....+.++...++..++.. T Consensus 398 ~~~~p~~~~e~a~~~~~---~--giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:96 398 NFNVMVNDLEQSQIGAQ---S--QYLSKETLVRHHPWVDDPKAELERLDEEQLELN-KQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred cCCCccCHHHHHHHHHH---c--CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccccccCCCCCCcCCCCcc Confidence 99999999999987654 3 579999999999999998888877777765432 22222222222222221111111 Q ss_pred ccCCCCCCCCCccc Q lcl|NC_021324. 465 TETQTSPSGFNRTK 478 (480) Q Consensus 465 ~~~~~~~~~~~~~~ 478 (480) .+. T Consensus 472 -----------e~~ 474 (474) T protein:vir:96 472 -----------QSK 474 (474) T ss_pred -----------ccC Confidence 111 No 40 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=2.1e-79 Score=451.93 Aligned_cols=434 Identities=11% Similarity=0.102 Sum_probs=335.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+..++|++|+++|..+.+|+.++.+||+|+|+|.+..... .+..+++|+++||+++||++.++||+++|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~ 104 (474) T protein:vir:95 25 VETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTY 104 (474) T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCcee Confidence 667789999999999999999999999999999997653221 22345789999999999999999999999986 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.++++++ |+++..+.+++++++++|+||+++|. |++|++++++++|.+++|+||+.....+.++ T Consensus 105 ~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ 177 (474) T protein:vir:95 105 AHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYI------NEDGELKLFRVPAEQAIPIWTDKEREQLNAF 177 (474) T ss_pred ccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeee------CCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 5 44456677777764 78999999999999999999999996 4578999999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc------ccccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... ..++++|+++++++|...++... ......+..+|+||.||||+|+|+ ++|.|++++ T Consensus 178 ir~~~~~~----~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn-----~~~~~d~e~- 247 (474) T protein:vir:95 178 IRIFTFNG----ETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNN-----PEEVSDIWM- 247 (474) T ss_pred EEEEeecC----eeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCC-----CCCCCchHH- Confidence 99986432 35789999999999987755321 123345567899999999999987 457888865 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-CCCCceEEEec--cccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||.++|++++.+++|++|+++++|+..++..+.. ......+++.+ +++++++.+++ ....+.++++ T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~----~~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:95 248 YKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFM----EGLKYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchh----hhhhccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 99999999999999999999999999999998765433221 12233344433 45567776554 4456677777 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.++.++++.+..|++ ++||+||+++++++.+||..+++.|+++|++++++++++.|.. .+..+++++| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~--~d~~~i~i~f 397 (474) T protein:vir:95 324 MRAYIVEFGQGVDFQTDKFGS----ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIK--LDAKEIEITF 397 (474) T ss_pred HHHHHHHHhCCcCcccccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777777666665553 3799999999999999999999999999999999999998743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) +++.|.|+++.++++.+ + |++|+||+++++|+++|+.++++++++|+.+.. ..+.....+.++...++..++.. T Consensus 398 ~~~~p~~~~e~a~~~~~---~--giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:95 398 NFNVMVNDLEQSQIGAQ---S--QYLSKETLVRHHPWVDDPKAELERLDEEQLELN-KQLPNLDDGGADGAQQQQQSENN 471 (474) T ss_pred cCCCccCHHHHHHHHHH---c--CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccccccCCCCCCcCCCCcc Confidence 99999999999987654 3 579999999999999998888877777765432 22222222222222221111111 Q ss_pred ccCCCCCCCCCccc Q lcl|NC_021324. 465 TETQTSPSGFNRTK 478 (480) Q Consensus 465 ~~~~~~~~~~~~~~ 478 (480) .+. T Consensus 472 -----------e~~ 474 (474) T protein:vir:95 472 -----------QSK 474 (474) T ss_pred -----------ccC Confidence 111 No 41 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=3.6e-79 Score=450.64 Aligned_cols=451 Identities=12% Similarity=0.075 Sum_probs=340.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc---------chhhhcceeccchHHHHHHHHHhhhccCc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA---------PPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~---------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) +....++|++++++| +.+++.++.+||.|+|+|....... .....++|+++||+++||++.++|++++| T Consensus 27 ~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g~~ 104 (503) T protein:vir:59 27 AEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVGEP 104 (503) T ss_pred cchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhcCC Confidence 444467888888888 5689999999999999986654322 12234779999999999999999999999 Q ss_pred cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceE Q lcl|NC_021324. 72 FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 72 ~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~ 151 (480) +++..+++....+.+.|..|+++.++.+++++++++|+||++||. |++|++++++++|.+++|+||+....++. T Consensus 105 ~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~dg~~~i~~~~p~~~~~i~d~~~~~~~~ 178 (503) T protein:vir:59 105 VTFTSDNKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFV------DEEGEFDYVIFPAEEMIVVYKDNTRRDIL 178 (503) T ss_pred eeeccCcHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEee------cCCCceEEEEEccceeEEEEeCCCCCceE Confidence 887655444444445555689999999999999999999999986 45789999999999999999998888999 Q ss_pred EEEEEEEEecC-CceEEEEEEEeCCcEEEEEecCCCcccc----------ccccccccccccccceEEeecccccCccCC Q lcl|NC_021324. 152 RAVRLYTTRDD-VAVPDRATLYLPDETVPLRRNGGLNDQW----------VVDGDVIKHGLGVVPVVPLTNDPRLGNRYG 220 (480) Q Consensus 152 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G 220 (480) +++++|...+. .....++++|+++++++|...+...... ....+..+|+|++||+|+|+|++ +| T Consensus 179 ~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~-----~~ 253 (503) T protein:vir:59 179 FALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNE-----EM 253 (503) T ss_pred EEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEecCCC-----CC Confidence 99999876543 4457789999999999998765432111 11234578999999999999874 58 Q ss_pred CccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-CCCCceEEEe--cccc Q lcl|NC_021324. 221 RSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEF--KAAE 297 (480) Q Consensus 221 ~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~--~~~~ 297 (480) .|+++. +++|||+||+++|++++.++++++|+++++|.++++..+... .+...+++.. +++++++.++ +... T Consensus 254 ~sd~~~-~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~l~~~~~~~~ 328 (503) T protein:vir:59 254 VSDLKF-YKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTA----NLRYHSVIKVSGDGGVDTLRAEIPVDS 328 (503) T ss_pred Ccchhh-hHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhh----hhhcccceeccCCCcceeEeccCCHHH Confidence 888865 999999999999999999999999999999987665332221 2333444443 3445777554 4556 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---CCc Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---EVT 374 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~~~ 374 (480) .+.+++.|+..+++++.++++++..|+++ +||+||++++.+|.+||+++++.|+.+|++++++++.+.+. ... T Consensus 329 ~~~~~~~l~~~i~~~s~~p~~~~~~~~~~----~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~ 404 (503) T protein:vir:59 329 AAKELERIQDELYKSAQAVDNSPETIGGG----ATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDF 404 (503) T ss_pred HHHHHHHHHHHHHHHhcccCCCccccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Confidence 78888999999999999998887776643 79999999999999999999999999999999998877542 222 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCc Q lcl|NC_021324. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 375 ~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) ....++.|+|++++|.|+++.++++++++++| ++|+||+++++|+++|+.++++++++|+++... .......... T Consensus 405 ~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~G--iiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~-~~~~~~~~~~-- 479 (503) T protein:vir:59 405 NPDKELTMTFTRTRIQNDSEIVQSLVQGVTGG--IMSKETAVARNPFVQDPEEELARIEEEMNQYAE-MQGNLLDDEG-- 479 (503) T ss_pred ccccceeEEeCCCCCCCHHHHHHHHHHHHhCC--CCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHh-hhccccCccC-- Confidence 34467999999999999999999999999886 689999999999998887888777766554322 1111111111 Q ss_pred CCCCCCCCCCccCCCCCCCCCcccC Q lcl|NC_021324. 455 TPKPTVTETKTETQTSPSGFNRTKT 479 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~~~~~~~~~~ 479 (480) +... ..+..++...+.++.+|..| T Consensus 480 ~~~~-~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 480 GDDD-LEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred CCCC-CCcCCCCCCcccCCCCCCcC Confidence 1100 00111111111222233233 No 42 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=3.4e-79 Score=450.74 Aligned_cols=420 Identities=13% Similarity=0.098 Sum_probs=327.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~ 79 (480) || .++|.+|+++|..+.+||+++.+||+|+|+|..... ..+...++|+++||+++||++.++||+++|+++. +++. T Consensus 1 l~--~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~-~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~ 77 (429) T protein:vir:98 1 MT--KDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQ-KEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHENKQ 77 (429) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc-cccCCCcceeecchHHHHHHHHhhhhcccCceeecCChH Confidence 76 477999999999999999999999999999865433 3344568899999999999999999999998764 4555 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..+.++++|+.|+|+.++.+++++++++|+||++||. +++|+|++++++|.+++|+||+..+.++.+++++|.. T Consensus 78 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~ 151 (429) T protein:vir:98 78 VSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFN------DENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYN 151 (429) T ss_pred HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------cCCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEe Confidence 6788999999999999999999999999999999986 4579999999999999999998888889999998864 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) . +...+..+|+.+.+++|...++. ....+..+|+||+||||+|+|+ ++|+|+|++ +++|+|+||+++ T Consensus 152 ~---~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~-v~~liD~~d~~~ 218 (429) T protein:vir:98 152 K---GGVLEGSYSDASNITYFKDGEKG----IEIGESEPHPFDGVPMIEYVEN-----EERQSLLAS-VVTLINAFNKAI 218 (429) T ss_pred c---CceEEEEEEeCceEEEEEecCCc----eEecccccccCCccceEEecCC-----CCCCCcHHH-HHHHHHHHHHHH Confidence 3 23567788999888888765432 3445677999999999999986 468999975 999999999999 Q ss_pred HHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCC-----CceEEEecc--ccHHHHHHHHHHHHHHH Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASE-----AAKISEFKA--AELRNFAEEMEVFRKEA 312 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~-----~~~~~~~~~--~~~~~~~~~l~~~~~~i 312 (480) |++++.++++++|+++++|...+.... .+...++++.++++ ++++.+++. ...+++++.|...++.+ T Consensus 219 s~~~~~~~~~~~p~~~i~g~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~ 292 (429) T protein:vir:98 219 SEKANDVEYFADAYLKILGAELDDETL------KSLRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLENLIFRT 292 (429) T ss_pred HHHHHHHHHhcCceeeeecCCCCcchh------hhHhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHH Confidence 999999999999999999987653211 12333455554322 345555543 23444445555555555 Q ss_pred HhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCC Q lcl|NC_021324. 313 ASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPT 391 (480) Q Consensus 313 ~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~v~f~~~~~~~ 391 (480) +.++++++..+ +++||+||++++.+|.+|+.++++.|+.+|++++++++++.+.. ...++.+++|.|+++.|.| T Consensus 293 s~~p~~~~~~~-----gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~~ 367 (429) T protein:vir:98 293 AMVANISDESF-----GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPAN 367 (429) T ss_pred hCccccCcccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCcC Confidence 54444443333 34799999999999999999999999999999999999987633 3455678999999999999 Q ss_pred HHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 392 VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) +++.|++++|+. +++|+||+++++|+++|+.++++++++|+.+.... .+....++ ..+...| T Consensus 368 ~~~~a~~~~kl~----g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~-~~~~~~~~----~~~~~~~ 429 (429) T protein:vir:98 368 LLEESQIAGNLA----GIVSEETQVGVLSIVENPQKEIERKNSDKSTLISR-QAGGLNGQ----NTTTILE 429 (429) T ss_pred HHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH-HHhhhcCC----CCCCCCC Confidence 999999999873 36899999999999998888888777776654322 22211111 1111111 No 43 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=1.6e-78 Score=447.03 Aligned_cols=438 Identities=11% Similarity=0.083 Sum_probs=331.0 Q ss_pred CCCH----HHHHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc- Q lcl|NC_021324. 1 MTTY----HEHVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI- 74 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~- 74 (480) |+.. .+.|.+++++|.. +.+||+++.+||+|+|+|.+... ++..+++|+++||+++||++.++|++++|+++ T Consensus 19 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~--~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~ 96 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPE--KETGADNRIVVNSAKYVVDVYNGYFCGIEPKLA 96 (470) T ss_pred eCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcc--cccCCcceeecchHHHHHHHHhhhhccCCeeEe Confidence 3322 2346678888865 45899999999999999976533 34456889999999999999999999999764 Q ss_pred -CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 -SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 -~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) .+|.+..+.|+++|+.|+|+.++.+++++++++|+||++||. +++|++++++++|.+++|+||+...+.+.++ T Consensus 97 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ 170 (470) T protein:vir:99 97 LLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQ------GEDARPHLMYSSPNHAFIIYDDTVQRQPLAF 170 (470) T ss_pred eCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEe------CCCCeEEEEEEccceeEEEEcCCCCcceEEE Confidence 455666789999999999999999999999999999999996 3578999999999999999999888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +++|....+.....++++|+++.+++|...+... .....+..+|+||.||||+|+|++ +|+|++++ |++||| T Consensus 171 vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vPvv~~~n~~-----~g~sd~e~-v~~liD 242 (470) T protein:vir:99 171 VHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEE--DTNAAGYAINPYGLVPAVEFFENE-----ERQGIFDS-IKTLIN 242 (470) T ss_pred EEEEEEecCCeeEEEEEEEecCeEEEEEeccccc--ccccccccccCCCccceEeecCCC-----CCCcchHh-HHHHHH Confidence 9999887777777889999999999988765432 234456678999999999999864 68999975 999999 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC----CCCceEEEec-cccHHHHHHHHHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA----SEAAKISEFK-AAELRNFAEEMEVF 308 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~----g~~~~~~~~~-~~~~~~~~~~l~~~ 308 (480) +||+++|++++.++++++|+++++|+..+.... +.........+++.++ +.++++..+. +.+.+.+...++.+ T Consensus 243 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 320 (470) T protein:vir:99 243 ALDKVISQKANQVEYFDNAYMYMIGFKLPEDDE--GNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHL 320 (470) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCcccccc--cchhhhhhhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHH Confidence 999999999999999999999999987664322 2222333444444432 2334444332 22344555555555 Q ss_pred HHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecC Q lcl|NC_021324. 309 RKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRD 386 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~~~~~~~~~v~f~~ 386 (480) ...|+..+++|+..++..+ +++||+||++++.+|.+||..+++.|+.+|++++++++.+.+. ....+..+++++|++ T Consensus 321 ~~~i~~~s~~p~~~~~~~~-~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~~ 399 (470) T protein:vir:99 321 TDFIFMMAMVPNIQDKNFA-GNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFKFTR 399 (470) T ss_pred HHHHHHHhCCccccccccc-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEEeCC Confidence 5666666666654444332 3479999999999999999999999999999999999887653 334456789999999 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +.|.|.++.++++++++ +++|.||+++++|+++ +.++++++++|+.+.... ............. ++.++ T Consensus 400 ~~p~~~~e~a~~~~kl~----giis~et~l~~l~~vd-~~~E~eri~~E~~~~~~~-~~~~~~~~d~~~~-----d~~~e 468 (470) T protein:vir:99 400 NLPEDMASAIDNAKNAE----GIVSKKTQLGMIPDIE-PDAEMKQIAKEKADAIKQ-TQQLSMPIDILKR-----DNNAE 468 (470) T ss_pred CCCcCHHHHHHHHHHHh----ccCCHHHHHHhCCCCC-HHHHHHHHHHHHHHHHHH-HHhhcCCCCcCCC-----CCCcc Confidence 99999999999999875 3689999999999984 455666666665443222 2211111111111 11221 Q ss_pred CC Q lcl|NC_021324. 467 TQ 468 (480) Q Consensus 467 ~~ 468 (480) .+ T Consensus 469 e~ 470 (470) T protein:vir:99 469 EE 470 (470) T ss_pred CC Confidence 11 No 44 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=5.9e-79 Score=449.48 Aligned_cols=434 Identities=11% Similarity=0.097 Sum_probs=334.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc------cchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+-.++|++|+++|..+.+|++++.+||+|+|+|.+.... .+...+++|+++||+++||++.++|++++|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~ 104 (474) T protein:vir:97 25 FETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTY 104 (474) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCcee Confidence 44556899999999999999999999999999998654321 233456889999999999999999999999987 Q ss_pred CCCch-hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 SEDSE-GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~~d~~-~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ..+++ ..+.++.++ .|+++..+.++++.++++|+||+++|. |++|++++++++|.+++|+||+.....+.++ T Consensus 105 ~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 177 (474) T protein:vir:97 105 SCEDENVLKVIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYI------NENGEMKLFRVPAEQAIPIWVDKEREELKSF 177 (474) T ss_pred ccCcHHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEe------cCCCeeEEEEEcccceEEEEcCCCCCceEEE Confidence 55444 445566555 589999999999999999999999996 4578999999999999999998888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccc------cccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQ------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... ..++++|+++.+++|...++.... ........+|+||+||||+|+|+ ++|.|++++ T Consensus 178 ir~~~~~~----~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~- 247 (474) T protein:vir:97 178 IRYYKFNN----EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNN-----PEEVSDIWM- 247 (474) T ss_pred EEEEEecC----eEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCC-----cCCCCcHHH- Confidence 99986533 357899999999999887653221 11234467899999999999987 468999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-CCCCceEEEec--cccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||+++|++++.++++++|+++++|++++...+... .....+++.+ +++++++.+++ ....+++++. T Consensus 248 v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~----~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:97 248 YKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMR----GLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDL 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh----hhhccceeeccCCCceeEEeecCCHHHHHHHHHH Confidence 999999999999999999999999999999987654332211 2223344433 45567776654 3456667777 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.++.++++++..|++ ++||+||++++.++.+||.++++.|+++|++++++++++.+.. .++.+++|+| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~d~~~i~v~f 397 (474) T protein:vir:97 324 MRVYIMEFGQGVDFQTDKFGS----APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK--TDVKDIEISF 397 (474) T ss_pred HHHHHHHHhCccccCcccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777777766666654 3799999999999999999999999999999999999998743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) .++.|.|+++.|+++.+ + +++|+||++.++|+++|+.++++++++|+.+.. ..+........+.+++.+..+.. T Consensus 398 ~~~~p~~~~e~a~~~~~---~--g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:97 398 NFNRMMNDAEQSQIIAQ---S--QYLSRETLVKSSPLVDDYKAELERIEQEQMEYN-KQLPNLDDGGADGAQQQEGSNNK 471 (474) T ss_pred ccCcccCHHHHHHHHHH---c--CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccCCCCCCCcccCCCCccc Confidence 99999999999998765 3 468999999999999988888877777665432 22222222222222221111111 Q ss_pred ccC Q lcl|NC_021324. 465 TET 467 (480) Q Consensus 465 ~~~ 467 (480) .+. T Consensus 472 ~~e 474 (474) T protein:vir:97 472 ESE 474 (474) T ss_pred ccC Confidence 111 No 45 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=5.9e-79 Score=449.48 Aligned_cols=434 Identities=11% Similarity=0.097 Sum_probs=334.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc------cchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+-.++|++|+++|..+.+|++++.+||+|+|+|.+.... .+...+++|+++||+++||++.++|++++|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~p~~~ 104 (474) T protein:vir:94 25 FETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTY 104 (474) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcCCcee Confidence 44556899999999999999999999999999998654321 233456889999999999999999999999987 Q ss_pred CCCch-hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 SEDSE-GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~~d~~-~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ..+++ ..+.++.++ .|+++..+.++++.++++|+||+++|. |++|++++++++|.+++|+||+.....+.++ T Consensus 105 ~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~~~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 177 (474) T protein:vir:94 105 SCEDENVLKVIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYI------NENGEMKLFRVPAEQAIPIWVDKEREELKSF 177 (474) T ss_pred ccCcHHHHHHHHHHH-hccHHHHHHHHHHHHhhcCceEEEEEe------cCCCeeEEEEEcccceEEEEcCCCCCceEEE Confidence 55444 445566555 589999999999999999999999996 4578999999999999999998888899999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccc------cccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQ------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|.... ..++++|+++.+++|...++.... ........+|+||+||||+|+|+ ++|.|++++ T Consensus 178 ir~~~~~~----~~~~~~yt~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn-----~~g~sd~e~- 247 (474) T protein:vir:94 178 IRYYKFNN----EEKVEFWTDTTVTYYVLENGGLIPDYYYGANHVQSHFSNGNWGRVPFIAFKNN-----PEEVSDIWM- 247 (474) T ss_pred EEEEEecC----eEEEEEEeCCeEEEEEEcCCccccccccCcCcccccccccCCCccceEEecCC-----cCCCCcHHH- Confidence 99986533 357899999999999887653221 11234467899999999999987 468999975 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-CCCCceEEEec--cccHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFK--AAELRNFAEE 304 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~--~~~~~~~~~~ 304 (480) |++|||+||+++|++++.++++++|+++++|++++...+... .....+++.+ +++++++.+++ ....+++++. T Consensus 248 v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~----~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:94 248 YKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMR----GLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDL 323 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh----hhhccceeeccCCCceeEEeecCCHHHHHHHHHH Confidence 999999999999999999999999999999987654332211 2223344433 45567776654 3456667777 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) |+..++.++.++++++..|++ ++||+||++++.++.+||.++++.|+++|++++++++++.+.. .++.+++|+| T Consensus 324 l~~~I~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~--~d~~~i~v~f 397 (474) T protein:vir:94 324 MRVYIMEFGQGVDFQTDKFGS----APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK--TDVKDIEISF 397 (474) T ss_pred HHHHHHHHhCccccCcccccc----ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEe Confidence 777777777777766666654 3799999999999999999999999999999999999998743 4567899999 Q ss_pred cCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 385 RDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) .++.|.|+++.|+++.+ + +++|+||++.++|+++|+.++++++++|+.+.. ..+........+.+++.+..+.. T Consensus 398 ~~~~p~~~~e~a~~~~~---~--g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:94 398 NFNRMMNDAEQSQIIAQ---S--QYLSRETLVKSSPLVDDYKAELERIEQEQMEYN-KQLPNLDDGGADGAQQQEGSNNK 471 (474) T ss_pred ccCcccCHHHHHHHHHH---c--CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccCCCCCCCcccCCCCccc Confidence 99999999999998765 3 468999999999999988888877777665432 22222222222222221111111 Q ss_pred ccC Q lcl|NC_021324. 465 TET 467 (480) Q Consensus 465 ~~~ 467 (480) .+. T Consensus 472 ~~e 474 (474) T protein:vir:94 472 ESE 474 (474) T ss_pred ccC Confidence 111 No 46 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=1.5e-78 Score=447.26 Aligned_cols=433 Identities=11% Similarity=0.097 Sum_probs=333.9 Q ss_pred CC-----------------------CHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhccee Q lcl|NC_021324. 1 MT-----------------------TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDV 51 (480) Q Consensus 1 ~~-----------------------~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~ 51 (480) |. +..++|.+|+.+|..+.++++++.+||+|+|+|....... .+..+++|+ T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 80 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRM 80 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccccccccccccccee Confidence 22 3377999999999999999999999999999986543221 123457799 Q ss_pred ccchHHHHHHHHHhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEE Q lcl|NC_021324. 52 QPGWVATYLRTLSDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLI 130 (480) Q Consensus 52 ~~n~~~~iVd~~~~~l~~~~~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~ 130 (480) ++||+++||++.++|++++++++ +++++..+.|+++++ |+++.++.+++++++++|+||+++|. |++|++++ T Consensus 81 ~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~------d~~g~~~~ 153 (478) T protein:vir:10 81 YTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYV------DEEGEFKT 153 (478) T ss_pred ccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEe------cCCCeeEE Confidence 99999999999999999999875 445556778888875 78999999999999999999999996 45789999 Q ss_pred EEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc----------cccccccccccc Q lcl|NC_021324. 131 RVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND----------QWVVDGDVIKHG 200 (480) Q Consensus 131 ~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~ 200 (480) ++++|.+++|+||+...+.+.+++++|.... ..++++|+++++++|...++... .........+|+ T Consensus 154 ~~~~p~~~~~i~d~~~~~~~~~~v~~~~~~~----~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 229 (478) T protein:vir:10 154 FRVPAEQAVPIWTNKERDELQAFIRVYELDG----AERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMS 229 (478) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEecC----ceEEEEEeCCeEEEEEEcCCeeeccccccccccccceeccccccc Confidence 9999999999999887788999999986533 35789999999999887654321 111234567899 Q ss_pred ccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchh Q lcl|NC_021324. 201 LGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRI 280 (480) Q Consensus 201 ~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~ 280 (480) ||.||||+|+|+ ++|.|++++ |++|||+||.++|++++.+++|++|+++++|+++++..+.. ..+...++ T Consensus 230 ~~~vPvv~~~n~-----~~g~sd~~~-v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~----~~~~~~~~ 299 (478) T protein:vir:10 230 WGRVPFIPFKNN-----PQEVSDLFM-YKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFM----HNLKYYKA 299 (478) T ss_pred CCccceEEeccC-----CCCCCcHHH-HHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhh----hhhhhcce Confidence 999999999986 578999975 99999999999999999999999999999998776543322 22333444 Q ss_pred eec---CCCCceEEEec--cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 281 LTL---ASEAAKISEFK--AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 281 ~~~---~g~~~~~~~~~--~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) +.+ +|+++++.+++ ..+.+++++.|+..++.++.++++++..|++ ++||+||+++++.|.+||+.+++.|+ T Consensus 300 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~~~~~~~ 375 (478) T protein:vir:10 300 ISVAGESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGN----SPSGIALKFMYSNLDLKANKLKNKTL 375 (478) T ss_pred EEecCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccccc----ccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433 45567776654 3456667777777777777766666555543 47999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHH Q lcl|NC_021324. 356 GAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQ 435 (480) Q Consensus 356 ~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~ 435 (480) ++|++++++++++.|.. .+..+++|+|+++.|.|.++.|++++++ + |++|+||++.++|+++|+.++++++++| T Consensus 376 ~~l~~~~~li~~~~g~~--~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~--g~iS~et~~~~l~~v~D~~~E~~ri~~E 449 (478) T protein:vir:10 376 TALQELLQYIIDFYRLD--VKVQDIEITFNFNVMVNELENSQIAMNS--T--GLLSKETILSNHAWVEDPVAEMERIEQE 449 (478) T ss_pred HHHHHHHHHHHHHhCCC--cccccceEEecCCCCCCHHHHHHHHHHH--h--CCCChHHHHHhCCCCCCHHHHHHHHHHH Confidence 99999999999998744 4567899999999999999999999887 2 3689999999999998888888777766 Q ss_pred HHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 436 ETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +++.. +.+.....+..+ .++++..+.+.+ T Consensus 450 ~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~ 478 (478) T protein:vir:10 450 NIELN-QQLPDIEEGLNG-EQQRQSENNQPE 478 (478) T ss_pred HHHHH-hhccccccccCC-CCCCCCCCCCCC Confidence 54422 222222221111 111111111111 No 47 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=2.5e-78 Score=446.03 Aligned_cols=427 Identities=12% Similarity=0.103 Sum_probs=333.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) || .+.|.+|+++|..+.+|+.++.+||+|+|+|....... ....+++|+++||+++||++.++|++|+++++ T Consensus 1 l~--~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~ 78 (451) T protein:vir:10 1 ME--LEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLF 78 (451) T ss_pred CC--HHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheeccccee Confidence 44 46788899999999999999999999999997654322 12235779999999999999999999999765 Q ss_pred --CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccC--CCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 75 --SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG--DPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 75 --~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~--d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) .++++..+.+ +.|..|+++.++.++++.++++|+||+++|.++.... ...|+++++.++|.+++|+||+...+.+ T Consensus 79 ~~~~~~~~~~~~-~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~ 157 (451) T protein:vir:10 79 DIDNNKELNEKV-TDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIEREL 157 (451) T ss_pred ecCCcHHHHHHH-HHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCce Confidence 3444444444 4555699999999999999999999999998653221 2347899999999999999998888899 Q ss_pred EEEEEEEEEecCC------ceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccc Q lcl|NC_021324. 151 TRAVRLYTTRDDV------AVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 151 ~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i 224 (480) .+++|+|...++. ....++++||++.+++|...+..........+..+|+||.||||+|.|+. +|.|++ T Consensus 158 ~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-----~~~~d~ 232 (451) T protein:vir:10 158 EAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNI-----KKQSDL 232 (451) T ss_pred EEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCC-----CCCCch Confidence 9999999766543 23568899999999999887766555666777889999999999999875 467788 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec------CCCCceEEEec--cc Q lcl|NC_021324. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL------ASEAAKISEFK--AA 296 (480) Q Consensus 225 ~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~------~g~~~~~~~~~--~~ 296 (480) + .|++|||+||.++|++++.+++|++|+++++|+..++..+.. .++...+++.+ .++++++.+++ .. T Consensus 233 e-~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~ 307 (451) T protein:vir:10 233 S-KYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFL----KELKRYKTIKTETDSEGDSGGLKTMQIEIPTE 307 (451) T ss_pred h-hHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhH----HHHhhCCeEEecCcCCccCCcceEEeecCCHH Confidence 5 599999999999999999999999999999998765433221 12233333332 23456776554 34 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021324. 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 297 ~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 376 (480) ..+.++++|+..|+.++.++++++..|| ++||+||++++.+|.+||+.+++.|+++|++++++++++++.. + T Consensus 308 ~~~~~~~~l~~~I~~~s~~p~~~~~~~g-----n~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~---d 379 (451) T protein:vir:10 308 ARKIILEILKKQIYESGQGLQQDTENFG-----NASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLGVT---D 379 (451) T ss_pred HHHHHHHHHHHHHHHHhCcccccccccc-----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC---C Confidence 5666777777777777777666555543 3799999999999999999999999999999999999998743 4 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCC Q lcl|NC_021324. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 377 ~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) +.+++++|++++|.|+.+.++++++++ +++|+||++.++|+++|+.++++++.+++++.... .....+..++ T Consensus 380 ~~~i~i~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~-~~~~~~~~~~ 451 (451) T protein:vir:10 380 YKKIQQTYTRNMMSNDLEDADIATKSV----GIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASK-VSDDYNNFTE 451 (451) T ss_pred ccceeEEecCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH-HHhhcCCCCC Confidence 678999999999999999999999884 46999999999999998877766655555443332 2222222222 No 48 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=4.9e-78 Score=444.42 Aligned_cols=433 Identities=11% Similarity=0.096 Sum_probs=334.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) =-+..++|.+|+++|..+.+|++++.+||+|+|+|.+..... ....+++|+++||+++||++.++|++++|+++ T Consensus 24 ~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~p~~~ 103 (478) T protein:vir:10 24 YETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPVTF 103 (478) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhcccCcee Confidence 124477999999999999999999999999999986543221 23345789999999999999999999999875 Q ss_pred -CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 -SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 -~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) +++++..+.|+++++ |+++.++.++++.++++|+||++||. |++|++++++++|.+++|+||+.....+.++ T Consensus 104 ~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~~ 176 (478) T protein:vir:10 104 GVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYV------DEEGEFKTFRVPAEQAVPIWTNKERDELQAF 176 (478) T ss_pred ecCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 455667788888885 89999999999999999999999996 4578999999999999999998877889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc----------ccccccccccccccccceEEeecccccCccCCCcc Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~ 223 (480) +++|...+ ..++++|+++++++|...++... .........+|+||+||||+|+|+ ++|.|+ T Consensus 177 ir~~~~~~----~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd 247 (478) T protein:vir:10 177 IRVYELDG----AERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNN-----PQEVSD 247 (478) T ss_pred EEEEeeeC----ceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccC-----CCCCCc Confidence 99886533 24689999999999887654321 111224456899999999999997 468999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec---CCCCceEEEec--cccH Q lcl|NC_021324. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL---ASEAAKISEFK--AAEL 298 (480) Q Consensus 224 i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~---~g~~~~~~~~~--~~~~ 298 (480) +++ |++|||+||.++|++++.++++++|+++++|++.++..+.. ..+...+++.+ +|+++++.+++ .... T Consensus 248 ~e~-v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 322 (478) T protein:vir:10 248 LFM-YKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFM----HNLKYYKAISVAGESGSGVDTIKVEVPIDSV 322 (478) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchh----hhhhhCceeEecCCCCCcceEEeecCCHHHH Confidence 975 99999999999999999999999999999998776543321 12233334333 34567776654 3456 Q ss_pred HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021324. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 378 (480) ++++++|+..++.++.++++.+..|++ ++||+||++++.+|.+||..+++.|+++|++++++++++.+.. .+.. T Consensus 323 ~~~~~~l~~~I~~~s~~p~~~~~~~~~----n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~d~~ 396 (478) T protein:vir:10 323 KEYTKMLRDYIIEFGQGVDFQQDKFGN----SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLD--VRVQ 396 (478) T ss_pred HHHHHHHHHHHHHHhCCcCcCcccccc----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccc Confidence 666677776777776666666555543 3799999999999999999999999999999999999998754 4567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) +++|+|.++.|.|.++.+++++++. |++|+||+++++|+++|+.++++++++|+.+.... ......+..+ .+++ T Consensus 397 ~i~i~f~~~~p~~~~e~~~~~~~~~----g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~-~~~~~~~~~d-~~~~ 470 (478) T protein:vir:10 397 DIEITFNFNVMVNELENSQIAMNST----GLLSKETILGNHSWVQDPVAEMERIEQENIELNQQ-LPDIEEGLND-EQQR 470 (478) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHh----CCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh-ccccCCCCcc-cccc Confidence 8999999999999999999998762 36899999999999999888888887776653322 2221111111 1112 Q ss_pred CCCCCCcc Q lcl|NC_021324. 459 TVTETKTE 466 (480) Q Consensus 459 ~~~d~~~~ 466 (480) .+.+...| T Consensus 471 ~~~d~~~e 478 (478) T protein:vir:10 471 QSEDNQSE 478 (478) T ss_pred cCcCCCCC Confidence 22222222 No 49 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1.5e-77 Score=441.72 Aligned_cols=437 Identities=14% Similarity=0.148 Sum_probs=325.7 Q ss_pred CCCHHHHHHHHHHHHH-HHHHHHHHHHHHhhccCcchhcCccc---chhhhcceeccchHHHHHHHHHhhhccCcccc-C Q lcl|NC_021324. 1 MTTYHEHVERLQGLLA-RDLPNLLEAEAYRNGTRRLKTIGIGA---PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-S 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~-~~~~~~~~~~~YY~G~~~i~~~~~~~---~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~ 75 (480) |.+ .+.|++++.+|. .+.+|++++.+||+|+|++....... .....++|+++||+++||++.++|++++++++ + T Consensus 29 ~~~-~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 107 (481) T protein:vir:10 29 LLK-EENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITITH 107 (481) T ss_pred hcC-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhccCCceEec Confidence 344 456888999886 56699999999999998654332221 12344779999999999999999999999864 4 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEE Q lcl|NC_021324. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 76 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ++++..+.|+++|++|+|+.++.+++++++++|+||+++|. +++|++++++++|.+++|+||+....++.++++ T Consensus 108 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~------d~dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~ 181 (481) T protein:vir:10 108 QDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYR------DFEDRDTFKVLDPKSTFVVYDQTLDKKVVAGVR 181 (481) T ss_pred CChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEe------CCCCeEEEEEEcccceEEEEcCCCCCceEEEEE Confidence 56778889999999999999999999999999999999986 457899999999999999999888888999999 Q ss_pred EEEEecC-CceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHH Q lcl|NC_021324. 156 LYTTRDD-VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 156 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~ 234 (480) +|...+. .+.+.++++|+++.+++|...++. ....+..+|+||.||||+|+|+ .+|.|+|++ +++|||+ T Consensus 182 ~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~----~~~~~~~~~~~g~vPvv~~~n~-----~~g~~~~~~-v~~lida 251 (481) T protein:vir:10 182 YFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGT----YHRVEEVEHYYNDVPIIEYLND-----QFKQGDFEN-VIALIDL 251 (481) T ss_pred EEEEeeCCCceEEEEEEEecCeEEEEEecCCc----eeecccccccCCceeEEEeecC-----CCCCCchhh-HHHHHHH Confidence 9876554 445678999999999999876543 2345678999999999999986 468999974 9999999 Q ss_pred HHHHHHHHHHHHHHhcchhhhhcCCCccccccccc---cchhhhhcchheecC--CCCceEEEecc--ccHHHHHHHHHH Q lcl|NC_021324. 235 ASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE---NTTLDIYYGRILTLA--SEAAKISEFKA--AELRNFAEEMEV 307 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~---~~~~~~~~~~~~~~~--g~~~~~~~~~~--~~~~~~~~~l~~ 307 (480) ||+++|++++.++++++|+++++|....+...... ...+........... +.++++.+++. ...+++++.|+. T Consensus 252 ~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 331 (481) T protein:vir:10 252 YDSAQSDTANYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQN 331 (481) T ss_pred HHHHHHHHHHHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHH Confidence 99999999999999999999999864432211111 011111111111112 23445544443 344555555555 Q ss_pred HHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--CcccceeeeEEec Q lcl|NC_021324. 308 FRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE--VTEEYTRLETVWR 385 (480) Q Consensus 308 ~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--~~~~~~~~~v~f~ 385 (480) .++.+ +++|+.+++.. .+++||+||++++.+|.+||+++++.|+.+|++++++++++.+.. ...+..++++.|+ T Consensus 332 ~i~~~---s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f~ 407 (481) T protein:vir:10 332 DIHKY---TNTPDLNDEQF-SGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITFT 407 (481) T ss_pred HHHHH---hCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEeC Confidence 55544 55554444432 234799999999999999999999999999999999999886533 3345678999999 Q ss_pred CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCC-CCCC Q lcl|NC_021324. 386 DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV-TETK 464 (480) Q Consensus 386 ~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~ 464 (480) ++.|+|.++.+++++|++ +++|.||+++++|+++|+.++++++++|+++........ +.+++.++... +|+. T Consensus 408 ~~~~~~~~~~a~~~~kl~----g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~---~~~~~~~~~~~~dd~~ 480 (481) T protein:vir:10 408 PNLPKSMMESINAFNALS----GGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKR---GYGEAFENHLNVDDSN 480 (481) T ss_pred CCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhc---cCCccCCCCCCCCCCC Confidence 999999999999999874 468999999999999998888888777776543322211 11222222111 1111 Q ss_pred c Q lcl|NC_021324. 465 T 465 (480) Q Consensus 465 ~ 465 (480) + T Consensus 481 g 481 (481) T protein:vir:10 481 G 481 (481) T ss_pred C Confidence 1 No 50 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=8.6e-78 Score=443.08 Aligned_cols=431 Identities=13% Similarity=0.075 Sum_probs=332.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc----------cchhhhcceeccchHHHHHHHHHhhhccC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG----------APPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) ..+..++|++++..|..+.+||.++.+||+|+|+|.+.... .+...+++|+++||+++||++.++|++|+ T Consensus 3 ~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~G~ 82 (470) T protein:vir:10 3 LDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVASV 82 (470) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhheecc Confidence 66778899999999999999999999999999998654321 11234578999999999999999999999 Q ss_pred cccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCc Q lcl|NC_021324. 71 GFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 71 ~~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~ 149 (480) |+++ +++++..+.|+++++. +++..+.++++.++++|+||+++|. |++|.+++.+++|.+++|+||+...+. T Consensus 83 p~~~~~~d~~~~~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~------d~~~~~~~~~~~p~~~~~v~d~~~~~~ 155 (470) T protein:vir:10 83 FPDIDVGKDADNKKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWI------DEDGNFRYGIIQPDQITPIYATTLDNK 155 (470) T ss_pred ceeeecCchHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCc Confidence 9875 3566677889999975 6788888999999999999999996 557899999999999999999988889 Q ss_pred eEEEEEEEEEecCCc--eEEEEEEEeCCcEEEEEecCCCccc----------------cccccccccccccccceEEeec Q lcl|NC_021324. 150 VTRAVRLYTTRDDVA--VPDRATLYLPDETVPLRRNGGLNDQ----------------WVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 150 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~g~iPvv~~~n 211 (480) +.+++++|...+..+ ..+++++|+++.+++|...+..... .....+..+|+||+||||+|+| T Consensus 156 ~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 235 (470) T protein:vir:10 156 LLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSK 235 (470) T ss_pred eEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeec Confidence 999999997665433 4568899999999999866542211 1122346689999999999999 Q ss_pred ccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC------C Q lcl|NC_021324. 212 DPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA------S 285 (480) Q Consensus 212 ~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~------g 285 (480) ++ +|.|+++ .+++|||+||.++|++++.+++|++|+++++|+.+++..+... .+...+.+.++ + T Consensus 236 n~-----~g~sd~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~----~~~~~~~i~~~~~~~~~~ 305 (470) T protein:vir:10 236 NK-----YRLPELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMN----DLRKYKSIKINNTGNGDN 305 (470) T ss_pred CC-----CCCCchh-HHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhh----hhhhcCeEeccCCCCCcC Confidence 74 6899996 5999999999999999999999999999999987654333211 12222222221 2 Q ss_pred CCceEEEecc--ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 286 EAAKISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 286 ~~~~~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) .++++.+++. ...+.++++|+..++.++.++++++..| +++||+||++++++|++||+++++.|+++|+++++ T Consensus 306 ~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-----gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (470) T protein:vir:10 306 SGVDKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-----SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (470) T ss_pred ceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3456665543 3456666777777777766666655433 24799999999999999999999999999999999 Q ss_pred HHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 364 IAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDT 443 (480) Q Consensus 364 li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 443 (480) +|+++++.. ..++.+++++|++++|.|+.+.++.++++. +++|.||+++++|+++|+.++++++++|+.++... T Consensus 381 ~i~~~l~~~-~~d~~~i~i~f~~~~p~d~~e~~~~~~~~~----g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~~~- 454 (470) T protein:vir:10 381 AIMRYLNFS-DADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDKEENDPY- 454 (470) T ss_pred HHHHHhccc-CcccceeeEEeccCCCCCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHh- Confidence 999988753 345678999999999999999999998863 47899999999999999888888888776653221 Q ss_pred HhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 444 LYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 444 ~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ... .....+++.+++. T Consensus 455 ~~~--------~~~~~~~~~dde~ 470 (470) T protein:vir:10 455 SNQ--------ADELNGKGVNDEQ 470 (470) T ss_pred hcc--------ccccCCCCCCCCC Confidence 110 0001111111111 No 51 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=9.3e-78 Score=442.90 Aligned_cols=428 Identities=12% Similarity=0.093 Sum_probs=328.6 Q ss_pred CC--CHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc--------------chhhhcceeccchHHHHHHHHH Q lcl|NC_021324. 1 MT--TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA--------------PPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~--~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~--------------~~~~~~~~~~~n~~~~iVd~~~ 64 (480) |+ ...++|.+++.+|..+++++.++.+||+|+|+|.+..... .....++|+++||+++||++.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 54 4488999999999999999999999999999997543221 1123477999999999999999 Q ss_pred hhhccCccccCC-CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEe Q lcl|NC_021324. 65 DRLDIEGFRISE-DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELD 143 (480) Q Consensus 65 ~~l~~~~~~~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d 143 (480) +|++|+|+++.. +++..+.++.+++ |+++.++.++++.++++|+||+++|.+. ++|.+++.+++|.+++|+|| T Consensus 81 ~yl~G~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~-----~~g~~~~~~~~p~~~~~i~d 154 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDA-----SDNSFRYACVDSKEVIPIYS 154 (471) T ss_pred hhhcccCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeC-----CCCeeEEEEEcccceEEEEc Confidence 999999987654 4445666766665 8999999999999999999999999642 36899999999999999999 Q ss_pred CCCCCceEEEEEEEEEec--CCceEEEEEEEeCCcEEEEEecCCCcc----------------ccccccccccccccccc Q lcl|NC_021324. 144 PRNTRRVTRAVRLYTTRD--DVAVPDRATLYLPDETVPLRRNGGLND----------------QWVVDGDVIKHGLGVVP 205 (480) Q Consensus 144 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~g~iP 205 (480) +....++.+++++|...+ +....+++++|+++.+++|...++... ......+..+|+||+|| T Consensus 155 ~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 234 (471) T protein:vir:10 155 KSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVP 234 (471) T ss_pred CCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCcee Confidence 888888999999997654 445678999999999999987654321 12334566789999999 Q ss_pred eEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC- Q lcl|NC_021324. 206 VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA- 284 (480) Q Consensus 206 vv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~- 284 (480) ||+|+|+. .|.|+++ .+++|||+||.++|++++.+++|++|+++++|.+.+...+.. ..+..++++.+. T Consensus 235 vv~~~n~~-----~~~sd~e-~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~----~~~~~~~~i~~~~ 304 (471) T protein:vir:10 235 FIPFKNNE-----IETNDLK-PIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFL----EDLKRYKMIKMDN 304 (471) T ss_pred EEEeccCC-----CCCCchH-HHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhH----HHhhcCCeEEecC Confidence 99999974 5788886 599999999999999999999999999999998655433221 122233333332 Q ss_pred -----CCCceEEEecc--ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 285 -----SEAAKISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGA 357 (480) Q Consensus 285 -----g~~~~~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 357 (480) +.++++.+++. ...+.++++|+..++.++.++++.+..| +++||+||++++.+|.+||..+++.|+++ T Consensus 305 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~-----gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~ 379 (471) T protein:vir:10 305 DGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNPETDKL-----GNSSGVALKFLYSLLELKAGNMETQFRSG 379 (471) T ss_pred CCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc-----cCccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23567766553 3445555555555555555555444333 23799999999999999999999999999 Q ss_pred HHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHH Q lcl|NC_021324. 358 WERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQET 437 (480) Q Consensus 358 l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~ 437 (480) |++++++++++++.. +..+++|.|++++|.|+++.+++++++. |++|.||+++++|+++|+.++++++++|++ T Consensus 380 l~~~~~li~~~~~~~---d~~~i~i~f~~~~p~n~~e~~~~~~kl~----g~iS~et~~~~~p~v~D~~~E~eri~~E~~ 452 (471) T protein:vir:10 380 YATLVKMILKHLGLS---DKLKIKQTWTRNSINNDTEMAQVVSTLA----TITSRENVAKSNPIVEDWQDELRLQKAEQE 452 (471) T ss_pred HHHHHHHHHHHhccC---CCceeEEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHH Confidence 999999999988743 3567899999999999999999998873 369999999999999988888877777665 Q ss_pred HHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 438 EDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 438 ~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.. +..... .+..+++++. T Consensus 453 ~~~-~~~~~~----~~~~~~~e~~ 471 (471) T protein:vir:10 453 GRS-EKLYDM----EEVEHESEVE 471 (471) T ss_pred HHH-hccccc----CCCCCccccC Confidence 432 211221 1112222221 No 52 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=1.8e-77 Score=441.36 Aligned_cols=436 Identities=9% Similarity=0.064 Sum_probs=325.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc------cchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG------APPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~------~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) -.+-.++|++|+.+|..+.+|+.++.+||+|+|+|.+.... .+...+++|+++||+++||++.++||+++|+++ T Consensus 25 ~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~ 104 (474) T protein:vir:95 25 FETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVTY 104 (474) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCcee Confidence 33335689999999999999999999999999998654322 223345789999999999999999999999887 Q ss_pred CCC-chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 SED-SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~~d-~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ..+ ++..+.++.++ .|+++..+.++++.++++|+||++||. |++|++++++++|.+++|+||+.....+.++ T Consensus 105 ~~~d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 177 (474) T protein:vir:95 105 SCEDESVLKIIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYI------NENGEMKLFRVPAEQAIPIWVDKEREELKSF 177 (474) T ss_pred ccCchHHHHHHHHHH-hccHHHHHHHHHHHHhhcCcEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 544 44556666665 478999999999999999999999996 4578999999999999999998877889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccc------cccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQ------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) +++|..... .++++|+++++++|...++.... ........+|+||.||||+|+|+ ++|.|+|++ T Consensus 178 i~~~~~~~~----~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd~e~- 247 (474) T protein:vir:95 178 IRYYKFNNE----EKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSHFSNGNWGRVPFIAFKNN-----PEEVSDIWM- 247 (474) T ss_pred EEEEEEcCe----eEEEEEeCCeEEEEEEcCCccccccccCcccccccccccCCCccceEeecCC-----CCCCCcHHH- Confidence 998865432 46899999999999877643211 11234457899999999999986 468999965 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-CCCCceEEEeccccHHHHHHHHH Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEFKAAELRNFAEEME 306 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~l~ 306 (480) |++|||+||+++|++++.++++++|+++++|+++++..+... .....+++.+ +++++++.+++. +.+.+...++ T Consensus 248 v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~----~~~~~~~i~~~~~~~~~~l~~~~-~~~~~~~~~~ 322 (474) T protein:vir:95 248 YKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMR----GLKYYKAINVDGDGGVETIQVEV-PVSSTKEYID 322 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhh----hhhccceeeccCCCceeEEeecC-CHHHHHHHHH Confidence 999999999999999999999999999999987654332111 2223344433 455677776653 3444555555 Q ss_pred HHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecC Q lcl|NC_021324. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRD 386 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~ 386 (480) .+...|+..+++|..+++.. .+++||+||++++..+..||+.+++.|+++|++++++|+++.|.. .++.+++|+|.+ T Consensus 323 ~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~--~d~~~i~v~f~~ 399 (474) T protein:vir:95 323 LMRAYIMEFGQGVDFQTDKF-GSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLK--MDVKDIEISFNF 399 (474) T ss_pred HHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccceeeEEecc Confidence 55555555555554333322 234799999999999999999999999999999999999998754 456789999999 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +.|.|+++.|+++.+ + +++|+||++.++|+++|+.++++++++|+.+..... .......++..++.+ .. T Consensus 400 ~~p~d~~e~a~~~~~---~--g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~-~~~~~~~~d~~~~~~-----~~ 468 (474) T protein:vir:95 400 NRMMNDAEQSQIIAQ---S--QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQL-PNLDDGGADGAQQQE-----RS 468 (474) T ss_pred CCCcCHHHHHHHHHh---c--CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcc-cccccccCCCCcCCC-----CC Confidence 999999999998765 3 478999999999999988888887777765543322 111111111111111 11 Q ss_pred CCCCCC Q lcl|NC_021324. 467 TQTSPS 472 (480) Q Consensus 467 ~~~~~~ 472 (480) .+.+|. T Consensus 469 ~~~~~~ 474 (474) T protein:vir:95 469 NDKESE 474 (474) T ss_pred ccCCCC Confidence 111111 No 53 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=1.4e-77 Score=441.92 Aligned_cols=423 Identities=16% Similarity=0.126 Sum_probs=314.6 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccCcchhcCcc-cchhhhcceeccchHHHHHHHHHhhhccCccccC----CCchhHHHH Q lcl|NC_021324. 10 RLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG-APPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS----EDSEGLEEL 84 (480) Q Consensus 10 ~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~-~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~----~d~~~~~~l 84 (480) .|...+..+++||+++.+||+|+|++...... ..+...++|+++||+++|||+.++||+++++++. ++++..+.| T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~~l 80 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLSTI 80 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHHHH Confidence 44456667889999999999999998655433 3344568899999999999999999999997753 234456689 Q ss_pred HHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCc Q lcl|NC_021324. 85 WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVA 164 (480) Q Consensus 85 ~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~ 164 (480) +++|+.|+++.++.+++++++++|+||+++|. +++|+|+++.++|.+++|+||+...+++.+++++|...+. T Consensus 81 ~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~------d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~~-- 152 (440) T protein:vir:95 81 KDIEWQNDINALNSDLAFDASVYGRAYEYHFR------DKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYADK-- 152 (440) T ss_pred HHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecCc-- Confidence 99999999999999999999999999999996 4578999999999999999999888889999998865442 Q ss_pred eEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 165 VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQS 244 (480) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~ 244 (480) .++++|+++.+++|...+.... .....+..+|+||.||||+|+|+ ++|.|++++ +++|||+||+++|++++ T Consensus 153 --~~~~vyt~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~vPvv~~~n~-----~~g~sd~e~-v~~lida~~~~~s~~~~ 223 (440) T protein:vir:95 153 --VNMTVYTKDKVITYKPYSNNSV-RLVVDDVKKHSYNDVPVVEWWNN-----RFRMGDYES-EISLIDAYDAGQSDTAN 223 (440) T ss_pred --eEEEEEeCCeEEEEEEecCCcc-ceeecceeeccCceeeEEEeeCC-----CCCCCchhh-hHHHHHHHHHHHHHHHH Confidence 4678999999999987654433 33456778999999999999986 368899965 99999999999999999 Q ss_pred HHHHhcchhhhhcCCCccccccccccchhhhhcchhee----------cCCCCceEEEeccccHHHHHHHHHHHHHHHHh Q lcl|NC_021324. 245 ASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT----------LASEAAKISEFKAAELRNFAEEMEVFRKEAAS 314 (480) Q Consensus 245 ~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~----------~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~ 314 (480) .+++|++|+++++|.........+....++ ...++. .+++++++.+++. +.+.+...++.+...|+. T Consensus 224 ~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~lt~~~-~~~~~~~~~~~l~~~i~~ 300 (440) T protein:vir:95 224 YMSDLNDAMLLVKGDLDGIKLSPEDAAKMK--DANMLFLKTGISTTGQQTTADASYIYKQY-DVNGTEAYKNRLANDIHR 300 (440) T ss_pred HHHHhhcceeeeecccccCCCCccchhhhh--hccceecccccccccCCCCcceeEEeecC-CHHHHHHHHHHHHHHHHH Confidence 999999999999997554333332222221 111111 1234456665553 234444444444444444 Q ss_pred hcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCH Q lcl|NC_021324. 315 ITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTV 392 (480) Q Consensus 315 ~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~~~~~~~~~v~f~~~~~~~~ 392 (480) .+++|+.+++..+ +++||+||++++.+|.+||+++++.|+++|++++++++.+.+. ....+..++++.|+++.|+|+ T Consensus 301 ~s~~p~~~~~~~~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~ 379 (440) T protein:vir:95 301 FSRIPNLDDDRFN-STSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDV 379 (440) T ss_pred HhCCccccccccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCH Confidence 4555544443322 2479999999999999999999999999999999999887653 344556789999999999999 Q ss_pred HHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 393 AAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 393 ~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) ++.|++++|+. +++|.||+++++|+++++ ++++++++|+.+.... +........+...+++ T Consensus 380 ~~~ad~~~kl~----g~iS~et~~~~l~~~d~~-~E~~ri~~E~~~~~~~-~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 380 WTEIKAYIEAG----GEISQETLMENASFTDYK-TEHSRILKQGGSSDLE-IGQIVGDADVGQADTE 440 (440) T ss_pred HHHHHHHHHHh----ccCcHHHHHHhCCCCCcH-HHHHHHHHHHHHhhhh-HHhhccCCCCCCcCCC Confidence 99999999973 368999999999998654 4555555555443322 1111111100011111 No 54 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=1.2e-76 Score=436.86 Aligned_cols=445 Identities=14% Similarity=0.061 Sum_probs=332.1 Q ss_pred CCCHHHHHHHHHHHHH-HHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc-CCCc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLA-RDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~-~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~ 78 (480) +....+.+.+|+.+|. .+.+|++++.+||+|+|+|.+.+....+..+++|+++||+++||++.++|++++++++ ++++ T Consensus 13 ~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g~~~~~~~~d~ 92 (489) T protein:vir:99 13 SKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLGVPVEYKNENK 92 (489) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhccCCceeecCCh Confidence 4433445666777775 4669999999999999999887666656667889999999999999999999999875 4566 Q ss_pred hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 79 EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 79 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) +..+.|+++|+.|+|+.++.++++.++++|+||+++|... ..|.+|++++.+++|.+++|+||+.....+.+++++|. T Consensus 93 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~--~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~ 170 (489) T protein:vir:99 93 DLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEK--IDDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMAVHFYD 170 (489) T ss_pred hHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeecc--CcCCCcceEEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 6788999999999999999999999999999999998643 23568999999999999999999888788999999886 Q ss_pred EecC-CceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDD-VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASR 237 (480) Q Consensus 159 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~ 237 (480) ..+. .+...++++|+++.+++|+...... ......+..+|+||.||||+|+|+. +|.|++. .|++|||+||. T Consensus 171 ~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~-~~~~~~~~~~~~~g~vPvv~~~n~~-----~~~s~~~-~v~~liDa~d~ 243 (489) T protein:vir:99 171 IDYGSGKRKQIIKAYTSDTIYTYEDYNLET-KGMRLKDYEGHFFKGVPVNEYANNE-----ERTGAYE-SVLDNIDAYDL 243 (489) T ss_pred EecCCCceEEEEEEEeCCcEEEEEecCCCc-ccceecccccccCCceeEEEeecCC-----CCCCchh-hhHHHHHHHHH Confidence 5543 4566799999999999998764332 2334566789999999999999863 5788886 59999999999 Q ss_pred HHHHHHHHHHHhcchhhhhcCCCccccccccccchh------------hhhcchheecCC--------CCceEEEec--c Q lcl|NC_021324. 238 TLMNLQSASQILGTPLRVISGVTTDELTNDGENTTL------------DIYYGRILTLAS--------EAAKISEFK--A 295 (480) Q Consensus 238 ~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~------------~~~~~~~~~~~g--------~~~~~~~~~--~ 295 (480) ++|++++.++++++|+++++|............... ....++++..++ .++++.+++ . T Consensus 244 ~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 323 (489) T protein:vir:99 244 SQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDT 323 (489) T ss_pred HHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccceeeeeecCCh Confidence 999999999999999999999765543222111111 111222222211 133444433 3 Q ss_pred ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC---C Q lcl|NC_021324. 296 AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR---E 372 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~ 372 (480) ...+.++++|...++.++.++++++..|++ ++||+||+++++.+.+||..+++.|+.+|++++++++.+.+. . T Consensus 324 ~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~----n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~ 399 (489) T protein:vir:99 324 AGSEAYKNRLVADILRFTFTPDTQDMKFSG----VQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNE 399 (489) T ss_pred HHHHHHHHHHHHHHHHHhCCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCc Confidence 456666677776667666666666555543 379999999999999999999999999999999999887642 1 Q ss_pred --CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCh--hHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021324. 373 --VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTA--TQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 373 --~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~--d~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ....+.+++|.|++++|.|.++.+++++|++ +++|+|++++++|+++ +..++++++++|+++.. ... T Consensus 400 ~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~----giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~-~~~---- 470 (489) T protein:vir:99 400 ATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY----GIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQ-SLP---- 470 (489) T ss_pred cccccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHh-ccc---- Confidence 1223457899999999999999999999874 4689999999999864 44556666666554321 100 Q ss_pred ccCCCcCCCCCCCCCCccCCCCC Q lcl|NC_021324. 449 KAQADATPKPTVTETKTETQTSP 471 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~~ 471 (480) +....+...++....+..| T Consensus 471 ----~~~~~~~~~~~~~~~~~~p 489 (489) T protein:vir:99 471 ----EPRLVGDASGQEEPTAEKP 489 (489) T ss_pred ----cccccCCCCCCcCCCCCCC Confidence 1111111111222222333 No 55 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=4.8e-77 Score=438.97 Aligned_cols=430 Identities=11% Similarity=0.093 Sum_probs=326.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) ..+..++|++++++|..+.+|+.++.+||+|+|+|.+..... .+...++|+++||+++||++.++||+++|+++ T Consensus 24 ~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~ 103 (474) T protein:vir:96 24 YETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTF 103 (474) T ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhhcccCcee Confidence 566688999999999999999999999999999987654322 12245789999999999999999999999876 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.|+++++ |+++..+.++++.++++|+||+++|. |++|+++++.++|.+++|+||+.....+.++ T Consensus 104 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 176 (474) T protein:vir:96 104 SSDDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYI------DENGEFKTFRVPAEQAIPIWTNKERDTLKAF 176 (474) T ss_pred ecCchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEe------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 55667788888886 68999999999999999999999996 4578999999999999999998888889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccc----------ccccccccccccccceEEeecccccCccCCCcc Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQW----------VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~ 223 (480) +++|.... ..++++|+++++++|...++..... .......+|+||+||||+|+|+ ++|.|+ T Consensus 177 vr~~~~~~----~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~nn-----~~g~sd 247 (474) T protein:vir:96 177 IRYYRLDG----AERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQSHYYVGNKRVSWGRVPFIPFKNN-----PQEMSD 247 (474) T ss_pred EEEEeecC----ceEEEEEeCCeEEEEEecCCceeeccccccccccccccccccccCCCceeEEEeccC-----CCCCCc Confidence 99986533 2467999999999988765432111 1223457899999999999997 468999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCC--CceEEEecc--ccHH Q lcl|NC_021324. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASE--AAKISEFKA--AELR 299 (480) Q Consensus 224 i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~~~--~~~~ 299 (480) ++. +++|||+||.++|++++.++++++|+++++|..+++..+. ..++..++++.++++ ++++.+++. ...+ T Consensus 248 ~e~-v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~----~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~ 322 (474) T protein:vir:96 248 LFM-YKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEF----MRNLKYYKAINVDGDGSGVDTIQIEVPVQSSK 322 (474) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccch----hhhhhcCceEEecCCCCceeEEeecCChHHHH Confidence 965 9999999999999999999999999999999876543221 223445666766554 455555443 2344 Q ss_pred HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccee Q lcl|NC_021324. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTR 379 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 379 (480) .++++++..++.++.++++.+..|| +++||+||+++++++.+||..+++.|+++|++++++++.+.+.. .+..+ T Consensus 323 ~~~~~l~~~i~~~s~~p~~~~~~~~----~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~--~~~~~ 396 (474) T protein:vir:96 323 EYLDMLRDYVIEFGQGVDFQQDKFG----NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLN--IKVQD 396 (474) T ss_pred HHHHHHHHHHHHHhCCccccccccc----cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccce Confidence 5555555555555544444433333 34799999999999999999999999999999999999998744 45678 Q ss_pred eeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 380 LETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 380 ~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) ++|+|+++.|.|+++.++.+. ++ +++|+||++.++|+++|+.++++++++|+++.. +.+.+.... +... T Consensus 397 i~i~f~~~~p~~~~e~~~~~~---~a--g~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~-~~~~~~~~~-----~~~~ 465 (474) T protein:vir:96 397 VEITFNFNVMVNELEQSQIGV---QS--QYLSKETVVTNHPWVDDPVAELERIEQDNIDFN-KQLPPLEGD-----ANGR 465 (474) T ss_pred eeEEeccCCCcCHHHHHHHHH---hc--CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hcccccccc-----cccc Confidence 999999999999999988654 33 479999999999999888888877777765432 222111111 0111 Q ss_pred CCCCCccCC Q lcl|NC_021324. 460 VTETKTETQ 468 (480) Q Consensus 460 ~~d~~~~~~ 468 (480) ..|...+++ T Consensus 466 ~~d~~~e~~ 474 (474) T protein:vir:96 466 AQDNESETN 474 (474) T ss_pred cCCCcccCC Confidence 111111111 No 56 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=4.5e-77 Score=439.15 Aligned_cols=449 Identities=11% Similarity=0.048 Sum_probs=322.6 Q ss_pred CC---------CHHHHHHHHHHHHH--HHHHHHHHHHHHhhccCcchhcCcc---------cchhhhcceeccchHHHHH Q lcl|NC_021324. 1 MT---------TYHEHVERLQGLLA--RDLPNLLEAEAYRNGTRRLKTIGIG---------APPELAYLDVQPGWVATYL 60 (480) Q Consensus 1 ~~---------~~~~~i~~l~~~~~--~~~~~~~~~~~YY~G~~~i~~~~~~---------~~~~~~~~~~~~n~~~~iV 60 (480) || .+.+++...+..|. .+.+++.++.+||+|+|+|.+.+.. .....+++|+++||+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Iv 80 (537) T protein:vir:78 1 MTSPLLNKPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELV 80 (537) T ss_pred CCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHH Confidence 33 33456655555553 5678999999999999999654322 1222457899999999999 Q ss_pred HHHHhhhccCccccCCCch----hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEccc Q lcl|NC_021324. 61 RTLSDRLDIEGFRISEDSE----GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPL 136 (480) Q Consensus 61 d~~~~~l~~~~~~~~~d~~----~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~ 136 (480) ++.++||+|+|+++..+++ ..+.|++++ .|+++..+.+++++++++|+||+++|. |++|.++++.++|. T Consensus 81 d~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~-~~~~~~~~~el~~~~s~~G~ay~~~y~------de~~~~~~~~i~p~ 153 (537) T protein:vir:78 81 DQLAQYLLSNGVEVKVKDEDNTQLDEILQEYF-DEDFQATIDTLVTNASKKGFEGIFART------TSEGKLKFQTVDGL 153 (537) T ss_pred HHHhhhhcccCceeecCcchhHHHHHHHHHHh-hccHHHHHHHHHHHHhhcCeeEEEeee------cCCCceEEEEEccc Confidence 9999999999988754332 334455544 589999999999999999999999996 45789999999999 Q ss_pred ceEEEEeCCCCCceEEEEEEEEEec------CCceEEEEEEEeCCcEEEEEecCCCcccc-------------------- Q lcl|NC_021324. 137 YMYAELDPRNTRRVTRAVRLYTTRD------DVAVPDRATLYLPDETVPLRRNGGLNDQW-------------------- 190 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------- 190 (480) ++|||||+.. .+.+++++|.... +...++++++||++.+++|...++..... T Consensus 154 ~~~pv~d~~~--~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ 231 (537) T protein:vir:78 154 TLIPVFDDYG--VLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEE 231 (537) T ss_pred eeEEEEcCCC--CceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeeccc Confidence 9999998753 5667777775432 23456799999999999998776532111 Q ss_pred --------ccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcc Q lcl|NC_021324. 191 --------VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTD 262 (480) Q Consensus 191 --------~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~ 262 (480) .......+|+||+||||+|+||. +|.|++. +|++|||+||.++|++++.+++|++|+++++|+.++ T Consensus 232 ~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn~-----~~~sd~e-~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~ 305 (537) T protein:vir:78 232 STDADFEDTDGYQVLGRSYSKFPFQLLYNNK-----DGMSDVK-RVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGD 305 (537) T ss_pred cccccccccccccccccCCcceeEEEeccCc-----cCCCchh-hhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCc Confidence 12234567999999999999974 5788885 599999999999999999999999999999998765 Q ss_pred ccccccccchhhhhcchheecC--CCCceEEEec--cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH Q lcl|NC_021324. 263 ELTNDGENTTLDIYYGRILTLA--SEAAKISEFK--AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~--g~~~~~~~~~--~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~ 338 (480) ...+.. . .+...+++.++ ++++++.+++ ....+.++++|+..|+.++..+++++..+| ++||+||++ T Consensus 306 ~~~~~~--~--~l~~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~g-----n~SGvAlk~ 376 (537) T protein:vir:78 306 STDKLR--Q--NIKAKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDG-----NVTNVVIKS 376 (537) T ss_pred cchhHH--H--HHhhcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCcccccc-----CCcHHHHHH Confidence 432211 1 22233444443 4556776554 456788889999999988888876654322 369999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHH Q lcl|NC_021324. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQAR 416 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~ 416 (480) ++++|++||+.+++.|+++|++++++|+.+.+. ....++..++++|++++|.|+.+.|+.+++++++| ++|+||++ T Consensus 377 ~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~g--iiS~eT~l 454 (537) T protein:vir:78 377 RYTLLAMKARKMETSLRKVLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETE--ALKIGNIM 454 (537) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcC--cchHHHHH Confidence 999999999999999999999999999887642 23456778999999999999999999999887655 78999999 Q ss_pred HhCCCChhHHHHHHHHHHHHHH----HHHHHHhhhcccCCCc--------------------CCCCCCCCCCccCCCCCC Q lcl|NC_021324. 417 IDLGYTATQREQMRDWDKQETE----DMIDTLYSTTKAQADA--------------------TPKPTVTETKTETQTSPS 472 (480) Q Consensus 417 ~~~~~~~d~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~--------------------~~~~~~~d~~~~~~~~~~ 472 (480) +++|+++|+ ++ +++++++.+ ...+.+........+. .++...+||+..+.+.|+ T Consensus 455 ~~~p~vdd~-e~-ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 532 (537) T protein:vir:78 455 TVAPRIGDD-ET-LKLIAEELDLDYNELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPPTDPN 532 (537) T ss_pred HhCCCCCCH-HH-HHHHHHHHHhhhhhhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCCCCCc Confidence 999999885 22 222222211 1111111111111000 111122333334444443 Q ss_pred CCCcccC Q lcl|NC_021324. 473 GFNRTKT 479 (480) Q Consensus 473 ~~~~~~~ 479 (480) .+. +| T Consensus 533 ~~~--~~ 537 (537) T protein:vir:78 533 AVP--QT 537 (537) T ss_pred cCC--CC Confidence 333 23 No 57 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=1.9e-76 Score=435.67 Aligned_cols=431 Identities=12% Similarity=0.087 Sum_probs=323.4 Q ss_pred CCCHHHHHHHHHHHHH--HHHHHHHHHHHHhhccCcchhcCccc--------chhhhcceeccchHHHHHHHHHhhhccC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLA--RDLPNLLEAEAYRNGTRRLKTIGIGA--------PPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~--~~~~~~~~~~~YY~G~~~i~~~~~~~--------~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) +.+..+++ +++..|. .+.++++++.+||+|+|+|++..... +....++|+++||+++||++.++|++++ T Consensus 18 ~~~~~~~~-~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g~ 96 (479) T protein:vir:79 18 KESTINLV-KVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVGN 96 (479) T ss_pred cCChhHHH-HHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhcC Confidence 33333333 3344332 25688999999999999997654322 2234577999999999999999999999 Q ss_pred ccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 71 GFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 71 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) |+++..+++..+.+++.|..|+|+..+.++++.++++|+||++||. |++|++++++++|.+++|+||+....++ T Consensus 97 p~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~v~d~~~~~~~ 170 (479) T protein:vir:79 97 PIVFNADDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYI------NRKGEFKYVIIPAEEAIPIWDSKRQREL 170 (479) T ss_pred CceeccCCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEe------CCCCceEEEEEccceeEEEEeCCCCCce Confidence 9988766666666777788899999999999999999999999986 4578999999999999999998888889 Q ss_pred EEEEEEEEEec-CCceEEEEEEEeCCcEEEEEecCCCccc---------------cccccccccccccccceEEeecccc Q lcl|NC_021324. 151 TRAVRLYTTRD-DVAVPDRATLYLPDETVPLRRNGGLNDQ---------------WVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) Q Consensus 151 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~g~iPvv~~~n~~~ 214 (480) .+++++|...+ +.+..+++++|+++.+++|...++.... .....+..+|+||.||||+|+|++ T Consensus 171 ~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~- 249 (479) T protein:vir:79 171 VAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNE- 249 (479) T ss_pred EEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCCC- Confidence 99999987654 4446779999999999999876543221 112345678999999999999874 Q ss_pred cCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec-CCCCceEEEe Q lcl|NC_021324. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAKISEF 293 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~~~~~ 293 (480) +|.|+++ .|++|||+||.++|++++.++++++|+++++|.+.....+... ....++++.+ +++++++.++ T Consensus 250 ----~g~sd~~-~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~----~~~~~~~i~~~~~~~~~~l~~ 320 (479) T protein:vir:79 250 ----KCVSDLT-FYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFID----NIRYYKSIKVDGGGGVDKLEI 320 (479) T ss_pred ----CCCcchh-hhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchh----hhhhccceecCCCCcceEEec Confidence 6889996 5999999999999999999999999999999976544332211 2223344443 4556778776 Q ss_pred ccc--cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_021324. 294 KAA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR 371 (480) Q Consensus 294 ~~~--~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 371 (480) +.+ ..+++++.|+..++.++ ++|...+++. +++||+||++++++|.+||..+++.|+.+|++++++++.+.+. T Consensus 321 ~~~~~~~~~~~~~l~~~i~~~s---~~p~~~~~~~--gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 395 (479) T protein:vir:79 321 NIPVEAKKELLDRLEKNIIIFG---QGVNPESQNT--GDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKI 395 (479) T ss_pred cCCHHHHHHHHHHHHHHHHHHh---Cccccccccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 543 34455555555555555 4444334322 3479999999999999999999999999999999999887653 Q ss_pred --CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_021324. 372 --EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 372 --~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) ....+..+++|.|++++|.|+++.|+++++++ |++|.||+++++|+++|+.++++++++|+.++........ T Consensus 396 ~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl~----g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~-- 469 (479) T protein:vir:79 396 SGNKSYDYKTVQITFNHSMIINEAEKIDMAAKST----GIVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIP-- 469 (479) T ss_pred cCCCccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccC-- Confidence 34455678999999999999999999999874 4689999999999999888888877777654332111110 Q ss_pred cCCCcCCCCCCCCC Q lcl|NC_021324. 450 AQADATPKPTVTET 463 (480) Q Consensus 450 ~~~~~~~~~~~~d~ 463 (480) ...++..+|. T Consensus 470 ----~~~~~~~~e~ 479 (479) T protein:vir:79 470 ----NNQDGVIDET 479 (479) T ss_pred ----cccCCCcCcC Confidence 0111111111 No 58 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=3.1e-76 Score=434.56 Aligned_cols=423 Identities=12% Similarity=0.090 Sum_probs=325.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc------chhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------PPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-.-.++|.+|+.+|..+.+|+.++.+||+|+|+|.+..... ++..+++|+++||++.||++.++|++++++++ T Consensus 24 ~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~ 103 (468) T protein:vir:96 24 YETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPVTY 103 (468) T ss_pred ccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhccCCcee Confidence 223366889999999999999999999999999986653322 22335789999999999999999999999875 Q ss_pred C-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 75 S-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 75 ~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ++++..+.|+++|+ |+++..+.++++.++++|+||++||. |++|.+++..++|.+++|+|++.....+.++ T Consensus 104 ~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ 176 (468) T protein:vir:96 104 GTEDEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYV------DEQGEFKTFRVPAEQAIPIWTNKERDELKAF 176 (468) T ss_pred ccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEE------cCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 4 55667788999986 78999999999999999999999996 4578999999999999999998877889999 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc----------ccccccccccccccccceEEeecccccCccCCCcc Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE 223 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~ 223 (480) +++|..... .++++|+++++++|+..++... .........+|+||+||||+|+|+ ++|.|+ T Consensus 177 ir~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~-----~~g~sd 247 (468) T protein:vir:96 177 IRLYELDGG----ERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWNRVPFIPFKNN-----PQEVSD 247 (468) T ss_pred EEEEEecCc----eEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCCcccEEEecCC-----CCCCCc Confidence 998865332 4679999999999887654211 112234567899999999999987 458999 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecC---CCCceEEEecc--ccH Q lcl|NC_021324. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLA---SEAAKISEFKA--AEL 298 (480) Q Consensus 224 i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~---g~~~~~~~~~~--~~~ 298 (480) +++ +++|||+||.++|++++.++++++|+++++|+.+++.... ...+..++++.++ ++++++.+++. ... T Consensus 248 ~e~-v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~----~~~~~~~~~i~~~~d~~~~~~~l~~~~~~~~~ 322 (468) T protein:vir:96 248 LFM-YKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEF----MYNLKYYKAINVDGDGSGGVDTIQIDVPVQSA 322 (468) T ss_pred hHH-HHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchh----hhhhhcCceEEecCCCCCcceEEeecCChHHH Confidence 975 9999999999999999999999999999999876532221 2233445555443 33456665543 345 Q ss_pred HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021324. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 378 (480) +.+++.|+..++.++.++++.+..|+ +++||+||+++++++.+||+.+++.|+++|++++++++++.|.. .+.. T Consensus 323 ~~~~~~l~~~I~~~s~~p~~~~~~~~----~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~--~d~~ 396 (468) T protein:vir:96 323 KEYLDMLRDYVIEFGQGVDFQQDKFG----NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLS--IKVQ 396 (468) T ss_pred HHHHHHHHHHHHHHhCcccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--cccc Confidence 55666666666666655555544444 34799999999999999999999999999999999999998754 4567 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) ++.|+|.+++|.|..+.|+++.+ + |++|+||+++++|+++|+.++++++++|+++..... ....+ T Consensus 397 ~i~i~f~~~~p~d~~e~a~~~~~---~--g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~--~~~~~-------- 461 (468) T protein:vir:96 397 DVEITFNFNVMVNELEQSQIGVN---S--QYLSKETVVTNHPWVDDPVAEMERIDQEELALPSIE--EGLNG-------- 461 (468) T ss_pred eeeEEecCCCCcCHHHHHHHHHh---c--CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHh--hccCC-------- Confidence 89999999999999999987654 3 578999999999999988888887777765433211 11111 Q ss_pred CCCCCCccCCCCCC Q lcl|NC_021324. 459 TVTETKTETQTSPS 472 (480) Q Consensus 459 ~~~d~~~~~~~~~~ 472 (480) +.+.+|+ T Consensus 462 -------~~~~~~~ 468 (468) T protein:vir:96 462 -------KENNEPT 468 (468) T ss_pred -------CCCCCCC Confidence 1111111 No 59 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=1.3e-75 Score=431.13 Aligned_cols=443 Identities=13% Similarity=0.071 Sum_probs=323.0 Q ss_pred CCCHHH----HHHHHHHHHHH-HHHHHHHHHHHhhccCcchhcCc-c-cchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHE----HVERLQGLLAR-DLPNLLEAEAYRNGTRRLKTIGI-G-APPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~----~i~~l~~~~~~-~~~~~~~~~~YY~G~~~i~~~~~-~-~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) ++++++ .|.+|+++|.. +++|++++.+||+|+|++..... . .....+++|+++||+++||++.++||+|+|++ T Consensus 16 ~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~ 95 (506) T protein:vir:94 16 QESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPIN 95 (506) T ss_pred ccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcccCce Confidence 544433 35668888755 67899999999999997643222 1 22334588999999999999999999999987 Q ss_pred c-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 74 I-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 74 ~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) + +++++..+.|+++|+.|+++.++.++++.++++|+||++||. +++|++++.++||.+++|+||+.....+.+ T Consensus 96 ~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~------ded~~~~i~~~~p~~~~~v~dd~~~~~~~~ 169 (506) T protein:vir:94 96 VKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYR------GEDNEEHLAKLDPLDTFVIYSTDVDPKPIM 169 (506) T ss_pred eecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEe------cCCCeeEEEEEcccceEEEecCCCCCceEE Confidence 5 445667789999999999999999999999999999999996 457899999999999999999888788999 Q ss_pred EEEEEEEecCCc-----eEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHH Q lcl|NC_021324. 153 AVRLYTTRDDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPE 227 (480) Q Consensus 153 ~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~ 227 (480) ++++|...+..+ ...+.++|+++.+++|....... ......+|+||.||||+|+|++ .|.|++. . T Consensus 170 ~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~----~~~~~~~~~~g~vPvv~~~n~~-----~~~sd~e-~ 239 (506) T protein:vir:94 170 AVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMG----KMQVDTTKPITTFPVVEFKNSN-----FRLGDFE-N 239 (506) T ss_pred EEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCcc----ceeccccccCCccceEEecCCC-----CCCCchh-h Confidence 999987554322 34577889999998887654321 2344578999999999999975 4677775 5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc-----------cc---------chhhhhcchheecC--- Q lcl|NC_021324. 228 LRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG-----------EN---------TTLDIYYGRILTLA--- 284 (480) Q Consensus 228 v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~-----------~~---------~~~~~~~~~~~~~~--- 284 (480) +++|||+||.++|++++.++++++|+++++|.......... .. .......++++.++ T Consensus 240 ~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (506) T protein:vir:94 240 VLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDANMLLLKSGM 319 (506) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhcCeeeecccc Confidence 99999999999999999999999999999997543221100 00 00111222333322 Q ss_pred -------CCCceEEEec--cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 285 -------SEAAKISEFK--AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 285 -------g~~~~~~~~~--~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) +.++++.+++ ....++++++|...++.++.+|++++..|++ ++||+||++++.++.+||.++++.|+ T Consensus 320 ~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~----n~Sg~Aik~~~~~l~~k~~~k~~~~~ 395 (506) T protein:vir:94 320 TVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFAS----NSSGVAMQYKVLGTVELASTKRRMFE 395 (506) T ss_pred cccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccc----cchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1234444443 3445566666666666666666655555543 47999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhC---CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHH Q lcl|NC_021324. 356 GAWERAMRIAMQIMG---REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDW 432 (480) Q Consensus 356 ~~l~~~~~li~~~~~---~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~ 432 (480) ++|++++++++++.+ .....++.+++|.|+++.|.|+++.|++++|++ |++|+||+++++|+++|+.++++++ T Consensus 396 ~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~lp~v~d~~~E~~ri 471 (506) T protein:vir:94 396 RGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG----ATLPQKYLYQQLPGVTNPQDIVDMM 471 (506) T ss_pred HHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHH Confidence 999999999988754 334556778999999999999999999999873 4799999999999999988888877 Q ss_pred HHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCC Q lcl|NC_021324. 433 DKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQT 469 (480) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) ++|+.+.... .......+...+..+..+...+.=+ T Consensus 472 ~~E~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~e~~ 506 (506) T protein:vir:94 472 KEQSANGDYS--FDQNGVISNDGQTNTTATQTDEEVR 506 (506) T ss_pred HHHHHHHhhc--chhhcCCCcccCccccccccccCCC Confidence 7776543211 1111111111111111111111111 No 60 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=8.4e-75 Score=426.68 Aligned_cols=429 Identities=12% Similarity=0.028 Sum_probs=319.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc---hhcCc-------------ccchhhhcceeccchHHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRL---KTIGI-------------GAPPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i---~~~~~-------------~~~~~~~~~~~~~n~~~~iVd~~~ 64 (480) ..| .++|+++++.|...+.|+.++.+||+|.+.. ..... ...+..+++|+++||+++||++.+ T Consensus 14 ~~~-~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 92 (474) T protein:vir:10 14 GIL-PKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRV 92 (474) T ss_pred CCC-HHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHh Confidence 223 3679999999999999999999999996542 11110 111223467999999999999999 Q ss_pred hhhccCccccCC------CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccce Q lcl|NC_021324. 65 DRLDIEGFRISE------DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) Q Consensus 65 ~~l~~~~~~~~~------d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~ 138 (480) +|++++|+++.. +++..+.|+++|+.|+++.++.+++++++++|+||+++|. |++|++++++++|.++ T Consensus 93 ~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~------d~~~~~~~~~i~p~~~ 166 (474) T protein:vir:10 93 GYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYI------DTNGDIRIKNIDPYNV 166 (474) T ss_pred hheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------CCCCeeEEEEEcccce Confidence 999999987532 3445678999999999999999999999999999999986 4588999999999999 Q ss_pred EEEEeCCCCCceEEEEEEEEEecCC--ceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 139 YAELDPRNTRRVTRAVRLYTTRDDV--AVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) +|+||+.. .+.+++++|...++. ...+++++|++..++.|...+.. .....+..+|+||.||||+|+|++ T Consensus 167 ~~v~d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~---~~~~~~~~~~~~g~vPvv~~~n~~--- 238 (474) T protein:vir:10 167 IFVGDNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGID---ALQEVGRYEHLFDYNPLFGVPNNK--- 238 (474) T ss_pred EEEEcCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCC---cccccccccCCCCccceEEecCCC--- Confidence 99998644 467888888766533 34568899999999998866432 234556778999999999999874 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec--CCCCceEEEec Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL--ASEAAKISEFK 294 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~--~g~~~~~~~~~ 294 (480) +|.|++++ |++|||+||.++|++++.++++++|+++++|+..++.. .. +...++++.+ +++++++.+++ T Consensus 239 --~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~----~~--~~~~~~~i~~~~~~~~~~~l~~~ 309 (474) T protein:vir:10 239 --EMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEM----IQ--ETQKSGAFELFDKDMDVKYLTKD 309 (474) T ss_pred --CCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchh----hh--hhhhcceeEecCCCCceeEEecc Confidence 58899965 99999999999999999999999999999998665321 11 1222333333 34456665554 Q ss_pred c--ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Q lcl|NC_021324. 295 A--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR- 371 (480) Q Consensus 295 ~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~- 371 (480) . ...+.+++.|+..++.++.++++.+..|+ +++||+||++++++|.+||..+++.|+.+|++++++++++++. T Consensus 310 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~----~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~ 385 (474) T protein:vir:10 310 VNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN----GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRK 385 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccccc----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 3 23445555555555555554444433343 2479999999999999999999999999999999999887642 Q ss_pred ---CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021324. 372 ---EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 372 ---~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ....++.++++.|.++.|.|.++.|++++|+. |++|+||+++++|+++|+.++++++++|+++. .+.+.... T Consensus 386 ~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~-~~~~~~~~ 460 (474) T protein:vir:10 386 GYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK----GQVSERTRLGQSQLVDDVDYELDEMEKESLEF-NDKLPDID 460 (474) T ss_pred cCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-Hhhccccc Confidence 22345678999999999999999999999874 46899999999999999888888777766542 23222222 Q ss_pred ccCCCcCCCCCCCC Q lcl|NC_021324. 449 KAQADATPKPTVTE 462 (480) Q Consensus 449 ~~~~~~~~~~~~~d 462 (480) .+..+..+++...| T Consensus 461 ~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 461 EGDANDKSQNNQSE 474 (474) T ss_pred CCCcCCCCccccCC Confidence 21111111111111 No 61 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=8.4e-75 Score=426.68 Aligned_cols=429 Identities=12% Similarity=0.028 Sum_probs=319.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc---hhcCc-------------ccchhhhcceeccchHHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRL---KTIGI-------------GAPPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i---~~~~~-------------~~~~~~~~~~~~~n~~~~iVd~~~ 64 (480) ..| .++|+++++.|...+.|+.++.+||+|.+.. ..... ...+..+++|+++||+++||++.+ T Consensus 14 ~~~-~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~ 92 (474) T protein:vir:94 14 GIL-PKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDTRV 92 (474) T ss_pred CCC-HHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHhHh Confidence 223 3679999999999999999999999996542 11110 111223467999999999999999 Q ss_pred hhhccCccccCC------CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccce Q lcl|NC_021324. 65 DRLDIEGFRISE------DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) Q Consensus 65 ~~l~~~~~~~~~------d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~ 138 (480) +|++++|+++.. +++..+.|+++|+.|+++.++.+++++++++|+||+++|. |++|++++++++|.++ T Consensus 93 ~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~------d~~~~~~~~~i~p~~~ 166 (474) T protein:vir:94 93 GYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYI------DTNGDIRIKNIDPYNV 166 (474) T ss_pred hheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe------CCCCeeEEEEEcccce Confidence 999999987532 3445678999999999999999999999999999999986 4588999999999999 Q ss_pred EEEEeCCCCCceEEEEEEEEEecCC--ceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 139 YAELDPRNTRRVTRAVRLYTTRDDV--AVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) +|+||+.. .+.+++++|...++. ...+++++|++..++.|...+.. .....+..+|+||.||||+|+|++ T Consensus 167 ~~v~d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~---~~~~~~~~~~~~g~vPvv~~~n~~--- 238 (474) T protein:vir:94 167 IFVGDNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGID---ALQEVGRYEHLFDYNPLFGVPNNK--- 238 (474) T ss_pred EEEEcCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCC---cccccccccCCCCccceEEecCCC--- Confidence 99998644 467888888766533 34568899999999998866432 234556778999999999999874 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec--CCCCceEEEec Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL--ASEAAKISEFK 294 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~--~g~~~~~~~~~ 294 (480) +|.|++++ |++|||+||.++|++++.++++++|+++++|+..++.. .. +...++++.+ +++++++.+++ T Consensus 239 --~g~sd~e~-v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~~----~~--~~~~~~~i~~~~~~~~~~~l~~~ 309 (474) T protein:vir:94 239 --EMIGDAEK-VIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEEM----IQ--ETQKSGAFELFDKDMDVKYLTKD 309 (474) T ss_pred --CCCCchHH-HHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCchh----hh--hhhhcceeEecCCCCceeEEecc Confidence 58899965 99999999999999999999999999999998665321 11 1222333333 34456665554 Q ss_pred c--ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC- Q lcl|NC_021324. 295 A--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR- 371 (480) Q Consensus 295 ~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~- 371 (480) . ...+.+++.|+..++.++.++++.+..|+ +++||+||++++++|.+||..+++.|+.+|++++++++++++. T Consensus 310 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~----~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~ 385 (474) T protein:vir:94 310 VNDTMIENHLDRIEKNIMRFAKSVNFNSDEFN----GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRK 385 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCccccccccc----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 3 23445555555555555554444433343 2479999999999999999999999999999999999887642 Q ss_pred ---CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021324. 372 ---EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 372 ---~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ....++.++++.|.++.|.|.++.|++++|+. |++|+||+++++|+++|+.++++++++|+++. .+.+.... T Consensus 386 ~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~-~~~~~~~~ 460 (474) T protein:vir:94 386 GYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK----GQVSERTRLGQSQLVDDVDYELDEMEKESLEF-NDKLPDID 460 (474) T ss_pred cCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh----ccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-Hhhccccc Confidence 22345678999999999999999999999874 46899999999999999888888777766542 23222222 Q ss_pred ccCCCcCCCCCCCC Q lcl|NC_021324. 449 KAQADATPKPTVTE 462 (480) Q Consensus 449 ~~~~~~~~~~~~~d 462 (480) .+..+..+++...| T Consensus 461 ~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 461 EGDANDKSQNNQSE 474 (474) T ss_pred CCCcCCCCccccCC Confidence 21111111111111 No 62 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=7.2e-55 Score=317.42 Aligned_cols=438 Identities=11% Similarity=0.054 Sum_probs=291.1 Q ss_pred CCC-HHHHHHHHHHH------------------HHHHHHHHHHHHHHhhccCcchhcCcc--cchhhhcceeccchHHHH Q lcl|NC_021324. 1 MTT-YHEHVERLQGL------------------LARDLPNLLEAEAYRNGTRRLKTIGIG--APPELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~-~~~~i~~l~~~------------------~~~~~~~~~~~~~YY~G~~~i~~~~~~--~~~~~~~~~~~~n~~~~i 59 (480) |.+ +..+++..+++ +.....|+.+..+||+|+|++.+.... ..+...++++++|||+.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i 80 (496) T protein:vir:38 1 MINQIIAGVKGVMRRMGLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (496) T ss_pred ChhHHHHHHHHHHHHhccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHH Confidence 222 12222222222 223446788899999999987443221 123334667889999999 Q ss_pred HHHHHhhhccCccccC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccce Q lcl|NC_021324. 60 LRTLSDRLDIEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) Q Consensus 60 Vd~~~~~l~~~~~~~~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~ 138 (480) |+.+++|+++++.++. +++...+.|+++++.|+|...+.+++..++.+|.+|+++|. |.+|.+++.+++|+++ T Consensus 81 ~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~------D~~~~~~i~~v~~~~~ 154 (496) T protein:vir:38 81 AKYMSKLLFNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYH------DGNKNVKVSFATADCM 154 (496) T ss_pred HHHHhhhhhCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEE------cCCCcEEEEEEcccce Confidence 9999999999987654 55667888999999999999999999999999999999996 4578999999999999 Q ss_pred EEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCc-E--EE---EEecCCCcccc-c--------cccccccccccc Q lcl|NC_021324. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDE-T--VP---LRRNGGLNDQW-V--------VDGDVIKHGLGV 203 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~---~~~~~~~~~~~-~--------~~~~~~~~~~g~ 203 (480) +|+|++..+-...+++..| ..+....+.++.|+... . +. |+...+...+. + .......+++++ T Consensus 155 ~P~~~~~~~~~~~~f~~~~--~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~ 232 (496) T protein:vir:38 155 YPLSNDSENVDECVIANSF--HKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTR 232 (496) T ss_pred EEEEecCCcEEEEEEEEEE--EeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCc Confidence 9999775432222333333 23344455666665321 1 11 22222111110 1 012233467788 Q ss_pred cceEEee----cccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhh----hcCCCcccccccccc-chhh Q lcl|NC_021324. 204 VPVVPLT----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----ISGVTTDELTNDGEN-TTLD 274 (480) Q Consensus 204 iPvv~~~----n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~----i~G~~~~~~~~~~~~-~~~~ 274 (480) +|+++|+ |+.....++|.|++. ++++|+|+||.++|++++.++.....+.+ +... .. ..+.. ..+. T Consensus 233 ~~f~~~~~~~~N~~~~~~p~G~Sd~~-~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~-~~---~~g~~~~~~~ 307 (496) T protein:vir:38 233 PTFIYIKPNIANNKNLTSPLGISVYA-NALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTA-VN---LDGSTTQYFD 307 (496) T ss_pred ceEEEecCCcccccccCCcCCCchHh-hHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhcc-CC---CCCccccCCC Confidence 9998875 566778899999996 59999999999999999999864433333 1110 00 01111 1111 Q ss_pred hhcchhee----cCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 275 IYYGRILT----LASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAER 349 (480) Q Consensus 275 ~~~~~~~~----~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~ 349 (480) ........ ..++...+.+++.. ..++|++.++.++++++..+++|++.||.+..+.+||.+++++++.|.+++.. T Consensus 308 ~~~~~~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~ 387 (496) T protein:vir:38 308 STDEAFFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNS 387 (496) T ss_pred CccceEEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHH Confidence 11111111 12222346656543 57899999999999999999999999998877778999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCC-h Q lcl|NC_021324. 350 KGRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYT-A 423 (480) Q Consensus 350 ~~~~~~~~l~~~~~li~~~~~-----~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~-~ 423 (480) +++.|+.+|++++++++.+.. .+...+...+.+.|.+.+|.|..+.++++++++++| ++|.++++..++++ + T Consensus 388 ~~~~~~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--iiS~et~l~~~~~~~d 465 (496) T protein:vir:38 388 HSQLIEQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQG--MIPLKIALQRAWNITE 465 (496) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcC--CCCHHHHHHhcCCCCh Confidence 999999999999998875421 222334456899999999999999999999999876 78999999888655 4 Q ss_pred hHHHH-HHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCC Q lcl|NC_021324. 424 TQREQ-MRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQ 468 (480) Q Consensus 424 d~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) ++.++ ++++++|+.+.. +.+......++.+ T Consensus 466 ~ea~~el~ri~~E~~~~~---------------~~~d~~~~~~~~e 496 (496) T protein:vir:38 466 AEADEWAEMLAKEKQAEM---------------PNNDMNGIFGEEE 496 (496) T ss_pred HHHHHHHHHHHHhhhccC---------------ccccccCCCCCCC Confidence 44433 333333221110 0111011111111 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=1.1e-51 Score=300.02 Aligned_cols=443 Identities=11% Similarity=0.037 Sum_probs=291.7 Q ss_pred CCC-HHHHHHHHHHH------------------HHHHHHHHHHHHHHhhccCcchhcCc--ccchhhhcceeccchHHHH Q lcl|NC_021324. 1 MTT-YHEHVERLQGL------------------LARDLPNLLEAEAYRNGTRRLKTIGI--GAPPELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~-~~~~i~~l~~~------------------~~~~~~~~~~~~~YY~G~~~i~~~~~--~~~~~~~~~~~~~n~~~~i 59 (480) |-+ ...+|+.++++ +.....++.+..+||.|+|+..+... ...+...++++++|+++.| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~i 80 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKVT 80 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHHH Confidence 544 34444444433 22234678888999999987543221 1223344678899999999 Q ss_pred HHHHHhhhccCccccC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccce Q lcl|NC_021324. 60 LRTLSDRLDIEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) Q Consensus 60 Vd~~~~~l~~~~~~~~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~ 138 (480) |+.+++|+++++.++. +++..++.|+++++.|+|...+.+++..|+.+|.+|+++|. |.+|++++..++|.++ T Consensus 81 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~------D~~~~~~i~~v~a~~~ 154 (499) T protein:vir:80 81 AKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYH------DGNKNVKVSFATADCM 154 (499) T ss_pred HHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEE------CCCCcEEEEEEcCCce Confidence 9999999999976543 55667888999999999999999999999999999999996 4478999999999999 Q ss_pred EEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEe--CCcEEE-------EEecCCCcccc-cc--------cccccccc Q lcl|NC_021324. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYL--PDETVP-------LRRNGGLNDQW-VV--------DGDVIKHG 200 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-------~~~~~~~~~~~-~~--------~~~~~~~~ 200 (480) +|+|.+...-...+++..+. .+....+.++.|+ .+.... |+.......+. +. ......++ T Consensus 155 ~Pi~~d~~~~~~~~f~~~~~--~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~ 232 (499) T protein:vir:80 155 YPLSNDSENVDECLIANSFH--KNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPS 232 (499) T ss_pred EEEEecCCCeEEEEEEEEEe--ecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecC Confidence 99986553322222333332 2333444555443 222111 22111111110 10 01122246 Q ss_pred ccccceEEee----cccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc-chhhh Q lcl|NC_021324. 201 LGVVPVVPLT----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN-TTLDI 275 (480) Q Consensus 201 ~g~iPvv~~~----n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~-~~~~~ 275 (480) ++++|+++|+ |+.....|+|.|++.. +++|+|+||+++|++++.++....++.+-..+-.......+.. ..+.. T Consensus 233 ~~~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 311 (499) T protein:vir:80 233 LTRPTFIYIKPNIANNKNLTSPLGISVYAN-ALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYFDS 311 (499) T ss_pred CCccceEeecCCccccccCCCccCCchHhh-HHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcccCCCc Confidence 8899999885 4556678999999975 9999999999999999999876555544111000000000010 11111 Q ss_pred hcchh--e--ecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRI--L--TLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 276 ~~~~~--~--~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~ 350 (480) ..... + ..+++...+.+++. -..++|++.++.++++|...+|+++..||....+..||++++++++.+.+++..+ T Consensus 312 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~ 391 (499) T protein:vir:80 312 TDEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSH 391 (499) T ss_pred ccceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHH Confidence 11111 1 11222234666654 3578899999999999999999999999987777779999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCC-Chh Q lcl|NC_021324. 351 GRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGY-TAT 424 (480) Q Consensus 351 ~~~~~~~l~~~~~li~~~~~-----~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~-~~d 424 (480) ++.|+.+|++++++++.+.. .+...+...+.|.|.+.++.|..+.++.+.+++++| ++|.++++..+++ +++ T Consensus 392 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et~l~~~~~~~d~ 469 (499) T protein:vir:80 392 SQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQG--MIPLKIALQRAWNITEA 469 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHhhcCCCChH Confidence 99999999999998875421 122334567999999999999999999999999886 7899999988755 444 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 425 QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +.+++.+++++|... .. +.+++.+..+..+ T Consensus 470 ea~~el~~i~~E~~~----------~~--~~~d~~g~~ge~e 499 (499) T protein:vir:80 470 EADEWAEMLAKEKQA----------EI--PNNDMTGIFGEEE 499 (499) T ss_pred HHHHHHHHHHHHhhc----------CC--CCCCccccCCCCC Confidence 443333333322110 00 1111111111111 No 64 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=7e-48 Score=279.10 Aligned_cols=440 Identities=12% Similarity=0.040 Sum_probs=295.3 Q ss_pred CCCHHHHHHHHHHHH------------------HHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLL------------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~------------------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~ 62 (480) .+-++.++.+++.+. .....++++..+||.|+++..+......+...+.++.+|+++.||+. T Consensus 4 ~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i~~~ 83 (505) T protein:vir:79 4 WDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLASAK 83 (505) T ss_pred HHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHHHHH Confidence 455566666654321 12335667778999999876443222233334567778999999999 Q ss_pred HHhhhccCccccC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEE Q lcl|NC_021324. 63 LSDRLDIEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAE 141 (480) Q Consensus 63 ~~~~l~~~~~~~~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v 141 (480) +++++++++.++. +++..++.|+++++.|+|...+.+++..++..|.+++.+|.+ ++++++.+++|++++|+ T Consensus 84 ~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-------~~~~~i~~v~ad~~~P~ 156 (505) T protein:vir:79 84 LASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-------SGKIKLAWATADQVYPL 156 (505) T ss_pred HHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-------CCceEEEEEcCCeeEEE Confidence 9999999875543 456678899999999999999999999999999999999863 47799999999999999 Q ss_pred EeCCCCCceEEEEEEEEEecCCc--eEEEEEEEeCCc---EEE---EEecCCCcccc----------cc-cccccccccc Q lcl|NC_021324. 142 LDPRNTRRVTRAVRLYTTRDDVA--VPDRATLYLPDE---TVP---LRRNGGLNDQW----------VV-DGDVIKHGLG 202 (480) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~---~~~---~~~~~~~~~~~----------~~-~~~~~~~~~g 202 (480) +.+..+....+++..|+..++.+ .++.++.|+.+. .+. |+.......+. .. ..+...++++ T Consensus 157 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~ 236 (505) T protein:vir:79 157 QADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLK 236 (505) T ss_pred EEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCC Confidence 64444433333344444333322 345667776321 112 22222111111 10 1122235677 Q ss_pred ccceEEe----ecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhh----hcCC-Cccccccccccchh Q lcl|NC_021324. 203 VVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----ISGV-TTDELTNDGENTTL 273 (480) Q Consensus 203 ~iPvv~~----~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~----i~G~-~~~~~~~~~~~~~~ 273 (480) ++|+++| +|+.....|+|.|++.. +++++|++|.++|++++.++....++.+ +.-. .............+ T Consensus 237 ~p~f~~~~~~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~~f 315 (505) T protein:vir:79 237 HPLFAFYRNKGANNKNFTSPMGMSLIDN-SYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETHPPMF 315 (505) T ss_pred cceEEEecCCcccccccCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccccccccCC Confidence 8888777 46777788999999975 8999999999999999999875544433 2100 00000001111112 Q ss_pred hhhcchh--eecCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 274 DIYYGRI--LTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 274 ~~~~~~~--~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~ 350 (480) ......+ +..+++++.+..++.. ..+.|++.++.++++|+..+|+++..||....+..+|+++++.++.+.+++..+ T Consensus 316 d~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~ 395 (505) T protein:vir:79 316 DPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSY 395 (505) T ss_pred CccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHH Confidence 2211111 1234445567777653 578999999999999999999999999987777779999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCC-----------CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC Q lcl|NC_021324. 351 GRIFGGAWERAMRIAMQIMGR-----------EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL 419 (480) Q Consensus 351 ~~~~~~~l~~~~~li~~~~~~-----------~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~ 419 (480) ++.|..+|+++++.++.+... ........+.|.|.+.++.|..+.++...+++++| ++|.++++..+ T Consensus 396 ~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~G--i~s~e~~l~~~ 473 (505) T protein:vir:79 396 ITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQ--VMPKKQFLMRN 473 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHHhc Confidence 999999999999988765321 11222356889999999999999999999999987 68999987766 Q ss_pred -CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 420 -GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 420 -~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) |+++++.+++.++.++|... ..+.....++| T Consensus 474 ~~~~eeea~~el~ri~~E~~~------------~~p~~~~~gg~ 505 (505) T protein:vir:79 474 YGLDEEEADEWLAQIDAENST------------AEPEFNQFGGD 505 (505) T ss_pred CCCChHHHHHHHHHHHHhccc------------cCCCchhccCC Confidence 56666655444444333210 01111122222 No 65 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=1.5e-47 Score=277.23 Aligned_cols=440 Identities=10% Similarity=0.055 Sum_probs=294.2 Q ss_pred CCCHHHHHHHHHHHH------------------HHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLL------------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~------------------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~ 62 (480) ++.++.++.+.+.++ .....|+.+..+||+|+|+............+..++.+|+++.||+. T Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i~~~ 83 (508) T protein:vir:15 4 IQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTAARR 83 (508) T ss_pred HHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHHHHH Confidence 444556655544332 12346788889999999876433222233334557789999999999 Q ss_pred HHhhhccCc--cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEE Q lcl|NC_021324. 63 LSDRLDIEG--FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA 140 (480) Q Consensus 63 ~~~~l~~~~--~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~ 140 (480) +++++++++ +++++++..++.|+++++.|+|...+.+++..++..|.+++.+|.+ ++.++|.+++|++.+| T Consensus 84 ~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~i~~v~ad~~~P 156 (508) T protein:vir:15 84 IASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYID-------GNHIKIAWVRADQFYP 156 (508) T ss_pred HHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEe-------CCeeEEEEEcCCeeEE Confidence 999999886 5566666677889999999999999999999999999999999863 4678999999999999 Q ss_pred EEeCCCCCceEEEEEEEEEecC--CceEEEEEEEe--CC--cEEE---EEecCCCcccccc-----------cccccccc Q lcl|NC_021324. 141 ELDPRNTRRVTRAVRLYTTRDD--VAVPDRATLYL--PD--ETVP---LRRNGGLNDQWVV-----------DGDVIKHG 200 (480) Q Consensus 141 v~d~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~--~~--~~~~---~~~~~~~~~~~~~-----------~~~~~~~~ 200 (480) +..+..+..-.++++.+...++ ...++.++.|+ .+ ..+. |+..+....+... ......++ T Consensus 157 ~~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g 236 (508) T protein:vir:15 157 LQSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISG 236 (508) T ss_pred EEEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecC Confidence 7533333222223333333332 23445566665 21 2222 3222211111100 11223367 Q ss_pred ccccceEEe----ecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhh---hcCCCccccccccccchh Q lcl|NC_021324. 201 LGVVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV---ISGVTTDELTNDGENTTL 273 (480) Q Consensus 201 ~g~iPvv~~----~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~---i~G~~~~~~~~~~~~~~~ 273 (480) +.++|+++| +|+....+|+|.|+++. +++++|++|.++|++++.++.....+.+ ++..+ ..+...+ T Consensus 237 ~~~p~f~y~~~~~~N~~~~~splG~S~~~~-~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d------~~~~~~~ 309 (508) T protein:vir:15 237 LQRPLFAYFKTPGANNINIESPLGLGVVDN-AKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFD------DEHKPTF 309 (508) T ss_pred CCcceeEEecCCccccccCCCCcCCchHhh-hHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCC------CCCcccc Confidence 888888887 46677788999999975 8999999999999999999754444444 22211 1122223 Q ss_pred hhhcchheec---CCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 274 DIYYGRILTL---ASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAER 349 (480) Q Consensus 274 ~~~~~~~~~~---~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~ 349 (480) ......+..+ ++.+..+.+++.. ..+.|.+.++.+++.|...+|+++..||....+..||+++++.++.+.+++.. T Consensus 310 ~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~ 389 (508) T protein:vir:15 310 DTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSS 389 (508) T ss_pred CCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHH Confidence 2222112212 1223446666644 67889999999999999999999999998877778999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhC------CC-------CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHH Q lcl|NC_021324. 350 KGRIFGGAWERAMRIAMQIMG------RE-------VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQAR 416 (480) Q Consensus 350 ~~~~~~~~l~~~~~li~~~~~------~~-------~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~ 416 (480) +++.|+.+|++++++|+.+.. .+ ......++.|.|.+.+++|..+++...++++++| ++|+++++ T Consensus 390 ~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aG--i~s~e~~i 467 (508) T protein:vir:15 390 YLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIG--ALSKQTFL 467 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcC--CCCHHHHH Confidence 999999999999998876532 11 1122356889999999999999999999999887 68999987 Q ss_pred HhC-CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCC Q lcl|NC_021324. 417 IDL-GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTS 470 (480) Q Consensus 417 ~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) ..+ |+++++.+++.++++++... ....+... .+.++.+++ T Consensus 468 ~~~~g~~deea~~el~ri~~E~~~---------~~~~~~~~-----~~~~g~~ge 508 (508) T protein:vir:15 468 QRNYGMTDEQAAEELAKIQSEAPT---------DTFEGGRS-----AILNGGDGE 508 (508) T ss_pred HhcCCCChHHHHHHHHHHHHhccc---------cCcccccc-----ccCCCCCCC Confidence 665 67776665544444443210 00000001 111111111 No 66 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=2.3e-48 Score=281.77 Aligned_cols=455 Identities=12% Similarity=0.111 Sum_probs=311.0 Q ss_pred CCCHHHHHHHHHHHH-HHHHHHHHHHHHHhhccCc-chhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---C Q lcl|NC_021324. 1 MTTYHEHVERLQGLL-ARDLPNLLEAEAYRNGTRR-LKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---S 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-~~~~~~~~~~~~YY~G~~~-i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~ 75 (480) |... ...| ..|+.+|+.|.+||.+.+. +...-+ -.+......+.++..++||...- ++.+.|... . T Consensus 21 ~p~~-------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lr-g~~~~~~r~~~~ps~~~~~~~~~-~~~~~g~~~~~~~ 91 (527) T protein:vir:10 21 FPNA-------VTDFDKARLASYRLYEDMYLTNTSDYQVILR-GGDEGDQRPIYVPNGEKLIEAKM-RFLGQGLKWEFSK 91 (527) T ss_pred Cccc-------CCHHHHHHHHHHHHHHHHhcCchhheeeecC-CccccccceeeehhhHHhhCCcc-eeeccCccccccc Confidence 3221 3333 4677899999999999753 221111 11111245678899999999654 445666543 3 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE-- Q lcl|NC_021324. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA-- 153 (480) Q Consensus 76 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~-- 153 (480) .+++....|..++++|++..++.+..+++.+-|++-+++-.+.. ..+.++++++.+||.+.||+.|+...+.+..+ T Consensus 92 ~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~--k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~ 169 (527) T protein:vir:10 92 KDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDE--KDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYL 169 (527) T ss_pred hhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccC--CCcCCCceEeecCcceeeeeecCCCCCceeeEEE Confidence 45667778889999999999999999999999998555432210 12346799999999999999988776665554 Q ss_pred EEEEEEecCCce--E-------------------EEEEEEeCCcEE--EEEecCCC--cc------cccccccccccccc Q lcl|NC_021324. 154 VRLYTTRDDVAV--P-------------------DRATLYLPDETV--PLRRNGGL--ND------QWVVDGDVIKHGLG 202 (480) Q Consensus 154 ~~~~~~~~~~~~--~-------------------~~~~~~~~~~~~--~~~~~~~~--~~------~~~~~~~~~~~~~g 202 (480) +.-|...++.+. + ...++||...+. .|.-.... .. ....+.+..|++++ T Consensus 170 ~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~ 249 (527) T protein:vir:10 170 VDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQIT 249 (527) T ss_pred eeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCCC Confidence 222433333221 0 011222211110 00000000 00 11123456789999 Q ss_pred ccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhee Q lcl|NC_021324. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT 282 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~ 282 (480) +||||+|+|.+..++.||+|+|+ ++++++|++|+++|+.+.++.+.+.|+.++.|..+.+ ..+....+.+.++.+|. T Consensus 250 fiPvV~~~t~p~~~~~WG~S~La-~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd--~~G~~~~~~VgPG~iwe 326 (527) T protein:vir:10 250 TLPVFHFRGHPIMNAMFGRSGLA-GLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRD--SRGNMVPWTISPLGMVE 326 (527) T ss_pred ccceEeecCCCccccccChhhHh-HHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccccc--ccCCcCccccCCceeEe Confidence 99999999999999999999997 5999999999999999999999999999999987654 23444567889999998 Q ss_pred cCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhcc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 LASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 283 ~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) ++ ++++|..++. .+++.|.+++..+...|+.++++|...||. +..++.||.||++.+++|.+++.+++..++...++ T Consensus 327 L~-e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq 405 (527) T protein:vir:10 327 HG-QNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQ 405 (527) T ss_pred cC-CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 85 5688888875 578899999999999999999999999994 34667899999999999999999999988888876 Q ss_pred HHH-HH----HHHhCCCC--cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHH Q lcl|NC_021324. 361 AMR-IA----MQIMGREV--TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMR 430 (480) Q Consensus 361 ~~~-li----~~~~~~~~--~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~~~~~ 430 (480) ..+ ++ -++.+... ......+.|+|.++.|.|.++.++.+.+++++| ++|.++++++| |+..|+-.+++ T Consensus 406 ~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aG--i~S~~tAv~~L~~~~g~eD~E~E~~ 483 (527) T protein:vir:10 406 FFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAG--LIPAKKLTEELSKIMGFELTEEDFK 483 (527) T ss_pred hhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcC--chhHHHHHHHHHhccCCCChHHHHH Confidence 543 22 22333222 222346789999999999999999999999987 78999998887 56666655555 Q ss_pred HHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC--ccCCCCCC Q lcl|NC_021324. 431 DWDKQETEDMIDTLYSTTKAQADATPKPTVTETK--TETQTSPS 472 (480) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~--~~~~~~~~ 472 (480) ++..++..+.++...+...-++.++...+.++.. ....+.|. T Consensus 484 ~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 484 QATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 5554444433332222222222322222222222 22223333 No 67 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=2.7e-48 Score=281.40 Aligned_cols=455 Identities=12% Similarity=0.111 Sum_probs=311.0 Q ss_pred CCCHHHHHHHHHHHH-HHHHHHHHHHHHHhhccCc-chhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---C Q lcl|NC_021324. 1 MTTYHEHVERLQGLL-ARDLPNLLEAEAYRNGTRR-LKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---S 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-~~~~~~~~~~~~YY~G~~~-i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~ 75 (480) |... ...| ..|+.+|+.|.+||.+.+. +...-+ -.+......+.++..++||...- ++.+.|... . T Consensus 21 ~p~~-------v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lr-g~~~~~~r~~~~ps~~~~~~~~~-~~~~~g~~~~~~~ 91 (527) T protein:vir:10 21 FPNA-------VTDFDKARLASYRLYEDMYLTNTSDYQVILR-GGDEGDQRPIYVPNGEKLIEAKM-RFLGQGLKWEFSK 91 (527) T ss_pred Cccc-------CCHHHHHHHHHHHHHHHHhcCchhheeeecC-CccccccceeeehhhHHhhCCcc-eeeccCccccccc Confidence 3221 3333 4677899999999999753 221111 11111245678899999999654 445666543 3 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEE-- Q lcl|NC_021324. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRA-- 153 (480) Q Consensus 76 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~-- 153 (480) .+++....|..++++|++..++.+..+++.+-|++-+++-.+.. ..+.++++++.+||.+.||+.|+...+.+..+ T Consensus 92 ~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~--k~~~~R~~v~~~DP~~~f~~ed~d~~~~v~~v~~ 169 (527) T protein:vir:10 92 KDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDE--KDEGSRLSLHEVDPSTYFPYEDPRYPGQVLGVYL 169 (527) T ss_pred hhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccC--CCcCCCceEeecCcceeeeeecCCCCCceeeEEE Confidence 45667778889999999999999999999999998555432210 12346799999999999999988776665554 Q ss_pred EEEEEEecCCce--E-------------------EEEEEEeCCcEE--EEEecCCC--cc------cccccccccccccc Q lcl|NC_021324. 154 VRLYTTRDDVAV--P-------------------DRATLYLPDETV--PLRRNGGL--ND------QWVVDGDVIKHGLG 202 (480) Q Consensus 154 ~~~~~~~~~~~~--~-------------------~~~~~~~~~~~~--~~~~~~~~--~~------~~~~~~~~~~~~~g 202 (480) +.-|...++.+. + ...++||...+. .|.-.... .. ....+.+..|++++ T Consensus 170 ~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~~~~l~~lp~pi~ 249 (527) T protein:vir:10 170 VDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLSTLTEEEPLPEQIT 249 (527) T ss_pred eeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcCceeeecccCCCC Confidence 222433333221 0 011222211110 00000000 00 11123456789999 Q ss_pred ccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhee Q lcl|NC_021324. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT 282 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~ 282 (480) +||||+|+|.+..++.||+|+|+ ++++++|++|+++|+.+.++.+.+.|+.++.|..+.+ ..+....+.+.++.+|. T Consensus 250 fiPvV~~~t~p~~~~~WG~S~La-~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd--~~G~~~~~~VgPG~iwe 326 (527) T protein:vir:10 250 TLPVFHFRGHPIMNAMFGRSGLA-GLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRD--SRGNMVPWTISPLGMVE 326 (527) T ss_pred ccceEeecCCCccccccChhhHh-HHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccccc--ccCCcCccccCCceeEe Confidence 99999999999999999999997 5999999999999999999999999999999987654 23444567889999998 Q ss_pred cCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhcc-ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 LASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 283 ~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) ++ ++++|..++. .+++.|.++++.+...|+.++++|...||. +..++.||.||++.+++|.+++.+++..++...++ T Consensus 327 L~-e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq 405 (527) T protein:vir:10 327 HG-QNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQELELKSVLKQ 405 (527) T ss_pred cC-CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 85 5688888875 578899999999999999999999999994 34667899999999999999999999988888876 Q ss_pred HHH-HH----HHHhCCCC--cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHH Q lcl|NC_021324. 361 AMR-IA----MQIMGREV--TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMR 430 (480) Q Consensus 361 ~~~-li----~~~~~~~~--~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~~~~~ 430 (480) ..+ ++ -++.+... ......+.|+|.++.|.|.++.++.+.+++++| ++|.++++++| |+..|+-.+++ T Consensus 406 ~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aG--iiS~etAv~~L~~~~g~eD~E~E~~ 483 (527) T protein:vir:10 406 FFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAG--LIPAKKLTEELSKIMGFELTEEDFR 483 (527) T ss_pred hhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcC--chhHHHHHHHHHhccCCCchHHHHH Confidence 543 22 22333222 222346789999999999999999999999987 78999998887 56666655555 Q ss_pred HHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC--ccCCCCCC Q lcl|NC_021324. 431 DWDKQETEDMIDTLYSTTKAQADATPKPTVTETK--TETQTSPS 472 (480) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~--~~~~~~~~ 472 (480) ++..++..+.++...+...-++.++...+.++.. ....+.|. T Consensus 484 ~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~~~~~ 527 (527) T protein:vir:10 484 QATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALNGQPL 527 (527) T ss_pred HHHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccCCCCC Confidence 5554444433332222222222322222222222 22223333 No 68 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=5.2e-46 Score=268.85 Aligned_cols=442 Identities=9% Similarity=0.010 Sum_probs=293.7 Q ss_pred CCCHHHHHHHHHHHH-----------------HHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-----------------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) .+-++.|+.+...+. .....|+++..+||+|+++........++...+.+..+|+++.||+.+ T Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~ 83 (500) T protein:vir:30 4 IQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKI 83 (500) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHH Confidence 445566655533221 123467888899999997554322223344446677889999999999 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEE Q lcl|NC_021324. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL 142 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~ 142 (480) ++++++++.++ .+++..++.|+++++.|+|...+.+++..++..|.+|+.+|.+ .++++|.+++|++++|+. T Consensus 84 A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~ 156 (500) T protein:vir:30 84 ASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQ 156 (500) T ss_pred hhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEE Confidence 99999887542 3456688899999999999999999999999999999999863 467899999999999986 Q ss_pred eCCCCCceEEEEEEE-EEecC-Cc-eEEEEEEEe--CCcE--EE---EEecCCCccc------ccc---ccccccccccc Q lcl|NC_021324. 143 DPRNTRRVTRAVRLY-TTRDD-VA-VPDRATLYL--PDET--VP---LRRNGGLNDQ------WVV---DGDVIKHGLGV 203 (480) Q Consensus 143 d~~~~~~~~~~~~~~-~~~~~-~~-~~~~~~~~~--~~~~--~~---~~~~~~~~~~------~~~---~~~~~~~~~g~ 203 (480) .+..+ ...+++.++ ....+ .+ .++.++.|+ .+.. +. |+.......+ .+. ..+...+++++ T Consensus 157 ~d~~~-~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 235 (500) T protein:vir:30 157 SNTQD-VSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTR 235 (500) T ss_pred EcCCC-eEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCC Confidence 55444 344444333 22222 22 344566654 3322 22 3322111110 000 11223356777 Q ss_pred cceEEe----ecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC---Cccc-cccccccchhhh Q lcl|NC_021324. 204 VPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV---TTDE-LTNDGENTTLDI 275 (480) Q Consensus 204 iPvv~~----~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~---~~~~-~~~~~~~~~~~~ 275 (480) +|+++| +|+....+|+|.|++.. +++++|++|..+|++++.++....++.+-..+ +... ..+......+.. T Consensus 236 p~f~~~~~~~~N~~~~~sp~G~S~~~~-~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~ 314 (500) T protein:vir:30 236 PIFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFES 314 (500) T ss_pred ccEEEecCCccccccCCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCC Confidence 777765 57778889999999975 89999999999999999998644433331111 1100 000011112221 Q ss_pred hcchheec---CCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTL---ASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKG 351 (480) Q Consensus 276 ~~~~~~~~---~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~ 351 (480) .......+ +++...+..++.. ..+.|.+.++.++++++..+++++..||....+..+|++++++++.+.+++..++ T Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~ 394 (500) T protein:vir:30 315 DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIV 394 (500) T ss_pred CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHH Confidence 11111112 2233456666543 5788999999999999999999999999887777899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh-CCCChhH Q lcl|NC_021324. 352 RIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID-LGYTATQ 425 (480) Q Consensus 352 ~~~~~~l~~~~~li~~~~~-----~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~-~~~~~d~ 425 (480) +.|+.+|++++++++.+.. ........++.|.|.+.++.|..+.++.+.+++++| ++|.++++.+ +|+++++ T Consensus 395 ~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aG--i~s~~~~i~~~~g~~eee 472 (500) T protein:vir:30 395 ALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAG--FGTREMAIQKVLNVTEEK 472 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHHhcCCCCHHH Confidence 9999999999998875532 122223356899999999999999999999999986 6899997755 5888887 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 426 REQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 426 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) .+++.+..+++.. ...+..++.. +.-++ T Consensus 473 a~~~l~~i~~E~~--------~~~~~~~~~~-----~~~g~ 500 (500) T protein:vir:30 473 AQEIAAEINTGIV--------DEINQQRTDT-----HLYGE 500 (500) T ss_pred HHHHHHHHHHhcc--------ccCCCCCccc-----cccCC Confidence 7665444444321 1111111111 11111 No 69 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=5.2e-46 Score=268.85 Aligned_cols=442 Identities=9% Similarity=0.010 Sum_probs=293.7 Q ss_pred CCCHHHHHHHHHHHH-----------------HHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-----------------~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) .+-++.|+.+...+. .....|+++..+||+|+++........++...+.+..+|+++.||+.+ T Consensus 4 ~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~ 83 (500) T protein:vir:98 4 IQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAAKKI 83 (500) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHHHHH Confidence 445566655533221 123467888899999997554322223344446677889999999999 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEE Q lcl|NC_021324. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL 142 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~ 142 (480) ++++++++.++ .+++..++.|+++++.|+|...+.+++..++..|.+|+.+|.+ .++++|.+++|++++|+. T Consensus 84 A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~I~~v~ad~~~P~~ 156 (500) T protein:vir:98 84 ASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVD-------GDKVRVAFVQAPVFLPLQ 156 (500) T ss_pred hhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe-------CCceEEEEEcCCeeEEEE Confidence 99999887542 3456688899999999999999999999999999999999863 467899999999999986 Q ss_pred eCCCCCceEEEEEEE-EEecC-Cc-eEEEEEEEe--CCcE--EE---EEecCCCccc------ccc---ccccccccccc Q lcl|NC_021324. 143 DPRNTRRVTRAVRLY-TTRDD-VA-VPDRATLYL--PDET--VP---LRRNGGLNDQ------WVV---DGDVIKHGLGV 203 (480) Q Consensus 143 d~~~~~~~~~~~~~~-~~~~~-~~-~~~~~~~~~--~~~~--~~---~~~~~~~~~~------~~~---~~~~~~~~~g~ 203 (480) .+..+ ...+++.++ ....+ .+ .++.++.|+ .+.. +. |+.......+ .+. ..+...+++++ T Consensus 157 ~d~~~-~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 235 (500) T protein:vir:98 157 SNTQD-VSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTR 235 (500) T ss_pred EcCCC-eEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCC Confidence 55444 344444333 22222 22 344566654 3322 22 3322111110 000 11223356777 Q ss_pred cceEEe----ecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC---Cccc-cccccccchhhh Q lcl|NC_021324. 204 VPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV---TTDE-LTNDGENTTLDI 275 (480) Q Consensus 204 iPvv~~----~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~---~~~~-~~~~~~~~~~~~ 275 (480) +|+++| +|+....+|+|.|++.. +++++|++|..+|++++.++....++.+-..+ +... ..+......+.. T Consensus 236 p~f~~~~~~~~N~~~~~sp~G~S~~~~-~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~ 314 (500) T protein:vir:98 236 PIFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFES 314 (500) T ss_pred ccEEEecCCccccccCCCccCCchhhh-hHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCC Confidence 777765 57778889999999975 89999999999999999998644433331111 1100 000011112221 Q ss_pred hcchheec---CCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTL---ASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKG 351 (480) Q Consensus 276 ~~~~~~~~---~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~ 351 (480) .......+ +++...+..++.. ..+.|.+.++.++++++..+++++..||....+..+|++++++++.+.+++..++ T Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~ 394 (500) T protein:vir:98 315 DQNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIV 394 (500) T ss_pred CcceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHH Confidence 11111112 2233456666543 5788999999999999999999999999887777899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh-CCCChhH Q lcl|NC_021324. 352 RIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID-LGYTATQ 425 (480) Q Consensus 352 ~~~~~~l~~~~~li~~~~~-----~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~-~~~~~d~ 425 (480) +.|+.+|++++++++.+.. ........++.|.|.+.++.|..+.++.+.+++++| ++|.++++.+ +|+++++ T Consensus 395 ~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aG--i~s~~~~i~~~~g~~eee 472 (500) T protein:vir:98 395 ALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAG--FGTREMAIQKVLNVTEEK 472 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcC--CCCHHHHHHhcCCCCHHH Confidence 9999999999998875532 122223356899999999999999999999999986 6899997755 5888887 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 426 REQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 426 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) .+++.+..+++.. ...+..++.. +.-++ T Consensus 473 a~~~l~~i~~E~~--------~~~~~~~~~~-----~~~g~ 500 (500) T protein:vir:98 473 AQEIAAEINTGIV--------DEINQQRTDT-----HLYGE 500 (500) T ss_pred HHHHHHHHHHhcc--------ccCCCCCccc-----cccCC Confidence 7665444444321 1111111111 11111 No 70 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=1.7e-42 Score=249.62 Aligned_cols=451 Identities=9% Similarity=-0.009 Sum_probs=293.5 Q ss_pred CCCHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGL-----------------LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~-----------------~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) ++..+.+++++..+ +.....|+.+...||+|+++.........+..++.+...|+++.||+.+ T Consensus 4 ~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~~~~ 83 (522) T protein:vir:47 4 FQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTASKKI 83 (522) T ss_pred HHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHHHHH Confidence 66677788877644 2334577888889999987643322222233445678889999999999 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEE Q lcl|NC_021324. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL 142 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~ 142 (480) ++++++++.++ .+|+..++.|+++++.|+|...+.+++..++..|.+++.+|.+ ++++++.+++|++.+|++ T Consensus 84 A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-------~~~~~i~~v~ad~~~P~~ 156 (522) T protein:vir:47 84 ASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-------GDKVRVAFIQAPVFFPLE 156 (522) T ss_pred hhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-------CCceEEEEEcCCceEEEE Confidence 99999987653 3456678899999999999999999999999999999998863 467999999999999985 Q ss_pred eCCCCCceEEEEEEEEEe--cCCce-EEEEEEEeC---------------CcEEE---EEecCCCccc----------cc Q lcl|NC_021324. 143 DPRNTRRVTRAVRLYTTR--DDVAV-PDRATLYLP---------------DETVP---LRRNGGLNDQ----------WV 191 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~~--~~~~~-~~~~~~~~~---------------~~~~~---~~~~~~~~~~----------~~ 191 (480) .+..+ ...+++...+.. ...+. ++.++.|+- ..++. |+.......+ +. T Consensus 157 ~~~~~-~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~ 235 (522) T protein:vir:47 157 SNTQD-VSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYK 235 (522) T ss_pred EcCCc-eEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCcccccccccccc Confidence 43332 333444332222 22221 223343320 11222 2222111111 11 Q ss_pred c-ccccccccccccceEEe----ecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhh----hcCCCcc Q lcl|NC_021324. 192 V-DGDVIKHGLGVVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----ISGVTTD 262 (480) Q Consensus 192 ~-~~~~~~~~~g~iPvv~~----~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~----i~G~~~~ 262 (480) . .....-+++.++++++| +|+.....|+|.|++.. .++++|++|.++|++++.++....++.+ +.-.... T Consensus 236 ~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~ 314 (522) T protein:vir:47 236 NLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDN-AKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQR 314 (522) T ss_pred CCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhh-hHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCC Confidence 1 11122356777778776 46677788999999975 8899999999999999999876655444 2110000 Q ss_pred ccccccccchhhhhcchheec---CCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH Q lcl|NC_021324. 263 ELTNDGENTTLDIYYGRILTL---ASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~---~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~ 338 (480) ..........+......+..+ .++...+..++.. ..+.|.+.++.+++.|...+++++..||.......+|++++. T Consensus 315 ~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s 394 (522) T protein:vir:47 315 PDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVS 394 (522) T ss_pred CCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHH Confidence 000000111122111111111 1233356555543 678899999999999999999999999988777788999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHH Q lcl|NC_021324. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKE 413 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-----~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e 413 (480) ..+.+.++++.+++.+..+|+++++.++.+.. .........+.|.|.+.++.|..+++....+++++| ++|.+ T Consensus 395 ~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG--~~s~e 472 (522) T protein:vir:47 395 ENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAG--FSTKK 472 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcC--CCCHH Confidence 99999999999999999999999998886542 122334467899999999999999999999999987 68999 Q ss_pred HHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 414 QARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 414 ~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 477 (480) +++.+ +|+++++.+++.++.+++... ... +..+-. +....+.++ |+..+ T Consensus 473 ~~i~~~~g~~eeea~~el~ri~~E~~~-------~~~----~~~~~~---~~~~~~~~~-~d~~~ 522 (522) T protein:vir:47 473 RAIGKTLNISGVEAEKELNAINSELLP-------MND----AELAIY---GMHDQNEEK-ADDKG 522 (522) T ss_pred HHHHhcCCCChHHHHHHHHHHHHhhcc-------CCC----CCCCCC---CCCCccccc-CCCCC Confidence 97655 488877765444444433210 000 000000 011111112 11111 No 71 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=4.3e-42 Score=247.37 Aligned_cols=466 Identities=12% Similarity=0.084 Sum_probs=290.8 Q ss_pred CCCH-------HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTY-------HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~-------~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |..- .-|+. .....|+.+|+.|.+||.|++.-...-..- . ...-+..++++++|++...+| +.|+. T Consensus 9 ~p~~~~fp~~~a~wV~---~~D~~RlaaY~ly~d~y~n~~~el~~il~G-~--dr~~~~~ps~r~~V~~~~~~L-g~~~~ 81 (563) T protein:vir:74 9 DPAKPFLRGGDDNIVD---ENDKNRVRAYDLYENIYLNSAETLKLVLRG-D--DSVPILMPSGRKIVEAVHRFL-GVGFD 81 (563) T ss_pred CCCcccccccccccCC---HHHHHHHHHHHHHHHhhcCchhhhhhhcCC-C--ceeeeccchHHHHHHHHHHhc-CCCcE Confidence 2221 12322 345668899999999999987442210000 1 122344568899999966554 88865 Q ss_pred c--CC---Cc----hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeC Q lcl|NC_021324. 74 I--SE---DS----EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDP 144 (480) Q Consensus 74 ~--~~---d~----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~ 144 (480) + +. |+ .....|.++++++++..++.+..+++.+-|++-++|-.+. .....++++++.++|.+.||+-|+ T Consensus 82 ~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp--~K~~g~R~rv~~vDP~~~fp~~dp 159 (563) T protein:vir:74 82 YLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADP--NKKAGERISVDEVDPRQIFLIEDG 159 (563) T ss_pred EecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeecc--ccccCCCceEeecCCceeeeccCC Confidence 4 32 22 1335678899999999999999999999999855543221 012346899999999999996555 Q ss_pred CCCCceEEEEE---EEEEecCCce-EE---------------------EEEEEeCCcEE-------EEEecCCC--cccc Q lcl|NC_021324. 145 RNTRRVTRAVR---LYTTRDDVAV-PD---------------------RATLYLPDETV-------PLRRNGGL--NDQW 190 (480) Q Consensus 145 ~~~~~~~~~~~---~~~~~~~~~~-~~---------------------~~~~~~~~~~~-------~~~~~~~~--~~~~ 190 (480) +..... +.++ -|...++... .. -++.|+.+.+- .+...... .... T Consensus 160 d~v~g~-~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~ 238 (563) T protein:vir:74 160 STVVGF-HMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSAQH 238 (563) T ss_pred CCcccc-eeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhhhh Confidence 443221 1111 2222221111 00 11111111000 00000000 0001 Q ss_pred ccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc Q lcl|NC_021324. 191 VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN 270 (480) Q Consensus 191 ~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~ 270 (480) ..+.+..|++++.||+|+|+|.+..++.||+|+|++ ++++++++|..+|+.+.++.+.++|+.++.|..+. ....+.. T Consensus 239 d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~-ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~-d~~~g~~ 316 (563) T protein:vir:74 239 DEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEG-METLAYALNQSLTDEDATIVFQGLGMYVTNASAPV-DPNTGEL 316 (563) T ss_pred hchhhhccccccCccEEEcCCCCCcccccchhhHHH-HHHHHHHHhhhhhHHHHHHHhcCCCeEEecccccc-ccccccc Confidence 113345689999999999999999999999999975 99999999999999999999999999888876533 2344455 Q ss_pred chhhhhcchheecCCC--CceEEEecc-ccHHHHHHHHHHHHH-HHHhhcCCChHHhcc-ccccchHHHHHHHHHHHHHH Q lcl|NC_021324. 271 TTLDIYYGRILTLASE--AAKISEFKA-AELRNFAEEMEVFRK-EAASITGLPPQYLSS-SSENPASAEAIIATDSRIVK 345 (480) Q Consensus 271 ~~~~~~~~~~~~~~g~--~~~~~~~~~-~~~~~~~~~l~~~~~-~i~~~~~~p~~~~g~-~~~~~~Sg~Al~~~~~~l~~ 345 (480) ..+.+++|.+|.+.+. ...+-.++. .+++++..+++.+.+ -++.++++|...||. +..+..||.||++.+.+|.+ T Consensus 317 ~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~a 396 (563) T protein:vir:74 317 TDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLLA 396 (563) T ss_pred cccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHHH Confidence 6688899999988644 235666775 467788888887776 557788999999994 34556899999999999999 Q ss_pred HHHHHHHHHHHHHHH----HHHHHH-HHhC-----C-----CCc--ccceeeeEEecCCCCCCHHHHHHHHHHHHhcccc Q lcl|NC_021324. 346 MAERKGRIFGGAWER----AMRIAM-QIMG-----R-----EVT--EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG 408 (480) Q Consensus 346 k~~~~~~~~~~~l~~----~~~li~-~~~~-----~-----~~~--~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~ 408 (480) ++.++++.+...+++ .+++++ .+.+ . +.. ..-..+.|+|.++.|.|.++....++.++++| T Consensus 397 ~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aG-- 474 (563) T protein:vir:74 397 ANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAH-- 474 (563) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcC-- Confidence 999999977777766 333333 1222 1 111 11234678999999999999999999999987 Q ss_pred CCCHHHHHHhC---CCChhHH-HHHHHHHHHHHHHHHHHHhhhc-ccCCCcCCCCCCCCCCccCCCC-------CCCCCc Q lcl|NC_021324. 409 PIPKEQARIDL---GYTATQR-EQMRDWDKQETEDMIDTLYSTT-KAQADATPKPTVTETKTETQTS-------PSGFNR 476 (480) Q Consensus 409 ~~s~e~~~~~~---~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~d~~~~~~~~-------~~~~~~ 476 (480) ++|.+++.++| ||...+. .+++.+...+..+++-+...+. .....+..+.+..|+....++. |-.... T Consensus 475 iiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p~~~~~~~~~~~~ 554 (563) T protein:vir:74 475 LILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQGNPIDQFGNPVEIPP 554 (563) T ss_pred chhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccCCchhHcCCcccCCc Confidence 78999998777 7754332 2333333333333222211111 1122222222222322222222 222222 Q ss_pred ccCC Q lcl|NC_021324. 477 TKTR 480 (480) Q Consensus 477 ~~~~ 480 (480) .-|+ T Consensus 555 ~~~~ 558 (563) T protein:vir:74 555 DVTQ 558 (563) T ss_pred cccc Confidence 2333 No 72 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=6.8e-40 Score=235.31 Aligned_cols=445 Identities=9% Similarity=-0.022 Sum_probs=290.3 Q ss_pred CCCHHHHHHHHHHHHH-----------------HHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLA-----------------RDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~-----------------~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) .+.++.|++++..++. ....|+.+..+||.|+++-.+.........+..+...|+++.|+..+ T Consensus 4 ~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~~~~ 83 (517) T protein:vir:98 4 IQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSADVL 83 (517) T ss_pred HHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHHHHh Confidence 5666777776644321 12457778889999998653221112233345577889999999999 Q ss_pred HhhhccCc--cccCCC----------chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEE Q lcl|NC_021324. 64 SDRLDIEG--FRISED----------SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 64 ~~~l~~~~--~~~~~d----------~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~ 131 (480) +++++.+. +++++. ...++.|+++++.|+|...+.+.+..++..|.+++.+|.+ +++++|. T Consensus 84 A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-------~~~~~I~ 156 (517) T protein:vir:98 84 SGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD-------NGEIEFS 156 (517) T ss_pred hhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe-------CCeeEEE Confidence 99998875 344321 2356789999999999999999999999999999998863 4678999 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEE-EecCCc--eEEEEEEEeCCcE------E--E---EEecCCCcccccc----- Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYT-TRDDVA--VPDRATLYLPDET------V--P---LRRNGGLNDQWVV----- 192 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~~~------~--~---~~~~~~~~~~~~~----- 192 (480) +++|++++|+-.+. ++...+++..+. ...+.+ .++.++.|+.+.+ + + |+.......+..+ T Consensus 157 ~v~ad~~~Pl~~~~-~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~ 235 (517) T protein:vir:98 157 WALANAFYPLRSNS-NGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEEL 235 (517) T ss_pred EEcCCeeEEEEecC-CCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccccccccc Confidence 99999999964333 344555543322 222222 3455677765542 1 1 2222111111100 Q ss_pred ----ccccccccccccceEEe----ecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc Q lcl|NC_021324. 193 ----DGDVIKHGLGVVPVVPL----TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL 264 (480) Q Consensus 193 ----~~~~~~~~~g~iPvv~~----~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~ 264 (480) .....-+++.++++++| +|+....+++|.|++.. +++++|++|..+|+++..++....++.+=..+- ... T Consensus 236 ~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~-a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l-~~~ 313 (517) T protein:vir:98 236 YEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDN-SVSTLKKINDTYDQFWWEIKMGQRTVFVSDVML-RTV 313 (517) T ss_pred ccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhh-hHHHHHHHHHHHHHHHHHHHhCCcceecChhhh-ccc Confidence 11223355666666665 46777788999999975 889999999999999999987555443311111 000 Q ss_pred cccc---ccchhhhhcchhe--ecCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH Q lcl|NC_021324. 265 TNDG---ENTTLDIYYGRIL--TLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 265 ~~~~---~~~~~~~~~~~~~--~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~ 338 (480) .+.. ....+......+. ..+.++..+..++.. ..+.|.+.++.+++.|...+|++...||....+..+|++++. T Consensus 314 ~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s 393 (517) T protein:vir:98 314 PDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVS 393 (517) T ss_pred cCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHH Confidence 0111 1111221111111 222233445555543 467899999999999999999999999988777778999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH Q lcl|NC_021324. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIM------GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPK 412 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~------~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~ 412 (480) ..+.+.+++.++++.+..+|++++++++.+. +... ....++.|.|.+.+++|..+.++...+++++| ++|. T Consensus 394 ~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~-~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG--~ms~ 470 (517) T protein:vir:98 394 ENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEI-PSAEHIGVDFDDGVFQDRSALLRFYGQAKTFG--FIPT 470 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC-CCCcceEEEcCCCCCCCHHHHHHHHHHHHhcC--CCCH Confidence 9999999999999999999999999886542 2222 23456889999999999999999999999987 5789 Q ss_pred HHHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCC Q lcl|NC_021324. 413 EQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTS 470 (480) Q Consensus 413 e~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) ++++.+ +|+++++.+++.++.+++.. ...+....++. ++...++++ T Consensus 471 ~~~i~~~~g~~eeeA~~e~~~i~~E~~----------~~~~~~~~~~~--~~~~~gd~e 517 (517) T protein:vir:98 471 VEAIQRIFKVPKKTAEQWLEEIRKDQI----------ELDPVTISQRA--QKRMFGDEE 517 (517) T ss_pred HHHHHHhCCCChHHHHHHHHHHHHhcc----------ccCCCCccccc--cCCCCCCCC Confidence 887655 59988877655444444321 00111111111 111111111 No 73 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=7e-39 Score=229.75 Aligned_cols=440 Identities=12% Similarity=0.097 Sum_probs=270.8 Q ss_pred CCCHHHHHHHHHHHH--HHHHHHHHHHHHHhhccCc----ch----hcCcccchhhhcceeccchHHHHHHHHHhhhccC Q lcl|NC_021324. 1 MTTYHEHVERLQGLL--ARDLPNLLEAEAYRNGTRR----LK----TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~--~~~~~~~~~~~~YY~G~~~----i~----~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) .++.++.|+...+-- .....+...+.++|.+... +. ..+...++...+.++..|+++.||+.++++++++ T Consensus 4 ~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ll~~e 83 (518) T protein:vir:78 4 WSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAEYISGK 83 (518) T ss_pred hhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCccccccccCChHHHHHHHHHHhhcCC Confidence 455555555443211 0122333333333433311 10 1122223444567888999999999999999988 Q ss_pred ccc--cC-----CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEe Q lcl|NC_021324. 71 GFR--IS-----EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELD 143 (480) Q Consensus 71 ~~~--~~-----~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d 143 (480) +.+ ++ +++.+++.|+++++.|+|...+.+.+..++..|.+++.++.. +|++++.+++|.+++|+|+ T Consensus 84 ~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-------~~~~~i~~v~ad~~~P~~~ 156 (518) T protein:vir:78 84 PLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-------NGRPSISVHSSSQFWIDFK 156 (518) T ss_pred CceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-------CCeeEEEEEcCCeeEEEee Confidence 654 33 355567889999999999999999999999999999988752 5789999999999999997 Q ss_pred CCCCCceEEEEEEEE-EecCCc-eEEEEEEEeCCc-----------EEE---EEecCCCc--cccc---------c---- Q lcl|NC_021324. 144 PRNTRRVTRAVRLYT-TRDDVA-VPDRATLYLPDE-----------TVP---LRRNGGLN--DQWV---------V---- 192 (480) Q Consensus 144 ~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~-----------~~~---~~~~~~~~--~~~~---------~---- 192 (480) +.. +..++.... ...+.+ .++.++.|..+. .+. |+...+.. .... . T Consensus 157 ~g~---~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~ 233 (518) T protein:vir:78 157 NNE---PFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTND 233 (518) T ss_pred cCc---EEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCccccccccccccccccccccccc Confidence 543 333333222 222222 233344443221 111 22111100 0000 0 Q ss_pred --ccccccccccccceEEe-e----cccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc Q lcl|NC_021324. 193 --DGDVIKHGLGVVPVVPL-T----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT 265 (480) Q Consensus 193 --~~~~~~~~~g~iPvv~~-~----n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~ 265 (480) ....++++. ..|++.| + |+...++++|.|++.. +++++|+||.++|++++.++....++.+-..+-..... T Consensus 234 ~~e~~~~~tg~-~~~~~~~~~n~~~N~~~~~splG~S~~~~-~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~ 311 (518) T protein:vir:78 234 IQLNHSVSIGL-KSMGAYLINNSPSNTRYPHLNLGESDLSQ-CTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKKVN 311 (518) T ss_pred CccceeeccCC-ccceEEeeccccccccccCCCcCcchHhh-hhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccCCC Confidence 011123343 4455554 3 5566678899999975 89999999999999999998744444332111100000 Q ss_pred c--ccccchhhhhcchheecC-----CCCc--eEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHH Q lcl|NC_021324. 266 N--DGENTTLDIYYGRILTLA-----SEAA--KISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEA 335 (480) Q Consensus 266 ~--~~~~~~~~~~~~~~~~~~-----g~~~--~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~A 335 (480) . ......+......+..+. |.++ .+.+++. -..+.|++.++.+++.+...+|+++..||.++. ..||++ T Consensus 312 ~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~-~~TATe 390 (518) T protein:vir:78 312 KSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNR-EVKATE 390 (518) T ss_pred CCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccc-cccHHH Confidence 0 011112222222222221 1111 2555543 357889999999999999999999999987543 479999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-------CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhcccc Q lcl|NC_021324. 336 IIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-------EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG 408 (480) Q Consensus 336 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-------~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~ 408 (480) ++...+.+.+++..++..+..+|+++++.++.+... ....+...+.|.|.+.++.|..+.++...+++++| T Consensus 391 i~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aG-- 468 (518) T protein:vir:78 391 IWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSAL-- 468 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcC-- Confidence 999999999999999999999999999988765431 11223457889999999999999999999999987 Q ss_pred CCCHHHHHHhC--CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC-CCCcc Q lcl|NC_021324. 409 PIPKEQARIDL--GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-ETKTE 466 (480) Q Consensus 409 ~~s~e~~~~~~--~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~ 466 (480) ++|.++++.++ ++++++.+++.++++++.. . ...+.+.+-++ +++++ T Consensus 469 imS~e~~i~~~~~~~~deea~~e~~ri~~E~~--------~---~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 469 AMSVEEKVKLIHPKWEDEEIQAEVKRIYLENA--------I---GEVPDPEAIGGMETKGG 518 (518) T ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhc--------c---cCCCCCccccCCCCCCC Confidence 68999988764 5776665543333333321 0 00011111111 11111 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.96 E-value=9.6e-28 Score=168.72 Aligned_cols=457 Identities=15% Similarity=0.114 Sum_probs=276.2 Q ss_pred CCCHH-HHHHHHHHHHHHHHHHHHHHHHHhhccCcchh-----cCc---ccchhhhcc---eeccchHHHHHHHHHhhhc Q lcl|NC_021324. 1 MTTYH-EHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-----IGI---GAPPELAYL---DVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~-~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~-----~~~---~~~~~~~~~---~~~~n~~~~iVd~~~~~l~ 68 (480) |+|-. +.+..--..|....++++++.+.|.|...++. +++ .....|+.. -+..|+++.+|+.++++++ T Consensus 1 m~~~~~~~v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~vf 80 (513) T protein:vir:97 1 MADKDPKSPATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKPF 80 (513) T ss_pred CCCCCCCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhhh Confidence 88863 34555556677888999999999988755442 222 222234322 2457999999999999998 Q ss_pred cCccccCCCchhHHHHHHH-HH-----hcCHHHHHHHHHHHHhhcCeeEEEEecCccccCC------------CCceeEE Q lcl|NC_021324. 69 IEGFRISEDSEGLEELWNW-WQ-----ANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD------------PAGIPLI 130 (480) Q Consensus 69 ~~~~~~~~d~~~~~~l~~~-~~-----~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d------------~~g~~~~ 130 (480) -+..+...+ ....+.+. ++ -++++.+++.++..++.+|+++++|..+.....+ ..-.|.+ T Consensus 81 ~k~p~~~~~--~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~ 158 (513) T protein:vir:97 81 SEPIKLNED--VPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRPYW 158 (513) T ss_pred hcCcccCcC--chHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCceE Confidence 887765422 23334432 22 2579999999999999999999999765433211 1124889 Q ss_pred EEEcccceEEEEeC--CCCC-ceEEEEEE--EEEecCCc--eEEEEEEEeCCcEEEEEecCCCccc--cccccccccccc Q lcl|NC_021324. 131 RVESPLYMYAELDP--RNTR-RVTRAVRL--YTTRDDVA--VPDRATLYLPDETVPLRRNGGLNDQ--WVVDGDVIKHGL 201 (480) Q Consensus 131 ~~~~p~~~~~v~d~--~~~~-~~~~~~~~--~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~ 201 (480) ..++|.+++- |+. .... .+..++.. +.+.|.-+ ....+.+|+++.+..|+...+.... .....+...|.+ T Consensus 159 ~~~~~e~Iin-W~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l 237 (513) T protein:vir:97 159 VMIKPECLLF-ARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGL 237 (513) T ss_pred EEecHhhhcC-cceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcC Confidence 9999998765 322 1222 23332222 22333222 3345567888887777665433221 112334456899 Q ss_pred cccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhe Q lcl|NC_021324. 202 GVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL 281 (480) Q Consensus 202 g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~ 281 (480) +.||||.|... .....-+.+.|. +|-.|..+.-+..|+...++.+.++|++++.|.+... ...+.++.+.+| T Consensus 238 ~~IP~v~~~~~-~~~~~~~~pPLl-~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~------~~~i~iG~~~~~ 309 (513) T protein:vir:97 238 NYVPLVTFYAD-RQGFMMGKPPLL-DLAHLNVAHWQSASDQRHILTVSRFPILACSGASGED------SDPVVVGPNKVL 309 (513) T ss_pred CceeEEEEecC-CCCCCCCccchH-HHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCC------CCceEeeccccc Confidence 99999988754 334455677776 4778888888999999999999999999999975432 112456777777 Q ss_pred ecCC--CCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 282 TLAS--EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 282 ~~~g--~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) .+++ .++.|.+.+.+.+....+.++.+..++..+.- ..+..++. +.||++.+.......+....+...+..+|. T Consensus 310 ~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga---~ll~~~~~-~~Ta~a~~~~~~~~~S~L~~~a~~le~al~ 385 (513) T protein:vir:97 310 YNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGA---EFLKRKTG-GQTATARALDSAEATSDLSAMTGLFEDALA 385 (513) T ss_pred cCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHH---HhhccCCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7764 45677788888888888888887777755432 11332333 368999999999999999999999999999 Q ss_pred HHHHHHHHHhCCCCcccceeeeEEecCCCCCCH-HHHHHHHHHHHhccccCCCHHHHHHhC---CCChhH--HHHHHHHH Q lcl|NC_021324. 360 RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTV-AAKADAVSKLYANGQGPIPKEQARIDL---GYTATQ--REQMRDWD 433 (480) Q Consensus 360 ~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~-~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~--~~~~~~~~ 433 (480) ++++++..++|.+.. ...+++. ++.....+ ++.++++.++.++| .+|.+++++.| +....+ .++..+.. T Consensus 386 ~~l~~~a~wlg~~~~--~~~v~in-~dF~~~~~~~~~~~al~~a~~~G--~is~~t~~~~L~r~gvl~~d~d~~~~~e~~ 460 (513) T protein:vir:97 386 QALDITADWLRLGPN--GGTVELV-KDYDLEEMDAPGLQALQVAREKR--DISRKTYLNGLRLRGVLPEDFDEDEDWEEL 460 (513) T ss_pred HHHHHHHHHhCCCCC--ccEEEec-cccCcccCCHHHHHHHHHHHhCC--CCCHHHHHHHHHhccCCCccCCHHHHHHHH Confidence 999999999885432 1223331 22223333 45678888888876 68999987665 454211 11112222 Q ss_pred HHHH-HHHHHHH--hhhcccCCCcC-C-----CCCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 434 KQET-EDMIDTL--YSTTKAQADAT-P-----KPTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 434 ~~~~-~~~~~~~--~~~~~~~~~~~-~-----~~~~~d~~~~~~~~~~~~~~~ 477 (480) +++. ++..+.- .+...+.+..+ . +..++|....+.+-.+.++.+ T Consensus 461 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 461 MEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred HHhhhhccCCCCccccccCCCCCCCCCCCCCCCCCCCCCCCccccCCCCCCCC Confidence 2221 1111100 00001111111 1 111111111111111111111 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.94 E-value=1.3e-25 Score=156.96 Aligned_cols=423 Identities=14% Similarity=0.116 Sum_probs=252.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchh-----cCccc---chhhhcc---eeccchHHHHHHHHHhhhcc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-----IGIGA---PPELAYL---DVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~-----~~~~~---~~~~~~~---~~~~n~~~~iVd~~~~~l~~ 69 (480) |+ |..--..|....++++++.+.|.|...++. +++.. ...|+.. -+..|+++.+|+.++++++. T Consensus 1 m~-----V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~vf~ 75 (452) T protein:vir:94 1 MP-----IETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMVLD 75 (452) T ss_pred CC-----CCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchhhc Confidence 43 222223456677788888888888765543 22221 1222222 23479999999999999988 Q ss_pred CccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCc Q lcl|NC_021324. 70 EGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 70 ~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~ 149 (480) +..+....+. ...+..=-+-++++.+...++..++.+|+++++|..+. ..+.|.+..++|.+++- |+...... T Consensus 76 k~p~~~~p~~-l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~-----~g~rPy~~~~~~~~Ii~-W~~~~~g~ 148 (452) T protein:vir:94 76 QPPVITHPDA-MSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPL-----TGGDPYISVYTTENILN-WEEDEDGR 148 (452) T ss_pred CCceecccHH-HHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeecc-----CCCceEEEEechhhhcC-ccccccCC Confidence 8876533222 22222112346899999999999999999999997643 34679999999999875 54333333 Q ss_pred eEEEEEEEE--EecC-----CceEEEEEEEe--CCcEEE--EEecCCCcccc--ccccccccccccccceEEeecccccC Q lcl|NC_021324. 150 VTRAVRLYT--TRDD-----VAVPDRATLYL--PDETVP--LRRNGGLNDQW--VVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 150 ~~~~~~~~~--~~~~-----~~~~~~~~~~~--~~~~~~--~~~~~~~~~~~--~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) +...+..+. ..+. .+....+.+++ ++.+.. |+...+..+.. ....+...|+++.||+|.+... ... T Consensus 149 l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~-~~~ 227 (452) T protein:vir:94 149 LLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPS-GLS 227 (452) T ss_pred eeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCC-CCC Confidence 333332222 1121 12333444554 554433 33222221111 1122344588999999977644 333 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCC--CCceEEEec Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLAS--EAAKISEFK 294 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g--~~~~~~~~~ 294 (480) ...+.+.|. ++-.|.-+..+..|+..+++...++|++++.|.+.. + .+.++.+..|.++. .++.+.+.+ T Consensus 228 ~~~~~pPLl-~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~------~--~i~iG~~~~~~lpe~~~~~~yie~~ 298 (452) T protein:vir:94 228 MTPAKPPMI-DIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQ------S--TMHIGSTKAWVIPEVAAKVGFLEFT 298 (452) T ss_pred CCCCccchH-HHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCC------C--ceEecccccccCCCCCCcceEEccC Confidence 345677776 488888888899999999999999999999986422 1 13456677777763 356777888 Q ss_pred cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021324. 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) Q Consensus 295 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 374 (480) ...++.+.+.++.+..++..+.. ..+-..+.+..|++|.+.....-.+....+...+..++.+++++++.+.|.... T Consensus 299 g~~i~~~~~~l~~le~~m~~~Ga---~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~~~~ 375 (452) T protein:vir:94 299 GQGLQSLEKALSEKQAQLASLSA---RLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDMESMGGT 375 (452) T ss_pred chhHHHHHHHHHHHHHHHHHHHH---HhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCc Confidence 87888888888877777655432 122223334467877766555555555556666677888888988888885421 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHHHHhhhcccC Q lcl|NC_021324. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRDWDKQETEDMIDTLYSTTKAQ 451 (480) Q Consensus 375 ~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (480) ..+++.-......-.++.++++.++..+| .+|.+|+++.| |..+.+.+ .+..+.|. .+ .+ T Consensus 376 ---~~v~~n~dF~~~~~~~~~~~al~~~~~~G--~is~~t~~~~L~~~gvl~~~~e-~~~i~~E~--------~~---~~ 438 (452) T protein:vir:94 376 ---LNIKLNSAFLDSKLTAAELKAWVEAYLSG--GISKEIYIHALKVGKVLPPPGE-SMGVIPDP--------PA---PE 438 (452) T ss_pred ---eEEEeccccccccCCHHHHHHHHHHHhcC--CCcHHHHHHHHHhCCCCCCccC-HHHHHHHh--------hc---cC Confidence 23433211122233457788888888776 68999998777 65543221 11111111 00 01 Q ss_pred CCcCCCCCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 452 ADATPKPTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 452 ~~~~~~~~~~d~~~~~~~~~~~~~~~ 477 (480) +.+...| |++++++ T Consensus 439 ~~~~~~~------------~~~~~~~ 452 (452) T protein:vir:94 439 PSPSNTP------------PNPSSKA 452 (452) T ss_pred cccCCCC------------CCCccCC Confidence 1111111 1111111 No 76 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.93 E-value=8.1e-25 Score=152.66 Aligned_cols=452 Identities=14% Similarity=0.093 Sum_probs=244.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchh-----cCccc--------chhhhcc---eeccchHHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-----IGIGA--------PPELAYL---DVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~-----~~~~~--------~~~~~~~---~~~~n~~~~iVd~~~ 64 (480) |.| |+.---.|....++++++.+.|.|...++. +++-. ...|+.+ -+..|+++.+|+.++ T Consensus 32 m~d----V~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~ 107 (535) T protein:vir:80 32 LPN----VGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDGMM 107 (535) T ss_pred CCC----CCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHHHh Confidence 554 332233466777888999999998765543 22211 0113322 345799999999999 Q ss_pred hhhccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeeEEEEecCccccC-------CCCceeEEEE Q lcl|NC_021324. 65 DRLDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG-------DPAGIPLIRV 132 (480) Q Consensus 65 ~~l~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~-------d~~g~~~~~~ 132 (480) ++++.+..... ....|..+++. ++++.+++.++..++.+|+++++|..+..+.. .....|.+.. T Consensus 108 G~vfrk~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rPy~~~ 183 (535) T protein:vir:80 108 GQVFSRDPIRQ----LPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRPTITL 183 (535) T ss_pred chhhcCCccee----ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCcEEEE Confidence 99887765442 23445555543 47999999999999999999999975432210 0123589999 Q ss_pred EcccceEEEEeCC-CC-C-ceEEEEEE--EEEecC---CceEEEEEEEeCC--c---EEEEEecCCC--cccc--ccccc Q lcl|NC_021324. 133 ESPLYMYAELDPR-NT-R-RVTRAVRL--YTTRDD---VAVPDRATLYLPD--E---TVPLRRNGGL--NDQW--VVDGD 195 (480) Q Consensus 133 ~~p~~~~~v~d~~-~~-~-~~~~~~~~--~~~~~~---~~~~~~~~~~~~~--~---~~~~~~~~~~--~~~~--~~~~~ 195 (480) ++|.+++- |+.. .+ . .+..++.. +...++ ......+.+++++ . +..|+..+.. ...+ ....+ T Consensus 184 y~ae~Iin-W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~ 262 (535) T protein:vir:80 184 VHPTSIIN-WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTD 262 (535) T ss_pred echhhccC-ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeeccc Confidence 99998765 3322 22 1 23333222 222222 1233345555553 2 2234333221 1111 12223 Q ss_pred cccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh Q lcl|NC_021324. 196 VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI 275 (480) Q Consensus 196 ~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~ 275 (480) ...|.++.||+|.|... ......+.+.|. ++-.|.-+.-+.-|+..+++.+.++|++++.|.+.....+..+...+.+ T Consensus 263 ~g~~~l~~IPfv~~~~~-~~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~~~i~i 340 (535) T protein:vir:80 263 GNGNPFKEIPFQFIGPL-DNNADIDHPPLL-DLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKDFKVHL 340 (535) T ss_pred CCCcccCeeEEEEeecC-CCCCCCCccchH-HHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCCcceEe Confidence 45589999999977533 334445666665 4666766667777889999999999999999976543222222223455 Q ss_pred hcchheecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 276 ~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) +....|.++ +.+.++.+++.+.+. .+.++.+..++..+... .+...+.+..++ +-+.......+....+...+ T Consensus 341 G~~~~~~lP~~~~~~~~e~~~~~~a--~~~l~~~e~qM~~lGa~---ll~~~~~~~Ta~-~a~~~~~~~~S~L~~~a~~l 414 (535) T protein:vir:80 341 GSRAIIPLPQGATAGILQITPNSVP--FEAMTHKESQMIAMGAN---LLVKSGGNRTFG-EAQQEEASEQSILSACTKNV 414 (535) T ss_pred cCcccccCCCCCCcceeeeccchhH--HHHHHHHHHHHHHHHHH---hhccCcccccHH-HHHHHHHHHhHHHHHHHHHH Confidence 666667665 445677777665443 34566555555443211 122122222222 22444444455566666777 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHH--HH Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQR--EQ 428 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~--~~ 428 (480) ..++.+++++++.+.|.....+...+.+. ++..... .++.++++.++.++| .+|.+|+++.| ++++.+. ++ T Consensus 415 e~al~~aL~~~A~w~G~~~~~~~~~i~~n-~dF~~~~ld~~~~~all~~~~~G--~Is~et~~~~L~r~gvl~~~~~~ee 491 (535) T protein:vir:80 415 SMAFRKALRWANQFQTGIVNDETVEYNLN-TDFPAARLTPNERAELILEWQQG--AITFKEMRAGLRRAGVASEDDAKAE 491 (535) T ss_pred HHHHHHHHHHHHHHcCCccCCCceEEEec-cccccccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCCCCcccchHH Confidence 88888889988888885433222223321 2222233 345677888888875 68999998766 6654332 22 Q ss_pred HHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 429 MRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 429 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) .....+.+.. + .....+...+.+.+.....+.+.+++-.+.+++ T Consensus 492 e~~ri~~E~~---~-~~~~~g~~~d~~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 492 TEGKATVEFI---A-KTAAAGKVGDAASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred HHHHHHhhhh---h-ccccCCCCCCCCCCCCCcCcccCCccccccCCC Confidence 2222222211 1 000000000001011101111111211122222 No 77 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.90 E-value=2.3e-22 Score=139.21 Aligned_cols=440 Identities=15% Similarity=0.132 Sum_probs=243.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcch-----hcCccc----c----hhhhcc---eeccchHHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK-----TIGIGA----P----PELAYL---DVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~-----~~~~~~----~----~~~~~~---~~~~n~~~~iVd~~~ 64 (480) |.|. +.---.|....++++++.+.+.|...++ ++++-. + ..|+.+ -+..|+++.+|+.++ T Consensus 1 m~~V----~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~l~ 76 (501) T protein:vir:95 1 MPNV----SFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFGLV 76 (501) T ss_pred CCCC----CCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHHHh Confidence 7652 2222346667788889999999976553 332210 1 123322 245799999999999 Q ss_pred hhhccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeeEEEEecCccccC-C--------CCceeEE Q lcl|NC_021324. 65 DRLDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG-D--------PAGIPLI 130 (480) Q Consensus 65 ~~l~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~-d--------~~g~~~~ 130 (480) ++++.+..++. ....|..+++. ++++.+++.++..++.+|+++++|..+..... . ..-.|.+ T Consensus 77 G~vf~k~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~rPy~ 152 (501) T protein:vir:95 77 GQVFMRDPVVK----VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRIRPTL 152 (501) T ss_pred hhhhcCCccee----CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccCCcEE Confidence 99988876663 23444555443 57999999999999999999999976532110 0 0124889 Q ss_pred EEEcccceEEEEeC-CCCC--ceEEEEEE--EEEecC---CceEEEEEEEeCC--c---EEEEEecCCCcc--------c Q lcl|NC_021324. 131 RVESPLYMYAELDP-RNTR--RVTRAVRL--YTTRDD---VAVPDRATLYLPD--E---TVPLRRNGGLND--------Q 189 (480) Q Consensus 131 ~~~~p~~~~~v~d~-~~~~--~~~~~~~~--~~~~~~---~~~~~~~~~~~~~--~---~~~~~~~~~~~~--------~ 189 (480) ..++|.+++- |+. ..+. .+..++.. +.+.++ ......+.+++.+ . +..|+....... . T Consensus 153 ~~~~~~~Iin-W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~ 231 (501) T protein:vir:95 153 YVYSPTEIIN-WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGN 231 (501) T ss_pred EEecHhhhcC-cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCc Confidence 9999998765 332 2221 23333222 222222 1122333444433 2 222433322110 0 Q ss_pred cc-----cccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc Q lcl|NC_021324. 190 WV-----VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL 264 (480) Q Consensus 190 ~~-----~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~ 264 (480) +. ...+...|.++.||+|.+.. ...+..-+.+.+. ++-.|.-+.-+.-|+...++.+.++|+++++|.+.+.. T Consensus 232 ~~~~~~~~~~~~g~~~l~~IPfv~~~~-~~~~~~~~~pPLl-~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~~ 309 (501) T protein:vir:95 232 YQQYVVYKPTDAQGKRLTEIPFMFIGS-ENNDSNPDNPNFY-DLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEWV 309 (501) T ss_pred ccccceeeeeccCCCcCCeeeEEEEec-CCCCCCCCccchH-HHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCccccc Confidence 00 11122348999999996633 2333233444454 34455444455667888889999999999999876533 Q ss_pred ccccccchhhhhcchheecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHH Q lcl|NC_021324. 265 TNDGENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRI 343 (480) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l 343 (480) ..... ..+.++....+.++ ++++++.+.+.+.+. .+.++.+..++..+.- ..+..++. +.||++.+...... T Consensus 310 ~~~~~-~~i~~G~~~~~~lP~~~~~~~ie~~~~~i~--~~~l~~l~~~m~~~Ga---~ll~~~~~-~~Ta~~~~~~~~~~ 382 (501) T protein:vir:95 310 TNVLK-GSVNFGSRGGIPLPVGADAKLLQASENTML--KEAMDTKERQMVALGA---KLVEQKEV-QRTATEAELEAASE 382 (501) T ss_pred ccCCC-CceeecccccccCCCCCceeEEecChhhHH--HHHHHHHHHHHHHHHH---hhccCCcc-chhHHHHHHHHHHH Confidence 22222 23455655666664 445667776655543 4556666665544421 11222222 35788888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCHHHHHHhC--- Q lcl|NC_021324. 344 VKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPKEQARIDL--- 419 (480) Q Consensus 344 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~e~~~~~~--- 419 (480) .+....+...+..+|.+++++++.+.|... +...+++. ++..... .++.++++.++..+| .+|.+|+++.| T Consensus 383 ~S~L~~~a~~le~al~~~l~~~a~w~g~~~--~~~~v~i~-~df~~~~~~~~~~~al~~~~~~G--~is~~t~~~~L~~~ 457 (501) T protein:vir:95 383 GSTLSSATKNVSAAFEWALKWAARWVGQAD--SGVKFELN-TDFDIARMTPDERRSLVEEWQKG--AITFEEMRTGLRKA 457 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCCC--CceEEEEe-cccccccCCHHHHHHHHHHHhCC--CCcHHHHHHHHHhC Confidence 888888888889999999999999988432 22234432 3333333 356688888888765 68999996555 Q ss_pred CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 420 GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 420 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) ++.+.+.+.+++.++.+.. . ....+.++..+. ++..+..-|.++ T Consensus 458 ~v~~~~~~~e~e~i~~~~~-~-------~~~~~~~~~~~~-----~~~gg~~~~~~~ 501 (501) T protein:vir:95 458 GVATEDDSKAKEKIAKDTA-E-------AMALATPANVPG-----DGSGGDNVGNSE 501 (501) T ss_pred CCCChhHHHHHHHHHhhhc-C-------cccccccCCCCC-----CCcccccccCCC Confidence 6665443322222222110 0 000011111110 000011011111 No 78 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.87 E-value=4.4e-21 Score=132.17 Aligned_cols=439 Identities=15% Similarity=0.081 Sum_probs=241.0 Q ss_pred CCCHHH---HHHHHHHHHHHHHHHHHHHHHHhhccCc--c-----hhcCccc-chhhhcc---eeccchHHHHHHHHHhh Q lcl|NC_021324. 1 MTTYHE---HVERLQGLLARDLPNLLEAEAYRNGTRR--L-----KTIGIGA-PPELAYL---DVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~~~~~---~i~~l~~~~~~~~~~~~~~~~YY~G~~~--i-----~~~~~~~-~~~~~~~---~~~~n~~~~iVd~~~~~ 66 (480) |-+..- =|+.--..|....++++++.+.|.|... . ++.++.. ...|... -+..|+++.+|++++++ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhch Confidence 222110 0111123455666777888888988542 1 1111111 1113222 24579999999999999 Q ss_pred hccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeeEEEEecCccccCC------CCceeEEEEEcc Q lcl|NC_021324. 67 LDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESGD------PAGIPLIRVESP 135 (480) Q Consensus 67 l~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d------~~g~~~~~~~~p 135 (480) ++.+..... ....|..+++. ++++.+++.++..++.+|+++++|..+...... ..-.|.+..++| T Consensus 81 vfrk~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~ 156 (489) T protein:vir:78 81 VMRKEPEIN----IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTT 156 (489) T ss_pred hhcCCccee----ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEech Confidence 988876652 23334444443 579999999999999999999999876432110 122699999999 Q ss_pred cceEEEEeCC-CCC-ceEEEEEEEE--Eec-----CCceEEEEEEEeCCc-----EEEEEecCCCcccc-cc--cccccc Q lcl|NC_021324. 136 LYMYAELDPR-NTR-RVTRAVRLYT--TRD-----DVAVPDRATLYLPDE-----TVPLRRNGGLNDQW-VV--DGDVIK 198 (480) Q Consensus 136 ~~~~~v~d~~-~~~-~~~~~~~~~~--~~~-----~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~-~~--~~~~~~ 198 (480) .+++---... ... .+...+.... ..+ .......+.+++.+. +..|+....+.... .+ ..+.-. T Consensus 157 ~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~ 236 (489) T protein:vir:78 157 ENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGE 236 (489) T ss_pred hhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCC Confidence 9976531111 222 3333332222 112 122345566777652 23344333222111 11 123345 Q ss_pred ccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc--cccccchhhhh Q lcl|NC_021324. 199 HGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT--NDGENTTLDIY 276 (480) Q Consensus 199 ~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~--~~~~~~~~~~~ 276 (480) |.++.||+|.+... .....-+.+.|. +|-.|.-+.-+.-|+...++...++|++++.|.+..... .......+.++ T Consensus 237 ~~l~~IPfv~~~~~-~~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~~~i~~g 314 (489) T protein:vir:78 237 SLRGVIPFTFIGAT-NNDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANPNGIKFG 314 (489) T ss_pred CccCeeeEEEEecC-CCCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCccceeeC Confidence 88999999987643 223334555564 366666666677788999999999999999986532211 11111123334 Q ss_pred cchheecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhh-cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 277 YGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASI-TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 277 ~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~-~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ....+.++ ++.+++.+.+..++ ..+.++.+..++..+ ..+. ..+. +-||++.+.....-.+....+...+ T Consensus 315 ~~~~~~lp~~~~~~~ie~~~~~~--~r~~l~~le~qm~~lGa~l~-----~~~~-~~Ta~~~~~~~~~~~S~L~~~a~~~ 386 (489) T protein:vir:78 315 SRRGHNLGYGGSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQLI-----TPTQ-QITAQSARIQRGADTSVMATIARNV 386 (489) T ss_pred CcccccCCCCCCcceeccCcchH--HHHHHHHHHHHHHHHhhhhc-----cCCc-chhHHHHHHHHHHhhHHHHHHHHHH Confidence 44555553 44566777765543 244555555555443 2221 1121 3577777777777777778888888 Q ss_pred HHHHHHHHHHHHHHhCCCCccc-ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHH Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEE-YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMR 430 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~-~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~~~~~ 430 (480) ..++.+++++++.++|.....+ ...+...|.. ..-.++.++++.++.++| .+|.+++++.| |+.+.+.++++ T Consensus 387 e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~--~~~d~~~~~al~~~~~~G--~is~~t~~~~L~~~gv~d~~~e~~~ 462 (489) T protein:vir:78 387 SQAYTDALRWVAVMLGKPEDTEVEFRLNMDFFL--EPMTAQDRAAWMADINAG--LLPATAYYAALRKAGVTDWTDADIK 462 (489) T ss_pred HHHHHHHHHHHHHHcCCCCCCceEEEeecccCc--ccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCCCCccHHHHH Confidence 9999999999999988543221 1122333421 122255677888888766 68999988765 55544333332 Q ss_pred HHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 431 DWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) ++.+++. ...+....++-|+ ++..... T Consensus 463 ~ei~~~~---------~~~~~~~~g~~~~----------~~q~~~~ 489 (489) T protein:vir:78 463 DAVADQP---------LPVATEVQGEIPQ----------SAQQQEK 489 (489) T ss_pred HHHhhcC---------CCcccCCcccCCC----------CcccccC Confidence 2222110 0000011111111 1111111 No 79 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.85 E-value=7.5e-20 Score=125.43 Aligned_cols=434 Identities=15% Similarity=0.077 Sum_probs=236.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCc-------chhcCccc-chhhhcc---eeccchHHHHHHHHHhhhcc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRR-------LKTIGIGA-PPELAYL---DVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~-------i~~~~~~~-~~~~~~~---~~~~n~~~~iVd~~~~~l~~ 69 (480) +++.. ..|....++++++.+.|.|... +++.++.. ...|+.+ -+..|+++.+|++++++++. T Consensus 11 V~~~h-------p~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~vfr 83 (491) T protein:vir:95 11 VKTKH-------REWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSVMR 83 (491) T ss_pred CCccC-------HHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhchhhc Confidence 44433 3455666778888888888431 11111111 1123322 24579999999999999988 Q ss_pred CccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeeEEEEecCccccCC------CCceeEEEEEcccce Q lcl|NC_021324. 70 EGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESGD------PAGIPLIRVESPLYM 138 (480) Q Consensus 70 ~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d------~~g~~~~~~~~p~~~ 138 (480) +..... ....|..+++. ++++.+++.++..++.+|+++++|..+...... ..-+|.+..++|.++ T Consensus 84 k~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~I 159 (491) T protein:vir:95 84 KEPEIN----IPKELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENI 159 (491) T ss_pred CCceee----ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhh Confidence 876652 22334444443 579999999999999999999999876432110 122699999999997 Q ss_pred EEEEeCCCC--CceEEEEEEEE--Eec-----CCceEEEEEEEeCC-----cEEEEEecCCCcccccc---ccccccccc Q lcl|NC_021324. 139 YAELDPRNT--RRVTRAVRLYT--TRD-----DVAVPDRATLYLPD-----ETVPLRRNGGLNDQWVV---DGDVIKHGL 201 (480) Q Consensus 139 ~~v~d~~~~--~~~~~~~~~~~--~~~-----~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~ 201 (480) +---....+ ..+...+.... ..+ .....+.+.+++.+ .+..|+....+...... ..+...|.+ T Consensus 160 inW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~l 239 (491) T protein:vir:95 160 VNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESLR 239 (491) T ss_pred cCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCCccc Confidence 652111111 23333332222 111 12233445555443 22234433222111111 112234789 Q ss_pred cccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc--cccccchhhhhcch Q lcl|NC_021324. 202 GVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT--NDGENTTLDIYYGR 279 (480) Q Consensus 202 g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~--~~~~~~~~~~~~~~ 279 (480) +.||+|.+... .....-+.+.|. +|-.|.-+.-+.-|+...++...++|++++.|.+..... .......+.++... T Consensus 240 ~~IPfv~~~~~-~~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~~~i~~g~~~ 317 (491) T protein:vir:95 240 GVIPFTFIGAT-NNDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANPNGIKFGSRC 317 (491) T ss_pred CeeEEEEEecC-CCCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCcceeEecCcC Confidence 99999987643 223334555554 365665566677788888899999999999986542211 11111123344444 Q ss_pred heecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhh-cCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 280 ILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASI-TGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGA 357 (480) Q Consensus 280 ~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~-~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 357 (480) .+.++ ++++.+.+.+..++ ..+.|+.+..++... +.+ ...+. +-||++.+.....-.+....+...+..+ T Consensus 318 ~~~lP~~~~~~~ie~~~~~~--~~~~l~~~e~qm~~~Ga~l-----~~~~~-~~Ta~~~~~~~~~~~S~L~~~a~~~e~a 389 (491) T protein:vir:95 318 GHNLGYGGSAQLIQAGENNL--ARQNMLDKEQQAIQIGAQL-----ITPSQ-QITAESARIQRGADTSVMATIARNVSQA 389 (491) T ss_pred CcCCCCCCccceeecCcchH--HHHHHHHHHHHHHHHHHHh-----ccCCc-chhHHHHHHHHHHhhHHHHHHHHHHHHH Confidence 45543 44566777665443 234444444433332 121 11222 3578888888777778888888888999 Q ss_pred HHHHHHHHHHHhCCCCccc-ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHHHH Q lcl|NC_021324. 358 WERAMRIAMQIMGREVTEE-YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRDWD 433 (480) Q Consensus 358 l~~~~~li~~~~~~~~~~~-~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~~~~~~~~ 433 (480) +.+++++++.+.|.....+ ...+...|.. ..-.++.++++.++.++| .+|.+++++.| ++.+...+++.++. T Consensus 390 l~~~l~~~a~w~G~~~~~~v~i~~n~dF~~--~~~~~~~~~all~~~~~G--~is~~t~~~~L~~~~vl~~~~e~~~~~i 465 (491) T protein:vir:95 390 YTDALRWVAMMLGKPEDSEVEFQLNMDFFL--QPMTAQDRAAWMADINAG--LLPATAYYAALRKAGVTDWTDEDILNAI 465 (491) T ss_pred HHHHHHHHHHHcCCCCCCceEEEeeccccc--ccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCCCCccHHHHHHHH Confidence 9999999999988543222 1122333421 122255678888888876 68999988765 45543333332222 Q ss_pred HHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 434 KQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 434 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +++. .+.....+.+.+-+++..+..+ T Consensus 466 e~~~-------~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 466 EDAP-------LPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HhcC-------CCCCccccccccchhhhhhccC Confidence 2211 0000000111111111111111 No 80 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.82 E-value=1.7e-18 Score=117.97 Aligned_cols=423 Identities=10% Similarity=0.006 Sum_probs=221.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc----------------chhhhcc---e-eccchHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA----------------PPELAYL---D-VQPGWVATYL 60 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~----------------~~~~~~~---~-~~~n~~~~iV 60 (480) .++.......+..+|..-.--...--+ ..|..-+|..+... ...+.++ + +..|+++..+ T Consensus 16 V~~~hp~y~a~~~~W~~~~d~g~~~~k-~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~tl 94 (488) T protein:vir:96 16 TPIYHPDYLVNAPQWLRNLDCVMDNIK-RKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNPTM 94 (488) T ss_pred ccccCHHHHHHhhhhhHhhhhhhHHHH-HhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHHHH Confidence 455555555555555321110100000 11221222211100 1111111 1 3469999999 Q ss_pred HHHHhhhccCccccCCCchhHHHHHHHHHh-----cCHHHHHHHHHHHHhhcCeeEEEEecCccccC--C---CCceeEE Q lcl|NC_021324. 61 RTLSDRLDIEGFRISEDSEGLEELWNWWQA-----NDLDEESVLGHDDSLTFGRAYITVSHPDVESG--D---PAGIPLI 130 (480) Q Consensus 61 d~~~~~l~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~--d---~~g~~~~ 130 (480) ++++++++-+......++ ...|..+++. ++++.+.+.++..++.+|+++++|..+..... | ..-.|.+ T Consensus 95 ~~l~G~vfrk~p~~~~~~--~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy~ 172 (488) T protein:vir:96 95 NAITGAVMRREPEFDTMD--NPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPTA 172 (488) T ss_pred HHhcchhhccCceeccCC--cHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcEE Confidence 999999988877654322 1224444443 57999999999999999999999987532110 0 1235899 Q ss_pred EEEcccceEEEEeCC-CCC-ceEEEE-EE-EEEecCC--c--eEEEEEEEeCCcEEEEEecCCCcccccccccccccccc Q lcl|NC_021324. 131 RVESPLYMYAELDPR-NTR-RVTRAV-RL-YTTRDDV--A--VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG 202 (480) Q Consensus 131 ~~~~p~~~~~v~d~~-~~~-~~~~~~-~~-~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 202 (480) ..++|.+++---... .++ .+..++ +- +.+.|.. . ....+..++++.+..++...+...........-.|.++ T Consensus 173 ~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e~~~~~~g~~~l~ 252 (488) T protein:vir:96 173 AFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDEWTPVLINSKQSD 252 (488) T ss_pred EEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccceEeecCCCcccC Confidence 999999977532112 222 233332 22 2222321 1 12222235666544444433322211122223457899 Q ss_pred ccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhee Q lcl|NC_021324. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT 282 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~ 282 (480) .||||+|... ..+..-+.+.+. ++-.|.-+.-+..|+...++...++|++++.+....... .......-+..+.-.. T Consensus 253 ~IP~v~~~~~-~~~~~~~~pPLl-dLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~~~~~~~-~~~~~~~g~~~~~~~~ 329 (488) T protein:vir:96 253 TIPFFLASSQ-SNEWCIDSTPLT-SLAEISLSIYVMNAYSNKAMILANEAKWMVDMGDMNKTM-ASEMNPLGFTLAGRMP 329 (488) T ss_pred eeEEEEEecC-CCCCCCCCCchH-HHHHHHHHHHhhhhHHHHHHHhcCCceeeeccCCCCccc-ccccccceeeeccccc Confidence 9999987543 333344555564 366666666677788888887778888876433222211 1111111122222222 Q ss_pred --cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhc-CCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 --LASEAAKISEFKAAELRNFAEEMEVFRKEAASIT-GLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 283 --~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~-~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) .+.++.++.+.+.+.+ ..+.++.+..++..+. .+. ..+.+ -||++.+.....-.+....+...+..++. T Consensus 330 ~~~~~g~~~~~e~~~~~l--~~~~l~~l~~qm~~~Ga~l~-----~~~~~-~Ta~~~~~~~~~~~S~L~~~a~~le~al~ 401 (488) T protein:vir:96 330 YYVKNGDVKVIQAQFSPE--TENKVEKLFEQAVKVGASLF-----TQQSN-ETATGAAIRSGSSTASMATLGNNVEDTVR 401 (488) T ss_pred ccccCCceeecCCchhHH--HHHHHHHHHHHHHHHhHhhc-----cCCCc-chHHHHHHHHHHhhHHHHHHHHHHHHHHH Confidence 2333344544443333 2455666666554432 121 12222 47888777777777777888888899999 Q ss_pred HHHHHHHHHhCCCCc---ccceeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCHHHHHHhC---CCChhH--HHHHH Q lcl|NC_021324. 360 RAMRIAMQIMGREVT---EEYTRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPKEQARIDL---GYTATQ--REQMR 430 (480) Q Consensus 360 ~~~~li~~~~~~~~~---~~~~~~~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~--~~~~~ 430 (480) +++++++.+.|.... .....+++. ++..... .++.++++.++..+| .+|.+|+++.| |++.++ .++++ T Consensus 402 ~~l~~~A~w~g~~~~~~~~~~~~~~in-~dF~~~~ld~~~~~al~~~~~~G--~Is~~t~~~~L~~~gvl~~d~~~e~~~ 478 (488) T protein:vir:96 402 NMLRFIMRYFEGTNLYVNPDELVFKLN-RDYFDVEVNPQMLQVAYAAMMEG--NLPQVSWFELLKRARVVRGDMSKEEFD 478 (488) T ss_pred HHHHHHHHHcCCCCCCcCccceEEEec-cCCCCccCCHHHHHHHHHHHhcC--CCCHHHHHHHHHhCCcCCccCCHHHHH Confidence 999999998874321 111223321 2222333 356788888888776 68999987665 555332 23332 Q ss_pred HHHHHHHHHH Q lcl|NC_021324. 431 DWDKQETEDM 440 (480) Q Consensus 431 ~~~~~~~~~~ 440 (480) ++.+++.-.+ T Consensus 479 ~~ie~~g~~~ 488 (488) T protein:vir:96 479 EHIAELGFGM 488 (488) T ss_pred HHHhhcCCCC Confidence 2222211101 No 81 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.80 E-value=4e-20 Score=126.90 Aligned_cols=465 Identities=12% Similarity=0.074 Sum_probs=219.7 Q ss_pred CCCHH--HHHHHHHHHHHHHHHH-------HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCc Q lcl|NC_021324. 1 MTTYH--EHVERLQGLLARDLPN-------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~~~~~-------~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) ++++. ++..+|+..+.....+ ..+-.+||+|+|--.... ..-.....-.+++|.++.+|+..+++..-+. T Consensus 38 ~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~i~~v~g~~~~nr 116 (776) T protein:vir:93 38 LDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWSQDEI-DELKERGQAPTVYNVISQSVNWIIGSEKRGR 116 (776) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCCHHHH-HHHHhcCCceEEecchHHHHHHHHHHHHhCC Confidence 55542 3555665554443221 234468999997432211 1111122345889999999999988765442 Q ss_pred cc------cCCCchhHH----HHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCce-eEEEEEcccceEE Q lcl|NC_021324. 72 FR------ISEDSEGLE----ELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYA 140 (480) Q Consensus 72 ~~------~~~d~~~~~----~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~-~~~~~~~p~~~~~ 140 (480) .. -.+|.+..+ .+..++..|+++.....++.+++++|.||+-|+.+. +.++. +++++++|.+++ T Consensus 117 ~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~----~~~~~~~~~~~~~p~~i~- 191 (776) T protein:vir:93 117 SDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQD----ENDGEPIYAGAESWRNIL- 191 (776) T ss_pred cceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeec----cCCCCceEeeccChhhee- Confidence 21 123444333 345567778999999999999999999999987643 22333 455678888765 Q ss_pred EEeCCCCC----ceEEEEE-EEE------------------------------------------------------Eec Q lcl|NC_021324. 141 ELDPRNTR----RVTRAVR-LYT------------------------------------------------------TRD 161 (480) Q Consensus 141 v~d~~~~~----~~~~~~~-~~~------------------------------------------------------~~~ 161 (480) ||+.... ...+.++ .|. ... T Consensus 192 -~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (776) T protein:vir:93 192 -WDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAY 270 (776) T ss_pred -eccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccccccccccc Confidence 5543211 0111110 000 000 Q ss_pred CCceEEEEEEEeCCcEEEEEecC--CC-cc------------------------------------cccccccccccccc Q lcl|NC_021324. 162 DVAVPDRATLYLPDETVPLRRNG--GL-ND------------------------------------QWVVDGDVIKHGLG 202 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~--~~-~~------------------------------------~~~~~~~~~~~~~g 202 (480) ....++.+++|+...+....... +. .. ..+......|...+ T Consensus 271 ~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p~~~~ 350 (776) T protein:vir:93 271 ARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWAGPSPYRHN 350 (776) T ss_pred CCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhccCCCCCCC Confidence 11233455666533221110000 00 00 00011122344568 Q ss_pred ccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchhee Q lcl|NC_021324. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT 282 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~ 282 (480) ++|+|+|+........+|.|.+ +.+++.++.+|..+|.+.+.+. +.++.+-.|.... . +.-... ...++.++. T Consensus 351 ~~Pfv~~~~~~~~~~~~~~G~v-~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav~~-~-d~~~~~--~~rp~~vi~ 423 (776) T protein:vir:93 351 RYPFTPIWGFRRARDGMPYGVI-RFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAVDD-I-DEFRRE--AARPDAVMT 423 (776) T ss_pred ccceEEecCceecccccccchH-HhhhHHHHHHHHHHHHHHHhhc--CCceeeccccccc-h-HHHHHh--cccCCceee Confidence 9999999887777777788766 5699999999999999988763 3444443343211 1 110000 123444444 Q ss_pred cC-CCCceEE-EeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 LA-SEAAKIS-EFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 283 ~~-g~~~~~~-~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) .. |..+.+. +.+..-...+...+......|-.++|+++..+|..+ |..||+|+..+...-........+.|..++++ T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~-n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~~~ 502 (776) T protein:vir:93 424 VKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTT-NAVSGVAIQARQEQGSVATNKLFDNLRLAFQQ 502 (776) T ss_pred eCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCc-chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43 3322221 112223456778888888888889999999999654 55799999988877777777777777777777 Q ss_pred HHHHHHHH----hCCC---------Cccccee--------------eeEEecC-CCCCC-HHHHHHHHHHHHhccccCCC Q lcl|NC_021324. 361 AMRIAMQI----MGRE---------VTEEYTR--------------LETVWRD-PSTPT-VAAKADAVSKLYANGQGPIP 411 (480) Q Consensus 361 ~~~li~~~----~~~~---------~~~~~~~--------------~~v~f~~-~~~~~-~~~~ad~~~kl~~~~~~~~s 411 (480) ++++++.+ .+.. ...++.. .+|.-.. +...+ ..+....++++.+....-+. T Consensus 503 ~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~~~ 582 (776) T protein:vir:93 503 HGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPEIA 582 (776) T ss_pred HHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChhhH Confidence 76655433 2211 1111111 1111111 11122 22334445555432211111 Q ss_pred H---HHHHHhCCC--ChhHHHHHHHHHH----------------HHHH----HHHHHHhhhcc--cCCCcC------CCC Q lcl|NC_021324. 412 K---EQARIDLGY--TATQREQMRDWDK----------------QETE----DMIDTLYSTTK--AQADAT------PKP 458 (480) Q Consensus 412 ~---e~~~~~~~~--~~d~~~~~~~~~~----------------~~~~----~~~~~~~~~~~--~~~~~~------~~~ 458 (480) . ..+++..++ .++-.+++.+... ++.+ ....+...... .++.+. ... T Consensus 583 ~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~~~aea~~~ 662 (776) T protein:vir:93 583 LTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKARKAAAEAQVA 662 (776) T ss_pred HHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHHHHHHHHHH Confidence 1 112222222 1111122211100 0000 00000000000 000000 000 Q ss_pred CCCCCCccCCCCCCCC----CcccCC Q lcl|NC_021324. 459 TVTETKTETQTSPSGF----NRTKTR 480 (480) Q Consensus 459 ~~~d~~~~~~~~~~~~----~~~~~~ 480 (480) ...-.....+...... ...++. T Consensus 663 ~aqa~~~~~~a~~~~~~a~q~a~qa~ 688 (776) T protein:vir:93 663 EAKAKHISRMAIREGVGAVKDATDAA 688 (776) T ss_pred hhhhhhhhhcchhhhhhhhhhhhhhh Confidence 0000000000000000 000000 No 82 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.76 E-value=2.8e-17 Score=111.35 Aligned_cols=465 Identities=11% Similarity=0.052 Sum_probs=218.8 Q ss_pred CCCHHHHHHHHHHHHHHHH-------HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDL-------PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~-------~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) =.+.++++.++...+.... ....+-.+||.|+|--.... ..-.....-.+++|.++.+|+..+++..-+... T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr~~ 103 (711) T protein:vir:10 25 NDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVR-TERELEQRPCLVNNVLPTFVDQVLGDQRQNRPA 103 (711) T ss_pred cchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHH-HHHHhcCCCcEEEcchHHHHHHHhhhHhhCCcc Confidence 1222445555555443322 22334478999987532211 011111233578999999999999877544322 Q ss_pred c---C-------------------------CCchhHHHH----HHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc Q lcl|NC_021324. 74 I---S-------------------------EDSEGLEEL----WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES 121 (480) Q Consensus 74 ~---~-------------------------~d~~~~~~l----~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~ 121 (480) + + +|.+..+.| ..+++.|+.+.....++.+++++|.||+-|+.+...+ T Consensus 104 ~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~~d~~~~ 183 (711) T protein:vir:10 104 IKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRSDYLAD 183 (711) T ss_pred eEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEEEecccCC Confidence 1 1 233344444 3455668899999999999999999999887543333 Q ss_pred CCCCceeEEEEE-cccceEEEEeCCCCC----ceEEE-EEEEEEecC--------------------------CceEEEE Q lcl|NC_021324. 122 GDPAGIPLIRVE-SPLYMYAELDPRNTR----RVTRA-VRLYTTRDD--------------------------VAVPDRA 169 (480) Q Consensus 122 ~d~~g~~~~~~~-~p~~~~~v~d~~~~~----~~~~~-~~~~~~~~~--------------------------~~~~~~~ 169 (480) ...+|.+++..+ +|.++ +||+.... ...++ .+.|.+.++ ....... T Consensus 184 d~~~~e~~i~~v~~p~~v--~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~~~vrv~ 261 (711) T protein:vir:10 184 DSFEQDLIIEAIQNQFSV--TIDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTEKSVRVS 261 (711) T ss_pred CCCCCCeEEeeecChhhe--eeCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCcccCcceeeEE Confidence 445688888877 68875 46664211 11122 222211110 1123344 Q ss_pred EEEeCCcEEE-EEe-cCCCc------------------------------------cccccccccccccccccceEEeec Q lcl|NC_021324. 170 TLYLPDETVP-LRR-NGGLN------------------------------------DQWVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 170 ~~~~~~~~~~-~~~-~~~~~------------------------------------~~~~~~~~~~~~~~g~iPvv~~~n 211 (480) ++|....... +.. ..+.. .+.....++.|.+.+++|+|+|.- T Consensus 262 E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~~~P~vp~~g 341 (711) T protein:vir:10 262 EYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTIPVIPVWG 341 (711) T ss_pred EEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCCcccEEEEee Confidence 4554322111 100 00000 000011233344457788887753 Q ss_pred c----cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccccccccccchhhhhcchheecC-C Q lcl|NC_021324. 212 D----PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDIYYGRILTLA-S 285 (480) Q Consensus 212 ~----~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~-G~~~~~~~~~~~~~~~~~~~~~~~~~~-g 285 (480) . ...+.+||. .+.+++.++.+|..+|.+...+...+.+..++. |. .++..+. .... ...++.++.+. + T Consensus 342 ~r~~~d~~~~~~G~---vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~ga-i~~~~~~-~~e~-~~~~~~vi~~~~~ 415 (711) T protein:vir:10 342 KSLIIKKKEIFRSI---IRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGN-VEGREDE-WEQA-NTKNFSLLTYIPQ 415 (711) T ss_pred eeeccccccccchh---hhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcc-cCChHHH-HHhc-cccCCCeeEeccc Confidence 2 334446663 366899999999999999999988777665543 33 2211110 1100 12344444332 2 Q ss_pred C--CceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 286 E--AAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) Q Consensus 286 ~--~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 362 (480) . ...+...+. .-...+...++.....|-.++|+++..+|..+ |..||+|+......-..........+..+.+++. T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~-n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g 494 (711) T protein:vir:10 416 YQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMG-NETSGRAIIARQRQGDRGSFAFIDNLTKSIRRVG 494 (711) T ss_pred ccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 123433332 23456777777777777788999999998765 4579999999888777766667777777777766 Q ss_pred HHHHHH----hC---------CCCcccceee-----------------------eEEe--cCCCCCCHHHHHHHHHHHHh Q lcl|NC_021324. 363 RIAMQI----MG---------REVTEEYTRL-----------------------ETVW--RDPSTPTVAAKADAVSKLYA 404 (480) Q Consensus 363 ~li~~~----~~---------~~~~~~~~~~-----------------------~v~f--~~~~~~~~~~~ad~~~kl~~ 404 (480) ++++.+ .+ .....++..+ +|.- .+..+....+.+..++++.+ T Consensus 495 ~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~~ 574 (711) T protein:vir:10 495 KILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAMIQFAQ 574 (711) T ss_pred HHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHHHHHHh Confidence 655432 22 1111111111 1111 11111112234445555544 Q ss_pred ccccCCC--HHHHHHhCCCC-hhHH-HHH-----------------HHHHHHHHHH---HHHHHhhhcc--cCCCcC-CC Q lcl|NC_021324. 405 NGQGPIP--KEQARIDLGYT-ATQR-EQM-----------------RDWDKQETED---MIDTLYSTTK--AQADAT-PK 457 (480) Q Consensus 405 ~~~~~~s--~e~~~~~~~~~-~d~~-~~~-----------------~~~~~~~~~~---~~~~~~~~~~--~~~~~~-~~ 457 (480) ..+...+ ...+++.+++. .+++ +++ .+...+.++. ..-++..... ...+.. .. T Consensus 575 ~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~ 654 (711) T protein:vir:10 575 AVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQ 654 (711) T ss_pred hcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3211110 11122332221 1111 111 0000000000 0000000000 000000 00 Q ss_pred CCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ......+... ..-.++++ T Consensus 655 Aqae~~qa~~-----e~~~~q~q 672 (711) T protein:vir:10 655 AQADMLKAQL-----ETEEAQKQ 672 (711) T ss_pred HHHHHHHHHH-----HHHHHHHH Confidence 0000000000 00011111 No 83 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.72 E-value=3.8e-17 Score=110.60 Aligned_cols=435 Identities=8% Similarity=0.020 Sum_probs=213.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-cchhcC-----cccc-hhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR-RLKTIG-----IGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~-~i~~~~-----~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |.+..+--...+...... +...+..+=.|.. +....+ ..+. .........+.++++|||..++.+.-+|+. T Consensus 1 ~~~~~~a~~~~~~~~a~~--~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~~l~r~iVd~~a~d~~r~g~~ 78 (461) T protein:vir:80 1 MYSIDKAKQAKIDSKIVN--RNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLYASNSIAMNIVDIISEDMVRAGWS 78 (461) T ss_pred Cccchhhhhhhhhhhhhh--hhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHHHhCCccchhhccchHHhhcCCee Confidence 777766544333332221 1111111111211 111101 1111 112233445678899999999999999987 Q ss_pred cCC-CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccc--eEEEEeCCCCCce Q lcl|NC_021324. 74 ISE-DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLY--MYAELDPRNTRRV 150 (480) Q Consensus 74 ~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~--~~~v~d~~~~~~~ 150 (480) +.+ +++..+.++..|++-++...+.++.+.+..||.|++++...+... ........+.|.. .+...++.....+ T Consensus 79 i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~---~~~~~~~pl~~~~~~~~~~l~~~~~~~i 155 (461) T protein:vir:80 79 LKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR---EQADLSTAIDPKTIKSIPYINTFNTQKV 155 (461) T ss_pred eecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCc---cccCccCCcccccccceeEEEecccccc Confidence 654 445667788999888899999999999999999998886432110 0000111111111 0000000000000 Q ss_pred EEEEEEEE--EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHH Q lcl|NC_021324. 151 TRAVRLYT--TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 151 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) . ...... ...+.+.+.++.+........+...+. .....-.+..-++++|.+.+.....+|+|.+.+ + T Consensus 156 ~-~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~--------~~~~~~~iH~SRii~~~~~~~~~~~~G~S~le~-~ 225 (461) T protein:vir:80 156 T-QLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGT--------TASTSEQIHRSRIIHEQGLRFEGETKGRSIFES-L 225 (461) T ss_pred c-hhhhcccCcCcccccceEEEEeccccccccccccc--------cCccceEEccccEEEecCCCCCccccCcchHHH-H Confidence 0 000000 001222232222222111111111100 000111234457888988888888899998865 8 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccc-ccchhhhh--cchheecCCCCceEEEeccccHHHHHHHH Q lcl|NC_021324. 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDG-ENTTLDIY--YGRILTLASEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 229 ~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~-~~~~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~~l 305 (480) ...+.++++++-.....+..+..+.+.+.|.......... -...++.. ...++.++ .+-++.+++. ++....+.+ T Consensus 226 ~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~~~~d-~~e~~e~~~~-~lsgl~~~l 303 (461) T protein:vir:80 226 YDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEALAIIK-GDEQLTKEST-NVSGMKDLL 303 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceEEEEc-CCcceEEEec-CcCCHHHHH Confidence 8888889988888777666666555444432211100000 01112211 11223333 3335555443 355666777 Q ss_pred HHHHHHHHhhcCCChHHhccc-cccchHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhCC---CCcccceee Q lcl|NC_021324. 306 EVFRKEAASITGLPPQYLSSS-SENPASAEAIIATDSRIVKMAERKG-RIFGGAWERAMRIAMQIMGR---EVTEEYTRL 380 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~-~~~~~Sg~Al~~~~~~l~~k~~~~~-~~~~~~l~~~~~li~~~~~~---~~~~~~~~~ 380 (480) +.....|+..++||..-|.+. ++..+||..=. ......++.++ ..+++.|++++++++.-.+. ..+++..++ T Consensus 304 ~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~---~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~~~~~ 380 (461) T protein:vir:80 304 DYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDV---MNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPDSFEW 380 (461) T ss_pred HHHHHHHhhhhcCCeeeeecccCCccccchHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCccccce Confidence 888899999999998766543 44456775433 33334444444 56788999999988764432 344555689 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC-CChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG-YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) ++.|.+...++..+.|+...+.+++. .+++. .| ++++++.+ ....+ +.. .....-+..+++ T Consensus 381 ~i~f~~L~~~s~kekAe~~~~~a~a~------~~~~~-~g~is~~e~r~---~l~~~-------~~~-~~~~~~~~~~~~ 442 (461) T protein:vir:80 381 AIEFNPLWNLDSKTDAEVRKLTAEAD------QIYIV-NGVLDPDEVKE---TRFGR-------FGL-ENSSKFSGDSAE 442 (461) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHH------HHHHh-cCCCCHHHHHH---HHHHh-------cCC-CCCccCCCCCch Confidence 99999999999999998766655432 12222 24 33343221 11000 000 000000000000 Q ss_pred CC-----CCCccCCCCCCC Q lcl|NC_021324. 460 VT-----ETKTETQTSPSG 473 (480) Q Consensus 460 ~~-----d~~~~~~~~~~~ 473 (480) .. +.....+..+.| T Consensus 443 ~~~~~~~~~~~~~~e~~~g 461 (461) T protein:vir:80 443 IDKLAKLVYDAYAKKNADG 461 (461) T ss_pred hhhhhhhccccccccCCCC Confidence 00 000111111111 No 84 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.71 E-value=4.9e-15 Score=99.04 Aligned_cols=443 Identities=14% Similarity=0.055 Sum_probs=228.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc----cch-----hh-------hcceeccchHHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG----APP-----EL-------AYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~----~~~-----~~-------~~~~~~~n~~~~iVd~~~ 64 (480) |.=....|..+.-+...++.+.....+-|+|-..-+..+.. .++ .. +++-....|++-+|+..+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 66666666655555554444444444557664322111100 000 00 112234568999999999 Q ss_pred hhhccC-ccccC-----C----CchhHHHHHHHHH---h-------cCHHHHHHHHHHHHhhcCeeEEEEecCccccC-C Q lcl|NC_021324. 65 DRLDIE-GFRIS-----E----DSEGLEELWNWWQ---A-------NDLDEESVLGHDDSLTFGRAYITVSHPDVESG-D 123 (480) Q Consensus 65 ~~l~~~-~~~~~-----~----d~~~~~~l~~~~~---~-------n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~-d 123 (480) ...+|. |+... . +++..+.|...|+ + .+|...+..+++.....|.+|+.+........ + T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 999997 55421 1 2334445555554 2 36888888899999999999998765432211 1 Q ss_pred CCc-eeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccccccc Q lcl|NC_021324. 124 PAG-IPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG 202 (480) Q Consensus 124 ~~g-~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 202 (480) ..+ -+++..++|+.+---++ ....+...| ..+..|.+..+-++..+- +. .....+. T Consensus 161 g~~~~l~lq~iepd~l~~~~~--~~~~i~~GV----e~d~~Gr~~aY~i~~~hP--------gd---------~~~~~~~ 217 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSD--ESNRLNQGV----FVDDWGRPEKYLVYKSRP--------VS---------GRQMETK 217 (502) T ss_pred CcccceEEEEecchhcCCCCC--CCCeeEeee----EECCCCceEEEEEeecCC--------CC---------Cccccee Confidence 111 24889999988732222 122333333 234455444333332211 10 0112234 Q ss_pred ccc---eEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc-----ccccccchhh Q lcl|NC_021324. 203 VVP---VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL-----TNDGENTTLD 274 (480) Q Consensus 203 ~iP---vv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~-----~~~~~~~~~~ 274 (480) +|| |+|+......++..|.|.+++-+ ..+..++....-........+.--.+|+....+.. .+........ T Consensus 218 rvpA~~vlH~f~~~r~gQ~RGis~lapvl-~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~ 296 (502) T protein:vir:79 218 EVDAERMLHLKFVRRLHQMRGTSLLSGVL-IRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELT 296 (502) T ss_pred EechhheEEeecccCCccccCCchHHHHH-HHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCCcccccc Confidence 666 78988888888999999998744 44443443332222222222211223332111111 1111222334 Q ss_pred hhcchhee-c-CCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 275 IYYGRILT-L-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGR 352 (480) Q Consensus 275 ~~~~~~~~-~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~ 352 (480) +.+|.++. + +|.+.++.+.+. ...+|...++..++.|+...|+|-+.|.++. + .|-.|+|+.+..........+. T Consensus 297 l~pG~i~~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~-s-~nySs~R~~~~e~~r~~~~~q~ 373 (502) T protein:vir:79 297 IQPGIIYDDLKPGEEIGMVKSDR-PNPNLETFRNGQLRAVAAGSRLSFSSTARNY-N-GTYSAQRQELVESTDGYLILQD 373 (502) T ss_pred ccCCccccccCCCceeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-c-chHHHHHHHHHHHHHHHHHHHH Confidence 56676543 4 445555544322 2346777788888999999999999998874 2 2555678888877777777777 Q ss_pred HHHHHHHH-HHHH--HHHHhCCCCcc-----cceeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCC Q lcl|NC_021324. 353 IFGGAWER-AMRI--AMQIMGREVTE-----EYTRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDLGYT 422 (480) Q Consensus 353 ~~~~~l~~-~~~l--i~~~~~~~~~~-----~~~~~~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~ 422 (480) .|...+-+ +++. -.+++.+.++. ...-+.+.|..|-. .|....+.+....+.+| +.|.+..+...|.. T Consensus 374 ~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~G--l~t~~~~~a~~G~D 451 (502) T protein:vir:79 374 WFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGG--AATESDWVRAGGRN 451 (502) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHcCCC Confidence 66654433 3332 12333332221 11234677855543 46666677777777776 67888888888987 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC-CCCccCCCCCCCCCcccC Q lcl|NC_021324. 423 ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT-ETKTETQTSPSGFNRTKT 479 (480) Q Consensus 423 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~~~~~~~~~~~ 479 (480) .+++-+ ++.++.+. .+.+. ..-..+++..+... .+.++.++++++++. +. T Consensus 452 ~~~v~~--q~a~e~~~--~~~~G--l~~~~~~~~~~~~~~~~~~~~e~~~~~~~~-e~ 502 (502) T protein:vir:79 452 PDDVKR--RRKAEIDE--NRKLD--LVFDTDPASDKGGSSAATKRQEPQHTDDQS-EE 502 (502) T ss_pred HHHHHH--HHHHHHHH--HHHcC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CC Confidence 765432 22212111 11111 11112222222111 122222222222222 22 No 85 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.68 E-value=3.1e-16 Score=105.57 Aligned_cols=464 Identities=9% Similarity=0.019 Sum_probs=205.3 Q ss_pred CCCH------HHHHHHHHHHHHH---HHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCc Q lcl|NC_021324. 1 MTTY------HEHVERLQGLLAR---DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEG 71 (480) Q Consensus 1 ~~~~------~~~i~~l~~~~~~---~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~ 71 (480) .++. .+.+..+...+.. .+..-.+-.+||+|+|--... ...-...+.-.+++|.++.+|+..+++..-+. T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~QW~~~~-~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~nr 92 (772) T protein:vir:10 14 LPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQLDTEL-LRRQQALGIPPAVEDLIGPALLSLQGYEAVTR 92 (772) T ss_pred CCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEEcchHHHHHHHHHHHHhcC Confidence 1111 1112222222221 112223456899998753221 01111223446789999999999998876543 Q ss_pred cc---cC----CCchhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEE Q lcl|NC_021324. 72 FR---IS----EDSEGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA 140 (480) Q Consensus 72 ~~---~~----~d~~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~ 140 (480) .. .+ +|.+..+. +..+++.|+.+.....++.+++++|.||+-++..+ ...++.++++.++|.+++ T Consensus 93 ~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~---d~~~~~i~i~~v~p~~v~- 168 (772) T protein:vir:10 93 TDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRES---DPFKFPYRCRPIRRDEIH- 168 (772) T ss_pred cceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEecccc---CCCCCCeEEEeeCcccce- Confidence 22 12 22333333 44456678999999999999999999999887643 223346789999998854 Q ss_pred EEeCCCCCc---eEEEE-EEEEE------------------------------------------------------e-- Q lcl|NC_021324. 141 ELDPRNTRR---VTRAV-RLYTT------------------------------------------------------R-- 160 (480) Q Consensus 141 v~d~~~~~~---~~~~~-~~~~~------------------------------------------------------~-- 160 (480) ||+..... ..+.+ ..|.+ . T Consensus 169 -~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (772) T protein:vir:10 169 -WDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQED 247 (772) T ss_pred -ecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhhccccccc Confidence 66643211 01111 00000 0 Q ss_pred ----cCCceEEEEEEEeCCcEEEEEecCC-Ccc-----------------------------------c-cccccccccc Q lcl|NC_021324. 161 ----DDVAVPDRATLYLPDETVPLRRNGG-LND-----------------------------------Q-WVVDGDVIKH 199 (480) Q Consensus 161 ----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-----------------------------------~-~~~~~~~~~~ 199 (480) .+.+.++.+++|.......+..... +.. + .+......|- T Consensus 248 ~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 327 (772) T protein:vir:10 248 HWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPY 327 (772) T ss_pred cccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecceeeccCCCCC Confidence 0012344555564432221111100 000 0 0011122333 Q ss_pred cccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhc Q lcl|NC_021324. 200 GLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYY 277 (480) Q Consensus 200 ~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~ 277 (480) +.+++|+|+|.-. ...+.+|| +.+.+++.++.+|...|.....+... +...=.|+-.+. +...... -..+ T Consensus 328 ~~~~fP~vP~~g~r~~~~g~~~G---~vr~~kd~Qr~~N~~~S~~~~~l~~~--~~~~~~gav~~~--d~~~~e~-~arp 399 (772) T protein:vir:10 328 THRHFPYVPFFGFREDATGIPYG---YVRGMKYAQDSLNSGVSKLRWGMSVA--RVERTKGAVAMT--DAQFRRQ-IARP 399 (772) T ss_pred CCCccceEEEeeeEeccCCcccc---hhhhhhhHHHHHHHHHHHHHHHHhcc--cccccCCCccch--hHHHHHh-ccCC Confidence 4456777776532 22344666 44679999999999999988876433 222222322111 0000000 1123 Q ss_pred chheecCCC-----CceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 278 GRILTLASE-----AAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKG 351 (480) Q Consensus 278 ~~~~~~~g~-----~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~ 351 (480) +.++.+..+ ...|...+.. -...+...++.....|-.++|+.+..+|.. .|..||+|+..+-..-.......- T Consensus 400 ~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~-~na~SGvAi~~rq~qg~~~l~~~~ 478 (772) T protein:vir:10 400 DADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRK-GTATSGIQEQQQIEQSNQSIGRIM 478 (772) T ss_pred CCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCC-cchhhHHHHHHHHHHHHHHHHHHH Confidence 333333211 2233222222 346788888888888889999999999864 456899999887666555555666 Q ss_pred HHHHHHHHHHHHHHH----HHhCCC----Cc-------ccce------------------ee-----eEEecCCCC--CC Q lcl|NC_021324. 352 RIFGGAWERAMRIAM----QIMGRE----VT-------EEYT------------------RL-----ETVWRDPST--PT 391 (480) Q Consensus 352 ~~~~~~l~~~~~li~----~~~~~~----~~-------~~~~------------------~~-----~v~f~~~~~--~~ 391 (480) ..+..+.+++.++++ .+.+.. +. ..+. ++ +|.- +..| .+ T Consensus 479 Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i-~~~p~~~t 557 (772) T protein:vir:10 479 DNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVAL-EDVPSTNS 557 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEe-eccccchH Confidence 666666666655444 333211 00 0000 00 1110 1111 12 Q ss_pred H-HHHHHHHHHHHhccccCCCHHH---HHHhCCCC--hhHHHHHHHHHHHH---------HHHHHHHHhhhc-------- Q lcl|NC_021324. 392 V-AAKADAVSKLYANGQGPIPKEQ---ARIDLGYT--ATQREQMRDWDKQE---------TEDMIDTLYSTT-------- 448 (480) Q Consensus 392 ~-~~~ad~~~kl~~~~~~~~s~e~---~~~~~~~~--~d~~~~~~~~~~~~---------~~~~~~~~~~~~-------- 448 (480) . .+.++.+.++...+...+.... +++.+.+. ++-.+++.+...++ .+.....+.... T Consensus 558 ~r~~~~~~m~ql~~~~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~qq~~~~~~~el~~~q~ 637 (772) T protein:vir:10 558 YRGQQLNAMSEAVKSMPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQDALAKAGNDIKLREL 637 (772) T ss_pred HHHHHHHHHHHHHhccChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 2345555555543222111111 12222221 11112222211000 000000000000 Q ss_pred c-----cCCCcCCC-CCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 449 K-----AQADATPK-PTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ~-----~~~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) . .++..... ..........+.+........+. T Consensus 638 ~a~~~~~~A~a~~~~aqa~~~~~~a~~~a~~aa~~~~q 675 (772) T protein:vir:10 638 EIKERKADSEISGLNAKAVQIGVQAAFSAMQAGAQIAQ 675 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHh Confidence 0 00000000 00000000000000000000111 No 86 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.66 E-value=2.3e-15 Score=100.80 Aligned_cols=452 Identities=12% Similarity=0.073 Sum_probs=210.4 Q ss_pred CCCH------HHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY------HEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~------~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.+. .++-.++...+..... ...+-.+||+|+|--.... ..-...+.-.+++|.++.+|+..+++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~v~~v~g~~ 86 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVL-QVLKDRGQPMTIHNLIAPTVDGVLGME 86 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHH-HHHHhcCCCcEEeccHHHHHHHHHhHH Confidence 3322 1233334443333221 2235568999987532211 111122244678999999999999887 Q ss_pred ccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcc Q lcl|NC_021324. 68 DIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESP 135 (480) Q Consensus 68 ~~~~~~---~~---~d~--~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p 135 (480) .-+... .| +++ +..+. +..++..|+.+.....++.+++++|.||+-++.+. ...++.+++..++| T Consensus 87 ~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~~v~p 163 (714) T protein:vir:99 87 AKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVSTVSR 163 (714) T ss_pred HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEEecch Confidence 544322 12 222 23333 45566778999999999999999999999887653 23456789999999 Q ss_pred cceEEEEeCCCCC----ceEEEE-EEEEEec------------------------------------------------- Q lcl|NC_021324. 136 LYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------------- 161 (480) Q Consensus 136 ~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------------- 161 (480) .+++ ||+.... ...+.+ +.|.+.+ T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (714) T protein:vir:99 164 NEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQ 241 (714) T ss_pred hhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccc Confidence 9965 5553211 111111 1111100 Q ss_pred -------CCceEEEEEEEeCCcEE------------EEEecCCCc--------------------ccc-----ccccccc Q lcl|NC_021324. 162 -------DVAVPDRATLYLPDETV------------PLRRNGGLN--------------------DQW-----VVDGDVI 197 (480) Q Consensus 162 -------~~~~~~~~~~~~~~~~~------------~~~~~~~~~--------------------~~~-----~~~~~~~ 197 (480) +......+++|...... .|....... ..+ +....+. T Consensus 242 ~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~ 321 (714) T protein:vir:99 242 QNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPC 321 (714) T ss_pred ccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCC Confidence 01123344555432221 111110000 000 0011122 Q ss_pred cccccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh Q lcl|NC_021324. 198 KHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI 275 (480) Q Consensus 198 ~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~ 275 (480) |-+.+++|+|+|.-. ...+.++| +.+.+++.++.+|...|.....+ .++..++..|...+. +..-...+ . T Consensus 322 p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~~~e~~-a 393 (714) T protein:vir:99 322 SAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DNDLMEQI-E 393 (714) T ss_pred CCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHHHHHhc-c Confidence 323345666665432 12244666 44678999999999999987765 344433333432211 10000001 2 Q ss_pred hcchheecCCC-------CceEEEec-cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTLASE-------AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA 347 (480) Q Consensus 276 ~~~~~~~~~g~-------~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~ 347 (480) .++.++....+ ...|.-.+ ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-.... T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:99 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVAISNLVEQGATTL 472 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHHHHHHHHHHHHHH Confidence 23333333211 12232222 223566778888888888888999999998754 5579999998877655555 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHhCCC----C-----ccc---c--------------------eeeeEEecCCCCCC Q lcl|NC_021324. 348 ERKGRIFGGAWERAMRIAM----QIMGRE----V-----TEE---Y--------------------TRLETVWRDPSTPT 391 (480) Q Consensus 348 ~~~~~~~~~~l~~~~~li~----~~~~~~----~-----~~~---~--------------------~~~~v~f~~~~~~~ 391 (480) ......+..+.+++.++++ .+.+.. + ..+ + .++.|.=.+..+.. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:99 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 5555666666666655443 333211 0 000 0 01111111111222 Q ss_pred HHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHH----------------HHHHHHHhh----hc Q lcl|NC_021324. 392 VAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQET----------------EDMIDTLYS----TT 448 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~---~s~e~~~~~~~~~~d~~~~~~~~~~~~~----------------~~~~~~~~~----~~ 448 (480) ..+.+..++++.+..... +.-..+++.+.+. . .+++.+..++.. +....++.. .. T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:99 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 345566677776543221 1223445555542 1 122222221100 000000000 00 Q ss_pred ccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 449 KAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+..+. -.+.+. ..-.++.+ T Consensus 631 ~~~~~a~------~~k~ea-----e~~~a~a~ 651 (714) T protein:vir:99 631 MREMAGR------VAKLEA-----DAARAHAA 651 (714) T ss_pred HHHHHHH------HHHHHH-----HHHHHHHH Confidence 0000000 000000 00000011 No 87 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.66 E-value=2.3e-15 Score=100.80 Aligned_cols=452 Identities=12% Similarity=0.073 Sum_probs=210.4 Q ss_pred CCCH------HHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY------HEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~------~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.+. .++-.++...+..... ...+-.+||+|+|--.... ..-...+.-.+++|.++.+|+..+++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~v~~v~g~~ 86 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVL-QVLKDRGQPMTIHNLIAPTVDGVLGME 86 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHH-HHHHhcCCCcEEeccHHHHHHHHHhHH Confidence 3322 1233334443333221 2235568999987532211 111122244678999999999999887 Q ss_pred ccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcc Q lcl|NC_021324. 68 DIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESP 135 (480) Q Consensus 68 ~~~~~~---~~---~d~--~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p 135 (480) .-+... .| +++ +..+. +..++..|+.+.....++.+++++|.||+-++.+. ...++.+++..++| T Consensus 87 ~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~~v~p 163 (714) T protein:vir:32 87 AKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVSTVSR 163 (714) T ss_pred HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEEecch Confidence 544322 12 222 23333 45566778999999999999999999999887653 23456789999999 Q ss_pred cceEEEEeCCCCC----ceEEEE-EEEEEec------------------------------------------------- Q lcl|NC_021324. 136 LYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------------- 161 (480) Q Consensus 136 ~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------------- 161 (480) .+++ ||+.... ...+.+ +.|.+.+ T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (714) T protein:vir:32 164 NEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQ 241 (714) T ss_pred hhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccc Confidence 9965 5553211 111111 1111100 Q ss_pred -------CCceEEEEEEEeCCcEE------------EEEecCCCc--------------------ccc-----ccccccc Q lcl|NC_021324. 162 -------DVAVPDRATLYLPDETV------------PLRRNGGLN--------------------DQW-----VVDGDVI 197 (480) Q Consensus 162 -------~~~~~~~~~~~~~~~~~------------~~~~~~~~~--------------------~~~-----~~~~~~~ 197 (480) +......+++|...... .|....... ..+ +....+. T Consensus 242 ~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~ 321 (714) T protein:vir:32 242 QNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPC 321 (714) T ss_pred ccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCC Confidence 01123344555432221 111110000 000 0011122 Q ss_pred cccccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh Q lcl|NC_021324. 198 KHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI 275 (480) Q Consensus 198 ~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~ 275 (480) |-+.+++|+|+|.-. ...+.++| +.+.+++.++.+|...|.....+ .++..++..|...+. +..-...+ . T Consensus 322 p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~~~e~~-a 393 (714) T protein:vir:32 322 SAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DNDLMEQI-E 393 (714) T ss_pred CCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHHHHHhc-c Confidence 323345666665432 12244666 44678999999999999987765 344433333432211 10000001 2 Q ss_pred hcchheecCCC-------CceEEEec-cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTLASE-------AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA 347 (480) Q Consensus 276 ~~~~~~~~~g~-------~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~ 347 (480) .++.++....+ ...|.-.+ ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-.... T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:32 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVAISNLVEQGATTL 472 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHHHHHHHHHHHHHH Confidence 23333333211 12232222 223566778888888888888999999998754 5579999998877655555 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHhCCC----C-----ccc---c--------------------eeeeEEecCCCCCC Q lcl|NC_021324. 348 ERKGRIFGGAWERAMRIAM----QIMGRE----V-----TEE---Y--------------------TRLETVWRDPSTPT 391 (480) Q Consensus 348 ~~~~~~~~~~l~~~~~li~----~~~~~~----~-----~~~---~--------------------~~~~v~f~~~~~~~ 391 (480) ......+..+.+++.++++ .+.+.. + ..+ + .++.|.=.+..+.. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:32 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 5555666666666655443 333211 0 000 0 01111111111222 Q ss_pred HHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHH----------------HHHHHHHhh----hc Q lcl|NC_021324. 392 VAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQET----------------EDMIDTLYS----TT 448 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~---~s~e~~~~~~~~~~d~~~~~~~~~~~~~----------------~~~~~~~~~----~~ 448 (480) ..+.+..++++.+..... +.-..+++.+.+. . .+++.+..++.. +....++.. .. T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:32 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 345566677776543221 1223445555542 1 122222221100 000000000 00 Q ss_pred ccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 449 KAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+..+. -.+.+. ..-.++.+ T Consensus 631 ~~~~~a~------~~k~ea-----e~~~a~a~ 651 (714) T protein:vir:32 631 MREMAGR------VAKLEA-----DAARAHAA 651 (714) T ss_pred HHHHHHH------HHHHHH-----HHHHHHHH Confidence 0000000 000000 00000011 No 88 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.66 E-value=2.3e-15 Score=100.80 Aligned_cols=452 Identities=12% Similarity=0.073 Sum_probs=210.4 Q ss_pred CCCH------HHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY------HEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~------~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.+. .++-.++...+..... ...+-.+||+|+|--.... ..-...+.-.+++|.++.+|+..+++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~v~~v~g~~ 86 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVL-QVLKDRGQPMTIHNLIAPTVDGVLGME 86 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHH-HHHHhcCCCcEEeccHHHHHHHHHhHH Confidence 3322 1233334443333221 2235568999987532211 111122244678999999999999887 Q ss_pred ccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcc Q lcl|NC_021324. 68 DIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESP 135 (480) Q Consensus 68 ~~~~~~---~~---~d~--~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p 135 (480) .-+... .| +++ +..+. +..++..|+.+.....++.+++++|.||+-++.+. ...++.+++..++| T Consensus 87 ~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~~v~p 163 (714) T protein:vir:81 87 AKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVSTVSR 163 (714) T ss_pred HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEEecch Confidence 544322 12 222 23333 45566778999999999999999999999887653 23456789999999 Q ss_pred cceEEEEeCCCCC----ceEEEE-EEEEEec------------------------------------------------- Q lcl|NC_021324. 136 LYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------------- 161 (480) Q Consensus 136 ~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------------- 161 (480) .+++ ||+.... ...+.+ +.|.+.+ T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (714) T protein:vir:81 164 NEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQ 241 (714) T ss_pred hhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccc Confidence 9965 5553211 111111 1111100 Q ss_pred -------CCceEEEEEEEeCCcEE------------EEEecCCCc--------------------ccc-----ccccccc Q lcl|NC_021324. 162 -------DVAVPDRATLYLPDETV------------PLRRNGGLN--------------------DQW-----VVDGDVI 197 (480) Q Consensus 162 -------~~~~~~~~~~~~~~~~~------------~~~~~~~~~--------------------~~~-----~~~~~~~ 197 (480) +......+++|...... .|....... ..+ +....+. T Consensus 242 ~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~ 321 (714) T protein:vir:81 242 QNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPC 321 (714) T ss_pred ccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCC Confidence 01123344555432221 111110000 000 0011122 Q ss_pred cccccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh Q lcl|NC_021324. 198 KHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI 275 (480) Q Consensus 198 ~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~ 275 (480) |-+.+++|+|+|.-. ...+.++| +.+.+++.++.+|...|.....+ .++..++..|...+. +..-...+ . T Consensus 322 p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~~~e~~-a 393 (714) T protein:vir:81 322 SAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DNDLMEQI-E 393 (714) T ss_pred CCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHHHHHhc-c Confidence 323345666665432 12244666 44678999999999999987765 344433333432211 10000001 2 Q ss_pred hcchheecCCC-------CceEEEec-cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTLASE-------AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA 347 (480) Q Consensus 276 ~~~~~~~~~g~-------~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~ 347 (480) .++.++....+ ...|.-.+ ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-.... T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:81 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVAISNLVEQGATTL 472 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHHHHHHHHHHHHHH Confidence 23333333211 12232222 223566778888888888888999999998754 5579999998877655555 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHhCCC----C-----ccc---c--------------------eeeeEEecCCCCCC Q lcl|NC_021324. 348 ERKGRIFGGAWERAMRIAM----QIMGRE----V-----TEE---Y--------------------TRLETVWRDPSTPT 391 (480) Q Consensus 348 ~~~~~~~~~~l~~~~~li~----~~~~~~----~-----~~~---~--------------------~~~~v~f~~~~~~~ 391 (480) ......+..+.+++.++++ .+.+.. + ..+ + .++.|.=.+..+.. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:81 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 5555666666666655443 333211 0 000 0 01111111111222 Q ss_pred HHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHH----------------HHHHHHHhh----hc Q lcl|NC_021324. 392 VAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQET----------------EDMIDTLYS----TT 448 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~---~s~e~~~~~~~~~~d~~~~~~~~~~~~~----------------~~~~~~~~~----~~ 448 (480) ..+.+..++++.+..... +.-..+++.+.+. . .+++.+..++.. +....++.. .. T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:81 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 345566677776543221 1223445555542 1 122222221100 000000000 00 Q ss_pred ccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 449 KAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+..+. -.+.+. ..-.++.+ T Consensus 631 ~~~~~a~------~~k~ea-----e~~~a~a~ 651 (714) T protein:vir:81 631 MREMAGR------VAKLEA-----DAARAHAA 651 (714) T ss_pred HHHHHHH------HHHHHH-----HHHHHHHH Confidence 0000000 000000 00000011 No 89 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.66 E-value=2.3e-15 Score=100.80 Aligned_cols=452 Identities=12% Similarity=0.073 Sum_probs=210.4 Q ss_pred CCCH------HHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY------HEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~------~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.+. .++-.++...+..... ...+-.+||+|+|--.... ..-...+.-.+++|.++.+|+..+++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~v~~v~g~~ 86 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVL-QVLKDRGQPMTIHNLIAPTVDGVLGME 86 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHH-HHHHhcCCCcEEeccHHHHHHHHHhHH Confidence 3322 1233334443333221 2235568999987532211 111122244678999999999999887 Q ss_pred ccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcc Q lcl|NC_021324. 68 DIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESP 135 (480) Q Consensus 68 ~~~~~~---~~---~d~--~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p 135 (480) .-+... .| +++ +..+. +..++..|+.+.....++.+++++|.||+-++.+. ...++.+++..++| T Consensus 87 ~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~~v~p 163 (714) T protein:vir:10 87 AKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVSTVSR 163 (714) T ss_pred HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEEecch Confidence 544322 12 222 23333 45566778999999999999999999999887653 23456789999999 Q ss_pred cceEEEEeCCCCC----ceEEEE-EEEEEec------------------------------------------------- Q lcl|NC_021324. 136 LYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------------- 161 (480) Q Consensus 136 ~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------------- 161 (480) .+++ ||+.... ...+.+ +.|.+.+ T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (714) T protein:vir:10 164 NEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQ 241 (714) T ss_pred hhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccc Confidence 9965 5553211 111111 1111100 Q ss_pred -------CCceEEEEEEEeCCcEE------------EEEecCCCc--------------------ccc-----ccccccc Q lcl|NC_021324. 162 -------DVAVPDRATLYLPDETV------------PLRRNGGLN--------------------DQW-----VVDGDVI 197 (480) Q Consensus 162 -------~~~~~~~~~~~~~~~~~------------~~~~~~~~~--------------------~~~-----~~~~~~~ 197 (480) +......+++|...... .|....... ..+ +....+. T Consensus 242 ~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~ 321 (714) T protein:vir:10 242 QNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPC 321 (714) T ss_pred ccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCC Confidence 01123344555432221 111110000 000 0011122 Q ss_pred cccccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh Q lcl|NC_021324. 198 KHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI 275 (480) Q Consensus 198 ~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~ 275 (480) |-+.+++|+|+|.-. ...+.++| +.+.+++.++.+|...|.....+ .++..++..|...+. +..-...+ . T Consensus 322 p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~~~e~~-a 393 (714) T protein:vir:10 322 SAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DNDLMEQI-E 393 (714) T ss_pred CCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHHHHHhc-c Confidence 323345666665432 12244666 44678999999999999987765 344433333432211 10000001 2 Q ss_pred hcchheecCCC-------CceEEEec-cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTLASE-------AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA 347 (480) Q Consensus 276 ~~~~~~~~~g~-------~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~ 347 (480) .++.++....+ ...|.-.+ ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-.... T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:10 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVAISNLVEQGATTL 472 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHHHHHHHHHHHHHH Confidence 23333333211 12232222 223566778888888888888999999998754 5579999998877655555 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHhCCC----C-----ccc---c--------------------eeeeEEecCCCCCC Q lcl|NC_021324. 348 ERKGRIFGGAWERAMRIAM----QIMGRE----V-----TEE---Y--------------------TRLETVWRDPSTPT 391 (480) Q Consensus 348 ~~~~~~~~~~l~~~~~li~----~~~~~~----~-----~~~---~--------------------~~~~v~f~~~~~~~ 391 (480) ......+..+.+++.++++ .+.+.. + ..+ + .++.|.=.+..+.. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:10 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 5555666666666655443 333211 0 000 0 01111111111222 Q ss_pred HHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHH----------------HHHHHHHhh----hc Q lcl|NC_021324. 392 VAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQET----------------EDMIDTLYS----TT 448 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~---~s~e~~~~~~~~~~d~~~~~~~~~~~~~----------------~~~~~~~~~----~~ 448 (480) ..+.+..++++.+..... +.-..+++.+.+. . .+++.+..++.. +....++.. .. T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:10 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 345566677776543221 1223445555542 1 122222221100 000000000 00 Q ss_pred ccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 449 KAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+..+. -.+.+. ..-.++.+ T Consensus 631 ~~~~~a~------~~k~ea-----e~~~a~a~ 651 (714) T protein:vir:10 631 MREMAGR------VAKLEA-----DAARAHAA 651 (714) T ss_pred HHHHHHH------HHHHHH-----HHHHHHHH Confidence 0000000 000000 00000011 No 90 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.66 E-value=2.3e-15 Score=100.80 Aligned_cols=452 Identities=12% Similarity=0.073 Sum_probs=210.4 Q ss_pred CCCH------HHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY------HEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~------~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.+. .++-.++...+..... ...+-.+||+|+|--.... ..-...+.-.+++|.++.+|+..+++. T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~-~~l~~~g~p~~~~N~i~~~v~~v~g~~ 86 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLPPEVL-QVLKDRGQPMTIHNLIAPTVDGVLGME 86 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHH-HHHHhcCCCcEEeccHHHHHHHHHhHH Confidence 3322 1233334443333221 2235568999987532211 111122244678999999999999887 Q ss_pred ccCccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcc Q lcl|NC_021324. 68 DIEGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESP 135 (480) Q Consensus 68 ~~~~~~---~~---~d~--~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p 135 (480) .-+... .| +++ +..+. +..++..|+.+.....++.+++++|.||+-++.+. ...++.+++..++| T Consensus 87 ~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~---d~~~~~i~i~~v~p 163 (714) T protein:vir:27 87 AKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---DPFGPEFKVSTVSR 163 (714) T ss_pred HhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEecccc---CCCCCCeEEEecch Confidence 544322 12 222 23333 45566778999999999999999999999887653 23456789999999 Q ss_pred cceEEEEeCCCCC----ceEEEE-EEEEEec------------------------------------------------- Q lcl|NC_021324. 136 LYMYAELDPRNTR----RVTRAV-RLYTTRD------------------------------------------------- 161 (480) Q Consensus 136 ~~~~~v~d~~~~~----~~~~~~-~~~~~~~------------------------------------------------- 161 (480) .+++ ||+.... ...+.+ +.|.+.+ T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 241 (714) T protein:vir:27 164 NEVF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQ 241 (714) T ss_pred hhee--eccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccc Confidence 9965 5553211 111111 1111100 Q ss_pred -------CCceEEEEEEEeCCcEE------------EEEecCCCc--------------------ccc-----ccccccc Q lcl|NC_021324. 162 -------DVAVPDRATLYLPDETV------------PLRRNGGLN--------------------DQW-----VVDGDVI 197 (480) Q Consensus 162 -------~~~~~~~~~~~~~~~~~------------~~~~~~~~~--------------------~~~-----~~~~~~~ 197 (480) +......+++|...... .|....... ..+ +....+. T Consensus 242 ~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~ 321 (714) T protein:vir:27 242 QNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPC 321 (714) T ss_pred ccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCC Confidence 01123344555432221 111110000 000 0011122 Q ss_pred cccccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhh Q lcl|NC_021324. 198 KHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDI 275 (480) Q Consensus 198 ~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~ 275 (480) |-+.+++|+|+|.-. ...+.++| +.+.+++.++.+|...|.....+ .++..++..|...+. +..-...+ . T Consensus 322 p~p~~~fp~vp~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~--d~~~~e~~-a 393 (714) T protein:vir:27 322 SAPQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLS--DNDLMEQI-E 393 (714) T ss_pred CCCCCceeEEEEeeeeeeccCceee---hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCccccc--HHHHHHhc-c Confidence 323345666665432 12244666 44678999999999999987765 344433333432211 10000001 2 Q ss_pred hcchheecCCC-------CceEEEec-cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTLASE-------AAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA 347 (480) Q Consensus 276 ~~~~~~~~~g~-------~~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~ 347 (480) .++.++....+ ...|.-.+ ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-.... T Consensus 394 rp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvAi~~rq~qg~~~l 472 (714) T protein:vir:27 394 RPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVAISNLVEQGATTL 472 (714) T ss_pred CCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCc-cchhHHHHHHHHHHHHHHH Confidence 23333333211 12232222 223566778888888888888999999998754 5579999998877655555 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHhCCC----C-----ccc---c--------------------eeeeEEecCCCCCC Q lcl|NC_021324. 348 ERKGRIFGGAWERAMRIAM----QIMGRE----V-----TEE---Y--------------------TRLETVWRDPSTPT 391 (480) Q Consensus 348 ~~~~~~~~~~l~~~~~li~----~~~~~~----~-----~~~---~--------------------~~~~v~f~~~~~~~ 391 (480) ......+..+.+++.++++ .+.+.. + ..+ + .++.|.=.+..+.. T Consensus 473 ~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~ 552 (714) T protein:vir:27 473 AEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAF 552 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHH Confidence 5555666666666655443 333211 0 000 0 01111111111222 Q ss_pred HHHHHHHHHHHHhccccC---CCHHHHHHhCCCChhHHHHHHHHHHHHH----------------HHHHHHHhh----hc Q lcl|NC_021324. 392 VAAKADAVSKLYANGQGP---IPKEQARIDLGYTATQREQMRDWDKQET----------------EDMIDTLYS----TT 448 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~---~s~e~~~~~~~~~~d~~~~~~~~~~~~~----------------~~~~~~~~~----~~ 448 (480) ..+.+..++++.+..... +.-..+++.+.+. . .+++.+..++.. +....++.. .. T Consensus 553 r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p-~-~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~lq 630 (714) T protein:vir:27 553 KAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAELQ 630 (714) T ss_pred HHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCC-C-HHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHHH Confidence 345566677776543221 1223445555542 1 122222221100 000000000 00 Q ss_pred ccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 449 KAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+..+. -.+.+. ..-.++.+ T Consensus 631 ~~~~~a~------~~k~ea-----e~~~a~a~ 651 (714) T protein:vir:27 631 MREMAGR------VAKLEA-----DAARAHAA 651 (714) T ss_pred HHHHHHH------HHHHHH-----HHHHHHHH Confidence 0000000 000000 00000011 No 91 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.64 E-value=5.5e-16 Score=104.25 Aligned_cols=470 Identities=10% Similarity=0.008 Sum_probs=208.9 Q ss_pred CCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |+|.+..+.++...+..... ....-.+||+|.|-....... .....+.++|.++.+|+..+++-.-+-.. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~---l~~q~rp~~N~i~~~i~~v~g~~~~nr~d 77 (725) T protein:vir:77 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQY---TTLQYRGQFDVVRPVVRKLVSEMRQNPID 77 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCCHHHHHH---HHhcCCCccccHHHHHHHHHhhHHhCCcc Confidence 99998888887766544222 123336899998854321110 01133456899999999988866433211 Q ss_pred ---c---CCCchhHHHHHH----HHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEE----cccceE Q lcl|NC_021324. 74 ---I---SEDSEGLEELWN----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE----SPLYMY 139 (480) Q Consensus 74 ---~---~~d~~~~~~l~~----~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~----~p~~~~ 139 (480) . ++|.+..+.|.. +...++.+.....++.+++++|.||+-|..+-......++..+|+.+ +|.++ T Consensus 78 ~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~~~v- 156 (725) T protein:vir:77 78 VLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHV- 156 (725) T ss_pred eEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccChhhc- Confidence 1 234444444444 45568899999999999999999998875432111222344444433 34333 Q ss_pred EEEeCCCCCc----eE-EEEEEEEEec------------------------------CCceEEEEEEEeCCcEE--EEEe Q lcl|NC_021324. 140 AELDPRNTRR----VT-RAVRLYTTRD------------------------------DVAVPDRATLYLPDETV--PLRR 182 (480) Q Consensus 140 ~v~d~~~~~~----~~-~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~~~--~~~~ 182 (480) +||+..... -. ++++.|.+.+ .......+++|+...+. .+.. T Consensus 157 -~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~~~~~~~ 235 (725) T protein:vir:77 157 -IWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIY 235 (725) T ss_pred -eeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEeeEEEEe Confidence 355532210 00 0011111100 11223445555432221 1111 Q ss_pred cC---CCc--------------------------------------cccccccccccccccccceEEeecc--cccCccC Q lcl|NC_021324. 183 NG---GLN--------------------------------------DQWVVDGDVIKHGLGVVPVVPLTND--PRLGNRY 219 (480) Q Consensus 183 ~~---~~~--------------------------------------~~~~~~~~~~~~~~g~iPvv~~~n~--~~~~~~~ 219 (480) .. +.. .+.....++.+.+-+++|+|+|.-. ...+.++ T Consensus 236 ~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~ 315 (725) T protein:vir:77 236 QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEV 315 (725) T ss_pred cCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeeccCCccc Confidence 00 000 0001112222333355677765422 1223344 Q ss_pred CCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhh-cCCCccccccccccch--hhhhcchheecCCCC---ceEEEe Q lcl|NC_021324. 220 GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-SGVTTDELTNDGENTT--LDIYYGRILTLASEA---AKISEF 293 (480) Q Consensus 220 G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i-~G~~~~~~~~~~~~~~--~~~~~~~~~~~~g~~---~~~~~~ 293 (480) +.|-| +.+++.++.+|..+|.....+........++ .|.- +.. ....... ........+...++. ..+... T Consensus 316 ~~G~v-r~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i-~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~ 392 (725) T protein:vir:77 316 YEGVV-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGF-EHMYDGNDDYPYYLLNRTDENSGDLPTQPLAYY 392 (725) T ss_pred ccchh-hhhhhHHHHHHHHHHHHHHHHHhccccccccchhhh-hHH-HHHHHhccCCceecccccccCCCcccccCcccc Confidence 33333 6799999999999999887775443322111 1110 000 0000000 000001111111221 123333 Q ss_pred ccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H Q lcl|NC_021324. 294 KAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ----I 368 (480) Q Consensus 294 ~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~----~ 368 (480) +.+ -...+...++.....|-.++|+.+..+|..+. ..||+|+...-..........-..+..+.+++.++++. + T Consensus 393 ~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n-~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~ 471 (725) T protein:vir:77 393 ENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGG-QVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDI 471 (725) T ss_pred CCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 24567778888888888899999999997654 47999999988877777777777777787777665543 3 Q ss_pred hC---------CCCcccc------------------------eeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCH-- Q lcl|NC_021324. 369 MG---------REVTEEY------------------------TRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPK-- 412 (480) Q Consensus 369 ~~---------~~~~~~~------------------------~~~~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~-- 412 (480) .+ .....++ .++.|.=. |...+ ..+.+..++++.++.....+. T Consensus 472 ~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~-p~~~s~r~~~~~~l~qll~~~~~~~~~~~ 550 (725) T protein:vir:77 472 YDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVG-PSFQSMKQQNRAEILELLGKTPQGTPEYQ 550 (725) T ss_pred cCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeec-cchHHHHHHHHHHHHHHHHhccccchhHH Confidence 22 1111111 11111111 11222 234566677777665443332 Q ss_pred HHHHHhCCCChhH-HHHHHHHHHHHHH--------------HHHHHHhhhc-ccCCCcCCCC------CCCCCCccCCC- Q lcl|NC_021324. 413 EQARIDLGYTATQ-REQMRDWDKQETE--------------DMIDTLYSTT-KAQADATPKP------TVTETKTETQT- 469 (480) Q Consensus 413 e~~~~~~~~~~d~-~~~~~~~~~~~~~--------------~~~~~~~~~~-~~~~~~~~~~------~~~d~~~~~~~- 469 (480) .++...+...+-+ ++++.+..+.... ....+..... +..+...... ...-.+.+.+. T Consensus 551 ~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~kaq~e~~ 630 (725) T protein:vir:77 551 LLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTL 630 (725) T ss_pred HHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222222222111 1222211111000 0000000000 0000000000 00000000000 Q ss_pred -CCCCCCc-------ccCC Q lcl|NC_021324. 470 -SPSGFNR-------TKTR 480 (480) Q Consensus 470 -~~~~~~~-------~~~~ 480 (480) .-...-. .+.+ T Consensus 631 k~q~~a~~~~~~a~~~aa~ 649 (725) T protein:vir:77 631 SLQIDAAKVEAQNQLNAAR 649 (725) T ss_pred HHHHHHHHHHHHHHHHHHH Confidence 0000000 0000 No 92 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.62 E-value=2.8e-15 Score=100.38 Aligned_cols=452 Identities=13% Similarity=0.063 Sum_probs=207.1 Q ss_pred CCCHH----HHHHHHHHHHHH---HHH----HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhcc Q lcl|NC_021324. 1 MTTYH----EHVERLQGLLAR---DLP----NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~~----~~i~~l~~~~~~---~~~----~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~ 69 (480) +..++ ++..+++..+.. ..+ ...+-.+||+|.|--... ...-...+.-.+++|.++.+|+..+++..- T Consensus 10 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~-~~~l~~~g~p~~~~N~i~~~v~~v~g~~~~ 88 (714) T protein:vir:10 10 MKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQLAPEV-IQVLKDRGQPMTIHNLIAPTVDGVLGMEAK 88 (714) T ss_pred CCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCCCCHHH-HHHHHhcCCCcEEeccHHHHHHHHHHHHHh Confidence 11111 111122222221 111 223556899998843221 011111223467899999999999988765 Q ss_pred Cccc---cC---CCc--hhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccc Q lcl|NC_021324. 70 EGFR---IS---EDS--EGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLY 137 (480) Q Consensus 70 ~~~~---~~---~d~--~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~ 137 (480) +... .+ +++ +..+. +..++..|+.+.....++.++.++|.||+-++.+. ...++.+++..++|.+ T Consensus 89 nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~---d~~~~~i~i~~v~p~~ 165 (714) T protein:vir:10 89 TRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS---EPFGPEFKVSTVSRNE 165 (714) T ss_pred CCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeecc---CCCCCCeEEEecChhh Confidence 4332 12 222 22333 45566778999999999999999999999887653 2346789999999988 Q ss_pred eEEEEeCCCCC----ceEEEE-EE--------------------------------------------------EEEe-- Q lcl|NC_021324. 138 MYAELDPRNTR----RVTRAV-RL--------------------------------------------------YTTR-- 160 (480) Q Consensus 138 ~~~v~d~~~~~----~~~~~~-~~--------------------------------------------------~~~~-- 160 (480) ++ ||+.... ...+.+ +. |... T Consensus 166 v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (714) T protein:vir:10 166 VF--WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQN 243 (714) T ss_pred ee--eccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhcccccccc Confidence 65 5543210 000110 00 0000 Q ss_pred ----cCCceEEEEEEEeCCcEEEEEecC--CCcc------------------------------c-----cccccccccc Q lcl|NC_021324. 161 ----DDVAVPDRATLYLPDETVPLRRNG--GLND------------------------------Q-----WVVDGDVIKH 199 (480) Q Consensus 161 ----~~~~~~~~~~~~~~~~~~~~~~~~--~~~~------------------------------~-----~~~~~~~~~~ 199 (480) .+...+..+++|.......+.... +... . .+....+.|- T Consensus 244 ~~~~~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 323 (714) T protein:vir:10 244 EWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSA 323 (714) T ss_pred cccccCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCC Confidence 011234456666543332221110 0000 0 0001122233 Q ss_pred cccccceEEeeccc--ccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhc Q lcl|NC_021324. 200 GLGVVPVVPLTNDP--RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYY 277 (480) Q Consensus 200 ~~g~iPvv~~~n~~--~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~ 277 (480) +.+++|+|+|+-.. ..+.+|| +.+.+++.++.+|...|.....+. ++...+-.|..... +..-...+ ..+ T Consensus 324 p~~~fp~vP~~g~~~~~~g~~~G---~vr~~~d~Qr~~N~~~s~~~~~l~--~~~~~~~~gav~~~--d~~~~e~~-~rp 395 (714) T protein:vir:10 324 PQGMFPLVPFWGYRKDKTGEPYG---LISRAIPAQDEVNFRRIKLTWLLQ--AKRVIMDEDATQLS--DNDLMEQL-ERP 395 (714) T ss_pred CCCceeeEEecceeeeccCccce---ehhhhhhHHHHHHHHHHHHHHHHh--CCceeecccccccc--HHHHHHhc-cCC Confidence 44457777665322 2344665 346789999999999999877652 33333333332211 10000001 122 Q ss_pred chheecC-----CC--CceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 278 GRILTLA-----SE--AAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAER 349 (480) Q Consensus 278 ~~~~~~~-----g~--~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~ 349 (480) +.++... |. ...+...+.+ -...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-...... T Consensus 396 ~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~-na~SGvAI~~r~~qg~~~l~~ 474 (714) T protein:vir:10 396 DGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDS-GATSGVAISNLVEQGATTLAE 474 (714) T ss_pred CCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCc-chhHHHHHHHHHHHHHHHHHH Confidence 3333221 11 1223332322 3556778888888888889999999998754 557999999887766666566 Q ss_pred HHHHHHHHHHHHHHHHHH----HhCCC----Cc-----cc-ceeeeEEec---------------------CCCCCC-HH Q lcl|NC_021324. 350 KGRIFGGAWERAMRIAMQ----IMGRE----VT-----EE-YTRLETVWR---------------------DPSTPT-VA 393 (480) Q Consensus 350 ~~~~~~~~l~~~~~li~~----~~~~~----~~-----~~-~~~~~v~f~---------------------~~~~~~-~~ 393 (480) ....+..+.+++.++++. +.+.. +. .. ...+.+.+. -|...+ .. T Consensus 475 ~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~ 554 (714) T protein:vir:10 475 INDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKA 554 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHH Confidence 666666666666655443 32211 00 00 000111110 011223 23 Q ss_pred HHHHHHHHHHhccccCCC---HHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCC-CCCCCCCCccCCC Q lcl|NC_021324. 394 AKADAVSKLYANGQGPIP---KEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP-KPTVTETKTETQT 469 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s---~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~d~~~~~~~ 469 (480) +.+..+.++.+.....+. ...+++.+.+. . .+++.+..++. .+....+.+ .|+........+. T Consensus 555 ~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p-~-~~ei~~~ir~~-----------~~~~~~~~~~~~e~q~~q~~~~~ 621 (714) T protein:vir:10 555 QLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVP-Q-KQEFVERIRAA-----------LGTPKSPDEMTPEEQEVAAQQQA 621 (714) T ss_pred HHHHHHHHHHhhcCchhhhhHHHHHHHhcCCc-C-HHHHHHHHHHH-----------cCCCCCccccCcchhHHHHHHHH Confidence 445566666654322111 12334444432 1 11222211111 000000000 0000000000000 Q ss_pred CC-------------------CCCCcccCC Q lcl|NC_021324. 470 SP-------------------SGFNRTKTR 480 (480) Q Consensus 470 ~~-------------------~~~~~~~~~ 480 (480) .. ...-.++.+ T Consensus 622 ~~~~q~~l~~~e~~a~~~k~eaea~~~~aq 651 (714) T protein:vir:10 622 LQQQQAELQMREMAGRVAKLEADAARAHAA 651 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 000000000 No 93 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.61 E-value=3.6e-15 Score=99.80 Aligned_cols=449 Identities=15% Similarity=0.099 Sum_probs=195.0 Q ss_pred CCCHHHHHHHHHHHHHH-------HHH-HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc---c Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR-------DLP-NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD---I 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-------~~~-~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~---~ 69 (480) |++ ++++..|...+.. .+. ...+...||.|+..-. .. + ..-+++.+.+...|+.....|. + T Consensus 10 ~~~-~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~----~~-~--~~s~~~~~~v~~~v~~~~~~l~~~~~ 81 (705) T protein:vir:88 10 MDD-EQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN----ER-P--GKSGIVSRDVQETVDWIMPSLMKVFT 81 (705) T ss_pred CCH-HHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc----cc-C--CCCccccHHHHHHHHHHHHHHHHhhc Confidence 555 4455544444322 222 2345568999975321 11 1 2346677778888887777652 2 Q ss_pred Cc---cc----cCCCchhHHHHHHH-----HHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccC--------------- Q lcl|NC_021324. 70 EG---FR----ISEDSEGLEELWNW-----WQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG--------------- 122 (480) Q Consensus 70 ~~---~~----~~~d~~~~~~l~~~-----~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~--------------- 122 (480) .+ |. ..+|.+..+.++.+ .+.|+....+..++++++++|.|++.||....... T Consensus 82 ~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~~l~~ 161 (705) T protein:vir:88 82 SGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSEDMVAD 161 (705) T ss_pred CCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChhhhhh Confidence 21 11 13454444444443 44566667788899999999999999876331100 Q ss_pred ---C------------------------CCceeEEEEEcccceEEEEeCCCCCceEEE-EEEEEEecC------------ Q lcl|NC_021324. 123 ---D------------------------PAGIPLIRVESPLYMYAELDPRNTRRVTRA-VRLYTTRDD------------ 162 (480) Q Consensus 123 ---d------------------------~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~-~~~~~~~~~------------ 162 (480) | ..|.+++..|+|.++++-.+........+. .+++....+ T Consensus 162 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~~~~~ 241 (705) T protein:vir:88 162 ILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPEDVIE 241 (705) T ss_pred hhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCChhHhh Confidence 1 116788899999988743221111111122 111111000 Q ss_pred ------------------------Cc--------------eEEEEEEEeCCcEEEEEecCCCcccccc---cccc--ccc Q lcl|NC_021324. 163 ------------------------VA--------------VPDRATLYLPDETVPLRRNGGLNDQWVV---DGDV--IKH 199 (480) Q Consensus 163 ------------------------~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~--~~~ 199 (480) .+ .+++.++|. .+...+.+...+.. ..+. ..- T Consensus 242 ~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~-----~~d~~~d~~~~~~~~~~~g~~il~~~ 316 (705) T protein:vir:88 242 ELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYT-----LLDVDGDGISELRRILYVGDYIISNE 316 (705) T ss_pred hhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeee-----EecccCCcceeeEEEEEeCccccccc Confidence 00 001111111 11111111111100 0000 012 Q ss_pred cccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcch Q lcl|NC_021324. 200 GLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGR 279 (480) Q Consensus 200 ~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~ 279 (480) ++|++|++.++..+..+..||.|-+. .+.++++.+|..++.+.+++...++|...+...... ....+...+|. T Consensus 317 ~~~~~PF~~~~~~p~~~~~~G~g~~~-~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~------~~d~~~~~pg~ 389 (705) T protein:vir:88 317 PWDCRPFADLNAYRIAHKFHGMSVYD-KIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVN------LEDLLTNEAAG 389 (705) T ss_pred cCCCCCEEEecceeecCccccCChHH-HHhHHHHHHHHHHHHHHHHHHhccCCceeccccccC------cccccccCCCe Confidence 57889999998888889999999774 599999999999999999999888887665311111 11112334555 Q ss_pred heecCCC-CceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc---cchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 280 ILTLASE-AAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE---NPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 280 ~~~~~g~-~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~---~~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) ++...+. ...+...+.. ...+...+..+...+-..+|+++...|.+.. +.-++.++..+............+.|. T Consensus 390 vv~~~~~~~i~~~~~~~~-~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~~~~~r~~a 468 (705) T protein:vir:88 390 IVRVKSMNSITPLETPQL-SGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQIDLIARMFA 468 (705) T ss_pred eEEecCCCccccccCCcC-cHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 5544332 2233322221 2233344455555556678888888774422 223556666666666666666666665 Q ss_pred H-HHHHHHHH----HHHHhCCCC----cccc-----------eeeeEEecCCCCCCHHHHHHHH---HHHHhcccc---- Q lcl|NC_021324. 356 G-AWERAMRI----AMQIMGREV----TEEY-----------TRLETVWRDPSTPTVAAKADAV---SKLYANGQG---- 408 (480) Q Consensus 356 ~-~l~~~~~l----i~~~~~~~~----~~~~-----------~~~~v~f~~~~~~~~~~~ad~~---~kl~~~~~~---- 408 (480) . ++++++++ +..+.+... ...+ .++.+.-. +...+..+....+ .++.+.... T Consensus 469 ~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~-~~~~~~eq~~a~l~~ll~~~q~l~~~~~~ 547 (705) T protein:vir:88 469 ETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVG-IGNMNKDQQMLHLMRIWEMAQAVVGGGGL 547 (705) T ss_pred HHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeec-cccchHHHHHHHHHHHHHHHHHhhcccch Confidence 3 44555444 443333210 0101 11222111 1111222222222 222111100 Q ss_pred --CCCHHHH-------HHhCCCCh------h--HHHHHH-HHHHH--HHHH---HHHHHhhhcccCCCcCCCCCCCCCCc Q lcl|NC_021324. 409 --PIPKEQA-------RIDLGYTA------T--QREQMR-DWDKQ--ETED---MIDTLYSTTKAQADATPKPTVTETKT 465 (480) Q Consensus 409 --~~s~e~~-------~~~~~~~~------d--~~~~~~-~~~~~--~~~~---~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) .++.... .+.+++-. + ..+.+. +...+ +.+. ...+.......+.+..... . T Consensus 548 ~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q------~ 621 (705) T protein:vir:88 548 GVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQ------A 621 (705) T ss_pred hhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH------H Confidence 0111100 01111100 0 000000 00000 0000 0000000000000000000 0 Q ss_pred cCCCCCCCCCcccCC Q lcl|NC_021324. 466 ETQTSPSGFNRTKTR 480 (480) Q Consensus 466 ~~~~~~~~~~~~~~~ 480 (480) +.+..- -..+.| T Consensus 622 e~q~~q---~E~q~~ 633 (705) T protein:vir:88 622 EAQMKQ---VEAQIR 633 (705) T ss_pred HHHHHH---HHHHHH Confidence 000000 000000 No 94 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.61 E-value=5.5e-15 Score=98.77 Aligned_cols=472 Identities=10% Similarity=0.003 Sum_probs=205.6 Q ss_pred CCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |+|-+..+.++...+..... ...+-.+||+|.|-....... ....-+.++|.++.+|+..+++-.-+-.. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~~~~~~~---l~~q~rp~~N~i~~~v~~v~g~e~~nr~d 77 (725) T protein:vir:10 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRVSQWDDWLSQY---TTLQYRGQFDVVRPVVRKLVSEMRQNPID 77 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHH---HHhcCCCcccchHHHHHHHHhhHHhCCcc Confidence 99988888887766544222 223446899998854321110 01123456899999999998876433211 Q ss_pred ---c---CCCchhHHHHHH----HHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEE----cccceE Q lcl|NC_021324. 74 ---I---SEDSEGLEELWN----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE----SPLYMY 139 (480) Q Consensus 74 ---~---~~d~~~~~~l~~----~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~----~p~~~~ 139 (480) . ++|.+..+.|.. +...++.+.....++.+++++|.||+-|..+-......++..+|+.+ ++.+++ T Consensus 78 ~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~v~ 157 (725) T protein:vir:10 78 VLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSHVI 157 (725) T ss_pred eEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCHhHcc Confidence 1 234444444444 45568899999999999999999998875332111122334444433 344443 Q ss_pred EEEeCCCCCc----eEEE-EEEEEEe------------------------------cCCceEEEEEEEeCCcE--EEEEe Q lcl|NC_021324. 140 AELDPRNTRR----VTRA-VRLYTTR------------------------------DDVAVPDRATLYLPDET--VPLRR 182 (480) Q Consensus 140 ~v~d~~~~~~----~~~~-~~~~~~~------------------------------~~~~~~~~~~~~~~~~~--~~~~~ 182 (480) ||+..... ..++ ++.|.+. ........+++|+...+ ..|.. T Consensus 158 --~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~~~~~~ 235 (725) T protein:vir:10 158 --WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIY 235 (725) T ss_pred --cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEeeEEEEe Confidence 55532110 0111 1111110 00112233344432211 01100 Q ss_pred c---CCCc--------------------------------------cccccccccccccccccceEEeeccc--ccCccC Q lcl|NC_021324. 183 N---GGLN--------------------------------------DQWVVDGDVIKHGLGVVPVVPLTNDP--RLGNRY 219 (480) Q Consensus 183 ~---~~~~--------------------------------------~~~~~~~~~~~~~~g~iPvv~~~n~~--~~~~~~ 219 (480) . ++.. .+..+..++.+.+-++||+|+|.-.. ..+.++ T Consensus 236 ~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~~g~~~ 315 (725) T protein:vir:10 236 QDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEV 315 (725) T ss_pred ccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeeccCCcce Confidence 0 0000 00001112223333456666654221 123344 Q ss_pred CCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhhcchheecCCC--CceEEEec- Q lcl|NC_021324. 220 GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIYYGRILTLASE--AAKISEFK- 294 (480) Q Consensus 220 G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~--~~~~~~~~~~~~~~~~~g~--~~~~~~~~- 294 (480) +.+-| +.+++.+|.+|..+|.....+........+.-....+..... ......-+..+.+....|. ...+...+ T Consensus 316 ~~G~v-r~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~i~~~~~ 394 (725) T protein:vir:10 316 YEGVV-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQPLAYYEN 394 (725) T ss_pred eeeee-ccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCcccccccCcccCC Confidence 23333 679999999999999998877543332222111011100000 0000000000000001111 01122222 Q ss_pred cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhC Q lcl|NC_021324. 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ----IMG 370 (480) Q Consensus 295 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~----~~~ 370 (480) ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-.............+..+.+++.++++. +.+ T Consensus 395 ~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~-n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~ 473 (725) T protein:vir:10 395 PEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYD 473 (725) T ss_pred CCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC Confidence 223456777888888888889999999998754 558999999988777777777777777777776665443 322 Q ss_pred CC---------Ccccc------------------------eeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCH--HH Q lcl|NC_021324. 371 RE---------VTEEY------------------------TRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPK--EQ 414 (480) Q Consensus 371 ~~---------~~~~~------------------------~~~~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~--e~ 414 (480) .. ....+ .++.|.=.+ ...+ ..+.+..++++.++.....+. .+ T Consensus 474 ~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p-~~~s~r~~~~~~l~qll~~~~~~~~~~~~~ 552 (725) T protein:vir:10 474 VPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP-SFQSMKQQNRSEILELLGKTPQGTPEYQLL 552 (725) T ss_pred CCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeecc-CcHHHHHHHHHHHHHHHHhccccchhHHHH Confidence 11 00000 111111111 1222 235566677777665443332 22 Q ss_pred HHHhCCCChhH-HHHHHHHHHHHH--------------HHHHHHHhhhcccCCCcCCCCCC------CCC-C--ccCCCC Q lcl|NC_021324. 415 ARIDLGYTATQ-REQMRDWDKQET--------------EDMIDTLYSTTKAQADATPKPTV------TET-K--TETQTS 470 (480) Q Consensus 415 ~~~~~~~~~d~-~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~------~d~-~--~~~~~~ 470 (480) +...+...+-+ ++++.+..+.+. +....+..........+..+... .+. + .+.... T Consensus 553 l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~qae~~ka~aE~~k~ 632 (725) T protein:vir:10 553 LLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSL 632 (725) T ss_pred HHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222222111 112221111100 00000000000000000000000 000 0 000000 Q ss_pred CCCCCcccC--C Q lcl|NC_021324. 471 PSGFNRTKT--R 480 (480) Q Consensus 471 ~~~~~~~~~--~ 480 (480) -...-..++ + T Consensus 633 ~~~a~~~~~~a~ 644 (725) T protein:vir:10 633 QIDAAKVEAQNQ 644 (725) T ss_pred HHHHHHHHHHHH Confidence 000000000 0 No 95 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.60 E-value=4.3e-13 Score=88.39 Aligned_cols=431 Identities=10% Similarity=0.007 Sum_probs=213.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchh------cCccc--chh-----------hhcceeccchHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT------IGIGA--PPE-----------LAYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~------~~~~~--~~~-----------~~~~~~~~n~~~~iVd 61 (480) +-.+. ............ +.|+|-..-.. .+... ... -+++...+.|++-+|+ T Consensus 14 ~i~~~--~~~~~~~~~~~~-------~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~ 84 (505) T protein:vir:96 14 MVNWA--WYRYVEPQKNAA-------RAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQ 84 (505) T ss_pred ccchh--hhhhHHHHHHhh-------hhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 21111 111111111111 23443221100 00000 000 0122334568999999 Q ss_pred HHHhhhcc-CccccC---------CCchhHHHHHHHHHh------------cCHHHHHHHHHHHHhhcCeeEEEEecCcc Q lcl|NC_021324. 62 TLSDRLDI-EGFRIS---------EDSEGLEELWNWWQA------------NDLDEESVLGHDDSLTFGRAYITVSHPDV 119 (480) Q Consensus 62 ~~~~~l~~-~~~~~~---------~d~~~~~~l~~~~~~------------n~~~~~~~~~~~~~~~~G~a~~~v~~~~~ 119 (480) ..+...+| .|+... -+++..+.+...|+. .+|...+..+++.....|.+|+....... T Consensus 85 ~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~ 164 (505) T protein:vir:96 85 LLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYP 164 (505) T ss_pred HHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCC Confidence 99999998 587542 144555555554432 13667788899999999999987754321 Q ss_pred ccCCCCceeEEEEEcccceEEEEeC--CCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccc Q lcl|NC_021324. 120 ESGDPAGIPLIRVESPLYMYAELDP--RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVI 197 (480) Q Consensus 120 ~~~d~~g~~~~~~~~p~~~~~v~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (480) ..--+++..++|+.+-.-++. ..+..+...| ..+..|.+..+-++..+- +.. ....... T Consensus 165 ----~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GI----e~d~~Gr~~aY~i~~~hP--------gd~---~~~~~~~ 225 (505) T protein:vir:96 165 ----NKWGYALQILECDRLDLNYNADLQNGNRIRMSI----ELDAWERPVAYHLLVNHP--------GDN---SYCYHYA 225 (505) T ss_pred ----CCcceEEEEechhhcCCCCCcccCCcCeEEece----EECCCCceEEEEEeecCC--------Ccc---ccccccc Confidence 111258899999876432211 1222333334 234455544443333211 000 0000111 Q ss_pred cccccccc---eEEeecccccCccCCCccchHHHHHHH--HHHHHHHHHHHHHHHHhcchhhhhcCCCc---cccccccc Q lcl|NC_021324. 198 KHGLGVVP---VVPLTNDPRLGNRYGRSEISPELRKVT--DAASRTLMNLQSASQILGTPLRVISGVTT---DELTNDGE 269 (480) Q Consensus 198 ~~~~g~iP---vv~~~n~~~~~~~~G~s~i~~~v~~l~--D~~~~~~s~~~~~~~~~~~p~l~i~G~~~---~~~~~~~~ 269 (480) ...+.+|| |+|+......++..|.|.|++-+..|- +.|.......+.....++ .+|+.... +...+..+ T Consensus 226 ~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a---~fi~~~~~~~~~~~~~~~~ 302 (505) T protein:vir:96 226 GQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKV---GFYEQDPEAYDQPPEDDQG 302 (505) T ss_pred cccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhhe---eeeecCCccCCCccccccC Confidence 12345666 578877777888999999988444442 333333333333332222 23442111 11112222 Q ss_pred cchhhhhcchheecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 270 NTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAE 348 (480) Q Consensus 270 ~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~ 348 (480) .....+.+|.+..+. |.+.++.+.+. ...+|...++.+++.|+...++|-+.|.++..+ +|-.|.|+.+........ T Consensus 303 ~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~-~nYSS~R~~~~e~~r~~~ 380 (505) T protein:vir:96 303 EIVEEVEAGTYQLLPYGIRFKEHKIDH-PHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEG-VNFSSLRSGELDERDLYK 380 (505) T ss_pred ccccccCCceeeecCCCCeeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc-ccHHHHHHHHHHHHHHHH Confidence 334456677777664 44455544332 234677778888999999999999988776422 234456777777777777 Q ss_pred HHHHHHHHHHHH-HHHHH--HHHhCCCCcccc----eeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhC Q lcl|NC_021324. 349 RKGRIFGGAWER-AMRIA--MQIMGREVTEEY----TRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDL 419 (480) Q Consensus 349 ~~~~~~~~~l~~-~~~li--~~~~~~~~~~~~----~~~~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~ 419 (480) ..+..|...+-+ +++.. ..++.+.++... .-+.+.|..|-. .|....+.+....+.+| +.|.+..+... T Consensus 381 ~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G--~~t~~~~~a~~ 458 (505) T protein:vir:96 381 LLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNR--TRSRSSIIRAA 458 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHc Confidence 777666654433 33322 223333332111 124577865543 46667777777777776 66777888888 Q ss_pred CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 420 GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 420 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) |...+++-+ ++.++.+. .+.+. .. ......+ ...+....+.++.++| T Consensus 459 G~D~~~v~~--q~a~e~~~--~~~~G--l~--~~~~~~~-~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 459 GDDPEDVFD--EIAWEEQL--MRDKG--VN--PTPPEQE-SKDATTDEEDDSASDD 505 (505) T ss_pred CCCHHHHHH--HHHHHHHH--HHHcC--CC--CCCCCCC-CCCCCCCCCCCCCCCC Confidence 987665432 22222111 11111 11 1111111 1111222222222333 No 96 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.58 E-value=2.5e-13 Score=89.69 Aligned_cols=458 Identities=10% Similarity=0.009 Sum_probs=194.0 Q ss_pred CCCHHHHHHHHHHHHHHHHH-------HH----------HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLP-------NL----------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~-------~~----------~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) |.|.+.+...+...|..... +. .++.+||.|...-...+ +......+++.+.+...|+.. T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~~~---~~~~~rs~~~~~~v~~~ve~~ 91 (651) T protein:vir:80 15 YDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSVGD---VNADWRHKITTGKAFEAIETI 91 (651) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccccccCC---CCCCCCccccChhHHHHHHHH Confidence 77777754444444432221 11 13456776653211111 111135578999999999987 Q ss_pred HhhhccC---c---cc--cCCCchhH----HHHHHHHH----hcCHHHHHHHHHHHHhhcCeeEEEEecCcccc------ Q lcl|NC_021324. 64 SDRLDIE---G---FR--ISEDSEGL----EELWNWWQ----ANDLDEESVLGHDDSLTFGRAYITVSHPDVES------ 121 (480) Q Consensus 64 ~~~l~~~---~---~~--~~~d~~~~----~~l~~~~~----~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~------ 121 (480) ...|... + |. ...+++.. +.+..++. .++|......+..+++++|.|++.|+...... T Consensus 92 ~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~~~~~~~~~~ 171 (651) T protein:vir:80 92 HAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRVETAEVKKKV 171 (651) T ss_pred HHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecceeeeeehhe Confidence 7766321 1 11 11233322 23555543 56788888899999999999999887532100 Q ss_pred ----------C---------CCCceeEEEEEcccceEEEEeCCCCC--ceEEEEEEEEE-ec------------------ Q lcl|NC_021324. 122 ----------G---------DPAGIPLIRVESPLYMYAELDPRNTR--RVTRAVRLYTT-RD------------------ 161 (480) Q Consensus 122 ----------~---------d~~g~~~~~~~~p~~~~~v~d~~~~~--~~~~~~~~~~~-~~------------------ 161 (480) . ...|.|++..++|.++++ |+.... ...+.++.+.. .+ T Consensus 172 ~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~--dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~~~~~~~~ 249 (651) T protein:vir:80 172 QVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY--DPNVTDPNRGAFIRKLTKTKADILNLLSEGYYYGVDPLDV 249 (651) T ss_pred eccccccccccceeeeccceeeeceeEEEEecHHHeee--cCCCcCccccceeeeeeeeHHHHHHHHhcccccchhhHHH Confidence 0 013678999999999874 554322 12222222211 00 Q ss_pred -CC---------------------------ceEEEEEEEeCCcEEEEEecCCCcccccc------cccccccc-ccccce Q lcl|NC_021324. 162 -DV---------------------------AVPDRATLYLPDETVPLRRNGGLNDQWVV------DGDVIKHG-LGVVPV 206 (480) Q Consensus 162 -~~---------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~-~g~iPv 206 (480) +. .....+++|.+ +...+......++ ......++ +..+|+ T Consensus 250 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~-----~d~e~~~~~~~~v~~~g~~il~~~~~~~~~~~Pf 324 (651) T protein:vir:80 250 VEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGD-----IHLENKTYHDVVVTIMGNEVLRFEQNPYWCGRPF 324 (651) T ss_pred HhhhccccccCCccccccccCCCccccccccceEEEEEEEE-----eeccCCceEEEEEEEcCcEEecccccCCCCCCCe Confidence 00 00112223221 1111111100000 01111233 345799 Q ss_pred EEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCC- Q lcl|NC_021324. 207 VPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLAS- 285 (480) Q Consensus 207 v~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g- 285 (480) ++++..+..++.||+|.+. .+.+.+..+|.+...+...+...++|.+.+..-.... ...+...++.++.... T Consensus 325 ~~~~~~~~~~~~yG~g~~~-~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~------~~~l~~~pg~vi~~~~~ 397 (651) T protein:vir:80 325 VIGTYIPTARQPYAMGALQ-PNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQ------PEDVYTEPGKVFLVSDH 397 (651) T ss_pred eeecceecCccccCCChHH-HHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCcccc------HHHhhcCCCceEEecCC Confidence 9998888999999999875 5899999999999999999999999986664211111 0112334555544322 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHH-HH---- Q lcl|NC_021324. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS--ENPASAEAIIATDSRIVKMAERKGRIFGG-AW---- 358 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~--~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l---- 358 (480) ++....+....++......+..+...+...+++++...|.+. ....++.+++.....+.......-+.|.. .+ T Consensus 398 ~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~ 477 (651) T protein:vir:80 398 GDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLL 477 (651) T ss_pred CCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222223332334444445555555566666777766555422 11123334444444444444444444443 33 Q ss_pred HHHHHHHHHHhCCCCc----cc------c-----eeeeEEec--CCCCCCHHHHHHHHHHH---Hhcccc--CCCH---- Q lcl|NC_021324. 359 ERAMRIAMQIMGREVT----EE------Y-----TRLETVWR--DPSTPTVAAKADAVSKL---YANGQG--PIPK---- 412 (480) Q Consensus 359 ~~~~~li~~~~~~~~~----~~------~-----~~~~v~f~--~~~~~~~~~~ad~~~kl---~~~~~~--~~s~---- 412 (480) +++++++.+....... .. + .++.+.+. ..-+....+....+.++ .+.+.. .+.. T Consensus 478 ~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~ 557 (651) T protein:vir:80 478 EKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDY 557 (651) T ss_pred HHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhH Confidence 4445554433221100 00 0 11222221 11111112222222222 221100 0000 Q ss_pred -H---HHHHhCCCChhHHHHHHH------HHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 413 -E---QARIDLGYTATQREQMRD------WDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 413 -e---~~~~~~~~~~d~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) . -+++.+|+-+.. .-+.. ...+++.....+.... +.... .....-......+.-...-+++++ T Consensus 558 ~~~~~~l~~~~g~~~~~-~~l~~~~q~~~~~~~~~~~~q~~~~~~---~a~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 630 (651) T protein:vir:80 558 KRILVDLLQHWGFEEPE-AYLKQQDQQAPANPQEALLSQAKDVGG---QAMSN-MLQNQLQADGGTQMMSEMYGTPNA 630 (651) T ss_pred HHHHHHHHHHcCCCCcH-HhcCCCccchhhhhhHHHHhhHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 011222221100 00000 0000000000000000 00000 000000000000000000000000 No 97 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.57 E-value=1.7e-14 Score=96.13 Aligned_cols=472 Identities=11% Similarity=0.012 Sum_probs=201.9 Q ss_pred CCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLP-------NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~-------~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |+|-+..+.++...+..... ...+-.+||+|.|-....... .....+..+|.++.+|+..+++-.-+-.. T Consensus 1 m~d~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~~---l~~q~rp~~N~i~~~i~~v~g~e~~nr~d 77 (725) T protein:vir:92 1 MADNENRLESILSRFDADWTASDEARREAKNDLFFSRISQWDDWLSQY---TTLQYRGQFDVVRPVVRKLVSEMRQNPID 77 (725) T ss_pred CCchHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCHHHHHH---HHhcCCCcccchHHHHHHHHhhHHhCCcc Confidence 99988888887766544222 123346899998854321110 01133456899999999988875333211 Q ss_pred ---c---CCCchhHHHHHH----HHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEE---cccceEE Q lcl|NC_021324. 74 ---I---SEDSEGLEELWN----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE---SPLYMYA 140 (480) Q Consensus 74 ---~---~~d~~~~~~l~~----~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~---~p~~~~~ 140 (480) . ++|.+..+.|.. +...++.+.....++.+++++|.||+-|..+-......++.++|+.. +|... + T Consensus 78 ~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~~~-V 156 (725) T protein:vir:92 78 VLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSACSH-V 156 (725) T ss_pred eEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCChhh-c Confidence 1 234444444444 45568899999999999999999998875432111222344444433 23221 2 Q ss_pred EEeCCCCCc----eE-EEEEEEEEec------------------------------CCceEEEEEEEeCCcE-------- Q lcl|NC_021324. 141 ELDPRNTRR----VT-RAVRLYTTRD------------------------------DVAVPDRATLYLPDET-------- 177 (480) Q Consensus 141 v~d~~~~~~----~~-~~~~~~~~~~------------------------------~~~~~~~~~~~~~~~~-------- 177 (480) +||+..... .. ++++.|.+.+ ....+..+++|+...+ T Consensus 157 ~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~~~~~~~ 236 (725) T protein:vir:92 157 IWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKETAFIYQ 236 (725) T ss_pred ccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEeeeEEeec Confidence 355532210 00 0011111100 0122334444442211 Q ss_pred -------EEEEecC----------CC------------------ccccccccccccccccccceEEeeccc--ccCccCC Q lcl|NC_021324. 178 -------VPLRRNG----------GL------------------NDQWVVDGDVIKHGLGVVPVVPLTNDP--RLGNRYG 220 (480) Q Consensus 178 -------~~~~~~~----------~~------------------~~~~~~~~~~~~~~~g~iPvv~~~n~~--~~~~~~G 220 (480) ..|.... .+ ..+.....++.+-+-+++|+|+|.-.. ..+.+++ T Consensus 237 d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~g~~~~ 316 (725) T protein:vir:92 237 DPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVEDKEVY 316 (725) T ss_pred CCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccCCcccc Confidence 1111000 00 000011112223333556777654221 2233443 Q ss_pred CccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccc--ccccchhhhhcchheecCCCC---ceEEEec- Q lcl|NC_021324. 221 RSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTN--DGENTTLDIYYGRILTLASEA---AKISEFK- 294 (480) Q Consensus 221 ~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~--~~~~~~~~~~~~~~~~~~g~~---~~~~~~~- 294 (480) .|-| +.+++.++.+|..+|.....+........++--...+.... ...+. ............++. ..+...+ T Consensus 317 ~G~v-r~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~~~~~~~~g~~~~~~i~~~~~ 394 (725) T protein:vir:92 317 EGVV-RLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDD-YPYYLLNRTDENNGEMPTQPLAYYEN 394 (725) T ss_pred ccee-ccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHHHHhccCc-cceeeccccccccccccccCCcccCC Confidence 3333 67999999999999998877754433221211000000000 00000 000000001111111 1122222 Q ss_pred cccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HhC Q lcl|NC_021324. 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ----IMG 370 (480) Q Consensus 295 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~----~~~ 370 (480) ..-...+...++.....|-.++|+++..+|..+ |..||+|+..+-..-..........+..+.+++.++++. +.+ T Consensus 395 ~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~-n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~ 473 (725) T protein:vir:92 395 PEVPQANAYMLEAATAAVKEVATLGVDAEAVNG-GQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIVNDIYD 473 (725) T ss_pred CCchHHHHHHHHHHHHHHHHHhCCCHHHhccCc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 223456777888888888889999999998754 558999999887776666666666667776666555443 332 Q ss_pred CC---------Ccccc------------------------eeeeEEecCCCCCC-HHHHHHHHHHHHhccccCCCH--HH Q lcl|NC_021324. 371 RE---------VTEEY------------------------TRLETVWRDPSTPT-VAAKADAVSKLYANGQGPIPK--EQ 414 (480) Q Consensus 371 ~~---------~~~~~------------------------~~~~v~f~~~~~~~-~~~~ad~~~kl~~~~~~~~s~--e~ 414 (480) .. ....+ .++.|.=.+ ...+ ..+.+..++++.++.....+. -+ T Consensus 474 ~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p-~~~s~r~~~~~~l~ql~~~~~~~~~~~~~~ 552 (725) T protein:vir:92 474 VPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP-SFQSMKQQNRAEILELLGKTPQGTPEYQLL 552 (725) T ss_pred CCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeecc-ChHHHHHHHHHHHHHHHHhcccchhHHHHH Confidence 11 01000 111111111 1122 234555666776654433222 12 Q ss_pred HHHhCCCChhH-HHHHHHHHHHHH--------------HHHHHHHhhhcccCCCcCCCCCC------CC-CCccCCCCCC Q lcl|NC_021324. 415 ARIDLGYTATQ-REQMRDWDKQET--------------EDMIDTLYSTTKAQADATPKPTV------TE-TKTETQTSPS 472 (480) Q Consensus 415 ~~~~~~~~~d~-~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~------~d-~~~~~~~~~~ 472 (480) +...+...+-+ +.++.+..+.+. +....+..........+..+... .+ .+.+.+..-. T Consensus 553 l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~~kaqaE~~k~ 632 (725) T protein:vir:92 553 LLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAKAQNQTLSL 632 (725) T ss_pred HHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222221111 112211111100 00000000000000000000000 00 0000000000 Q ss_pred CCCcccCC Q lcl|NC_021324. 473 GFNRTKTR 480 (480) Q Consensus 473 ~~~~~~~~ 480 (480) ..+-.+++ T Consensus 633 q~~a~~~~ 640 (725) T protein:vir:92 633 QIDAAKVE 640 (725) T ss_pred HHHHHHHH Confidence 00000000 No 98 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.55 E-value=1.4e-14 Score=96.53 Aligned_cols=471 Identities=13% Similarity=0.059 Sum_probs=200.0 Q ss_pred CCCH-HHHHHHHHHHHHHH----HHHH-----HHHHHHhhccCcchhcCccc-chh--hhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARD----LPNL-----LEAEAYRNGTRRLKTIGIGA-PPE--LAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~----~~~~-----~~~~~YY~G~~~i~~~~~~~-~~~--~~~~~~~~n~~~~iVd~~~~~l 67 (480) |++. .+++.++...+..- .... ++--+||+|.|--...-... ... .+.-.+++|.++.+|+..+++- T Consensus 1 ma~~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~v~g~e 80 (708) T protein:vir:17 1 MAETLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcchHHHHHHHHhhH Confidence 8887 56677666655431 1111 22236899987443211100 000 0123577899999999998876 Q ss_pred ccCccc---cCC----CchhHHHH----HHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc---CCCCceeEEEEE Q lcl|NC_021324. 68 DIEGFR---ISE----DSEGLEEL----WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES---GDPAGIPLIRVE 133 (480) Q Consensus 68 ~~~~~~---~~~----d~~~~~~l----~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~---~d~~g~~~~~~~ 133 (480) .-+-.. .+. |.+..+.| ..+...|+.+.....++.++.++|.||+-+..+-..+ .+.+..+.+..+ T Consensus 81 ~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:17 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred hhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccceEee Confidence 433221 122 33333444 4455668999999999999999999998775432111 112223333333 Q ss_pred -cc-cceEEEEeCCCCCc----eEEE-EEEEEEec---------------------------CCceEEEEEEEeCC---- Q lcl|NC_021324. 134 -SP-LYMYAELDPRNTRR----VTRA-VRLYTTRD---------------------------DVAVPDRATLYLPD---- 175 (480) Q Consensus 134 -~p-~~~~~v~d~~~~~~----~~~~-~~~~~~~~---------------------------~~~~~~~~~~~~~~---- 175 (480) +| ..++ ||+..... ..+. .+.|.+.+ +.+.+...++|... T Consensus 161 ~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~~ 238 (708) T protein:vir:17 161 YDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVRKESV 238 (708) T ss_pred ccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEEeeeee Confidence 33 4544 66653210 0000 11110000 11223333333211 Q ss_pred -----------cEEEEEecCC----------C------------------ccccccccccccccccccceEEeecc---- Q lcl|NC_021324. 176 -----------ETVPLRRNGG----------L------------------NDQWVVDGDVIKHGLGVVPVVPLTND---- 212 (480) Q Consensus 176 -----------~~~~~~~~~~----------~------------------~~~~~~~~~~~~~~~g~iPvv~~~n~---- 212 (480) .++.|..... + ..+......+.|-+++++|+|+|... T Consensus 239 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~ 318 (708) T protein:vir:17 239 DVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFI 318 (708) T ss_pred EEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEecccccc Confidence 1111111100 0 00001122334455677888887532 Q ss_pred cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhh-----cCCCcccccccccc---chhhhhcchh-eec Q lcl|NC_021324. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI-----SGVTTDELTNDGEN---TTLDIYYGRI-LTL 283 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i-----~G~~~~~~~~~~~~---~~~~~~~~~~-~~~ 283 (480) .....+||. .+.+++.++.||..+|.+...+......+.++ .|............ ..+....+.+ +.. T Consensus 319 d~~~~~yG~---vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~v~ 395 (708) T protein:vir:17 319 DDIERVEGH---IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGNII 395 (708) T ss_pred cCCCcccch---hhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccccc Confidence 222334663 46789999999999999988775554433222 22211100000000 0001001111 111 Q ss_pred CCCCceEEEe-ccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 284 ASEAAKISEF-KAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) Q Consensus 284 ~g~~~~~~~~-~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 362 (480) ++..+. ... +..-.+.+...++.....|-.++|+++..+|.. .| .||+|+...-..-..........+..+.+++. T Consensus 396 ~~a~~~-~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~-sn-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g 472 (708) T protein:vir:17 396 AGATPA-GYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMP-SN-IAQETVNNLMNRADMASFIYLDNMAKSLKRAG 472 (708) T ss_pred cccCCc-ccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCc-cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 122111 111 122345677778888888888899999988853 34 79999998877766666666666666666655 Q ss_pred HHHH----HHhCC---------CCcccce-----------------------eeeEEecC-CCCCCH-HHHHHHHHHHHh Q lcl|NC_021324. 363 RIAM----QIMGR---------EVTEEYT-----------------------RLETVWRD-PSTPTV-AAKADAVSKLYA 404 (480) Q Consensus 363 ~li~----~~~~~---------~~~~~~~-----------------------~~~v~f~~-~~~~~~-~~~ad~~~kl~~ 404 (480) ++++ .+.+. ....++. +.+|.-.. |...+. .+..+.++++.. T Consensus 473 ~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll~ 552 (708) T protein:vir:17 473 EVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVLS 552 (708) T ss_pred HHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHHH Confidence 5443 33321 1000000 01111111 111222 244556677766 Q ss_pred ccccCCCHH-----HHHHhCCCC--hhHHHHHH---------------------HHHHHHHHH---HH---HHHhhhcc- Q lcl|NC_021324. 405 NGQGPIPKE-----QARIDLGYT--ATQREQMR---------------------DWDKQETED---MI---DTLYSTTK- 449 (480) Q Consensus 405 ~~~~~~s~e-----~~~~~~~~~--~d~~~~~~---------------------~~~~~~~~~---~~---~~~~~~~~- 449 (480) ++....+.. .+++.+.+. ++-.+++. +..+.+.+. .+ .+.....+ T Consensus 553 ~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~qA 632 (708) T protein:vir:17 553 SMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQA 632 (708) T ss_pred hcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 543221111 112222211 11111111 000000000 00 00000000 Q ss_pred ----cCCCcCCCCCCCCCCccCCCC---------CCCCCcccC---C Q lcl|NC_021324. 450 ----AQADATPKPTVTETKTETQTS---------PSGFNRTKT---R 480 (480) Q Consensus 450 ----~~~~~~~~~~~~d~~~~~~~~---------~~~~~~~~~---~ 480 (480) ..+++.. -.+.......+.. +.-.+..+- + T Consensus 633 e~~ka~aea~~-~q~~a~q~~~~~~~a~~~a~q~~~q~~~~~~~~~~ 678 (708) T protein:vir:17 633 EAQKATNETAQ-TQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVM 678 (708) T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 0000000000000 000000000 0 No 99 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.55 E-value=2.3e-13 Score=89.87 Aligned_cols=435 Identities=15% Similarity=0.105 Sum_probs=211.1 Q ss_pred CCC-------------HHHHHHHHHHHHHH-HH---HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTT-------------YHEHVERLQGLLAR-DL---PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~-------------~~~~i~~l~~~~~~-~~---~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) |++ ...+|.+++..+.. +. .+..++.+||.+-.. ...+...-+ ..+++.+|.+.-++++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~-~~~~~~~~~--~r~~~~~~k~~~~~~~i 77 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDT-TTTSNQGLP--WKNSTTLPKLCQIRDNL 77 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhh-hhhhhcccc--cccccchhHHHHHHHHH Confidence 322 22233333333322 21 233678899988432 222211111 14588899998888888 Q ss_pred Hhhhc----cC-------ccccCCCchh--HHHHHHHH----HhcCHHHHHHHHHHHHhhcCeeEEEEecCccc------ Q lcl|NC_021324. 64 SDRLD----IE-------GFRISEDSEG--LEELWNWW----QANDLDEESVLGHDDSLTFGRAYITVSHPDVE------ 120 (480) Q Consensus 64 ~~~l~----~~-------~~~~~~d~~~--~~~l~~~~----~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~------ 120 (480) +.+|. ++ |+. ++|.+. .+.+.... ...++...+.++..++.++|.|++.+...... T Consensus 78 ~~~l~~~~Fp~~~w~~~v~~~-~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e~ 156 (584) T protein:vir:95 78 HSNYFSSLFPNDDWLRWVGYG-KGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTDG 156 (584) T ss_pred HHHHHHhhcCccceeeeecCC-CchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeecc Confidence 87773 11 222 222222 34455544 45589999999999999999999998753321 Q ss_pred -cCCCCceeEEEEEcccceEEEEeCCCCCc--eEEEEEEEEEe-----------------------------------cC Q lcl|NC_021324. 121 -SGDPAGIPLIRVESPLYMYAELDPRNTRR--VTRAVRLYTTR-----------------------------------DD 162 (480) Q Consensus 121 -~~d~~g~~~~~~~~p~~~~~v~d~~~~~~--~~~~~~~~~~~-----------------------------------~~ 162 (480) ....-..|++..+||.++| ||+.-... ..+.++.+... ++ T Consensus 157 ~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~~ 234 (584) T protein:vir:95 157 TLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVED 234 (584) T ss_pred ccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCcccc Confidence 1122346999999999988 66654221 11112221100 00 Q ss_pred CceE---------EEEEEEeCCcEEEEEe-------cCCCcc----------ccccccccccccccccceEEeecccccC Q lcl|NC_021324. 163 VAVP---------DRATLYLPDETVPLRR-------NGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 163 ~~~~---------~~~~~~~~~~~~~~~~-------~~~~~~----------~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) .... ...+.|.++.+.-+.. ..+... +.....+..|.+.|.+|++.+...+... T Consensus 235 ~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~~ 314 (584) T protein:vir:95 235 FDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRPD 314 (584) T ss_pred cccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeeec Confidence 0000 0111122221211110 000000 0001122345667999999999999999 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCC-CceEEEecc Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASE-AAKISEFKA 295 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~~~~~~~~~ 295 (480) +.||.|...- +.++|+.+|.++-.+.+++..+.+|.+...+.. .. +...++..+..... ...+...+. T Consensus 315 s~yG~gi~~l-l~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~--~~--------~~~~pg~~~~~~~~~~~q~~~p~a 383 (584) T protein:vir:95 315 NLWAMGPLDN-LVGMQYRIDHLENAKADAVDLIIQPPLKIIGEV--EE--------FVWGPGAEIHLDQGGDVQEIAKNV 383 (584) T ss_pred cccCCCchhh-hhhHHHHHhHHHHHHHHHHHHhcCcceeecccc--ch--------hcccCCceeecCCCCCcceecCch Confidence 9999998754 899999999999999999999999955544321 11 22334544443222 222333332 Q ss_pred ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhCCCC- Q lcl|NC_021324. 296 AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAW-ERAMRIAMQIMGREV- 373 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l-~~~~~li~~~~~~~~- 373 (480) ..+-.....|..+...+-..+|+|...-|..+....++..+......+..-...+.+.|...+ ++++.++..+-.... T Consensus 384 ~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~nmd 463 (584) T protein:vir:95 384 NYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLETATRNMD 463 (584) T ss_pred hhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 222222244555556667788999888886654444444566666677777777778888776 777777775521100 Q ss_pred cccce------------------eeeEEe--cCCCCCCHHHHHHHH---HHHHh--cccc---CCCHHHHHH----h--C Q lcl|NC_021324. 374 TEEYT------------------RLETVW--RDPSTPTVAAKADAV---SKLYA--NGQG---PIPKEQARI----D--L 419 (480) Q Consensus 374 ~~~~~------------------~~~v~f--~~~~~~~~~~~ad~~---~kl~~--~~~~---~~s~e~~~~----~--~ 419 (480) ..+.. +++-.| ......-+..+++.+ .++.+ .|.. .++...+.. . + T Consensus 464 ~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ladl~~~ 543 (584) T protein:vir:95 464 GSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATFVDDVTGL 543 (584) T ss_pred ccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHHHHHHhCC Confidence 00111 111111 111111112222222 22222 1111 122211111 1 1 Q ss_pred C---C-Chh--HHHH--HHHHHHHHHHHHHHHHhhhcccCCCcCC Q lcl|NC_021324. 420 G---Y-TAT--QREQ--MRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 420 ~---~-~~d--~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) + + .++ ..+. ....... ++..+....+..+.-+- T Consensus 544 p~~~~~~~~~~~~~Q~~~q~~~~~----~q~~~~~~~~~~~~~~~ 584 (584) T protein:vir:95 544 QGYEIFRPNVAVAEQAETQSLVAQ----AQEDLQLQAQMPAEGAI 584 (584) T ss_pred CcccccCCCcccchhHHHHhhhHH----HHHHHHHHHhhhhccCC Confidence 1 1 111 0010 1110000 11111111111000011 No 100 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.55 E-value=2.3e-14 Score=95.33 Aligned_cols=406 Identities=12% Similarity=0.101 Sum_probs=194.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHh---hccCcch----hcCccc-chhhhcceeccchHHHHHHHHHhhhccCcc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYR---NGTRRLK----TIGIGA-PPELAYLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY---~G~~~i~----~~~~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) |+.- ..+..+-+.|. -++-... ..+... -.........+.++++|||..+.-+.-+|+ T Consensus 5 m~~~--------------~~~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~ 70 (435) T protein:vir:79 5 MSDK--------------VKAITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGF 70 (435) T ss_pred cccc--------------cccchhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCc Confidence 4332 12222223332 2211110 011111 112223344567789999999999999999 Q ss_pred ccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccc----cCCCCcee-EEEEEcccceEEEEeCCCC Q lcl|NC_021324. 73 RISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE----SGDPAGIP-LIRVESPLYMYAELDPRNT 147 (480) Q Consensus 73 ~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~----~~d~~g~~-~~~~~~p~~~~~v~d~~~~ 147 (480) .+.+++ ..+.+...|++-++...+.++.+.+..||.|++++-..+.. +.+.+|.+ .+.+++|.++.|-.-..+. T Consensus 71 ~i~g~~-~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp 149 (435) T protein:vir:79 71 KVDGVK-NEKSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNA 149 (435) T ss_pred eecCCC-hHHHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccCC Confidence 887665 34667788887788899999999999999998887643321 12233433 4555666555432100111 Q ss_pred CceEEE-EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 148 RRVTRA-VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 148 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) ..+.+. ..+|......+. .-.. +-+.++++|. .-|+-.. .......||.|.|.+ T Consensus 150 ~sp~fg~P~~y~v~~~~~~-~~~~-iH~SRli~~~---------------------g~~~p~~--~~~~~~~~G~S~l~e 204 (435) T protein:vir:79 150 RSVRYGEPKLYKISPGGDI-PEFF-VHYSRICIID---------------------GERVSNE--KRRQNDGWGASILNK 204 (435) T ss_pred cccccCcceEEEEecCCCC-CceE-EcceeEEEec---------------------CCcchhh--hccccCcccchHHHH Confidence 111000 011111100000 0000 0112222211 0111001 012245778887755 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc---h---hh--hhcchheecCCCCceEEEeccccH Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT---T---LD--IYYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~---~---~~--~~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) .+.+-+..++.+.......+..+....+.+.|............. . +. ...+..+.+.+++.++.+++. ++ T Consensus 205 ~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~-~l 283 (435) T protein:vir:79 205 RLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNS-DV 283 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEec-cc Confidence 566666667777666666665555544444442111100111100 0 11 111222333444445554442 34 Q ss_pred HHHHHHHHHHHHHHHhhcCCChHHhccc-ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021324. 299 RNFAEEMEVFRKEAASITGLPPQYLSSS-SEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~-~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 376 (480) ....+.+.....+|++.++||..-|.+. ... ++||..-..-|...+.. ..+..+.+.|++++.+++.- T Consensus 284 sgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~--~Qe~~l~p~l~~l~~li~~s-------- 353 (435) T protein:vir:79 284 SGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDR--KRVEDYKPILEFLLPFMISE-------- 353 (435) T ss_pred CCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhcC-------- Confidence 4556667777889999999998665443 322 35676555555554442 33567788888888876521 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCC-hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcC Q lcl|NC_021324. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYT-ATQREQMRDWDKQETEDMIDTLYSTTKAQADAT 455 (480) Q Consensus 377 ~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (480) .++.+.|.+...++..+.|+...+.+++. .++ -..|.+ .+++.+. .+... ........... T Consensus 354 -~d~~~~f~pL~~~sekEkAei~~~~a~a~------~~~-~~~g~i~~~e~r~~---L~~~~-------~~~~~~~~~~~ 415 (435) T protein:vir:79 354 -TEWSIEFEPLSVPSDKDKAEIMAKNVESV------VKL-KAEQAINLKETRDT---LRSIC-------PDLKIMDNDNI 415 (435) T ss_pred -CCCeEEeCCCCCCCHHHHHHHHHHHHHHH------HHH-HhcCCCCHHHHHHH---HHHhc-------cccCCCCcccc Confidence 35789999999999999998766655432 122 233433 3333211 11100 00000000000 Q ss_pred CCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 456 PKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 456 ~~~~~~d~~~~~~~~~~~~~~ 476 (480) +-++ .+.....+..++|.|+ T Consensus 416 ~~~~-~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 416 ELPE-PEDLDPEPGQEGGLNK 435 (435) T ss_pred cCCc-cccCCCCCCCCCCCCC Confidence 1111 1111112222334444 No 101 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.53 E-value=2e-14 Score=95.72 Aligned_cols=429 Identities=12% Similarity=0.052 Sum_probs=190.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHH-----HHH-------HHhhccCcchh----cCcccch-hhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLL-----EAE-------AYRNGTRRLKT----IGIGAPP-ELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~-----~~~-------~YY~G~~~i~~----~~~~~~~-~~~~~~~~~n~~~~iVd~~ 63 (480) =.+ .. ...|+..|..+-..+. -+. .+.-|...-.. .+..... .+-.....+.+++++||.. T Consensus 27 ~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~r~~Vd~~ 104 (532) T protein:vir:94 27 RAT-HT-SLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWPGFPTLALLAQLPEYRTMHETP 104 (532) T ss_pred hhh-hh-hhhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccchHHHHHHHHcCchhhhhhccc Confidence 000 00 0012222222111100 000 01111100000 0000110 1111223356679999999 Q ss_pred HhhhccCccccCC--C----chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc---C----------CC Q lcl|NC_021324. 64 SDRLDIEGFRISE--D----SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES---G----------DP 124 (480) Q Consensus 64 ~~~l~~~~~~~~~--d----~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~---~----------d~ 124 (480) ++-+.-+|+.+.. + ++..+.|...|++-++...+.++.+.+..||.|++++....... . -. T Consensus 105 aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~~I~ 184 (532) T protein:vir:94 105 ADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPSFVQ 184 (532) T ss_pred hHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccccccccccccccccc Confidence 9988888887633 2 22334567777777788899999999999999987765321110 0 01 Q ss_pred Ccee-EEEEEcccceEEE-EeCCCCCceEEEEEEEEEecCCceEEEEEE-----EeCCcEEEEEecCCCccccccccccc Q lcl|NC_021324. 125 AGIP-LIRVESPLYMYAE-LDPRNTRRVTRAVRLYTTRDDVAVPDRATL-----YLPDETVPLRRNGGLNDQWVVDGDVI 197 (480) Q Consensus 125 ~g~~-~~~~~~p~~~~~v-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (480) .|.+ .+.+++|..+.|- |+..+...+ +.+.+.++.+ +-+.++++|... T Consensus 185 ~g~~~~l~vld~~~v~p~~~~~~dp~sp-----------~fg~P~~y~v~~g~~iH~SRli~f~g~-------------- 239 (532) T protein:vir:94 185 RGCLIGFATIEPMWLSPNAYNATDPTLP-----------SFYKPDSWIATSGKKIHSSRIHTVVGR-------------- 239 (532) T ss_pred cceeeEEEeechheeccccccccccccc-----------ccCCceeEEEccCeeeccceEEEecCC-------------- Confidence 1222 3556666666553 221111111 1111111111 113333333211 Q ss_pred cccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc-----ch Q lcl|NC_021324. 198 KHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN-----TT 272 (480) Q Consensus 198 ~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~-----~~ 272 (480) .+|-.. ......+|.|.+.+ +..-+..++.+.-.....+..+....+.+ +.......+.... .. T Consensus 240 -----~~p~~~----~~~~~~~G~Svlq~-~~~~l~~~~~t~~~~~~l~~~~~~~v~k~-~~a~~ls~~~~~~~~~r~~~ 308 (532) T protein:vir:94 240 -----PVGDML----KAAYSFRGVSISQL-AMPYVDNWLRTRQSVSDTVKQFSMTNLAT-DMAQLLAPGGAQSLDARLQL 308 (532) T ss_pred -----Cchhhh----ccccccccccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCceeee-chHHhhcchhHHHHHHHHHH Confidence 011100 01123568887754 55556666666666655554444333222 2111000011010 01 Q ss_pred hhhh--cchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHh-cccccc-chHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 273 LDIY--YGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL-SSSSEN-PASAEAIIATDSRIVKMAE 348 (480) Q Consensus 273 ~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~~~~-~~Sg~Al~~~~~~l~~k~~ 348 (480) +... ...++.++++..++.+++ .++....+.+.....+|+..++||..-| |..... +++|..-..-|...+. . T Consensus 309 ~~~~~~n~g~~~id~~~e~~e~~~-~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~--s 385 (532) T protein:vir:94 309 FNLYRDNRNIGALDKGTEEIQQTN-TPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIA--G 385 (532) T ss_pred HHhhcCCccceEEcCCCceeEEEe-cccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHH--H Confidence 1111 123444544444565554 2344555667777789999999998755 433222 3456554444444443 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH-------HHHhccccCCCHHHHHHhCCC Q lcl|NC_021324. 349 RKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS-------KLYANGQGPIPKEQARIDLGY 421 (480) Q Consensus 349 ~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~-------kl~~~~~~~~s~e~~~~~~~~ 421 (480) ..+..+...|++++.+++....+.... ++.+.|.+....+..+.|+... +++++ +.++.+.+++.++. T Consensus 386 ~Qe~~l~p~le~l~~~l~~s~~g~~~~---d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~--Gvi~~~Evr~~l~~ 460 (532) T protein:vir:94 386 YQATNLTPLMEWIIDLIQLSEYGQIDP---GLAWEWSPLMELDDKELAEVRQLNASTDSTLMEL--GVIDAKMVQQRLAA 460 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCC---CceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHhc Confidence 344667888999998887654444332 4889999988889988877543 34443 36777777776653 Q ss_pred Chh------HH--HHHHHHHHHHHHHHHHHHhhhcccCCCcC-----CCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 422 TAT------QR--EQMRDWDKQETEDMIDTLYSTTKAQADAT-----PKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 422 ~~d------~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) .+. .. +.+.+.... .+.........++.+ +.+.+.+.....+.....+.+..-| T Consensus 461 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 527 (532) T protein:vir:94 461 DPTSGYAGALGERDELDDVEEI-----AKQLMAAALNPPATAPQTPNPQPDSEDDQTDNQPDAQADPAQNDQ 527 (532) T ss_pred CCccccccccccccccccccch-----hhhhcccccCCCCCCCCCCCCCCCCCCCCCCCccCCCccccccCC Confidence 321 11 011111111 111111111111111 1111111111111111111221222 No 102 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.52 E-value=8.3e-13 Score=86.81 Aligned_cols=445 Identities=12% Similarity=0.061 Sum_probs=217.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC--c--chhc-C--cccch----hh-------hcceeccchHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR--R--LKTI-G--IGAPP----EL-------AYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~--~--i~~~-~--~~~~~----~~-------~~~~~~~n~~~~iVd~ 62 (480) |.++.--..... .+.........||.|.. . +... + ..... .. +++-..+.|++-+|+. T Consensus 1 ~~~p~~~~~~~~----~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~ 76 (533) T protein:vir:34 1 MKTPTIPTLLGP----DGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAIQL 76 (533) T ss_pred CCCchhhhhhcc----cccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 666632221111 11222345567876531 1 1110 0 00101 00 1122345689999999 Q ss_pred HHhhhccCccccCC-------------CchhHHHHHHHHH---h-----------cCHHHHHHHHHHHHhhcCeeEEEEe Q lcl|NC_021324. 63 LSDRLDIEGFRISE-------------DSEGLEELWNWWQ---A-----------NDLDEESVLGHDDSLTFGRAYITVS 115 (480) Q Consensus 63 ~~~~l~~~~~~~~~-------------d~~~~~~l~~~~~---~-----------n~~~~~~~~~~~~~~~~G~a~~~v~ 115 (480) .+.+++|.|++... +++..+.+...|+ + .+|...+..+++.....|.+|+... T Consensus 77 ~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~ 156 (533) T protein:vir:34 77 HQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQAT 156 (533) T ss_pred HHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEee Confidence 99999999987522 1222344444442 1 1466778889999999999999876 Q ss_pred cCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccc Q lcl|NC_021324. 116 HPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGD 195 (480) Q Consensus 116 ~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (480) ..... +..--+++..++|+.+---++......+...| ..+..|.+..+-++..+. .+.....+... T Consensus 157 ~~~~~--g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GI----e~d~~Gr~~aY~i~~~~~------~~~~~~~~~~~-- 222 (533) T protein:vir:34 157 WDTSS--SRLFRTQFRMVSPKRISNPNNTGDSRNCRAGV----QINDSGAALGYYVSEDGY------PGWMPQKWTWI-- 222 (533) T ss_pred eccCC--CCccceEEEEechhhcCCCCCCCCCCceEeee----EECCCCCeEEEEEeecCC------CCcccccccee-- Confidence 54321 11123588999998865433332333344444 234455444333333211 00000000000 Q ss_pred cccccccccceEEeecccccCccCCCccchHHHHHHH--HHHHHHHHHHHHHHHHhcchhhhhcCCCccc---------- Q lcl|NC_021324. 196 VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT--DAASRTLMNLQSASQILGTPLRVISGVTTDE---------- 263 (480) Q Consensus 196 ~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~--D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~---------- 263 (480) +.....+.-=|+|+......++..|.|.+++-+..|- +.|.......+.....++. +|+...... T Consensus 223 ~~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~---fi~~~~~~~~~~~~~~~~~ 299 (533) T protein:vir:34 223 PRELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAA---TIESELDTQSAMDFILGAN 299 (533) T ss_pred eeeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhhee---eeecCCCcccccccccCCC Confidence 0001122223788888778889999999987554442 3333333333333332221 222111100 Q ss_pred ---cccc-----------cccchhhhhcchheecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc Q lcl|NC_021324. 264 ---LTND-----------GENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE 328 (480) Q Consensus 264 ---~~~~-----------~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~ 328 (480) ..+. .....+.+.+|.+..+. |.+.++.+.+. ...+|...++.+++.|+...|+|-+.|.++.. T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~-p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s 378 (533) T protein:vir:34 300 SQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQD-TDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYA 378 (533) T ss_pred cccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcc Confidence 0000 00112245667776654 44455544332 23456677888889999999999999987642 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH--HHHHhCCCCcc------cc-----eeeeEEecCCCC--CCH Q lcl|NC_021324. 329 NPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRI--AMQIMGREVTE------EY-----TRLETVWRDPST--PTV 392 (480) Q Consensus 329 ~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~-~~~l--i~~~~~~~~~~------~~-----~~~~v~f~~~~~--~~~ 392 (480) + +|-.|.|+.+..........+..|...+-+ +++. -.+++.+.++. ++ .-+.+.|..|-. .|. T Consensus 379 ~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP 457 (533) T protein:vir:34 379 Q-MSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDG 457 (533) T ss_pred c-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccCh Confidence 2 233456777666666666666666554422 2221 12233332221 11 124577755443 466 Q ss_pred HHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCC Q lcl|NC_021324. 393 AAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPS 472 (480) Q Consensus 393 ~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 472 (480) ...+.+....+.+| +.|.+..+...|...+++-+ ++.++.+. .+.+. ..-..+++..+. ......+.+|+ T Consensus 458 ~Ke~~a~~~~i~~G--~~s~~~~~a~~G~D~~ev~~--q~a~e~~~--~~~~g--l~~~~~~~~~~~--s~~~~~~~~~~ 527 (533) T protein:vir:34 458 LKEVQEAVMLIEAG--LSTYEKECAKRGDDYQEIFA--QQVRETME--RRAAG--LKPPAWAAAAFE--SGLRQSTEEEK 527 (533) T ss_pred HHHHHHHHHHHHcC--CCCHHHHHHHcCCCHHHHHH--HHHHHHHH--HHhcC--CCCCCCCCcCcc--CCCCCCCCCCc Confidence 67777777777776 66888888888987665432 22222111 11110 111111111111 11112222233 Q ss_pred CCCccc Q lcl|NC_021324. 473 GFNRTK 478 (480) Q Consensus 473 ~~~~~~ 478 (480) .++.++ T Consensus 528 ~~~~~~ 533 (533) T protein:vir:34 528 SDSRAA 533 (533) T ss_pred ccCCCC Confidence 333333 No 103 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.52 E-value=4.3e-13 Score=88.41 Aligned_cols=456 Identities=11% Similarity=0.056 Sum_probs=214.9 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHhhcc--Cc--chh-c--Ccccch----hh-------hcceeccchHHHHHH Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLPNLLEAEAYRNGT--RR--LKT-I--GIGAPP----EL-------AYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~~~~~~~~YY~G~--~~--i~~-~--~~~~~~----~~-------~~~~~~~n~~~~iVd 61 (480) |.+. .-.+.......... +...-...|+|- +. ... . ...... .. +++-..+.|++-+|+ T Consensus 1 m~~~~~r~~~~~a~~~~~~--~~~~~~~~y~gA~~~~r~~~~w~~~~~s~~~~~~~~~~~lr~RaRdL~rNn~~a~~av~ 78 (553) T protein:vir:63 1 MTKVTVRKLSEVTSGRPEQ--SASLGGGGLEGASRLSRETVSWNPSLRSPDALINPLKRIADARGRDMADNDGFTNGAVG 78 (553) T ss_pred Ccchhhhhhcccccccchh--hhhhhcccccccccCCCcccccccCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 4443 22232222222111 111112345542 11 110 0 000000 11 112234568999999 Q ss_pred HHHhhhccCccccC----------CCc----hhHHHHHHH---HHh-----------cCHHHHHHHHHHHHhhcCeeEEE Q lcl|NC_021324. 62 TLSDRLDIEGFRIS----------EDS----EGLEELWNW---WQA-----------NDLDEESVLGHDDSLTFGRAYIT 113 (480) Q Consensus 62 ~~~~~l~~~~~~~~----------~d~----~~~~~l~~~---~~~-----------n~~~~~~~~~~~~~~~~G~a~~~ 113 (480) ..+..++|.|++.. .++ +..+.+... |.+ .+|...+..+++.....|.+|+. T Consensus 79 ~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 158 (553) T protein:vir:63 79 YQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLAT 158 (553) T ss_pred HHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEE Confidence 99999999998742 112 222333333 322 14677788899999999999988 Q ss_pred EecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccc Q lcl|NC_021324. 114 VSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVD 193 (480) Q Consensus 114 v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (480) ....... +..--+++..++|+.+-.-++......+...| +.+..|.+..+-++..+---.|...... ..+.-. T Consensus 159 ~~~~~~~--~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GV----E~d~~Gr~vaY~i~~~hPgd~~~~~~~~-~~~~r~ 231 (553) T protein:vir:63 159 AEWDRAA--NRPYATCFQMVSTDRLSNPYQQLDTPTLRRGV----QYDKRGRPQGYWIQVAHPGDLYQMAPDM-YKWKFV 231 (553) T ss_pred eeeccCC--CCcccceEEEechhhcCCCCCCCCCCeeEeee----EECCCCceEEEEeeccCCCccccccccc-cceeee Confidence 6543311 11123688999998875544433333444444 3345555544444442211111111100 000000 Q ss_pred cccccccccccceEEeecccccCccCCCccchHHHHHHH--HHHHHHHHHHHHHHHHhcchhhhhcCCCcccc------- Q lcl|NC_021324. 194 GDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT--DAASRTLMNLQSASQILGTPLRVISGVTTDEL------- 264 (480) Q Consensus 194 ~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~--D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~------- 264 (480) .. ...++.-=|+|+......++.-|.|.|++-+..|- +.|.......+.....++ .+|+...++.. T Consensus 232 ~~--~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a---~fi~~~~~~~~~~~~~~~ 306 (553) T protein:vir:63 232 QQ--SKPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYA---AAIESELPPEFIHSQMSG 306 (553) T ss_pred cc--ccccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhhe---eeeecCCChhhhhhhccc Confidence 00 11223334577777777888999999988555443 333333333333333222 12221111000 Q ss_pred ------c-------------cccccchhhhhcchheecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhc Q lcl|NC_021324. 265 ------T-------------NDGENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLS 324 (480) Q Consensus 265 ------~-------------~~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g 324 (480) . ...+.....+.+|.+..+. |.+.++.+.+. ...+|...++.+++.|+...|+|-+.|. T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~-p~~~~~~F~~~~lr~iaaglGi~Ye~lt 385 (553) T protein:vir:63 307 GSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGT-PGGVGSEFEASLNRHLASAFGMSYEEFT 385 (553) T ss_pred ccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHh Confidence 0 0011123445677776664 44455544332 2345667788889999999999999887 Q ss_pred cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH--HHHhCCCCccc--------------ceeeeEEecCC Q lcl|NC_021324. 325 SSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER-AMRIA--MQIMGREVTEE--------------YTRLETVWRDP 387 (480) Q Consensus 325 ~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~-~~~li--~~~~~~~~~~~--------------~~~~~v~f~~~ 387 (480) ++..+ +|-.|.|+.+..........+..|...+-+ +++.. .+++.+.++.. ..-+.+.|..| T Consensus 386 ~D~s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p 464 (553) T protein:vir:63 386 RDFSK-ANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGA 464 (553) T ss_pred hhccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecC Confidence 76422 233356666666666666666666554433 33321 22333322110 01245778665 Q ss_pred CC--CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcC-CCCCCCCCC Q lcl|NC_021324. 388 ST--PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADAT-PKPTVTETK 464 (480) Q Consensus 388 ~~--~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~ 464 (480) -. .|....+.+....+.+| +.|.+..+...|...+++-+ ++.++.+ ..+.+. .....++. ....+.+.. T Consensus 465 ~~~~iDP~Ke~~A~~~~i~~G--~~t~~~~~a~~G~D~~~v~~--q~a~e~~--~~~~~G--l~~~~~~~~~~~~~~~~~ 536 (553) T protein:vir:63 465 SQGQIDQLKETQAAVMRIDAG--LSTYEREIARLGGDFRKSFA--QRAREDA--LLKKYG--LTFNLSAKRSLGDGRDAA 536 (553) T ss_pred CccccChHHHHHHHHHHHHcC--CCCHHHHHHHhCCCHHHHHH--HHHHHHH--HHHHcC--CCCCCCCccccCCCcccC Confidence 54 35566777777777776 66788888888987665432 2222111 111111 01011111 101111111 Q ss_pred ccCCCCCCCCCcccCC Q lcl|NC_021324. 465 TETQTSPSGFNRTKTR 480 (480) Q Consensus 465 ~~~~~~~~~~~~~~~~ 480 (480) ++....|...+++++- T Consensus 537 ~~~~~~~~~~~~~~~~ 552 (553) T protein:vir:63 537 TGIAEDPAAAQTSQQG 552 (553) T ss_pred CCCCCCCCCCCccccc Confidence 1111112122222222 No 104 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.50 E-value=7.3e-14 Score=92.61 Aligned_cols=471 Identities=12% Similarity=0.074 Sum_probs=203.2 Q ss_pred CCCH-HHHHHHHHHHHHHHHH-----HHHH--HHHHh--hccCcchhcCc-ccchhh--hcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLP-----NLLE--AEAYR--NGTRRLKTIGI-GAPPEL--AYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~-----~~~~--~~~YY--~G~~~i~~~~~-~~~~~~--~~~~~~~n~~~~iVd~~~~~l 67 (480) |++. ++++.++...+..... |-.. -.+|| .|+|--..... ...... +.-.+++|.++.+|+..+++- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINKVATELNRIIAEY 80 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcchHHHHHHHHHHH Confidence 8887 6677777776544221 1111 12344 68774332100 000100 123577899999999998877 Q ss_pred ccCccc---cCC----CchhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc----CCCCceeEEEE Q lcl|NC_021324. 68 DIEGFR---ISE----DSEGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES----GDPAGIPLIRV 132 (480) Q Consensus 68 ~~~~~~---~~~----d~~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~----~d~~g~~~~~~ 132 (480) .-+... .+. |.+..+. +..+++.|+.+.....++.++.++|.||+-|..+-..+ .++.+.+.... T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i~i~~~ 160 (708) T protein:vir:10 81 RNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIAIEPI 160 (708) T ss_pred HhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCccccceEEe Confidence 544322 122 3333344 44456678999999999999999999998875432111 12233343344 Q ss_pred EcccceEEEEeCCCCC--c--eEEE-EEEEEEec---------------------------CCceEEEEEEEeCCc---- Q lcl|NC_021324. 133 ESPLYMYAELDPRNTR--R--VTRA-VRLYTTRD---------------------------DVAVPDRATLYLPDE---- 176 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~--~--~~~~-~~~~~~~~---------------------------~~~~~~~~~~~~~~~---- 176 (480) .+|... +.||+.... . ..++ ++.|.+.+ ........++|.... T Consensus 161 ~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~~~~~~ 239 (708) T protein:vir:10 161 YDPSRS-VWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVRKESVD 239 (708) T ss_pred ecchhh-cccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEEEEEEE Confidence 445321 235543211 0 0111 11111100 001122233332211 Q ss_pred -----------EEEEEecC-----------C-----------------CccccccccccccccccccceEEeecc----c Q lcl|NC_021324. 177 -----------TVPLRRNG-----------G-----------------LNDQWVVDGDVIKHGLGVVPVVPLTND----P 213 (480) Q Consensus 177 -----------~~~~~~~~-----------~-----------------~~~~~~~~~~~~~~~~g~iPvv~~~n~----~ 213 (480) +..|.... + ...+........+-+++++|+|+|.-. . T Consensus 240 ~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~r~~~d 319 (708) T protein:vir:10 240 VISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFID 319 (708) T ss_pred EEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeeeeeccC Confidence 11111100 0 000001112334456677888887532 2 Q ss_pred ccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc-----ccccccchh------hhhcchhee Q lcl|NC_021324. 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL-----TNDGENTTL------DIYYGRILT 282 (480) Q Consensus 214 ~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~-----~~~~~~~~~------~~~~~~~~~ 282 (480) ....+||. .+.+++.++.||...|.....+....-...++-....... .....+..+ ....|.+. T Consensus 320 ~~~~~yG~---vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~~~- 395 (708) T protein:vir:10 320 DIERVEGH---IAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGNII- 395 (708) T ss_pred CCccccee---ecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhccccccccccccc- Confidence 22334553 3668999999999999998877654433322211110000 000000000 00011111 Q ss_pred cCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 LASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 283 ~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) .+.. ....++. .....+...++.....|-.++|+++..+|. ..| .||+|+..+-..-..........+..+.+++ T Consensus 396 -~~~~-~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~-~sn-~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~~ 471 (708) T protein:vir:10 396 -AGAT-PAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSN-IAQETVNNLMNRADMASFIYLDNMAKSLKRA 471 (708) T ss_pred -cccC-CccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccC-ccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1110 1111222 235567788888888888899999998884 334 7999999987777777777777777777776 Q ss_pred HHHHHH----HhCC---------CCcccce-----------------------eeeEEe--cCCCCCCHHHHHHHHHHHH Q lcl|NC_021324. 362 MRIAMQ----IMGR---------EVTEEYT-----------------------RLETVW--RDPSTPTVAAKADAVSKLY 403 (480) Q Consensus 362 ~~li~~----~~~~---------~~~~~~~-----------------------~~~v~f--~~~~~~~~~~~ad~~~kl~ 403 (480) .++++. +.+. ....++. ..+|.- .+..+.-..+.+..++++. T Consensus 472 g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~qll 551 (708) T protein:vir:10 472 GEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVL 551 (708) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHHH Confidence 665543 3221 1000000 011211 1111222335666777777 Q ss_pred hccccCCCHH-----HHHHhCCCC--hhHHHHHHHH---------------------HHHHHHHH---H---HHHhhhcc Q lcl|NC_021324. 404 ANGQGPIPKE-----QARIDLGYT--ATQREQMRDW---------------------DKQETEDM---I---DTLYSTTK 449 (480) Q Consensus 404 ~~~~~~~s~e-----~~~~~~~~~--~d~~~~~~~~---------------------~~~~~~~~---~---~~~~~~~~ 449 (480) +.+....+.. .+++.+.+. ++-.+++.+. .+.+.+.. + .+.....+ T Consensus 552 ~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~q 631 (708) T protein:vir:10 552 SSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) T ss_pred HhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7653321111 112222221 1112222110 00000000 0 00000000 Q ss_pred cCCCcCCCC----CCCCCCccCCCC---------CCCCCccc---CC Q lcl|NC_021324. 450 AQADATPKP----TVTETKTETQTS---------PSGFNRTK---TR 480 (480) Q Consensus 450 ~~~~~~~~~----~~~d~~~~~~~~---------~~~~~~~~---~~ 480 (480) ++...+... .....+...+.. +.-.+..+ -+ T Consensus 632 Ae~~ka~a~a~~~~~~a~q~~~~~~~a~~~a~q~~~~a~~~~~~~~~ 678 (708) T protein:vir:10 632 AEAQKATNETAQTQIKAFTAQQDAMESQANTVYKLAQARNIDDKAVM 678 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 000000000000 00000000 00 No 105 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.50 E-value=1.4e-13 Score=90.98 Aligned_cols=397 Identities=12% Similarity=0.082 Sum_probs=184.3 Q ss_pred HHHHHHHHHHHHHHHHHhhccCcchhcC--cccc-hhhhcceeccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHH Q lcl|NC_021324. 12 QGLLARDLPNLLEAEAYRNGTRRLKTIG--IGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWW 88 (480) Q Consensus 12 ~~~~~~~~~~~~~~~~YY~G~~~i~~~~--~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~~~ 88 (480) ++.+ +-+-|..+--|.++-...+ .... -..-.....+.+++++||..+.-+.-+|+.+.++++ .+.+...| T Consensus 1 ~~~~-----~~d~~~~~~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~g~~~-~~~~~~~~ 74 (427) T protein:vir:10 1 MKIV-----KHDGYNDIFNGGADGSPKPFFMSDASYHVGSFYNDNATAKRIVDVIPEEMVTAGFKMSGVKD-EKEFKSLW 74 (427) T ss_pred CCcc-----ccchHHHHhhcCCCCcccCccccCchHHHHHHHHcCchhhhhhccchHHhhcCCccccCccH-HHHHHHHH Confidence 1111 1111122222211100000 0011 011233445677899999999999999999877653 35677888 Q ss_pred HhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc----CCCCce-eEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCC Q lcl|NC_021324. 89 QANDLDEESVLGHDDSLTFGRAYITVSHPDVES----GDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDV 163 (480) Q Consensus 89 ~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~----~d~~g~-~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~ 163 (480) ++-++...+.++.+.+..||.|++++...+... .+..|. ..+.++++.++.|-....+... .+. T Consensus 75 ~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s-----------~~f 143 (427) T protein:vir:10 75 DSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARS-----------PRY 143 (427) T ss_pred HHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccc-----------ccc Confidence 877899999999999999999998876443221 111222 2344455544433211111100 011 Q ss_pred ceEEEEEE----------EeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 164 AVPDRATL----------YLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 164 ~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +.+..+.+ +-+.++++|. .-|+-.+. ......||.|.|.+.+..-+. T Consensus 144 g~P~~y~v~~~~~~~~~~iH~SRli~~~---------------------g~~~p~~~--~~~~~~~G~S~l~~~~~~~i~ 200 (427) T protein:vir:10 144 GEPEIYKVSPGDNMQPYLIHHSRVFIAD---------------------GERVAQQA--RKQNQGWGASVLNKSLIDAIC 200 (427) T ss_pred CcceEEEEecCCCCcceEEccccEEEec---------------------CCCchhhh--cccCCcccchhhhHHHHHHHH Confidence 11211111 0111121111 11110000 122456888887654555455 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc-h--h---hh--hcchheecCCCCceEEEeccccHHHHHHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT-T--L---DI--YYGRILTLASEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~-~--~---~~--~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l 305 (480) .++++.......+..+....+.+.|............. . + .. .....+.+.+++.++.+++. ++...-+.+ T Consensus 201 ~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~-~lsgl~~~~ 279 (427) T protein:vir:10 201 DYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS-DISGVPEFL 279 (427) T ss_pred HHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEec-ccCChHHHH Confidence 66666665555555555544444442111001111100 0 0 00 11122233344445544432 244455667 Q ss_pred HHHHHHHHhhcCCChHHhccc-ccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021324. 306 EVFRKEAASITGLPPQYLSSS-SEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~-~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~ 383 (480) .....+|++.++||..-|-+. ... ++||..-..-|...+. ...+..+.+.|++++.+++.- .++.+. T Consensus 280 ~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~--~~Qe~~l~p~l~~l~~~i~~s---------~~~~~~ 348 (427) T protein:vir:10 280 SSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLLEFLLPFIVDE---------EEWSIE 348 (427) T ss_pred HHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhcC---------CCcEEE Confidence 777789999999998766443 222 3566654444444444 234467888899888876621 258899 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC-CChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG-YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) |.+-...+..+.|+...+.+++. . .+...| ++++++.+ ..+... ....+........+..++....+ T Consensus 349 f~pL~~~s~kEkaei~~~~a~a~------~-~~~~~gvi~~~e~r~---~L~~~~--~~~~~~~~~~~~~e~~~~~~e~~ 416 (427) T protein:vir:10 349 FEPLSVPSKKEESEITKNNVESV------T-KAITEQIIDLEEARD---TLRSIA--PEFKLKDGNNINIREPEETTEPE 416 (427) T ss_pred eCCCCCCCHHHHHHHHHHHHHHH------H-HHHhcCCCCHHHHHH---HHHhhh--ccccCCCCccccccccchhcCCC Confidence 99999999999988655544431 1 222234 33333221 111110 01111111100000001111111 Q ss_pred C-CccCCCCCCCCC Q lcl|NC_021324. 463 T-KTETQTSPSGFN 475 (480) Q Consensus 463 ~-~~~~~~~~~~~~ 475 (480) | .+++... .| T Consensus 417 p~~~e~~~d---~~ 427 (427) T protein:vir:10 417 PGLGEKLED---EN 427 (427) T ss_pred CCCCCCCCC---CC Confidence 1 1111111 11 No 106 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.50 E-value=5.1e-12 Score=82.51 Aligned_cols=448 Identities=15% Similarity=0.076 Sum_probs=218.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchh---c--Ccccchh----h-------hcceeccchHHHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT---I--GIGAPPE----L-------AYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~---~--~~~~~~~----~-------~~~~~~~n~~~~iVd~~~ 64 (480) |.=+..+|.-+.-....++.+-....+-|+|-..-.. . +...... . +++-..+.|++-+|+..+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 4433333333332333333333333455766321110 0 0000000 0 112234568899999999 Q ss_pred hhhccC-cccc-----CCCch----hHHHH---HHHHHh-------cCHHHHHHHHHHHHhhcCeeEEEEecCccccCCC Q lcl|NC_021324. 65 DRLDIE-GFRI-----SEDSE----GLEEL---WNWWQA-------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 65 ~~l~~~-~~~~-----~~d~~----~~~~l---~~~~~~-------n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~ 124 (480) +.++|. |+.+ ..|++ ..+.+ |+-|.. .+|..++..+++.....|.+|+.......... . T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~-~ 159 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNY-T 159 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccc-c Confidence 999874 4322 22322 22333 333432 34778888899999999999988764332110 0 Q ss_pred Cc---eeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccc Q lcl|NC_021324. 125 AG---IPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGL 201 (480) Q Consensus 125 ~g---~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (480) .| -+++..++|+.+-.-++.. ...+...| ..+..|.+..+-++..+---.+.... . .-+ T Consensus 160 ~g~~~~~~lqliepd~l~~~~~~~-~~~i~~GI----E~D~~Grp~aY~i~~~hPgd~~~~~~--~-----------~~~ 221 (548) T protein:vir:95 160 FATSVPFALELLEPDYLPFSYNNL-SKGIVQGI----ERDTWRRKRAYHLLKDHPGNLQTLGG--S-----------LAV 221 (548) T ss_pred CCcccceEEEEechhhcCCCCCCC-CCceeeee----EECCCCceEEEEEeecCCCccccccc--c-----------cce Confidence 11 2488999998864223221 22333333 33455554444343322110000000 0 011 Q ss_pred cccc---eEEeecccccCccCCCccchHHHHHHH--HHHHHHHHHHHHHHHHhcchhhhhcCCCcccc----ccccccch Q lcl|NC_021324. 202 GVVP---VVPLTNDPRLGNRYGRSEISPELRKVT--DAASRTLMNLQSASQILGTPLRVISGVTTDEL----TNDGENTT 272 (480) Q Consensus 202 g~iP---vv~~~n~~~~~~~~G~s~i~~~v~~l~--D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~----~~~~~~~~ 272 (480) -+|| |+|+....+.++..|.|.+++-+..|- +.|.......+.....|+ .+|+...++.. .+...... T Consensus 222 ~rvpA~~VlHif~~~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a---~fi~~~~~~~~~~~~~~~~~~~~ 298 (548) T protein:vir:95 222 KRVEAERIIHIAYRKRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALA---MYIKKGNPDSYTVEPGKDRKNRT 298 (548) T ss_pred eeechhHheecccccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhhe---eeeecCCCccccCCCCccccccc Confidence 1222 577777777788999999998555553 333333333333333222 23332211111 11222333 Q ss_pred hhhhcchhee-c-CCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 273 LDIYYGRILT-L-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERK 350 (480) Q Consensus 273 ~~~~~~~~~~-~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~ 350 (480) ..+.+|.++. + +|++.++.+.+. ...+|...++.+++.|+...++|-+.|.++. + .|-.|.|+.+.......... T Consensus 299 ~~~~pG~iv~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~IAaglGipYe~ltgD~-s-~nYSS~R~~l~e~~r~~~~~ 375 (548) T protein:vir:95 299 IPIAPGMVFDDLEPGEDVGMIESNR-PNPFLEGFRNGQLRMIGAGTRSTYSSVSRAY-D-GTYSAQRQELVEGWLGYDLL 375 (548) T ss_pred ccccCCccccccCCCceeeecCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccc-c-hhHHHHHHHHHHHHHHHHHH Confidence 4566776553 4 455656654332 2346777788888999999999999998875 2 35556787777777777777 Q ss_pred HHHHHHHHHH-HHHHH--HHHhCCCCcc-----cceeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhCC Q lcl|NC_021324. 351 GRIFGGAWER-AMRIA--MQIMGREVTE-----EYTRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDLG 420 (480) Q Consensus 351 ~~~~~~~l~~-~~~li--~~~~~~~~~~-----~~~~~~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~ 420 (480) +..|...+-+ +++.. .+++.+.+.. ...-+.+.|..|-. .|....+.+....+.+| +.|.+..+...| T Consensus 376 q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~G--l~T~~~~~a~~G 453 (548) T protein:vir:95 376 QHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAG--FADEAEVARARG 453 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHhC Confidence 6666655433 33322 2333333221 11235678865443 46677777777777776 667888888889 Q ss_pred CChhHHH-HHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCC------CCc----------------- Q lcl|NC_021324. 421 YTATQRE-QMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSG------FNR----------------- 476 (480) Q Consensus 421 ~~~d~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~------~~~----------------- 476 (480) ...+++- ++.++.+...+ +.=.....+.......+.++..+.+++..+ ++. T Consensus 454 ~D~~ev~~q~a~E~~~~~~-----~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (548) T protein:vir:95 454 RDPRELKKSRETEIKANRA-----AGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGP 528 (548) T ss_pred CCHHHHHHHHHHHHHHHHH-----cCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCC Confidence 8777643 22222211111 110000000000101112222222222111 111 Q ss_pred ------------ccCC Q lcl|NC_021324. 477 ------------TKTR 480 (480) Q Consensus 477 ------------~~~~ 480 (480) +|.- T Consensus 529 ~~~~~~~~~~~~~~~~ 544 (548) T protein:vir:95 529 DFPNESNNGGADGQPS 544 (548) T ss_pred CCCcccccCCCCCCCC Confidence 1110 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.49 E-value=5e-14 Score=93.48 Aligned_cols=426 Identities=10% Similarity=0.050 Sum_probs=197.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCC--- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED--- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d--- 77 (480) |+.-......+.... ...+-..+..||-... .. + .++-.....+.++++|||..+.-+.-+|+.+..+ T Consensus 70 ~d~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-~~--~----~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~ 140 (537) T protein:vir:10 70 MDGLDVEGGTFSAYA--NPNLSEGLVLWYAQQA-FI--G----HQMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGN 140 (537) T ss_pred ccccccchhhhhhhc--cccccchhhhhccccC-Cc--c----HHHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcc Confidence 221111111000000 0011111222222211 10 0 1111233456778999999999998888875332 Q ss_pred ---chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccc------cCC----CCce-eEEEEEcccceEEEEe Q lcl|NC_021324. 78 ---SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE------SGD----PAGI-PLIRVESPLYMYAELD 143 (480) Q Consensus 78 ---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~------~~d----~~g~-~~~~~~~p~~~~~v~d 143 (480) .+..+.+...|++-++...+.++.+.+..||.+++++...... ..+ ..|. ..+.+++|..+.|... T Consensus 141 ~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~ 220 (537) T protein:vir:10 141 ELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLD 220 (537) T ss_pred cccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEEEechhhcccccc Confidence 1334567778888889999999999999999998877643211 001 1111 2355666666555321 Q ss_pred CCCCCceEEEEEEEEEecCCceEEEEEE----EeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccC Q lcl|NC_021324. 144 PRNTRRVTRAVRLYTTRDDVAVPDRATL----YLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRY 219 (480) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~ 219 (480) ......+. ..+.+.+..+.+ +-+.++++|.... +|-+. ......+ T Consensus 221 ~~~~~dp~--------sp~fg~P~~y~v~g~~iH~SRli~f~g~~-------------------~p~~~----~~~~~~~ 269 (537) T protein:vir:10 221 AQASSNPV--------SMHFYEPTYWLINGKKYHRSHLAIYINDE-------------------VVDFL----KPSYIYG 269 (537) T ss_pred hhhhccCC--------ccccCCceeeeecCeEecceeEEEecCCC-------------------Cchhh----hcccCcc Confidence 11000000 001111111110 1123333322100 11100 1112357 Q ss_pred CCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-cc---hhhh--hcchheecCCCCceEEEe Q lcl|NC_021324. 220 GRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-NT---TLDI--YYGRILTLASEAAKISEF 293 (480) Q Consensus 220 G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~-~~---~~~~--~~~~~~~~~g~~~~~~~~ 293 (480) |.|.+.. +..-+..++++.-..+..+..+....+.+.|...- ..... .. .+.. ....++.++.+.-++.++ T Consensus 270 G~Svlq~-~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l--~~~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~ 346 (537) T protein:vir:10 270 GVPLPQQ-IMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVL--ANKQQFDETMSWWTATRDNYQVRVVDKDNEDVVQI 346 (537) T ss_pred cccHHHH-HHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhh--cCHHHHHHHHHHHHhhcCCcceeEecCCCceeEEE Confidence 8887754 66666667777776666666555555444442211 11111 00 1111 122344555444455554 Q ss_pred ccccHHHHHHHHHHHHHHHHhhcCCChHHh-cccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_021324. 294 KAAELRNFAEEMEVFRKEAASITGLPPQYL-SSSS-ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR 371 (480) Q Consensus 294 ~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~~-~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 371 (480) + .++...-+.+......|++.++||..-| |... +-++||..-..-|...+ +.++..+...+++++.+++....+ T Consensus 347 ~-~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I---~~~Qe~l~p~l~~l~~ll~~~~~~ 422 (537) T protein:vir:10 347 D-TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEEC---ESTQDDMRPLIDRHHQLVCRSHLR 422 (537) T ss_pred e-ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhcCC Confidence 4 2344555666777788999999998755 4332 22356765554444444 333345788999999988765443 Q ss_pred CCcccceeeeEEecCCCCCCHHHHHHHH-------HHHHhccccCCCHHHHHHhCCCChhH-HHHHHH-HHHHHHHHH-H Q lcl|NC_021324. 372 EVTEEYTRLETVWRDPSTPTVAAKADAV-------SKLYANGQGPIPKEQARIDLGYTATQ-REQMRD-WDKQETEDM-I 441 (480) Q Consensus 372 ~~~~~~~~~~v~f~~~~~~~~~~~ad~~-------~kl~~~~~~~~s~e~~~~~~~~~~d~-~~~~~~-~~~~~~~~~-~ 441 (480) . ..++++.|.+....+..+.|+.. .+++++| .++.+.++..|+-.++- ...+-. ...+..++. . T Consensus 423 ~----~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G--~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~ 496 (537) T protein:vir:10 423 K----RIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMG--AVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDV 496 (537) T ss_pred C----CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcC--CCCHHHHHHHHhccCccccccccCCCChhhhhcccC Confidence 2 23588999999889999888753 3444443 56666666554321110 000000 000000000 0 Q ss_pred HHHhhhcccCCCcCCCC--CCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 442 DTLYSTTKAQADATPKP--TVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 442 ~~~~~~~~~~~~~~~~~--~~~d~~~~~~~~~~~~~~~~~~ 480 (480) +....... .....+.+ ..+....+.++.+..++++++- T Consensus 497 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 536 (537) T protein:vir:10 497 DDEGKPVR-IIEDQPAPSEMFGATSSGESANDPRDSGAAFE 536 (537) T ss_pred CccCCcCC-CCCCCCCccccCCCCccccccCCCccCccccC Confidence 00000000 00000100 1111112223334455566666 No 108 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.49 E-value=1.6e-12 Score=85.18 Aligned_cols=398 Identities=10% Similarity=0.053 Sum_probs=187.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhc--cCcc-----hhcCcccc-hhhhcceeccchHHHHHHHHHhhhccCcc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNG--TRRL-----KTIGIGAP-PELAYLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G--~~~i-----~~~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) |-. .+-|...--| .... ...+.... ..+......+.+++++||..+.-+.-+|+ T Consensus 1 ~~~------------------~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~ 62 (437) T protein:vir:52 1 MKF------------------FDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWR 62 (437) T ss_pred Cch------------------hhhhHhHHhcCCCccccceeecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCc Confidence 111 1111111111 1000 00111110 11122344567789999999999999998 Q ss_pred ccCCC---chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccc---cCCCCcee-EEEEEcccceEEEEeCC Q lcl|NC_021324. 73 RISED---SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE---SGDPAGIP-LIRVESPLYMYAELDPR 145 (480) Q Consensus 73 ~~~~d---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~---~~d~~g~~-~~~~~~p~~~~~v~d~~ 145 (480) .+..+ ++..+.++..|++-++...+.++.+.+-.||.|++++..+... ..+..|.+ .++++++.++.|+.-.. T Consensus 63 ~i~~~d~~~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~ 142 (437) T protein:vir:52 63 EIYSNDLNSKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKD 142 (437) T ss_pred eEecCCCCHHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccccccc Confidence 87442 2333578888888788999999999999999999988654321 11112333 35566666555432111 Q ss_pred -CCCceEEEEEEEEEecCCceEEEEEE--------EeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 146 -NTRRVTRAVRLYTTRDDVAVPDRATL--------YLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 146 -~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) +...+ +.+.+..+.+ +-+.++++|... .+| + ... T Consensus 143 ~dp~s~-----------~fg~p~~y~v~~~~~~~~iH~SRii~~~~~-------------------~~~---~----~~~ 185 (437) T protein:vir:52 143 DDVLSP-----------NFGRYSEYSILGGSQSITVHHSRLIILNAN-------------------DAP---L----SDN 185 (437) T ss_pred cccccc-----------ccCcceEEEEecCCcceeEccceeEEecCc-------------------cCC---C----ccc Confidence 11111 1111111111 111222222110 011 1 123 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhh--hcchheecCCCCc Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDI--YYGRILTLASEAA 288 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~------~~~~~--~~~~~~~~~g~~~ 288 (480) ..+|.|.+.. +..-+..++++.-.....+..+..+.+.+.|.... ....... ..+.. ....++.++. +. T Consensus 186 ~~~G~s~le~-~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~-~~ 262 (437) T protein:vir:52 186 DIWGVSDLEK-IIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDK-IAAGMENEVASVISAVQEIKSATNSLLLDA-EN 262 (437) T ss_pred cccCCchHHH-HHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHH-hcCCcHHHHHHHHHHHHHhcCCCceEEEcC-Cc Confidence 4578887754 66666667777666666555555554444442110 1111010 11111 1223334432 23 Q ss_pred eEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccc-cccchHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Q lcl|NC_021324. 289 KISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS-SENPASAEAIIATDSRIVKMAERK-GRIFGGAWERAMRIAM 366 (480) Q Consensus 289 ~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~-~~~~~Sg~Al~~~~~~l~~k~~~~-~~~~~~~l~~~~~li~ 366 (480) ++.+++. ++....+.+.....+|+..++||...|.+. ...-+||..-..-|... +... +..+.+.+++++.+++ T Consensus 263 ~~e~~~~-~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yyd~---i~~~Qe~~l~p~le~l~~~i~ 338 (437) T protein:vir:52 263 EYDRKEL-TFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYHEA---IRRLQETRLRPIFEIIDPLIC 338 (437) T ss_pred ceEEEec-CcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 4544432 233445566677789999999998777554 33335665444344433 3333 3568889999999877 Q ss_pred HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCC-ChhHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021324. 367 QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGY-TATQREQMRDWDKQETEDMIDTLY 445 (480) Q Consensus 367 ~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~-~~d~~~~~~~~~~~~~~~~~~~~~ 445 (480) .-..+.... ++.+.|.+....+..+.|+...+..++- . .+-..|+ +++++.+. + ++. +.+. T Consensus 339 ~~~~g~~~~---~~~~~f~pL~~~s~kekae~~~~~a~a~------~-~~~~~g~i~~~e~r~~--L-~~~-----g~~~ 400 (437) T protein:vir:52 339 NELFGGLPA---DWWFEFVPLTTVKQEQQINMLNTFATAA------N-TLIQNGVLNEYQIANE--L-RES-----GLFA 400 (437) T ss_pred HHhcCCCCC---cceEEeCCcCCcCHHHHHHHHHHHHHHH------H-HHHhcCCCCHHHHHHH--H-Hhc-----CCCC Confidence 655444433 5889999998899998888655544331 1 1122342 33332211 1 110 0000 Q ss_pred hhcccCCCc--CCCCCCC--CCCccCCCCCCCCCccc Q lcl|NC_021324. 446 STTKAQADA--TPKPTVT--ETKTETQTSPSGFNRTK 478 (480) Q Consensus 446 ~~~~~~~~~--~~~~~~~--d~~~~~~~~~~~~~~~~ 478 (480) ........+ ...+.++ ++....++.+.+....+ T Consensus 401 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 401 NISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred CCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 000000000 0000000 00111111111111111 No 109 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.49 E-value=6.1e-12 Score=82.07 Aligned_cols=440 Identities=13% Similarity=0.047 Sum_probs=216.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC----cchhc-C--cccch----hh-------hcceeccchHHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR----RLKTI-G--IGAPP----EL-------AYLDVQPGWVATYLRT 62 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~----~i~~~-~--~~~~~----~~-------~~~~~~~n~~~~iVd~ 62 (480) |-++.=+- + ..++-......||.|.- ..... + ..... .. +++-....|++-+|+. T Consensus 1 ~~~~~~~~--~-----~~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av~~ 73 (530) T protein:vir:38 1 MKIPSLVG--P-----DGKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAVQL 73 (530) T ss_pred Cccceeec--C-----ccccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 43321110 0 11222344556775431 11110 0 00000 00 1222345689999999 Q ss_pred HHhhhccCccccCC-------------CchhHHHHHHHHH---h-----------cCHHHHHHHHHHHHhhcCeeEEEEe Q lcl|NC_021324. 63 LSDRLDIEGFRISE-------------DSEGLEELWNWWQ---A-----------NDLDEESVLGHDDSLTFGRAYITVS 115 (480) Q Consensus 63 ~~~~l~~~~~~~~~-------------d~~~~~~l~~~~~---~-----------n~~~~~~~~~~~~~~~~G~a~~~v~ 115 (480) .+..++|.|++... +++..+.+.+.|+ . .+|...+..+++.....|.+|+.+. T Consensus 74 ~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~ 153 (530) T protein:vir:38 74 HQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQAT 153 (530) T ss_pred HHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEEee Confidence 99999999986421 2223344444443 2 2467788889999999999999876 Q ss_pred cCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccc Q lcl|NC_021324. 116 HPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGD 195 (480) Q Consensus 116 ~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (480) ..... ...--+++..++|+.+---++......+...| +.+..|.+..+-++..+. .+.... ... T Consensus 154 ~~~~~--g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GI----e~d~~Gr~~aY~i~~~~~-------~~~~~~---~~~ 217 (530) T protein:vir:38 154 WDSDS--TRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGV----KINDSGAALGYYVSDDGY-------PGWMAQ---NWT 217 (530) T ss_pred eccCC--CCccceEEEEechhhcCCCCCCCCCCeeEeee----EECCCCceEEEEEeeccC-------CCcccc---ccc Confidence 53211 01112588999998764333322333344444 234455444333332211 000000 001 Q ss_pred ccc--ccccccceEEeecccccCccCCCccchHHHHHHH--HHHHHHHHHHHHHHHHhcchhhhhcCCCc---------- Q lcl|NC_021324. 196 VIK--HGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT--DAASRTLMNLQSASQILGTPLRVISGVTT---------- 261 (480) Q Consensus 196 ~~~--~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~--D~~~~~~s~~~~~~~~~~~p~l~i~G~~~---------- 261 (480) .++ ..++.--|+|+......++..|.|.+++.+..|- +.|.......+.....++. +|+.... T Consensus 218 ~~~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~---fi~~~~~~~~~~~~~~~ 294 (530) T protein:vir:38 218 YIPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAA---TIESELDTQSAMDFILG 294 (530) T ss_pred eeeeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhhee---eeeccCCcccccccccc Confidence 111 2234445889988888889999999988554443 2333333333333322221 2221111 Q ss_pred ---ccccc-----------ccccchhhhhcchheecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccc Q lcl|NC_021324. 262 ---DELTN-----------DGENTTLDIYYGRILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS 326 (480) Q Consensus 262 ---~~~~~-----------~~~~~~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~ 326 (480) ..... ......+.+.+|.+..+. |.+.++.+.+. ...+|...++.+++.|+...++|-+.|.++ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~ye~lt~D 373 (530) T protein:vir:38 295 ADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQD-TDNGYSTFEQSLLRYIAAGLGVSYEQLSRN 373 (530) T ss_pred CCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhcc Confidence 00000 000112345666666653 44445544332 234566778888999999999999999776 Q ss_pred cccchHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--HHHHhCCCCcc------cc-----eeeeEEecCCCC--C Q lcl|NC_021324. 327 SENPASAEAIIATDSRIVKMAERKGRIFGGAW-ERAMRI--AMQIMGREVTE------EY-----TRLETVWRDPST--P 390 (480) Q Consensus 327 ~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l-~~~~~l--i~~~~~~~~~~------~~-----~~~~v~f~~~~~--~ 390 (480) ..+ +|-.|.|+.+..........+..|...+ ..+++. -.+++.+.++. ++ .-+.+.|..|-. . T Consensus 374 ~s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~i 452 (530) T protein:vir:38 374 YSQ-MSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAI 452 (530) T ss_pred ccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCcccc Confidence 422 2333567777777776766666665543 222221 12233332221 11 124567755443 4 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCC Q lcl|NC_021324. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTS 470 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~ 470 (480) |....+.+....+.+| +.|.+..+...|...+++-+ ++.++.+. .+.+. ..-...+...+..+ ....++.+ T Consensus 453 DP~Ke~~a~~~~i~~G--~~s~~~~~a~~G~D~~~v~~--q~a~e~~~--~~~~G--l~~~~~~~~~~~~~-~~~~~~~~ 523 (530) T protein:vir:38 453 DGLKEVQEAVMLIEAG--LSTYEKECAKRGDDYQEIFA--QQVRESME--RRAAG--LNPPAWAAAAFEAG-VKKSNEEE 523 (530) T ss_pred ChHHHHHHHHHHHHcC--CCCHHHHHHHcCCCHHHHHH--HHHHHHHH--HHHcC--CCCCCCcccccCCC-CCCCCCCC Confidence 5566777777777776 66777888888987665432 22222111 11111 11111111122111 12222222 Q ss_pred CCCCCcc Q lcl|NC_021324. 471 PSGFNRT 477 (480) Q Consensus 471 ~~~~~~~ 477 (480) +.+++++ T Consensus 524 ~d~~~~a 530 (530) T protein:vir:38 524 QDGARAA 530 (530) T ss_pred CCCCCCC Confidence 3344444 No 110 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.48 E-value=1.3e-13 Score=91.25 Aligned_cols=401 Identities=13% Similarity=0.122 Sum_probs=187.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc-hhcC--cc-cchhhhcceeccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRL-KTIG--IG-APPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i-~~~~--~~-~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |+- .+-|....-|-+.- ...+ .. ...........+.+++++||..+.-+.-+|+.+.+ T Consensus 1 ~~~------------------~D~~~n~~~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~~~~Vd~~aed~~r~g~~i~~ 62 (422) T protein:vir:10 1 MVK------------------TDSYANIFLGGSDGSEIYGSLQNQAPTILASLYADNALVRRIIDTIPETALAAGFHIDG 62 (422) T ss_pred Ccc------------------chhhHHHHcCCCCCccccCcccccCHHHHHHHHHhChhhHHHHhhhhHHHhcCCccccC Confidence 322 22222222232110 0000 00 01112233445677899999999999999999877 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc----CCCCcee-EEEEEcccceEEEEeCCCCCceE Q lcl|NC_021324. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES----GDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 77 d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~----~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 151 (480) ++. .+.+..-|++-++...+.++.+.+..||.|++++-..+... .+..|.+ .+.+++|.++.|..-..+...+. T Consensus 63 ~~~-~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~ 141 (422) T protein:vir:10 63 IDD-EPAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNAR 141 (422) T ss_pred CCH-HHHHHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccc Confidence 653 34567777777889999999999999999988886533221 1233333 45556665554421001111111 Q ss_pred EEE-EEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHH Q lcl|NC_021324. 152 RAV-RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 152 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) +.. .+|......+.. -..+ -+.++++|... .+|-. -......||.|.|.+.+.. T Consensus 142 fg~P~~y~v~~~~~~~-~~~i-H~SRli~~~g~-------------------~~p~~----~~~~~~~~G~S~l~~~~~~ 196 (422) T protein:vir:10 142 FGEPLTYRITTNESDM-FYDV-HYSRIHIIDGE-------------------RIPNV----MRRQNDGWGRSVLSSDILD 196 (422) T ss_pred cCcceEEEEecCCCCc-ceee-ccceeEEeCCC-------------------Cchhh----hcccCCcccchhHHHHHHH Confidence 100 111110000000 0000 11122221100 01110 0122446899887654555 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc------chhhhh--cchheecCCCCceEEEeccccHHHHH Q lcl|NC_021324. 231 VTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN------TTLDIY--YGRILTLASEAAKISEFKAAELRNFA 302 (480) Q Consensus 231 l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~------~~~~~~--~~~~~~~~g~~~~~~~~~~~~~~~~~ 302 (480) -+..++++.......+..+....+.+.|............ ..+... ....+.+.+++.++.+++. ++.... T Consensus 197 ~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~-~lsgl~ 275 (422) T protein:vir:10 197 SIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS-DIGGID 275 (422) T ss_pred HHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEec-ccCChH Confidence 5566667666666655555554444443211000111110 001111 1112233344344544432 244455 Q ss_pred HHHHHHHHHHHhhcCCChHHhcccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceee Q lcl|NC_021324. 303 EEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRL 380 (480) Q Consensus 303 ~~l~~~~~~i~~~~~~p~~~~g~~~-~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~ 380 (480) +.+.....+|++.++||..-|.+.+ .. ++||..-..-|...+. ..++..+.+.+++++.+++. . .++ T Consensus 276 ~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~--~~Qe~~l~p~l~~l~~~i~~------s---~~~ 344 (422) T protein:vir:10 276 AFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVD--RKRNAELLPILEFLIPFIVN------A---EEW 344 (422) T ss_pred HHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhcc------c---CCc Confidence 6677777899999999987664443 22 2456554444444443 23456778889998888763 1 258 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCC-ChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGY-TATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~-~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) .+.|.+-..++..+.|+...+.+++- .++. ..|. +++++.+. + +.... .........+ .... T Consensus 345 ~~~f~pL~~~sekekaei~~~~a~a~------~~~~-~~g~i~~~e~r~~--L-~~~~~------~~~~~~~~~~-~~~~ 407 (422) T protein:vir:10 345 SVEFNPLAQESSKDKAEILEKNVNSI------AALI-AAGAMDIDEARDT--L-RTIAP------EVKINDGSVE-TEVT 407 (422) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHH------HHHH-hcCCCCHHHHHHH--h-hhhcc------cccCCCCCCc-cccc Confidence 89999999999999998765554432 2222 2343 34433211 1 11100 0000000001 0010 Q ss_pred CCCCCccCCCCCCCC Q lcl|NC_021324. 460 VTETKTETQTSPSGF 474 (480) Q Consensus 460 ~~d~~~~~~~~~~~~ 474 (480) ..+........|+++ T Consensus 408 ~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 408 ISETSNDPLEVPTDD 422 (422) T ss_pred hhhcCCCCCCCCCCC Confidence 011111111222222 No 111 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.46 E-value=8.8e-14 Score=92.16 Aligned_cols=468 Identities=13% Similarity=0.030 Sum_probs=204.9 Q ss_pred CCC-HHHHHHHHHHHHHHHHH-----H--HHHHHHHh--hccCcchhcCcccc-hh--hhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTT-YHEHVERLQGLLARDLP-----N--LLEAEAYR--NGTRRLKTIGIGAP-PE--LAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~-~~~~i~~l~~~~~~~~~-----~--~~~~~~YY--~G~~~i~~~~~~~~-~~--~~~~~~~~n~~~~iVd~~~~~l 67 (480) |++ -.+++.++...+..... | ...-.+|| +|.|-......... .. ...-.+++|.++.+|+..+++. T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~v~g~~ 80 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINKVATELNRIISEY 80 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecchHHHHHHHhhHH Confidence 998 46688887776653221 1 12223555 67775433211110 00 0123678899999999999887 Q ss_pred ccCccc---cC----CCchhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccC---CCCceeEEEEE Q lcl|NC_021324. 68 DIEGFR---IS----EDSEGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG---DPAGIPLIRVE 133 (480) Q Consensus 68 ~~~~~~---~~----~d~~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~---d~~g~~~~~~~ 133 (480) .-+-.. .+ +|.+..+. +..++..|+.+.....++.+++++|.||+-|..+-..+. ..++.+.+..+ T Consensus 81 ~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i~i~~v 160 (706) T protein:vir:10 81 RNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRIAVEPI 160 (706) T ss_pred HhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccceeeee Confidence 543221 12 23333444 344556789999999999999999999988854422221 12334444433 Q ss_pred -cccceEEEEeCCCCC----ceEEEEEE-EEEec--------------------------CCceEEEEEEEeCCc----E Q lcl|NC_021324. 134 -SPLYMYAELDPRNTR----RVTRAVRL-YTTRD--------------------------DVAVPDRATLYLPDE----T 177 (480) Q Consensus 134 -~p~~~~~v~d~~~~~----~~~~~~~~-~~~~~--------------------------~~~~~~~~~~~~~~~----~ 177 (480) +|.+. +.||+.... ...++++. |.+.+ ........+.|+... + T Consensus 161 ~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~~~~~~ 239 (706) T protein:vir:10 161 YDPARS-VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVRKESVDV 239 (706) T ss_pred ccchhc-eecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccccceeEEE Confidence 45532 246654211 11122211 11110 001112222233221 1 Q ss_pred EEEEecCCCcc---------------------------------------ccccccccccccccccceEEeecc----cc Q lcl|NC_021324. 178 VPLRRNGGLND---------------------------------------QWVVDGDVIKHGLGVVPVVPLTND----PR 214 (480) Q Consensus 178 ~~~~~~~~~~~---------------------------------------~~~~~~~~~~~~~g~iPvv~~~n~----~~ 214 (480) ..|........ +.....++.|-+.+++|+|+|.-. .. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r~~~d~ 319 (706) T protein:vir:10 240 ISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKRWFIDD 319 (706) T ss_pred EEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeeccccccc Confidence 11111100000 000111222333477888877532 22 Q ss_pred cCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc-cccccchhhhhcchh-e----ecCCCC- Q lcl|NC_021324. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT-NDGENTTLDIYYGRI-L----TLASEA- 287 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~-~~~~~~~~~~~~~~~-~----~~~g~~- 287 (480) ...+||. .+.+++.++.+|..+|.+.+.+..... ..-.|....... ...+..........+ + ..+|.- T Consensus 320 ~~~~~G~---vr~~~d~Q~~~N~~~s~~~~~~~~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~i~ 394 (706) T protein:vir:10 320 VERVEGH---IAKAMDPQRLYNLQVSMLADAAAQDPG--QTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGNVV 394 (706) T ss_pred cCcccce---eccchhhHHHHHHHHHHHHHHHHhcCC--cccccchhHHHHHHHHhhhcccccccchhcccccCCCCccc Confidence 3446664 367899999999999999987643322 222221110000 000000000000000 0 001110 Q ss_pred ---ceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 288 ---AKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 288 ---~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) ..+..++. .-.+.+...++.....|-.++|+++..+|..+ | .||+|+...-..-..........|..+.+++.+ T Consensus 395 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~s-n-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~ 472 (706) T protein:vir:10 395 APANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPS-N-VARETVNSLLNRSDMASFIYLDNMAKSLKRAGE 472 (706) T ss_pred ccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCcc-c-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111122 23445677777778888889999999888543 4 799999998887777777777788888877766 Q ss_pred HHHHH----hCC---------CCcccc-------------------------eeeeEEecCCCCCCHHHHHHHHHHHHhc Q lcl|NC_021324. 364 IAMQI----MGR---------EVTEEY-------------------------TRLETVWRDPSTPTVAAKADAVSKLYAN 405 (480) Q Consensus 364 li~~~----~~~---------~~~~~~-------------------------~~~~v~f~~~~~~~~~~~ad~~~kl~~~ 405 (480) +++.+ .+. ....++ .++.|.=.+..+.-..+..+.++++.++ T Consensus 473 ~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el~~~ 552 (706) T protein:vir:10 473 IWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQLLQG 552 (706) T ss_pred HHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHHHHh Confidence 55432 221 111000 0111111112222234566677777776 Q ss_pred cccCCCHH-----HHHHhCCC--ChhHHHHHHHHH---------------------HHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 406 GQGPIPKE-----QARIDLGY--TATQREQMRDWD---------------------KQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 406 ~~~~~s~e-----~~~~~~~~--~~d~~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) +....+.. .+++.+.+ .++-.+++.+.. +++......++.. . +..... T Consensus 553 ~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~-~--~aq~~~- 628 (706) T protein:vir:10 553 MLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLL-A--QAQMVV- 628 (706) T ss_pred cCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHH-H--HHHHHH- Confidence 53321111 12333322 122222221100 0000000000000 0 000000 Q ss_pred CCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) .++.-.+.+.+..-......+.+ T Consensus 629 ~qA~~~k~~a~~~q~~~~a~~a~ 651 (706) T protein:vir:10 629 AQAEAQKSQNETVQTQIKAFTAQ 651 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH Confidence 00000000000000000000000 No 112 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.43 E-value=3.8e-12 Score=83.22 Aligned_cols=442 Identities=11% Similarity=0.019 Sum_probs=216.3 Q ss_pred CCCHHHHHHHHHHHHHHHH-------HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC--- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDL-------PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE--- 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~-------~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~--- 70 (480) |++..+|+.+++.+|..-. ...+.+.+|-... +.+..+... -.+ .+++.+|-.-.+++++..+++.- T Consensus 15 ~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~-~tr~t~~~~-~~w-~~s~t~~k~~~~~~~l~a~~~~~~fp 91 (599) T protein:vir:31 15 RDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDAT-DTRKTSNSK-LPF-KNSTTINKLAHLHLMITTSYMEHLLP 91 (599) T ss_pred cCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhh-cccccccCC-CCc-ccccchHHHHHHHHHHHHHHHhhhcC Confidence 8999999777777764322 2246677775442 333332221 111 35677787788888877766422 Q ss_pred --------ccccCCCch-hHHHH----HHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc-------CCCCceeEE Q lcl|NC_021324. 71 --------GFRISEDSE-GLEEL----WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES-------GDPAGIPLI 130 (480) Q Consensus 71 --------~~~~~~d~~-~~~~l----~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~-------~d~~g~~~~ 130 (480) |+...++.. ..+.+ ++-+.+.+|...+..+..+...+|-||..+......- ...-..|++ T Consensus 92 ~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~~~d~~v~~~~~~P~~ 171 (599) T protein:vir:31 92 NRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTVTAENQVIKNYSGTVT 171 (599) T ss_pred CccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcceeecccccccccccceE Confidence 222222211 12223 3334456888999999999999999988876221110 011234899 Q ss_pred EEEcccceEEEEeCCCCCceEEEEEEEEEecCCc---------------------eE----------------------- Q lcl|NC_021324. 131 RVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVA---------------------VP----------------------- 166 (480) Q Consensus 131 ~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~---------------------~~----------------------- 166 (480) ..++|.++|+=-+........+++|.+....+.. .+ T Consensus 172 ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~~d~~~~~~g~D~~~~d 251 (599) T protein:vir:31 172 ERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREALADGYNGRRKFDSLHKK 251 (599) T ss_pred EeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccccchhhhhhhccccccc Confidence 9999998874332222223333344443111000 00 Q ss_pred ---EEEEEEeCCcE--E-----EEEecCCCcc----------ccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 167 ---DRATLYLPDET--V-----PLRRNGGLND----------QWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 167 ---~~~~~~~~~~~--~-----~~~~~~~~~~----------~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) ...++|.++.+ . .|...++... +.....+..|.+.|..|++.....+...+.||.+.+.. T Consensus 252 ~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~~~P~~~~~yG~G~l~~ 331 (599) T protein:vir:31 252 GYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAVYEFQKDTLCPIGPLHR 331 (599) T ss_pred cccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEEeeeeccccCCCCCchh Confidence 00011111110 0 0111111100 00112233456678899999999999999999998854 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCCCceEEEe-ccc---cHHHHH Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEF-KAA---ELRNFA 302 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~---~~~~~~ 302 (480) +..+++.+|.+.-.+.+..+.+..|++...|.. ...+ +...+++++-.. +.+.++.+ +++ +++..+ T Consensus 332 -~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl-~~eD-------~~~~P~~v~~~~-d~~~vq~~~p~s~~~~a~~~i 401 (599) T protein:vir:31 332 -LTGMQYKLDKRENFREDLHDRFLHPSLKKVGDV-REKG-------MRGGPNHVFEVE-ETGDVQYMTPPAEVLQPDNQL 401 (599) T ss_pred -cchHHHHHHHHHHHhhhhhhhhhcccccccccc-cccC-------ccCCCCcceeec-CCCccccccCchhhhhHHHHH Confidence 889999999998888888888889988876642 1111 122355555432 22333322 222 344444 Q ss_pred HHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH----HhCCCC---- Q lcl|NC_021324. 303 EEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE-RAMRIAMQ----IMGREV---- 373 (480) Q Consensus 303 ~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~----~~~~~~---- 373 (480) +.++.. +-..+|+|....|..+.....+..+..+..........+.+.|...+- .+++.++. +.+... T Consensus 402 s~~e~~---mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~l~e~~~~f~D~~~tiri 478 (599) T protein:vir:31 402 SITLQL---MEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLNDYLEQGRNHLDASDTIKT 478 (599) T ss_pred HHHHHH---HHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceee Confidence 444443 445678889888876554445556676777777766777777777653 35553332 222110 Q ss_pred ------cccceee-------eEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCH---HHHHHhC---------CCCh Q lcl|NC_021324. 374 ------TEEYTRL-------ETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPK---EQARIDL---------GYTA 423 (480) Q Consensus 374 ------~~~~~~~-------~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~---e~~~~~~---------~~~~ 423 (480) ...+.++ .+.+.+.-..-+.++++.+.++.+- |.++.+. +-+...+ ++-. T Consensus 479 ~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l~~~l~~~~~l~~~~~~~ 558 (599) T protein:vir:31 479 FNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKLFNAVEYLGDLDAYGIFT 558 (599) T ss_pred ecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHHHHHHHHHHhccccccCC Confidence 0111111 1122222233345555555544432 2333332 1111111 1111 Q ss_pred hH-----HHHHHHHHHHHHHH-HHHHHhhhcccCCCcCCCCCCCCCCccCC Q lcl|NC_021324. 424 TQ-----REQMRDWDKQETED-MIDTLYSTTKAQADATPKPTVTETKTETQ 468 (480) Q Consensus 424 d~-----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 468 (480) .. .+.+.++.+++.++ .-.++....-+. ...+.++ T Consensus 559 ~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~----------~~~~~~~ 599 (599) T protein:vir:31 559 FGIGVQEDQQLARMAQKSTQQTEETALTQEEVGG----------PTTDTGQ 599 (599) T ss_pred CchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCC----------CCcccCC Confidence 11 11111111111110 011111111111 1111111 No 113 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.37 E-value=2.1e-12 Score=84.55 Aligned_cols=467 Identities=13% Similarity=0.062 Sum_probs=198.9 Q ss_pred CCCH-HHHHHHHHHHHHHHHH-----HHHH--HHHHh--hccCcchhcCc---ccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLP-----NLLE--AEAYR--NGTRRLKTIGI---GAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~-----~~~~--~~~YY--~G~~~i~~~~~---~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |++. ++++.++...+..... |-+. -.+|| +|.+--...-. ...+.-..-.+++|.++.+|+..+++. T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~v~g~~ 80 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINKISTELNRIISEY 80 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEccHHHHHHHHHhHH Confidence 9987 6788888776654331 2222 23566 57764321100 000000122467899999999998877 Q ss_pred ccCccc---cC----CCchhHHHH----HHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCC---ceeEEEEE Q lcl|NC_021324. 68 DIEGFR---IS----EDSEGLEEL----WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPA---GIPLIRVE 133 (480) Q Consensus 68 ~~~~~~---~~----~d~~~~~~l----~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~---g~~~~~~~ 133 (480) .-+-.. .+ +|.+..+.| ..+...|+.+.....++.+++++|.||+-|..+-....|.+ ..+++..+ T Consensus 81 ~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i~i~~v 160 (720) T protein:vir:35 81 RHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRICLEPI 160 (720) T ss_pred HhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccceeeEecc Confidence 433221 12 233334444 44556689999999999999999999998865422222222 23444432 Q ss_pred -cc-cceEEEEeCCCCCc----eEEE-EEEEEEec--------------------------CCceEEEEEEEeCCcE--- Q lcl|NC_021324. 134 -SP-LYMYAELDPRNTRR----VTRA-VRLYTTRD--------------------------DVAVPDRATLYLPDET--- 177 (480) Q Consensus 134 -~p-~~~~~v~d~~~~~~----~~~~-~~~~~~~~--------------------------~~~~~~~~~~~~~~~~--- 177 (480) +| .++ .||+..... ..++ ++.|.+.+ +......+++|....+ T Consensus 161 ~~~~~~v--~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~~~~~~ 238 (720) T protein:vir:35 161 YDPARSV--WFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVKKESVD 238 (720) T ss_pred cCchhhe--eecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEEEEEEEE Confidence 22 233 345432210 0111 11111100 1112233333332221 Q ss_pred ------------EEEEecCC----------C----------c--------cccccccccccccccccceEEeecc----c Q lcl|NC_021324. 178 ------------VPLRRNGG----------L----------N--------DQWVVDGDVIKHGLGVVPVVPLTND----P 213 (480) Q Consensus 178 ------------~~~~~~~~----------~----------~--------~~~~~~~~~~~~~~g~iPvv~~~n~----~ 213 (480) +.|..... + . .+.....++.+-+++++|+|+|.-. . T Consensus 239 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g~r~~~d 318 (720) T protein:vir:35 239 VVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYGKRWFID 318 (720) T ss_pred EEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEeeeeccC Confidence 11111000 0 0 0001112223445677888887532 2 Q ss_pred ccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccc-cccccch-------hh-----hhcchh Q lcl|NC_021324. 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELT-NDGENTT-------LD-----IYYGRI 280 (480) Q Consensus 214 ~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~-~~~~~~~-------~~-----~~~~~~ 280 (480) ....+||. .+.+++.+|.+|..+|.+...+. ..+...-.|....... ....... +. ...|.+ T Consensus 319 ~~~~~~G~---vr~~kd~Q~~~N~~~s~~~~~~~--~~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~G~~ 393 (720) T protein:vir:35 319 DIERVEGH---IAKAMDAQRLYNLQVSMLADSAT--QDTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQGNI 393 (720) T ss_pred CCccccee---eecchhHHHHHHHHHHHHHHHHH--cCCccccccCcchHHHHHHHhhccccccccccccccccccCccc Confidence 22223553 35689999999999999999885 3443333332111000 0000000 00 001111 Q ss_pred eecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 281 LTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 281 ~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) ..- ...+...+. .-.+.+...++.....|-.++|+++..+|..+ | .||+|+..+-..-..........+..+.+ T Consensus 394 ~~~---~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~s-n-~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~ 468 (720) T protein:vir:35 394 IAP---PTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPS-N-IAKETVNHLMHRSDMSSFIYLDNMAKSLK 468 (720) T ss_pred ccC---CCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCccc-c-hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 112322222 23445677777778888889999999998643 4 79999998766666666666666666666 Q ss_pred HHHHHHH----HHhCC---------CCcccce-----------------------eeeEEec-CCCCCCH-HHHHHHHHH Q lcl|NC_021324. 360 RAMRIAM----QIMGR---------EVTEEYT-----------------------RLETVWR-DPSTPTV-AAKADAVSK 401 (480) Q Consensus 360 ~~~~li~----~~~~~---------~~~~~~~-----------------------~~~v~f~-~~~~~~~-~~~ad~~~k 401 (480) ++.++++ .+.+. ....... +.+|.-. -|...+. .+.+..+++ T Consensus 469 ~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~q 548 (720) T protein:vir:35 469 RAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLTN 548 (720) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHHH Confidence 6655544 33221 1111100 0111111 1222332 344556666 Q ss_pred HHhccccCCCH-----HHHHHhCCCC--hhHHHHHHHHH--------------------HHHHHHH---HHHHhhh-ccc Q lcl|NC_021324. 402 LYANGQGPIPK-----EQARIDLGYT--ATQREQMRDWD--------------------KQETEDM---IDTLYST-TKA 450 (480) Q Consensus 402 l~~~~~~~~s~-----e~~~~~~~~~--~d~~~~~~~~~--------------------~~~~~~~---~~~~~~~-~~~ 450 (480) +..+...-.+. ..+++.+.+. ++-.+++.+.. .++.+.. .....+. .++ T Consensus 549 ll~~~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~qa 628 (720) T protein:vir:35 549 LLAGMLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLMQG 628 (720) T ss_pred HHHhcCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHHHH Confidence 65543211110 1122333222 11111111100 0000000 0000000 000 Q ss_pred CCCcCCCCCCCCCCccCC--CCCCCCCcccCC Q lcl|NC_021324. 451 QADATPKPTVTETKTETQ--TSPSGFNRTKTR 480 (480) Q Consensus 451 ~~~~~~~~~~~d~~~~~~--~~~~~~~~~~~~ 480 (480) +.... ............ ..-+...-.+.| T Consensus 629 qae~~-kaqa~~~~~qa~a~~aqa~a~~~~a~ 659 (720) T protein:vir:35 629 QAEVQ-KAKNEELAIQVKAFQAQTEARVAEAK 659 (720) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000 000000000000 000000000000 No 114 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.35 E-value=7.4e-11 Score=76.12 Aligned_cols=439 Identities=13% Similarity=0.088 Sum_probs=207.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--CcchhcCcc--cchh-------hhcceeccchHHHHHHHHHhhhcc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGT--RRLKTIGIG--APPE-------LAYLDVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~--~~i~~~~~~--~~~~-------~~~~~~~~n~~~~iVd~~~~~l~~ 69 (480) |++- -++. +......+..+ .-|+..=.|. +..+..+.. +... -+++-..+.|++-+|+..++.++| T Consensus 3 ~~~~-~~~a-~~~~~~~~~~~-~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~~~vVG 79 (495) T protein:vir:10 3 MTPS-GYQS-LASGLLVPVGA-SAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWVAAAVG 79 (495) T ss_pred cccc-cccc-cchhhhhHHHh-hhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHhhcC Confidence 4332 2211 11111111100 1111111111 111111100 0000 112233456899999999999999 Q ss_pred Ccccc---CCCchhHHHHHHHH---Hh-------cCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEccc Q lcl|NC_021324. 70 EGFRI---SEDSEGLEELWNWW---QA-------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPL 136 (480) Q Consensus 70 ~~~~~---~~d~~~~~~l~~~~---~~-------n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~ 136 (480) .||+. ..+++..+.|...| .+ .+|...+..+++.....|.+|+.+...... .....-+++..++|+ T Consensus 80 ~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~-~g~~~~~~lqliepd 158 (495) T protein:vir:10 80 NGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLS-EGLSVPLQLQIIEPD 158 (495) T ss_pred CCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccC-CCCccceEEEEechh Confidence 99874 34555555555544 32 357778888999999999999876543211 001123689999999 Q ss_pred ceE-EEEeC--CCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc---eEEee Q lcl|NC_021324. 137 YMY-AELDP--RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP---VVPLT 210 (480) Q Consensus 137 ~~~-~v~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~ 210 (480) .+- |.-+. .....+...|. .+..|.+..+-++..+- +.. .... ....+-+|| |+|+. T Consensus 159 ~l~~~~~~~~~~~g~~i~~GIe----~d~~Gr~vaY~i~~~hp--------gd~---~~~~--~~~~~~rvpA~~vlH~f 221 (495) T protein:vir:10 159 MLASDIPDETLPSGGYVKGGIR----FSNGGKRKAYCFYRNHP--------AES---SLIG--DPVDTVWIKAEHVLHVT 221 (495) T ss_pred hcCCCCCCCCCCCCCEEEeceE----ECCCCceEEEEEeecCC--------Ccc---cccc--cccceeeechhheEecc Confidence 874 33221 12223444443 24444443333333211 100 0000 011233455 67775 Q ss_pred cccccCccCCCccchHHHHHHH--HHHHHHHHHHHHHHHHhcchhhhhcCCCcccc---------ccccccchhhhhcch Q lcl|NC_021324. 211 NDPRLGNRYGRSEISPELRKVT--DAASRTLMNLQSASQILGTPLRVISGVTTDEL---------TNDGENTTLDIYYGR 279 (480) Q Consensus 211 n~~~~~~~~G~s~i~~~v~~l~--D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~---------~~~~~~~~~~~~~~~ 279 (480) . .+.++..|.|.++. ++.|- +.|.......+.....+ -.+|+...++.. .+........+.+|. T Consensus 222 ~-~r~gQ~RGis~la~-i~~l~~l~~y~dael~~a~i~A~~---~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~ 296 (495) T protein:vir:10 222 V-LTVRSDAGAPWFQL-LLRLNELDQYEDAELVRKKTAALF---AAFIQEATADSTGGPTIGQPKRSKGGKRITGLNPGT 296 (495) T ss_pred c-cCCCcccCcchhHH-HHHHHHhhHHHHHHHHHHHHhhhh---eeeeecCCCccccccccCccccccCcccceecCCce Confidence 3 56788889998864 66652 23333333333322222 123332211111 111122234466777 Q ss_pred heecC-CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_021324. 280 ILTLA-SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGR-IFGGA 357 (480) Q Consensus 280 ~~~~~-g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~-~~~~~ 357 (480) +..+. |.+.++.+.+. ...+|...++.+++.|+...|+|-+.|.++..+ +|-.|+|..+......+...+. .+... T Consensus 297 i~~L~pGe~i~~~~p~~-p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~-~nYSS~R~~~~e~~r~~~~~q~~~~~~~ 374 (495) T protein:vir:10 297 LQYLQPGQEVKFSNPAD-VGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRG-VNYSSIRAGLLEFRRLCQQVQHHMIIHQ 374 (495) T ss_pred eeecCCCCeeeeeCCCC-CCCCHHHHHHHHHHHHHhhcCCCHHHHhccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 76664 44455544322 234677778888899999999999998776432 2333567666666666655443 34433 Q ss_pred H-HHHHHHH--HHHhCCCCc-ccc-----eeeeEEecCCCC--CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH Q lcl|NC_021324. 358 W-ERAMRIA--MQIMGREVT-EEY-----TRLETVWRDPST--PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR 426 (480) Q Consensus 358 l-~~~~~li--~~~~~~~~~-~~~-----~~~~v~f~~~~~--~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~ 426 (480) + +.+++.. ..++.+.+. .++ .-+.+.|..|-. .|....+.+....+.+| +.|.+..+...|...+++ T Consensus 375 ~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G--~~s~~~~~a~~G~D~~~v 452 (495) T protein:vir:10 375 FCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAG--FAPISDKQAERGYDMEEL 452 (495) T ss_pred HHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcC--CCCHHHHHHHcCCCHHHH Confidence 3 2233321 223333222 111 124567755543 46667777777777776 667788888889877654 Q ss_pred HHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 427 EQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 427 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) -+. +.++.+. .+.+. ..-..++...+..+ ...+...+++..|+ T Consensus 453 ~~q--~a~e~~~--~~~~G--l~~~~~p~~~~~~~-~~~~~~~~~~~~~e 495 (495) T protein:vir:10 453 FDM--ISDANQL--IDEYD--LRLDSDPRYVNGSG-AEQKSVMEAALNNE 495 (495) T ss_pred HHH--HHHHHHH--HHHcC--CCCCCCCCcCCCcc-CCCCCCCCCCCCCC Confidence 322 2222111 11111 11111111111100 00011111111111 No 115 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.32 E-value=1.2e-12 Score=85.86 Aligned_cols=424 Identities=14% Similarity=0.062 Sum_probs=183.3 Q ss_pred CCCHHHHHHHHHHHHHHHH-HHHHHHHHHhhccCcchhcCcccch-hhhcceeccchHHHHHHHHHhhhccCccccCC-- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGIGAPP-ELAYLDVQPGWVATYLRTLSDRLDIEGFRISE-- 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~-~~~~~~~~YY~G~~~i~~~~~~~~~-~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~-- 76 (480) |+.+..++..|-..+.... .....+..+|... ..+. ++-.+...+.++++|||..++-+.-+|+.+.. T Consensus 100 ~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~--------~f~gyql~alY~~~~larkiVd~pAeDatR~g~~I~~~~ 171 (862) T protein:vir:99 100 MDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQ--------GFIGHQACALIAQHWLVDKACSLAGEDAIRNGWHLKSLG 171 (862) T ss_pred hhcchhhhhhccccccccccccchhcccccccc--------CcccHHHHHHHHhCchhhhhhhhhhHHHhhCCceEeecC Confidence 1222222221111111000 0000111111110 0000 11123344677899999999999888887643 Q ss_pred -----CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecC--ccc----cCC----CCcee-EEEEEcccceEE Q lcl|NC_021324. 77 -----DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHP--DVE----SGD----PAGIP-LIRVESPLYMYA 140 (480) Q Consensus 77 -----d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~--~~~----~~d----~~g~~-~~~~~~p~~~~~ 140 (480) +++..+.|.+.|++-++...+.++.+.+-.||.+++++... +.. ..+ ..|.+ -|.+++|..+.| T Consensus 172 d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p 251 (862) T protein:vir:99 172 EGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMP 251 (862) T ss_pred cccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeEEEEechhhhcc Confidence 22345678888888888999999999999999886665321 110 011 11222 355566655554 Q ss_pred EEe---CCCCCceEEEEEEEEEecCCceEEEEEE----EeCCcEEEEEecCCCccccccccccccccccccceEEeeccc Q lcl|NC_021324. 141 ELD---PRNTRRVTRAVRLYTTRDDVAVPDRATL----YLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP 213 (480) Q Consensus 141 v~d---~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~ 213 (480) .-. ..+.... +.+.+..+.+ +-+.++++|.. . .+|- +. . T Consensus 252 ~~v~~~~~Dp~sp-----------~yGkP~~y~I~g~~IH~SRliif~g------------~-------~vpd--~l--k 297 (862) T protein:vir:99 252 MLTAESTADPSSQ-----------FFYEPEFWIISGQKYHRSHLIIARG------------P-------QPAD--IL--K 297 (862) T ss_pred ccccccccccccc-----------ccCCceeeeecCeeeccceeEEecC------------C-------Cchh--hh--h Confidence 210 0010000 1111111111 11222222211 0 0111 00 0 Q ss_pred ccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc---chhhhhc--chheecCCCCc Q lcl|NC_021324. 214 RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN---TTLDIYY--GRILTLASEAA 288 (480) Q Consensus 214 ~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~---~~~~~~~--~~~~~~~g~~~ 288 (480) .....+|.|.+.. +...+..++++.......+..+....+.+.+..... .++.-. ..+.... ..++.++. +- T Consensus 298 ~ay~f~G~SvLe~-iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~-~ed~l~~r~~~~~~~rdN~Gi~liD~-eE 374 (862) T protein:vir:99 298 PTYIFGGIPLVQR-IYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIA-NEDKFIQRLMFWVRYRDNHAVKVLGT-DE 374 (862) T ss_pred ccCCccCccHHHH-HHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhc-cHHHHHHHHHHHHhccCcceeEEecC-CC Confidence 1123579988754 666666777776666666655555444443332111 001000 1111111 12333332 23 Q ss_pred eEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHh-ccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 289 KISEFKAAELRNFAEEMEVFRKEAASITGLPPQYL-SSS-SENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM 366 (480) Q Consensus 289 ~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-g~~-~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 366 (480) ++.+++. ++...-+.+.....+|++.++||..-| |.. .+-++||..-..-|...+. ...+..+.+.|++++.+++ T Consensus 375 e~e~ls~-slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~--s~QE~~L~P~LerL~~li~ 451 (862) T protein:vir:99 375 TMEQFDT-SLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELE--SIQEHVYMPFLQRHYLISR 451 (862) T ss_pred ceeEEec-ccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH Confidence 4443332 233445556666778999999998755 433 2334677654444444444 2234668888888877664 Q ss_pred HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHH-------HHHhccccCCCHHHHHHhC------CCChhHHHHHHHHH Q lcl|NC_021324. 367 QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVS-------KLYANGQGPIPKEQARIDL------GYTATQREQMRDWD 433 (480) Q Consensus 367 ~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~-------kl~~~~~~~~s~e~~~~~~------~~~~d~~~~~~~~~ 433 (480) .-.+ .. .++.+.|.+....+..+.|+... +++++| .++.+.++..| |+..-+.+.+++.. T Consensus 452 ~~lg--~~---~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sG--vispdEvR~~L~~~~~~g~~~l~ded~E~d~ 524 (862) T protein:vir:99 452 LSLG--IQ---HEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGG--VISPDEERNRIRDDKRSGYNRLTKEDAEETP 524 (862) T ss_pred HhcC--CC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHHhcCCcCCCCCCcccccccC Confidence 3333 22 35889999999999988887643 344443 55665555543 33211111111000 Q ss_pred HHHHHHHHHHHhhhcccCCCc-------CCC--C-CCC-------CCCccCCCCCCCCCcccCC Q lcl|NC_021324. 434 KQETEDMIDTLYSTTKAQADA-------TPK--P-TVT-------ETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 434 ~~~~~~~~~~~~~~~~~~~~~-------~~~--~-~~~-------d~~~~~~~~~~~~~~~~~~ 480 (480) -...+ .............++ +.+ + +++ ++.-.+++.+..+...++. T Consensus 525 ~~~~e-~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~~p~~~~~~~g~~~~~t~~~~a~~ 587 (862) T protein:vir:99 525 GASPE-NLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPNVQMVPSMKPGQMVGPEVGITAPM 587 (862) T ss_pred CCCcc-cccccccCCcccccccccccccccCCccccCCcccccccCCCCCCCccccccccccCC Confidence 00000 000000000000000 000 0 000 0000000111111111111 No 116 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.32 E-value=3.8e-12 Score=83.20 Aligned_cols=452 Identities=12% Similarity=0.045 Sum_probs=177.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHH--hhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc---cCc--c- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAY--RNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD---IEG--F- 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~Y--Y~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~---~~~--~- 72 (480) ++.+...+....+.|...+.+.....+| |+|+.+- ... + .+-+++..-.+..|+.....|. ..+ + T Consensus 29 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~-~--grs~vv~~~v~~~ve~~~~~l~~~f~~~~~~~ 101 (763) T protein:vir:95 29 LQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKP----PKV-K--GRSQVQPKLVRRQAEWRYSALTEPFLGSNKLF 101 (763) T ss_pred HHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcc----ccc-C--CCccccCHHHHHHHHHHHHHHHHhhcCCCcEE Confidence 2222222333333344444443444444 4554321 111 1 1335566666777775554441 111 1 Q ss_pred c----cCCCchhHHHHHH-----HHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccc----------------------- Q lcl|NC_021324. 73 R----ISEDSEGLEELWN-----WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE----------------------- 120 (480) Q Consensus 73 ~----~~~d~~~~~~l~~-----~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~----------------------- 120 (480) . ..+|.+..+..+. ++..|+-.......+++++++|.+++.||..... T Consensus 102 ~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 (763) T protein:vir:95 102 KVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLFPIQTQEQADA 181 (763) T ss_pred EEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhccccchhHHHH Confidence 1 1344444443333 3445665566778999999999999998753100 Q ss_pred ---------------------cC-------C---------------------CCceeEEEEEcccceEEEEeCCCC-Cce Q lcl|NC_021324. 121 ---------------------SG-------D---------------------PAGIPLIRVESPLYMYAELDPRNT-RRV 150 (480) Q Consensus 121 ---------------------~~-------d---------------------~~g~~~~~~~~p~~~~~v~d~~~~-~~~ 150 (480) .. + ..++|++..++|.++++=.+...+ ... T Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da 261 (763) T protein:vir:95 182 LQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIIDPSCQGDINKA 261 (763) T ss_pred HHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheecCCCCCchhhC Confidence 00 0 125678888999998843321111 111 Q ss_pred EEE-EEEEEEecCC-----------------------------------------ceEEEEEEEeCCcEEEEEecCCCcc Q lcl|NC_021324. 151 TRA-VRLYTTRDDV-----------------------------------------AVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 151 ~~~-~~~~~~~~~~-----------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) .++ .+.+....+. ..+...++|.. +...+.+.. T Consensus 262 ~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~-----~d~~gdg~~ 336 (763) T protein:vir:95 262 MFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGF-----WDIEGNGVL 336 (763) T ss_pred ceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeee-----eccCCccee Confidence 222 1211111000 00111122211 111111111 Q ss_pred cc---------ccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-C Q lcl|NC_021324. 189 QW---------VVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-G 258 (480) Q Consensus 189 ~~---------~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~-G 258 (480) .+ +......|...|++|++.|+..+..+.+||.|-+ +.++++++.+|..++.+.+++...++|...+- | T Consensus 337 ~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~-~~~~d~Qr~~N~~~~~~~d~l~~~~~~~~~v~~g 415 (763) T protein:vir:95 337 EPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDA-ELLGDNQAVLGAVMRGMIDLLGRSANGQRGMPKG 415 (763) T ss_pred EEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchH-HHhhHHHHHHHHHHHHHHHHHHhhcCCcEEeecc Confidence 11 1112234455688999999988888999999877 45999999999999999999988888764432 2 Q ss_pred CCccccccccccchhhhhcchheec-CCCCce----EEEecc--ccHHHHHHHHHHHHHHHHhhcCCChHHhccccc--- Q lcl|NC_021324. 259 VTTDELTNDGENTTLDIYYGRILTL-ASEAAK----ISEFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSE--- 328 (480) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~----~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~--- 328 (480) . .... + .+...++.++.+ +|..+. +...+. ......+.+++..+. ..+|++....|.+.. T Consensus 416 a-v~~~-d-----~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l~~~~~~~e---~~TGv~~~~~G~~~~~~~ 485 (763) T protein:vir:95 416 M-LDAL-N-----SRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMATLQNQEAE---SLTGVKAFAGGVTGESYG 485 (763) T ss_pred c-ccch-h-----hhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHHHHHHHHHH---HhhCcchhhcCcCccccc Confidence 2 1111 1 112233443332 222211 122221 233444555555544 445555554443221 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhCCCCc----c-cc-----------eeeeEEecCCC Q lcl|NC_021324. 329 NPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI----AMQIMGREVT----E-EY-----------TRLETVWRDPS 388 (480) Q Consensus 329 ~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l----i~~~~~~~~~----~-~~-----------~~~~v~f~~~~ 388 (480) ..++|++.. ...-........+.|..+++.+++. +..+.+.... . ++ .++.|.-. T Consensus 486 ~tat~v~~l--~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~--- 560 (763) T protein:vir:95 486 DVAAGIRGV--LDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS--- 560 (763) T ss_pred chhHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc--- Confidence 123444332 2222233334445555566555554 4443332100 0 01 11222111 Q ss_pred CCCHH-HHHHHHHHHHhccccCCCH----HHHHHhCCCC------h---------hHHHH-HHHHHHHHHHHHHHHHhh- Q lcl|NC_021324. 389 TPTVA-AKADAVSKLYANGQGPIPK----EQARIDLGYT------A---------TQREQ-MRDWDKQETEDMIDTLYS- 446 (480) Q Consensus 389 ~~~~~-~~ad~~~kl~~~~~~~~s~----e~~~~~~~~~------~---------d~~~~-~~~~~~~~~~~~~~~~~~- 446 (480) +.+.. +.+..+..|.+.....++. ..+.....+. . ++.+. +.+....+.+........ T Consensus 561 ~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~~ak 640 (763) T protein:vir:95 561 TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEELRSK 640 (763) T ss_pred cchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHHHHH Confidence 22222 2233334433321111221 1111110100 0 00000 000000000000000000 Q ss_pred hcccCCCcCCCCCCCCCC--------ccCCCC-CCCCCcccCC Q lcl|NC_021324. 447 TTKAQADATPKPTVTETK--------TETQTS-PSGFNRTKTR 480 (480) Q Consensus 447 ~~~~~~~~~~~~~~~d~~--------~~~~~~-~~~~~~~~~~ 480 (480) +....+.+.......+.. .+.+.+ ....-.++.+ T Consensus 641 aq~~qaqa~~~~aq~e~~~~d~~~~e~~~Q~~~e~~~~~~~~e 683 (763) T protein:vir:95 641 IRLNDAQAQKAMAERDNKNLDYLEQESGTKHARDLEKMKAQSQ 683 (763) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000000000000 000000 0000000000 No 117 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.28 E-value=5.8e-12 Score=82.19 Aligned_cols=429 Identities=13% Similarity=0.062 Sum_probs=188.4 Q ss_pred CCCHHH--HHHHHHHHHHHHH-HH-HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MTTYHE--HVERLQGLLARDL-PN-LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~~~--~i~~l~~~~~~~~-~~-~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |+.... ....+.......- .+ ...+..||....-+-+ ++-.....+.++++|||..+.-+.-+|+.+.. T Consensus 71 ~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~gy-------ql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~ 143 (765) T protein:vir:96 71 MDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQGFIGY-------QACAIISQHWLVDKACSMSGEDAARNGWELKS 143 (765) T ss_pred ccccccccccchHHHhhhccCccchhhHHHhhhcccCCccH-------HHHHHHHhCchhhhhhhcchHHhhcCCceeec Confidence 544321 1222222221110 01 1123334433221111 12233445677899999999988888887644 Q ss_pred Cc-----hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccc------cCC----CCcee-EEEEEcccceEE Q lcl|NC_021324. 77 DS-----EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE------SGD----PAGIP-LIRVESPLYMYA 140 (480) Q Consensus 77 d~-----~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~------~~d----~~g~~-~~~~~~p~~~~~ 140 (480) +. +..+.|...|++-++...+.++.+.+-.||.+|+.+...... ..+ ..|.+ .|.+++|..+.| T Consensus 144 ~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~ 223 (765) T protein:vir:96 144 DGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMP 223 (765) T ss_pred CccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhhccc Confidence 32 233567777887788999999999999999998776532110 000 11222 244455544444 Q ss_pred EEeCCCCCceEEEEEEEEEecCCceEEEEEE----EeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 141 ELDPRNTRRVTRAVRLYTTRDDVAVPDRATL----YLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 141 v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) .-.......+. ..+.+.+..+.+ +-+.++++|. .-|+- .. ..... T Consensus 224 ~~v~e~~~Dp~--------sp~fg~P~~y~i~g~~IH~SRli~~~---------------------g~~lp-d~-lk~~~ 272 (765) T protein:vir:96 224 QLTAESTADPS--------AEHFYEPDFWIISGKKYHRSHLVVVR---------------------GPQPP-DI-LKPTY 272 (765) T ss_pred ccchhcccccc--------ccccCcceeeeecCceeccceEEEec---------------------CCCch-hh-hcccc Confidence 21000000000 001111111000 0011111110 00110 00 11223 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc--cccchhhhh--cchheecCCCCceEEE Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND--GENTTLDIY--YGRILTLASEAAKISE 292 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~--~~~~~~~~~--~~~~~~~~g~~~~~~~ 292 (480) ..+|.|.+.. +..-+..++++.-.....+..+....+.+.+.......+. .....+... ...++.++. +.++.+ T Consensus 273 ~~~G~Svlq~-~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g~~~id~-ee~~e~ 350 (765) T protein:vir:96 273 IFGGIPLTQR-IYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHGVKVIGI-DETMEQ 350 (765) T ss_pred CccCccHHHH-HHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCceeEEecC-CcceeE Confidence 4579987754 6666666777666666555554444443333211000000 000111111 112333332 335544 Q ss_pred eccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc--ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021324. 293 FKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS--ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) Q Consensus 293 ~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~--~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 370 (480) ++. ++....+.+.....+|+..++||..-|-+.+ +-++||..-..-|...+. ...+..+...|++++.+++... T Consensus 351 ~s~-~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~--s~Qe~~l~p~le~L~~li~~s~- 426 (765) T protein:vir:96 351 FDT-NLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELE--SIQEHIFDPLLERHYLLLAKSE- 426 (765) T ss_pred Eec-ccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhc- Confidence 443 3445566677778899999999975554432 224677654444444333 2344678889999999877542 Q ss_pred CCCcccceeeeEEecCCCCCCHHHHHHHHH-------HHHhccccCCCHHHHHHhC------CCC---hhHHHHHHHHHH Q lcl|NC_021324. 371 REVTEEYTRLETVWRDPSTPTVAAKADAVS-------KLYANGQGPIPKEQARIDL------GYT---ATQREQMRDWDK 434 (480) Q Consensus 371 ~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~-------kl~~~~~~~~s~e~~~~~~------~~~---~d~~~~~~~~~~ 434 (480) ... .++++.|.+....+..+.|+... +++++| .++...+++.| |+. +++++....+.. T Consensus 427 -~i~---~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~G--vis~dEvR~~L~~~~~~g~~~l~d~~~e~~~~~~p 500 (765) T protein:vir:96 427 -SID---VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSG--VVSPDEVRERLRDDPRSGYNRLTDDQAETEPGMSP 500 (765) T ss_pred -CCC---CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC--CCCHHHHHHHHhccccCCCCCCCccccccccCCCc Confidence 222 25889999999899988887543 334443 56666666655 222 222111000000 Q ss_pred HHHHHHHHHHhhhc--------ccCCCcCCCCCCCCCCccCC----CCCCCCCcccCC Q lcl|NC_021324. 435 QETEDMIDTLYSTT--------KAQADATPKPTVTETKTETQ----TSPSGFNRTKTR 480 (480) Q Consensus 435 ~~~~~~~~~~~~~~--------~~~~~~~~~~~~~d~~~~~~----~~~~~~~~~~~~ 480 (480) +..++ .+...... ...+.++...+.+++.+..+ .....+..++++ T Consensus 501 e~~~~-~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~p~~~~~~~~~g~ 557 (765) T protein:vir:96 501 ENLAE-LEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTKPLAKAAEEGAGE 557 (765) T ss_pred ccccc-ccCCCcccccccCccccccCCCCccCCCCcccccCCcccCCccccccccCcc Confidence 00000 00000000 00000000000000000000 000000001111 No 118 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.16 E-value=9e-10 Score=70.17 Aligned_cols=401 Identities=8% Similarity=0.073 Sum_probs=149.3 Q ss_pred CCC-----------------------H---------HHHHHHHHHHHHHHHHHHHHHHHHhhccC--cchhcCcccchhh Q lcl|NC_021324. 1 MTT-----------------------Y---------HEHVERLQGLLARDLPNLLEAEAYRNGTR--RLKTIGIGAPPEL 46 (480) Q Consensus 1 ~~~-----------------------~---------~~~i~~l~~~~~~~~~~~~~~~~YY~G~~--~i~~~~~~~~~~~ 46 (480) ++. . .+++..+++........+.....+..+.- .+......... T Consensus 54 ~~~~~~~~~~~~~~~~~r~~~~~~~~l~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~-- 131 (551) T protein:vir:80 54 YSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKP-- 131 (551) T ss_pred eecccccceecCcccccCccccChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCccc-- Confidence 100 0 12223333333333332222222221110 00000000000 Q ss_pred hcceeccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHHHhc---------CHHHHHHHHHHHHhhcCeeEEEEecC Q lcl|NC_021324. 47 AYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQAN---------DLDEESVLGHDDSLTFGRAYITVSHP 117 (480) Q Consensus 47 ~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~~~~~n---------~~~~~~~~~~~~~~~~G~a~~~v~~~ 117 (480) ...+....+.+.+++.+- .+..+...+..+.+.+|.+|+.+..+ T Consensus 132 ---------------------------~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~f~~~lv~dlll~Gnay~~i~rd 184 (551) T protein:vir:80 132 ---------------------------TSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFN 184 (551) T ss_pred ---------------------------ChhHHHHHHHHHHHHHhcCCCCCCccchHHHHHHHHHHHHHhcCCEEEEEEEC Confidence 000111112233333221 23345666788889999998877653 Q ss_pred ccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccc Q lcl|NC_021324. 118 DVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDV 196 (480) Q Consensus 118 ~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (480) ..|+| .+..++|..+.++.++... .....++++.. .+++. ...|.++. T Consensus 185 ------~~G~~~~L~~l~p~~V~v~~~~~g~-~~~~~~~y~~~-~~g~~---~~~~~~~e-------------------- 233 (551) T protein:vir:80 185 ------RNQSMVRFVAKDPTTIFFATTADGK-IPDNGNRFVQV-IDQKI---VATFNARE-------------------- 233 (551) T ss_pred ------CCCcEEEEEEeCCceeEEEECCccc-cccCceEEEEE-eCCcE---EEEEcccc-------------------- Confidence 35555 5778899988887655421 11111111111 11110 01122222 Q ss_pred ccccccccceEEeecccc---cCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHh---cchhhhh--cCCC-ccccccc Q lcl|NC_021324. 197 IKHGLGVVPVVPLTNDPR---LGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLRVI--SGVT-TDELTND 267 (480) Q Consensus 197 ~~~~~g~iPvv~~~n~~~---~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~---~~p~l~i--~G~~-~~~~~~~ 267 (480) |+||+.++. ....+|.|.|. .+.+++...+.-......+| +.|-.+| .|.. +.+...+ T Consensus 234 ---------iiH~~~n~~~~~~~~~~G~spi~----~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~ 300 (551) T protein:vir:80 234 ---------MAFAVRNPRSDIYATGYGYPELE----IALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALE 300 (551) T ss_pred ---------eEEecccCCCCcccccccccHHH----HHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHH Confidence 244443322 22457887653 33444443333333333333 3454333 3321 1111001 Q ss_pred cccchhhh------hcchheecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH-H Q lcl|NC_021324. 268 GENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-T 339 (480) Q Consensus 268 ~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~-~ 339 (480) .-...|.. ..+++..+.+++.++..+.... -..|++..+....+|+.+-++|+..+|....+.+.|..... - T Consensus 301 ~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t 380 (551) T protein:vir:80 301 IFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLN 380 (551) T ss_pred HHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccc Confidence 00111211 1234444445667888776332 22488888999999999999999999853322111100000 0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh Q lcl|NC_021324. 340 DSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID 418 (480) Q Consensus 340 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~ 418 (480) +..... .....+...|.-++..+...+... .......+.+.|......+..+.+. +.++..+| +++...++++ T Consensus 381 ~sn~e~---~~~~f~~~tL~P~~~~ie~~ln~~L~~~~~~~~~f~f~~~~~~~~~~~~~-~~~~~~~g--~lT~NE~R~~ 454 (551) T protein:vir:80 381 EGNSAE---KNQASKNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVK-ILAEKAKV--AMTVNEVRKE 454 (551) T ss_pred hhhHHH---HHHHHHHHHHHHHHHHHHHHHHhhhccccCCceEEEeeccChhhHHHHHH-HHHHHhcC--CcCHHHHHHH Confidence 000000 011111222222222111110000 0001124667777666666666554 44555544 5777788888 Q ss_pred CCCChh-H-HHHH----------HHHHHH--HHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCc-------- Q lcl|NC_021324. 419 LGYTAT-Q-REQM----------RDWDKQ--ETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR-------- 476 (480) Q Consensus 419 ~~~~~d-~-~~~~----------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~-------- 476 (480) +|+.+. + -+.+ ....++ +.+.............+.+...++..++++.....+.+.++ T Consensus 455 ~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 534 (551) T protein:vir:80 455 LNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQRKDKDNA 534 (551) T ss_pred hCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccccCcccc Confidence 877431 1 0000 001000 00000000000000000000000001111110101111111 Q ss_pred ----ccCC Q lcl|NC_021324. 477 ----TKTR 480 (480) Q Consensus 477 ----~~~~ 480 (480) .-+| T Consensus 535 ~~~~~~~~ 542 (551) T protein:vir:80 535 NAGKQGMK 542 (551) T ss_pred chhhhhcC Confidence 0111 No 119 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.13 E-value=7.5e-10 Score=70.59 Aligned_cols=400 Identities=7% Similarity=0.050 Sum_probs=150.1 Q ss_pred CCCH--------------------------------HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhc Q lcl|NC_021324. 1 MTTY--------------------------------HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY 48 (480) Q Consensus 1 ~~~~--------------------------------~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~ 48 (480) ++.+ ..++..+++........+.....|.........+-+.... T Consensus 50 ~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~---- 125 (547) T protein:vir:63 50 YSQPVIGSMSANPGFKTKPSIRNNQDLHGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDK---- 125 (547) T ss_pred hhchhhheeecccccccCCccCChhHHHHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEeccccc---- Confidence 1110 1223333333333333333222222211111000000000 Q ss_pred ceeccchHHHHHHHHHhhhccCccccCCCchhHHHHHHHHHhc---------CHHHHHHHHHHHHhhcCeeEEEEecCcc Q lcl|NC_021324. 49 LDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQAN---------DLDEESVLGHDDSLTFGRAYITVSHPDV 119 (480) Q Consensus 49 ~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~~~~~n---------~~~~~~~~~~~~~~~~G~a~~~v~~~~~ 119 (480) + ....+......+.+++.+- .+..+...+..+.+.+|.+|+.+..+ T Consensus 126 -~----------------------~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~~~lv~d~ll~Gn~~~~i~rd-- 180 (547) T protein:vir:63 126 -K----------------------PTSHDEATIKRIESFIEKTGVDNDINRDSFSSFVKKIVRDTYMYDQVNFEKVFN-- 180 (547) T ss_pred -c----------------------cChhhHHHHHHHHHHHHhhCCCCCCccchHHHHHHHHHHHHHhhCCEEEEEEEC-- Confidence 0 0001111112233332220 23356666788999999999887653 Q ss_pred ccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccc Q lcl|NC_021324. 120 ESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIK 198 (480) Q Consensus 120 ~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (480) ..|++ .+..++|..+.++.++... .....++++.. .+.+. ...|.++. T Consensus 181 ----~~G~~~~L~~l~p~~V~~~~~~~g~-~~~~~~~y~~~-~~~~~---~~~~~~~e---------------------- 229 (547) T protein:vir:63 181 ----RNQSMVRFVAKDPTTIFFATTADGK-IPDNGNRFVQV-IDQKI---VATFNARE---------------------- 229 (547) T ss_pred ----CCCcEEEEEEecCceeEEEECCccc-cccCceEEEEE-cCCcE---EEEecccc---------------------- Confidence 34555 5678889888877654321 10000111111 11110 01122222 Q ss_pred ccccccceEEeeccccc---CccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcc---hhhhh--cCCC-ccccccccc Q lcl|NC_021324. 199 HGLGVVPVVPLTNDPRL---GNRYGRSEISPELRKVTDAASRTLMNLQSASQILGT---PLRVI--SGVT-TDELTNDGE 269 (480) Q Consensus 199 ~~~g~iPvv~~~n~~~~---~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~---p~l~i--~G~~-~~~~~~~~~ 269 (480) |+||+.++.. ...+|.|.|. .+.+.+...+.-......+|.+ |--+| .|.. +.+...+.- T Consensus 230 -------iih~r~n~~~~~~~~~~G~Spi~----~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~l 298 (547) T protein:vir:63 230 -------MAFAVRNPRSDIYATGYGYPELE----IALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIF 298 (547) T ss_pred -------EEEecccCCCCcccccccccHHH----HHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHH Confidence 3444433222 2457887653 3344444333333333344433 44333 3321 111110000 Q ss_pred cchhhh------hcchheecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH-HHH Q lcl|NC_021324. 270 NTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-TDS 341 (480) Q Consensus 270 ~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~-~~~ 341 (480) ...|.. ..+++..+.+++.++.++.... -..|++..+..+.+|+.+-++|+..+|....+.+++..-.. -+. T Consensus 299 k~~~~~~~~G~~nagk~~vl~~~g~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~s 378 (547) T protein:vir:63 299 KREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEG 378 (547) T ss_pred HHHHHHHhcCcccccccccccCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchh Confidence 111211 1233444445667888776432 22388888999999999999999999853322111100000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCC Q lcl|NC_021324. 342 RIVKMAERKGRIFGGAWERAMRIAMQIMGREV-TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLG 420 (480) Q Consensus 342 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~ 420 (480) .+. ......+...|.-++..+...+.... ......+.+.|......+..+.+. +.++..+| +++...+++++| T Consensus 379 n~e---~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~~~~~~~~f~~~~~~~~~~~~~-~~~~~~~g--~lT~NE~R~~~g 452 (547) T protein:vir:63 379 NSA---EKNQASKNKGLQPLLGFIEDFINKHIVAEFGDKYTFQFVGGDIKSELESVK-ILAEKAKV--AMTVNEVRKELN 452 (547) T ss_pred hHH---HHHHHHHHHHHHHHHHHHHHHHHhhcccccCCceEEEeeccccccHHHHHH-HHHHHhCC--CcCHHHHHHHhC Confidence 000 00111112222222222211111000 000123667787666666666554 44555544 567777888887 Q ss_pred CChh-H-HHHH----------HHHHHHH-----HHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCC-CcccCC Q lcl|NC_021324. 421 YTAT-Q-REQM----------RDWDKQE-----TEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGF-NRTKTR 480 (480) Q Consensus 421 ~~~d-~-~~~~----------~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~-~~~~~~ 480 (480) +.+. + -..+ ....++. .++..+.......+ +...++.+++.......+.+. +..++. T Consensus 453 l~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 527 (547) T protein:vir:63 453 LPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGN---RVSTDVEDIPDGKDTTGDIGKDGQRKDK 527 (547) T ss_pred CCCCCCCCceeecccccccccccccccCCccccchhhccccccccCC---CCCCCCCCCCCCcccCCCcCccccccCc Confidence 7432 1 0000 0000000 00111111111000 000000111111000011111 111111 No 120 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.96 E-value=9.4e-09 Score=64.60 Aligned_cols=453 Identities=12% Similarity=0.025 Sum_probs=196.0 Q ss_pred CCCH-HHHHHHHHHHHHHHHHHH-HHHHHHhhccCcchhcCc----cc-chhhhcceeccchHHHHHHHHHhhhcc---- Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLPNL-LEAEAYRNGTRRLKTIGI----GA-PPELAYLDVQPGWVATYLRTLSDRLDI---- 69 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~~~-~~~~~YY~G~~~i~~~~~----~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~---- 69 (480) |.+. .+.+++..+.+...+..+ ...+++|+ +.+|.... .. ....++.++..+-+...|+++++.|.. T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltp 78 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSD--YINPRGSRFLTSEVNRNDRRNTRIIDSTGTMAARTLASGMMSGITS 78 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhccccCCcCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcC Confidence 8773 455555555555444322 23333442 22222211 11 112224466677788888888877731 Q ss_pred --Cccc---cCCCc-----hhH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEE Q lcl|NC_021324. 70 --EGFR---ISEDS-----EGL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 70 --~~~~---~~~d~-----~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~ 132 (480) .+|+ +++++ +.. +.+.+.+..++|.....++.++...+|.|.+++.. |.++-++++. T Consensus 79 p~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~------d~~~~~r~~~ 152 (559) T protein:vir:95 79 PARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLD------DDEDIIRTMP 152 (559) T ss_pred CCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeec------CCCceeEEEE Confidence 2232 22211 111 22455667788999999999999999999877653 3345578888 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEec--------------------CC-ceEEEEEEEe----CCcE---------- Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRD--------------------DV-AVPDRATLYL----PDET---------- 177 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~--------------------~~-~~~~~~~~~~----~~~~---------- 177 (480) ++..++++.-|.. .++...+|.+...- +. ....+++++. .... T Consensus 153 ~~l~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~ 230 (559) T protein:vir:95 153 FPIGSYYLANSPR--GSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNK 230 (559) T ss_pred eecCeEEEeeCCC--CCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccc Confidence 9988888776653 34444444332110 00 1011233321 1000 Q ss_pred ----EEEEecCCCccccccccccccccccccceEEeecccccCccCCCcc-chHHHHHHHHHHHHHHHHHHHHHHHhcch Q lcl|NC_021324. 178 ----VPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSE-ISPELRKVTDAASRTLMNLQSASQILGTP 252 (480) Q Consensus 178 ----~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~-i~~~v~~l~D~~~~~~s~~~~~~~~~~~p 252 (480) ++|...... ... ..+.+|..+|++.++-+...++.||+|- . .+..+-+..+|.+.-.....++...+| T Consensus 231 pf~s~~~e~~~~~---~~~---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~-~~al~d~k~L~~l~~~~l~~~~~~~~p 303 (559) T protein:vir:95 231 PFKSVYYEVGGDN---DKL---LRESGFDEFPIMAPRWEVNGEDVYGSSCPG-MLALGPVKALQLLQKRKSQLIDKATNP 303 (559) T ss_pred eEEEEEEEecCCC---cee---eecCCcccCCccceeeeecCCccccccchH-HHhhHHHHHHHHHHHHHHHHHHHHhcC Confidence 111111000 000 1234567788888888888899999994 4 346677788888888888888988888 Q ss_pred hhhhcCCCccccccccccchhhhhcchheecCCC--CceEEEe--ccccHHHH---HHHHHHHHHHHHhhcCCChHHhcc Q lcl|NC_021324. 253 LRVISGVTTDELTNDGENTTLDIYYGRILTLASE--AAKISEF--KAAELRNF---AEEMEVFRKEAASITGLPPQYLSS 325 (480) Q Consensus 253 ~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~--~~~~~~~~---~~~l~~~~~~i~~~~~~p~~~~g~ 325 (480) .+.+... .....++..+|.+...... ...+.-+ -..++... ++.++.-|...+.... ...+.. T Consensus 304 p~~v~~~--------~~~~~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~--~~~l~~ 373 (559) T protein:vir:95 304 PMVAPTS--------LKNQRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDL--FMMLQN 373 (559) T ss_pred ceecccc--------ccccceeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhh--HHHhhc Confidence 6665321 1111234445544432211 1122211 11223222 3334433433333211 111221 Q ss_pred ccccchHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHHHhCCC-C--cccceeeeEEecCCCCCCH----- Q lcl|NC_021324. 326 SSENPASAEAIIATDSRIVKM----AERK-GRIFGGAWERAMRIAMQIMGRE-V--TEEYTRLETVWRDPSTPTV----- 392 (480) Q Consensus 326 ~~~~~~Sg~Al~~~~~~l~~k----~~~~-~~~~~~~l~~~~~li~~~~~~~-~--~~~~~~~~v~f~~~~~~~~----- 392 (480) ......+++.++.....+... ..+. ...+.+-+.+++.++.+..-.. . ......++|.+..++..-. T Consensus 374 r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~ 453 (559) T protein:vir:95 374 INTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGL 453 (559) T ss_pred CCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHH Confidence 111112444444333222222 1111 2223334445555444321001 1 1112457777755443211 Q ss_pred ---HHHHHHHHHHHhcccc---CCCHHH----HHHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcC- Q lcl|NC_021324. 393 ---AAKADAVSKLYANGQG---PIPKEQ----ARIDLGYT------ATQREQMRDWDKQETEDMIDTLYSTTKAQADAT- 455 (480) Q Consensus 393 ---~~~ad~~~kl~~~~~~---~~s~e~----~~~~~~~~------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 455 (480) ...++.+..+.+.++. .+.... +...+|+. +++++++++.+.++++..+.+......++.... T Consensus 454 ~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~ 533 (559) T protein:vir:95 454 SSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTL 533 (559) T ss_pred HHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 0111122222222221 122222 23444543 344444433332222221111111111110000 Q ss_pred -CCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 456 -PKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 456 -~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) +....+...-..-....++++++.+ T Consensus 534 ~~~~~~~~~~l~~~~~~~~~~~~~~~ 559 (559) T protein:vir:95 534 SEAKTSDPSVLSAMANAVSGQGGQSQ 559 (559) T ss_pred ccccCCChhHHHHHHHhhcCccccCC Confidence 0000000000111123344555555 No 121 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=98.95 E-value=7.1e-09 Score=65.26 Aligned_cols=411 Identities=14% Similarity=0.131 Sum_probs=158.5 Q ss_pred CCCHH--HHHHHHHHHHHHHHHHHHHHHHHhhccCcc-hhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc---c Q lcl|NC_021324. 1 MTTYH--EHVERLQGLLARDLPNLLEAEAYRNGTRRL-KTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---I 74 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i-~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~ 74 (480) ++++. |... .+...|.+.... ....... ..+......+.....+|+..++-+-.-++. . T Consensus 9 ~~~p~~~e~~~--------------~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~~~~V~acV~~IA~~iA~lpl~l~~~ 73 (518) T protein:vir:10 9 LSAPAMAELSP--------------QMQDSYYYAPAVGMQLERQF-SLYGGIYKNQPWVRTVIAKRAQALARLPVKCMFT 73 (518) T ss_pred ecCchhhhhhh--------------hhhcccccccccceeccccc-chhhHHHhhhHHHHHHHHHHHHhhccCceEEEEE Confidence 33331 1111 111112111100 0000000 000011111233344444444333222222 1 Q ss_pred CCCc---hhHHHHHHHHHh-cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCC Q lcl|NC_021324. 75 SEDS---EGLEELWNWWQA-ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRN 146 (480) Q Consensus 75 ~~d~---~~~~~l~~~~~~-n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~ 146 (480) .++. .....+..++.+ |. ...+...+..+.+.+|.||+++..+ .+|.+ .+..++|..+.+..+... T Consensus 74 ~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~------~~G~~~~L~~l~p~~v~v~~~~~~ 147 (518) T protein:vir:10 74 SGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN------KSGTPEKLMPMHPSRVAIKRNSRT 147 (518) T ss_pred cCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEECCCceEEEEcCCC Confidence 1111 112234444433 22 2356777788899999999998753 35665 578889998888876543 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) . . +.+++.. ..+.......|.++.+ +||++....+..+|.|.|.- T Consensus 148 ~-~----~~y~~~~-~~~~~~~~~~~~~~eV-----------------------------iHir~~s~dg~~~G~spi~~ 192 (518) T protein:vir:10 148 G-R----YEYYFQA-GAGVGTQLVSFADDEV-----------------------------VPIRFFNPDGLERGLSLMES 192 (518) T ss_pred C-E----EEEEEEe-cCCccceEEEecCCcE-----------------------------EEecCCCCCcccccccHHHH Confidence 2 1 1111111 1111111122333333 34443222223457765532 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEeccccH- Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAEL- 298 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~~- 298 (480) ....+.....+.....+.+...+.|-.++.-- .+.+...+.-...|+. ..++++.++ ++.++.++..... T Consensus 193 -a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~k~~~~~~~~G~~nag~v~vL~-~G~~~~~l~~s~~D 270 (518) T protein:vir:10 193 -LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSEAAQQRLREQFDRAHSGSSNTGKTMVVE-EGMEPIPLQLTAVE 270 (518) T ss_pred -HHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCccccCcceEcC-CCceEEEccCChhH Confidence 22222222222222222233333454444321 1111111111111211 123455565 4467777664322 Q ss_pred HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021324. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 378 (480) ..|++..+..+.+|+..-++|+..+|....+.-|. +......+...+ -.-+-..|+..+.. .+...... .. T Consensus 271 ~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn--~eq~~~~f~~~t---L~P~l~~ie~~ln~--~L~~~~~~--~~ 341 (518) T protein:vir:10 271 MQFIEARQLNREEVCGVYDIAPPIVHILDRATFSN--ISAQMRAFYRDT---MAIPIARIQSAMDK--YVGQYWVR--KN 341 (518) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchh--HHHHHHHHHHHH---HHHHHHHHHHHHHH--hhcccccC--Cc Confidence 23788888889999999999999997543221121 111111111111 00111111111110 01111111 12 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhc-ccC-C---- Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT-KAQ-A---- 452 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~---- 452 (480) .+++........|..++++++.+++.+| +++...+++.+|+.+-+-....+......-..+....... .++ + T Consensus 342 ~~~fd~~~llr~D~~~r~~~~~~~~~~G--~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~ 419 (518) T protein:vir:10 342 RMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPK 419 (518) T ss_pred eEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCC Confidence 3444445566788899999999998876 6677778888876432100000000000000000000000 000 0 Q ss_pred CcCCCC--CCCC--CCccCCCCCCCCCcccCC Q lcl|NC_021324. 453 DATPKP--TVTE--TKTETQTSPSGFNRTKTR 480 (480) Q Consensus 453 ~~~~~~--~~~d--~~~~~~~~~~~~~~~~~~ 480 (480) .++..+ ..++ +......++...++...+ T Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (518) T protein:vir:10 420 RPASTPVASLDQSPPTSVPGLSPTNSDRSTDS 451 (518) T ss_pred CCCccccccccccccccCCCCCcccccccccc Confidence 001111 0000 011111111111111111 No 122 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.95 E-value=1.1e-08 Score=64.17 Aligned_cols=390 Identities=13% Similarity=0.102 Sum_probs=160.9 Q ss_pred hhcceeccchHHHHHHHHHhhhccCccccC------CCc---hhHHHHHHHHHh---c-----------CHHHHHHHHHH Q lcl|NC_021324. 46 LAYLDVQPGWVATYLRTLSDRLDIEGFRIS------EDS---EGLEELWNWWQA---N-----------DLDEESVLGHD 102 (480) Q Consensus 46 ~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~------~d~---~~~~~l~~~~~~---n-----------~~~~~~~~~~~ 102 (480) ++++--...+...+|+..++.+.+-|+.+- ... ...+.+.+++.. | .+..+...+.. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 333322346777788877777766666431 011 111223333321 1 23456677888 Q ss_pred HHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCC------ Q lcl|NC_021324. 103 DSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPD------ 175 (480) Q Consensus 103 ~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 175 (480) +...+|.||+.+..+. .|++ .+..++|..+.+.-|...- +.. .+.. ..++.+|... T Consensus 81 ~l~l~Gn~~i~~~r~~------~G~~~~l~~l~~~~v~~~~d~~~~------~~~----~~~~-~~~~~~~~~~~~~~~~ 143 (467) T protein:vir:31 81 DYEAIGWLTIEILTQT------DGTPTGLAYVPGHTIRKRMDERGF------VQL----LEEK-EKYFGVAGDRYQTNGN 143 (467) T ss_pred HHHhcCCeEEEEEECC------CCcEEEEEEeCCceeEeeeeccee------Eee----cCCc-eeeEEeccccceeecc Confidence 9999999999876532 4554 5778888888776544210 100 0100 1111111110 Q ss_pred -cEEE-EEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhc--- Q lcl|NC_021324. 176 -ETVP-LRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG--- 250 (480) Q Consensus 176 -~~~~-~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~--- 250 (480) .... +...... .......+..==|+||++.....+.+|.|.+.- +.+++.....-......+|. T Consensus 144 ~~~~~~~~~~~~~-------~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~----~~~~i~~~~~~~~~~~~~f~ng~ 212 (467) T protein:vir:31 144 GDLDPVFVDADDG-------STGTSVSNPANELIFKRNHSPLYPHYGAPDIIP----AVKTIRGDSAAQDYNIDFFENDG 212 (467) T ss_pred cceeeeeeeeccc-------cccceeEeccccEEEecCCCCCCCcccccHHHH----HHHHHHHHHHHHHHHHHHHhccC Confidence 0000 0000000 000001111123577776555677889887643 33444333322222233333 Q ss_pred chhhhh--cCCCccccccccccchhhh-----------------hcchheecCCC-C-----ceEEEecccc--HHHHHH Q lcl|NC_021324. 251 TPLRVI--SGVTTDELTNDGENTTLDI-----------------YYGRILTLASE-A-----AKISEFKAAE--LRNFAE 303 (480) Q Consensus 251 ~p~l~i--~G~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~g~-~-----~~~~~~~~~~--~~~~~~ 303 (480) .|-.++ +|........+.-...|+- ..+....+.++ + .++..++... -..|++ T Consensus 213 ~p~gil~~~~~~l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e 292 (467) T protein:vir:31 213 VPRIAIIVKGAELTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLE 292 (467) T ss_pred CCceEEEecCcCCCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHH Confidence 343333 3432221111100011110 01122222221 1 1222222211 234778 Q ss_pred HHHHHHHHHHhhcCCChHHhccccc-cchH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 304 EMEVFRKEAASITGLPPQYLSSSSE-NPAS-AEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~-~~~S-g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) ..+....+|+..-++|+..+|.... +..| .......+. .....-.... |++.+.. .+...........++ T Consensus 293 ~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~--~~~l~P~~~~----ie~~ln~--~l~~~~~~~~~~~i~ 364 (467) T protein:vir:31 293 FRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKEFA--EETIQPKQHD----FGELLYE--LVHKQGLDAPDWTIE 364 (467) T ss_pred HHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHHHH--HHHHHHHHHH----HHHHHHH--hhcchhhccCCceEE Confidence 8888899999999999998875322 2112 212111111 1111111111 1111110 111111111223466 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhc--cc-CCCcCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT--KA-QADATPKP 458 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~~~~ 458 (480) +.+......+..+.++++.+++.+| +++...+++.+|+.+-.-+.+.. ........... .. ..+....+ T Consensus 365 f~~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~Gl~pi~d~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~ 436 (467) T protein:vir:31 365 FELAKPDTKLQDVEIASQRVQAMQG--LLTVNELRDEFGFEPFPEEHVYG------GETLVAEVTGGSGPGGGIGDQIEQ 436 (467) T ss_pred EecchhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCcccccC------CcccccccccccCCCCcccCcCCC Confidence 6667777889999999998888766 67777888888874321000000 00000000000 00 00000000 Q ss_pred CCCCCC----------ccCCCCCCCCCcccC Q lcl|NC_021324. 459 TVTETK----------TETQTSPSGFNRTKT 479 (480) Q Consensus 459 ~~~d~~----------~~~~~~~~~~~~~~~ 479 (480) +..+.. -++++.-.-++++-| T Consensus 437 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 467 (467) T protein:vir:31 437 LVEDRADEIIDSYQADLETEQLIEIGANADS 467 (467) T ss_pred CCCCcccchHhhhhhccccchhhhhccccCC Confidence 000000 001111111112222 No 123 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.94 E-value=1.1e-08 Score=64.14 Aligned_cols=420 Identities=15% Similarity=0.073 Sum_probs=159.9 Q ss_pred HHHHHHHHHHHHHHH-HHHHHH------HHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc--- Q lcl|NC_021324. 5 HEHVERLQGLLARDL-PNLLEA------EAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--- 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~-~~~~~~------~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~--- 74 (480) .=+++.|........ ...... ...+... .....+..+.+.. -+++.. ...+|+..++-+-.-++.+ T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~g~~V~~~~-al~~~~--V~~~v~~Ia~~iA~lp~~~~~~ 76 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLG-AVAASGETVTPHD-ALQVSA--VFASVRLLSETIATLPLSTYSK 76 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhc-ccccCCceechHH-hhccHH--HHHHHHHHHHhhccCceEEEEe Confidence 222222222111000 000000 0000000 0000011111100 011100 1122333332222223322 Q ss_pred CCC---chhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCce-eEEEEEcccceEEEEeCCC Q lcl|NC_021324. 75 SED---SEGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPRN 146 (480) Q Consensus 75 ~~d---~~~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~-~~~~~~~p~~~~~v~d~~~ 146 (480) .++ +.....+..++.. + ....+...+..+.+.+|.||+.+... .|+ ..+..++|..+.++.+... T Consensus 77 ~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-------~g~~~~l~~l~p~~v~v~~~~~~ 149 (457) T protein:vir:13 77 RGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-------GPNIVGLDVLDPTKIHVHMVMVD 149 (457) T ss_pred cCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEccCceEEEEecCC Confidence 111 1112234444332 1 23466777888899999999887532 233 3567788888877654433 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) ...... .+.|. ....+.......|.++.+++ +++....+..+|.|.+.. T Consensus 150 ~~~~~~-~~~y~-~~~~~~~~~~~~~~~~diih-----------------------------~~~~~~~~~~~G~s~i~~ 198 (457) T protein:vir:13 150 GLRRKV-FEAYD-IDADGNEVLLGWFTPRDVLH-----------------------------IPGMMLPGDFVGCSPISY 198 (457) T ss_pred Ccccee-EEEEE-EecCCceeeEEeeCccceEE-----------------------------ecCCCCCCccccccHHHH Confidence 211111 11121 12222222223344444444 333222233567775532 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecccc-H Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-L 298 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~ 298 (480) +...++....+.....+.+...+.|-.+|.-. ...+...+.-...|.. ..++++.++ ++.++.++.... - T Consensus 199 -~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d 276 (457) T protein:vir:13 199 -ARESIGLALAAQKYGSKFFANGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLT-EGAKFSKVAMSPDE 276 (457) T ss_pred -HHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEccCChhH Confidence 33333322222222223233334455444421 1111110100111111 123455665 446787775432 2 Q ss_pred HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021324. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 378 (480) ..+++..+....+|+..-++|+..+|....+..++..+......+...+- .-+-..|+..+. ..+...... ... T Consensus 277 ~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~tl---~P~~~~ie~~ln--~~L~~~~~~-~~~ 350 (457) T protein:vir:13 277 AQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFSL---RPWLERIEAGFN--RLLFAETAD-RFR 350 (457) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHHH---HHHHHHHHHHHH--HhhcCcccc-Cce Confidence 23677778888999999999999997644332222222222222211110 011111221111 111111111 223 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH---HHH-HHHHHHHHHHHHHHHhhhcccCCCc Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR---EQM-RDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~---~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) .+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+- +++ .-..-. ..+.......+...+ T Consensus 351 ~i~fd~~~l~~~D~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~----~~~~~~~~~~~~~~~ 424 (457) T protein:vir:13 351 FVKFNLDEIKRGAPKERMELWSLGLQNG--IYSIDEVRAAEDMTPLPDGLGEKYRVPLNLG----EVGEEPEPEPAPAPP 424 (457) T ss_pred eEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcccceeeccccc----cccccccccccCCCC Confidence 3555556667778899999999998876 566666777777643210 111 000000 000000001111111 Q ss_pred CCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 455 TPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) +.++...+++.+.+.+...++..++- T Consensus 425 ~~~~~~~~~~~~~~~~g~~d~~~~~~ 450 (457) T protein:vir:13 425 AIEPPAEEPDEEPEPEGKPDDEGATE 450 (457) T ss_pred CCCCCccccCCCCCCCCCCccccCCC Confidence 11111122221111111111111111 No 124 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.93 E-value=1.1e-08 Score=64.27 Aligned_cols=385 Identities=12% Similarity=0.048 Sum_probs=155.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchhHHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEEL 84 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~l 84 (480) ..+..++-.........-..+..+..+... +..+.... -+.+.-...+|+..++-+-.-++.+.++ .+ T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~~~---al~~~~V~~~v~~ia~~ia~~p~~~~~~-----~~ 68 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLTGGEA----QKYVSADT---ALKNSDIFSLIMQLSGDLAMVRYTSESD-----RS 68 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhcCCcC----CceechHH---hhccHHHHHHHHHHHHHHhhCccccccc-----HH Confidence 111111100000000000001111111100 01111110 0001111222333322222223443221 12 Q ss_pred HHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 85 WNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 85 ~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ..+..+ | ....+.+.+..+.+.+|.||+.+-.+ ..|.+ .+..++|..+.++.+.... .+. |... T Consensus 69 ~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~l~~l~~~~v~i~~~~~~~-~~~----y~~~ 137 (397) T protein:vir:38 69 QSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKN------TNGVDLSWEYLRPSQVQPMLLQDGS-GLI----YNIN 137 (397) T ss_pred HHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eEE----EEEE Confidence 233322 2 34467788888999999999887553 34554 6778899888777654432 111 1111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) ....+. .....+.++. |+||++....+..+|.|.+.. +...++....+. T Consensus 138 ~~~~~~-~~~~~~~~~e-----------------------------iih~~~~~~~~~~~G~s~i~~-~~~~i~~~~~~~ 186 (397) T protein:vir:38 138 FDEPAI-GYMENVPAAD-----------------------------VIHIRLLSKNGGKTGISPLSA-LINEQQIKDASN 186 (397) T ss_pred eccccc-cceeEecCcc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHHHHH Confidence 011000 0001112222 456655544455678887643 333344333333 Q ss_pred HHHHHHHHHhcchhhhhcCCC-ccccccccccchhhh-----hcchheecCCCCceEEEeccc-cHHHHHHHHHHHHHHH Q lcl|NC_021324. 240 MNLQSASQILGTPLRVISGVT-TDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRKEA 312 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~G~~-~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i 312 (480) ....+.+...+.|-.+++-.. ......+.....++. ..+.++.++ ++.++.++..+ .-..|++..+....+| T Consensus 187 ~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~~~~I 265 (397) T protein:vir:38 187 ELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVID-ALEDYKPLEVKGNIASLLNQVDWTRDQI 265 (397) T ss_pred HHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecC-CCceEEecCCChhHHHHHHHHHHHHHHH Confidence 333333344445555554211 111111111111111 123344454 45678777643 2334788888999999 Q ss_pred HhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCH Q lcl|NC_021324. 313 ASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTV 392 (480) Q Consensus 313 ~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~ 392 (480) +..-++|+..+|+...+.+|....+..+ ...|.-++..+..........+ ....+.| ....+. T Consensus 266 a~afgVp~~~lg~~~~~~~~~e~~~~~~--------------~~~l~P~~~~ie~~ln~~l~~~-~~~~~~~--~~~~d~ 328 (397) T protein:vir:38 266 AKVYGVPDSYLNGQGDQQSSITQISGQY--------------AKSLNRYVQAIVGELNDKLHAN-ISANIRF--AIDAMG 328 (397) T ss_pred HHHhCCCHHHhCCCCCcccHHHHHHHHH--------------HHHHHHHHHHHHHHHHHhccCh-hcccccc--cccCCH Confidence 9999999999987654433332222211 1222222222211111000000 1122333 334577 Q ss_pred HHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCC Q lcl|NC_021324. 393 AAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPS 472 (480) Q Consensus 393 ~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~ 472 (480) .+.++.+.+++.+| +++...+++.+|..+-+-..+-.. ............+.+++.+++...+++ T Consensus 329 ~~~~~~~~~~~~~G--~~t~nE~R~~lg~~p~~~~d~~~~-------------~~~~~~~~~~~~~~~g~~~~~~~~e~~ 393 (397) T protein:vir:38 329 DQYASTISSSVKGG--TIAGNQARFILQNSGYLAKDLPDP-------------EKEPQQAIQLIQQEGGENDGNNSDERG 393 (397) T ss_pred HHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCccccc-------------cccccccccccccccCCCCCCCCCCCC Confidence 88888888888775 667777777766532110000000 000000111111112222222222222 Q ss_pred CCCc Q lcl|NC_021324. 473 GFNR 476 (480) Q Consensus 473 ~~~~ 476 (480) +.++ T Consensus 394 ~~~~ 397 (397) T protein:vir:38 394 SDPE 397 (397) T ss_pred CCCC Confidence 2222 No 125 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.92 E-value=1.5e-08 Score=63.52 Aligned_cols=460 Identities=11% Similarity=-0.020 Sum_probs=180.6 Q ss_pred CCCH--HHHHHHHHHHHHHHH----HHHHHHHHHhhccCcchh-----cCc--ccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY--HEHVERLQGLLARDL----PNLLEAEAYRNGTRRLKT-----IGI--GAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~--~~~i~~l~~~~~~~~----~~~~~~~~YY~G~~~i~~-----~~~--~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |++. ..+|.++++.+...+ ++++.+.+||........ ... .......+.++..+.+...|+.++..| T Consensus 20 ~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~L 99 (641) T protein:vir:94 20 LSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAYF 99 (641) T ss_pred CCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhHH Confidence 5553 223444444443322 355667777755332211 000 001111234788888888888877766 Q ss_pred ccC----c--ccc----CCCchhHHH----HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcc------------c- Q lcl|NC_021324. 68 DIE----G--FRI----SEDSEGLEE----LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDV------------E- 120 (480) Q Consensus 68 ~~~----~--~~~----~~d~~~~~~----l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~------------~- 120 (480) ..- . |.+ .+|.+..+. +...+.++++.....+..++++.+|.+++.++.... + T Consensus 100 m~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~ 179 (641) T protein:vir:94 100 KGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVETGD 179 (641) T ss_pred hhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhcccchh Confidence 321 1 111 233333333 334445677778788999999999999988763210 0 Q ss_pred --c-------CCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe----------------------cCCc----- Q lcl|NC_021324. 121 --S-------GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR----------------------DDVA----- 164 (480) Q Consensus 121 --~-------~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~----------------------~~~~----- 164 (480) . ......+++..++|.+++ +|+..+..-..++++.... .+.. T Consensus 180 ~~~~~~~~~v~~~~~~~r~~~v~~~di~--~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~d 257 (641) T protein:vir:94 180 IFGGWEDVAVNRQRSELRIEPLSPYDVW--LDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADPD 257 (641) T ss_pred hcccccccceecccceeeEEecchhhee--ecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhcccccccccc Confidence 0 012344566667777655 4443321111111111000 0000 Q ss_pred -----------eEEEEEEEe---CCc--EEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHH Q lcl|NC_021324. 165 -----------VPDRATLYL---PDE--TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 165 -----------~~~~~~~~~---~~~--~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) .+..+++|. .+. .+.|.....+ ..+ .....-+.|..+|++.++..+..++.||+|.. ..+ T Consensus 258 ~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g--~~i-l~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~-~~~ 333 (641) T protein:vir:94 258 TPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYG--KQL-IRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVL-HPN 333 (641) T ss_pred cccccccccccccceeeeeeeeccCCCceeeEEEEEeC--CEE-eecccccccCcCCeEEecceecCCcccCCChH-HHH Confidence 001111111 000 0000000000 001 11111123667899999888888999999976 458 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheecCCCCceEEEec--cccHH---HHHH Q lcl|NC_021324. 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFK--AAELR---NFAE 303 (480) Q Consensus 229 ~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~--~~~~~---~~~~ 303 (480) .+.+..+|.+.-...+.+....+|.+.+..... .....+...+|.++..... ..++-+. ..+.. ..++ T Consensus 334 l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~------~~~~~l~~~PG~ii~~~~~-~~v~pl~~~~~~~~~~~~~~~ 406 (641) T protein:vir:94 334 LGALHVLNVLTNGRLDNLVLHINKMWTLVEDGI------LKREDVKAKPGAVFKVAQH-GSLQPIDMGRQDFVVTYQEAQ 406 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCeeeeccccc------cccceeeccCCcceeeCCC-CcceeecCCccccchhHHHHH Confidence 899999999999999999999888765432100 0111234455555443322 1222221 12222 2233 Q ss_pred HHHHHHHHHHhhcCCChHHhccc--cccchHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHh----------- Q lcl|NC_021324. 304 EMEVFRKEAASITGLPPQYLSSS--SENPASAEAIIATDSRIVKMAERKGRIFGG-AWERAMRIAMQIM----------- 369 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~--~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~-~l~~~~~li~~~~----------- 369 (480) .++..+.+..... ....+.. +...-++..+..+......+....-+.|.. .+..+++.++.++ T Consensus 407 ~~~~~i~~~~~~~---~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R 483 (641) T protein:vir:94 407 VQESSVYRNTSTG---PLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIR 483 (641) T ss_pred HHHHHHHHhhhhh---hhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhh Confidence 3443333332222 1111111 111113444554455555555555555553 4444444332211 Q ss_pred --C------CCCcccceeeeEEecCCCCCCH---HHHHHHHH---HHHhccccCCC--------H---HHHHHhCCCC-- Q lcl|NC_021324. 370 --G------REVTEEYTRLETVWRDPSTPTV---AAKADAVS---KLYANGQGPIP--------K---EQARIDLGYT-- 422 (480) Q Consensus 370 --~------~~~~~~~~~~~v~f~~~~~~~~---~~~ad~~~---kl~~~~~~~~s--------~---e~~~~~~~~~-- 422 (480) + +-....-..+...|.- .+... .+.+..+. ++.+.. +..+ . +.+++..|+. T Consensus 484 ~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~-a~~P~v~d~~d~~~~~~~~~~~~g~~~p 561 (641) T protein:vir:94 484 MYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDIS-GRVPQIGQSLDYALILEDLLRQMRFTDP 561 (641) T ss_pred hhchhhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHh-hcChhhhhcCCHHHHHHHHHHHhCCCCc Confidence 0 0011111122222210 12221 22222232 222221 1111 1 2223333321 Q ss_pred ------hh-HHHHHHHHHHHHHHHHHHHHhh---hc---------ccCCCcCCCC---CCCCCCccCCCC--CCCCCccc Q lcl|NC_021324. 423 ------AT-QREQMRDWDKQETEDMIDTLYS---TT---------KAQADATPKP---TVTETKTETQTS--PSGFNRTK 478 (480) Q Consensus 423 ------~d-~~~~~~~~~~~~~~~~~~~~~~---~~---------~~~~~~~~~~---~~~d~~~~~~~~--~~~~~~~~ 478 (480) ++ +.+......++.++..+..... .. +.+..+.... ...+...+.-.+ |+=..+ + T Consensus 562 ~~~ir~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 640 (641) T protein:vir:94 562 MRYIKKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIAGMTPEDVSDLASRIGIDTSDVAPEAMAAATQQITSG-A 640 (641) T ss_pred hhhccCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHHHHhhcCCchhhhHHHHhccccccccc-C Confidence 11 0000000001001111100000 00 0000000000 000111100000 100000 0 Q ss_pred C Q lcl|NC_021324. 479 T 479 (480) Q Consensus 479 ~ 479 (480) - T Consensus 641 ~ 641 (641) T protein:vir:94 641 L 641 (641) T ss_pred C Confidence 0 No 126 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.91 E-value=1.6e-08 Score=63.36 Aligned_cols=433 Identities=13% Similarity=0.110 Sum_probs=193.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCc----ccc----hhhhcceeccchHHHHHHHHHhhhcc--- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGI----GAP----PELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~----~~~----~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |+ .+.+++++-.-...|.+.....++||+- .+|.+.. ... ...++.++..+-+...|+++++.|.+ T Consensus 1 ~~-~~~l~~r~~~l~~~R~~~e~~w~e~~~~--~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~lt 77 (547) T protein:vir:10 1 ME-NSKIVKRLDFLKTDRKNVEQIWDCIRKY--IMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLT 77 (547) T ss_pred CC-HHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhc Confidence 54 3555555443333343333333444422 2222211 111 11234567778888888888887732 Q ss_pred ---Cccc---cCCC-----ch-------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEE Q lcl|NC_021324. 70 ---EGFR---ISED-----SE-------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 70 ---~~~~---~~~d-----~~-------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~ 131 (480) .+|+ ..+. .+ ..+.+...+...+|.....++.++...+|.|-+++..+ .+..+.++++ T Consensus 78 Pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d----~~~~~~~r~~ 153 (547) T protein:vir:10 78 SPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEED----EDEEGSVVFQ 153 (547) T ss_pred CCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccC----CCCCCceeEE Confidence 2332 1111 11 11234556677889999999999999999997777532 2345678899 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEe-----------------------cCCceEEEEEEEe----CCc-------- Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR-----------------------DDVAVPDRATLYL----PDE-------- 176 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~----~~~-------- 176 (480) .++..++++.-|.. .++...+|.+... +.+.....+++++ ... T Consensus 154 ~~pl~~~~v~~d~~--G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~ 231 (547) T protein:vir:10 154 SSPIQDSYFEEDSR--GQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAG 231 (547) T ss_pred EeecceEEEeeCCC--cCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCcccc Confidence 99988888776653 3344444332210 0000001122221 100 Q ss_pred -----------EEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 177 -----------TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSA 245 (480) Q Consensus 177 -----------~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~ 245 (480) .+++...+. ... ..+.+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-..... T Consensus 232 ~~~~~~~~p~~s~~~e~~~~----~~~---l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~ 303 (547) T protein:vir:10 232 TVLAPTERPFGKKWILKEGA----VQL---GEEGGYYEMPAYAIRWRKSAGSQWGFGPSH-LALPDVLTANRYVELVLRS 303 (547) T ss_pred ceeeccccceeEEEEEecCc----eee---eecCCcccCCeeeeeeeecCCcccccchHH-HHHHHHHHHHHHHHHHHHH Confidence 001111100 000 123467788999998888889999999764 4667778888888888888 Q ss_pred HHHhcchhhhhcCCCccccccccccchhhhhcchheecCCCCceEEEecc-ccHH---HHHHHHHHHHHHHHhhcCCChH Q lcl|NC_021324. 246 SQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQ 321 (480) Q Consensus 246 ~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~---~~~~~l~~~~~~i~~~~~~p~~ 321 (480) ++....|.+.+.-. .-...++..+|.+...+ +...+..++. +++. ..++.++.-|...+....+. T Consensus 304 ~~~~~~pp~~v~~~--------g~~~~~~~~pgg~~~~~-~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~-- 372 (547) T protein:vir:10 304 SEKVIDPAIMVTER--------GLISDIDLGASGLTVVR-DMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQ-- 372 (547) T ss_pred HHHHhcCceecccc--------cccccceecCCeeeecC-CcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhh-- Confidence 89888888655310 00112344555554432 2223333332 2333 34444444444444332211 Q ss_pred HhccccccchHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHHhCCCC--------cccceeeeEEecCCC Q lcl|NC_021324. 322 YLSSSSENPASAEAIIATDSRIVKM----AERKG-RIFGGAWERAMRIAMQIMGREV--------TEEYTRLETVWRDPS 388 (480) Q Consensus 322 ~~g~~~~~~~Sg~Al~~~~~~l~~k----~~~~~-~~~~~~l~~~~~li~~~~~~~~--------~~~~~~~~v~f~~~~ 388 (480) .. +... -+++-++.....+... ..+.. ..+.+-+.+.+.++.+. +.. ..+...++|++..++ T Consensus 373 -~~-~~~~-~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~--g~lP~~p~~l~~~~~~~~~v~~is~L 447 (547) T protein:vir:10 373 -MK-DSPA-MTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRA--GKLGELPSKLLESGKAAMDIVYTGPL 447 (547) T ss_pred -cC-CCcc-ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCchhhhccCcceEEEEeccHH Confidence 11 1111 2333333332222221 11111 22233334444443321 111 113456778886555 Q ss_pred CCCHHH-HHHHH---HH----HHhcccc---CCCHHH----HHHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021324. 389 TPTVAA-KADAV---SK----LYANGQG---PIPKEQ----ARIDLGYT------ATQREQMRDWDKQETEDMIDTLYST 447 (480) Q Consensus 389 ~~~~~~-~ad~~---~k----l~~~~~~---~~s~e~----~~~~~~~~------~d~~~~~~~~~~~~~~~~~~~~~~~ 447 (480) .+.... .+.++ .+ +.+.++. .+.... +...+|.. +++++++.+.+.+.++....+ +. T Consensus 448 araq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qa--a~ 525 (547) T protein:vir:10 448 SRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQA--AI 525 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHH--HH Confidence 433211 11111 12 2222211 122222 23445653 444444433322222211111 11 Q ss_pred cccCCCcCCCCCCCCCC-ccCC Q lcl|NC_021324. 448 TKAQADATPKPTVTETK-TETQ 468 (480) Q Consensus 448 ~~~~~~~~~~~~~~d~~-~~~~ 468 (480) ..+.+++.+..+.++.+ -+.+ T Consensus 526 ~~~~g~~m~~~~~~~a~~~~~~ 547 (547) T protein:vir:10 526 AEAEGNAMEAQGKGQAALKENQ 547 (547) T ss_pred HHHHHHHHHhhcCcccchhccC Confidence 11111111111111111 0000 No 127 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.90 E-value=1.8e-08 Score=63.09 Aligned_cols=418 Identities=16% Similarity=0.102 Sum_probs=159.8 Q ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHhhc-cCcchhc------CcccchhhhcceeccchHHHHHHHHHhhhccCcccc-- Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLE-AEAYRNG-TRRLKTI------GIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-- 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~-~~~YY~G-~~~i~~~------~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-- 74 (480) .=|++.|..+... +.... -.+.|.. ...+... +..+.+.. -++... ...+|+..++-+-.-++.+ T Consensus 1 Mg~~~~l~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~-al~~~~--v~~~i~~ia~~iA~lp~~~~~ 75 (457) T protein:vir:62 1 MGFWSALFGRGHS--PALDAAEGRAWEPYDPSIYNLGATASSGERVTPHD-ALQVSA--VFASVRLLSETIATLPLSTYS 75 (457) T ss_pred Cchhhhhhccccc--cccccccccccccchhhhhhccccccCCceechHH-hhccHH--HHHHHHHHHHhHhhCceEEEE Confidence 2223322221100 00000 0000000 0000000 01111100 011111 1112222222221223321 Q ss_pred -CCC-c--hhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCce-eEEEEEcccceEEEEeCC Q lcl|NC_021324. 75 -SED-S--EGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPR 145 (480) Q Consensus 75 -~~d-~--~~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~-~~~~~~~p~~~~~v~d~~ 145 (480) .++ . .....+..++.. | ....+...+..+.+.+|.||+.+..+ .|. ..+..++|..+.+..+.. T Consensus 76 ~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-------~g~~~~l~~l~p~~v~v~~~~~ 148 (457) T protein:vir:62 76 KRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-------GPNIAGLDVLDPTKIHVHMVMV 148 (457) T ss_pred ecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-------CCcEEEEEEEcCcceEEEEecc Confidence 111 1 111223333322 2 24567778888999999999888532 233 356678888877655433 Q ss_pred CCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccch Q lcl|NC_021324. 146 NTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .... ....+.|. ....+.......|.++.+ +||++....+..+|.|.+. T Consensus 149 ~~~~-~~~~~~y~-~~~~g~~~~~~~~~~~ei-----------------------------ih~r~~~~~~~~~G~sp~~ 197 (457) T protein:vir:62 149 DGLR-RKVFEAYD-IDADGNEVLLGWFTPRDV-----------------------------LHIPGMMLPGDFVGCSPIS 197 (457) T ss_pred CCcc-ceeEEEEE-EccCCceeEEEeeCccce-----------------------------EEecCCCCCCceecccHHH Confidence 2211 11111121 112222222223344444 3444332223356777553 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcC-CCccccccccccchhhh------hcchheecCCCCceEEEecccc- Q lcl|NC_021324. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISG-VTTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE- 297 (480) Q Consensus 226 ~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G-~~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~- 297 (480) - +...+.....+.....+.+...+.|-.+|.- ..+.++..+.-...|+. ..++++.++ ++.++.++.... T Consensus 198 ~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~-~g~~~~~l~~~~~ 275 (457) T protein:vir:62 198 Y-ARESIGLALAAQKYGAHFFRNGAMPGAVVEVPGTMSEEGLARAREAWRAANSGVDNAHRVALLT-EGAKFSKVAMSPD 275 (457) T ss_pred H-HHHHHHHHHHHHHHHHHHHhccCCcceEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEccCChh Confidence 2 2233332222222222333333345444431 11111111111111211 123455565 446777765432 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 377 (480) -..|++..+..+.+|+..-++|+..+|....+..++..+......+...+ -.-+-..|+..+.. .+..... ... T Consensus 276 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~---l~P~~~~ie~~ln~--~L~~~~~-~~~ 349 (457) T protein:vir:62 276 EAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFS---LRPWLERIEAGFNR--LLFAETA-DRF 349 (457) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHH---HHHHHHHHHHHHHh--hhcCccc-cCc Confidence 22467877888999999999999998764433322222322222222211 01111112211111 1111111 112 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH---HHH-HHHHHHHHHHHHHHHhhhcccCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR---EQM-RDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~---~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) ..+++.+......|..++++++.+++++| +++...+++.+|+.+-+- .++ .-..-.. .+........... T Consensus 350 ~~i~fd~~~l~~~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~----~~~~~~~~~~~~~ 423 (457) T protein:vir:62 350 RFVKFNLDEIKRGAPKERMELWSLGLQNG--IYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGE----IGEEPEPEPAPAP 423 (457) T ss_pred eEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceeeecccccc----ccccccccccCCC Confidence 33455555666678899999999998876 667777888877643210 111 0000000 0000000000001 Q ss_pred cCCCCCC-----CCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 454 ATPKPTV-----TETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 454 ~~~~~~~-----~d~~~~~~~~~~~~~~~~~~ 480 (480) ++.++.. +...++.+++| ..+.++++ T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~ 454 (457) T protein:vir:62 424 PAIDPPAEEPADDEEPDNAEGDP-DEGETEDD 454 (457) T ss_pred ccCCCCccCCCCCCCCCCCCCCC-cccccccc Confidence 1111111 11112222332 22233334 No 128 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.84 E-value=1.7e-08 Score=63.18 Aligned_cols=397 Identities=11% Similarity=0.039 Sum_probs=162.4 Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc--- Q lcl|NC_021324. 5 HEHVERLQGLLARDLPN-------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--- 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~-------~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~--- 74 (480) .-+++++.....+.... ...+..+.-+... +..+.... -.+ +.-...+|+..++-+-.-+|.+ T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~~~-al~--~~~v~~~i~~ia~~ia~l~~~~~~~ 73 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKGKN-ALK--VATVFACIKILSESVSKLPLKIYQE 73 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCC----cceechhh-hhc--cHHHHHHHHHHHHhhccCceEEEEe Confidence 44444444322211100 0111111111000 00110100 001 1112233333333222223322 Q ss_pred CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCC Q lcl|NC_021324. 75 SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPR 145 (480) Q Consensus 75 ~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~ 145 (480) .++ +.....+..++.. | ....+...+..+.+.+|.||+++..+ ..|++ .+..++|..+.++.+.. T Consensus 74 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~ 147 (429) T protein:vir:10 74 DEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYIDDV 147 (429) T ss_pred cCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEcCc Confidence 111 1112235555532 3 24467788888999999999998653 34554 67788888887766543 Q ss_pred CCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccch Q lcl|NC_021324. 146 NTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS 225 (480) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~ 225 (480) .. +....+.|+.....+.. ..|.++ -|+||++.....+.+|.|.+. T Consensus 148 ~~--~~~~~~~~~~~~~~g~~---~~~~~~-----------------------------evih~~~~~~~~~~~G~s~i~ 193 (429) T protein:vir:10 148 GL--LNSKTKMWYVVNTGGQQ---RVLKPE-----------------------------EILHFKNGITLDGLVGVPTME 193 (429) T ss_pred cc--ccccceEEEEEccCCeE---EEEccc-----------------------------cEEEecCCCCCCCcccccHHH Confidence 21 11111111111111110 011111 256666544455677877653 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecccc- Q lcl|NC_021324. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE- 297 (480) Q Consensus 226 ~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~- 297 (480) . +...++....+.......+...+.|-.+++.- .+.+...+.-...|+. ..++++.++ ++.++.+++... T Consensus 194 ~-~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~ 271 (429) T protein:vir:10 194 Y-LKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPISLNMS 271 (429) T ss_pred H-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhccccccCceeecC-CCceEEEccCChh Confidence 2 33333332222222222233333355444321 1111111111111211 123455554 345777775432 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCC Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGRE 372 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~~ 372 (480) -..+++..+....+|+..-++|+..+|....+.-|. +......+ +...|.-++..+.. +.... T Consensus 272 d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn--~e~~~~~f----------~~~~l~P~~~~ie~~ln~kl~~~~ 339 (429) T protein:vir:10 272 DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQQQQF----------YTDTLQATLTMYEQEMTYKLFLDS 339 (429) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHHHHH----------HHHHHHHHHHHHHHHHHHhhcChh Confidence 223677778888999999999999997533221121 11111111 11122222221111 11111 Q ss_pred CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCC Q lcl|NC_021324. 373 VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQA 452 (480) Q Consensus 373 ~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (480) .......+++.+......|..++++++.+++.+| +++...+++.+|+.+-+ ...+...-..-..++.... T Consensus 340 ~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~--ggD~~~~~~n~~~~d~~~~------ 409 (429) T protein:vir:10 340 ELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEA--GGDRLLVNGNMLPIDMAGQ------ 409 (429) T ss_pred hcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CcCeeeecccccchhhccc------ Confidence 1111122444444556678899999999998876 66777778888775422 0000000000000110000 Q ss_pred CcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 453 DATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 453 ~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) ....+++.+++...+.+.+| T Consensus 410 ---~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 410 ---AYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred ---cccCCCCCCCCCCCCCCCCC Confidence 01112222222222221222 No 129 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.83 E-value=3.4e-08 Score=61.53 Aligned_cols=423 Identities=11% Similarity=0.080 Sum_probs=160.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC--CCc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS--EDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~--~d~ 78 (480) +++..|.-..++.+.. ..-...-+ |.+++..-+... ..+.+..-...+...+|+..+..+..-++.+. ++. T Consensus 51 ~~~~~d~~~~~~~r~g-----~~~~~~~~-g~~~~~epp~d~-~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~ 123 (648) T protein:vir:79 51 GSAKRDPKMSLVKRIG-----LAIMDGGG-GGRDFEEPEFDF-NEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPN 123 (648) T ss_pred ccccccchhHHHHHhH-----HHHHhhcC-CccccccCCcCH-HHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCc Confidence 2232222111111110 11111112 222331111111 11222222345667777777666655565432 221 Q ss_pred hhHH-HHHH-HHHhc---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-E---------------EEEEcccc Q lcl|NC_021324. 79 EGLE-ELWN-WWQAN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-L---------------IRVESPLY 137 (480) Q Consensus 79 ~~~~-~l~~-~~~~n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~---------------~~~~~p~~ 137 (480) .... .... +...| ....+...+..+.+.+|.||+.+-.+. +|.+ + +..++|.. T Consensus 124 ~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~------~G~~~~~l~~~~~~~~~~v~~l~pl~p~~ 197 (648) T protein:vir:79 124 AVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAK------DALPFQGMNVMGVGDSMPVAGYFPLNLAS 197 (648) T ss_pred cchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecC------CCccchhhhhhhhccccceeeeEeecCce Confidence 1111 1111 11222 345677888899999999999876543 2211 0 11112222 Q ss_pred eEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCc Q lcl|NC_021324. 138 MYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGN 217 (480) Q Consensus 138 ~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~ 217 (480) +.+.. ++.+.+.. |.|...++.. ...|..=-|+||+......+ T Consensus 198 v~v~~------------------d~~g~~~~---------Y~y~~~g~~~----------~~~~~~~dIIHik~~~~~d~ 240 (648) T protein:vir:79 198 MKVKR------------------DKFGMIKG---------WQQEQEGQDK----------PQKFKPEDIVHIYYKREKGR 240 (648) T ss_pred eEEEE------------------cCCCceee---------eEEEecCCce----------eEEecCccEEEEccCCCCCC Confidence 21111 11111110 1111111100 00011113678876656677 Q ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc--chhhhhcchheecCCC-CceEEEec Q lcl|NC_021324. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN--TTLDIYYGRILTLASE-AAKISEFK 294 (480) Q Consensus 218 ~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~--~~~~~~~~~~~~~~g~-~~~~~~~~ 294 (480) .+|.|.|.- +...++....+.....+.+...+.|-.++.-.......+.... ..+.-....+...++. ..+...++ T Consensus 241 ~~GlSpi~~-a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i~gg~v~~~~~~i~ 319 (648) T protein:vir:79 241 AFGTPWLLP-ALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQEGFGAEEGEVDLVRGEVENMDVEGGMVTTERVNIS 319 (648) T ss_pred ceeccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCccchHHHHHHHHHHHHhcccccccccccccceeecc Confidence 899987643 3333333233323333334444555544431011110010000 1111111112222221 22222222 Q ss_pred c-cc--HHHHHHHHHHHHHHHHhhcCCChHHhcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH-H- Q lcl|NC_021324. 295 A-AE--LRNFAEEMEVFRKEAASITGLPPQYLSSSS-ENPASAEAIIATDSRIVKMAERKGRIFGGAWERA-MRIAM-Q- 367 (480) Q Consensus 295 ~-~~--~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~-~~li~-~- 367 (480) . .+ .-.|++..+....+|+...++|+..+|... .+.+.+.+....+.. .+.-....+...+... ++.++ . T Consensus 320 ~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~~---~i~~l~~~i~~~le~~~~~~ll~e~ 396 (648) T protein:vir:79 320 SIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFKD---RIKALQKVMATFINEFMVKEILMEG 396 (648) T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhh Confidence 1 11 113777778888999999999999997532 222233333322222 1111112222222211 11111 0 Q ss_pred HhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH--HH--HHH-HHHHHHHHHHHH Q lcl|NC_021324. 368 IMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ--RE--QMR-DWDKQETEDMID 442 (480) Q Consensus 368 ~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~--~~--~~~-~~~~~~~~~~~~ 442 (480) ........ ...+++.|.+....+....++.+.++.++| +++...+++++|+.+-+ .. .+- .+.....+...+ T Consensus 397 ~l~~~l~~-d~~ieF~~~~Llr~D~~~~a~~~~~l~~~G--ilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~ 473 (648) T protein:vir:79 397 GFDPVLNP-DDKVEFRFNEIDMDSKIKLENQAVFLYEHN--AISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALA 473 (648) T ss_pred hccccccc-cceEEEeecccchhhHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCccccccccccchhccccc Confidence 01111111 134677777777778888888888888876 67888888888875422 00 010 000000000000 Q ss_pred HHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 443 TLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 443 ~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) .......+........++....++....|.+.++.+++ T Consensus 474 ~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~ 511 (648) T protein:vir:79 474 ALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTS 511 (648) T ss_pred cCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCC Confidence 00000000000000000000001111112222222222 No 130 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.77 E-value=5.5e-08 Score=60.38 Aligned_cols=453 Identities=12% Similarity=0.040 Sum_probs=190.2 Q ss_pred CCCHHHH--HHHHHHHHHHHHHH-HHHHHHHhhccCcchhcC----ccc-chhhhcceeccchHHHHHHHHHhhhcc--- Q lcl|NC_021324. 1 MTTYHEH--VERLQGLLARDLPN-LLEAEAYRNGTRRLKTIG----IGA-PPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~~~~~-~~~~~~YY~G~~~i~~~~----~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+..+. +.+-.+.+..++.. ....+.+|+ +.+|..+ ... ....+..++..+-+...++++++.|.+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhc Confidence 8877552 33333333333322 233334442 2233321 111 112224456677778888888877632 Q ss_pred ---Cccc-cC-CCch-------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEE Q lcl|NC_021324. 70 ---EGFR-IS-EDSE-------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 70 ---~~~~-~~-~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~ 131 (480) .+|+ .. .|.+ ..+.+...+..++|.....++.++...+|.|-+++.+ |.++.++++ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~------d~~~~~rf~ 152 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP------DFDAVVYHH 152 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec------CCCceEEEE Confidence 2232 11 1111 1123455667788999999999999999999887654 335667888 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEe-------c--------------CCceEEEEEEEeCCcEEEEEecCCC---- Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR-------D--------------DVAVPDRATLYLPDETVPLRRNGGL---- 186 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~--------------~~~~~~~~~~~~~~~~~~~~~~~~~---- 186 (480) .++..++++.-|.. .++...+|.+.-. . ......++++++. .|.+.... T Consensus 153 ~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~ 226 (555) T protein:vir:10 153 SLTAGEYAIAADNQ--GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKR 226 (555) T ss_pred EeecceeEEeeCCC--CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCC Confidence 88888887766544 3444444432110 0 0011122333211 01000000 Q ss_pred -----ccc-ccccc----c--cccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhh Q lcl|NC_021324. 187 -----NDQ-WVVDG----D--VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLR 254 (480) Q Consensus 187 -----~~~-~~~~~----~--~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l 254 (480) +.. ++++. . ..+-+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-.....++...+|.+ T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~-~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (555) T protein:vir:10 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAM-EALGDVRQLQHEQLRKAQAIDYKSNPPL 305 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 000 11110 0 122356788999888888889999999664 3556667777776667777888788766 Q ss_pred hhcCCCccccccccccchhhhhcchheec-CCCCce--EEEecc-ccHHHHHHHHHHHHHHHHhhcCCC-hHHhcccccc Q lcl|NC_021324. 255 VISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAK--ISEFKA-AELRNFAEEMEVFRKEAASITGLP-PQYLSSSSEN 329 (480) Q Consensus 255 ~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~--~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p-~~~~g~~~~~ 329 (480) .+.- +.....+...+|.+... .|...+ .-.++. .+++...+.++.+...|....-.+ ...+...... T Consensus 306 ~v~~--------~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~ 377 (555) T protein:vir:10 306 QLPV--------SAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNP 377 (555) T ss_pred eecc--------ccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Confidence 5531 11111234445544222 121111 112222 234444444444433332222111 0112111111 Q ss_pred chHHHHHHHHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHHHhCCC---CcccceeeeEEecCCCCCCHH-HHH---- Q lcl|NC_021324. 330 PASAEAIIATDSRIVKMA----ER-KGRIFGGAWERAMRIAMQIMGRE---VTEEYTRLETVWRDPSTPTVA-AKA---- 396 (480) Q Consensus 330 ~~Sg~Al~~~~~~l~~k~----~~-~~~~~~~~l~~~~~li~~~~~~~---~~~~~~~~~v~f~~~~~~~~~-~~a---- 396 (480) .-+++.++.....+.... .+ ....+.+-+.+.+.++.+..-.. .......++|.|..++..... ..+ T Consensus 378 ~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~ 457 (555) T protein:vir:10 378 QMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVD 457 (555) T ss_pred cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHH Confidence 123444433322222111 11 11222333344444433210000 011123477777555432211 111 Q ss_pred ---HHHHHHHhcccc---CCCH----HHHHHhCCCC------hhHHHHHHHHHHHHHHHH-HHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 397 ---DAVSKLYANGQG---PIPK----EQARIDLGYT------ATQREQMRDWDKQETEDM-IDTLYSTTKAQADATPKPT 459 (480) Q Consensus 397 ---d~~~kl~~~~~~---~~s~----e~~~~~~~~~------~d~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 459 (480) ..+..+.+.++. .+.. ..+...+|+. +++++++.+.+.+.++.. ..++..... +.+..-. T Consensus 458 ~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~---~~~~~~~ 534 (555) T protein:vir:10 458 RFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGA---DTAAKLG 534 (555) T ss_pred HHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhc Confidence 111122222211 1222 2233445653 444554443332222211 111111111 1111000 Q ss_pred CCCC-CccCCCCCCCCCcccC Q lcl|NC_021324. 460 VTET-KTETQTSPSGFNRTKT 479 (480) Q Consensus 460 ~~d~-~~~~~~~~~~~~~~~~ 479 (480) ..+. ..+....-.++.++=| T Consensus 535 ~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 535 SVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred ccccCcchhHHHHHhhhccCC Confidence 0111 1111222233334444 No 131 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.77 E-value=5.5e-08 Score=60.38 Aligned_cols=453 Identities=12% Similarity=0.040 Sum_probs=190.2 Q ss_pred CCCHHHH--HHHHHHHHHHHHHH-HHHHHHHhhccCcchhcC----ccc-chhhhcceeccchHHHHHHHHHhhhcc--- Q lcl|NC_021324. 1 MTTYHEH--VERLQGLLARDLPN-LLEAEAYRNGTRRLKTIG----IGA-PPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~~~~~-~~~~~~YY~G~~~i~~~~----~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+..+. +.+-.+.+..++.. ....+.+|+ +.+|..+ ... ....+..++..+-+...++++++.|.+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhc Confidence 8877552 33333333333322 233334442 2233321 111 112224456677778888888877632 Q ss_pred ---Cccc-cC-CCch-------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEE Q lcl|NC_021324. 70 ---EGFR-IS-EDSE-------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 70 ---~~~~-~~-~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~ 131 (480) .+|+ .. .|.+ ..+.+...+..++|.....++.++...+|.|-+++.+ |.++.++++ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~------d~~~~~rf~ 152 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP------DFDAVVYHH 152 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec------CCCceEEEE Confidence 2232 11 1111 1123455667788999999999999999999887654 335667888 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEe-------c--------------CCceEEEEEEEeCCcEEEEEecCCC---- Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR-------D--------------DVAVPDRATLYLPDETVPLRRNGGL---- 186 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~--------------~~~~~~~~~~~~~~~~~~~~~~~~~---- 186 (480) .++..++++.-|.. .++...+|.+.-. . ......++++++. .|.+.... T Consensus 153 ~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~ 226 (555) T protein:vir:10 153 SLTAGEYAIAADNQ--GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKR 226 (555) T ss_pred EeecceeEEeeCCC--CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCC Confidence 88888887766544 3444444432110 0 0011122333211 01000000 Q ss_pred -----ccc-ccccc----c--cccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhh Q lcl|NC_021324. 187 -----NDQ-WVVDG----D--VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLR 254 (480) Q Consensus 187 -----~~~-~~~~~----~--~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l 254 (480) +.. ++++. . ..+-+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-.....++...+|.+ T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~-~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (555) T protein:vir:10 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAM-EALGDVRQLQHEQLRKAQAIDYKSNPPL 305 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 000 11110 0 122356788999888888889999999664 3556667777776667777888788766 Q ss_pred hhcCCCccccccccccchhhhhcchheec-CCCCce--EEEecc-ccHHHHHHHHHHHHHHHHhhcCCC-hHHhcccccc Q lcl|NC_021324. 255 VISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAK--ISEFKA-AELRNFAEEMEVFRKEAASITGLP-PQYLSSSSEN 329 (480) Q Consensus 255 ~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~--~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p-~~~~g~~~~~ 329 (480) .+.- +.....+...+|.+... .|...+ .-.++. .+++...+.++.+...|....-.+ ...+...... T Consensus 306 ~v~~--------~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~ 377 (555) T protein:vir:10 306 QLPV--------SAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNP 377 (555) T ss_pred eecc--------ccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Confidence 5531 11111234445544222 121111 112222 234444444444433332222111 0112111111 Q ss_pred chHHHHHHHHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHHHhCCC---CcccceeeeEEecCCCCCCHH-HHH---- Q lcl|NC_021324. 330 PASAEAIIATDSRIVKMA----ER-KGRIFGGAWERAMRIAMQIMGRE---VTEEYTRLETVWRDPSTPTVA-AKA---- 396 (480) Q Consensus 330 ~~Sg~Al~~~~~~l~~k~----~~-~~~~~~~~l~~~~~li~~~~~~~---~~~~~~~~~v~f~~~~~~~~~-~~a---- 396 (480) .-+++.++.....+.... .+ ....+.+-+.+.+.++.+..-.. .......++|.|..++..... ..+ T Consensus 378 ~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~ 457 (555) T protein:vir:10 378 QMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVD 457 (555) T ss_pred cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHH Confidence 123444433322222111 11 11222333344444433210000 011123477777555432211 111 Q ss_pred ---HHHHHHHhcccc---CCCH----HHHHHhCCCC------hhHHHHHHHHHHHHHHHH-HHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 397 ---DAVSKLYANGQG---PIPK----EQARIDLGYT------ATQREQMRDWDKQETEDM-IDTLYSTTKAQADATPKPT 459 (480) Q Consensus 397 ---d~~~kl~~~~~~---~~s~----e~~~~~~~~~------~d~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 459 (480) ..+..+.+.++. .+.. ..+...+|+. +++++++.+.+.+.++.. ..++..... +.+..-. T Consensus 458 ~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~---~~~~~~~ 534 (555) T protein:vir:10 458 RFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGA---DTAAKLG 534 (555) T ss_pred HHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhc Confidence 111122222211 1222 2233445653 444554443332222211 111111111 1111000 Q ss_pred CCCC-CccCCCCCCCCCcccC Q lcl|NC_021324. 460 VTET-KTETQTSPSGFNRTKT 479 (480) Q Consensus 460 ~~d~-~~~~~~~~~~~~~~~~ 479 (480) ..+. ..+....-.++.++=| T Consensus 535 ~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:10 535 SVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred ccccCcchhHHHHHhhhccCC Confidence 0111 1111222233334444 No 132 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.77 E-value=5.5e-08 Score=60.38 Aligned_cols=453 Identities=12% Similarity=0.040 Sum_probs=190.2 Q ss_pred CCCHHHH--HHHHHHHHHHHHHH-HHHHHHHhhccCcchhcC----ccc-chhhhcceeccchHHHHHHHHHhhhcc--- Q lcl|NC_021324. 1 MTTYHEH--VERLQGLLARDLPN-LLEAEAYRNGTRRLKTIG----IGA-PPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~~~~~-~~~~~~YY~G~~~i~~~~----~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.+..+. +.+-.+.+..++.. ....+.+|+ +.+|..+ ... ....+..++..+-+...++++++.|.+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQDRNRGEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCCCCCcchhcccccccccHHHHHHHHHHHHHHhhc Confidence 8877552 33333333333322 233334442 2233321 111 112224456677778888888877632 Q ss_pred ---Cccc-cC-CCch-------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEE Q lcl|NC_021324. 70 ---EGFR-IS-EDSE-------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 70 ---~~~~-~~-~d~~-------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~ 131 (480) .+|+ .. .|.+ ..+.+...+..++|.....++.++...+|.|-+++.+ |.++.++++ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~------d~~~~~rf~ 152 (555) T protein:vir:98 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP------DFDAVVYHH 152 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec------CCCceEEEE Confidence 2232 11 1111 1123455667788999999999999999999887654 335667888 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEe-------c--------------CCceEEEEEEEeCCcEEEEEecCCC---- Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTR-------D--------------DVAVPDRATLYLPDETVPLRRNGGL---- 186 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~--------------~~~~~~~~~~~~~~~~~~~~~~~~~---- 186 (480) .++..++++.-|.. .++...+|.+.-. . ......++++++. .|.+.... T Consensus 153 ~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~ 226 (555) T protein:vir:98 153 SLTAGEYAIAADNQ--GRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKR 226 (555) T ss_pred EeecceeEEeeCCC--CCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCC Confidence 88888887766544 3444444432110 0 0011122333211 01000000 Q ss_pred -----ccc-ccccc----c--cccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhh Q lcl|NC_021324. 187 -----NDQ-WVVDG----D--VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLR 254 (480) Q Consensus 187 -----~~~-~~~~~----~--~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l 254 (480) +.. ++++. . ..+-+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-.....++...+|.+ T Consensus 227 ~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~~~~ge~YGrgp~~-~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (555) T protein:vir:98 227 DDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWALVGGDIYGNSPAM-EALGDVRQLQHEQLRKAQAIDYKSNPPL 305 (555) T ss_pred CccccceEEEEEEeccCCccccccCCcccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 000 11110 0 122356788999888888889999999664 3556667777776667777888788766 Q ss_pred hhcCCCccccccccccchhhhhcchheec-CCCCce--EEEecc-ccHHHHHHHHHHHHHHHHhhcCCC-hHHhcccccc Q lcl|NC_021324. 255 VISGVTTDELTNDGENTTLDIYYGRILTL-ASEAAK--ISEFKA-AELRNFAEEMEVFRKEAASITGLP-PQYLSSSSEN 329 (480) Q Consensus 255 ~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~--~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p-~~~~g~~~~~ 329 (480) .+.- +.....+...+|.+... .|...+ .-.++. .+++...+.++.+...|....-.+ ...+...... T Consensus 306 ~v~~--------~~~~~~~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~ 377 (555) T protein:vir:98 306 QLPV--------SAKNQDISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNP 377 (555) T ss_pred eecc--------ccccccceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCC Confidence 5531 11111234445544222 121111 112222 234444444444433332222111 0112111111 Q ss_pred chHHHHHHHHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHHHhCCC---CcccceeeeEEecCCCCCCHH-HHH---- Q lcl|NC_021324. 330 PASAEAIIATDSRIVKMA----ER-KGRIFGGAWERAMRIAMQIMGRE---VTEEYTRLETVWRDPSTPTVA-AKA---- 396 (480) Q Consensus 330 ~~Sg~Al~~~~~~l~~k~----~~-~~~~~~~~l~~~~~li~~~~~~~---~~~~~~~~~v~f~~~~~~~~~-~~a---- 396 (480) .-+++.++.....+.... .+ ....+.+-+.+.+.++.+..-.. .......++|.|..++..... ..+ T Consensus 378 ~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~ 457 (555) T protein:vir:98 378 QMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVD 457 (555) T ss_pred cccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHH Confidence 123444433322222111 11 11222333344444433210000 011123477777555432211 111 Q ss_pred ---HHHHHHHhcccc---CCCH----HHHHHhCCCC------hhHHHHHHHHHHHHHHHH-HHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 397 ---DAVSKLYANGQG---PIPK----EQARIDLGYT------ATQREQMRDWDKQETEDM-IDTLYSTTKAQADATPKPT 459 (480) Q Consensus 397 ---d~~~kl~~~~~~---~~s~----e~~~~~~~~~------~d~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 459 (480) ..+..+.+.++. .+.. ..+...+|+. +++++++.+.+.+.++.. ..++..... +.+..-. T Consensus 458 ~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~---~~~~~~~ 534 (555) T protein:vir:98 458 RFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGA---DTAAKLG 534 (555) T ss_pred HHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhc Confidence 111122222211 1222 2233445653 444554443332222211 111111111 1111000 Q ss_pred CCCC-CccCCCCCCCCCcccC Q lcl|NC_021324. 460 VTET-KTETQTSPSGFNRTKT 479 (480) Q Consensus 460 ~~d~-~~~~~~~~~~~~~~~~ 479 (480) ..+. ..+....-.++.++=| T Consensus 535 ~~~~~~~~~~~~~~~~~~~~~ 555 (555) T protein:vir:98 535 SVDTSKQNALTDVTRAFSGYT 555 (555) T ss_pred ccccCcchhHHHHHhhhccCC Confidence 0111 1111222233334444 No 133 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.77 E-value=5.5e-08 Score=60.36 Aligned_cols=409 Identities=15% Similarity=0.130 Sum_probs=158.0 Q ss_pred CCCHH--HHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchh---hhcceeccchHHHHHHHHHhhhccCcccc- Q lcl|NC_021324. 1 MTTYH--EHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPE---LAYLDVQPGWVATYLRTLSDRLDIEGFRI- 74 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~---~~~~~~~~n~~~~iVd~~~~~l~~~~~~~- 74 (480) .+++. ++.. .+..-|-|... .+...... .......+.....+|+..+.-+-.-+|.+ T Consensus 9 ~~~p~~~~~~~--------------~~~~~~~~~~~---~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~lp~~l~ 71 (518) T protein:vir:78 9 LSAPAMAELSP--------------QMQDSYYYAPA---VGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALARLPVKCM 71 (518) T ss_pred eccchhhhhhh--------------hhhhcccccce---eceecccccchhhHHhhhhHHHHHHHHHHHHhhccCceEEE Confidence 33332 1111 01111111110 00000000 00011112233444444444332223321 Q ss_pred --CCCc---hhHHHHHHHHHh-cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeC Q lcl|NC_021324. 75 --SEDS---EGLEELWNWWQA-ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDP 144 (480) Q Consensus 75 --~~d~---~~~~~l~~~~~~-n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~ 144 (480) .++. .....+..++.+ |. ...+...+..+.+.+|.+|+++-.+ ..|.+ .+..++|..+.+..+. T Consensus 72 ~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~------~~G~~~~L~~l~p~~Vtv~~~~ 145 (518) T protein:vir:78 72 FTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKN------KSGTPEKLMPMHPSRVAIKRNS 145 (518) T ss_pred EEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEECCCceEEEEcC Confidence 1111 111233444443 32 3356677888899999999998653 34555 5778889888888765 Q ss_pred CCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccc Q lcl|NC_021324. 145 RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i 224 (480) ... . +.+++...+ +.......+.++++ +||++....+..+|.|.| T Consensus 146 ~~~-~----~~y~~~~~~-~~~~~~~~~~~~eI-----------------------------iHir~~~~dg~~~G~Spi 190 (518) T protein:vir:78 146 RTG-R----YEYYFQAGA-GVGTQLVSFADDEV-----------------------------VPIRFFNPDGLERGLSLM 190 (518) T ss_pred CCC-E----EEEEEEecC-CccceeEEecCCcE-----------------------------EEecCCCCCcccccccHH Confidence 432 1 111111111 11111112233333 344332222224566655 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecccc Q lcl|NC_021324. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE 297 (480) Q Consensus 225 ~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~ 297 (480) .. +...+.....+.....+.+...+.|-.+|+.- ...+...+.-...|+. ..++++.++ ++.++..+.... T Consensus 191 ~~-~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~ls~e~~~~~k~~~~~~~~G~~nag~~~vL~-~G~~~~~l~~~~ 268 (518) T protein:vir:78 191 ES-LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKRLSPEAQQRLREQFDRAHAGSSNTGKTMVVE-EGMEPIPLQLTA 268 (518) T ss_pred HH-HHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccCCceeEcC-CCceEEeccCCh Confidence 32 22223322222222333333344455555421 1111111111111211 123455665 346777765432 Q ss_pred -HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccc Q lcl|NC_021324. 298 -LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEE 376 (480) Q Consensus 298 -~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 376 (480) -..|++..+....+|+..-++|+..+|....+.-|. +......+...+ -.-+-..++..+.. .+...... T Consensus 269 ~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn--~e~~~~~f~~~t---L~P~~~~ie~eln~--~L~~~~~~-- 339 (518) T protein:vir:78 269 VEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSN--ISAQMRAFYRDT---MAIPIARIQSAMDK--YVGQYWVR-- 339 (518) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchh--HHHHHHHHHHHH---HHHHHHHHHHHHHH--hhcccccC-- Confidence 224778888888999999999999997543221121 111111111110 00001111111110 01111111 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhc--ccCCCc Q lcl|NC_021324. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT--KAQADA 454 (480) Q Consensus 377 ~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~ 454 (480) ...+++........|..++++++.+++.+| +++...+++.+|+.+-+-....+......-..++...... .+++.. T Consensus 340 ~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G--~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~ 417 (518) T protein:vir:78 340 KNRMKFDIDDVIQPDWEAKSESTQKMVNSG--VATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPA 417 (518) T ss_pred cceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCC Confidence 123444445566788899999999998876 6677678888776432100000000000000000000000 000000 Q ss_pred CCCCCC------CC--CCccCCCCCCCC----CcccCC Q lcl|NC_021324. 455 TPKPTV------TE--TKTETQTSPSGF----NRTKTR 480 (480) Q Consensus 455 ~~~~~~------~d--~~~~~~~~~~~~----~~~~~~ 480 (480) .+.+.. ++ +++....++... +++++. T Consensus 418 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (518) T protein:vir:78 418 PKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKTE 455 (518) T ss_pred CCCCCcccccccccCccccCCCCCcccccccccccccc Confidence 011110 00 011111111111 111111 No 134 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.75 E-value=6.3e-08 Score=60.06 Aligned_cols=439 Identities=13% Similarity=0.114 Sum_probs=187.1 Q ss_pred CCCHH------HHHHHHHHHHHHH----HHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhcc- Q lcl|NC_021324. 1 MTTYH------EHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI- 69 (480) Q Consensus 1 ~~~~~------~~i~~l~~~~~~~----~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~- 69 (480) |++-+ +.++.+.+.+..+ ..+.+.+.+|..-.- .. ...........++..+-+...++++++.|.+ T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~-~~--~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL-FP--KESDNESTDYTTPWQAVGARGLNNLASKLMLA 77 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-cC--CCCCcccccccccccccHHHHHHHHHHHHHHh Confidence 65543 3344455555443 344455555542210 10 0111111112244555667777777776632 Q ss_pred ----Cccc-c--CC--------Cc-----------hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCC Q lcl|NC_021324. 70 ----EGFR-I--SE--------DS-----------EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD 123 (480) Q Consensus 70 ----~~~~-~--~~--------d~-----------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d 123 (480) .+|+ . .+ +. +..+.+...+..++|.....++.++...+|.|-+++.. + T Consensus 78 ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~------~ 151 (535) T protein:vir:15 78 LFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPE------P 151 (535) T ss_pred hcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeec------C Confidence 1222 1 11 11 01123444567788999999999999999998776643 3 Q ss_pred CCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEec-----C-----------CceEEEEEEEeC-----C--cEEEE Q lcl|NC_021324. 124 PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD-----D-----------VAVPDRATLYLP-----D--ETVPL 180 (480) Q Consensus 124 ~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~-----~-----------~~~~~~~~~~~~-----~--~~~~~ 180 (480) .++..+++.++-.+++..-|.. .++...+|.+...- + ......+++|+. + .+..| T Consensus 152 ~~~~~~f~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~ 229 (535) T protein:vir:15 152 EGSYNPMKLYRLSSYVVQRDAY--GNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKY 229 (535) T ss_pred CCCceeeEEEEcCeeEEeeCCC--CCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEE Confidence 3566778888777766665543 33444444432210 0 000112223221 1 01111 Q ss_pred EecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--C Q lcl|NC_021324. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G 258 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G 258 (480) ....+. .........+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-......+....|.+.+. | T Consensus 230 ~e~~g~----~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g 304 (535) T protein:vir:15 230 EEVEDV----EIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCE-EYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAG 304 (535) T ss_pred EEeeCc----cccccccccccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 111110 01111123457789999888888889999999764 4667788888888888888888888876652 1 Q ss_pred CCccccccccccchhhhhcchheecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHH Q lcl|NC_021324. 259 VTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII 337 (480) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~ 337 (480) ... ........++.+..-..++..+.++.. ++++...+.++.+...|.... +.+ .+.......-+++-++ T Consensus 305 ~~~-------~~~l~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~ 375 (535) T protein:vir:15 305 ITQ-------PRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLN-SAVQRTGERVTAEEIR 375 (535) T ss_pred ccc-------chhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH-hhh-hcccCCCccccHHHHH Confidence 100 001112222333332333444544432 344444444444433332221 111 1211111112333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHh--CCCCc-ccceeeeEEecCCCCCCH-HHHHHHHHHHH-- Q lcl|NC_021324. 338 ATDSRIVKMAERKGRIFGGAWERA--------MRIAMQIM--GREVT-EEYTRLETVWRDPSTPTV-AAKADAVSKLY-- 403 (480) Q Consensus 338 ~~~~~l~~k~~~~~~~~~~~l~~~--------~~li~~~~--~~~~~-~~~~~~~v~f~~~~~~~~-~~~ad~~~kl~-- 403 (480) . ++..+...++..+.++ +...+.++ .+..+ .....++++|.-++..-- ...++.+.+.+ T Consensus 376 ~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~ 448 (535) T protein:vir:15 376 Y-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISA 448 (535) T ss_pred H-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHH Confidence 3 3333344444433332 22222222 11111 112336777755543211 11222222222 Q ss_pred --hcccc----CCCHHH----HHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhhhcc--cCCCcCCCCCCCCCC Q lcl|NC_021324. 404 --ANGQG----PIPKEQ----ARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTK--AQADATPKPTVTETK 464 (480) Q Consensus 404 --~~~~~----~~s~e~----~~~~~~~~-------~d~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~d~~ 464 (480) +.++. .+.... +...+|.. +++++++.+...+++ .. .+..+... ..+.++..|+..... T Consensus 449 la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~-~~-~~~a~~~g~~~~~~~~~~p~~~~~~ 526 (535) T protein:vir:15 449 WAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQT-GI-ENAAATGGAGVGALATSSPEAMQGA 526 (535) T ss_pred HHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHH-HH-HHHHHHHHhhccchhccChHHHHHH Confidence 22111 122222 22334543 233333322111111 11 11111111 111222222222222 Q ss_pred ccCCCCCCC Q lcl|NC_021324. 465 TETQTSPSG 473 (480) Q Consensus 465 ~~~~~~~~~ 473 (480) ...-|.+++ T Consensus 527 ~~~~g~~~~ 535 (535) T protein:vir:15 527 AAQAGLDAT 535 (535) T ss_pred HhccCCCCC Confidence 222222222 No 135 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=98.75 E-value=6.4e-08 Score=60.02 Aligned_cols=439 Identities=13% Similarity=0.086 Sum_probs=179.0 Q ss_pred CCCHH-----HHHHHHHHHHHHHH----HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC- Q lcl|NC_021324. 1 MTTYH-----EHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE- 70 (480) Q Consensus 1 ~~~~~-----~~i~~l~~~~~~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~- 70 (480) |++.+ +-++++.+++..++ .+.+.+.+|..-.- .+. .......+..++..+-+...++++++.|.+- T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~-~~~--~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL-FPK--DSDNASTDYQTPWQAVGARGLNNLASKLMLAL 77 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-cCC--CCCcccccccccccccHHHHHHHHHHHHHHhh Confidence 98854 34555666655443 34444445542210 111 1111111223556666777777777766321 Q ss_pred ----ccc-cC-CCch--------------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCC Q lcl|NC_021324. 71 ----GFR-IS-EDSE--------------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 71 ----~~~-~~-~d~~--------------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~ 124 (480) +|+ .. .|.+ ..+.+...+..++|.....++.++...+|.|.+++-.+ .. T Consensus 78 tP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~-----~~ 152 (536) T protein:vir:21 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-----EG 152 (536) T ss_pred cCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC-----CC Confidence 222 11 1100 11235556677889999999999999999988766432 12 Q ss_pred CceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecC----------------CceEEEEEEEe-----CC--cEEEEE Q lcl|NC_021324. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDD----------------VAVPDRATLYL-----PD--ETVPLR 181 (480) Q Consensus 125 ~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~-----~~--~~~~~~ 181 (480) .+...++.++-.++++.-|.. .++...+|.+..... ......+++|+ ++ .+..|. T Consensus 153 ~~~~~f~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~ 230 (536) T protein:vir:21 153 SNYNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYE 230 (536) T ss_pred CceeeEEEEEcCeEEEeeCCC--CCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEe Confidence 334456777766666555533 344444444321100 00011222221 11 111111 Q ss_pred ecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc Q lcl|NC_021324. 182 RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTT 261 (480) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~ 261 (480) ...+ .....+....+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-...........|...+. + T Consensus 231 e~~g----~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~---p 302 (536) T protein:vir:21 231 EVEG----MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIE-EYLGDLRSLENLQEAIVKMSMISSKVIGLVN---P 302 (536) T ss_pred ccCC----eeeccccCccccccCCeeeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCcccC---c Confidence 1110 011122223457889999999888889999999664 3556667777766666666666565543331 0 Q ss_pred cccccccccchhhhhcchheecCCCCceEEEec-cccHH---HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHH Q lcl|NC_021324. 262 DELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELR---NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII 337 (480) Q Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~ 337 (480) +.. ..........+|.++.-..++..+.++. .++++ ..++.++.-|.+.+...+ +.......-+++.++ T Consensus 303 ~g~--~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----l~~~~~~r~TAtEV~ 375 (536) T protein:vir:21 303 AGI--TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGERVTAEEIR 375 (536) T ss_pred ccc--cchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCccHHHHH Confidence 000 0011112233344433222333344443 23333 344444444444333321 111111112344444 Q ss_pred HHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHHHhCCCC---cccceeeeEEecCCCC-CCHHHHHHHHHHHHh---- Q lcl|NC_021324. 338 ATDSRIVKMA----ER-KGRIFGGAWERAMRIAMQIMGREV---TEEYTRLETVWRDPST-PTVAAKADAVSKLYA---- 404 (480) Q Consensus 338 ~~~~~l~~k~----~~-~~~~~~~~l~~~~~li~~~~~~~~---~~~~~~~~v~f~~~~~-~~~~~~ad~~~kl~~---- 404 (480) .+...+.... .+ ....+.+-+.+++.++.+ .+.. +.+. +++.+.-++. -...+.++.+....+ T Consensus 376 ~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r--~g~lP~~p~~~--v~~~~vs~l~~l~r~~~~~~l~~~~~~la~ 451 (536) T protein:vir:21 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA--TQQIPELPKEA--VEPTISTGLEAIGRGQDLDKLERCVTAWAA 451 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--CCCCCCCChhh--ccceEEecHHHHHHHHHHHHHHHHHHHHHh Confidence 3332222211 11 112222223333333221 1111 2222 3444433322 111122233322222 Q ss_pred cccc----CCCHHHH----HHhCCCC-------hhHHHHHHHHHHHHHHH--HHHHHhhhcccCCCcCCCCCCC----CC Q lcl|NC_021324. 405 NGQG----PIPKEQA----RIDLGYT-------ATQREQMRDWDKQETED--MIDTLYSTTKAQADATPKPTVT----ET 463 (480) Q Consensus 405 ~~~~----~~s~e~~----~~~~~~~-------~d~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----d~ 463 (480) .++. .+....+ ...+|++ +++++++.+.+.++++. ...++.... .......|+.. ++ T Consensus 452 ~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~--~~~~~~~~~~~~~~~~~ 529 (536) T protein:vir:21 452 LAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGM--AAQATASPEAMAAAADS 529 (536) T ss_pred hchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHhcChhhHHhhhhc Confidence 1111 1232222 2345652 34444443322222111 111111111 11111111111 11 Q ss_pred CccCCCC Q lcl|NC_021324. 464 KTETQTS 470 (480) Q Consensus 464 ~~~~~~~ 470 (480) .+-.++- T Consensus 530 ~g~~~~~ 536 (536) T protein:vir:21 530 VGLQPGI 536 (536) T ss_pred cccCCCC Confidence 1111111 No 136 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.75 E-value=6.7e-08 Score=59.90 Aligned_cols=450 Identities=11% Similarity=0.035 Sum_probs=194.3 Q ss_pred CCCH-HHHHHHHHHHHHHHHHH-HHHHHHHhhccCcchhcC----ccc-chhhhcceeccchHHHHHHHHHhhhc----c Q lcl|NC_021324. 1 MTTY-HEHVERLQGLLARDLPN-LLEAEAYRNGTRRLKTIG----IGA-PPELAYLDVQPGWVATYLRTLSDRLD----I 69 (480) Q Consensus 1 ~~~~-~~~i~~l~~~~~~~~~~-~~~~~~YY~G~~~i~~~~----~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~----~ 69 (480) |.+. .+.+++..+.+...+.. ....+++|+ +.+|... ... ....+.-++..+-+...|+++++.|. + T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~dst~~~a~~~Las~l~~~ltp 78 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSD--FINPRGSRFLTSDVNRDDRRNTKIVDPTGSMAQRILSSGMMSGITS 78 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhccccCCcCCCCCCcchhhcCccccchHHHHHHHHHHHHHHhhcC Confidence 8773 55566666666544432 233334442 1222221 111 11223446677778888888887773 2 Q ss_pred --Cccc---cCCCc-----h-------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEE Q lcl|NC_021324. 70 --EGFR---ISEDS-----E-------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 70 --~~~~---~~~d~-----~-------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~ 132 (480) .+|+ +.+.+ + ..+.+.+.+..++|.....++.++...+|.|-+++.. +..+.++++. T Consensus 79 p~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~------~~~~~~r~~~ 152 (556) T protein:vir:73 79 PARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVME------DDQDVIRTMP 152 (556) T ss_pred CCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeee------cCCceEEEEE Confidence 2222 22211 1 1123455667788999999999999999999887653 3355678888 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEe----------c----------CCce-EEEEEEEeCCcEEEEEecCCCc---- Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTR----------D----------DVAV-PDRATLYLPDETVPLRRNGGLN---- 187 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~----------~----------~~~~-~~~~~~~~~~~~~~~~~~~~~~---- 187 (480) ++..+++.--|.. .++...+|.+... + +.+. ...++++.. .|.+..... T Consensus 153 ~~l~~~~~~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~----V~pr~~~~~~~~~ 226 (556) T protein:vir:73 153 FPIGSYYLANSPR--GSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHC----ITPNVNRDSGKMD 226 (556) T ss_pred eecceeEEeeCCC--CCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEE----EeccccccccccC Confidence 8888887766543 3444444443211 0 0010 112222210 000000000 Q ss_pred -----cc-ccccc--c----cccccccccceEEeecccccCccCCCcc-chHHHHHHHHHHHHHHHHHHHHHHHhcchhh Q lcl|NC_021324. 188 -----DQ-WVVDG--D----VIKHGLGVVPVVPLTNDPRLGNRYGRSE-ISPELRKVTDAASRTLMNLQSASQILGTPLR 254 (480) Q Consensus 188 -----~~-~~~~~--~----~~~~~~g~iPvv~~~n~~~~~~~~G~s~-i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l 254 (480) .. ++++. + ..+-+|..+|++.++-+...++.||+|- .. +..+-+..+|.+.-......+....|.+ T Consensus 227 ~~~~p~~s~~~~~~~~~~~vl~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~-~~lgD~k~L~~l~~~~l~~~~~~~~pp~ 305 (556) T protein:vir:73 227 SKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRWEVNGEDVYASSCPGM-LALGQVKALQVEQKRKAQLIDKATNPPM 305 (556) T ss_pred cccceEEEEEEEecCCCceecccCCcccCCceeeeeeecCCcccccCccHH-HhHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 00 00110 0 1224577889998888888899999994 54 3556678888888888888888888876 Q ss_pred hhcCCCccccccccccchhhhhcchhe--ecCCCCceEEEe--ccccHHHHHHHHHHHHHHHHhhcCCC-hHHhcccccc Q lcl|NC_021324. 255 VISGVTTDELTNDGENTTLDIYYGRIL--TLASEAAKISEF--KAAELRNFAEEMEVFRKEAASITGLP-PQYLSSSSEN 329 (480) Q Consensus 255 ~i~G~~~~~~~~~~~~~~~~~~~~~~~--~~~g~~~~~~~~--~~~~~~~~~~~l~~~~~~i~~~~~~p-~~~~g~~~~~ 329 (480) .+... .....++..+|.+. ...++...+.-+ -.+++....+.++.+-..|....-.. ...++..... T Consensus 306 ~v~~~--------~~~~~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~ 377 (556) T protein:vir:73 306 VAPTS--------LKNQRVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTR 377 (556) T ss_pred ecccc--------ccccceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCC Confidence 65321 11122445555432 122322223322 12334444444444433332222111 0112221111 Q ss_pred chHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHHHhCCCCc---ccceeeeEEecCCCCCCHH-------- Q lcl|NC_021324. 330 PASAEAIIATDSRIVKM----AERK-GRIFGGAWERAMRIAMQIMGREVT---EEYTRLETVWRDPSTPTVA-------- 393 (480) Q Consensus 330 ~~Sg~Al~~~~~~l~~k----~~~~-~~~~~~~l~~~~~li~~~~~~~~~---~~~~~~~v~f~~~~~~~~~-------- 393 (480) .-+++.++.+...+... ..+. ...+.+-+.+++.++.+..-.... .....++|.|..++..... T Consensus 378 r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~ 457 (556) T protein:vir:73 378 SMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLS 457 (556) T ss_pred CccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHH Confidence 12343333332222221 1111 222333444555544431100111 1124577777555432111 Q ss_pred HHHHHHHHHHhcccc---CCCHHH----HHHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhhhc--ccCCCcC--- Q lcl|NC_021324. 394 AKADAVSKLYANGQG---PIPKEQ----ARIDLGYT------ATQREQMRDWDKQETEDMIDTLYSTT--KAQADAT--- 455 (480) Q Consensus 394 ~~ad~~~kl~~~~~~---~~s~e~----~~~~~~~~------~d~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~--- 455 (480) ..++.+..+.+.++. .+.... +...+|+. +++++++.+.+.++++.......... ++...-+ T Consensus 458 ~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~ 537 (556) T protein:vir:73 458 QTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQ 537 (556) T ss_pred HHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 111122222222221 122222 23444543 44555554333332222111111110 0000000 Q ss_pred -CCCCCCCCCccCCCCCCC Q lcl|NC_021324. 456 -PKPTVTETKTETQTSPSG 473 (480) Q Consensus 456 -~~~~~~d~~~~~~~~~~~ 473 (480) +.+.+...-...-+.|-. T Consensus 538 ~~~~~~l~~~~~~~g~~~~ 556 (556) T protein:vir:73 538 TSDPSALTAIANAAGAPQQ 556 (556) T ss_pred CCCHHHHHHHHHhhcCCCC Confidence 111100000001111111 No 137 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=98.70 E-value=8.2e-08 Score=59.43 Aligned_cols=421 Identities=12% Similarity=0.083 Sum_probs=201.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc-cCc-------- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD-IEG-------- 71 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~-~~~-------- 71 (480) .-+..++| .+|..+..+++-+..+. .||+..+-+=. -++ T Consensus 50 ~~~~~eLI-----------~~YR~ma~~pEvd~Av~---------------------eIVneaiv~d~~~~pV~i~Ld~~ 97 (537) T protein:vir:10 50 IRNDHELI-----------TRYREMVLNPECDSAVD---------------------DVVNETICGNFDDVPISIDLHNL 97 (537) T ss_pred cchHHHHH-----------HHHHHHhhccchhhHHH---------------------HhhcceeEecCCCceEEEEeccc Confidence 11111121 23333434433332221 11211110000 001 Q ss_pred -cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 72 -FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 72 -~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) ++-.-.+...+.+..|++--+|+....+..+...+.|+.|++...+.. ...+|-..++.++|..+..|.-.... . T Consensus 98 ~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k--~pk~GI~ELr~lDPr~i~~vR~i~~~--~ 173 (537) T protein:vir:10 98 KQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPK--KPRQGLVELRYVDPRKIRKVTEYEAK--R 173 (537) T ss_pred ccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC--CccccceeeeeeCCccceeeEeeccc--C Confidence 111112234455666766668999999999999999999988764321 23568888999999988766432111 1 Q ss_pred EEEEEEEEEecCC-ceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc--eEEeecc--cccCccCCCccch Q lcl|NC_021324. 151 TRAVRLYTTRDDV-AVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP--VVPLTND--PRLGNRYGRSEIS 225 (480) Q Consensus 151 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~n~--~~~~~~~G~s~i~ 225 (480) ...++...-.... .....+.+|.+...+ +. + +.-=+|| .|.|... .+.......|-|. T Consensus 174 ~~~~~~~~~~~~v~~~~~eyf~ynp~g~~-~~--~--------------~~~vkI~~dAI~y~hSGl~d~n~~~i~syLh 236 (537) T protein:vir:10 174 PEALRTQDLNQQLTQQSASYFLYNPKGLK-NS--T--------------NQGMKIAPDSIAYCHSGIQDLNKNMVLSHLH 236 (537) T ss_pred CccceEEecceeeeecccceeeecccccc-cc--C--------------CCceeccHhheeeecccceeCCCCeeeeeeh Confidence 1111111000000 000111223332211 00 0 0001222 1333331 1112233445555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------h Q lcl|NC_021324. 226 PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------R 279 (480) Q Consensus 226 ~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~ 279 (480) +.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| - T Consensus 237 kAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlED 314 (537) T protein:vir:10 237 KAIKAVNQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLED 314 (537) T ss_pred hhhHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhh Confidence 66666533 2667777777777778877776555555543322110 011111 1 Q ss_pred hee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc-cchHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 280 ILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIFG 355 (480) Q Consensus 280 ~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~~~l~~k~~~~~~~~~ 355 (480) +|. .+|.+.++.+++..+.=.-++-++-....++..-++|.+-+...+. +..-|..|--.......-+.+.+..|. T Consensus 315 yWLPRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs 394 (537) T protein:vir:10 315 FWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFS 394 (537) T ss_pred hcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHH Confidence 111 1344567788887653345666677777888888999888865432 222233455566677777888888888 Q ss_pred HHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhccccCCCHHHHHHh-CCCCh Q lcl|NC_021324. 356 GAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQGPIPKEQARID-LGYTA 423 (480) Q Consensus 356 ~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~~~s~e~~~~~-~~~~~ 423 (480) .-+.++++.=+-+.|.--..+| ..+.+.|..-........++. +.++..-..-.+|.++++.. |..++ T Consensus 395 ~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tD 474 (537) T protein:vir:10 395 ELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTE 474 (537) T ss_pred HHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCH Confidence 8888888754444443333333 346677754433333333332 22222221224588888755 79999 Q ss_pred hHHHHHHHHHHHHHHHHHH----HHhhhcccC------CCcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 424 TQREQMRDWDKQETEDMID----TLYSTTKAQ------ADATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 424 d~~~~~~~~~~~~~~~~~~----~~~~~~~~~------~~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) ++++++.+.++++..+..- +......+. +...++|+.++..++.+..|.++.= T Consensus 475 eeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 475 SEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred HHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCCC Confidence 9998887666665543110 000001000 0111122222222222222222222 No 138 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.69 E-value=1.1e-07 Score=58.75 Aligned_cols=439 Identities=13% Similarity=0.081 Sum_probs=179.5 Q ss_pred CCCHH-----HHHHHHHHHHHHHH----HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC- Q lcl|NC_021324. 1 MTTYH-----EHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE- 70 (480) Q Consensus 1 ~~~~~-----~~i~~l~~~~~~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~- 70 (480) |++.+ +-++++.+++..++ .+.+.+.+|..-.- .+.-+ .....+..++..+-+...++++++.|.+- T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~-~~~~~--~~~~~~~~~~~dst~~~a~~~Laa~l~~~l 77 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL-FPKDS--DNASTDYQTPWQAVGARGLNNLASKLMLAL 77 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-cCCCC--CcccccccccccccHHHHHHHHHHHHHhhh Confidence 88854 34555556555443 34444555542211 11111 11111223455666777777777766321 Q ss_pred ----ccc-cC-CCch--------------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCC Q lcl|NC_021324. 71 ----GFR-IS-EDSE--------------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 71 ----~~~-~~-~d~~--------------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~ 124 (480) +|+ .. .|.+ ..+.+...+..++|.....++.++...+|.|.+++-.+ .. T Consensus 78 tP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~-----~~ 152 (536) T protein:vir:10 78 FPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEP-----EG 152 (536) T ss_pred cCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeC-----CC Confidence 222 11 1100 11234556677889999999999999999988766432 12 Q ss_pred CceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEec-------C---------CceEEEEEEEe---C----CcEEEEE Q lcl|NC_021324. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD-------D---------VAVPDRATLYL---P----DETVPLR 181 (480) Q Consensus 125 ~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~-------~---------~~~~~~~~~~~---~----~~~~~~~ 181 (480) .+...++.++-.++++.-|.. .++...+|.+...- . ......+++|+ + +.+..|. T Consensus 153 ~~~~~~~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~ 230 (536) T protein:vir:10 153 SNYNPMKLYRLSSYVVQRDAF--GNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYE 230 (536) T ss_pred CceeeEEEEEcCeEEEeeCCC--CCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEE Confidence 334456777766666555533 34444444432110 0 00011222222 1 1111111 Q ss_pred ecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc Q lcl|NC_021324. 182 RNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTT 261 (480) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~ 261 (480) ...+ .....+....+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-...........|...+. + T Consensus 231 e~~g----~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~---p 302 (536) T protein:vir:10 231 EVEG----MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIE-EYLGDLRSLENLQEAIVKMSMISSKVIGLVN---P 302 (536) T ss_pred eecC----ccccccccccccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCcccC---c Confidence 1111 011122223467789999998888889999999664 3556667777766666666666565543331 0 Q ss_pred cccccccccchhhhhcchheecCCCCceEEEec-cccHH---HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHH Q lcl|NC_021324. 262 DELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELR---NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII 337 (480) Q Consensus 262 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~ 337 (480) +.. ..........+|.++.-..++..+.++. .++++ ..++.++.-|.+.+...+ +.......-+++.++ T Consensus 303 ~g~--~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----l~~~~~~r~TAtEV~ 375 (536) T protein:vir:10 303 AGI--TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNS-----AVQRTGERVTAEEIR 375 (536) T ss_pred ccc--cchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhhh-----cccCCCCCccHHHHH Confidence 000 0011112233344433222333344443 23333 334444444444333321 111111112344444 Q ss_pred HHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHHHhCCCC---cccceeeeEEecCCCC-CCHHHHHHHHHHHHh---- Q lcl|NC_021324. 338 ATDSRIVKMA----ER-KGRIFGGAWERAMRIAMQIMGREV---TEEYTRLETVWRDPST-PTVAAKADAVSKLYA---- 404 (480) Q Consensus 338 ~~~~~l~~k~----~~-~~~~~~~~l~~~~~li~~~~~~~~---~~~~~~~~v~f~~~~~-~~~~~~ad~~~kl~~---- 404 (480) .+...+.... .+ ....+.+-+.+++.++.+ .+.. +.+. +++.+.-++. -...+.++.+....+ T Consensus 376 ~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r--~g~lP~~p~~~--v~~~~vs~l~~l~r~~~~~~l~~~~~~la~ 451 (536) T protein:vir:10 376 YVASELEDTLGGVYSILSQELQLPLVRVLLKQLQA--TQQIPELPKEA--VEPTISTGLEAIGRGQDLDKLERCVTAWAA 451 (536) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--CCCCCCCChhh--ccceEEecHHHHHHHHHHHHHHHHHHHHHh Confidence 3333222211 11 112222223333333221 1211 2222 3444433322 111122223322222 Q ss_pred cccc----CCCHHHH----HHhCCCC-------hhHHHHHHHHHHHHHHH--HHHHHhhhcccCCCcCCCCCCC----CC Q lcl|NC_021324. 405 NGQG----PIPKEQA----RIDLGYT-------ATQREQMRDWDKQETED--MIDTLYSTTKAQADATPKPTVT----ET 463 (480) Q Consensus 405 ~~~~----~~s~e~~----~~~~~~~-------~d~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----d~ 463 (480) .++. .+....+ ...+|++ +++++++.+.+.++++. ...++... .....+..|+.. ++ T Consensus 452 ~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~--~~~~~~~~~~~~~~~~~~ 529 (536) T protein:vir:10 452 LAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQG--MAAQATASPEAMAAAADS 529 (536) T ss_pred hchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHhcCchhHHhhhhc Confidence 1111 1232222 2345652 34444443322222111 11111111 111111112111 11 Q ss_pred CccCCCC Q lcl|NC_021324. 464 KTETQTS 470 (480) Q Consensus 464 ~~~~~~~ 470 (480) .+-.++- T Consensus 530 ~g~~~~~ 536 (536) T protein:vir:10 530 VGLQPGI 536 (536) T ss_pred cccCCCC Confidence 1111111 No 139 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=98.66 E-value=2.3e-08 Score=62.41 Aligned_cols=435 Identities=12% Similarity=0.073 Sum_probs=200.2 Q ss_pred CCCHHHHHHH-----------HHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc- Q lcl|NC_021324. 1 MTTYHEHVER-----------LQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD- 68 (480) Q Consensus 1 ~~~~~~~i~~-----------l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~- 68 (480) ++...++..- -++.-.....+|..+..+++-+..+. .||+..+-+=. T Consensus 27 ~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~---------------------eIVneaiv~d~~ 85 (533) T protein:vir:10 27 LDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVD---------------------DIVNETICGNFD 85 (533) T ss_pred ccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHH---------------------HhhcceeeecCC Confidence 1111111110 00000011123333333333332221 11211110000 Q ss_pred cCc---------cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceE Q lcl|NC_021324. 69 IEG---------FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMY 139 (480) Q Consensus 69 ~~~---------~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~ 139 (480) .++ ++-.-.+...+.+..|++--+|+....+..+...+.|+.|.+.-.+. ....+|-..++.+||+.+- T Consensus 86 ~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~--~~pk~GI~ELr~lDPr~i~ 163 (533) T protein:vir:10 86 DVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDP--DNPQGGLIELRYIDPRKIR 163 (533) T ss_pred CceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecC--CCccccceeeeecccccee Confidence 001 11111223445566666666899999999999999999998754321 1234688889999999887 Q ss_pred EEEeC--CCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc--eEEeeccccc Q lcl|NC_021324. 140 AELDP--RNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP--VVPLTNDPRL 215 (480) Q Consensus 140 ~v~d~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~n~~~~ 215 (480) +|.-- .......+. +......+....+-+|.+.... +. ++ + ++ +|| .|.|.+.-.. T Consensus 164 ~vr~i~~~~~~~~~~~---~~~~~v~~~~~eyf~Ynp~g~~-~~--~~---~----------~v-kI~~dAI~y~hSGl~ 223 (533) T protein:vir:10 164 KINETEQKRPEQLRGL---PLNQQLSPKSAEYFLYDPKGLK-NS--TT---Q----------GL-KIAPDSICYVHSGIM 223 (533) T ss_pred eeeeeeccCCCcccee---ecchhhhccceeeeeecccccc-cc--CC---C----------ce-ecchhheeeeeccce Confidence 76521 111111100 0000011111222334432221 10 00 0 00 122 1233321111 Q ss_pred C--ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-- Q lcl|NC_021324. 216 G--NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-- 278 (480) Q Consensus 216 ~--~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-- 278 (480) . ...-.|-|.+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| T Consensus 224 d~~~~~i~syLhkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev 301 (533) T protein:vir:10 224 DLNKNMTLSHLHKAIKAVNQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEI 301 (533) T ss_pred eCCCCceeccchHhHHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCcee Confidence 1 11112334455555422 2667777777777777777776555555543322110 011111 Q ss_pred -----------hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc-cchHHHHHHHHHHHH Q lcl|NC_021324. 279 -----------RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRI 343 (480) Q Consensus 279 -----------~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~~~l 343 (480) -+|. .+|.+.++.+++..+.=.-++-++-....++..-++|.+-+...+. +..-|..|--..... T Consensus 302 ~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF 381 (533) T protein:vir:10 302 KDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKF 381 (533) T ss_pred cccchhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHH Confidence 1111 2344567888887653345666677777888888999888865432 222233465566677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhccccCCCH Q lcl|NC_021324. 344 VKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQGPIPK 412 (480) Q Consensus 344 ~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~~~s~ 412 (480) ..-+.+.+..|..-+.++++.=+-+.|.--..+| ..+.+.|..-........++. +.++-.-..-.+|. T Consensus 382 ~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~ 461 (533) T protein:vir:10 382 QKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSV 461 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccch Confidence 7778888888888888888854444443333333 346677754433333333332 22222222224599 Q ss_pred HHHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc--CCCCCCCCCcccCC Q lcl|NC_021324. 413 EQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE--TQTSPSGFNRTKTR 480 (480) Q Consensus 413 e~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~--~~~~~~~~~~~~~~ 480 (480) ++++.. |..++++++++.+.++++..+..-.-..........+.+|+.+....+ .+.-|....+++.. T Consensus 462 dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (533) T protein:vir:10 462 EYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEGPDPSDERKAE 532 (533) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCCCCcchhhccC Confidence 998755 799999998887666666543211000000000011122222222111 11222233333444 No 140 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.64 E-value=1.6e-07 Score=57.90 Aligned_cols=429 Identities=12% Similarity=0.090 Sum_probs=173.5 Q ss_pred HHHHHHHHHHHHHH----HHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhcc------Cccc Q lcl|NC_021324. 4 YHEHVERLQGLLAR----DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI------EGFR 73 (480) Q Consensus 4 ~~~~i~~l~~~~~~----~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~------~~~~ 73 (480) -++.+++..+.+.. ...+.+.+.+|..-.- ... .......+.-++..+-+...++++++.|.+ .+|+ T Consensus 1 m~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~-~~~--~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~~WF 77 (555) T protein:vir:17 1 MKHSAQAKYMMLRADREDYLDSGRQSARLTLPYI-LTD--EGHVQGGYLPTPWQSVGSKGVNVLASKLMLSLFPVNTSFF 77 (555) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-cCC--CCCcccccccccccccHHHHHHHHHHHHHHhhcCCCCccc Confidence 23334444444433 3445555555542211 110 111111223355667777888887776631 2232 Q ss_pred -cC-CC---------chhH-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEE Q lcl|NC_021324. 74 -IS-ED---------SEGL-----------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 74 -~~-~d---------~~~~-----------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~ 131 (480) .. .| .+.. +.+...+..++|.....++.++...+|.+.+++.. ++ ++ T Consensus 78 ~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~--------~~---~~ 146 (555) T protein:vir:17 78 KLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLYQGK--------KN---LK 146 (555) T ss_pred ccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEecC--------Cc---ee Confidence 11 11 1111 12444456688999999999999999998765532 11 44 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEec-------C------------------------------CceEEEEEEEeC Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTRD-------D------------------------------VAVPDRATLYLP 174 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~-------~------------------------------~~~~~~~~~~~~ 174 (480) +++-.+.++-.|.. ..+...+|.+...- . ......+++|+. T Consensus 147 ~~pl~~y~v~~d~~--G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t~ 224 (555) T protein:vir:17 147 LYPLDRFVVSRDGE--GNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYTY 224 (555) T ss_pred EEEcCeEEEeeCCC--cCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEeec Confidence 45444444333432 34444444332110 0 000111223321 Q ss_pred ----C-cEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021324. 175 ----D-ETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQIL 249 (480) Q Consensus 175 ----~-~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~ 249 (480) + .++.|....+. ........-+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-.....++.. T Consensus 225 ~~~~~~~~~~~~e~~~~----~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~~ 299 (555) T protein:vir:17 225 VCRKDGQVKWHQECDGK----VIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVE-EFMGDLKSLEALSQAMVEGSAAS 299 (555) T ss_pred ccccCCeeEEEEecCce----eccccccccCcccCCeeeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 1 11111111110 11111224577889999998888889999999764 46677888888888888888888 Q ss_pred cchhhhhc--CCCccccccccccchhhhhc-chheecCCCCceEEEecc-ccHH---HHHHHHHHHHHHHHhhcCCChHH Q lcl|NC_021324. 250 GTPLRVIS--GVTTDELTNDGENTTLDIYY-GRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQY 322 (480) Q Consensus 250 ~~p~l~i~--G~~~~~~~~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~-~~~~---~~~~~l~~~~~~i~~~~~~p~~~ 322 (480) ..|.+.+. |.... ..+...+ +.+..-..++....++.. .+++ ..++.++.-|.+.+...+.. T Consensus 300 ~~pp~lv~~~g~~~~--------~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~~~~--- 368 (555) T protein:vir:17 300 AKVVFMVSPSATTKP--------QNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLMLQVR--- 368 (555) T ss_pred hCCceeeccccccCc--------ceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhcCCC--- Confidence 88886652 11110 0111111 222211112222233222 2233 44455554454444332211 Q ss_pred hccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHhCC-CCcccceeeeEEecCCCC Q lcl|NC_021324. 323 LSSSSENPASAEAIIATDSRIVKMAERKGRIFGGA------------WERAMRIAMQIMGR-EVTEEYTRLETVWRDPST 389 (480) Q Consensus 323 ~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~------------l~~~~~li~~~~~~-~~~~~~~~~~v~f~~~~~ 389 (480) +... -+++-++. ++..+...++.. +.+++.++.+..-. ..+.+...+.+. -++. T Consensus 369 ---d~~r-~TAtEV~~-------r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~--~~l~ 435 (555) T protein:vir:17 369 ---QSER-TTATEVQA-------TVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVV--AGLW 435 (555) T ss_pred ---Cccc-chHHHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhcccee--ehHH Confidence 1111 13333333 233333333333 33334433321100 122222223222 1111 Q ss_pred -CCHHHHHHHHHHHH----hcccc-----CCCH----HHHHHhCCC-------ChhHHHHHHHHHHHHHHHH-HHHHhhh Q lcl|NC_021324. 390 -PTVAAKADAVSKLY----ANGQG-----PIPK----EQARIDLGY-------TATQREQMRDWDKQETEDM-IDTLYST 447 (480) Q Consensus 390 -~~~~~~ad~~~kl~----~~~~~-----~~s~----e~~~~~~~~-------~~d~~~~~~~~~~~~~~~~-~~~~~~~ 447 (480) ....+.++.+...+ +.+.. .+.. ..+...+|+ ++++++++++.++++++.. .-+.... T Consensus 436 ~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~ 515 (555) T protein:vir:17 436 GVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQ 515 (555) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01111222222222 21100 1121 223455565 3444554433222221111 0001000 Q ss_pred cccC--CCc-----CCCCCCCCCCc---cCCCCCCCCCcc Q lcl|NC_021324. 448 TKAQ--ADA-----TPKPTVTETKT---ETQTSPSGFNRT 477 (480) Q Consensus 448 ~~~~--~~~-----~~~~~~~d~~~---~~~~~~~~~~~~ 477 (480) ..+. .++ ..++....+++ ..-.+|.+..++ T Consensus 516 ~~~~~~~~~~~~~~~~~~~~a~~~~~a~~~~~~~~~~~~~ 555 (555) T protein:vir:17 516 LAKTPMAEQAMQLIQQQQEGAQDAGAAESETSSAEAQAGA 555 (555) T ss_pred HHhhhhhhhHHhccccchhhhhHHHHHHhhcCCcccccCC Confidence 0000 000 00111111111 111234444443 No 141 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=98.63 E-value=1.8e-07 Score=57.62 Aligned_cols=433 Identities=13% Similarity=0.085 Sum_probs=180.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc----c--Cccc- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD----I--EGFR- 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~--~~~~- 73 (480) |+ .+...+.|-.+......+.+.+.+|..-.- ....+.......+..++..+-+...++++++.|. + .+|+ T Consensus 1 m~-~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~-~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~ 78 (522) T protein:vir:10 1 MK-ARERYNQLTTARQMFLDKAVECSELTLPYL-IDDDISSRPNHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFFK 78 (522) T ss_pred Cc-hHHHHHHHHHHhhHHHHHHHHHHHHhhhcc-cCCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCcccc Confidence 77 455666666666666667777777753211 1111111111112235666777778887777663 1 2332 Q ss_pred c--CC-------Cch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEE Q lcl|NC_021324. 74 I--SE-------DSE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) Q Consensus 74 ~--~~-------d~~----~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~ 133 (480) . .+ +.+ .. +.+...+..++|.....++.++...+|.|.+++.. ++ ++++ T Consensus 79 l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~~--------~~---~~~~ 147 (522) T protein:vir:10 79 LQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFMGK--------DG---LKTF 147 (522) T ss_pred ccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEEcC--------CC---ceEE Confidence 1 11 100 11 12344566788999999999999999998876643 21 4455 Q ss_pred cccceEEEEeCCCCCceEEEEEEEEEe-------cC-----------CceEEEEEEEeCCcEEEEEecCCCcccccc--- Q lcl|NC_021324. 134 SPLYMYAELDPRNTRRVTRAVRLYTTR-------DD-----------VAVPDRATLYLPDETVPLRRNGGLNDQWVV--- 192 (480) Q Consensus 134 ~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 192 (480) +-.+.++--|. . ..+...++.+... .. ......+++|+. .|.+.......+.. T Consensus 148 pl~~y~v~~d~-~-G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~----v~p~~~~~~~~~~~~~~ 221 (522) T protein:vir:10 148 PLTRYVINRDG-D-GNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTY----VKLDKSSGRWVWHQEAF 221 (522) T ss_pred EcceEEEeeCC-C-CCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEE----EEeeccCCceEEEEccC Confidence 55554443333 2 3444444433211 00 001112333321 01111111111110 Q ss_pred ----ccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCcccccc Q lcl|NC_021324. 193 ----DGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTN 266 (480) Q Consensus 193 ----~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~ 266 (480) ......-+|..+|++.++-+.-.++.||+|-.. +..+-+..+|.+.-......+....|.+.+. |.... T Consensus 222 ~~~~~~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~---- 296 (522) T protein:vir:10 222 DKIIPDSRSTAPKNASPWLPLRFNTVDGEDYGRGRVE-EFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKP---- 296 (522) T ss_pred CccccccccccccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccc---- Confidence 011123478889999888888889999999764 3556667777777777777788888876652 11111 Q ss_pred ccccchhhhhcchheecCCCCceEEEec-cccHHH---HHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHH Q lcl|NC_021324. 267 DGENTTLDIYYGRILTLASEAAKISEFK-AAELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSR 342 (480) Q Consensus 267 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~ 342 (480) ........+.+..-..++....++. ..+++. .++.++.-+...+...+. -+... -+++.++..... T Consensus 297 ---~~l~~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~~------~d~~r-vTAtEV~~r~~E 366 (522) T protein:vir:10 297 ---ATIAKAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMNV------RNAER-VTAEEVRLTQLE 366 (522) T ss_pred ---ccccCCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhccC------CCCCC-CCHHHHHHHHHH Confidence 0111112222222112223333433 234443 334444444433332211 01111 134444433332 Q ss_pred HHHH----HHH-HHHHHHHHHHHHHHHHHHHhCC--CCcccc-eeeeEEecCCCCCCHHHHHHHHHHHHhcc-----cc- Q lcl|NC_021324. 343 IVKM----AER-KGRIFGGAWERAMRIAMQIMGR--EVTEEY-TRLETVWRDPSTPTVAAKADAVSKLYANG-----QG- 408 (480) Q Consensus 343 l~~k----~~~-~~~~~~~~l~~~~~li~~~~~~--~~~~~~-~~~~v~f~~~~~~~~~~~ad~~~kl~~~~-----~~- 408 (480) +... ..+ ....+.+-+.+.+.++.+- |. ..+.+. ....|++..++.+ .+.++.+....+.. +. T Consensus 367 ~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~~~i~~~~~p~~ 443 (522) T protein:vir:10 367 LEQQLGGIFSLLVIEFLIPYLNRTLLVLQRS-NQIPKLPKDIVRPTIVAGVNALGR--GQDRESLTAFVGTIAQTLGPEA 443 (522) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCccccccccccchhHHHH--HHHHHHHHHHHHHHHHhhCchh Confidence 2221 111 1233333444445443321 10 111111 2233455444432 22333333222211 11 Q ss_pred ---CCCHHH----HHHhCCCChh----HHHHHHHHHHHHHHH-HHHHHhhhc-ccC----CCcCCCCCCCCCCccCCCC Q lcl|NC_021324. 409 ---PIPKEQ----ARIDLGYTAT----QREQMRDWDKQETED-MIDTLYSTT-KAQ----ADATPKPTVTETKTETQTS 470 (480) Q Consensus 409 ---~~s~e~----~~~~~~~~~d----~~~~~~~~~~~~~~~-~~~~~~~~~-~~~----~~~~~~~~~~d~~~~~~~~ 470 (480) .+.... +...+|.... ..+++.+++++.++. ...++..+. +.. ++++.++.+.++-...+.+ T Consensus 444 ~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (522) T protein:vir:10 444 LMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTKNPQLMDEEQPPMEE 522 (522) T ss_pred hhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccccHHHHHHhCCCCCC Confidence 111111 2233454311 112222222222221 122222211 111 1222222211111111111 No 142 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=98.61 E-value=1.9e-07 Score=57.40 Aligned_cols=373 Identities=12% Similarity=0.083 Sum_probs=149.8 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHHhhcc--Ccchh-----cCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCC Q lcl|NC_021324. 7 HVERLQGLLARDLPN--LLEAEAYRNGT--RRLKT-----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) Q Consensus 7 ~i~~l~~~~~~~~~~--~~~~~~YY~G~--~~i~~-----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d 77 (480) ++-.+.+.+...... -.....+.... ..+.. .+..+.+. .-+.+.=...+|+..++-+-.-++.+... T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~~v~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSAR---AALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchh---hhhcchHHHHHHHHHHHhhccCceeeccc Confidence 211222211110000 00000000000 00000 00011110 00011112334444444333334544322 Q ss_pred chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEE Q lcl|NC_021324. 78 SEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 78 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) . ....+.+-........+...+..+.+.+|.||+++-.+ .+|++ .+..++|..+.+..+.... .+. T Consensus 78 ~-~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~~~-~~~----- 144 (392) T protein:vir:74 78 K-NQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN------ANGADMKWEYLRPSQVNTYYFEYEN-GMY----- 144 (392) T ss_pred h-hhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eEE----- Confidence 1 11111111111123466777788999999999887653 34554 6778888888777654322 111 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHH Q lcl|NC_021324. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~ 236 (480) |......+.......+.+++ |+||++....+..+|.|.|.- +...++... T Consensus 145 y~~~~~~~~~~~~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~i~~-~~~~i~~~~ 194 (392) T protein:vir:74 145 YNITFDDPKIEPILQAPQSD-----------------------------LIHMKLLSIDGGKTGISPLYS-LRRESKIQR 194 (392) T ss_pred EEEEecCCccceeEEEcCcc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHH Confidence 11011111001111122222 355554433444678886532 333333333 Q ss_pred HHHHHHHHHHHHhcchhhhhc--CCCc-cccccccccchhh--hhcchheecCCCCceEEEeccc-cHHHHHHHHHHHHH Q lcl|NC_021324. 237 RTLMNLQSASQILGTPLRVIS--GVTT-DELTNDGENTTLD--IYYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRK 310 (480) Q Consensus 237 ~~~s~~~~~~~~~~~p~l~i~--G~~~-~~~~~~~~~~~~~--~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~ 310 (480) .+.....+.+...+.|-.+++ +... ++.........+. ...++++.++ ++.++.+++.. .-..+++..+.... T Consensus 195 ~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~ 273 (392) T protein:vir:74 195 ASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-DLEEFTALEIKSNVAQLLSQTDWTSK 273 (392) T ss_pred HHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecC-CCceEEEccCChhHHHHHHHHHHHHH Confidence 332223333333344444443 2111 1100000011111 0123445554 44688777643 23357888888999 Q ss_pred HHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCC Q lcl|NC_021324. 311 EAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTP 390 (480) Q Consensus 311 ~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~ 390 (480) +|+..-++|+..+|....+.++..+.+..+ ...|.-+++.+..-...... ..+++.+...... T Consensus 274 ~Ia~~fgVPp~~lg~~~~~~~~~e~~~~~~--------------~~~l~p~~~~ie~~l~~~l~---~~~~~~~~~~~~~ 336 (392) T protein:vir:74 274 QYAKVYGLPDSYIGGQGDQQSSIQQISGMY--------------ASALNRYLRPAISELEYKLS---DHISVNMRPAIDP 336 (392) T ss_pred HHHHHhCCCHHHhCCCCCcccHHHHHHHHH--------------HHHHHHHHHHHHHHHHHhcc---chhcccchhhhcC Confidence 999999999999987554443333332211 11122111111110000000 0112222222234 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 391 TVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~e~~~~~---~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) +..+.++.+.+++.+| +++...++++ .|+.++++.+.+. +.. . + ++++. T Consensus 337 d~~~~~~~~~~l~~~g--~~t~near~~~~~~g~~pne~r~~en------------l~~-~---------~----~Gd~~ 388 (392) T protein:vir:74 337 LGDNYLSTISTATRWG--ALAENQATFVLQEAGYIPKDLPAPEN------------TNK-K---------T----TGQSN 388 (392) T ss_pred CHHHHHHHHHHHHhCC--CcCHHHHHHHHHhCCCCccccchhcC------------CCC-C---------C----CCCCC Confidence 5667778888888765 5666555544 3666555432110 000 0 0 01112 Q ss_pred CCCC Q lcl|NC_021324. 468 QTSP 471 (480) Q Consensus 468 ~~~~ 471 (480) +..| T Consensus 389 ~p~p 392 (392) T protein:vir:74 389 EPVP 392 (392) T ss_pred CCCC Confidence 2222 No 143 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.60 E-value=2.1e-07 Score=57.23 Aligned_cols=428 Identities=14% Similarity=0.135 Sum_probs=180.3 Q ss_pred CCCHH----HHHHHHHHHHHHH----HHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhcc--- Q lcl|NC_021324. 1 MTTYH----EHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI--- 69 (480) Q Consensus 1 ~~~~~----~~i~~l~~~~~~~----~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~--- 69 (480) |.--. +-++++.+.+..+ ..+.+.+.+|..-.- ... .......+..++..+-+...++++++.|.+ T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~-~~~--~~~~~~~~~~~~~dst~~~a~~~Las~l~~~lt 77 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSL-FPK--ESDNSSTEYTTPWQAVGARCLNNLAAKLMLALF 77 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-cCC--CCCcccccccccccccHHHHHHHHHHHHHhhcC Confidence 54422 2244455555443 344445555542211 111 111111122345566677778877776632 Q ss_pred -C-cc-cc--C--------CCchhH-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCC Q lcl|NC_021324. 70 -E-GF-RI--S--------EDSEGL-----------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPA 125 (480) Q Consensus 70 -~-~~-~~--~--------~d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~ 125 (480) . +| +. . .+.... +.+...+..++|.....++.++...+|.|.+++..+. ++ T Consensus 78 P~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~-----~~ 152 (522) T protein:vir:94 78 PQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPE-----QG 152 (522) T ss_pred CCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccC-----CC Confidence 1 22 11 1 011111 2234456668899999999999999999988765321 22 Q ss_pred ceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe--------------cCCceEEEEEEEe-----CCcEEEEEecCCC Q lcl|NC_021324. 126 GIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR--------------DDVAVPDRATLYL-----PDETVPLRRNGGL 186 (480) Q Consensus 126 g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~-----~~~~~~~~~~~~~ 186 (480) +...++.++-.+.++--|.. .++...++.+... +.......+++|+ .+++..|....+. T Consensus 153 ~~~~~~~~pl~~y~v~~d~~--G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~~~~~~~~~~~~g~ 230 (522) T protein:vir:94 153 TYSPMRMYRLVSYVVQRDAF--GNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQDDEYLRYEEVEGI 230 (522) T ss_pred ceeeEEEEEcceEEEeeCCC--cCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEeeCCceeEEeeccCc Confidence 33457777766654444432 3444444443211 1111122334433 1222222211111 Q ss_pred ccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCcccc Q lcl|NC_021324. 187 NDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDEL 264 (480) Q Consensus 187 ~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~~ 264 (480) .........+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-......+...+|.+.+. |.... T Consensus 231 ----~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~~-- 303 (522) T protein:vir:94 231 ----EVTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCE-EYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQP-- 303 (522) T ss_pred ----eecccCCCCccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhCCceeecccccccc-- Confidence 11111122357889999888888889999999764 4667788889888888888888888886653 11110 Q ss_pred ccccccchhhhhcchheecCCCCceEEEecc-ccHHH---HHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHH Q lcl|NC_021324. 265 TNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATD 340 (480) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~ 340 (480) .......++.+..-..++..+.++.. ++++. .++.++.-|...+.... ++......-+++.++.+ T Consensus 304 -----~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~TAtEV~~r- 372 (522) T protein:vir:94 304 -----RRLNKAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNS-----AVQRNAERVTAEEIRYV- 372 (522) T ss_pred -----hheeccCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----hccCCCccccHHHHHHH- Confidence 01112233333332223334444332 33433 34444444444433321 21111111233333332 Q ss_pred HHHHHHHHHHHHHHHHHH------------HHHHHHHHHHhCC--CCcccceeeeEEecCCCCCCH-HHHHHHHHHHHh- Q lcl|NC_021324. 341 SRIVKMAERKGRIFGGAW------------ERAMRIAMQIMGR--EVTEEYTRLETVWRDPSTPTV-AAKADAVSKLYA- 404 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l------------~~~~~li~~~~~~--~~~~~~~~~~v~f~~~~~~~~-~~~ad~~~kl~~- 404 (480) +..+...++..+ .+++.++.+. |. ..+. ..+++.+.-++..-- ...++.+....+ T Consensus 373 ------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~-g~lP~~p~--~~v~v~~~s~La~~qr~~~~~~l~~~~~~ 443 (522) T protein:vir:94 373 ------AGELEATLGGVYSVQSQELQLPIVRVLMNQLQSA-GMIPDLPK--EAVEPTVSTGLEALGRGQDLEKLTQAVNM 443 (522) T ss_pred ------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCCCCc--ccEEeeEecHHHHHHHHHHHHHHHHHHHH Confidence 233333333333 3333332211 11 1122 236666654443211 112222222222 Q ss_pred ---cccc----CCCH----HHHHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 405 ---NGQG----PIPK----EQARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 405 ---~~~~----~~s~----e~~~~~~~~~-------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) .++. .+.. ..+...+|+. +++++.+.+.+. .+..+.+......++..+....+. .+ T Consensus 444 ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~--~~~~~~~~~~~~~~~~~a~~~~~~----~~ 517 (522) T protein:vir:94 444 MTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQS--SQQAVVQGASAAGANMGAAVGQGA----GE 517 (522) T ss_pred HHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhhhhccc----ch Confidence 1111 1121 2233445652 222332222111 111111111111111111000000 00 Q ss_pred CCCCCCCCCcc Q lcl|NC_021324. 467 TQTSPSGFNRT 477 (480) Q Consensus 467 ~~~~~~~~~~~ 477 (480) ..++ + T Consensus 518 ~~~~------~ 522 (522) T protein:vir:94 518 DMAQ------A 522 (522) T ss_pred hhhc------C Confidence 0000 0 No 144 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.59 E-value=2.3e-07 Score=57.01 Aligned_cols=402 Identities=13% Similarity=0.073 Sum_probs=157.8 Q ss_pred HHHHHHHHHHH-----------H-HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---C Q lcl|NC_021324. 11 LQGLLARDLPN-----------L-LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---S 75 (480) Q Consensus 11 l~~~~~~~~~~-----------~-~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~ 75 (480) |.+-+...... + ..+....++--.....+..+.+.. -.++ .=...+|+..++-+-.-++.+ . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~-al~~--~~V~~~v~~Ia~~iA~lp~~~~~~~ 77 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEA-VLSF--HAVFACISLISQDIAKMRLRLMQTD 77 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHH-hhcc--HHHHHHHHHHHHhhccCceEEEEec Confidence 22222110000 0 000000000000000111111110 0111 001122333322222223321 1 Q ss_pred CCc--h--hHHHHHHHHHh-cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCC Q lcl|NC_021324. 76 EDS--E--GLEELWNWWQA-ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRN 146 (480) Q Consensus 76 ~d~--~--~~~~l~~~~~~-n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~ 146 (480) .+. + ....+..++.+ |. ...+...+..+.+.+|.||+++-.+. .|++ .+..++|..+.+++++.. T Consensus 78 ~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~------~G~~~~L~~i~~~~v~v~~~~~g 151 (454) T protein:vir:93 78 AQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNA------RGQIKELRILDWNRVEPLVADDG 151 (454) T ss_pred cCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECC------CCcEEEEEEEcCcceEEEEcCCC Confidence 111 1 11223444433 32 34667778889999999999886532 4554 677889998888775432 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) .+.+ .+......+... ...+..+. |+||+......+.+|.|.+. T Consensus 152 --~~~y---~~~~~~~~~~~~-~~~~~~~e-----------------------------ViH~k~~~~~~~~~G~sp~~- 195 (454) T protein:vir:93 152 --EVFY---RITPDRNCGITE-AVTVPARE-----------------------------VIHDRFNCFFHPLIGLPPVY- 195 (454) T ss_pred --cEEE---EEEeccccccce-eEEecCcc-----------------------------eEEeccCCCCCCceeccHHH- Confidence 2111 111111111000 11122222 45555443445567777653 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcC-CCccccccccccchhhh-----hcchheecCCCCceEEEecccc Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILG---TPLRVISG-VTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAAE 297 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i~G-~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~ 297 (480) .+.+.+.....-......+|. .|-.+++- ..+.+...+.-...|+. ..++++.++ ++.++.++.... T Consensus 196 ---~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~-~g~~~~~l~~~~ 271 (454) T protein:vir:93 196 ---AAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPGSITEENAKKLKSNWDSGYTGENAGKTAILS-NGAKYNPTTFSP 271 (454) T ss_pred ---HHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcccccCCceecc-CCceEEEcccCh Confidence 233333333222222223333 34444431 11111111111111211 123455554 346777665432 Q ss_pred -HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHH-HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021324. 298 -LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA-EAI--IATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 298 -~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg-~Al--~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 373 (480) -..|++..+....+|+..-++|+..+|....+.-|. ... .+....|.--+...+ ..+.+ .+.. T Consensus 272 ~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~ie----~~ln~------~L~~--- 338 (454) T protein:vir:93 272 VDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIESIE----LLLDE------ALET--- 338 (454) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHHHH----HHHHH------hhcC--- Confidence 224778888888999999999999998543221121 111 111111111111111 11111 1111 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HHHHHHHHHHHHHHHHHHhhhc---- Q lcl|NC_021324. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQMRDWDKQETEDMIDTLYSTT---- 448 (480) Q Consensus 374 ~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~---- 448 (480) .....+++.+......|..++++.+.+++++| +++...+++++|+.+-+- +++- + ...--..+.+.... T Consensus 339 -~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~ggD~~~-~--~~~~~~~~~~~~~~~~~~ 412 (454) T protein:vir:93 339 -GENESTEFDVTTLLRMDSERRMKTLGDAVKNT--LLTPNEARKRENLPPLAGGDALY-L--QQQNYSLEALSRRDARED 412 (454) T ss_pred -CCCcEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCeee-e--ccCccchHhhhccCcccC Confidence 11123555556666788899999999998876 667777777877753210 0000 0 00000011111100 Q ss_pred ----ccCCCcCCC-CCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 449 ----KAQADATPK-PTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 449 ----~~~~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) .+.+.+.++ +...|++...........++..| T Consensus 413 ~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~~ 449 (454) T protein:vir:93 413 PFASSGKTASVPQAVAASDGNKAITETEHDAVKAMFR 449 (454) T ss_pred CCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhhh Confidence 000111100 01111111111222333344444 No 145 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=98.59 E-value=3.2e-08 Score=61.67 Aligned_cols=441 Identities=15% Similarity=0.093 Sum_probs=170.8 Q ss_pred CCCHHHHHHH-HHH---------HHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC Q lcl|NC_021324. 1 MTTYHEHVER-LQG---------LLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~~i~~-l~~---------~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |||-+--++. .+. .......+-..-.++|.++.-|+ +.--+..++++.-..++.+.+|+.++..+.+- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~ 78 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQIPDHRIQSHNVGVN--PPYNPDRLAAFLELNETLATGIRKKSRYEVGF 78 (651) T ss_pred CCCccceeeeeEEEeecccccccccccccccccchhhhcccCCCCC--CCCCHHHHHHHHhcChHHHHHHHHHhhhhhcc Confidence 7775422111 000 00000011112235565554332 22234455566556789999999999999888 Q ss_pred cccc------CCC---chhHHHHHHHHHh---------------cCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCc Q lcl|NC_021324. 71 GFRI------SED---SEGLEELWNWWQA---------------NDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG 126 (480) Q Consensus 71 ~~~~------~~d---~~~~~~l~~~~~~---------------n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g 126 (480) ||.+ ..+ ++..+.++++|.. ..+......+..+...+|.+|+-|..+.. | T Consensus 79 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~------g 152 (651) T protein:vir:99 79 GFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIE------G 152 (651) T ss_pred CceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCc------c Confidence 8753 111 1122344555432 12334555666677778887776654332 1 Q ss_pred ee-EEEEEcccceEEEEeC-------------CCCCceEE-------------EEEEEEEecCCceEEEEE--------- Q lcl|NC_021324. 127 IP-LIRVESPLYMYAELDP-------------RNTRRVTR-------------AVRLYTTRDDVAVPDRAT--------- 170 (480) Q Consensus 127 ~~-~~~~~~p~~~~~v~d~-------------~~~~~~~~-------------~~~~~~~~~~~~~~~~~~--------- 170 (480) +| .+..+++..+-..-+. ..+..... ....+...+..+...... T Consensus 153 ~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~ 232 (651) T protein:vir:99 153 RPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTI 232 (651) T ss_pred chhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeE Confidence 11 1222222211100000 00000000 000000001111000000 Q ss_pred EEeCCcEEEEEecC-CCccccccccccccccccccc---eEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 171 LYLPDETVPLRRNG-GLNDQWVVDGDVIKHGLGVVP---VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSAS 246 (480) Q Consensus 171 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~ 246 (480) .+..+....+.... ......+.... .++...+| |+||++.....+.+|.|.+. .+.+.+....+-..... T Consensus 233 ~~~~d~~~~~~~~~~~~~~g~~~~~~--~~~~~~~~~~eViHir~~~~~~g~~G~spl~----~a~~~i~~a~~a~~~~~ 306 (651) T protein:vir:99 233 RYREDEESEREPIFVDRETGDVTTGD--ANGLENRPANELIFIPNPSILEDDYGVPDWV----SAIRTISADEAAKDYNR 306 (651) T ss_pred EeccCcceeeeeecccceeeeEEEcC--CCceeEecccceEEecCCCCCCCcccccHHH----HHHHHHHHHHHHHHHHH Confidence 01111100000000 00000000000 01111233 57887765567789998764 34444443333333333 Q ss_pred HHhc---chhhhhc--CCCccccccccccchhh---hhcchheecCC----------CCceEEEecccc--HHHHHHHHH Q lcl|NC_021324. 247 QILG---TPLRVIS--GVTTDELTNDGENTTLD---IYYGRILTLAS----------EAAKISEFKAAE--LRNFAEEME 306 (480) Q Consensus 247 ~~~~---~p~l~i~--G~~~~~~~~~~~~~~~~---~~~~~~~~~~g----------~~~~~~~~~~~~--~~~~~~~l~ 306 (480) .+|. .|-.+|. |..+.+...+.-...|+ -..++++.+++ .+.++..++... -..|++..+ T Consensus 307 ~~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~ 386 (651) T protein:vir:99 307 DFFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFRE 386 (651) T ss_pred HHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHH Confidence 3333 3554543 32222111111011111 12234444432 245676665432 234788888 Q ss_pred HHHHHHHhhcCCChHHhccccc-cchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021324. 307 VFRKEAASITGLPPQYLSSSSE-NPASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~ 383 (480) ....+|+..-++|+..+|.... +-++..... +....|.-.+ . .|++.+. ..+...........+.+. T Consensus 387 ~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P~~----~----~ie~eln--~kLl~~~e~~~~~~i~~e 456 (651) T protein:vir:99 387 KNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQPEQ----H----TFAEWLY--QIIHQQALGVTDWTIEYE 456 (651) T ss_pred HHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHHHH----H----HHHHHHH--HhhcCccccccCceEEEE Confidence 8899999999999999986432 212221111 1111111111 1 1111111 011111111112235555 Q ss_pred ec--CCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 384 WR--DPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 384 f~--~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) |. .....+....++.+.+++++| +++...+++.+|+.+-+-+... ..+..+.....++ + ..++ T Consensus 457 f~~~~llr~D~~~~~e~~~~~i~~G--~~T~NE~R~~lglppi~~~~gd--------~~l~~~~~~~~g~--~---~~gg 521 (651) T protein:vir:99 457 LRGADQPKQEAQLAEQRVRAMRLAG--VGLVDEAREELGLDPLGEPYGE--------MTLSEFEAEVAGD--V---AGGG 521 (651) T ss_pred eccchhhhccHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCcccc--------ccccccccccccc--c---ccCC Confidence 53 344467778888888888776 6677778888876431100000 0000000000000 0 0001 Q ss_pred CCCccCCCCC-C--------CCCcccCC Q lcl|NC_021324. 462 ETKTETQTSP-S--------GFNRTKTR 480 (480) Q Consensus 462 d~~~~~~~~~-~--------~~~~~~~~ 480 (480) +..++.+.++ + .....-++ T Consensus 522 e~~~~~~~~~~~~~~~~e~~~~~~~~~~ 549 (651) T protein:vir:99 522 ETEAVHEPPEENKIGEREWDTVKSELTT 549 (651) T ss_pred CCcccccCccccccccchhhhhhhhhcc Confidence 1111100000 0 00000011 No 146 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.58 E-value=2.4e-07 Score=56.89 Aligned_cols=432 Identities=11% Similarity=0.049 Sum_probs=186.8 Q ss_pred CCCHHH-HHHHHHHHHH---HHH-HHHHHHHHHhhccCcchhcCc--------ccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTYHE-HVERLQGLLA---RDL-PNLLEAEAYRNGTRRLKTIGI--------GAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~~~-~i~~l~~~~~---~~~-~~~~~~~~YY~G~~~i~~~~~--------~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |....+ +.++|.+.|. ..+ +.....+++|+ +.+|.++. ..+...++-++..+-+...++++++.| T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAs~l 78 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVID--YLMPRLDKFGQLPRPDSEKGRERSQKMFDSTAPLALRNFVAAM 78 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHH--HhccccccccccCCCCCCcccccccccccchHHHHHHHHHHHH Confidence 887754 4444444332 222 22333344542 22332211 111112233566677788888887776 Q ss_pred cc------Cccc-cC-CCchh------HHHH-------HHHH--HhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCC Q lcl|NC_021324. 68 DI------EGFR-IS-EDSEG------LEEL-------WNWW--QANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 68 ~~------~~~~-~~-~d~~~------~~~l-------~~~~--~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~ 124 (480) .+ .+|+ .. .++.. ..-| ..++ ...+|.....++.++...+|.|-+++.+ +. T Consensus 79 ~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~------~~ 152 (549) T protein:vir:10 79 DSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEH------DV 152 (549) T ss_pred HhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEee------cC Confidence 32 1232 11 11111 1112 2222 2467888899999999999999887753 33 Q ss_pred CceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe-------cC-------------CceEEEEEEEeC----Cc---- Q lcl|NC_021324. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-------DD-------------VAVPDRATLYLP----DE---- 176 (480) Q Consensus 125 ~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~~-------------~~~~~~~~~~~~----~~---- 176 (480) .+.++++.++-.++++.-|.. ..+...+|.+... .. .+....+++|+. .. T Consensus 153 ~~~~~f~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~V~pr~~~~~~ 230 (549) T protein:vir:10 153 GKGIVYRNVPMQRLWFAENNS--GLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHAVEPRADRDPR 230 (549) T ss_pred CCeeEEEEEEcCeEEEeeCCC--CCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEEeecCCCCCcc Confidence 456788888888877766643 3444444432110 00 011123344321 00 Q ss_pred ----------EEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 177 ----------TVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSAS 246 (480) Q Consensus 177 ----------~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~ 246 (480) .+++.. ++ ... ..+.+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-.....+ T Consensus 231 ~~~~~~~pf~sv~~e~-~~----~~i---l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~ 301 (549) T protein:vir:10 231 KLDGRNMQFASYWLDE-GR----DRI---VQNSGFRTFPFAIGRFYVGTDDVYGGSPAY-DAMPDVRMANDMAKTNIRGA 301 (549) T ss_pred ccccccCceEEEEEEe-cC----CEe---eccCCcccCCcceeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHH Confidence 011111 00 000 123467788998888888889999999764 46677788888888888888 Q ss_pred HHhcchhhhhcCCCccccccccccchhhhhcchhe---ecCCCCceEEEecc-ccHHH---HHHHHHHHHHHHHhhcCCC Q lcl|NC_021324. 247 QILGTPLRVISGVTTDELTNDGENTTLDIYYGRIL---TLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLP 319 (480) Q Consensus 247 ~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~---~~~g~~~~~~~~~~-~~~~~---~~~~l~~~~~~i~~~~~~p 319 (480) +...+|.+.+.- +.....+...+|.+. ...+++..+..+.. .+++. .++.++.-|...+...-+. T Consensus 302 ~~~~~p~~~v~~--------~g~~~~~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~ 373 (549) T protein:vir:10 302 QKLVDPPLLANE--------DGVLDGFDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQ 373 (549) T ss_pred HHHhcCceeecc--------ccccccceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhh Confidence 888888776531 001111222333221 11233344555532 23333 3444444444433322111 Q ss_pred hHHhccccccchHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHHHhCCCCc-------ccceeeeEEecCC Q lcl|NC_021324. 320 PQYLSSSSENPASAEAIIATDSRIVKM----AERK-GRIFGGAWERAMRIAMQIMGREVT-------EEYTRLETVWRDP 387 (480) Q Consensus 320 ~~~~g~~~~~~~Sg~Al~~~~~~l~~k----~~~~-~~~~~~~l~~~~~li~~~~~~~~~-------~~~~~~~v~f~~~ 387 (480) . . .+... -+++.++.....+... ..+. ...+.+-+.+.+.++.+. +..+ .....++|+|..+ T Consensus 374 ~--~-~~~~~-~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~--g~lP~~p~~l~~~~~~~~i~yis~ 447 (549) T protein:vir:10 374 I--L-VDSGD-MTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEA--GQLPDMPQELIDAGADVDVEYDSP 447 (549) T ss_pred h--h-cCCCC-ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCCCChhhhcCCceeEEEeecH Confidence 1 1 12222 2344444333222222 1121 223333444555544431 1111 1234577777544 Q ss_pred CCCCHH-HHH-------HHHHHHHhcccc---CCCHHH----HHHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021324. 388 STPTVA-AKA-------DAVSKLYANGQG---PIPKEQ----ARIDLGYT------ATQREQMRDWDKQETEDMIDTLYS 446 (480) Q Consensus 388 ~~~~~~-~~a-------d~~~kl~~~~~~---~~s~e~----~~~~~~~~------~d~~~~~~~~~~~~~~~~~~~~~~ 446 (480) +.+... ..+ +.+..+.+.++. .+.... +...+|.. +++++++.+..+++++.... ... T Consensus 448 La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~-~~~ 526 (549) T protein:vir:10 448 LNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQM-LAA 526 (549) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHH-HHH Confidence 432111 111 112222222221 122222 23444543 44444443322222221111 111 Q ss_pred hcccCCCcCC---CCCCCCCCccC Q lcl|NC_021324. 447 TTKAQADATP---KPTVTETKTET 467 (480) Q Consensus 447 ~~~~~~~~~~---~~~~~d~~~~~ 467 (480) + ...++.+. +.....+.... T Consensus 527 a-~~a~~~a~~~~~~~ta~~~~~~ 549 (549) T protein:vir:10 527 A-PVAAGAIKDLSDAQTAAQTARV 549 (549) T ss_pred H-HHHHHHHHhhhhhcCCCcccCC Confidence 0 00000000 00001111111 No 147 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=98.58 E-value=8.4e-08 Score=59.36 Aligned_cols=448 Identities=10% Similarity=0.072 Sum_probs=204.1 Q ss_pred CCCHHHHHHH-H----HHH--------HHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTYHEHVER-L----QGL--------LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~~~~i~~-l----~~~--------~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) +.+....+.. . ... -..-..+|+.+..+++-+ +-...||+..+-+- T Consensus 25 ~~~~~~~i~~g~~g~~v~~~g~~~~~n~~eLI~~YR~ma~~pEVd---------------------~Av~eIVneaIv~d 83 (564) T protein:vir:10 25 DEASVSTVAGGYFGTYVDTSGGQNSRNEYELIRRYRDMSLHPEVD---------------------SAIDEIVNEFVVND 83 (564) T ss_pred cCCChhhhhccccceeeecccccchhhHHHHHHHHHHHhhccchh---------------------hHHHHhhcceeEec Confidence 3333222210 0 000 000111222222222211 11122222211000 Q ss_pred -ccCcccc---------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccc Q lcl|NC_021324. 68 -DIEGFRI---------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLY 137 (480) Q Consensus 68 -~~~~~~~---------~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~ 137 (480) ..+++.+ .-.++..+.+..|++--+|+....+..+...+.|+.|++.-.+. ....+|-..++.+||.. T Consensus 84 ~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~--~~pk~GI~eLr~lDPr~ 161 (564) T protein:vir:10 84 GDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDL--DNPKKGILELRYIDSLK 161 (564) T ss_pred CCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeC--CChhhhhhhhhhhcccc Confidence 0011111 11122445566666666899999999999999999998854321 12346777889999998 Q ss_pred eEEEEeCCCCC--ceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccccc-ccccce--EEeecc Q lcl|NC_021324. 138 MYAELDPRNTR--RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHG-LGVVPV--VPLTND 212 (480) Q Consensus 138 ~~~v~d~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~iPv--v~~~n~ 212 (480) +-.||....+. ......+-+......+....+.+|.+... . +..+ ........+.+ -=+||. |.|.+. T Consensus 162 i~~vr~i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~---~--g~~~--~~~~~~~~~~~~~ikI~~daI~y~hS 234 (564) T protein:vir:10 162 IRKVRQKLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGF---A--GNIP--MVTGSMDWSNQEGIKIASDAIAQSTS 234 (564) T ss_pred eeeeeeeccccccccceeeeeeeeeccccccccceeeccccc---c--Cccc--ccccccccccccceeechhhcceecc Confidence 88877332211 11111111111111111111222222110 0 0000 00000000000 002222 112221 Q ss_pred cccC--ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhc Q lcl|NC_021324. 213 PRLG--NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYY 277 (480) Q Consensus 213 ~~~~--~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~ 277 (480) -... ...-.|-|.+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- ..... T Consensus 235 GL~d~~~~~i~gyLhkAIKp~NQ--LkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~T 312 (564) T protein:vir:10 235 GLMDLNKKMTLSFLHKAIKSLNQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQT 312 (564) T ss_pred cceeCCCCceeccchhhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccC Confidence 1111 11112234444554422 2667777777777777777776555555543322110 01111 Q ss_pred c-------------hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc--ccchHHHHHHHH Q lcl|NC_021324. 278 G-------------RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS--ENPASAEAIIAT 339 (480) Q Consensus 278 ~-------------~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~--~~~~Sg~Al~~~ 339 (480) | -+|. .+|.+.++.+++..+.=.-++-++-....++..-++|.+-+.... .+..-+..|--. T Consensus 313 Gevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRD 392 (564) T protein:vir:10 313 GEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRD 392 (564) T ss_pred ceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHH Confidence 1 1111 234456778888664334555666677788888899988776542 122123345555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhcccc Q lcl|NC_021324. 340 DSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQG 408 (480) Q Consensus 340 ~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~ 408 (480) +.....-+.+.+..|..-+.++++.=+-+.|.--..+| ..+.+.|..-....+...++. +.++-.-..- T Consensus 393 EiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGk 472 (564) T protein:vir:10 393 ELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGK 472 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 66677778888888888888888854444443333333 346677754433333333332 2222222222 Q ss_pred CCCHHHHHHh-CCCChhHHHHHHHHHHHHHHHHHHH------HhhhcccCC------------CcCCCCCCCCCCccCCC Q lcl|NC_021324. 409 PIPKEQARID-LGYTATQREQMRDWDKQETEDMIDT------LYSTTKAQA------------DATPKPTVTETKTETQT 469 (480) Q Consensus 409 ~~s~e~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~------~~~~~~~~~------------~~~~~~~~~d~~~~~~~ 469 (480) .+|.++++.. |..++++++++.+.++++..+..-. .......++ +..++++.....+.+++ T Consensus 473 y~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~ 552 (564) T protein:vir:10 473 YFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKP 552 (564) T ss_pred ccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccCCCC Confidence 4599998755 7999999988776666654431100 000001111 11112222222334455 Q ss_pred CCCCCCcccCC Q lcl|NC_021324. 470 SPSGFNRTKTR 480 (480) Q Consensus 470 ~~~~~~~~~~~ 480 (480) +|...++.++| T Consensus 553 ~~~~~~~~~~~ 563 (564) T protein:vir:10 553 PPSQQSKSQSN 563 (564) T ss_pred CCCCCCcCcCC Confidence 56666777777 No 148 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.56 E-value=2.8e-07 Score=56.48 Aligned_cols=397 Identities=10% Similarity=0.036 Sum_probs=165.0 Q ss_pred HHHHHHHHHHHHH--HH-HH-------HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 5 HEHVERLQGLLAR--DL-PN-------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 5 ~~~i~~l~~~~~~--~~-~~-------~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-+++++...+.- +. +. ...+..+.-+... +..+... .-..+.-...+|+..++-+-.-+|.+ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKGK---NALKVATVFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC----ccccchh---hhhccHHHHHHHHHHHHhhccCceEE Confidence 3344444332210 00 00 0011111100000 0001000 00011112223333333222223332 Q ss_pred ---CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEE Q lcl|NC_021324. 75 ---SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAEL 142 (480) Q Consensus 75 ---~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~ 142 (480) .++ +.....+..++.. | ....+...+..+.+.+|.+|+.+..+ ..|++ .+..++|..+.++. T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~ 147 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYI 147 (432) T ss_pred EEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEE Confidence 111 1122335555532 2 34567788889999999999998653 34554 67788888887776 Q ss_pred eCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCc Q lcl|NC_021324. 143 DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s 222 (480) |+... +....+.|+.....+.. ..+.+++ |+||++.....+.+|.| T Consensus 148 d~~~~--~~~~~~~~y~~~~~g~~---~~~~~~e-----------------------------iih~r~~~~~~~~~G~s 193 (432) T protein:vir:10 148 DDVGL--LNSKTKMWYVVNTGGQQ---RVLKPEE-----------------------------ILHFKNGITLDGLVGVP 193 (432) T ss_pred cCccc--ccccceEEEEEecCCeE---EEEcccc-----------------------------EEEecCCCCCCCccccc Confidence 54321 11111111111111110 0111111 46666544445677887 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecc Q lcl|NC_021324. 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKA 295 (480) Q Consensus 223 ~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~ 295 (480) .+.. +...++....+.......+...+.|-.+++-- .+.+...+.-...|+. ..++++.++ ++.++.+++. T Consensus 194 ~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~ 271 (432) T protein:vir:10 194 TMEY-LKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPISL 271 (432) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecC-CCceEEEccC Confidence 6532 33333333222222233333333455554421 1111111111111211 123455555 3467777764 Q ss_pred cc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHh Q lcl|NC_021324. 296 AE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIM 369 (480) Q Consensus 296 ~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~ 369 (480) +. -..+++..+....+|+..-++|+..+|....+.-|. +......+ +...|+-++..+. .+. T Consensus 272 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e~~~~~~----------~~~~l~P~~~~ie~~ln~kLl 339 (432) T protein:vir:10 272 NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQQQQF----------YTDTLQATLTMYEQEMTYKLF 339 (432) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHHHHH----------HHHHHHHHHHHHHHHHHHhhc Confidence 32 224677778888999999999999997533221111 11111111 1112222222211 111 Q ss_pred CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_021324. 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 370 ~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) ..........+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+ -..+..-.....+++.+.. T Consensus 340 ~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~--ggD~~~~~~n~~~~~~~~~--- 412 (432) T protein:vir:10 340 LDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEA--GGDRLLVNGNMLPIDMAGQ--- 412 (432) T ss_pred ChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CCCeEeecccccchhhccc--- Confidence 1111111123444555666788899999999998875 67777778888875432 0000000000001111000 Q ss_pred cCCCcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 450 AQADATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 450 ~~~~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) ....+++.+.+..++.+.+| T Consensus 413 ------~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 413 ------AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ------cccCCCCCCCCCCCCCCCCC Confidence 11112222222222222222 No 149 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.56 E-value=2.8e-07 Score=56.48 Aligned_cols=397 Identities=10% Similarity=0.036 Sum_probs=165.0 Q ss_pred HHHHHHHHHHHHH--HH-HH-------HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 5 HEHVERLQGLLAR--DL-PN-------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 5 ~~~i~~l~~~~~~--~~-~~-------~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-+++++...+.- +. +. ...+..+.-+... +..+... .-..+.-...+|+..++-+-.-+|.+ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKGK---NALKVATVFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC----ccccchh---hhhccHHHHHHHHHHHHhhccCceEE Confidence 3344444332210 00 00 0011111100000 0001000 00011112223333333222223332 Q ss_pred ---CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEE Q lcl|NC_021324. 75 ---SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAEL 142 (480) Q Consensus 75 ---~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~ 142 (480) .++ +.....+..++.. | ....+...+..+.+.+|.+|+.+..+ ..|++ .+..++|..+.++. T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~ 147 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYI 147 (432) T ss_pred EEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEE Confidence 111 1122335555532 2 34567788889999999999998653 34554 67788888887776 Q ss_pred eCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCc Q lcl|NC_021324. 143 DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s 222 (480) |+... +....+.|+.....+.. ..+.+++ |+||++.....+.+|.| T Consensus 148 d~~~~--~~~~~~~~y~~~~~g~~---~~~~~~e-----------------------------iih~r~~~~~~~~~G~s 193 (432) T protein:vir:10 148 DDVGL--LNSKTKMWYVVNTGGQQ---RVLKPEE-----------------------------ILHFKNGITLDGLVGVP 193 (432) T ss_pred cCccc--ccccceEEEEEecCCeE---EEEcccc-----------------------------EEEecCCCCCCCccccc Confidence 54321 11111111111111110 0111111 46666544445677887 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecc Q lcl|NC_021324. 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKA 295 (480) Q Consensus 223 ~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~ 295 (480) .+.. +...++....+.......+...+.|-.+++-- .+.+...+.-...|+. ..++++.++ ++.++.+++. T Consensus 194 ~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~ 271 (432) T protein:vir:10 194 TMEY-LKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPISL 271 (432) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecC-CCceEEEccC Confidence 6532 33333333222222233333333455554421 1111111111111211 123455555 3467777764 Q ss_pred cc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHh Q lcl|NC_021324. 296 AE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIM 369 (480) Q Consensus 296 ~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~ 369 (480) +. -..+++..+....+|+..-++|+..+|....+.-|. +......+ +...|+-++..+. .+. T Consensus 272 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e~~~~~~----------~~~~l~P~~~~ie~~ln~kLl 339 (432) T protein:vir:10 272 NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQQQQF----------YTDTLQATLTMYEQEMTYKLF 339 (432) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHHHHH----------HHHHHHHHHHHHHHHHHHhhc Confidence 32 224677778888999999999999997533221111 11111111 1112222222211 111 Q ss_pred CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_021324. 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 370 ~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) ..........+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+ -..+..-.....+++.+.. T Consensus 340 ~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~--ggD~~~~~~n~~~~~~~~~--- 412 (432) T protein:vir:10 340 LDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEA--GGDRLLVNGNMLPIDMAGQ--- 412 (432) T ss_pred ChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CCCeEeecccccchhhccc--- Confidence 1111111123444555666788899999999998875 67777778888875432 0000000000001111000 Q ss_pred cCCCcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 450 AQADATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 450 ~~~~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) ....+++.+.+..++.+.+| T Consensus 413 ------~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 413 ------AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ------cccCCCCCCCCCCCCCCCCC Confidence 11112222222222222222 No 150 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.56 E-value=2.8e-07 Score=56.48 Aligned_cols=397 Identities=10% Similarity=0.036 Sum_probs=165.0 Q ss_pred HHHHHHHHHHHHH--HH-HH-------HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 5 HEHVERLQGLLAR--DL-PN-------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 5 ~~~i~~l~~~~~~--~~-~~-------~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .-+++++...+.- +. +. ...+..+.-+... +..+... .-..+.-...+|+..++-+-.-+|.+ T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~ 73 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKGK---NALKVATVFACIKILSESVSKLPLKI 73 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC----ccccchh---hhhccHHHHHHHHHHHHhhccCceEE Confidence 3344444332210 00 00 0011111100000 0001000 00011112223333333222223332 Q ss_pred ---CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEE Q lcl|NC_021324. 75 ---SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAEL 142 (480) Q Consensus 75 ---~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~ 142 (480) .++ +.....+..++.. | ....+...+..+.+.+|.+|+.+..+ ..|++ .+..++|..+.++. T Consensus 74 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~i~~~~v~v~~ 147 (432) T protein:vir:10 74 YQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFD------RKGKVQALWPIDASKVTVYI 147 (432) T ss_pred EEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEE Confidence 111 1122335555532 2 34567788889999999999998653 34554 67788888887776 Q ss_pred eCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCc Q lcl|NC_021324. 143 DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRS 222 (480) Q Consensus 143 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s 222 (480) |+... +....+.|+.....+.. ..+.+++ |+||++.....+.+|.| T Consensus 148 d~~~~--~~~~~~~~y~~~~~g~~---~~~~~~e-----------------------------iih~r~~~~~~~~~G~s 193 (432) T protein:vir:10 148 DDVGL--LNSKTKMWYVVNTGGQQ---RVLKPEE-----------------------------ILHFKNGITLDGLVGVP 193 (432) T ss_pred cCccc--ccccceEEEEEecCCeE---EEEcccc-----------------------------EEEecCCCCCCCccccc Confidence 54321 11111111111111110 0111111 46666544445677887 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecc Q lcl|NC_021324. 223 EISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKA 295 (480) Q Consensus 223 ~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~ 295 (480) .+.. +...++....+.......+...+.|-.+++-- .+.+...+.-...|+. ..++++.++ ++.++.+++. T Consensus 194 ~~~~-~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~ 271 (432) T protein:vir:10 194 TMEY-LKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESMSSGLQNSHRIALMP-VGYQFQPISL 271 (432) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcccccCCcceecC-CCceEEEccC Confidence 6532 33333333222222233333333455554421 1111111111111211 123455555 3467777764 Q ss_pred cc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHh Q lcl|NC_021324. 296 AE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIM 369 (480) Q Consensus 296 ~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~ 369 (480) +. -..+++..+....+|+..-++|+..+|....+.-|. +......+ +...|+-++..+. .+. T Consensus 272 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~--~e~~~~~~----------~~~~l~P~~~~ie~~ln~kLl 339 (432) T protein:vir:10 272 NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQQQQQF----------YTDTLQATLTMYEQEMTYKLF 339 (432) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHHHHH----------HHHHHHHHHHHHHHHHHHhhc Confidence 32 224677778888999999999999997533221111 11111111 1112222222211 111 Q ss_pred CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_021324. 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 370 ~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) ..........+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+ -..+..-.....+++.+.. T Consensus 340 ~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~--ggD~~~~~~n~~~~~~~~~--- 412 (432) T protein:vir:10 340 LDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGG--FLKPNEARSKEDLPPEA--GGDRLLVNGNMLPIDMAGQ--- 412 (432) T ss_pred ChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CCCeEeecccccchhhccc--- Confidence 1111111123444555666788899999999998875 67777778888875432 0000000000001111000 Q ss_pred cCCCcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 450 AQADATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 450 ~~~~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) ....+++.+.+..++.+.+| T Consensus 413 ------~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 413 ------AYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred ------cccCCCCCCCCCCCCCCCCC Confidence 11112222222222222222 No 151 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.55 E-value=3.1e-07 Score=56.30 Aligned_cols=419 Identities=11% Similarity=0.062 Sum_probs=152.1 Q ss_pred CCCHHHHHHHHHHH-HH---HH------HHHHHHHHHHhhccCcch--------hcCc--ccchhhh---cc----e--e Q lcl|NC_021324. 1 MTTYHEHVERLQGL-LA---RD------LPNLLEAEAYRNGTRRLK--------TIGI--GAPPELA---YL----D--V 51 (480) Q Consensus 1 ~~~~~~~i~~l~~~-~~---~~------~~~~~~~~~YY~G~~~i~--------~~~~--~~~~~~~---~~----~--~ 51 (480) -++.+|.-+-+... |. .. ..+-+.+.+.-+++.... .... ...+..+ ++ + . T Consensus 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 92 (574) T protein:vir:80 13 KSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFG 92 (574) T ss_pred hhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhc Confidence 22233322211111 10 00 011122334433332110 0000 0000000 00 1 0 Q ss_pred ccchHHHHHHHHHhhhc-----------cCcccc---CCC-------chhHHHHHHHHHh-----c----CHHHHHHHHH Q lcl|NC_021324. 52 QPGWVATYLRTLSDRLD-----------IEGFRI---SED-------SEGLEELWNWWQA-----N----DLDEESVLGH 101 (480) Q Consensus 52 ~~n~~~~iVd~~~~~l~-----------~~~~~~---~~d-------~~~~~~l~~~~~~-----n----~~~~~~~~~~ 101 (480) .......+|++.++.+. +-||.+ ..+ ......|.+++.. | .+..+...+. T Consensus 93 ~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv 172 (574) T protein:vir:80 93 NNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLV 172 (574) T ss_pred cChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHH Confidence 11223344443322111 123322 111 1112235555532 1 2345677788 Q ss_pred HHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEE Q lcl|NC_021324. 102 DDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPL 180 (480) Q Consensus 102 ~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (480) .+.+.+|.+|+.+-.+ ..|+| .+..++|..+.++.+..... .....++|...++ +. ...|.++.+ T Consensus 173 ~~lll~Gnayi~i~r~------~~G~~~~L~pl~p~~V~v~~d~~~~~-~~~~~~y~~~~~g-~~---~~~~~~~ei--- 238 (574) T protein:vir:80 173 RATYMYDQVNFEKVFD------KDGNFIKFDTVDPTTIFLATNGEGKL-IKNGERFVQVIDN-RI---VAKFNEREL--- 238 (574) T ss_pred HHHHhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEcCcccc-ccCceEEEEEeCC-ce---EEEEccccE--- Confidence 8899999999877543 34554 57788999888876543210 0001112111111 10 111222222 Q ss_pred EecCCCccccccccccccccccccceEEeecccc---cCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc Q lcl|NC_021324. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR---LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS 257 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~---~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~ 257 (480) +||+.++. ..+.+|.|.|.- +...++....+.....+.+...+.|-.+|. T Consensus 239 --------------------------ih~~~~~~~~~~~~~~G~spi~~-a~~~i~~~~~a~~~~~~~f~ng~~p~gil~ 291 (574) T protein:vir:80 239 --------------------------AFAVRNPRADIEVGQYGYPELEI-ALKQFIAHENTEVFNDRFFSHGGTTRGILH 291 (574) T ss_pred --------------------------EEEeccCCCCcccccccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEE Confidence 34433222 224578876632 333333322222222232333334554442 Q ss_pred --CCC-ccccccccccchhhh------hcchheecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhcccc Q lcl|NC_021324. 258 --GVT-TDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSS 327 (480) Q Consensus 258 --G~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~ 327 (480) +.. +.+...+.-...|.- ..+++..+.+++.++..+.... -..|++..+....+|+..-++|+..+|... T Consensus 292 ~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~ 371 (574) T protein:vir:80 292 VKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPN 371 (574) T ss_pred eCCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccc Confidence 211 111110111111211 1223334445667888776432 224788888899999999999999998643 Q ss_pred ccchHH--------HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHH Q lcl|NC_021324. 328 ENPASA--------EAIIATDSRIVKM-AERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADA 398 (480) Q Consensus 328 ~~~~Sg--------~Al~~~~~~l~~k-~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~ 398 (480) .+...| ..+.......... ..=....+...|.+ .++ . .. ...+.+.|......+..+.+ . T Consensus 372 ~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~---~Ll---~-~~---~~~~~~~f~~~d~~~~~~~~-~ 440 (574) T protein:vir:80 372 NGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNT---YIV---A-EF---GEKYQFQFRGGDLSAQLDKL-K 440 (574) T ss_pred cccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHh---hhh---h-hc---CCceEEEecccchhhHHHHH-H Confidence 221111 0111111111110 00011111111111 011 1 00 12355677655444433333 2 Q ss_pred HHHHHhccccCCCHHHHHHhCCCChhH----H------HHHHHHHH------HHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 399 VSKLYANGQGPIPKEQARIDLGYTATQ----R------EQMRDWDK------QETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 399 ~~kl~~~~~~~~s~e~~~~~~~~~~d~----~------~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) +.+++.+ ++++..-+++.+|+.+-+ . ..+..... +..++..+.... ...+.++ .+ T Consensus 441 ~~~~~~~--G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~-~~ 511 (574) T protein:vir:80 441 IIEQEGK--VFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLE------LSGGDVE-QP 511 (574) T ss_pred HHHHHhC--CccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhcccccccc------ccCCCCC-CC Confidence 3344444 477777788777664221 0 00000000 000000000000 0000000 00 Q ss_pred CCccCCCCCCCCCcccCC Q lcl|NC_021324. 463 TKTETQTSPSGFNRTKTR 480 (480) Q Consensus 463 ~~~~~~~~~~~~~~~~~~ 480 (480) +.+++..+.+++++...+ T Consensus 512 ~~~~p~~~~~d~~~~~~~ 529 (574) T protein:vir:80 512 EPEEPKDSQNDTDVSFQD 529 (574) T ss_pred CCCCCCCccccccchhhh Confidence 001111111111111111 No 152 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.54 E-value=3.2e-07 Score=56.21 Aligned_cols=420 Identities=10% Similarity=0.089 Sum_probs=154.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---------------------cchhcCcccchhhhcceeccchHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---------------------RLKTIGIGAPPELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~---------------------~i~~~~~~~~~~~~~~~~~~n~~~~i 59 (480) =+++..-+++.-.. -..+..+..--++++ ....-+...+...+.+ ........+ T Consensus 26 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~-~~n~i~~~~ 100 (563) T protein:vir:99 26 DEGLQANIKKIEQD----NKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKF-GNNPILNAI 100 (563) T ss_pred cCChhhhHhhhhcc----chhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHh-hcchHHHHH Confidence 12233333321111 001111111111111 1100001111111111 112344555 Q ss_pred HHHHHhhhcc-----------Ccccc-----CC---Cc--hhHHHHHHHHH-----hc----CHHHHHHHHHHHHhhcCe Q lcl|NC_021324. 60 LRTLSDRLDI-----------EGFRI-----SE---DS--EGLEELWNWWQ-----AN----DLDEESVLGHDDSLTFGR 109 (480) Q Consensus 60 Vd~~~~~l~~-----------~~~~~-----~~---d~--~~~~~l~~~~~-----~n----~~~~~~~~~~~~~~~~G~ 109 (480) |++.++.+.. -++.+ .. ++ .....+..++. .+ .+..+...+..+.+.+|. T Consensus 101 I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn 180 (563) T protein:vir:99 101 ILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ 180 (563) T ss_pred HHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCC Confidence 6555443311 11111 01 11 11122333332 11 245677788899999999 Q ss_pred eEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc Q lcl|NC_021324. 110 AYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 110 a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) ||+.+.... +..|++ .+..++|..+.++.++... ......+++...+ ++. ...|.++.+++ T Consensus 181 ~~~~~~~~r----d~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~-g~~---~~~~~~~evI~--------- 242 (563) T protein:vir:99 181 VNFEKVFNK----NNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD-KRV---VASFTSRELAM--------- 242 (563) T ss_pred eEEEEEEEe----cCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC-Cce---eEEecCcceEE--------- Confidence 988754221 234554 5778899988887765421 1111111111111 110 01112222111 Q ss_pred ccccccccccccccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHH---HHHhcchhhhhc--CC-C Q lcl|NC_021324. 189 QWVVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSA---SQILGTPLRVIS--GV-T 260 (480) Q Consensus 189 ~~~~~~~~~~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~---~~~~~~p~l~i~--G~-~ 260 (480) +++|. ....+.+|.|.|. .+.+.+...+.-.... +...+.|-.+|. |. . T Consensus 243 -------------------~~~~~~~d~~~~~~G~Spi~----~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ 299 (563) T protein:vir:99 243 -------------------GIRNPRTELSSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQ 299 (563) T ss_pred -------------------EeccCCCCcccCcccchHHH----HHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCC Confidence 11111 1123567887654 3334443333332222 333334554442 32 1 Q ss_pred ccccccccccchhhh------hcchh-eecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccch- Q lcl|NC_021324. 261 TDELTNDGENTTLDI------YYGRI-LTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA- 331 (480) Q Consensus 261 ~~~~~~~~~~~~~~~------~~~~~-~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~- 331 (480) +.+...+.-...|+. ..+++ +.+ +++.+|..+.... -..|++..+....+|+..-++|+..+|....+.. T Consensus 300 ls~e~~~~~~~~~~~~~~G~~nagk~~~vl-~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~ 378 (563) T protein:vir:99 300 QSQHALENFKREWKSSLSGINGSWQIPVVM-ADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGAT 378 (563) T ss_pred CCHHHHHHHHHHHHHHhccccccccceEEc-CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccccccc Confidence 111111111112221 11233 333 4557888776432 2247888899999999999999999985321111 Q ss_pred -----HH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCCHHHHHHHHHHH Q lcl|NC_021324. 332 -----SA---EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKL 402 (480) Q Consensus 332 -----Sg---~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl 402 (480) |+ ..+..... ..+...|.-++..+....... .......+.+.|.+....+..+.. ++.++ T Consensus 379 ~~~~~ss~~~sn~e~~~~----------~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~r~D~~~~~e~~-~~~~~ 447 (563) T protein:vir:99 379 GSKGGSTLNEADPGKKQQ----------QSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFVGGDTKSATDKL-NILKL 447 (563) T ss_pred ccccccchhhccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhhchhcccccEEEeccCCHHHHHHHH-HHHHH Confidence 10 01111111 111112221111111100000 000112355667655443333322 23344 Q ss_pred HhccccCCCHHHHHHhCCCChhHH-HH------------HHHHH---HHHHHHHHHHHhhhcccCCCcCCCCC-----CC Q lcl|NC_021324. 403 YANGQGPIPKEQARIDLGYTATQR-EQ------------MRDWD---KQETEDMIDTLYSTTKAQADATPKPT-----VT 461 (480) Q Consensus 403 ~~~~~~~~s~e~~~~~~~~~~d~~-~~------------~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 461 (480) +.+ ++++...+++.+|+.+-+- .. ..... .+..++..+.+........+..++.+ .+ T Consensus 448 ~~~--G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (563) T protein:vir:99 448 ETQ--IFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSND 525 (563) T ss_pred hcC--CccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCC Confidence 444 4677777777777642210 00 00000 00111111111111111111111111 11 Q ss_pred CCCccCCCCCCCCCcccCC Q lcl|NC_021324. 462 ETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 462 d~~~~~~~~~~~~~~~~~~ 480 (480) +...+.++.+.+.++..+. T Consensus 526 ~~~~~~~~~~~~~~~~~~~ 544 (563) T protein:vir:99 526 DKEIGTDAQIKGDDNVYRT 544 (563) T ss_pred ccccccccccccccccccc Confidence 1112222333333332222 No 153 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.54 E-value=3.2e-07 Score=56.21 Aligned_cols=420 Identities=10% Similarity=0.089 Sum_probs=154.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---------------------cchhcCcccchhhhcceeccchHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---------------------RLKTIGIGAPPELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~---------------------~i~~~~~~~~~~~~~~~~~~n~~~~i 59 (480) =+++..-+++.-.. -..+..+..--++++ ....-+...+...+.+ ........+ T Consensus 26 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~-~~n~i~~~~ 100 (563) T protein:vir:95 26 DEGLQANIKKIEQD----NKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKF-GNNPILNAI 100 (563) T ss_pred cCChhhhHhhhhcc----chhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHh-hcchHHHHH Confidence 12233333321111 001111111111111 1100001111111111 112344555 Q ss_pred HHHHHhhhcc-----------Ccccc-----CC---Cc--hhHHHHHHHHH-----hc----CHHHHHHHHHHHHhhcCe Q lcl|NC_021324. 60 LRTLSDRLDI-----------EGFRI-----SE---DS--EGLEELWNWWQ-----AN----DLDEESVLGHDDSLTFGR 109 (480) Q Consensus 60 Vd~~~~~l~~-----------~~~~~-----~~---d~--~~~~~l~~~~~-----~n----~~~~~~~~~~~~~~~~G~ 109 (480) |++.++.+.. -++.+ .. ++ .....+..++. .+ .+..+...+..+.+.+|. T Consensus 101 I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~Gn 180 (563) T protein:vir:95 101 ILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYDQ 180 (563) T ss_pred HHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcCC Confidence 6555443311 11111 01 11 11122333332 11 245677788899999999 Q ss_pred eEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcc Q lcl|NC_021324. 110 AYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 110 a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) ||+.+.... +..|++ .+..++|..+.++.++... ......+++...+ ++. ...|.++.+++ T Consensus 181 ~~~~~~~~r----d~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~-g~~---~~~~~~~evI~--------- 242 (563) T protein:vir:95 181 VNFEKVFNK----NNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVD-KRV---VASFTSRELAM--------- 242 (563) T ss_pred eEEEEEEEe----cCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeC-Cce---eEEecCcceEE--------- Confidence 988754221 234554 5778899988887765421 1111111111111 110 01112222111 Q ss_pred ccccccccccccccccceEEeecc--cccCccCCCccchHHHHHHHHHHHHHHHHHHHH---HHHhcchhhhhc--CC-C Q lcl|NC_021324. 189 QWVVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRKVTDAASRTLMNLQSA---SQILGTPLRVIS--GV-T 260 (480) Q Consensus 189 ~~~~~~~~~~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~---~~~~~~p~l~i~--G~-~ 260 (480) +++|. ....+.+|.|.|. .+.+.+...+.-.... +...+.|-.+|. |. . T Consensus 243 -------------------~~~~~~~d~~~~~~G~Spi~----~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ 299 (563) T protein:vir:95 243 -------------------GIRNPRTELSSSGYGLSEVE----IAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQ 299 (563) T ss_pred -------------------EeccCCCCcccCcccchHHH----HHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCC Confidence 11111 1123567887654 3334443333332222 333334554442 32 1 Q ss_pred ccccccccccchhhh------hcchh-eecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccch- Q lcl|NC_021324. 261 TDELTNDGENTTLDI------YYGRI-LTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA- 331 (480) Q Consensus 261 ~~~~~~~~~~~~~~~------~~~~~-~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~- 331 (480) +.+...+.-...|+. ..+++ +.+ +++.+|..+.... -..|++..+....+|+..-++|+..+|....+.. T Consensus 300 ls~e~~~~~~~~~~~~~~G~~nagk~~~vl-~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~ 378 (563) T protein:vir:95 300 QSQHALENFKREWKSSLSGINGSWQIPVVM-ADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGAT 378 (563) T ss_pred CCHHHHHHHHHHHHHHhccccccccceEEc-CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccccccc Confidence 111111111112221 11233 333 4557888776432 2247888899999999999999999985321111 Q ss_pred -----HH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCCHHHHHHHHHHH Q lcl|NC_021324. 332 -----SA---EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKL 402 (480) Q Consensus 332 -----Sg---~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl 402 (480) |+ ..+..... ..+...|.-++..+....... .......+.+.|.+....+..+.. ++.++ T Consensus 379 ~~~~~ss~~~sn~e~~~~----------~f~~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~f~r~D~~~~~e~~-~~~~~ 447 (563) T protein:vir:95 379 GSKGGSTLNEADPGKKQQ----------QSQNKGLQPLLRFIEDLVNRHIISEYGDKYTFQFVGGDTKSATDKL-NILKL 447 (563) T ss_pred ccccccchhhccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhhchhcccccEEEeccCCHHHHHHHH-HHHHH Confidence 10 01111111 111112221111111100000 000112355667655443333322 23344 Q ss_pred HhccccCCCHHHHHHhCCCChhHH-HH------------HHHHH---HHHHHHHHHHHhhhcccCCCcCCCCC-----CC Q lcl|NC_021324. 403 YANGQGPIPKEQARIDLGYTATQR-EQ------------MRDWD---KQETEDMIDTLYSTTKAQADATPKPT-----VT 461 (480) Q Consensus 403 ~~~~~~~~s~e~~~~~~~~~~d~~-~~------------~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 461 (480) +.+ ++++...+++.+|+.+-+- .. ..... .+..++..+.+........+..++.+ .+ T Consensus 448 ~~~--G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (563) T protein:vir:95 448 ETQ--IFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSND 525 (563) T ss_pred hcC--CccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCC Confidence 444 4677777777777642210 00 00000 00111111111111111111111111 11 Q ss_pred CCCccCCCCCCCCCcccCC Q lcl|NC_021324. 462 ETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 462 d~~~~~~~~~~~~~~~~~~ 480 (480) +...+.++.+.+.++..+. T Consensus 526 ~~~~~~~~~~~~~~~~~~~ 544 (563) T protein:vir:95 526 DKEIGTDAQIKGDDNVYRT 544 (563) T ss_pred ccccccccccccccccccc Confidence 1112222333333332222 No 154 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.54 E-value=3.4e-07 Score=56.07 Aligned_cols=421 Identities=13% Similarity=0.120 Sum_probs=156.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..+++.+.+. ....+.-.......||+- ++ -...+.+..-...+...+|+..++.+...++.+..++ T Consensus 11 ~~~~~~i~~~---~~~s~~~~~~~~~~~~~p--p~------~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~~~~~~-- 77 (542) T protein:vir:41 11 LEKYKAIKRE---EVESQALGETRFEEYVEP--KV------NPLVLLSLLQVNPYHASACSIKANDIIRTGYILEGDD-- 77 (542) T ss_pred cccchhhhhc---cccccccccccCCccccC--CC------CHHHHHHHHhhcHHHHHHHHHHHHHHhhCceeeeccc-- Confidence 1111111111 000000000111122210 01 0112222222234567778877777767777764332 Q ss_pred HHHHHHHHHhc--CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQAN--DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 81 ~~~l~~~~~~n--~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ...+..++-.. ....+...+..+.+.+|.||+.+..+. .|++ .+..++|..+.+..|... +...+ T Consensus 78 ~~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~------~G~~~~L~~l~~~~v~v~~d~~~------~~~~~ 145 (542) T protein:vir:41 78 EGVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDD------RGDPIRFEYIPSHTIRVHKDGSR------YRQTW 145 (542) T ss_pred chhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcC------CCcEEEEEEEcCcceEEEEcCCe------eEeee Confidence 23344443221 345667788889999999999886533 4554 577888887776554321 11111 Q ss_pred EEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHH Q lcl|NC_021324. 158 TTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASR 237 (480) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~ 237 (480) +..+ ..++..|.....+. ...+.. ...+..=-|+||++.....+.+|.|.+.. +.+.+.. T Consensus 146 ---~~~~-~~~~~~y~~~~~~~--~~~g~~----------~~~~~~~eIiHir~~~~~~~~~Glspi~~----~~~~i~~ 205 (542) T protein:vir:41 146 ---DGVN-ITHFKDYRYEGEIN--PETGED----------QDSVGANELVFIHIPSPVCSYYGVPRYVS----AAPAILA 205 (542) T ss_pred ---cCCc-ceeEEeeccccccc--cccccc----------ccccCcccEEEecCCCCCCCcccccHHHH----HHHHHHH Confidence 1111 12222222211110 000000 00111113578887666677899987643 3344433 Q ss_pred HHHHHHHHHHHhcc---hh--hhhcCCCccccccccc---------cchhhh-------hcchheecC-----CCCceEE Q lcl|NC_021324. 238 TLMNLQSASQILGT---PL--RVISGVTTDELTNDGE---------NTTLDI-------YYGRILTLA-----SEAAKIS 291 (480) Q Consensus 238 ~~s~~~~~~~~~~~---p~--l~i~G~~~~~~~~~~~---------~~~~~~-------~~~~~~~~~-----g~~~~~~ 291 (480) ...-......+|.+ |- +.+.|...++...+.. ...|+. ..++++.++ .++.++. T Consensus 206 ~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~ 285 (542) T protein:vir:41 206 MQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFT 285 (542) T ss_pred HHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCCcccceeEE Confidence 33222223333433 33 3334433222111100 001110 112333332 2345676 Q ss_pred Eeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021324. 292 EFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVK-MAERKGRIFGGAWERAMRIAMQIM 369 (480) Q Consensus 292 ~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~-k~~~~~~~~~~~l~~~~~li~~~~ 369 (480) .++.. .-..|++..+....+|+..-++|+..+|....+..++..+......... ...-....+...|.+ .. T Consensus 286 pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P~~~~ie~~ln~-------~L 358 (542) T protein:vir:41 286 PLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRPQQNIISSILTD-------FF 358 (542) T ss_pred EcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHh-------hc Confidence 66532 2234778888889999999999999998643221111111111111111 111111111122211 11 Q ss_pred CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHHH-------HHHHHH---H Q lcl|NC_021324. 370 GREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMRD-------WDKQET---E 438 (480) Q Consensus 370 ~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~-~~~~d~~~~~~~-------~~~~~~---~ 438 (480) .... .....+.|....... ...+..+.+++++| +++...+++.+ |+.+-+...+.- ....+. . T Consensus 359 ~~~~---~~~~~~~f~~~~ll~-~d~~~~~~~~v~~G--ilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~ 432 (542) T protein:vir:41 359 QVKF---NPKTRFKFNDETLLE-SDSVRNCALLVQSG--VLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNYEK 432 (542) T ss_pred cccc---CCceEEEecchhhcc-hHHHHHHHHHHhCC--CCCHHHHHHhhCCCCCCCccccccccccccccccCCcCCCC Confidence 1011 123445564322111 12233344555554 55555566644 443211000100 000000 0 Q ss_pred HHHHHHhhhcccCCCcCCCCC-----CCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 439 DMIDTLYSTTKAQADATPKPT-----VTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 439 ~~~~~~~~~~~~~~~~~~~~~-----~~d~~~~~~~~~~~~~~~~~~ 480 (480) +....+..... ..++.-++. .+..+.....+|.+-..+-+- T Consensus 433 ~~~~~~~k~~~-k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 478 (542) T protein:vir:41 433 NQIREIRKIYA-KYRPRFNEIISSKLSAEEKKKKIDESLAEFRAEAY 478 (542) T ss_pred Cchhhhhhccc-ccCccccccccccccchhhcccccchhhhhHHhHH Confidence 00000000000 000000000 000000000000000000000 No 155 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.52 E-value=3.8e-07 Score=55.81 Aligned_cols=385 Identities=11% Similarity=0.011 Sum_probs=159.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCCch-h Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSE-G 80 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~-~ 80 (480) .-++.++.......+ .......+...-......+..+... .-+.+.....+|+..++-+-.-+|.+ .++.+ . T Consensus 1 Mgl~~~~f~~~~~~~-~~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~ 76 (409) T protein:vir:84 1 MSLFTRIFSGPSEER-TLTKISGIPSPAEDWAMHGDRPGAN---SAMTLGAFYACVTLLADTVASLSIDAYRKKDNVRIP 76 (409) T ss_pred CchhhhhhcCCCccc-ccccccccccccchhhccCcccchh---hhhccHHHHHHHHHHHHhhhhCceEEEEecCCcccc Confidence 222222222111000 0000000000000000000011000 00111223344444444333333322 22211 1 Q ss_pred HHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 81 LEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 81 ~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ...+.+++.. | ....+...+..+.+.+|.+|+++... +..|.+ .+..++|..+.+......... + T Consensus 77 ~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~-----~~~g~~~~L~~l~p~~v~v~~~~~~~~~--~-- 147 (409) T protein:vir:84 77 VSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISAR-----DEANRPTAIMPIHPDCIHVTDAKDEDGD--W-- 147 (409) T ss_pred cchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEE-----CCCCceEEEEEEcCceeEEEEcCCCcce--E-- Confidence 2234444432 2 34567788888999999999876421 234554 567788887766543322211 1 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHH Q lcl|NC_021324. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~ 234 (480) ++......+. .|.++. |+||++....+..+|.|.+. .+.+. T Consensus 148 -~~~~~~~~g~-----~~~~~d-----------------------------vih~~~~~~~~~~~G~s~i~----~~~~~ 188 (409) T protein:vir:84 148 -IEPVYRIDGK-----VVPNHR-----------------------------IMHIKRYPVAGCALGMSPIE----KAASA 188 (409) T ss_pred -EEEEecCCce-----EEchhh-----------------------------EEEecCCCCCcccccccHHH----HHHHH Confidence 0000001110 011222 35565544445567887653 33344 Q ss_pred HHHHHHH---HHHHHHHhcchhhhhcCC-Cccccccccccchhh---hhcchheecCCCCceEEEecccc-HHHHHHHHH Q lcl|NC_021324. 235 ASRTLMN---LQSASQILGTPLRVISGV-TTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAE-LRNFAEEME 306 (480) Q Consensus 235 ~~~~~s~---~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~ 306 (480) +...+.- ..+.+...+.|-.+|+.- ...+...+.-...+. ...++++.++ ++.++.++..+. -..+++..+ T Consensus 189 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~~~~~~~d~q~~e~~~ 267 (409) T protein:vir:84 189 IGLGLAAERYGLRWFRDSANPSGILSSDADLTPDQVKQTQKQWIQSHHNRRLPAVMS-AGIKWQSVSITPNESQFLETRS 267 (409) T ss_pred HHHHHHHHHHHHHHHhcCCCccEEEecCCCCCHHHHHHHHHHHHHHhccCCCeeecC-CCceEEEccCChhHHHHHHHHH Confidence 4433332 222233334455555421 111111010001111 1223455554 456787776432 224677778 Q ss_pred HHHHHHHhhcCCChHHhccccccchHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATD-----SRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~-----~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) ....+|+..-++|+.-+|....++.++..++... ..|.-.+...+..+ .+ .+.. ...++ T Consensus 268 ~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l----~~-------~L~~-----g~~i~ 331 (409) T protein:vir:84 268 FQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQAL----DT-------FLPR-----GQFVK 331 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHH----HH-------hccC-----CCeEE Confidence 8899999999999998875433222222222222 22222222222221 11 1111 12355 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......|..++++++.+++.+| +++...+++.+|+.+-+ .-. .+..... ..+....... T Consensus 332 fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~p~~--ggD------------~~~~~~n--~~~~~~~~~~ 393 (409) T protein:vir:84 332 FNVDGLMRGDVTARFTAYQMGLQNG--IWSVNEVRAWEDAPPIP--EGD------------IHLQPMN--FVPLGYVPPE 393 (409) T ss_pred EechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--Ccc------------eeeeccc--ccccccCCcc Confidence 5566666778899999999998876 66766777787775432 100 0000000 0000001111 Q ss_pred CCCc-cCCCCCCCCCc Q lcl|NC_021324. 462 ETKT-ETQTSPSGFNR 476 (480) Q Consensus 462 d~~~-~~~~~~~~~~~ 476 (480) ++.. ..+.+.+++|+ T Consensus 394 ~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 394 EPAQEPQPNSATEGNK 409 (409) T ss_pred ccCcCCCCCCccCCCC Confidence 1111 11122223333 No 156 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.52 E-value=3.8e-07 Score=55.76 Aligned_cols=441 Identities=13% Similarity=0.122 Sum_probs=183.4 Q ss_pred CCCHH------HHHHHHHHHHHHH----HHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhcc- Q lcl|NC_021324. 1 MTTYH------EHVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI- 69 (480) Q Consensus 1 ~~~~~------~~i~~l~~~~~~~----~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~- 69 (480) |+.-+ +-++++.+.+... ..+.+.+.+|..-.. .+ ...........++..+-+...++++++.|.+ T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~-~~--~~~~~~~~~~~~~~dst~~~a~~~Laa~l~~~ 77 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL-FP--KESDNESTDYTTPWQAVGARGLNNLASKLMLA 77 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc-cC--CCCCcccccccccccccHHHHHHHHHHHHHHh Confidence 55432 3345555555443 344445555542211 10 0111111111234455567777777776632 Q ss_pred ----Cccc-cC-CC---------c-----------hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCC Q lcl|NC_021324. 70 ----EGFR-IS-ED---------S-----------EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD 123 (480) Q Consensus 70 ----~~~~-~~-~d---------~-----------~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d 123 (480) .+|+ .. .| . +..+.+...+..++|.....++.++...+|.|.+++.. + T Consensus 78 ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~------~ 151 (535) T protein:vir:33 78 LFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPE------P 151 (535) T ss_pred hcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeec------C Confidence 1222 11 11 0 01123445567788999999999999999999777653 2 Q ss_pred CCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEec-----CC-----------ceEEEEEEEe-----CC--cEEEE Q lcl|NC_021324. 124 PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD-----DV-----------AVPDRATLYL-----PD--ETVPL 180 (480) Q Consensus 124 ~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~-----~~-----------~~~~~~~~~~-----~~--~~~~~ 180 (480) ..+..+++.++-.++++.-|.. .++...+|.+...- .- .....+++|+ .+ .+..| T Consensus 152 ~~~~~~f~~~pl~~~~v~~d~~--G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~~ 229 (535) T protein:vir:33 152 EGSYNPMKLYRLSSYVVQRDAY--GNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLKY 229 (535) T ss_pred CCCceeeEEEEcCeeEEeeCCC--CCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEEE Confidence 3456778888776665554433 34444444432210 00 0000111221 11 01111 Q ss_pred EecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--C Q lcl|NC_021324. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G 258 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G 258 (480) ....+. .........+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-......+....|.+.+. | T Consensus 230 ~~~~~~----~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~g 304 (535) T protein:vir:33 230 EEVEDV----EIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCE-EYLGDLRSLENLQEAIVKMSMISAKVIGLVNPAG 304 (535) T ss_pred EEEeCc----cccccccccccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeecccc Confidence 111000 00111112357789999888888889999999764 4667788888888888888888888876652 1 Q ss_pred CCccccccccccchhhhhcchheecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHH Q lcl|NC_021324. 259 VTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII 337 (480) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~ 337 (480) ... .........+.+..-..++..+.++.. ++++...+.++.+...|.... +.+ .+.......-+++-++ T Consensus 305 ~~~-------~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~-~~~~~~~~r~TAtEV~ 375 (535) T protein:vir:33 305 ITQ-------PRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLN-SAVQRTGERVTAEEIR 375 (535) T ss_pred ccc-------hhhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH-hhh-hcccCCCccccHHHHH Confidence 100 001112222333332333444544432 344444444444433332221 101 1211111112333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHh--CCCCc-ccceeeeEEecCCCCCCH-HHHHHHHHHHH-- Q lcl|NC_021324. 338 ATDSRIVKMAERKGRIFGGAWERA--------MRIAMQIM--GREVT-EEYTRLETVWRDPSTPTV-AAKADAVSKLY-- 403 (480) Q Consensus 338 ~~~~~l~~k~~~~~~~~~~~l~~~--------~~li~~~~--~~~~~-~~~~~~~v~f~~~~~~~~-~~~ad~~~kl~-- 403 (480) . ++..+...++..+.++ +...+.++ .+..+ .....++++|.-++..-- ...++.+.+.+ T Consensus 376 ~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~ 448 (535) T protein:vir:33 376 Y-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISA 448 (535) T ss_pred H-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHH Confidence 3 3333334444433332 22222222 11111 112346777755543211 11222222222 Q ss_pred --hcccc----CCCHHH----HHHhCCCChhH----HHHHHHHHHHHHHH--HHHHHhhhcccC-CCcCCCCCCCCCCcc Q lcl|NC_021324. 404 --ANGQG----PIPKEQ----ARIDLGYTATQ----REQMRDWDKQETED--MIDTLYSTTKAQ-ADATPKPTVTETKTE 466 (480) Q Consensus 404 --~~~~~----~~s~e~----~~~~~~~~~d~----~~~~~~~~~~~~~~--~~~~~~~~~~~~-~~~~~~~~~~d~~~~ 466 (480) +.++. .+.... +...+|..... .+++.++.+++++. ...++....++. +.+...|+. .+ T Consensus 449 la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~----~~ 524 (535) T protein:vir:33 449 WAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEA----MQ 524 (535) T ss_pred HHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChh----HH Confidence 22111 122222 22334543221 11222222222211 111111111111 111111211 11 Q ss_pred CCCCCCCCCcc Q lcl|NC_021324. 467 TQTSPSGFNRT 477 (480) Q Consensus 467 ~~~~~~~~~~~ 477 (480) +-.+..|.+-+ T Consensus 525 ~~~~~~g~~~~ 535 (535) T protein:vir:33 525 GAAAKAGLNAT 535 (535) T ss_pred HHHHhccCCCC Confidence 11122222222 No 157 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.45 E-value=6.1e-07 Score=54.64 Aligned_cols=388 Identities=11% Similarity=0.043 Sum_probs=158.8 Q ss_pred HHHHHHHHHHHHHHH-HHHHH---------HHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc Q lcl|NC_021324. 5 HEHVERLQGLLARDL-PNLLE---------AEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI 74 (480) Q Consensus 5 ~~~i~~l~~~~~~~~-~~~~~---------~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~ 74 (480) .=+..+|..+..... .+... ...|+.+- .+. .+..+... .-+...-...+|+..++-+-.-++.+ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~-~~~~v~~~---~al~~~~v~~ci~~ia~~iA~lp~~~ 75 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKF-GIK-LNFSVRGK---RALKENTVYVCTKIRAESIGKLSLKI 75 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhc-ccc-CCcccchh---hhhccHHHHHHHHHHHHhhhhCceEE Confidence 223333333221100 00000 00111110 000 00001010 00111222233333333332233432 Q ss_pred -CC-CchhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCC Q lcl|NC_021324. 75 -SE-DSEGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRN 146 (480) Q Consensus 75 -~~-d~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~ 146 (480) .. ++.....+..++.. |. ...+...+..+.+.+|.||+.+..+. .|++ .+..++|..+.+++|+.. T Consensus 76 ~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~------~G~~~~L~~i~~~~v~~~~~~~~ 149 (422) T protein:vir:13 76 YKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDR------KGKIIGLYPINSDNVTKIIDDDN 149 (422) T ss_pred EecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECC------CCcEEEEEEECCcceEEEEcCCc Confidence 11 11111234444432 32 34678888899999999999986532 4554 677889999988886543 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) .......+ +|...+..+.. ..+.++ -|+|++.+....+.+|.|.+.. T Consensus 150 ~~~~~~~~-~y~~~~~~g~~---~~~~~~-----------------------------eiih~~~~~~~~~~~G~s~~~~ 196 (422) T protein:vir:13 150 FLSSLSKV-WYVVTDKNGKE---HKLLPD-----------------------------EMLHFIGDITLDGLIGIKPLDY 196 (422) T ss_pred ceeccceE-EEEEEeCCCeE---EEEccc-----------------------------ceEEEcCCCCCCCcccccHHHH Confidence 21100011 11111111110 011122 2345554334456778876532 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEeccc-cH Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAA-EL 298 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~-~~ 298 (480) +...++....+.....+.+...+.|-.++.-. ...+...+.-...|.. ..+.++.++ ++.++.+++.+ .- T Consensus 197 -~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d 274 (422) T protein:vir:13 197 -LRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLP-FGYQFQPISLSMAD 274 (422) T ss_pred -HHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecC-CCceeeeccCChhH Confidence 33333333333222223333333455555321 1111111111111111 123455554 44677776543 22 Q ss_pred HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCC Q lcl|NC_021324. 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREV 373 (480) Q Consensus 299 ~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~ 373 (480) ..+++..+....+|+..-++|+..+|....+.-| .+......+ +...|.-+++.+. .+..... T Consensus 275 ~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~s--n~e~~~~~f----------~~~~l~P~~~~ie~~l~~~Ll~~~~ 342 (422) T protein:vir:13 275 AQFLENSKLTKRELAATFGMKSYHLNDLERATFN--NLTEQQKDF----------YVTTLQSSLTVYEQEIQDKLFSQYE 342 (422) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHH----------HHHHHHHHHHHHHHHHHHhhCChhh Confidence 2467777888899999999999999864322111 111111111 1111222211111 1111111 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCC- Q lcl|NC_021324. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQA- 452 (480) Q Consensus 374 ~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 452 (480) ......+++.+......|..+.++.+.+++++| +++...+++++|+.+-+ -..+. .......+ T Consensus 343 ~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~--ggD~~------------~~~~n~~~l 406 (422) T protein:vir:13 343 TLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGG--FIEANEARRRENLPPVE--GGDRL------------LVNGNMIPI 406 (422) T ss_pred hcCCceEEeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CcCee------------eeccCccch Confidence 111122444444555678888999999998876 67777788888875432 00000 00000000 Q ss_pred Cc--CCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 453 DA--TPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 453 ~~--~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) +. ..+..++ ..|++ T Consensus 407 ~~~~~~~~~~g------------~~~g~ 422 (422) T protein:vir:13 407 EMAGEQYKKGG------------EKGGK 422 (422) T ss_pred hhcccccccCC------------CcCCC Confidence 00 0111111 11111 No 158 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=98.45 E-value=6.2e-07 Score=54.61 Aligned_cols=410 Identities=14% Similarity=0.087 Sum_probs=146.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHH-HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPN-LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~-~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~ 76 (480) |+ .++.+++.+....... ....-+ +-|-. +.-.+..........-......-.+|+..++-+-.-++.+ .. T Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~~-~~g~~-~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~ 75 (460) T protein:vir:10 1 MA---NRIIRALRELTGLDNKFNDAFIK-YIGQT-FTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKD 75 (460) T ss_pred Cc---hhHHHHHhhhhccCCCchHHHHH-hhccc-cCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccC Confidence 43 3445554433221111 111111 22211 1101111111111111122333444444444333333321 11 Q ss_pred Cch-------------------------------hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCcccc Q lcl|NC_021324. 77 DSE-------------------------------GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVES 121 (480) Q Consensus 77 d~~-------------------------------~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~ 121 (480) +.. .......++.+ | ....+...+..+.+.+|.||+++..+..+ T Consensus 76 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~- 154 (460) T protein:vir:10 76 TKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDG- 154 (460) T ss_pred CccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCC- Confidence 110 00111222222 2 23456677788999999999998765422 Q ss_pred CCCCceeE-EEEEcccceEEEEeCCCCCceEE-EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccc Q lcl|NC_021324. 122 GDPAGIPL-IRVESPLYMYAELDPRNTRRVTR-AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKH 199 (480) Q Consensus 122 ~d~~g~~~-~~~~~p~~~~~v~d~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (480) +..|.++ +..++|..+.+..+......... .+..|.... .+. ...+.++.+++++.... T Consensus 155 -~~~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~-~g~---~~~~~~~evih~r~~~~-------------- 215 (460) T protein:vir:10 155 -INAGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQ-GDQ---FIEFNEDEVIHTKYANP-------------- 215 (460) T ss_pred -ccCceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEec-Cce---eEEecccceEEEecCCC-------------- Confidence 3356664 67788888877665433211100 011111111 110 01223333333221100 Q ss_pred cccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-CCCccccccccccchhhh--- Q lcl|NC_021324. 200 GLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-GVTTDELTNDGENTTLDI--- 275 (480) Q Consensus 200 ~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~-G~~~~~~~~~~~~~~~~~--- 275 (480) .+.....+.+|.|.+.. +...+.....+.....+.+...+.|-.++. +..+.+...+.-...|+- T Consensus 216 ----------~~~~~~~~~~G~sp~~~-~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~ 284 (460) T protein:vir:10 216 ----------NFDLQGSHLYGMSPIRA-ILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTGLTQPQADSLKQRLTEMDK 284 (460) T ss_pred ----------CcccccCccccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCCCCHHHHHHHHHHHHHHhc Confidence 01112234677776532 333333222222222222222233333322 211111111111111211 Q ss_pred ---hcchheecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHH-HHH Q lcl|NC_021324. 276 ---YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA-ERK 350 (480) Q Consensus 276 ---~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~-~~~ 350 (480) ..++++.++ ++.++.++.... -..+++..+....+|+..-++|++.+|.......++..++.....+...+ .-. T Consensus 285 g~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~l~P~ 363 (460) T protein:vir:10 285 SPDRLSQIAGAS-GEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDNIQPD 363 (460) T ss_pred CccccCCceecC-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHHHHHH Confidence 123455555 446887776432 22467888888999999999999999853221111112222211111111 001 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHH Q lcl|NC_021324. 351 GRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMR 430 (480) Q Consensus 351 ~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~ 430 (480) ...+...|.+ .+...........+++.|... ..+.+...+..+++.+| +++...+++.+|+.+-+-+-. T Consensus 364 ~~~ie~~ln~------kl~~~~~~~~~~~i~~d~~~l--~~l~~d~~~~~~~~~~g--~~T~NE~R~~~g~~pi~~~~g- 432 (460) T protein:vir:10 364 LVILKQAFDK------KFIKRFKGYENAVIEWDISEL--PEMQTDMVAMASWLNTI--PVTPNEIRIAMKYETLNQDGM- 432 (460) T ss_pred HHHHHHHHHH------hhcCcccccCCceEEeecchh--hhHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCCCC- Confidence 1111111111 111111111112233333221 11233333444555544 556555666665542110000 Q ss_pred HHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 431 DWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) +.+.......+- +. ...+..+.+.|..| T Consensus 433 -----------D~~~~~~n~~~~--------~~-~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 433 -----------DIVFMPSNKVRI--------DD-VSNNLIDSAFNQNQ 460 (460) T ss_pred -----------Ceeeecccccch--------hh-cccccCCCcccCCC Confidence 000000000000 00 00011111222222 No 159 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.44 E-value=6.3e-07 Score=54.59 Aligned_cols=372 Identities=10% Similarity=0.005 Sum_probs=146.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccC-cc---hhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTR-RL---KTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~-~i---~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) .-|.+++..... ++-.....+..... .. -..+..+.+.. .....-...+|+..++-+-.-++.+.... . T Consensus 1 M~~f~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---al~~~~v~~~i~~ia~~ia~~p~~~~~~~-~ 73 (386) T protein:vir:49 1 MPIFNITNLATE---SPPINQESFFDIADSDFLASLNSSEWVSAEN---ALKNSDLFSIISQLSNDLATAKITTSRKQ-L 73 (386) T ss_pred CchhhhhccCCC---CcccchhhhhhhhhccccccccCCceechhh---hhccHHHHHHHHHHHHHhhhCceeeccch-h Confidence 112221111100 00000000100000 00 00011111110 01111112233333332323344432211 1 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ...+.+-........+...+..+.+.+|.||+.+-.+ ..|++ .+..++|..+.++.++... .+ ++ .|.. T Consensus 74 ~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~l~~i~~~~v~v~~~~~~~-~~-~y--~~~~ 143 (386) T protein:vir:49 74 QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------DNGRDMKWEYLRPSQVSFNRLDNQN-GL-YY--NITF 143 (386) T ss_pred hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEecCceeEEEEcCCCc-eE-EE--EEEE Confidence 1111111111134566777888999999999987653 24554 6677888888776654321 11 11 1111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) .+..+.. ...+..++ |+||++....+..+|.|.+.. +...++....+. T Consensus 144 ~~~~~~~--~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~~ 191 (386) T protein:vir:49 144 DDPHIAP--KQHVPQND-----------------------------ILHFRLLSVDGGLTSVSPLMA-LGREFNIQKASD 191 (386) T ss_pred cCccccc--eeEEcccc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHHHHH Confidence 1111100 01111122 456655444445678876532 333333333232 Q ss_pred HHHHHHHHHhcchhhhhc--CCCccccccccccch---hhhhcchheecCCCCceEEEeccc-cHHHHHHHHHHHHHHHH Q lcl|NC_021324. 240 MNLQSASQILGTPLRVIS--GVTTDELTNDGENTT---LDIYYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAA 313 (480) Q Consensus 240 s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~---~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~ 313 (480) ....+.+...+.|-.+++ |...++ ........ .....++++.++ ++.++.+++.. .-..+++..+....+|+ T Consensus 192 ~~~~~~~~ng~~~~~il~~~~~~~~~-~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 269 (386) T protein:vir:49 192 KLTISALKNALNANGILKIKGGGLLD-FKTKVSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLSQADWTTGQFA 269 (386) T ss_pred HHHHHHHHccCCccEEEEeCCCCChH-HHHHHHHHHHHhccCCCCceecC-CCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 223333333444555543 211111 00000011 111234555555 34678777643 22347788888999999 Q ss_pred hhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHH Q lcl|NC_021324. 314 SITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVA 393 (480) Q Consensus 314 ~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~ 393 (480) ..-++|+..+|+...+.+++..++..+...+.. .++.+..-+-..++. .+++......-.+.. T Consensus 270 ~~fgVPp~~lg~~~~~~~~~~~~~~~~~~~i~~----------~l~~i~~~~~~~l~~-------~~~~~~~~~~~~d~~ 332 (386) T protein:vir:49 270 KVYGIPESIVGGDGDQQSSLEMIYNIYFKSVSR----------YLRPFVSEMSKKLSC-------EVDVDISPAVDPTGS 332 (386) T ss_pred HHhCCCHHHhCCCCCccchHHHHHHHHHHHHHH----------HHHHHHHHHHHHhcc-------hhcccchhhhccCHH Confidence 999999999987655544554444333222211 111111111111111 122222223334556 Q ss_pred HHHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 394 AKADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 394 ~~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) +.+..+.+++.+| +++...+++++ |+.++++.+.. . + ......++|.+++. T Consensus 333 ~~~~~~~~l~~~g--~~t~nE~r~~l~~~~~~~~~~~~~~------------------~--~-~~~~~~gGd~~~~~ 386 (386) T protein:vir:49 333 NYISLINSMVKSG--TLAQNQGLYILQQAEILPKELPDGK------------------N--P-NRTSLKGGEINEQD 386 (386) T ss_pred HHHHHHHHHHhCC--CcCHHHHHHHHhhCCCCCCcCcchh------------------c--c-CCCCCCCCCCCCCC Confidence 6677777777665 44444444433 23222211100 0 0 00011111111111 No 160 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.44 E-value=6.5e-07 Score=54.51 Aligned_cols=390 Identities=14% Similarity=0.055 Sum_probs=162.5 Q ss_pred HHHHHHHHHHHHHH----HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCC- Q lcl|NC_021324. 6 EHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED- 77 (480) Q Consensus 6 ~~i~~l~~~~~~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d- 77 (480) =+++++..+..... .-...+..++-|..... +..+... .-+.......+|+..++-+-.-++.+ .++ T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~---~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~~ 75 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTAS--GERVSES---NSLVQPDIFACVNVLSDDIAKLPIHTYKRTDGG 75 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCccccc--Cceechh---hhhccHHHHHHHHHHHHhhhhCceEEEEecCCc Confidence 12222222211100 01122334443322111 1111110 11112223344444433332333322 111 Q ss_pred -chh-HHHHHHHHH-h-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCc Q lcl|NC_021324. 78 -SEG-LEELWNWWQ-A-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 78 -~~~-~~~l~~~~~-~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~ 149 (480) +.. ...+..++. + | ....+...+..+.+.+|.||+++..+ ..|.+ .+..++|..+.++.++... . T Consensus 76 ~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~~-~ 148 (416) T protein:vir:12 76 IERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFG------SHGYPEALFPLRPDYTNAYVHPTTG-M 148 (416) T ss_pred cccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEECCcceEEEEeCCCc-E Confidence 111 112333332 2 3 34467788888999999999988653 34555 5777889888877654432 1 Q ss_pred eEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHH Q lcl|NC_021324. 150 VTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELR 229 (480) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~ 229 (480) . +|....+++. ..+.++. |+||++.. ..+.+|.|.|.. +. T Consensus 149 -~----~~~~~~~g~~----~~~~~~e-----------------------------iih~~~~~-~~~~~G~s~i~~-~~ 188 (416) T protein:vir:12 149 -L----WYQTVLNGKA----IELYDYE-----------------------------VLHFKGLS-TDGIHGKSPIGV-VR 188 (416) T ss_pred -E----EEEEecCCeE----EEecCcc-----------------------------EEEecCcC-CCCcccccHHHH-HH Confidence 1 1111111110 0112222 34444332 344678776532 33 Q ss_pred HHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh--hcchheecCCCCceEEEecccc-HHHHHHHH Q lcl|NC_021324. 230 KVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI--YYGRILTLASEAAKISEFKAAE-LRNFAEEM 305 (480) Q Consensus 230 ~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l 305 (480) ..++....+.....+.+...+.|-.++.-. ...+...+.-...|+- ..+.++.+++ +.++.+++... -..+++.. T Consensus 189 ~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e~~~~~~~~~~~~~~~~~~~vl~~-g~~~~~l~~~~~d~q~~e~~ 267 (416) T protein:vir:12 189 EHIGAQAAATKYNAKLYKNEATPRGILKVPAFLDEKPKENVRKEWKRVNKVENIAIIDY-GLEYQSISMPLQEAQFVESM 267 (416) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhcCCCeeecCC-CceEEEccCChhhHHHHHHH Confidence 333333333333333334444455555421 1111111111111211 2344555553 45787776432 22467778 Q ss_pred HHHHHHHHhhcCCChHHhccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCCCccccee Q lcl|NC_021324. 306 EVFRKEAASITGLPPQYLSSSSENP-ASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGREVTEEYTR 379 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~~~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~~~~~~~~~ 379 (480) +....+|+..-++|+..+|....+. ++....... .+...|.-++..+.. +........... T Consensus 268 ~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~-------------f~~~~l~P~~~~ie~~l~~~l~~~~~~~~g~~ 334 (416) T protein:vir:12 268 KFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIE-------------YVRNTLQPWIVNFEQELNVKLFLDHDQKSGHY 334 (416) T ss_pred HHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHH-------------HHHHHHHHHHHHHHHHHHHhhcCchhhcCCce Confidence 8889999999999999998543222 221111111 111122222221111 111111111123 Q ss_pred eeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 380 LETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 380 ~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +++.+......|..++++++.+++.+| +++...+++.+|..+-+- ..+...-..-...+.+.. .....+..... T Consensus 335 i~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~Pi~g--gd~~~~~~n~~~~~~~~~--~~~~~~~~~~~ 408 (416) T protein:vir:12 335 VKFNIDSELRGDSKTQAEYLKTLHETG--VLNKDEIRELLERNPIEN--GDKYISSLNYVFLDFLEE--YQRLKAGGAMK 408 (416) T ss_pred EEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cceeeeccccccccccch--hhccccccccC Confidence 555555666778899999999998876 667777787877654310 000000000000000000 00000011111 Q ss_pred CCCCCccCCCCCCC Q lcl|NC_021324. 460 VTETKTETQTSPSG 473 (480) Q Consensus 460 ~~d~~~~~~~~~~~ 473 (480) ++|++.+ | T Consensus 409 gge~~~~------g 416 (416) T protein:vir:12 409 GGDNKNE------G 416 (416) T ss_pred CCCCcCC------C Confidence 1222211 1 No 161 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.40 E-value=8.5e-07 Score=53.86 Aligned_cols=416 Identities=10% Similarity=0.042 Sum_probs=160.7 Q ss_pred CCCHHHHHHH--------------------------HHHHHHHHHHHHH-HHHHHhhccCcchhcCccc-chh------h Q lcl|NC_021324. 1 MTTYHEHVER--------------------------LQGLLARDLPNLL-EAEAYRNGTRRLKTIGIGA-PPE------L 46 (480) Q Consensus 1 ~~~~~~~i~~--------------------------l~~~~~~~~~~~~-~~~~YY~G~~~i~~~~~~~-~~~------~ 46 (480) -.--++++.+ ++.-+..+.+... .+. -+.|.. ....+... +.. . T Consensus 40 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f-~~s~es-~s~vtsls~pdaf~~vnVs 117 (945) T protein:vir:10 40 YPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLF-EYSPES-LMYLPSISDPDAFFLINLF 117 (945) T ss_pred CCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhh-hccCcc-ceecccccCccceeeehhh Confidence 0111111111 1111111111100 000 111211 11111000 100 0 Q ss_pred hcceeccchHHHHHHHHHhhhccCcccc---CCCc---------hhHHHHHHHHHh-cC-------HHHHHHHHHHHHhh Q lcl|NC_021324. 47 AYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDS---------EGLEELWNWWQA-ND-------LDEESVLGHDDSLT 106 (480) Q Consensus 47 ~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~---------~~~~~l~~~~~~-n~-------~~~~~~~~~~~~~~ 106 (480) .+......-...+|+..++-+-.-++.+ ..+. .....+..++.+ |. +..+...+..+.+. T Consensus 118 ~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL 197 (945) T protein:vir:10 118 RKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILT 197 (945) T ss_pred hhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhh Confidence 0111111122334444444333333321 1111 112345556543 32 12355667889999 Q ss_pred cCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCC Q lcl|NC_021324. 107 FGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGG 185 (480) Q Consensus 107 ~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 185 (480) +|.+|+.+..+ .+|++ .+..++|..+.++.++... . ..+|....+++. ...+.++..++ T Consensus 198 ~GNAYieIiRd------~~G~ii~L~pLdPs~Vti~~ddDG~-~----~y~Yv~~idG~~---~~~v~a~DvIl------ 257 (945) T protein:vir:10 198 IDRGAIVKIRD------EQGNLVAITPVDGTTIKPILSEDTG-I----VVGYVQEVDGAI---VAHFDKRDVVL------ 257 (945) T ss_pred cCCeEEEEEEC------CCCcEEEEEEECCcceEEEEcCCCc-E----EEEEEEecCCce---EEEecCCceEE------ Confidence 99999988653 35665 5788999988887765432 1 111111111111 11122222111 Q ss_pred CccccccccccccccccccceEEeecccc--cCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhc----chhhhh--c Q lcl|NC_021324. 186 LNDQWVVDGDVIKHGLGVVPVVPLTNDPR--LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG----TPLRVI--S 257 (480) Q Consensus 186 ~~~~~~~~~~~~~~~~g~iPvv~~~n~~~--~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~----~p~l~i--~ 257 (480) ++++... ....+|.|.| ..+.+++...+.-......+|. .|-.++ . T Consensus 258 ----------------------hirn~s~DG~~~GyGlSPI----eaa~~aI~~alAaek~aar~FskNGa~PsGILsvk 311 (945) T protein:vir:10 258 ----------------------FRQNLTPDVYMYGYSLPPI----EILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIE 311 (945) T ss_pred ----------------------EeccCCCCcccccCCchHH----HHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEec Confidence 1111111 1123466644 3444555554444443444442 343333 2 Q ss_pred CCCcccc------ccc---cccchhhh-----hcchheecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHH Q lcl|NC_021324. 258 GVTTDEL------TND---GENTTLDI-----YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQY 322 (480) Q Consensus 258 G~~~~~~------~~~---~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~ 322 (480) |...... .++ .-...|.. ..+..+.+ +++.++.+++... -..+++..+..+.+|+..-++|+.. T Consensus 312 g~~~~d~k~~~~LseEq~erlKe~wee~~sG~NnG~piVL-deGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~l 390 (945) T protein:vir:10 312 PPSYKEGDIYPQLSREQLESIQRQLQAIMMGDYTQVPILS-GGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQD 390 (945) T ss_pred CccccccccccccCHHHHHHHHHHHHHHhCCcccccceec-CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHH Confidence 2211100 000 00011111 11223333 3456887776432 2246788888899999999999999 Q ss_pred hccccccchHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHH Q lcl|NC_021324. 323 LSSSSENPASAEAIIATDSRIVK-MAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSK 401 (480) Q Consensus 323 ~g~~~~~~~Sg~Al~~~~~~l~~-k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~k 401 (480) +|....++-|. +......... ...-....+...+.+. +. . ......+.+.|......+..+.++.+.+ T Consensus 391 LG~~e~st~SN--iEqq~~~Fv~~tL~Pil~~IEqeLNrk---Ll---~---~~eg~~i~fdFd~ldl~D~ksraEal~k 459 (945) T protein:vir:10 391 VGILEGSNKAT--AEVMASLTKAKGLEPLMATISKGFDEV---VS---E---FRNEKDIKLWFKEDDLEKERDWWNIIQG 459 (945) T ss_pred cccCCCCCcch--HHHHHHHHHHHHHHHHHHHHHHHHHHh---cc---c---cccCceeEEEecchhccCHHHHHHHHHH Confidence 97543322121 1211111111 1111111112222211 11 0 1112346778866666677888998988 Q ss_pred HHhccccCCCHHHHHHhCCCChhHH-HHHHHHHHHHHHHHHHHHhhhc-ccCCCcCCCCCCCCCCccCC-CCCCCCCccc Q lcl|NC_021324. 402 LYANGQGPIPKEQARIDLGYTATQR-EQMRDWDKQETEDMIDTLYSTT-KAQADATPKPTVTETKTETQ-TSPSGFNRTK 478 (480) Q Consensus 402 l~~~~~~~~s~e~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~d~~~~~~-~~~~~~~~~~ 478 (480) ++++| +++...+++.+|+.+-+- +... .....-.+.+...... .+.+.+..++.++.+..++. .++++...+. T Consensus 460 li~sG--iLTiNEvRe~lGLpPIeGGD~ll--i~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE 535 (945) T protein:vir:10 460 QLNTG--FRSINEARMEKGLEPVPWGDVPF--SGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSE 535 (945) T ss_pred HHhCC--CcCHHHHHHHhCCCCCCCcceee--eccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCc Confidence 88876 667777888887653210 0000 0000000000000111 11111111111111111111 1111222222 Q ss_pred CC Q lcl|NC_021324. 479 TR 480 (480) Q Consensus 479 ~~ 480 (480) +| T Consensus 536 ~k 537 (945) T protein:vir:10 536 QK 537 (945) T ss_pred cc Confidence 33 No 162 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.38 E-value=9.2e-07 Score=53.67 Aligned_cols=437 Identities=12% Similarity=0.073 Sum_probs=180.9 Q ss_pred CCC-HHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc--cchhhhcceeccchHHHHHHHHHhhhc----c--Cc Q lcl|NC_021324. 1 MTT-YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG--APPELAYLDVQPGWVATYLRTLSDRLD----I--EG 71 (480) Q Consensus 1 ~~~-~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~--~~~~~~~~~~~~n~~~~iVd~~~~~l~----~--~~ 71 (480) |-+ .+...+.|-.+......+.+.+.+|..- ++... .....+..++..+-+...++++++.|. + .+ T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP-----~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~~ 75 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLP-----YLLTEDGHASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQTS 75 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----ccCCCCCCcccccccccccchHHHHHHHHHHHHHHhhcCCCCc Confidence 443 3445555555555556666677777532 21110 001111224556667778888777663 2 23 Q ss_pred cc-c--CC---------CchhH-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee Q lcl|NC_021324. 72 FR-I--SE---------DSEGL-----------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 72 ~~-~--~~---------d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~ 128 (480) |+ . ++ +++.. ..+...+..++|.....++.++...+|.|.+++.. + T Consensus 76 WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~--------~--- 144 (542) T protein:vir:78 76 FFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFAGK--------K--- 144 (542) T ss_pred cccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEecC--------C--- Confidence 32 1 11 11111 23455666788999999999999999999776532 1 Q ss_pred EEEEEcccceEEEEeCCCCCceEEEEEEEEEec-------C----------------CceEEEEE-EEeCCcEEEEEecC Q lcl|NC_021324. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD-------D----------------VAVPDRAT-LYLPDETVPLRRNG 184 (480) Q Consensus 129 ~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~-------~----------------~~~~~~~~-~~~~~~~~~~~~~~ 184 (480) .+++++-.+.++--|. . .++...+|.+...- . ......+. ++..+..-.|.... T Consensus 145 ~~~~~pl~~y~v~~d~-~-G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~ 222 (542) T protein:vir:78 145 TLKVYPLDRYVIERDG-D-GNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCK 222 (542) T ss_pred CceEEecceeEEeeCC-C-CCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccc Confidence 1455554553333332 2 33444444432110 0 00011111 11111111111100 Q ss_pred --CCccccccc--cc-----cccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhh Q lcl|NC_021324. 185 --GLNDQWVVD--GD-----VIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV 255 (480) Q Consensus 185 --~~~~~~~~~--~~-----~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~ 255 (480) .....+..+ .. ....+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-......+....|.+. T Consensus 223 ~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~a~~pp~l 301 (542) T protein:vir:78 223 LVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVE-EFFGDLSSLDALTRSLIEGSAAAAKVVFM 301 (542) T ss_pred cCCCeEEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 000011111 11 223478889999888888889999999764 46666788888888888888888888755 Q ss_pred hcCCCccccccccccchhhhhcchheecCCCCceEEEec-cccHHH---HHHHHHHHHHHHHhhcCCChHHhccccccch Q lcl|NC_021324. 256 ISGVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPA 331 (480) Q Consensus 256 i~G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~ 331 (480) +.- + .-..........+|.+..-..++....++. ..+++. .++.++.-|.+.+...+. - +... - T Consensus 302 v~~-~----g~~~~~~~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~~~-----~-d~~r-v 369 (542) T protein:vir:78 302 VSP-S----ATTKPQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLILNV-----R-QSER-T 369 (542) T ss_pred ecc-c----cccchhhcccCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhccccc-----C-Cccc-c Confidence 520 0 000011112233333333222333444433 224433 344444444433332211 0 1111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHhCC-CCcccceeeeEEecCCCCCCH-HHHHH Q lcl|NC_021324. 332 SAEAIIATDSRIVKMAERKGRIFGGAWER------------AMRIAMQIMGR-EVTEEYTRLETVWRDPSTPTV-AAKAD 397 (480) Q Consensus 332 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~------------~~~li~~~~~~-~~~~~~~~~~v~f~~~~~~~~-~~~ad 397 (480) +++.++ ..+..+...++..+.+ .+.++.+..-. ..+.+ -+++.+.-++..-- ...++ T Consensus 370 TAtEV~-------~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~--lv~~~~~s~La~~~r~~~~~ 440 (542) T protein:vir:78 370 TATEVR-------EVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKG--LVMPTVVAGLGGVGRGEDRA 440 (542) T ss_pred cHHHHH-------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh--ceeeeeechHHHHHHHHHHH Confidence 333333 3334444444444333 33332221000 12222 25666654442111 11122 Q ss_pred HHHHH---Hhc--ccc----CCCHHHH----HHhCCCChhH----HHHHHHHHHH-HHHHHHHHHhhhcc-cCCCcCCCC Q lcl|NC_021324. 398 AVSKL---YAN--GQG----PIPKEQA----RIDLGYTATQ----REQMRDWDKQ-ETEDMIDTLYSTTK-AQADATPKP 458 (480) Q Consensus 398 ~~~kl---~~~--~~~----~~s~e~~----~~~~~~~~d~----~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~~~~~ 458 (480) .+.+. +.+ +.. .+....+ ...+|..... .+++..++++ +++..+.++..... ..+.+..++ T Consensus 441 ~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~ 520 (542) T protein:vir:78 441 ALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEK 520 (542) T ss_pred HHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 22211 111 111 1122222 2334543211 1222222221 11122222211111 111111111 Q ss_pred ---CCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 459 ---TVTETKTETQTSPSGFNRT 477 (480) Q Consensus 459 ---~~~d~~~~~~~~~~~~~~~ 477 (480) ....+++..+++|..+..- T Consensus 521 ~~~~~~a~~~~~~~~~~~~~~~ 542 (542) T protein:vir:78 521 MMQQINAPGQEAPAGPQTGEDL 542 (542) T ss_pred hhhhcCCCCcCCCCCCcccccC Confidence 1111222233334444443 No 163 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=98.35 E-value=1.1e-06 Score=53.17 Aligned_cols=393 Identities=9% Similarity=0.031 Sum_probs=144.3 Q ss_pred CC----------------CHHHHH---------HHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccch Q lcl|NC_021324. 1 MT----------------TYHEHV---------ERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGW 55 (480) Q Consensus 1 ~~----------------~~~~~i---------~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~ 55 (480) |. +.++++ ..+++........|.....+.. T Consensus 53 ~~~~~~g~~~~~~~~~~~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~------------------------- 107 (535) T protein:vir:10 53 ADGNVAGQYSVASISDVLSTKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNR------------------------- 107 (535) T ss_pred ccCCcccccccCccccccCHHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhc------------------------- Confidence 22 111211 1122222111112211111111 Q ss_pred HHHHHHHHHhhhccCcccc---CCCc--hhHHHHHHHHHh--cC-------HHHHHHHHHHHHhhcC-eeEEEEecCccc Q lcl|NC_021324. 56 VATYLRTLSDRLDIEGFRI---SEDS--EGLEELWNWWQA--ND-------LDEESVLGHDDSLTFG-RAYITVSHPDVE 120 (480) Q Consensus 56 ~~~iVd~~~~~l~~~~~~~---~~d~--~~~~~l~~~~~~--n~-------~~~~~~~~~~~~~~~G-~a~~~v~~~~~~ 120 (480) ..+++ ...-+.. +..+ .....+..++.. |. +..+...+..+++.+| .+|+.+..+ T Consensus 108 ------~~~~~-~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~--- 177 (535) T protein:vir:10 108 ------NGVGF-KVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKN--- 177 (535) T ss_pred ------ccCcc-eeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEEC--- Confidence 10000 0000000 0000 011123333321 11 2234556667777665 578887653 Q ss_pred cCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccc Q lcl|NC_021324. 121 SGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKH 199 (480) Q Consensus 121 ~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (480) ..|+| .+..++|..+.+..++........ +|... +.+.. ..|.++.+ T Consensus 178 ---~~G~~~~L~~l~p~~V~v~~d~~~~~~~~~---~~~~~-~~~~~---~~~~~~ei---------------------- 225 (535) T protein:vir:10 178 ---DSNELDHFNAVDASKVVISYSPRSKDQPRK---FEQFV-SETKS---VKFSERNL---------------------- 225 (535) T ss_pred ---CCCcEEEEEEeCCceeEEEEcCccccCceE---EEEEe-cCcee---EEECcccE---------------------- Confidence 35655 478899999888776543222111 11111 11110 11222322 Q ss_pred cccccceEEeeccc---ccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhhhc--CCCcccccccccc- Q lcl|NC_021324. 200 GLGVVPVVPLTNDP---RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVIS--GVTTDELTNDGEN- 270 (480) Q Consensus 200 ~~g~iPvv~~~n~~---~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i~--G~~~~~~~~~~~~- 270 (480) +||+.++ ...+.+|.|.|. .+.+++...+.-......+|. .|-.+|. +.......++... T Consensus 226 -------ih~~~~~~~~~~~~~~G~Spi~----~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~ 294 (535) T protein:vir:10 226 -------TFINYWNLSDTDRRGYGYSPVE----ASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAG 294 (535) T ss_pred -------EEEeccCCCCcccccccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHH Confidence 3443322 123567887653 334444444333333334433 3443433 2111111111000 Q ss_pred --chhhh------hcchheecCCCCceEEEeccc--cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHH-H-HHHH Q lcl|NC_021324. 271 --TTLDI------YYGRILTLASEAAKISEFKAA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA-E-AIIA 338 (480) Q Consensus 271 --~~~~~------~~~~~~~~~g~~~~~~~~~~~--~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg-~-Al~~ 338 (480) ..|+. ..+++..+.+++.++..+... .. .|++..+....+|+.+.++|+..+|....++-|. . +-.. T Consensus 295 lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~-qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~ 373 (535) T protein:vir:10 295 IRRQWTSQGSGLGGAWKIPILAAKDAKFVNMTQNSRDM-EFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSV 373 (535) T ss_pred HHHHHHHHhcCcccccccccccCCCceEEecCCChhHH-HHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhh Confidence 11111 113344455667788877643 33 4788888999999999999999998633211111 0 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHH Q lcl|NC_021324. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARI 417 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~ 417 (480) .+..... ......+...|.-++..+....... ...-...+++.|......+..+.++.. ++..+| +++...+++ T Consensus 374 ~~~s~~E--~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~~~~~f~f~~l~~~d~~~r~~~~-~~~~~g--~lT~NE~R~ 448 (535) T protein:vir:10 374 NEGSTAK--AKLESSKDKGLTPLLSFIEQVINDKIMRYVDTDYRFSFTLGDAQDKLQEEQVW-KLKLAN--GYFINEYRK 448 (535) T ss_pred hhhhhHH--HHHHHHHHHHHHHHHHHHHHHHhhhcccccCCeEEEEeccccccCHHHHHHHH-HHHHcC--CCCHHHHHH Confidence 1111000 1111122222333222222111100 011112466778777777777666544 444443 467777888 Q ss_pred hCCCChhHH-HHHHHHHHHHHHHHHHHHhhhcccCCCcC-----------CCCCC----------CCCC-----ccCCCC Q lcl|NC_021324. 418 DLGYTATQR-EQMRDWDKQETEDMIDTLYSTTKAQADAT-----------PKPTV----------TETK-----TETQTS 470 (480) Q Consensus 418 ~~~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~----------~d~~-----~~~~~~ 470 (480) ++|+.+-+- +..-.... .......-.......+++. +++.. +|+. ....++ T Consensus 449 ~~gl~piegGD~~~~~~~--~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 526 (535) T protein:vir:10 449 DHGLKTVDGLDVPGFIGS--AENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSESDD 526 (535) T ss_pred HhCCCCCCCccccccccc--hhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCCCc Confidence 877643210 00000000 0000000000000000000 00000 0000 000000 Q ss_pred CCCCCcccC Q lcl|NC_021324. 471 PSGFNRTKT 479 (480) Q Consensus 471 ~~~~~~~~~ 479 (480) ++...++-| T Consensus 527 ~~~~~~~~~ 535 (535) T protein:vir:10 527 VSNNEDADT 535 (535) T ss_pred cccccccCC Confidence 011111111 No 164 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.34 E-value=1.2e-06 Score=52.97 Aligned_cols=381 Identities=14% Similarity=0.091 Sum_probs=158.0 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCC Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLL----EAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED 77 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~----~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d 77 (480) .=|.+++...+..+.+... .+-.+. |-.. ..+ ..-+.+.-...+|+..++-+-.-++.+ .++ T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~-------~~~---~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~ 69 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLLLQWL-GVDP-------DTP---RNQLSEATYFACLKILSESLGKLPLKMYQKTER 69 (411) T ss_pred CchHHHHHhhccCcccccccchHHHHHHh-cCcc-------cCh---hhhhccHHHHHHHHHHHHhHhhCceeEEEecCC Confidence 2233332222111110000 000011 1000 000 000111112233333333222223322 211 Q ss_pred c---hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCC Q lcl|NC_021324. 78 S---EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTR 148 (480) Q Consensus 78 ~---~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~ 148 (480) . .....+..++.. | ....+...+..+.+.+|.||+++..+ .|.+ .+..++|..+.++.|+.... T Consensus 70 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-------~g~~~~l~~l~~~~v~~~~~~~~~~ 142 (411) T protein:vir:81 70 GIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-------GPQLQALWILPSQYVTIVVDDRGLL 142 (411) T ss_pred ceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCceEEEEEECCceEEEEEcCcccc Confidence 1 112234454432 3 34567778888999999999987643 2333 46788999988887654321 Q ss_pred ceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHH Q lcl|NC_021324. 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) .....+.+.+.....+. . ..+.++. |+||+++....+.+|.|.+.. + T Consensus 143 ~~~~~~~~~~~~~~~g~-~--~~~~~~e-----------------------------iih~k~~~~~~~~~G~s~~~~-~ 189 (411) T protein:vir:81 143 GEKNAIWYRYNDPYDGK-M--YVFRNDE-----------------------------ILHFKTSVTFDGITGLSVRDV-L 189 (411) T ss_pred cccceEEEEEEecCCce-E--EEEcccc-----------------------------EEEEcCCCCCCCcccccHHHH-H Confidence 11111111000000010 0 0011111 466664444556778876532 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC-ccccccccccchhhh------hcchheecCCCCceEEEecccc-HHH Q lcl|NC_021324. 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVT-TDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRN 300 (480) Q Consensus 229 ~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~ 300 (480) ...++....+.....+.+...+.|-.++..-. ..++..+.-...|.. ..++++.++ ++.++.+++.+. -.. T Consensus 190 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~q 268 (411) T protein:vir:81 190 KHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAGKIIPVP-LGMKLVPLDIKLTDSQ 268 (411) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecC-CCceEEEccCCHHHHH Confidence 33333333332222233333334665554311 111111111111211 123455554 446787776432 224 Q ss_pred HHHHHHHHHHHHHhhcCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCc Q lcl|NC_021324. 301 FAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVT 374 (480) Q Consensus 301 ~~~~l~~~~~~i~~~~~~p~~~~g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~ 374 (480) +++..+....+|+..-++|+..+|....+ .++...... ..+...|.-++..+. .+...... T Consensus 269 ~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~-------------~f~~~~l~P~~~~ie~~l~~~ll~~~~~ 335 (411) T protein:vir:81 269 FFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNL-------------AFYVDTLLYVLKQYEEEITYKILSNDLI 335 (411) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHH-------------HHHHHHHHHHHHHHHHHHHhhcCChhhc Confidence 66777788899999999999999764322 122221111 111222222222221 11111111 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCc Q lcl|NC_021324. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 375 ~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) .....+++.+......|..++++++.+++.+| +++...+++.+|+.+.+- -.+........+++.+... T Consensus 336 ~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g--~~t~NE~R~~~gl~p~~g--gD~~~~~~n~~pl~~~~~~------- 404 (411) T protein:vir:81 336 SQGHYFKFNVNVILRADIKTQMDSLSTAVQNG--IMTPNEARDYLDMPADDY--GNNLMANGNYIPLSMLGAN------- 404 (411) T ss_pred CCCcEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeeeeccCccchhhhhhh------- Confidence 11123455555556678889999999998876 667666778888754321 0000000000000000000 Q ss_pred CCCCCCCCC Q lcl|NC_021324. 455 TPKPTVTET 463 (480) Q Consensus 455 ~~~~~~~d~ 463 (480) ...++|. T Consensus 405 --~~kgGd~ 411 (411) T protein:vir:81 405 --YGKGGDS 411 (411) T ss_pred --hccCCCC Confidence 0011111 No 165 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=98.32 E-value=1.3e-06 Score=52.78 Aligned_cols=449 Identities=10% Similarity=0.080 Sum_probs=203.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) +-+.+.- ++.-..-..+|+.+..+++-+..+...-..+ ++.+-....|..-.+ ...++-+-.+.. T Consensus 43 ~~~~~~~----~~~~~eLI~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~i~Ld---~~~~s~~iK~kI 107 (558) T protein:vir:10 43 YVDIEGA----YRSEYDLIRRYREMALHPEADGAIEDVVNEA--------IVSDLYDSPVEVELS---NLNASNTLKKKI 107 (558) T ss_pred eecccch----hhhHHHHHHHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEEec---ccCcchHHHHHH Confidence 1111000 0001112234444444444333222110000 000000000000000 001111112233 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+...+.|+.|++...+.. ...+|-..++.++|..+-.|..-.....-. -...... T Consensus 108 ~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k--~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~--~~~~~~~ 183 (558) T protein:vir:10 108 REEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTK--NPQEGIQDLRYIDPLKIKFIRQEKRKPGNQ--DPAIRVR 183 (558) T ss_pred HHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC--CccccceeeeeeCcccceeeeeeccccccc--cceeeee Confidence 455666666668999999999999999999998764321 235688889999999887765421110000 0011111 Q ss_pred cCCc-----eEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeecccccC--ccCCCccchHHHHHH Q lcl|NC_021324. 161 DDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLG--NRYGRSEISPELRKV 231 (480) Q Consensus 161 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~~--~~~G~s~i~~~v~~l 231 (480) +..+ .+..+-+|.+.........+. +. ..+++ +||- |.|....... ...-.|-|.+.|+++ T Consensus 184 ~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~-----~~----~~~~v-kI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~ 253 (558) T protein:vir:10 184 SEQDVVPNPEFEEFYIYTPKVQHPTGMVGQ-----MG----GKNSI-KIAKDSITMCTSGLVDRNKNRVLSYLHKAIKAL 253 (558) T ss_pred cccceeeccceeEeeeecCCccccccccee-----ec----CCCce-eechhheeeecccceecCCCeeeecchHhhHhH Confidence 1111 111222333332222111110 00 01111 2332 3343321111 112223344555554 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hhee--- Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RILT--- 282 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~~~--- 282 (480) .. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -+|. T Consensus 254 NQ--LkmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRR 331 (558) T protein:vir:10 254 NQ--LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRR 331 (558) T ss_pred Hh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhccccc Confidence 32 2667777777777778877776555555543322110 011111 1111 Q ss_pred cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) .+|.+.++.+++..+.=.-++-++-....++..-++|.+-+...+. +..-|..|--.......-+.+.+..|..-+.++ T Consensus 332 eGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~ 411 (558) T protein:vir:10 332 EGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDM 411 (558) T ss_pred CCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2344567788886643345566676777888888999888865432 222233455556667777888888888888888 Q ss_pred HHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhccccCCCHHHHHHh-CCCChhHHHHH Q lcl|NC_021324. 362 MRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQGPIPKEQARID-LGYTATQREQM 429 (480) Q Consensus 362 ~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~~~s~e~~~~~-~~~~~d~~~~~ 429 (480) ++.=+-+.|.--..+| ..+.+.|..-........++. +.++-.-..-.+|.++++.. |..++++++++ T Consensus 412 Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~ 491 (558) T protein:vir:10 412 LKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEI 491 (558) T ss_pred HHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHH Confidence 8754444443333333 346677754433333333332 22222211124589988755 79999999888 Q ss_pred HHHHHHHHHHHHH-------HHh-hhcccCCCcCC-CCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 430 RDWDKQETEDMID-------TLY-STTKAQADATP-KPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 430 ~~~~~~~~~~~~~-------~~~-~~~~~~~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) .+.++++..+..- -+. ...+.+++++. ..+......+.++++..++-...| T Consensus 492 ~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 551 (558) T protein:vir:10 492 DTQIEDEIQKGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSK 551 (558) T ss_pred HHHHHHHHhCCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhhhhhhhhh Confidence 7666665442110 000 00001111110 011011111222333233333344 No 166 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.32 E-value=1.4e-06 Score=52.73 Aligned_cols=370 Identities=13% Similarity=0.118 Sum_probs=148.4 Q ss_pred HHHHHHHHHHHHH--HHHHHHHHHhhccCc--c-hh----cCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCC Q lcl|NC_021324. 7 HVERLQGLLARDL--PNLLEAEAYRNGTRR--L-KT----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) Q Consensus 7 ~i~~l~~~~~~~~--~~~~~~~~YY~G~~~--i-~~----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d 77 (480) ++-.+.+.+.+.. +.......+.-.... + .. .+..+.+.. -+.+.-...+|+..++-+-.-++.+-.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---al~~~~v~~~i~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARA---ALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHH---hhccHHHHHHHHHHHHhhccCceeeccc Confidence 2222222221110 000000011100000 0 00 000011000 0011222334444433333334544322 Q ss_pred chhHHHHHHHHHhc---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 78 SEGLEELWNWWQAN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 78 ~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ...|.+ +-| ....+...+..+.+.+|.||+++..+ ..|.+ .+..++|..+.+..+.... .+ + T Consensus 78 ~--~~~l~~--~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~~~~~~~~~-~~-~- 144 (392) T protein:vir:39 78 K--NQGIID--NPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN------ANGADMKWEYLRPSQVNTYYFEYEN-GM-Y- 144 (392) T ss_pred h--hhhHhh--cCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eE-E- Confidence 1 111111 122 23466777888999999999988653 34554 6777888888777654322 11 1 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +.....+.+ ......|.++. |+|+++....+..+|.|.|.- +...++ T Consensus 145 --y~~~~~~~~-~~~~~~~~~~e-----------------------------iih~~~~~~~~~~~G~s~i~~-~~~~i~ 191 (392) T protein:vir:39 145 --YNITFDDPK-IEPILQAPQSD-----------------------------LIHMKLLSIDGGKTGISPLYS-LRRESK 191 (392) T ss_pred --EEEEecCcc-cceeEEEcccc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHH Confidence 100111110 00011122222 455555444445678876532 333333 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhc--CCCccc-cccccccchhh--hhcchheecCCCCceEEEeccc-cHHHHHHHHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVIS--GVTTDE-LTNDGENTTLD--IYYGRILTLASEAAKISEFKAA-ELRNFAEEMEV 307 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~-~~~~~~~~~~~--~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~ 307 (480) ....+.....+.+...+.|-.++. +..... .........+. ...++++.++ ++.++.++... .-..+++..+. T Consensus 192 ~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~ 270 (392) T protein:vir:39 192 IQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-DLEEFTALEIKSNVAQLLSQTDW 270 (392) T ss_pred HHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecC-CCceEEEccCChhHHHHHHHHHH Confidence 322222222222333334443332 211111 00000011111 1123445554 44688777643 22357788888 Q ss_pred HHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCC Q lcl|NC_021324. 308 FRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDP 387 (480) Q Consensus 308 ~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~ 387 (480) ...+|+..-++|+..+|....+.++..+.+. .+...|.-+++.+..-...... ..+++..... T Consensus 271 ~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~--------------f~~~~l~P~~~~ie~~l~~~L~---~~~~~d~~~~ 333 (392) T protein:vir:39 271 TSKQYAKVYGLPDSYIGGQGDQQSSIQQISG--------------MYASALNRYLRPAISELEYKLS---DHISVNMRPA 333 (392) T ss_pred HHHHHHHHhCCCHHHhCCCCCcccHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhcc---ccccccchhh Confidence 8999999999999999875444333322221 1111222222211110000000 0111111122 Q ss_pred CCCCHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 388 STPTVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 388 ~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~---~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) .-.+..+.++.+.+++.+| +++...+++. .|+.++++.+++. +.. . + ++ T Consensus 334 ~~~d~~~~~~~~~~l~~~g--~~t~nE~r~~l~~~g~~p~e~r~~e~------------l~~-~----------~---~G 385 (392) T protein:vir:39 334 IDPLGDNYLSTISTATRWG--ALAENQATFVLQEAGYIPKDLPAPEN------------TNK-K----------T---TG 385 (392) T ss_pred hccCHHHHHHHHHHHHhCC--CcCHHHHHHHHHhcCCCccccchhcC------------CCC-C----------C---CC Confidence 2345566777788888765 5566555544 4777665432110 000 0 0 11 Q ss_pred ccCCCCC Q lcl|NC_021324. 465 TETQTSP 471 (480) Q Consensus 465 ~~~~~~~ 471 (480) ++.+..| T Consensus 386 d~~~p~p 392 (392) T protein:vir:39 386 QSNEPVP 392 (392) T ss_pred CCCCCCC Confidence 1122222 No 167 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.32 E-value=1.4e-06 Score=52.73 Aligned_cols=370 Identities=13% Similarity=0.118 Sum_probs=148.4 Q ss_pred HHHHHHHHHHHHH--HHHHHHHHHhhccCc--c-hh----cCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCC Q lcl|NC_021324. 7 HVERLQGLLARDL--PNLLEAEAYRNGTRR--L-KT----IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISED 77 (480) Q Consensus 7 ~i~~l~~~~~~~~--~~~~~~~~YY~G~~~--i-~~----~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d 77 (480) ++-.+.+.+.+.. +.......+.-.... + .. .+..+.+.. -+.+.-...+|+..++-+-.-++.+-.. T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---al~~~~v~~~i~~ia~~ia~lp~~~~~~ 77 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARA---ALRNSDLFSIILQLSSDLAIVKINAEKK 77 (392) T ss_pred CcchhhhhhhcccccccccccccccccCchhhhhhhhcCCCCceechHH---hhccHHHHHHHHHHHHhhccCceeeccc Confidence 2222222221110 000000011100000 0 00 000011000 0011222334444433333334544322 Q ss_pred chhHHHHHHHHHhc---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 78 SEGLEELWNWWQAN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 78 ~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) . ...|.+ +-| ....+...+..+.+.+|.||+++..+ ..|.+ .+..++|..+.+..+.... .+ + T Consensus 78 ~--~~~l~~--~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~~~~~~~~~-~~-~- 144 (392) T protein:vir:10 78 K--NQGIID--NPSTNANKHGFWQSMFAQLLLGGEAFAYRWRN------ANGADMKWEYLRPSQVNTYYFEYEN-GM-Y- 144 (392) T ss_pred h--hhhHhh--cCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-eE-E- Confidence 1 111111 122 23466777888999999999988653 34554 6777888888777654322 11 1 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +.....+.+ ......|.++. |+|+++....+..+|.|.|.- +...++ T Consensus 145 --y~~~~~~~~-~~~~~~~~~~e-----------------------------iih~~~~~~~~~~~G~s~i~~-~~~~i~ 191 (392) T protein:vir:10 145 --YNITFDDPK-IEPILQAPQSD-----------------------------LIHMKLLSIDGGKTGISPLYS-LRRESK 191 (392) T ss_pred --EEEEecCcc-cceeEEEcccc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHH Confidence 100111110 00011122222 455555444445678876532 333333 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhc--CCCccc-cccccccchhh--hhcchheecCCCCceEEEeccc-cHHHHHHHHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVIS--GVTTDE-LTNDGENTTLD--IYYGRILTLASEAAKISEFKAA-ELRNFAEEMEV 307 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~-~~~~~~~~~~~--~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~ 307 (480) ....+.....+.+...+.|-.++. +..... .........+. ...++++.++ ++.++.++... .-..+++..+. T Consensus 192 ~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~~~~e~~~~ 270 (392) T protein:vir:10 192 IQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-DLEEFTALEIKSNVAQLLSQTDW 270 (392) T ss_pred HHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecC-CCceEEEccCChhHHHHHHHHHH Confidence 322222222222333334443332 211111 00000011111 1123445554 44688777643 22357788888 Q ss_pred HHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCC Q lcl|NC_021324. 308 FRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDP 387 (480) Q Consensus 308 ~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~ 387 (480) ...+|+..-++|+..+|....+.++..+.+. .+...|.-+++.+..-...... ..+++..... T Consensus 271 ~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~--------------f~~~~l~P~~~~ie~~l~~~L~---~~~~~d~~~~ 333 (392) T protein:vir:10 271 TSKQYAKVYGLPDSYIGGQGDQQSSIQQISG--------------MYASALNRYLRPAISELEYKLS---DHISVNMRPA 333 (392) T ss_pred HHHHHHHHhCCCHHHhCCCCCcccHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhcc---ccccccchhh Confidence 8999999999999999875444333322221 1111222222211110000000 0111111122 Q ss_pred CCCCHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCC Q lcl|NC_021324. 388 STPTVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETK 464 (480) Q Consensus 388 ~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~---~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 464 (480) .-.+..+.++.+.+++.+| +++...+++. .|+.++++.+++. +.. . + ++ T Consensus 334 ~~~d~~~~~~~~~~l~~~g--~~t~nE~r~~l~~~g~~p~e~r~~e~------------l~~-~----------~---~G 385 (392) T protein:vir:10 334 IDPLGDNYLSTISTATRWG--ALAENQATFVLQEAGYIPKDLPAPEN------------TNK-K----------T---TG 385 (392) T ss_pred hccCHHHHHHHHHHHHhCC--CcCHHHHHHHHHhcCCCccccchhcC------------CCC-C----------C---CC Confidence 2345566777788888765 5566555544 4777665432110 000 0 0 11 Q ss_pred ccCCCCC Q lcl|NC_021324. 465 TETQTSP 471 (480) Q Consensus 465 ~~~~~~~ 471 (480) ++.+..| T Consensus 386 d~~~p~p 392 (392) T protein:vir:10 386 QSNEPVP 392 (392) T ss_pred CCCCCCC Confidence 1122222 No 168 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=98.31 E-value=1.4e-06 Score=52.64 Aligned_cols=382 Identities=10% Similarity=0.061 Sum_probs=146.7 Q ss_pred HHHHHHHHHHH---H-HHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc---cCCCchhHHHH Q lcl|NC_021324. 12 QGLLARDLPNL---L-EAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISEDSEGLEEL 84 (480) Q Consensus 12 ~~~~~~~~~~~---~-~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~~~~~~l 84 (480) +.-|..+..+. . -+..+.-|...-......+ .+. .-.-.+|+..++-+-.-++. ..++......+ T Consensus 1 m~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A------l~~--~~V~~~i~~Ia~~iA~lp~~~~~~~g~~~~~~~~ 72 (406) T protein:vir:97 1 MSFFQPLGTSKVSYDDYISSVLAGDVSQKYLGVSA------LKN--SDILTATSIIAGDIARFPLVKKDVNGDIIHDEDI 72 (406) T ss_pred CccccccCCCCCCcchHHHHHhcCCCCcccccchh------hcc--HHHHHHHHHHHHhhhhCeeEEEecCccccccchH Confidence 11111111100 0 0111111111100000000 010 11111233222222111232 12222222345 Q ss_pred HHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 85 WNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 85 ~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ..++.. |. ...+...+....+.+|.||+++..+. ..|.+ .+..++|..+.+..++.. . +.|.. T Consensus 73 ~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~-----~~g~~~~L~~i~p~~v~v~~~~~~--~----~~y~~ 141 (406) T protein:vir:97 73 NYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDP-----KTNQALQFQFYRPSETTVEETDNH--E----IVYTF 141 (406) T ss_pred HHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-----CCCeEEEEEEECCCeeEEEEcCCc--e----EEEEE Confidence 555532 32 34667778889999999999886532 23444 677788888876654321 1 11111 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) .....+.. ..+.++. |+||++.. ..+..|.|.+. .+.+.+... T Consensus 142 ~~~~~~~~---~~~~~~e-----------------------------vih~r~~~-~dg~~G~spi~----~~~~~i~~~ 184 (406) T protein:vir:97 142 TDMLTAKQ---VKCFAHD-----------------------------VIHWKFFS-HDTILGRSPLL----SLGDEIDLQ 184 (406) T ss_pred EecCCceE---EEEcccc-----------------------------EEEecCCC-CCCcccccHHH----HHHHHHHHH Confidence 11111110 0122222 34444432 23345777553 333333332 Q ss_pred HHHHHHHHHHhc---chhhhh-cCCCccccccccccchhhh-----hcchheecCCCCceEEEecccc-HHHHHHHHHHH Q lcl|NC_021324. 239 LMNLQSASQILG---TPLRVI-SGVTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVF 308 (480) Q Consensus 239 ~s~~~~~~~~~~---~p~l~i-~G~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~ 308 (480) ..-......+|. .|-.++ .+....+...+.-...|+- ..++++.++ ++.++.+++... -..+++..+.. T Consensus 185 ~a~~~~~~~~f~ng~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~ 263 (406) T protein:vir:97 185 TGGINTLIKFFKDGFSSGILTMKGAQLSGDARQRARQEFEKMREGSVGGSPLVFD-STMEYTPLEIDTNVLQLITSNNFS 263 (406) T ss_pred HHHHHHHHHHHhccCCCceEEecCCCCCHHHHHHHHHHHHHHhcccccCceeecC-CCceEEEccCCHHHHHHHHHHHhh Confidence 222222222222 222221 1221111111111111211 123455554 346777765332 22367777778 Q ss_pred HHHHHhhcCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCC Q lcl|NC_021324. 309 RKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDP 387 (480) Q Consensus 309 ~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~ 387 (480) +.+|+..-++|+..+|....+..+....+ +....|.-.+. .|+..+.. .+..... .....+.|. T Consensus 264 ~~~Ia~afgVPp~~lg~~~~~~~~e~~~~~f~~~~l~P~~~--------~ie~~l~~--kll~~~~---~~~~~i~fd-- 328 (406) T protein:vir:97 264 TAQIAKALRVPSYKLGVNSPNQSVAQLMEDYVTNDLPFYFD--------AITSELGL--KTLNDKD---RRLYHIEFD-- 328 (406) T ss_pred HHHHHHHhCCCHHHcCCCCCcchHHHHHHHHHHHHHHHHHH--------HHHHHHhh--hhcChhh---ccceeEEEe-- Confidence 89999999999999986533211111111 11111111111 11111110 1111111 112334452 Q ss_pred CCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 388 STPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 388 ~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ...+....++++.++++.| +++...+++.+|+.+-+-... +.+.......+-...+...+..++.. T Consensus 329 ~~~~~~~~~~~~~~~~~~g--~~T~NE~R~~~g~~p~~~~~g------------D~~~~~~n~~~~~~~~~~~~~~~~~~ 394 (406) T protein:vir:97 329 TRSVTGRNVDEIVKLVNNQ--ILTPNQGLVELGKQKSTDPNM------------DRYQSSLNYVFLDKKEEYQDKVGIKG 394 (406) T ss_pred cCccchhhHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCC------------CeEeeccCccchhccccccccccccc Confidence 1234556677788888765 567777777777653211000 00000000000000011111222233 Q ss_pred CCCCCCCCcccC Q lcl|NC_021324. 468 QTSPSGFNRTKT 479 (480) Q Consensus 468 ~~~~~~~~~~~~ 479 (480) ++.++++.+.+| T Consensus 395 ~gg~~~~~~~~~ 406 (406) T protein:vir:97 395 KGGEVNAEEDKS 406 (406) T ss_pred CCCCCCCCCCCC Confidence 344444444455 No 169 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=98.27 E-value=1.8e-06 Score=52.03 Aligned_cols=368 Identities=12% Similarity=0.067 Sum_probs=141.0 Q ss_pred HHHHHHHHHHHHHHHHHH----HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNL----LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~----~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) .-++.++.-.-....... .....+.-|.-.... +.. +..+.+.-...+|+..++-+-.-++.+..+. . T Consensus 1 Mg~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~----v~~---~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~-~ 72 (383) T protein:vir:10 1 MGLLTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQLSY----VSA---LSALQNTNVYSVINRIASDVSSAHFKTENTA-T 72 (383) T ss_pred CCcccccccccccccccccccchhhhhhhccCccccc----cch---hHhhcchHHHHHHHHHHHhhccCceeecccc-h Confidence 111110000000000000 000001111000000 000 0001111223334444333333355553332 1 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) ...|..-...-....+...+..+.+.+|.||+.+.... ..+...+|.++.+..+. +. +.++... T Consensus 73 ~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---------~~~~p~~~~~v~~~~~~--~~-----~~~~~~~ 136 (383) T protein:vir:10 73 LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---------LEHIPNSDVQINYLPGN--MG-----IVYTVLE 136 (383) T ss_pred hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---------eeEeecCcceEEEEEcC--Cc-----eEEEEEE Confidence 12222111111345667778889999999999875321 11222223222222211 10 0011111 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeeccc--ccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP--RLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~--~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) ...+. ...|.+++ |+||++.. .....+|.|.+.. +...++....+ T Consensus 137 ~~~~~---~~~~~~~e-----------------------------vih~r~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~ 183 (383) T protein:vir:10 137 SNDRP---KMVLRQDQ-----------------------------MLHFRLMPDPQYRYLIGRSPLES-LQNALNLDDKA 183 (383) T ss_pred cCCce---EEEEcccc-----------------------------eEEeccCCCCcccccccccHHHH-HHHHHHHHHHH Confidence 11110 01112222 35554321 2234578876643 33334433333 Q ss_pred HHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh-----hcchheecCCCCceEEEeccc--cHHHHHHHHHHHH Q lcl|NC_021324. 239 LMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAA--ELRNFAEEMEVFR 309 (480) Q Consensus 239 ~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~--~~~~~~~~l~~~~ 309 (480) .......+...+.|-.++. |...++...+.-...++. ..++++.++ ++.++..++.+ ..+.+.+..+... T Consensus 184 ~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~~~l~e~~~~~~ 262 (383) T protein:vir:10 184 SKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLP-DGFDYTQLEMKTDVFKALADNSAYSA 262 (383) T ss_pred HHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccC-CCceEEecCCChhHHHHHHHHHHHHH Confidence 3333333343444544443 211111111110111211 123445554 45677766543 3443446777778 Q ss_pred HHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCC Q lcl|NC_021324. 310 KEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST 389 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~ 389 (480) .+|+..-++|+..+|....+.+.+..++.. +..|...|.-+++.+..-....... ..+++.+..... T Consensus 263 ~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~-----------~~~~~~~l~P~~~~ie~~l~~~l~~--~~~~f~~~~l~~ 329 (383) T protein:vir:10 263 DQISKAFGVPSDILGGGTSTESQHSNIDQI-----------KATYLANLNSYVNPIVDELRLKMNA--PDLELDIKDMLD 329 (383) T ss_pred HHHHHHhCCCHHHcCCccCCCCccccHHHH-----------HHHHHHHHHHHHHHHHHHHHHhhCC--ceEEeechhhhc Confidence 999999999999998642211111111111 1111112222222221111000001 135666677777 Q ss_pred CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCC Q lcl|NC_021324. 390 PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQT 469 (480) Q Consensus 390 ~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) .|..+.++++.+++.+| +++...+++.+|..+-+- + ..+......... T Consensus 330 ~d~~~~~~~~~~~~~~G--~~t~nE~R~~lg~~p~~~-----------------------~-d~~~~~~~~~~~------ 377 (383) T protein:vir:10 330 VDDSILINQVSNLAKSG--VLGAEQAQFILTRSGFLP-----------------------D-NLPEFKPLTNET------ 377 (383) T ss_pred cCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCcccC-----------------------C-cccccCCCcccC------ Confidence 89999999999998876 566666777766533110 0 000000000000 Q ss_pred CCCCCCc Q lcl|NC_021324. 470 SPSGFNR 476 (480) Q Consensus 470 ~~~~~~~ 476 (480) ..|+++ T Consensus 378 -~gGd~e 383 (383) T protein:vir:10 378 -KGGDDK 383 (383) T ss_pred -CCCCCC Confidence 112222 No 170 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.25 E-value=2.1e-06 Score=51.75 Aligned_cols=396 Identities=11% Similarity=0.101 Sum_probs=141.5 Q ss_pred CCCHH-----------------------------------HHHHHHHHHHHHHHHHHHHHHHHhhccC--cchhcCcccc Q lcl|NC_021324. 1 MTTYH-----------------------------------EHVERLQGLLARDLPNLLEAEAYRNGTR--RLKTIGIGAP 43 (480) Q Consensus 1 ~~~~~-----------------------------------~~i~~l~~~~~~~~~~~~~~~~YY~G~~--~i~~~~~~~~ 43 (480) |+..+ +++..+++........+.....+....- ++..+..... T Consensus 54 ~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~ 133 (576) T protein:vir:96 54 QAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAE 133 (576) T ss_pred chhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCc Confidence 21111 1233333333333333222222211110 0000000000 Q ss_pred hhhhcceeccchHHHHHHHHHhhhccCccccCCCch--hHHHHHHHHH-----h----cCHHHHHHHHHHHHhhcCeeEE Q lcl|NC_021324. 44 PELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE--GLEELWNWWQ-----A----NDLDEESVLGHDDSLTFGRAYI 112 (480) Q Consensus 44 ~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~--~~~~l~~~~~-----~----n~~~~~~~~~~~~~~~~G~a~~ 112 (480) . .+.. ....+..++. . ..+..+...+..+.+.+|.||+ T Consensus 134 ~-------------------------------~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~ 182 (576) T protein:vir:96 134 P-------------------------------GKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNF 182 (576) T ss_pred c-------------------------------chhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEE Confidence 0 0000 0011111111 0 1244566778889999999998 Q ss_pred EEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccc Q lcl|NC_021324. 113 TVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWV 191 (480) Q Consensus 113 ~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (480) .+.... +..|++ .+..++|..+.++.++... .......|....+.+ ....+.++.+++ T Consensus 183 ~i~~~r----d~~g~~~~L~pl~p~~V~v~~~~dg~--~~~~~~~~~~~~~~~---~~~~~~~~dii~------------ 241 (576) T protein:vir:96 183 EKVFNK----KNATTMDKFIAVDPSTIFYATDKNGK--IIKGGKRFVQVINKK---VVASFTSREMAM------------ 241 (576) T ss_pred EEEEec----CCCCceEEEEEeCCceeEEEECCCCc--eeeeeeEEEEecCCc---eEEEecccceEE------------ Confidence 875322 223444 5778999998888765431 111111111111111 111122222222 Q ss_pred cccccccccccccceEEeeccc---ccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CC-Cccccc Q lcl|NC_021324. 192 VDGDVIKHGLGVVPVVPLTNDP---RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV-TTDELT 265 (480) Q Consensus 192 ~~~~~~~~~~g~iPvv~~~n~~---~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~-~~~~~~ 265 (480) +..++ ...+.+|.|.|.- +...++....+.....+.+...+.|-.+|. |. .+.+.. T Consensus 242 -----------------~~~~~~~d~~~~~~G~Spi~~-a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~ 303 (576) T protein:vir:96 242 -----------------GIRNPRTELSSSGYGLSEVEI-AMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRA 303 (576) T ss_pred -----------------EeecCCCCcccCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHH Confidence 21111 1224578776532 333333222222222333333344544443 32 111111 Q ss_pred cccccchhhh------hcchh-eecCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHH---- Q lcl|NC_021324. 266 NDGENTTLDI------YYGRI-LTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASA---- 333 (480) Q Consensus 266 ~~~~~~~~~~------~~~~~-~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg---- 333 (480) .+.-...|.. ..+++ +.++ ++.+|..+... .-..|++..+....+|+..-++|+..+|....+.++| T Consensus 304 ~~~lr~~~~~~~~G~~nag~~p~vl~-~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~ 382 (576) T protein:vir:96 304 LENFKREWKSSFSGINGSWQVPVVMA-DDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGG 382 (576) T ss_pred HHHHHHHHHHHhccccccccceeecC-CCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccc Confidence 1111112221 11232 3343 45788777643 2335788889999999999999999998532211111 Q ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccc Q lcl|NC_021324. 334 -----EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE-VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQ 407 (480) Q Consensus 334 -----~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~-~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~ 407 (480) ..+...... .+...|.-++..+....... .......+.+.|.+.... ..++.......... T Consensus 383 ~s~t~sn~e~~~~~----------f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~f~r~d~~---~~~e~~~~~~~~~~ 449 (576) T protein:vir:96 383 NTLNEADPGKKQQQ----------SQNKGLQPLLRFIEDLINTHIISEYSDKYVFQFVGGDTK---SELDKIKILQEEVK 449 (576) T ss_pred cccccccHHHHHHH----------HHHHHHHHHHHHHHHHHHhhhchhccCceEEEeccCCHH---HHHHHHHHHHHHhc Confidence 011111111 11111222111111100000 001112355667555443 34443322222223 Q ss_pred cCCCHHHHHHhCCCChhHH-HHH---------HHHH------HHHHHHHHHHHhhhcccCCCcCCCCCCCCCC--ccCCC Q lcl|NC_021324. 408 GPIPKEQARIDLGYTATQR-EQM---------RDWD------KQETEDMIDTLYSTTKAQADATPKPTVTETK--TETQT 469 (480) Q Consensus 408 ~~~s~e~~~~~~~~~~d~~-~~~---------~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~--~~~~~ 469 (480) ++++...+++.+|+.+-+- .++ -... .+.+++..+.......+.....+++.+.+++ ++... T Consensus 450 G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~~~~ 529 (576) T protein:vir:96 450 TYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGRESN 529 (576) T ss_pred CccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCCcccccccc Confidence 5677777788877643210 000 0000 0000111111111001100000111111111 11111 Q ss_pred CCCCCCcccCC Q lcl|NC_021324. 470 SPSGFNRTKTR 480 (480) Q Consensus 470 ~~~~~~~~~~~ 480 (480) ++++-++.=++ T Consensus 530 ~~~~~~~~~~~ 540 (576) T protein:vir:96 530 DPTKIDSPVGT 540 (576) T ss_pred cCCCCCCcccc Confidence 11111110001 No 171 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.23 E-value=2.3e-06 Score=51.48 Aligned_cols=369 Identities=10% Similarity=-0.004 Sum_probs=144.9 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchhHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDLP-NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~-~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~ 83 (480) .-|++++......... ........+-+. .. .+..+.... -....-...+|+..++-+-.-+|.+.... .+. T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~v~~~~---~l~~~~v~~~i~~ia~~ia~~~~~~~~~~--~~~ 72 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLAS-LK--GNEWVSAET---ALRNSDLFSIINQLSNDLATVKLITSRKK--LQG 72 (382) T ss_pred CccccccccCCcccccccccchhhhcccc-cc--CCcccchHh---hhccHHHHHHHHHHHHhhccCceeeecch--hhh Confidence 2222222111100000 000000111000 00 000011000 01111223334444433333345443322 111 Q ss_pred HHH-HHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEec Q lcl|NC_021324. 84 LWN-WWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD 161 (480) Q Consensus 84 l~~-~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~ 161 (480) |.. =...-....+...+..+.+.+|.||+.+-.+ ..|++ .+..++|..+.++.++... .+ .|....+ T Consensus 73 L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd------~~G~~~~l~~i~~~~v~v~~~~~~~-~~----~y~~~~~ 141 (382) T protein:vir:48 73 IVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------ENGRDMKWEYLRPSQVSFNRLDNKD-GI----YYNITFD 141 (382) T ss_pred hhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCC-eE----EEEEEec Confidence 211 1111134567778888999999999988653 34554 7778888888877654322 11 1111111 Q ss_pred CCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 162 DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMN 241 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~ 241 (480) +... .....+.++. |+||++....+..+|.|.+.. +...++....+... T Consensus 142 ~~~~-~~~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~l~~-~~~~i~~~~~~~~~ 190 (382) T protein:vir:48 142 DPRI-PPKQHVPQND-----------------------------VLHFRLLSVDGGMTSVSPLMA-LSRELDIQKASGNL 190 (382) T ss_pred Cccc-cceeEEcCcc-----------------------------EEEecCCCCCCccccccHHHH-HHHHHHHHHHHHHH Confidence 1100 0001111111 466665544455788886642 44444433333333 Q ss_pred HHHHHHHhcchhhhhc--CCCccccccccccch---hhhhcchheecCCCCceEEEeccc-cHHHHHHHHHHHHHHHHhh Q lcl|NC_021324. 242 LQSASQILGTPLRVIS--GVTTDELTNDGENTT---LDIYYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRKEAASI 315 (480) Q Consensus 242 ~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~---~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~i~~~ 315 (480) ..+.+...+.|-.+++ |...+ +........ .....++++.+++ +.++.+++.. .-..+++..+....+|+.. T Consensus 191 ~~~~~~ng~~p~~il~~~~~~~~-e~~~~~~~~~~~~~~n~g~~~vl~~-g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~a 268 (382) T protein:vir:48 191 TINSLKNALNANGILKIKGGGLL-DFKTKLSRSRQAMKQMQGGPLVLDD-LEDFTPLEIKSNVSQLLKQADWTTGQFAKV 268 (382) T ss_pred HHHHHhccCCCceEEEeCCCCCh-HHHHHHHHHHHhhccCCCCeeEcCC-CceEEEccCChhHHHHHHHHHHHHHHHHHH Confidence 3343444455655553 21111 100000000 1112345566653 4678777633 2234678888889999999 Q ss_pred cCCChHHhccccccchHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021324. 316 TGLPPQYLSSSSENPASAEAIIATD-SRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 316 ~~~p~~~~g~~~~~~~Sg~Al~~~~-~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~ 394 (480) -++|+..+|....+..+..+.+..+ ..|.-.+...+..+... + ...... ++...+ + .+... T Consensus 269 fgVp~~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~----------l-~~~~~~---~~~~~~-~---~~~~~ 330 (382) T protein:vir:48 269 YGIPDNVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQK----------L-SCDVDA---DIFPAV-D---PTGSN 330 (382) T ss_pred hCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHH----------h-cChhhh---hhhhhh-c---cchhH Confidence 9999999986543332222222211 11111111111111111 1 101000 011111 1 11223 Q ss_pred HHHHHHHHHhccccCCCHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 395 KADAVSKLYANGQGPIPKEQARIDL---GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 395 ~ad~~~kl~~~~~~~~s~e~~~~~~---~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ....+.++..++ +++...+++.+ |+.++++.+.+. .. +..+++|.+++. T Consensus 331 ~~~~~~~l~~~g--~~t~~e~r~~l~~~g~~~~~~~~~~~------------~~----------~~~~GGd~~~~~ 382 (382) T protein:vir:48 331 YISRINSLVKTG--TLAQNQGLYILQQAEILPKELPNGEN------------PN----------STLKGGEEDGQD 382 (382) T ss_pred HHHHHHHHhhcC--ccCHHHHHHHHhhCCCCCcchhhhhc------------CC----------CCCCCCCCCCCC Confidence 334455565554 45555555443 555544322110 00 001111111111 No 172 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.22 E-value=2.4e-06 Score=51.40 Aligned_cols=382 Identities=12% Similarity=0.071 Sum_probs=152.7 Q ss_pred HHHHHHHHHHH-----HHHHHHh---hccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCch--hH Q lcl|NC_021324. 12 QGLLARDLPNL-----LEAEAYR---NGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE--GL 81 (480) Q Consensus 12 ~~~~~~~~~~~-----~~~~~YY---~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~--~~ 81 (480) +.-+.....|- .-+.... -|..... +..+.+.. -++. .-.-.+|+..++-+-.-++.+-.+.+ .. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-al~~--~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~ 75 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK--LRQYKDIE-AIRH--SDIFTAVMMIASDLARMPIRVTVNGQINYS 75 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccC--ccccchhh-hhcc--hHHHHHHHHHHHhhccCceEEecCcccccc Confidence 11111110000 0001111 1111000 00010100 0111 11111334333333333444322211 12 Q ss_pred HHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEE Q lcl|NC_021324. 82 EELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 82 ~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ..+..++.. |. ...+...+....+.+|.||+++..+ ..|++ .+..++|..+.++.+... .+.+ T Consensus 76 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~g--~~~~--- 144 (416) T protein:vir:81 76 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTFRKTSEIELKSDARG--RLYY--- 144 (416) T ss_pred chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEECCCc--cEEE--- Confidence 234444432 32 3466777888889999999998753 34665 477888988887775432 2211 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHH Q lcl|NC_021324. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~ 235 (480) ++...+..+. .....|.++. |+||++.. ..+.+|.|.|. .+.+.+ T Consensus 145 ~~~~~~~~~~-~~~~~~~~~e-----------------------------vihir~~~-~d~~~G~s~i~----~~~~~i 189 (416) T protein:vir:81 145 FHQRIDSNGN-NIERNVKFED-----------------------------MLDIKFYS-LDGINGLSLLD----TLSRTI 189 (416) T ss_pred EEEEecCCCc-eeEEEEcccc-----------------------------EEEeccCC-CCCccccCHHH----HHHHHH Confidence 1111111111 1111122222 24554432 34467777553 333444 Q ss_pred HHHHHHHH---HHHHHhcchhhhhc--CCCccccccccccchhhh------hcchheecCCCCceEEEecccc-HHHHHH Q lcl|NC_021324. 236 SRTLMNLQ---SASQILGTPLRVIS--GVTTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 236 ~~~~s~~~---~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~ 303 (480) ....+-.. +.+...+.|..++. |...++...+.-...|+. ..++++.++ ++.++..++.+. -..|++ T Consensus 190 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~~e 268 (416) T protein:vir:81 190 ESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQLEVDTEVLKLIR 268 (416) T ss_pred HHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC-CCceeEeccCCHHHHHHHH Confidence 33332222 22333334544443 221111100000111211 113455565 345777765432 224777 Q ss_pred HHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--cceeee Q lcl|NC_021324. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTE--EYTRLE 381 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~--~~~~~~ 381 (480) ..+....+|+..-++|+..+|....+. |-......+ ...|.-++..+..-....... ....++ T Consensus 269 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~~~~--------------~~~l~P~~~~ie~~ln~~l~~~~~~~~~~ 333 (416) T protein:vir:81 269 ENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDY--------------LSTLKPYITCVCAELNFKFNDEYVNREFK 333 (416) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHH--------------HHHHHHHHHHHHHHHhhhccccccCceEE Confidence 778888999999999999998654331 211111111 111222222111111100111 112344 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH---HHHH-HHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ---REQM-RDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) +.+......|..++++++.+++.+| +++...+++.+|+.+-+ .... ....-. ..+.+.......+..... T Consensus 334 f~~~~l~~~D~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~----~~~~~~~~~~~~~~~~~~ 407 (416) T protein:vir:81 334 FDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHV----NIELVDEYQMNKSRATDK 407 (416) T ss_pred EechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeeccccc----ccccccccCccccccccc Confidence 5555556678889999998888876 66777777787765321 0000 000000 000000000000000000 Q ss_pred CCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFNR 476 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~~ 476 (480) ..+++. .|+ T Consensus 408 ----~~kgGe------~n~ 416 (416) T protein:vir:81 408 ----KLKGGE------ENE 416 (416) T ss_pred ----ccCCCC------CCC Confidence 000100 111 No 173 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.22 E-value=2.4e-06 Score=51.40 Aligned_cols=382 Identities=12% Similarity=0.071 Sum_probs=152.7 Q ss_pred HHHHHHHHHHH-----HHHHHHh---hccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCch--hH Q lcl|NC_021324. 12 QGLLARDLPNL-----LEAEAYR---NGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE--GL 81 (480) Q Consensus 12 ~~~~~~~~~~~-----~~~~~YY---~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~--~~ 81 (480) +.-+.....|- .-+.... -|..... +..+.+.. -++. .-.-.+|+..++-+-.-++.+-.+.+ .. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-al~~--~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~ 75 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK--LRQYKDIE-AIRH--SDIFTAVMMIASDLARMPIRVTVNGQINYS 75 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccC--ccccchhh-hhcc--hHHHHHHHHHHHhhccCceEEecCcccccc Confidence 11111110000 0001111 1111000 00010100 0111 11111334333333333444322211 12 Q ss_pred HHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEE Q lcl|NC_021324. 82 EELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 82 ~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) ..+..++.. |. ...+...+....+.+|.||+++..+ ..|++ .+..++|..+.++.+... .+.+ T Consensus 76 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~v~v~~~~~g--~~~~--- 144 (416) T protein:vir:45 76 DRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTFRKTSEIELKSDARG--RLYY--- 144 (416) T ss_pred chHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEECCCc--cEEE--- Confidence 234444432 32 3466777888889999999998753 34665 477888988887775432 2211 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHH Q lcl|NC_021324. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~ 235 (480) ++...+..+. .....|.++. |+||++.. ..+.+|.|.|. .+.+.+ T Consensus 145 ~~~~~~~~~~-~~~~~~~~~e-----------------------------vihir~~~-~d~~~G~s~i~----~~~~~i 189 (416) T protein:vir:45 145 FHQRIDSNGN-NIERNVKFED-----------------------------MLDIKFYS-LDGINGLSLLD----TLSRTI 189 (416) T ss_pred EEEEecCCCc-eeEEEEcccc-----------------------------EEEeccCC-CCCccccCHHH----HHHHHH Confidence 1111111111 1111122222 24554432 34467777553 333444 Q ss_pred HHHHHHHH---HHHHHhcchhhhhc--CCCccccccccccchhhh------hcchheecCCCCceEEEecccc-HHHHHH Q lcl|NC_021324. 236 SRTLMNLQ---SASQILGTPLRVIS--GVTTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 236 ~~~~s~~~---~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~ 303 (480) ....+-.. +.+...+.|..++. |...++...+.-...|+. ..++++.++ ++.++..++.+. -..|++ T Consensus 190 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~~e 268 (416) T protein:vir:45 190 ESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-ESMTFDQLEVDTEVLKLIR 268 (416) T ss_pred HHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC-CCceeEeccCCHHHHHHHH Confidence 33332222 22333334544443 221111100000111211 113455565 345777765432 224777 Q ss_pred HHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc--cceeee Q lcl|NC_021324. 304 EMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTE--EYTRLE 381 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~--~~~~~~ 381 (480) ..+....+|+..-++|+..+|....+. |-......+ ...|.-++..+..-....... ....++ T Consensus 269 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~~~~--------------~~~l~P~~~~ie~~ln~~l~~~~~~~~~~ 333 (416) T protein:vir:45 269 ENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDY--------------LSTLKPYITCVCAELNFKFNDEYVNREFK 333 (416) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHH--------------HHHHHHHHHHHHHHHhhhccccccCceEE Confidence 778888999999999999998654331 211111111 111222222111111100111 112344 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH---HHHH-HHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ---REQM-RDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) +.+......|..++++++.+++.+| +++...+++.+|+.+-+ .... ....-. ..+.+.......+..... T Consensus 334 f~~~~l~~~D~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~----~~~~~~~~~~~~~~~~~~ 407 (416) T protein:vir:45 334 FDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLNHV----NIELVDEYQMNKSRATDK 407 (416) T ss_pred EechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeeccccc----ccccccccCccccccccc Confidence 5555556678889999998888876 66777777787765321 0000 000000 000000000000000000 Q ss_pred CCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFNR 476 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~~ 476 (480) ..+++. .|+ T Consensus 408 ----~~kgGe------~n~ 416 (416) T protein:vir:45 408 ----KLKGGE------ENE 416 (416) T ss_pred ----ccCCCC------CCC Confidence 000100 111 No 174 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=98.20 E-value=2.4e-06 Score=51.39 Aligned_cols=398 Identities=13% Similarity=0.059 Sum_probs=193.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) +.+..++|+ +|..+..+++-+..+...-..+ ++.+--..+|.--.+ -..++-.-.++. T Consensus 64 ~~~~~~LI~-----------~YR~ma~~pEvd~Av~eIvnea--------iv~d~~~~pV~l~l~---~~e~s~sik~kI 121 (516) T protein:vir:10 64 ISGTKDLIN-----------TYRQLTNNPEVERAVANIVNEA--------VVYEKGHKVVSLDLD---DTEFSSSIKDKI 121 (516) T ss_pred cccHHHHHH-----------HHHHhhhccchhHHHHHhhcce--------eEecCCCceEEEEec---ccccchHHHHHH Confidence 444444433 4445555554443332210000 000000000000000 000111112234 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+...+.|+.|++...+ ...+|-..++.++|..+..+.- .... T Consensus 122 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid----~~k~GI~elr~lDPr~i~~vR~-------------i~~~ 184 (516) T protein:vir:10 122 LEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMP----NPKEGIVELRRLDPRHVEYYRE-------------IVTS 184 (516) T ss_pred HHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEec----CcccceeeeeeeCCcceeeEEe-------------eecc Confidence 4556666666688999999999999999999886543 3457888899999988766532 1111 Q ss_pred cCCc-----eEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeecccccCccCCC----ccchHHHH Q lcl|NC_021324. 161 DDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRYGR----SEISPELR 229 (480) Q Consensus 161 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~~~~~G~----s~i~~~v~ 229 (480) +..+ ....+-+|.+... .|...+.... |+.-=+||- |.|.+.-. .+.+. |-|.+.|+ T Consensus 185 ~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~~~---------~~~~ikI~~daI~y~hSGl--~d~~~~~i~syLhkAiK 252 (516) T protein:vir:10 185 DVGGTSVVKGYREFFVYTTGNE-GYAYNGRLFE---------PNTRIKIPRSAIVYAHSGL--QDCSDRGIVGYLHNAVK 252 (516) T ss_pred cCcchhhhhceeeeeeeecCcc-ceeccccccC---------CCCceecchhheeeeecCc--ccCCCCceeceehhhhH Confidence 1111 1112223333221 2222211100 010012221 33332111 11112 33445555 Q ss_pred HHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hhee- Q lcl|NC_021324. 230 KVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RILT- 282 (480) Q Consensus 230 ~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~~~- 282 (480) ++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -+|. T Consensus 253 p~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLp 330 (516) T protein:vir:10 253 PANQ--LKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLM 330 (516) T ss_pred hHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccc Confidence 5422 2567777777777777877776555555543322110 011111 1111 Q ss_pred --cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 --LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENP---ASAEAIIATDSRIVKMAERKGRIFGGA 357 (480) Q Consensus 283 --~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~---~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 357 (480) .+|.+.++.++++.+.=.-++-++-....++..-++|.+-+....+.+ ..+..|--.+.....-+.+.+..|..- T Consensus 331 RReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~l 410 (516) T protein:vir:10 331 RRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEI 410 (516) T ss_pred ccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 234456788888765444566667777888888899988886543211 233345445556666677778888887 Q ss_pred HHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHH-------HHHHHHHHhccccCCCHHHHHHh-CCCChhH Q lcl|NC_021324. 358 WERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAK-------ADAVSKLYANGQGPIPKEQARID-LGYTATQ 425 (480) Q Consensus 358 l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~-------ad~~~kl~~~~~~~~s~e~~~~~-~~~~~d~ 425 (480) +..+++.=+-+.|---..++ ..+.+.|..-....+... ++.+.++..-....+|.++++.. |..++++ T Consensus 411 F~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDee 490 (516) T protein:vir:10 411 FLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQ 490 (516) T ss_pred HHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhH Confidence 77777753333443333333 346667754433333322 33333333322356799998755 7899999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 426 REQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 426 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) ++++.+.++++..+. .-..++ ++..- T Consensus 491 i~~~~k~I~~E~~~~-------~~~~p~----~e~~f 516 (516) T protein:vir:10 491 IAQEEKQIEKEANVK-------RFQNPE----NEDDF 516 (516) T ss_pred HHHHHHHHHHhhhCC-------CCCCCC----ccccC Confidence 887776666554211 000011 11001 No 175 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.19 E-value=2.9e-06 Score=50.94 Aligned_cols=441 Identities=15% Similarity=0.122 Sum_probs=179.2 Q ss_pred CCC------HHHHHHHHHHHHHHHHH----HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhcc- Q lcl|NC_021324. 1 MTT------YHEHVERLQGLLARDLP----NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDI- 69 (480) Q Consensus 1 ~~~------~~~~i~~l~~~~~~~~~----~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~- 69 (480) |+. .++.++++.+.+...+. +++.+.+|..-.- .+.-+. ... ....++..+-+...++++++.|.+ T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~-~~~~~~-~~~-~~~~~~~dst~~~a~~~Laa~l~~~ 77 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSL-FPKDSD-NSS-TDYTTPWQAVGARGLNNLSAKVMLA 77 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhcccc-CCCCCC-ccc-ccccccccchHHHHHHHHHHHHHHh Confidence 554 35566666666655443 3344444443210 100000 001 111245556667777777776632 Q ss_pred ----Cccc-c--CC--------CchhH-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCC Q lcl|NC_021324. 70 ----EGFR-I--SE--------DSEGL-----------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD 123 (480) Q Consensus 70 ----~~~~-~--~~--------d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d 123 (480) .+|+ . .+ +.... +.+...+..++|.....++.++...+|.|.+++..+... T Consensus 78 ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~--- 154 (543) T protein:vir:88 78 LFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDAS--- 154 (543) T ss_pred hcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCccc--- Confidence 1222 1 11 11111 224445566889999999999999999998776532200 Q ss_pred CCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEec---------------CCceEEEEEEEeCCcEEEEEecCCCcc Q lcl|NC_021324. 124 PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD---------------DVAVPDRATLYLPDETVPLRRNGGLND 188 (480) Q Consensus 124 ~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (480) ..+...++.++-.+.+.-.|. . .++...++.+...- +.+....+++|+. .|.+..+... T Consensus 155 ~~~~~~~~~~pl~~y~v~~d~-~-G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~----V~pr~~~~~~ 228 (543) T protein:vir:88 155 SNSYNPMKLYTLHNHVVQRDA-F-GNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTH----IYIDDESGDF 228 (543) T ss_pred cceecceEEeEcceEEEeeCC-C-CCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEE----EEeecCCCcc Confidence 001112344433333333333 2 34444444432110 0111112333321 1111111111 Q ss_pred c-------cccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CC Q lcl|NC_021324. 189 Q-------WVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GV 259 (480) Q Consensus 189 ~-------~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~ 259 (480) . ..+..+....++..+|++.++-+...++.||+|-.. +..+-+..+|.+.-.....++....|.+.+. |. T Consensus 229 ~~~~~~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~ 307 (543) T protein:vir:88 229 LSYQEIEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVE-EYLGDLNSLESLNEAMIKFAMISSKVVGLVNPNGI 307 (543) T ss_pred cccccccCeeeecCCCccccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeeccccc Confidence 0 111122222346788999888888889999999764 4667778888888888888888888886652 11 Q ss_pred CccccccccccchhhhhcchheecCCCCceEEEecc-ccHHHH---HHHHHHHHHHHHhhcCCChHHhccccccchHHHH Q lcl|NC_021324. 260 TTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNF---AEEMEVFRKEAASITGLPPQYLSSSSENPASAEA 335 (480) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~---~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~A 335 (480) .. .........+.+..-..++..+.++.. +++... ++.++.-|.+.+...+ +.......-+++- T Consensus 308 ~~-------~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~~~~r~TAtE 375 (543) T protein:vir:88 308 TQ-------VRRLVKAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNS-----AVQRSGERVTAEE 375 (543) T ss_pred cc-------hhhcccCCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----hccCCCCcccHHH Confidence 00 001111222333222233444444432 344443 3444444443333221 2111111123433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHhCC-CCcccceeeeEEecCCC-CCCHHHHHHHHHH Q lcl|NC_021324. 336 IIATDSRIVKMAERKGRIFGGAW------------ERAMRIAMQIMGR-EVTEEYTRLETVWRDPS-TPTVAAKADAVSK 401 (480) Q Consensus 336 l~~~~~~l~~k~~~~~~~~~~~l------------~~~~~li~~~~~~-~~~~~~~~~~v~f~~~~-~~~~~~~ad~~~k 401 (480) ++. ++..+...++..+ .+++.++.+..-. ..+.+ .+++.+.-++ +-.....++.+.. T Consensus 376 V~~-------r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~--~v~~~~vs~l~~l~r~~~~~~l~~ 446 (543) T protein:vir:88 376 IRY-------VASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQE--AVEPTVTTGAEALGRGQDLDKLTQ 446 (543) T ss_pred HHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh--ceeeeEEecHHHHHHHHHHHHHHH Confidence 333 2333333334333 3333333221000 11222 3445543222 1112222333333 Q ss_pred HHhccccC--------CCHHHH----HHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 402 LYANGQGP--------IPKEQA----RIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 402 l~~~~~~~--------~s~e~~----~~~~~~~-------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) ..+....+ +....+ ...+|.. +++++++.+.+.++++....+.........+....++..+ T Consensus 447 ~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~ 526 (543) T protein:vir:88 447 FLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATASPEAME 526 (543) T ss_pred HHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhccChHHHH Confidence 32211111 222222 2334652 3334433322222221111111111111112222222211 Q ss_pred C---CccCCCCCCCCCcccCC Q lcl|NC_021324. 463 T---KTETQTSPSGFNRTKTR 480 (480) Q Consensus 463 ~---~~~~~~~~~~~~~~~~~ 480 (480) . +.+++..| ++++ T Consensus 527 ~~~~~~~~~~~p-----~~~~ 542 (543) T protein:vir:88 527 SAMDTAGVQPGP-----IATQ 542 (543) T ss_pred HHhhhcCCCCCC-----CCCC Confidence 1 11122222 1222 No 176 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.18 E-value=2.9e-06 Score=50.90 Aligned_cols=433 Identities=14% Similarity=0.112 Sum_probs=170.8 Q ss_pred CCCHHH-------HHHHHHHHHHHH----HHHHHHHHHHhhccCcchhcCcc--cchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTYHE-------HVERLQGLLARD----LPNLLEAEAYRNGTRRLKTIGIG--APPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~~~-------~i~~l~~~~~~~----~~~~~~~~~YY~G~~~i~~~~~~--~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |++... -.++..+.+..+ ..+.+.+.+|.. |.+... .....+..++..+-+...++++++.| T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~l-----P~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~l 75 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTI-----PSLFPKDSDNASTDYTTPWQAVGARGLNNLASKL 75 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc-----cccCCCCCCccccccCCcccccHHHHHHHHHHHH Confidence 776533 233344444433 233344444432 221110 01111123455666777777777766 Q ss_pred cc-----Cccc-cC-CC---------ch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccc Q lcl|NC_021324. 68 DI-----EGFR-IS-ED---------SE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVE 120 (480) Q Consensus 68 ~~-----~~~~-~~-~d---------~~----~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~ 120 (480) .+ .+|+ .. .| .. .. ..+...+..++|.....++.++...+|.|.+++... T Consensus 76 ~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~--- 152 (535) T protein:vir:94 76 MLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEP--- 152 (535) T ss_pred HhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccC--- Confidence 32 1222 11 11 01 11 123344566899999999999999999997776432 Q ss_pred cCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEec---------------CCceEEEEEEEeCCcEEEEEecCC Q lcl|NC_021324. 121 SGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD---------------DVAVPDRATLYLPDETVPLRRNGG 185 (480) Q Consensus 121 ~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~ 185 (480) .+...+++.++-.+.++--|. . .++...++.+...- +......+++|+. .|....+ T Consensus 153 ---~~~~~~f~~~pl~~y~v~~d~-~-G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~~~~ 223 (535) T protein:vir:94 153 ---EGTYNPMKLYRLSSYVVQRDA-F-GTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTH----IYLDEES 223 (535) T ss_pred ---cCcccceEEEEcCeEEEeeCC-C-CCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEE----EEeeCCC Confidence 222345677765554444443 2 34444444432110 0111122333331 0111111 Q ss_pred Cccccc-------cccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc- Q lcl|NC_021324. 186 LNDQWV-------VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS- 257 (480) Q Consensus 186 ~~~~~~-------~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~- 257 (480) ....++ +.......+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-...........|.+.+. T Consensus 224 ~~~~~~~e~~g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~p 302 (535) T protein:vir:94 224 GEYLKYEEIDGVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCE-EYLGDLRSLENLQEAIVKMSMISAKVIGLVNP 302 (535) T ss_pred CcEEEEEEecCeeeccccccCccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhccCCccccc Confidence 111111 0111123578889999999888889999999664 3455566666665555555555555554432 Q ss_pred -CCCccccccccccchhhhhcchheecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHH Q lcl|NC_021324. 258 -GVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEA 335 (480) Q Consensus 258 -G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~A 335 (480) |.. .........+|.++.-..++..+.++.. ++++.....++.+...|....-+ ..+.......-+++. T Consensus 303 ~g~~-------~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~d~~rvTAtE 373 (535) T protein:vir:94 303 AGIT-------QVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFML--NSAVQRTGERVTAEE 373 (535) T ss_pred cccc-------chhhcccCCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhH--hhhccCCCCCccHHH Confidence 111 0111112233443332223344555542 34444444444433333222110 112111111123433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHhCCC---CcccceeeeEEecCCCCC-CHHHHHHHH Q lcl|NC_021324. 336 IIATDSRIVKMAERKGRIFGGAW------------ERAMRIAMQIMGRE---VTEEYTRLETVWRDPSTP-TVAAKADAV 399 (480) Q Consensus 336 l~~~~~~l~~k~~~~~~~~~~~l------------~~~~~li~~~~~~~---~~~~~~~~~v~f~~~~~~-~~~~~ad~~ 399 (480) ++.. +..++..++..+ .+++.++.+ .+. .+.+. +++.+.-++.. ...+.++.+ T Consensus 374 V~~r-------~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r--~g~lP~~p~~~--v~~~~vs~la~l~r~~~~~~l 442 (535) T protein:vir:94 374 IRYV-------ASELEDTLGGVYSILSQELQLPMVRVLLKQLQA--TNQIPELPKEA--VEPTISTGMEALGRGQDLDKL 442 (535) T ss_pred HHHH-------HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHh--CCCCCCCChhh--ccceEeehHHHHHHHHHHHHH Confidence 3332 233333334333 333333221 121 12222 34444323211 011122222 Q ss_pred HHHH----hcccc----CCCHHH----HHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhhhcc-cCCCcCCCCC Q lcl|NC_021324. 400 SKLY----ANGQG----PIPKEQ----ARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTK-AQADATPKPT 459 (480) Q Consensus 400 ~kl~----~~~~~----~~s~e~----~~~~~~~~-------~d~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 459 (480) ...+ +.++. .+.... +...+|.. +++++.+.+..+++++ ...++..... ..+..+.+|. T Consensus 443 ~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~-~~~~~~~~g~~~~~~~~~~~~ 521 (535) T protein:vir:94 443 ERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTA-MQNAAASAGAGAGTMATASPE 521 (535) T ss_pred HHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHH-HHHHHHHHHHhhhcccccChH Confidence 2222 22211 122222 22334442 3333332222211111 1111111111 1111122222 Q ss_pred CCCCCc-cCCCCCC Q lcl|NC_021324. 460 VTETKT-ETQTSPS 472 (480) Q Consensus 460 ~~d~~~-~~~~~~~ 472 (480) ...... ..--.|+ T Consensus 522 ~~~~~~~~~g~~~~ 535 (535) T protein:vir:94 522 NMKAAAAQAGMAPN 535 (535) T ss_pred HHHHHHHHhccCCC Confidence 111110 0111233 No 177 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.16 E-value=3.4e-06 Score=50.57 Aligned_cols=432 Identities=13% Similarity=0.074 Sum_probs=174.7 Q ss_pred CCCH------HHHHHHHHHHHHHHH----HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc-- Q lcl|NC_021324. 1 MTTY------HEHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD-- 68 (480) Q Consensus 1 ~~~~------~~~i~~l~~~~~~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~-- 68 (480) |+.. ++-++++.+.+..++ .+.+.+.+|..-.. .. ....+...+..++..+-+...++++++.|. T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~-~~--~~~~~~~~~~~~~~dst~~~a~~~LAa~L~~~ 77 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV-FP--SATADGSTSYTTPWQSIGARGLNNLASKLMLA 77 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc-cC--CCCCcchhhccccccchHHHHHHHHHHHHHHh Confidence 6652 344555556554443 33444445542211 10 111111222345666667777887777663 Q ss_pred --c--Cccc-cC-CCch--------------------hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccC Q lcl|NC_021324. 69 --I--EGFR-IS-EDSE--------------------GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESG 122 (480) Q Consensus 69 --~--~~~~-~~-~d~~--------------------~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~ 122 (480) + .+|+ .. .|.. ..+.+...+..++|.....++.++...+|.|.+++..++ . T Consensus 78 ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~---~ 154 (532) T protein:vir:99 78 LFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTE---Q 154 (532) T ss_pred hcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccc---c Confidence 1 2332 11 1110 112234556678999999999999999999988776432 1 Q ss_pred CCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe-cC---------------CceEEEEEEEeCCcEEEEEecCCC Q lcl|NC_021324. 123 DPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-DD---------------VAVPDRATLYLPDETVPLRRNGGL 186 (480) Q Consensus 123 d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-~~---------------~~~~~~~~~~~~~~~~~~~~~~~~ 186 (480) +..+...++.++-.+.++.-|.. .++...++.+.-. +. ......+++|+. .+...++. T Consensus 155 ~~~~~~~f~~~pl~~y~v~~d~~--G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~----v~~~~~~~ 228 (532) T protein:vir:99 155 VEGQSNAPKLYKLHNFVVERDAY--DNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTH----VYRDPEAM 228 (532) T ss_pred ccCcccceEEEEcCeEEEeeCCC--CCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEE----EEecCCCC Confidence 23345567777766644444432 3344444332211 00 011112333331 01111111 Q ss_pred ccccc-------cccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC Q lcl|NC_021324. 187 NDQWV-------VDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV 259 (480) Q Consensus 187 ~~~~~-------~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~ 259 (480) ...++ ........+|..+|++.++-+...++.||+|-.. +..+-+..+|.+.-...........|.+.+. T Consensus 229 ~~~~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~-- 305 (532) T protein:vir:99 229 VFRSYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVE-EYLGDLKSLENLYEAIVKMSMISSKVLFFVN-- 305 (532) T ss_pred eeEEEEeecCceecccccccccccCCceeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHHcCCCceec-- Confidence 11111 1111112236678999888888889999999664 3556667777776666666677677764442 Q ss_pred CccccccccccchhhhhcchheecCCCCceEEEec-cccHHHH---HHHHHHHHHHHHhhcCCChHHhccccccchHHHH Q lcl|NC_021324. 260 TTDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRNF---AEEMEVFRKEAASITGLPPQYLSSSSENPASAEA 335 (480) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~~---~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~A 335 (480) ++ .-.......+..+|.+..-..++..+.++. .++++.. ++.++.-|.+.+...+ +.......-+++. T Consensus 306 -p~--g~~~~~~~~~~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~-----~~~~d~~r~TAtE 377 (532) T protein:vir:99 306 -PN--GVTQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNS-----AVQRGGDRVTAEE 377 (532) T ss_pred -cc--cccchhhhccCCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----cccCCCCcccHHH Confidence 00 000011111223333333222223344433 2344433 3444444443333221 1111111123433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHhCCC---Ccccceeee-EEecCCCCCCHHHHHHHH Q lcl|NC_021324. 336 IIATDSRIVKMAERKGRIFGGAW------------ERAMRIAMQIMGRE---VTEEYTRLE-TVWRDPSTPTVAAKADAV 399 (480) Q Consensus 336 l~~~~~~l~~k~~~~~~~~~~~l------------~~~~~li~~~~~~~---~~~~~~~~~-v~f~~~~~~~~~~~ad~~ 399 (480) ++.+ +..+...++..+ .+++.++.+ .+. .+.+..... +++-.++. ..+.++.+ T Consensus 378 V~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r--~g~lP~~p~~~~~~~iv~~is~La--raq~~~~l 446 (532) T protein:vir:99 378 IRYV-------AGELEDTLGGVYSLLSQELQLPLVKILLKELQA--TSKIPNLPKEAVEPAIATGLEALG--RGHDLNKL 446 (532) T ss_pred HHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh--cCCCCCCChhhcccceeecchHHH--HHHHHHHH Confidence 3332 233333333333 333333332 111 122222222 23322222 22223333 Q ss_pred HHHHhcc----c---cCCCH----HHHHHhCCCC-------hhHHHHHHHHHHHHHH-HHHHHHhhhcccCCCcCCCCCC Q lcl|NC_021324. 400 SKLYANG----Q---GPIPK----EQARIDLGYT-------ATQREQMRDWDKQETE-DMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 400 ~kl~~~~----~---~~~s~----e~~~~~~~~~-------~d~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 460 (480) +...+.. . -.+.. ..+...+|+. +++++.+.+.+++++. ........+..+++... +.+ T Consensus 447 ~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~--~~~ 524 (532) T protein:vir:99 447 NVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA--MMQ 524 (532) T ss_pred HHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcch--hHH Confidence 2222211 1 11221 2223444552 3333333221111111 11111111111111111 110 Q ss_pred CCCCccCCCC Q lcl|NC_021324. 461 TETKTETQTS 470 (480) Q Consensus 461 ~d~~~~~~~~ 470 (480) .-. +.+++ T Consensus 525 ~~~--~~~~~ 532 (532) T protein:vir:99 525 QQA--GMPTQ 532 (532) T ss_pred hhc--CCCCC Confidence 000 00111 No 178 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=98.12 E-value=4.2e-06 Score=50.05 Aligned_cols=419 Identities=13% Similarity=0.075 Sum_probs=161.4 Q ss_pred CCCH----HHHHHHHHHHHHHHH----HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc---- Q lcl|NC_021324. 1 MTTY----HEHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD---- 68 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~---- 68 (480) |+=+ ..-+.+..+.+..++ .+.+.+.+|. +|++........+.-++..+-+...++++++.|. T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~-----lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~lt 75 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFT-----LPYLMADVNDDLSSQNAWQDDGASATNFLSNKLSQVLF 75 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHh-----ccccccCCCCCccccccccchHHHHHHHHHHHHHHhhc Confidence 4322 334444555554333 3344444443 3332211111112224555666777777777663 Q ss_pred c--Cccc-cC-CC---------ch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCC Q lcl|NC_021324. 69 I--EGFR-IS-ED---------SE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDP 124 (480) Q Consensus 69 ~--~~~~-~~-~d---------~~----~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~ 124 (480) + .+|+ .. .+ .+ .. ..+...+..++|.....++.++...+|.|.+++- T Consensus 76 pp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~--------- 146 (517) T protein:vir:10 76 PAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYHP--------- 146 (517) T ss_pred CCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEe--------- Confidence 1 2232 11 11 11 11 1234456677999999999999999999865431 Q ss_pred CceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe-c------CC-----------ceEEEEEEEe-----CCcEEE-E Q lcl|NC_021324. 125 AGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-D------DV-----------AVPDRATLYL-----PDETVP-L 180 (480) Q Consensus 125 ~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-~------~~-----------~~~~~~~~~~-----~~~~~~-~ 180 (480) ++...++.++-.+.++.-|. .+ ++...++..... . .. .....+++|+ ++..+. | T Consensus 147 ~~~~~~~~~pl~~y~v~~d~-~G-~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~ 224 (517) T protein:vir:10 147 DKTSPIQAVPLHHYCVRRDN-NG-TVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTKDGKYLIR 224 (517) T ss_pred CCCCcEEEEEcCeEEEeeCC-Cc-CeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeCCCceEEE Confidence 22234566655554443343 33 333333222110 0 00 0001223333 122111 1 Q ss_pred EecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--C Q lcl|NC_021324. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G 258 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G 258 (480) ...++. ........+|..+|++.++-+...++.||+|-.. +..+=+..+|.+.-...........|.+.+. | T Consensus 225 ~~~d~~-----~~~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~~~~ 298 (517) T protein:vir:10 225 QSADDV-----PVGKESTVTEDKSPFLILTWKRSYGEDYGRGMAE-DHAGAFFVIQFLSEALARGMALMADVKYLVKPGS 298 (517) T ss_pred EEeCce-----eeccccccccccCCeeeeeeeecCCCCcccchHH-HhHHHHHHHHHHHHHHHHHHHHhccCCcccCccc Confidence 111110 0011111235788999888888889999999653 3445556666665555555555565554442 1 Q ss_pred CCccccccccccchhhhhcchheecCCCCceEEEec-cccHHH---HHHHHHHHHHHHHhhcCCChHHhcc-ccccchHH Q lcl|NC_021324. 259 VTTDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRN---FAEEMEVFRKEAASITGLPPQYLSS-SSENPASA 333 (480) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~-~~~~~~Sg 333 (480) ... .........+.+..-..++....++. ..+++. .++.++.-|...+.... +.. ++.. -++ T Consensus 299 ~~~-------~~~l~~~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~-----l~~~~~~r-vTA 365 (517) T protein:vir:10 299 YTD-------INQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEA-----MTRRDAER-VTA 365 (517) T ss_pred ccc-------hhhccCCCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-----hhccCCcc-ccH Confidence 110 00111112222222111222333333 223433 34444444444443322 111 1111 133 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhCCCCcccceeeeEEecCCCC-CCHHHHHHHHHHHHh Q lcl|NC_021324. 334 EAIIATDSRIVKMAERKGRIFGGAWERA--------MRIAMQIMGREVTEEYTRLETVWRDPST-PTVAAKADAVSKLYA 404 (480) Q Consensus 334 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~li~~~~~~~~~~~~~~~~v~f~~~~~-~~~~~~ad~~~kl~~ 404 (480) +.++ ..+..++..++..+.++ +...+.++....... .+++.+--++. -.....++.+.+..+ T Consensus 366 tEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~--~v~~~~~s~la~l~r~~~~~~i~~~~~ 436 (517) T protein:vir:10 366 YEIQ-------RDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSK--NVSPTILTGIEALGRMAELDKLGTFNG 436 (517) T ss_pred HHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCC--CccceeeccHHHHHHHHHHHHHHHHHH Confidence 3333 34445555555554442 222222222222211 23333322221 011112222322211 Q ss_pred c------ccc----CCCHHH----HHHhCCCC------hhHHHHHHHHHHHHHHHHHHH--HhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 405 N------GQG----PIPKEQ----ARIDLGYT------ATQREQMRDWDKQETEDMIDT--LYSTTKAQADATPKPTVTE 462 (480) Q Consensus 405 ~------~~~----~~s~e~----~~~~~~~~------~d~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~d 462 (480) . +.. .+.... +...+|.. +++++++ ++++++..+.+ ...+..+.+.+..+++. T Consensus 437 ~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~---~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~-- 511 (517) T protein:vir:10 437 YVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAE---AQAQQEQEATKYAAEQAGKAIPDMVKNGQI-- 511 (517) T ss_pred HHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-- Confidence 1 011 111222 22344543 2333332 22222111111 11111111111111111 Q ss_pred CCccCC Q lcl|NC_021324. 463 TKTETQ 468 (480) Q Consensus 463 ~~~~~~ 468 (480) +..+++ T Consensus 512 ~~~~~~ 517 (517) T protein:vir:10 512 NPQGGQ 517 (517) T ss_pred CCCCCC Confidence 111111 No 179 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=98.11 E-value=4.5e-06 Score=49.91 Aligned_cols=379 Identities=12% Similarity=0.076 Sum_probs=155.2 Q ss_pred CCCHHHHHHHHH-HHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhc------cee-ccchHHHHHHHHHhhhccCcc Q lcl|NC_021324. 1 MTTYHEHVERLQ-GLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY------LDV-QPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~-~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~------~~~-~~n~~~~iVd~~~~~l~~~~~ 72 (480) |+--+++-+.+. +-+.........- +-+.-....+......+..... .+. .......+|+..++-+-.-+| T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~~ 79 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEES-KAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMTI 79 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhh-hhhccccccccccccchhhHhhhccchhHHHhhchHHHHHHHHHHHhhccCce Confidence 332222222111 1111111100000 0000000000000000000000 011 123445555555554444455 Q ss_pred cc---CC--CchhHHHHHHHHH--hc---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCc-ee-EEEEEcccceEE Q lcl|NC_021324. 73 RI---SE--DSEGLEELWNWWQ--AN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG-IP-LIRVESPLYMYA 140 (480) Q Consensus 73 ~~---~~--d~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g-~~-~~~~~~p~~~~~ 140 (480) .+ .+ .......+..++. -| ....+...+..+.+.+|.||+++..+. .| .+ .+..++|..+.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~------~g~~~~~L~~l~~~~v~~ 153 (413) T protein:vir:96 80 QLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQV------SGDKIIGLTPISPYKVTF 153 (413) T ss_pred EEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcC------CCCceEEEEEecCceeEE Confidence 32 11 1122233444443 23 235677888899999999999987632 33 33 577888888777 Q ss_pred EEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccc-cCccC Q lcl|NC_021324. 141 ELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR-LGNRY 219 (480) Q Consensus 141 v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~-~~~~~ 219 (480) .+++.. ++ |......+ .+.++ -|+||+.+.. ..... T Consensus 154 ~~~~~~-------~~-y~~~~~~~------~~~~~-----------------------------evih~k~~~~~~~~~~ 190 (413) T protein:vir:96 154 NVSDDD-------LD-YSITFDNK------EYDPS-----------------------------TLLHFVLNPSIERPFI 190 (413) T ss_pred EEcCCe-------EE-EEEeecCc------EEchh-----------------------------hEEEEeccCCCCCccc Confidence 664321 11 11111110 01111 1456653322 23456 Q ss_pred CCccchHHHHHHHHHHHHHHHH---HHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCce Q lcl|NC_021324. 220 GRSEISPELRKVTDAASRTLMN---LQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAK 289 (480) Q Consensus 220 G~s~i~~~v~~l~D~~~~~~s~---~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~ 289 (480) |.|.+. .+.+.+.....- ..+.+...+.|-.++.-- ...+...+.-...|+- ..++++.++++... T Consensus 191 G~s~~~----~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~ 266 (413) T protein:vir:96 191 GTGYKV----ALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVN 266 (413) T ss_pred cccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCceeeecCCccc Confidence 777553 333443333222 333333344455555421 1111111111111211 12344455444444 Q ss_pred EEEe---ccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 290 ISEF---KAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM 366 (480) Q Consensus 290 ~~~~---~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~ 366 (480) +.++ +.... .+++..+....+|+..-++|+..+|..+. .+..+..+. ...|.-+++.+. T Consensus 267 ~~~~~~~~~~d~-q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~--~~~~~~~~~---------------~~~l~P~~~~ie 328 (413) T protein:vir:96 267 VQQIKPLTLNDL-AINDAVTLDKKTVAGIFGVPAFLLGVGTY--NKDEFNNFI---------------NTKIMSIAQVIQ 328 (413) T ss_pred ccccccCChhHH-HHHHHHHHHHHHHHHHhCCCHHHcCCCcc--hHHHHHHHH---------------HHHHHHHHHHHH Confidence 4433 22233 36777788889999999999999975321 122222221 112222222221 Q ss_pred HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021324. 367 QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYS 446 (480) Q Consensus 367 ~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 446 (480) .......-.....+++.+......+..+.++++.+++.+| +++...+++++|+.+.+- .. .+.. T Consensus 329 ~~ln~~ll~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~p~~~--gd------------~~~~ 392 (413) T protein:vir:96 329 QTYNKLIVEEDMYFSLNPRSLYNYSLTEMVSAGAQMTQLN--ALRRNEFRNWVGMPPDAE--MD------------DLLV 392 (413) T ss_pred HHHHHhhCCCCcEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cc------------eeee Confidence 1111000011233555556666788899999999998876 667777788888765421 00 0000 Q ss_pred hcccCCCcCCCCCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 447 TTKAQADATPKPTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 447 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 477 (480) ...- .+. .. .++.+ +.+.+++ T Consensus 393 ~~n~--~~~--~~----~~~~~--~~~~~dt 413 (413) T protein:vir:96 393 LENY--LQQ--KD----LVNQK--KLIQDET 413 (413) T ss_pred cccc--cch--hh----ccccc--CCCCCCC Confidence 0000 000 00 00000 0011111 No 180 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=98.10 E-value=4.6e-06 Score=49.83 Aligned_cols=400 Identities=13% Similarity=-0.008 Sum_probs=142.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHH-Hhh-----ccCcch-h----cCcccc-hhhhcceeccchHHHHHHHHHhhhc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEA-YRN-----GTRRLK-T----IGIGAP-PELAYLDVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~-YY~-----G~~~i~-~----~~~~~~-~~~~~~~~~~n~~~~iVd~~~~~l~ 68 (480) ||+--+ |.-.|..+..|.....+ |.. |..+.. . .+.... ..+...+-....++.|||..++.+- T Consensus 1 ~~~~~~----~~~~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~~~ 76 (449) T protein:vir:10 1 MTDKLT----LAVNHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGKCW 76 (449) T ss_pred CchhhH----HHHhhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhhhh Confidence 888633 32333333333322222 221 111111 0 011111 1111112223346899998888663 Q ss_pred cCc-cccCCCch----hHHHHHHHHH---hcCHHHHHHHHHHHHhhcCeeEEEEecCccccCC----CCceeEEEEEccc Q lcl|NC_021324. 69 IEG-FRISEDSE----GLEELWNWWQ---ANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD----PAGIPLIRVESPL 136 (480) Q Consensus 69 ~~~-~~~~~d~~----~~~~l~~~~~---~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d----~~g~~~~~~~~p~ 136 (480) -+. ..+.+++. ....+...++ .+++...+.++.+.+..+|.|++++..++....+ ..+.+ . T Consensus 77 ~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~~Pl~~~~~i--~----- 149 (449) T protein:vir:10 77 QTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIRDEKDWNLPATKGRGL--Q----- 149 (449) T ss_pred hcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEecCCCCCCcccccCcce--e----- Confidence 222 12222211 1112222222 2345566778888999999988776543221111 11111 1 Q ss_pred ceEEEEeCCCCCceEEEEEEEEEe--cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccc Q lcl|NC_021324. 137 YMYAELDPRNTRRVTRAVRLYTTR--DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~ 214 (480) ++.|+|-.... . -.++.+. .+.+.+.++.+-+. ..++...+.. -|.--.| +|.-.+ T Consensus 150 ~i~v~~~~~i~----~-~~~~~dp~sp~yg~P~~y~v~~~-------~~g~~~~~~~------iH~SRl~---~~~~~~- 207 (449) T protein:vir:10 150 KVSVSWAGSLK----V-AEWDTGINSKTYGQPKLWKYTER-------LPNGSSRRVD------IHPDRVF---ILGDYS- 207 (449) T ss_pred eEEeeccccCC----h-hhhhcCCCCCCCCCceEEEEeee-------ccCCCcccee------eccceeE---eecCCC- Confidence 11112211000 0 0000100 11222222111100 0000000000 1221111 121000 Q ss_pred cCccCCCccchHHHHHHHHHH---HHHH-----HHHHHHHHHhcch---hhhhcC------CCcccccccc--ccchhhh Q lcl|NC_021324. 215 LGNRYGRSEISPELRKVTDAA---SRTL-----MNLQSASQILGTP---LRVISG------VTTDELTNDG--ENTTLDI 275 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~l~D~~---~~~~-----s~~~~~~~~~~~p---~l~i~G------~~~~~~~~~~--~~~~~~~ 275 (480) ..|.|.+. +..+++ +.+. +-+.+......-. -..+.| ...+...+.- ....+.. T Consensus 208 ---~~g~~~L~----~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~ 280 (449) T protein:vir:10 208 ---EDAIGFLE----PAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINR 280 (449) T ss_pred ---CCChhHHH----HHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhc Confidence 11333222 111111 1110 0001111100000 001111 1111000000 0000111 Q ss_pred hcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccc-cccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS-SENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 276 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~-~~~~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ..+.++...+++..+.+.+.+ ...+.+......+|+.++||..-|-|. .+..+|..-++. |... +..++..+ T Consensus 281 ~~~~~~i~~~~d~~~~~~~~s---gl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D~~n-yyd~---i~~~Q~~l 353 (449) T protein:vir:10 281 GNDVLMTTQGATVTPLVTSVA---DPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTEDQKY-FNAR---CQSRRVDL 353 (449) T ss_pred cchheeecCCcceEEEecccC---ChhHHHHHHHHHHHHHhCCCeeeeeccCccccccchhHHH-HHHH---HHHHHHhh Confidence 122223333333333333333 444556667788999999997666444 322222222332 3333 33344468 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHh--C-CCChhHHHHHHH Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID--L-GYTATQREQMRD 431 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~--~-~~~~d~~~~~~~ 431 (480) ++.|++++.+++...-+... .++.+.|.+-..++..+.|+...+.+++. .++... . .++.+++.+.. T Consensus 354 ~p~le~l~~~l~~s~~g~~~---~d~~i~f~pL~~~t~kEkAei~k~~A~a~------~~~~~ag~~~~~~~~EiR~~~- 423 (449) T protein:vir:10 354 SFEIEDFCDKLIELKIIDAV---AKKAVIWDDLNEQTGTEKLTNAKTMGEIN------QTMLGSGDNPAFSREEIRTAA- 423 (449) T ss_pred hHHHHHHHHHHHHhhcCCCC---CceeEEeCCCCCCCHHHHHHHHHHHHHHH------HHHHHccccCCcCHHHHHHHh- Confidence 99999999987654332222 35999999999999999988766555432 111211 1 12333221110 Q ss_pred HHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 432 WDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 432 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) + + .. +...+.+. +...+. .++.++- + T Consensus 424 ----------~-~---~~--~~~~~~~~--e~~de~--~~~~d~~-a 449 (449) T protein:vir:10 424 ----------G-Y---DN--DDEEPLGE--EDGDEE--DKATDSA-A 449 (449) T ss_pred ----------c-c---cC--CCCCCCCC--CCCccc--cccCCcC-C Confidence 0 0 00 00111111 111111 1111111 1 No 181 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.07 E-value=5.5e-06 Score=49.43 Aligned_cols=386 Identities=11% Similarity=0.035 Sum_probs=152.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHh----hccCcchh-cCcccchhhhcceeccchHHHHHHHHHhhhccCccccC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYR----NGTRRLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY----~G~~~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~ 75 (480) |... -++.++-+.+ +..+. .+-..... .+.....-..+....+.-...+|+..++-+-.-++.+- T Consensus 1 ~~~~-~~~~~~k~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~Ia~~ia~lp~~~~ 70 (409) T protein:vir:94 1 MAKE-NIVTRIKKKL---------IDNWIDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMY 70 (409) T ss_pred Cccc-ccchhhhhHH---------hhhhhcCCcccccccccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEe Confidence 5442 1222111111 11111 01000000 00000000000011112223333333332222233321 Q ss_pred -CCchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCC Q lcl|NC_021324. 76 -EDSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTR 148 (480) Q Consensus 76 -~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~ 148 (480) ..+.....+..++.. | .-..+...+..+.+.+|.||+++..+ ..|.| .+..++|..+.++.+.... T Consensus 71 ~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~~- 143 (409) T protein:vir:94 71 EDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQSR- 143 (409) T ss_pred ecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCCc- Confidence 111122334444432 3 23455677788999999999988753 34554 6677888888877764321 Q ss_pred ceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHH Q lcl|NC_021324. 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) . +. |......+.. + .+.++. |+||++.....+.+|.|.|.- + T Consensus 144 ~----~~-y~~~~~~g~~--~-~~~~~d-----------------------------vih~r~~~~~~~~~G~s~l~~-~ 185 (409) T protein:vir:94 144 E----LY-YSIHAATGNK--L-IVHNMD-----------------------------MLHFKHIVASNMVQGISPIDV-L 185 (409) T ss_pred E----EE-EEEEcCCceE--E-EEcccc-----------------------------EEEecCCCCCCccccccHHHH-H Confidence 1 11 1111111110 0 112222 355554333455678876532 3 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhh--cCCCccccccccccchhh---hhcchheecCCCCceEEEecccc-HHHHH Q lcl|NC_021324. 229 RKVTDAASRTLMNLQSASQILGTPLRVI--SGVTTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAE-LRNFA 302 (480) Q Consensus 229 ~~l~D~~~~~~s~~~~~~~~~~~p~l~i--~G~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~-~~~~~ 302 (480) ...++ ...+-....+..+..+-.++ .+....+...+.-...|+ ...++++.++ ++.++.+++... -..++ T Consensus 186 ~~~i~---~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~ 261 (409) T protein:vir:94 186 KNTTD---FDNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIV 261 (409) T ss_pred HHHHH---HHHHHHHHHHHhcCCCCeeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHH Confidence 33333 22221111233333332222 222221111111111111 1223455554 446787776432 22467 Q ss_pred HHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCcccc Q lcl|NC_021324. 303 EEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEEY 377 (480) Q Consensus 303 ~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~~~~ 377 (480) +..+....+|+..-++|+.-+|....+.-| .+......+.. ..|.-++..+. .+......... T Consensus 262 e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s--n~e~~~~~f~~----------~~l~P~~~~ie~~ln~~Ll~~~~~~~~ 329 (409) T protein:vir:94 262 ASENLTRERVANVFQLPSVFLNARSNTNFA--KNEELNRFYLQ----------HTLLPIVKQYEEEFNRKLLTKTDREKN 329 (409) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHH----------HHHHHHHHHHHHHHHHhhCCcccccCc Confidence 777788899999999999999864322111 12211111111 11222222111 11111111111 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ..+++........+..++++++.+++.+| +++...+++.+|+.+-+- . +.+.-... ..+-.. T Consensus 330 ~~i~fd~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--g------------D~~~~~~n--~~~~~~ 391 (409) T protein:vir:94 330 RYFKFNVKSYLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVEG--G------------DKPLISGD--LYPIDT 391 (409) T ss_pred ceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------CeEeeccc--cccccc Confidence 22333334445578888999999998876 566666677777643320 0 00000000 000000 Q ss_pred CCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~~~ 477 (480) +...... ..+...+.+++ T Consensus 392 ~~~~~~~--~kGG~~n~~e~ 409 (409) T protein:vir:94 392 PLELRKS--LKGGDKNVNES 409 (409) T ss_pred chhhccc--ccCCCCCcCCC Confidence 0000000 00000111111 No 182 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.06 E-value=5.6e-06 Score=49.39 Aligned_cols=387 Identities=10% Similarity=-0.010 Sum_probs=160.8 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCC--- Q lcl|NC_021324. 5 HEHVERLQGLLARD-LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED--- 77 (480) Q Consensus 5 ~~~i~~l~~~~~~~-~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d--- 77 (480) .-++++|..+.... ......+...+.+... ...+..+.++. -.+ ..-...+|+..++-+-.-++.+ .++ T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~-~~~g~~v~~~~-al~--~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~~~ 76 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYD-TYTGKQISSQR-AMR--LTAVFSCVRVLAESVGMLPCNLYHLNGSLKQ 76 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCcc-ccCCceechhh-hhc--cHHHHHHHHHHHHHhccCceEEEEecCCcee Confidence 22233332221110 0011111222211110 00111111110 011 1122333333333332223322 211 Q ss_pred chhHHHHHHHHH--hc---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceE Q lcl|NC_021324. 78 SEGLEELWNWWQ--AN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 78 ~~~~~~l~~~~~--~n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 151 (480) ......+..++. -| ....+...+....+.+|.||+++..+ .|.+ .+..++|..+.+.++... .+. T Consensus 77 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-------~g~~~~L~~l~~~~v~~~~~~~~--~~~ 147 (414) T protein:vir:44 77 RATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-------FGEVAELLPVDPGCVVPKLNSSW--EPV 147 (414) T ss_pred ecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-------CCcEEEEEEEcCceEEEEECCCC--cEE Confidence 111223444442 12 34567788888999999999988642 3455 577888888887775432 111 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHH Q lcl|NC_021324. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) | ......+.. ..+.++. |+||++. ...+++|.|.+.. +... T Consensus 148 ----y-~~~~~~g~~---~~~~~~e-----------------------------vih~~~~-~~d~~~G~s~i~~-~~~~ 188 (414) T protein:vir:44 148 ----Y-QVTFPDGST---DVLSQED-----------------------------IWHVRTL-TLDGLVGLNPIAY-AREA 188 (414) T ss_pred ----E-EEEecCceE---EEEcccc-----------------------------EEEecCC-CCCCcccccHHHH-HHHH Confidence 1 111111111 1122222 3455443 2345678876542 3333 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecccc-HHHHHH Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~ 303 (480) ++....+.....+.+...+.|-.++.-- ...++..+.-...|.. ..++++.++ ++.++.+++... -..|++ T Consensus 189 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d~~~~e 267 (414) T protein:vir:44 189 ISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMILE-MGLDWKSMALNAEDSQFLE 267 (414) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEEccCChHHHHHHH Confidence 3332333222333333333455444321 1111111111111111 113455554 446787776432 224778 Q ss_pred HHHHHHHHHHhhcCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCcccc Q lcl|NC_021324. 304 EMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEEY 377 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~~~~ 377 (480) ..+....+|+..-++|+..+|....+ .++..... .. .+...|+-+++.+. .+...... .. T Consensus 268 ~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~---~~----------~~~~~l~P~~~~ie~~ln~~L~~~~~~-~~ 333 (414) T protein:vir:44 268 TRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG---LG----------FINYSLVPYLTRIEQRINTGLVRKSKQ-GV 333 (414) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH---HH----------HHHHHHHHHHHHHHHHHHhhcCCcccc-Cc Confidence 88888899999999999999764322 12211111 11 11112222222111 11111111 11 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ..+++.+......|..++++++.+++++| +++...+++.+|+.+-+- - +.+.........+. T Consensus 334 ~~i~fd~~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g--g------------D~~~~~~n~~~~~~-- 395 (414) T protein:vir:44 334 FYAKFNAGALLRGDMKSRFEAYATGINWG--IYSPNDCRDLEDMNPRPG--G------------DVYLTPMNMTTKPS-- 395 (414) T ss_pred eEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------ceecccccccccCC-- Confidence 23444445566678889999999988875 667777777877754220 0 11111011000000 Q ss_pred CCCCCCCccCCCCCCCCCcccC Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFNRTKT 479 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~~~~~ 479 (480) +....+.+++++..+.+++ T Consensus 396 ---~~~~~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 396 ---DGSKAGKQKDNANADETTS 414 (414) T ss_pred ---ccccCCCCCCCCCCCCCCC Confidence 1111222233333333333 No 183 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=98.06 E-value=5.6e-06 Score=49.36 Aligned_cols=400 Identities=14% Similarity=0.091 Sum_probs=163.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHH-HHHHhhccCc-c-------------hhcCcccchhhhcceeccchHHHHHHHHHh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLE-AEAYRNGTRR-L-------------KTIGIGAPPELAYLDVQPGWVATYLRTLSD 65 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~-~~~YY~G~~~-i-------------~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~ 65 (480) |-...+ +.+.+... +.. |-|..- . ...+..+... .-+.+.-...+|+..++ T Consensus 1 ~~~~~~----------~~~~~~~~~~~~-~~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~ci~~Ia~ 66 (437) T protein:vir:10 1 MKQGKQ----------RALGRIKSSFLK-WLGVPISLTDGSFWSAWGGMGSSSGETVTAD---SALQLSAVWSCVRLIAE 66 (437) T ss_pred CCcchh----------hhhhhhHHhhhh-hcCCcccCCchhHHHhhcccccCCCceechH---hhhccHHHHHHHHHHHH Confidence 322211 11112111 111 112110 0 0001111110 00111112223333333 Q ss_pred hhccCccc---cCCC--ch--hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEE Q lcl|NC_021324. 66 RLDIEGFR---ISED--SE--GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRV 132 (480) Q Consensus 66 ~l~~~~~~---~~~d--~~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~ 132 (480) -+-.-++. ...+ .. ....+..++.. | ....+...+..+.+.+|.||+++..+ .|++ .+.. T Consensus 67 ~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-------~g~~~~L~~ 139 (437) T protein:vir:10 67 TIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-------AGVLIGLEL 139 (437) T ss_pred HHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEE Confidence 22222332 1111 11 12234444432 2 34567777888999999999988652 2454 4677 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecc Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~ 212 (480) ++|..+.+..+... . +.++.. ...+.. ..+.++. |+||++. T Consensus 140 l~p~~v~i~~~~~g--~----~~y~~~-~~~g~~---~~~~~~d-----------------------------Iih~r~~ 180 (437) T protein:vir:10 140 MLPQRTTVKRLTSG--A----LQYTYR-NVDGTV---STLAEDD-----------------------------VFHVRGF 180 (437) T ss_pred EcCcceEEEECCCC--e----EEEEEE-ecCceE---EEEcccc-----------------------------EEEecCc Confidence 88888777654321 1 111111 111110 1112222 3455543 Q ss_pred cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC-ccccccccccchhhh------hcchheecCC Q lcl|NC_021324. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVT-TDELTNDGENTTLDI------YYGRILTLAS 285 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~-~~~~~~~~~~~~~~~------~~~~~~~~~g 285 (480) . ..+.+|.|.|.. +...++....+.....+.+...+.|-.++.-.. ..+...+.-...|.- ..++++.++ T Consensus 181 ~-~d~~~G~spi~~-~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~- 257 (437) T protein:vir:10 181 S-LDGLMGLTPIQY-AREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMVLE- 257 (437) T ss_pred C-CCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecc- Confidence 2 345678876532 333333222222223333333344555554211 111111111111211 113455565 Q ss_pred CCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 286 EAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) Q Consensus 286 ~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l 364 (480) ++.++.++.... -..|++..+....+|+.+-++|+..+|....++..+..++.....+...+- .-+-..++..+.. T Consensus 258 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~tl---~P~~~~ie~~l~~ 334 (437) T protein:vir:10 258 AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFTL---RPWLTRIEQAARR 334 (437) T ss_pred CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHH---HHHHHHHHHHHHh Confidence 446888776432 224788888888999999999999997643322222223222222211110 0011111111110 Q ss_pred HHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH-HHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 365 AMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ-REQMRDWDKQETEDMIDT 443 (480) Q Consensus 365 i~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~-~~~~~~~~~~~~~~~~~~ 443 (480) .+........ ..+++.+......|..++++++.+++.+| +++...+++.+|+.+-+ -....-. ... T Consensus 335 --kll~~~e~~~-~~~~fd~~~ll~~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~gg~~~~~~--------~~~ 401 (437) T protein:vir:10 335 --SLLRPGERDQ-FYAEFSVEGLLRADSAGRAAFYSTMTQNG--LMTRDECRAKENLPPMGGNAAVLTV--------QSA 401 (437) T ss_pred --hccCccccCc-eEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcceEee--------cCc Confidence 1111111111 23455555666778889999999988875 66777777777764321 0000000 000 Q ss_pred HhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 444 LYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 444 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) +..... .....+...+.+...+++..++.+.+++.| T Consensus 402 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 437 (437) T protein:vir:10 402 LLPIDK-LGEHTTATAAQDALKAWLYQEEKTRATQER 437 (437) T ss_pred ccchhh-ccCcCCCcchhccccccCCCCCCCCccccC Confidence 000000 011111111122223333334445555556 No 184 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=98.05 E-value=6e-06 Score=49.20 Aligned_cols=415 Identities=11% Similarity=0.066 Sum_probs=197.2 Q ss_pred CCCH-----------HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc--chhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY-----------HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA--PPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~-----------~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~--~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.-+ -++-.. ++.-..-..+|+.+..+++-+..+...-..+ .+... .=+.+++- T Consensus 46 ~~~~~~~~gg~~~~~~~~e~~-~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~-~pV~l~L~----------- 112 (521) T protein:vir:81 46 DNLPASAWNSLTQQFYSTDQK-ISTTKQLVNTYRGLMNNHEVENAVQNIVNDAIVFEEGH-EVVSLNLE----------- 112 (521) T ss_pred CCCcceeecceeeeecccccc-hhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCC-ceEEEEec----------- Confidence 1100 000000 0001122234444444444433332210000 00000 00000110 Q ss_pred ccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCC Q lcl|NC_021324. 68 DIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNT 147 (480) Q Consensus 68 ~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~ 147 (480) -..++-.-.++..+.+..|++--+|+....+..+...+.|+.|++.-.+. ...+|-..++.++|..+..+.--... T Consensus 113 -~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~---~pk~GI~Elr~lDPr~i~~vr~i~k~ 188 (521) T protein:vir:81 113 -ATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGK---NPKDGIVELRQLDPRNLEYVREIITE 188 (521) T ss_pred -ccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEcC---CccccceeeeeeCCcceeeeeeeccc Confidence 00111111223445666677666899999999999999999998865431 24678889999999988776522111 Q ss_pred CceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc--eEEeecccccCcc--CCCcc Q lcl|NC_021324. 148 RRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP--VVPLTNDPRLGNR--YGRSE 223 (480) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP--vv~~~n~~~~~~~--~G~s~ 223 (480) ......+ .+....+-+|.++... |...+... .|+.-=+|| .|.|.+.-..... .-.|- T Consensus 189 ~~~~~~v--------~~~~~e~f~Y~~~~~~-~~~~g~~~---------~~~~~vkI~~dAI~y~hSGl~d~~~~~i~sy 250 (521) T protein:vir:81 189 DTPEGKI--------YKATKEYFIYTVGNSS-YCAGGQVF---------SPNSRVKIPRSAITYAHSGLMDCDDKYIIGY 250 (521) T ss_pred ccCccce--------ecceeeeeeeecCCcc-ccccceee---------cCCcceeechhheeeeeccceeCCCCeeeec Confidence 1100000 0011222334443221 11111100 000001122 1233322111111 11233 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc------------ Q lcl|NC_021324. 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG------------ 278 (480) Q Consensus 224 i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~------------ 278 (480) |.+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| T Consensus 251 LhkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMl 328 (521) T protein:vir:81 251 LHRAVKPANQ--LKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMT 328 (521) T ss_pred chhhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchh Confidence 4455555432 2667777777777778877776555555443322110 011111 Q ss_pred -hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc---cchHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 279 -RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE---NPASAEAIIATDSRIVKMAERKG 351 (480) Q Consensus 279 -~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~---~~~Sg~Al~~~~~~l~~k~~~~~ 351 (480) -.|. .+|.+.++.+++..+.=.-++-++-....++..-++|.+-++...+ +..-|..|--.+.....-+.+.+ T Consensus 329 EDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR 408 (521) T protein:vir:81 329 EDYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQ 408 (521) T ss_pred hhhcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHH Confidence 1111 2344567888887544445666677778888888999888843322 11233455556667777788888 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhccccCCCHHHHHHh-C Q lcl|NC_021324. 352 RIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQGPIPKEQARID-L 419 (480) Q Consensus 352 ~~~~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~~~s~e~~~~~-~ 419 (480) ..|..-+.++++.=+-++|.--..++ ..+.+.|..-....+...++. +.++..-..-.+|.++++.. | T Consensus 409 ~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~IL 488 (521) T protein:vir:81 409 SQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDIL 488 (521) T ss_pred HHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHh Confidence 88888888888754434443333333 346677754433333333332 22222211124588888755 7 Q ss_pred CCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 420 GYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 420 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) ..++++++++.+.++++..+. .-.++++..+.- T Consensus 489 r~tDeei~~~~k~I~~E~~~~-------~~~~p~~~~~~f 521 (521) T protein:vir:81 489 KYTDDQMDTEKKQIEEEANDP-------RFKQTPDEIEDF 521 (521) T ss_pred ccCHHHHHHHHHHHHHHhhCC-------CCCCCcccccCC Confidence 899999888776666554311 111122211111 No 185 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=98.04 E-value=5.6e-06 Score=49.37 Aligned_cols=404 Identities=10% Similarity=0.090 Sum_probs=194.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc--chhhhcceeccchHHHHHHHHHhhhccCccccCCCc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA--PPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~--~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~ 78 (480) +.+..++|+ +|+.+..+++-+..+...-..+ .+... .=+.+++ +. ..++-.-.+ T Consensus 71 ~~~~~eLI~-----------~YR~ma~~pEvd~Av~eIVneaIv~~~~~-~pV~l~L-----~~-------~~~s~~iK~ 126 (524) T protein:vir:98 71 IQNKEQLIN-----------TYRGIMSYPEVENAVSEIIDDAIVNEQGK-DIITMDL-----AK-------TNFSKAIQD 126 (524) T ss_pred cchHHHHHH-----------HHHHHhhccchhhHHHhhhcceeEecCCC-ceEEEEe-----cc-------cccchHHHH Confidence 112222222 3444444444333222110000 00000 0000011 00 011111122 Q ss_pred hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 79 EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 79 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) +..+.+..|++--+|+....+..+...+.|+.|++.-.+. ...+|-..++.++|..+-.|..--.. .+...+..+ T Consensus 127 kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~---~~~kGI~ELr~lDPr~i~~vr~~~~~-~~~~~~~v~- 201 (524) T protein:vir:98 127 KIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHK---DESKGIRELRQLDPRCMELIRESITE-TLDGGVKVF- 201 (524) T ss_pred HHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcC---CCCcceeeeeeeCCccceeeeecccc-ccccchhhc- Confidence 3445566676666899999999999999999998865432 23458888999999888666422111 111111111 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeecccccCccCCC---ccchHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNRYGR---SEISPELRKVTD 233 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~~~~~G~---s~i~~~v~~l~D 233 (480) .....+-+|.+... .|...+.. .. ++.-=+||- |.|.+.-.. +.+. |-|.+.|+++.. T Consensus 202 -----~~~~e~f~Y~~~~~-~~~~~g~~----~~-----~~~~ikI~~dAIvy~hSGL~--d~~~~iisyLhkAiKp~NQ 264 (524) T protein:vir:98 202 -----RGYREFFVYSAPKA-GYTYNGQI----YQ-----ANQKIKIPRSAIVYAHSGLE--DCSNNIIGYLHRAVKPANQ 264 (524) T ss_pred -----cceeeeeeeccCCC-ccccccce----ec-----CCCceeechhheeeeccCcc--cCCCCeeeehhHhhHhHHh Confidence 00122233333222 11111100 00 010012221 222221111 1111 334454555422 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hhee---cC Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RILT---LA 284 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~~~---~~ 284 (480) -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -+|. .+ T Consensus 265 --Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRReG 342 (524) T protein:vir:98 265 --LRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDG 342 (524) T ss_pred --hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhhhcccccCC Confidence 2667777777777778877776555555544322110 011111 1111 23 Q ss_pred CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcc-c-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 285 SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSS-S-SENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) Q Consensus 285 g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~-~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 362 (480) |.+.++.++++.+.-.-++-++-....++..-++|.+-+.. + +.+..-|..|--.+.....-+.+.+..|..-+.+++ T Consensus 343 grgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L 422 (524) T protein:vir:98 343 KAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLSDPL 422 (524) T ss_pred CCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44567888887653345666677777888888998877752 2 223222335655666777778888888888888888 Q ss_pred HHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhccccCCCHHHHHHh-CCCChhHHHHHH Q lcl|NC_021324. 363 RIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQGPIPKEQARID-LGYTATQREQMR 430 (480) Q Consensus 363 ~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~~~s~e~~~~~-~~~~~d~~~~~~ 430 (480) +.=+-+.|.--..++ ..+.+.|..-....+...++. +.++..-..-.+|.++++.. |..++++++++. T Consensus 423 ~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~ 502 (524) T protein:vir:98 423 KTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQA 502 (524) T ss_pred HHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHHHHHHH Confidence 754434443333333 346677754433333333332 23332212236788888755 789999988877 Q ss_pred HHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 431 DWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +.++++..+. .-.++++..+.- T Consensus 503 k~I~~E~k~~-------~~~~p~~e~~~f 524 (524) T protein:vir:98 503 KLIEEESKEE-------RFKNPEAEEENF 524 (524) T ss_pred HHHHHHHhCC-------CCcCCccccccC Confidence 6666664321 111111111111 No 186 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=98.04 E-value=6.4e-06 Score=49.07 Aligned_cols=427 Identities=14% Similarity=0.068 Sum_probs=171.0 Q ss_pred CCCHHHHHHHHHHHHHH--HHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC-------- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR--DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE-------- 70 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~--~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~-------- 70 (480) ..++.+.-. -++.. ...-++-+ .+|.|..-+-+.-- ..+.. +.-.++++.+.+..+.-+ T Consensus 79 ~~~~~~~~~---~~~~~~~~~~~~~~l-~~~~~~~F~Gy~~l---a~laQ----~~eyr~~~~~ia~e~~R~w~~~~~~~ 147 (695) T protein:vir:36 79 NYTPRERRA---ASYALDFNGTSMDAL-SFVTSSGFPGFPTL---VLLAQ----LPEYRAMHEVLADECIRTWGEAIGGT 147 (695) T ss_pred ccCccccch---hhhhhcccccccccc-hhhhccCcchHHHH---HHHhh----ccchhhHHHHHHHHhhcccceecccc Confidence 222222110 00111 11112222 24544322211000 00000 111122222222222111 Q ss_pred -------cccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEec-Cccc----------cCCCCc Q lcl|NC_021324. 71 -------GFRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSH-PDVE----------SGDPAG 126 (480) Q Consensus 71 -------~~~~------~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~-~~~~----------~~d~~g 126 (480) |+.+ ..+.+..+.|..-+++-++...+.++.+.+-.||.+.+++.. ++.. +....| T Consensus 148 ~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kG 227 (695) T protein:vir:36 148 KEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 227 (695) T ss_pred hhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCc Confidence 1221 123345567788788778899999999999999998755543 2100 001122 Q ss_pred ee-EEEEEcccceEEEE-eCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccccccccc Q lcl|NC_021324. 127 IP-LIRVESPLYMYAEL-DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVV 204 (480) Q Consensus 127 ~~-~~~~~~p~~~~~v~-d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 204 (480) .+ -+.+++|..+.|-. +..+.-. .+.+.+.++.+- +..++..+- +.|..- T Consensus 228 slKGl~ViDp~~vtP~~~n~~dP~s-----------pdfgkP~~y~V~--G~kIH~SRL---------------~~f~g~ 279 (695) T protein:vir:36 228 SFQGLRVVEPYWVTPNNYNSINPVA-----------DDFYKPSTWWMI--GTEVHATRL---------------HTIVSR 279 (695) T ss_pred ceeeeEeecccccccchhhhccchh-----------hccCCCceEEEe--ceEEeeeeE---------------EEecCC Confidence 22 25667777666632 1111100 111112222111 111111100 000000 Q ss_pred ceEE-eecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccc-ccccc------cchhhh- Q lcl|NC_021324. 205 PVVP-LTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDEL-TNDGE------NTTLDI- 275 (480) Q Consensus 205 Pvv~-~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~-~~~~~------~~~~~~- 275 (480) |+-. ++ ....++|.|-. .-+.+-+++.+++.-.....+..+....+.. + .... ..... ...+.. T Consensus 280 plPd~LK---p~y~~~GiSv~-q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~-d--la~aL~~g~~~~l~~R~eli~~~ 352 (695) T protein:vir:36 280 PVGDMLK---PTYSFAGISMT-QLAMPYIDNWLRTRQSVSDIVKQFSVSGILM-D--LAQALMPGANVDLSMRAELINRY 352 (695) T ss_pred Cchhhhh---cccccCcccHH-HHHHHHHHHHHHHHhHHHHHHHhhhHHHHHH-H--HHHhhcChhHHHHHHHHHHHHHh Confidence 1100 11 11224566644 3355555666666555555553333222111 1 1000 00000 011111 Q ss_pred -hcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc-cc-chHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 -YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKGR 352 (480) Q Consensus 276 -~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~~-~~Sg~Al~~~~~~l~~k~~~~~~ 352 (480) ....++.+++++-+|.+++. ++..+-+.+-.....+|..++||.--|-+.+ .. |+||++-..-|...+. ..++. T Consensus 353 Rsn~G~~llDk~~Eefeq~st-slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~ 429 (695) T protein:vir:36 353 RDNRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQRN 429 (695) T ss_pred cCccceEEEecCCcceEEEec-ccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHH--HHHHH Confidence 11233445544457777763 4667777788888999999999976664443 22 4688876666666665 56688 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCHHHHHHhC------CC Q lcl|NC_021324. 353 IFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPKEQARIDL------GY 421 (480) Q Consensus 353 ~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~e~~~~~~------~~ 421 (480) .+...|++++.++.+-.-+..+. .+.+.|.+-...+..+.|+.-.|-+++ ..+.++......+| +| T Consensus 430 ~L~p~L~rl~~ii~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y 506 (695) T protein:vir:36 430 ALQQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPY 506 (695) T ss_pred HHHHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccc Confidence 99999999999876544333332 478899999999999998864433221 11122222222221 11 Q ss_pred ChhHHHHHHH--HHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCC---CCC----CCcccCC Q lcl|NC_021324. 422 TATQREQMRD--WDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTS---PSG----FNRTKTR 480 (480) Q Consensus 422 ~~d~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~---~~~----~~~~~~~ 480 (480) ... ++.-.. +..+.+.+-...+......+.+.++..++....+.+++- +.. ..++++. T Consensus 507 ~~~-~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~ 573 (695) T protein:vir:36 507 AGK-LDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDA 573 (695) T ss_pred ccc-cccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCCCc Confidence 100 000000 000000000000000000000000000000000000000 000 0111111 No 187 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.01 E-value=7.3e-06 Score=48.73 Aligned_cols=411 Identities=14% Similarity=0.116 Sum_probs=155.8 Q ss_pred HHHHHHH--HHHHHHHHHH------HhhccC-cchhcCccc-chhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 11 LQGLLAR--DLPNLLEAEA------YRNGTR-RLKTIGIGA-PPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 11 l~~~~~~--~~~~~~~~~~------YY~G~~-~i~~~~~~~-~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) |.+.|.. .+.|+..... -+.++- .+ ..... ...+++..-...+...+|+..+..+...++.+..+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~pp~~~~~La~~~~~n~~v~scI~~ia~~ia~~~~~i~~~~~~ 78 (540) T protein:vir:41 1 MFNYHLSIKSLEKYRAIKGDTDSQALKEDRFEEY--VEPKVHPLVLLSLLQVNPYHASACSIKANDILRTGYLIDGDDGG 78 (540) T ss_pred CCCcccChhhccchhhhhccccccccccCCCCcc--ccCCCCHHHHHHHHHhcHHHHHHHHHHHHHHhcCCceEecCccc Confidence 2222210 1111111111 011100 00 00000 01112222223456777887777777777765433222 Q ss_pred HHHHHHHHHhc---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEE Q lcl|NC_021324. 81 LEELWNWWQAN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 81 ~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) .. .+. .| ....+...+..+.+.+|.||+.+..+. .|.+ .+..++|..+.+..+... + T Consensus 79 ~~---~~l-pN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~------~G~~~~L~~i~~~~V~v~~~~~~---------~ 139 (540) T protein:vir:41 79 VE---ELL-RACRPSFEFILLQALEDLQVFNYCTLEVVRDD------QGEPVRLDYIPAHTVRVHRDGSR---------Y 139 (540) T ss_pred hh---hhc-cCCCCCHHHHHHHHHHHHHhcCCeEEEEEECC------CCcEEEEEEeCCcceEEeEcCce---------e Confidence 21 111 22 345667778889999999999887533 4554 577788887766543321 1 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHH Q lcl|NC_021324. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~ 236 (480) +...+... ..++..|.....+. ..++. ....+..=-|+||++.....+.+|.|.+. .+..++. T Consensus 140 ~~~~d~~~-~~~~~~~~~~~~~~--~~~g~----------~~~~~~~~eViHir~~~~~~~~~G~Spi~----~~~~~i~ 202 (540) T protein:vir:41 140 MQTWDGIH-VTYFKDYRYEGEVN--PDNGE----------DQDGVGANEIIFIHLPSPICSYYGVPRYL----SAAPSIL 202 (540) T ss_pred EeeecCce-eeeeecccccceee--ccccc----------cceeecccceEEecCCCCCCCcccccHHH----HHHHHHH Confidence 11111111 11222222111111 00100 00111112367887665566789998764 3334443 Q ss_pred HHHHHHHHHHHHhc---chhhhh--cCCCcccccc--cc---ccchhh-----------hhcchheecC-----CCCceE Q lcl|NC_021324. 237 RTLMNLQSASQILG---TPLRVI--SGVTTDELTN--DG---ENTTLD-----------IYYGRILTLA-----SEAAKI 290 (480) Q Consensus 237 ~~~s~~~~~~~~~~---~p~l~i--~G~~~~~~~~--~~---~~~~~~-----------~~~~~~~~~~-----g~~~~~ 290 (480) ....-......+|. .|-.+| .|...++... +. ....++ -..++++.++ +++.++ T Consensus 203 ~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~~~~~~g~~~ 282 (540) T protein:vir:41 203 AMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIPGGDTVEVTF 282 (540) T ss_pred HHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecCCCcccceeE Confidence 33322222233333 343333 2322111110 00 000111 0123334332 234577 Q ss_pred EEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhcccc---ccchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 291 SEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) Q Consensus 291 ~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~~~~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~l 364 (480) ..+.... -..|++..+....+|+..-++|+..+|... .+.++..... +....|.-.+ ..+...|.+ . T Consensus 283 ~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~----~~ie~~ln~---~ 355 (540) T protein:vir:41 283 TPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQ----EIVSSVLTD---F 355 (540) T ss_pred EecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHH----HHHHHHHHH---h Confidence 7665332 224788888999999999999999987432 1112222222 1111122111 122222221 1 Q ss_pred HHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC-CCChhHHHHHH--HHHHHHHHHHH Q lcl|NC_021324. 365 AMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL-GYTATQREQMR--DWDKQETEDMI 441 (480) Q Consensus 365 i~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~-~~~~d~~~~~~--~~~~~~~~~~~ 441 (480) ++. . .. ....+.|....... .+.+..+.+++++| +++...+++.+ |..+-+..-+. .....+ .. T Consensus 356 L~~--~--~~---~~~~i~f~~~~ll~-~D~~~~~~~lv~~G--~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~---~~ 422 (540) T protein:vir:41 356 IQL--K--LD---PGARFVFNEEILME-SEFVHNYALLVQCG--VLTPSEVREKLFGLDGGPDMFMVPSSIGKSA---MK 422 (540) T ss_pred hhh--c--cC---CceEEEecchhhcc-hHHHHHHHHHHhCC--CCCHHHHHHHhCcCcCCCccccccccccccc---cc Confidence 110 1 11 12455664332211 23344455666655 55555555543 54321100010 000000 00 Q ss_pred HHHhhhcccCCCcCCCC-CCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 442 DTLYSTTKAQADATPKP-TVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 442 ~~~~~~~~~~~~~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 480 (480) +.-....+.++.+.... ...++..++..+ ++.+..+++ T Consensus 423 ~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~-~~~~~~~~~ 461 (540) T protein:vir:41 423 RQKRNYEKNQINEIKRTYAKYKPRIQEIIS-SESPLEDKK 461 (540) T ss_pred ccccccCCCCccccccccchhcccccCccc-ccccccccc Confidence 00000000000000000 000111110000 000001111 No 188 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=98.00 E-value=7.6e-06 Score=48.65 Aligned_cols=427 Identities=14% Similarity=0.081 Sum_probs=175.1 Q ss_pred CCCHHHH--HHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC-------- Q lcl|NC_021324. 1 MTTYHEH--VERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE-------- 70 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~-------- 70 (480) .-++.+- ..+-++. ...-++-+ .+|.|..-+-+.-- ..+.. +.-.++++.+.+..+.-+ T Consensus 79 ~~~~~~~~~~~~~~~~---~~~~~~~l-~~~~~~~F~Gy~~l---a~laQ----~~eyr~~~~~ia~e~~R~w~~~~~~~ 147 (698) T protein:vir:10 79 NYTPRERRAASYALDF---NGTSMDAL-SFVTSSGFPGFPTL---VLLAQ----LPEYRAMHEVLADECIRTWGEAIGGT 147 (698) T ss_pred cCCccccchhhhhhcc---cccccccc-hhhhccCcchHHHH---HHHhh----ccchhhHHHHHHHHhhcccceecccc Confidence 2222111 1100000 11112222 34544322211000 00000 111222222222222111 Q ss_pred -------cccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEec-Ccccc----------CCCCc Q lcl|NC_021324. 71 -------GFRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSH-PDVES----------GDPAG 126 (480) Q Consensus 71 -------~~~~------~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~-~~~~~----------~d~~g 126 (480) |+.+ ..|.+..+.|..-+++-++...+.++.+.+-.||.+..++.. ++... ....| T Consensus 148 ~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kG 227 (698) T protein:vir:10 148 KEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 227 (698) T ss_pred chhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccccccccccccccCc Confidence 1221 122244566777777778899999999999999988655432 11000 00111 Q ss_pred ee-EEEEEcccceEEEE-eCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccccccc-- Q lcl|NC_021324. 127 IP-LIRVESPLYMYAEL-DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG-- 202 (480) Q Consensus 127 ~~-~~~~~~p~~~~~v~-d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-- 202 (480) .+ -+.+++|..+.|-. +..+.-. .+.+.+.++.+-. ..++..+- +.|. T Consensus 228 slKGL~ViDp~~vtP~~~n~~dP~s-----------pdfgkP~~y~V~G--~~IH~SRL---------------~~~vg~ 279 (698) T protein:vir:10 228 SFQGLRVVEPYWVTPNNYNSINPVA-----------DDFYKPSTWWMIG--SEVHATRL---------------HTIVSR 279 (698) T ss_pred cceeeeeecccccccchhhhccchh-----------hccCCCceEEEec--ceecceeE---------------EEecCC Confidence 21 25566666665521 1111100 0111111111110 01110000 0000 Q ss_pred ccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc-------chhhh Q lcl|NC_021324. 203 VVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN-------TTLDI 275 (480) Q Consensus 203 ~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~-------~~~~~ 275 (480) .+|-. ++ ....++|.|-. .-+..-+++++++.-.....+..+....+. ++ .......+.. ..+.. T Consensus 280 pvpd~-LK---p~y~f~G~Sv~-q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~-~d--la~aL~~g~~~~l~~R~eli~~ 351 (698) T protein:vir:10 280 PVGDM-LK---PTYSFAGISMT-QLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MD--LAQALTPGANVDLSMRAELINR 351 (698) T ss_pred Cchhh-hc---chhccCCccHH-HHHHHHHHHHHHHhhhHHHHHHHhhHHHHH-HH--HHHhcCChhhHHHHHHHHHHHH Confidence 01111 11 11224566644 335555666666655555555433332221 11 1100000000 01111 Q ss_pred --hcchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc-cc-chHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 276 --YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKG 351 (480) Q Consensus 276 --~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~~-~~Sg~Al~~~~~~l~~k~~~~~ 351 (480) ....++.+++++-+|.+++. ++..+-+.+......+|..++||.--|-+.+ .. ++||++-..-|...+. ..++ T Consensus 352 ~Rsn~G~~llDk~~Eefeq~st-~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe 428 (698) T protein:vir:10 352 YRDNRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQR 428 (698) T ss_pred hcCccceEEEecCCcceEEEec-CcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHH--HHHH Confidence 11233445544457777763 4667777788888999999999976664443 22 4688876666666655 5668 Q ss_pred HHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCHHHHHHhC------C Q lcl|NC_021324. 352 RIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPKEQARIDL------G 420 (480) Q Consensus 352 ~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~e~~~~~~------~ 420 (480) ..+...|++++.++.+-.-+..+. .+.+.|.+-...+..+.|+.-.+-+++ ..+.++....+.+| | T Consensus 429 ~~L~p~L~rl~~ii~rS~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~ 505 (698) T protein:vir:10 429 NALQQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGP 505 (698) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccCCCc Confidence 889999999999876544333332 488899999999999999864433221 12233333333332 2 Q ss_pred CCh--hHHHHHHHHHHHHHHHHHHHHhhhc-ccCCCcCCCCC-----CCCCCccCC-CCCCCCCcccCC Q lcl|NC_021324. 421 YTA--TQREQMRDWDKQETEDMIDTLYSTT-KAQADATPKPT-----VTETKTETQ-TSPSGFNRTKTR 480 (480) Q Consensus 421 ~~~--d~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-----~~d~~~~~~-~~~~~~~~~~~~ 480 (480) |.. |..++--.-.....+.....++... +++..+...|. .++|....+ ..++....+.++ T Consensus 506 Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 574 (698) T protein:vir:10 506 YAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGARAGATAPPAAANVNANANPREAGAQ 574 (698) T ss_pred cccccCCcccCCCCCCCcchHHHhhhcCCcCCCCcccccccccccCCCCCCcccccccCCCCccccCcc Confidence 210 0000000000000111111111211 11111111111 111111111 112222222222 No 189 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=97.98 E-value=8.1e-06 Score=48.49 Aligned_cols=429 Identities=14% Similarity=0.066 Sum_probs=170.5 Q ss_pred CCCHHHH--HHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC-------- Q lcl|NC_021324. 1 MTTYHEH--VERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE-------- 70 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~-------- 70 (480) ..++.+- ..+-++. ...-++-+ .+|.|..-+-+.-- ..+.. +.-.++++.+.+..+.-+ T Consensus 79 ~~~~~~~~~~~~~~~~---~~~~~~~l-~~~~~~~F~Gy~~l---a~laQ----~~eyr~~~~~ia~e~~R~w~~~~~~~ 147 (695) T protein:vir:78 79 NYTPRERRAASYALDF---NGTSMDAL-SFVTSSGFPGFPTL---VLLAQ----LPEYRAMHEVLADECIRTWGEAIGGT 147 (695) T ss_pred cCCccccchhhhhhcc---cccccccc-hhhhccCcchHHHH---HHHhh----ccchhhHHHHHHHHhhcccceecccc Confidence 2222111 1100000 11112222 34544322211000 00000 111222222222222111 Q ss_pred -------cccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEec-Cccc----------cCCCCc Q lcl|NC_021324. 71 -------GFRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSH-PDVE----------SGDPAG 126 (480) Q Consensus 71 -------~~~~------~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~-~~~~----------~~d~~g 126 (480) |+.+ ..|.+..+.|..-+++-++...+.++.+.+-.||.+.+++.. ++.. +....| T Consensus 148 ~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~kG 227 (695) T protein:vir:78 148 KEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 227 (695) T ss_pred chhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccccccccccccCc Confidence 1221 122244566777777778899999999999999998755543 2100 001122 Q ss_pred ee-EEEEEcccceEEEE-eCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccccccccc Q lcl|NC_021324. 127 IP-LIRVESPLYMYAEL-DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVV 204 (480) Q Consensus 127 ~~-~~~~~~p~~~~~v~-d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 204 (480) .+ -+.+++|..+.|-. +..+.-. .+.+.+.++.+- +..++..+- +.|... T Consensus 228 slKGl~ViDp~~vtP~~~n~~dP~s-----------pdfgkP~~y~V~--G~kIH~SRL---------------~~f~g~ 279 (695) T protein:vir:78 228 SFQGLRVVEPYWVTPNNYNSINPVA-----------DDFYKPSTWWMI--GTEVHATRL---------------HTIVSR 279 (695) T ss_pred ceeeeEeecccccccchhhhccchh-----------hccCCCceEEEe--ceEEeeeeE---------------EEecCC Confidence 22 25667777666632 1111100 111112222111 111111100 000000 Q ss_pred ceEE-eecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-----Cccccccccccchhhh--h Q lcl|NC_021324. 205 PVVP-LTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-----TTDELTNDGENTTLDI--Y 276 (480) Q Consensus 205 Pvv~-~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-----~~~~~~~~~~~~~~~~--~ 276 (480) |+-. ++ ....++|.|-. .-+.+-+++.+++.-.....+..++...+.. +. ......-......+.. . T Consensus 280 plPd~LK---p~y~~~GiSv~-q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~-dla~~L~~g~~~~l~~R~eli~~~Rs 354 (695) T protein:vir:78 280 PVGDMLK---PTYSFAGISMT-QLAMPYIDNWLRTRQSVSDIVKQFSVSGILM-DLAQALMPGANVDLSMRAELINRYRD 354 (695) T ss_pred Cchhhhh---cccccCcccHH-HHHHHHHHHHHHHHhHHHHHHHhhhhHHHHH-HHHHhhcChhHHHHHHHHHHHHHhcC Confidence 1100 11 11224566644 3355556666666555555554333322111 10 0000000000011111 1 Q ss_pred cchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc-cc-chHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 277 YGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 277 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~~-~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ...++.+++++-+|.+++. ++..+-+.+-.....+|..++||.--|-+.+ .. |+||++-..-|...+. ..++..+ T Consensus 355 n~G~~llDk~~Eefeq~st-slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~L 431 (695) T protein:vir:78 355 NRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQRNAL 431 (695) T ss_pred ccceEEEecCCcceEEEec-ccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHH--HHHHHHH Confidence 1233445544457777763 4667777788888999999999976664443 22 4688876666666665 5668899 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCHHHHHHhC------CCCh Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPKEQARIDL------GYTA 423 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~e~~~~~~------~~~~ 423 (480) ...|++++.++.+-.-+..+. .+.+.|.+-...+..+.|+.-.|-+++ ..+.++...+..+| +|.. T Consensus 432 ~p~L~rl~~ii~rS~~G~idp---di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~ 508 (695) T protein:vir:78 432 QQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAG 508 (695) T ss_pred HHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccccc Confidence 999999999876544333332 478899999999999998864433221 11122222222221 1110 Q ss_pred hHHHHHHH--HHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCC-------CCcccCC Q lcl|NC_021324. 424 TQREQMRD--WDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSG-------FNRTKTR 480 (480) Q Consensus 424 d~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-------~~~~~~~ 480 (480) . ++.-.. +..+.+.+-...+......+.+.++..++..+++.+++-.+= ..++++. T Consensus 509 ~-~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ag~~~~ 573 (695) T protein:vir:78 509 K-LDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVKPREAGAQDA 573 (695) T ss_pred c-cccccCCCcCccchhhhhHhhhcCcccccccCCCCCCCCCCCCCCceeeeeccccccccCCCCc Confidence 0 000000 000000000000000000000000000000000000000000 0011111 No 190 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=97.96 E-value=9.1e-06 Score=48.23 Aligned_cols=393 Identities=10% Similarity=0.070 Sum_probs=152.1 Q ss_pred HHHHHH---HHHHHHHHHHHH--HHHHHHhhccC-cc--h------------hc----CcccchhhhcceeccchHHHHH Q lcl|NC_021324. 5 HEHVER---LQGLLARDLPNL--LEAEAYRNGTR-RL--K------------TI----GIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 5 ~~~i~~---l~~~~~~~~~~~--~~~~~YY~G~~-~i--~------------~~----~~~~~~~~~~~~~~~n~~~~iV 60 (480) .-|..- .++-..++.+|- ..+--|...+. .+ + .. ...+.+.. -++... .-.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-al~~~~--V~~cv 77 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE-AIRHSD--IFTAV 77 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhh-hhccHH--HHHHH Confidence 111111 001001111111 10011111000 00 0 00 00000000 011110 11123 Q ss_pred HHHHhhhccCccccCCCc--hhHHHHHHHHH--hcC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEE Q lcl|NC_021324. 61 RTLSDRLDIEGFRISEDS--EGLEELWNWWQ--AND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRV 132 (480) Q Consensus 61 d~~~~~l~~~~~~~~~d~--~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~ 132 (480) +..++-+-.-++.+-.+. .....+..++. -|. -..+...+..+.+.+|.||+.+..+ ..|+| .+.. T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~ 151 (441) T protein:vir:94 78 MMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTF 151 (441) T ss_pred HHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEE Confidence 322222222234332111 11223344442 232 2356677888899999999998653 35665 5788 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecc Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~ 212 (480) ++|..+.++.|+.. .+.+.. ...+..+. .....|.++. |+||++. T Consensus 152 i~~~~v~v~~d~~g--~~~~~~---~~~~~~~~-~~~~~~~~~d-----------------------------vih~k~~ 196 (441) T protein:vir:94 152 RKTSEIELKSDARG--RLYYFH---QRIDSNGN-NIERNVKFED-----------------------------MLDIKFY 196 (441) T ss_pred EcCceeEEEECCCc--cEEEEE---EEeccCCc-eeEEEEcccc-----------------------------EEEeccC Confidence 99999888776432 221111 11111110 1111122222 3455543 Q ss_pred cccCccCCCccchHHHHHHHHHHHHHHHHH---HHHHHHhcchhhhhc--CCCccccccccccchhhh------hcchhe Q lcl|NC_021324. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNL---QSASQILGTPLRVIS--GVTTDELTNDGENTTLDI------YYGRIL 281 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~---~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~------~~~~~~ 281 (480) . ..+.+|.|.|. .+.+.+.....-. .+.+...+.|-.+|. |...++...+.-...|.. ..++++ T Consensus 197 ~-~dg~~G~spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~ 271 (441) T protein:vir:94 197 S-LDGINGLSLLD----TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVV 271 (441) T ss_pred C-CCCccccCHHH----HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcce Confidence 2 23467777553 3334443332222 222333344554443 321111100000111211 113455 Q ss_pred ecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 282 TLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 282 ~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) .+++ +.++.+++.+. -..|++..+....+|+..-++|+..+|....+. |-......+ ...|.- T Consensus 272 vl~~-G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~q~~~~~--------------~~tl~P 335 (441) T protein:vir:94 272 VLDE-SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDY--------------LSTLKP 335 (441) T ss_pred ecCC-CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHH--------------HHHHHH Confidence 5653 45777775432 224778888889999999999999998654332 211111111 111221 Q ss_pred HHHHHHHHhCCCCccc--ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HH-HHHHHHHH Q lcl|NC_021324. 361 AMRIAMQIMGREVTEE--YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQ-MRDWDKQE 436 (480) Q Consensus 361 ~~~li~~~~~~~~~~~--~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~-~~-~~~~~~~~ 436 (480) ++..+..-........ ...+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+- .. +.... T Consensus 336 ~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G--~~T~NE~R~~~gl~Pi~ggd~~~~~~~--- 410 (441) T protein:vir:94 336 YITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVD--- 410 (441) T ss_pred HHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeec--- Confidence 1111111000000011 123444444555678888899898888775 667777777777753210 00 00000 Q ss_pred HHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 437 TEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 437 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) -....... ..+.+...++ +.+....|+++.. T Consensus 411 -----~n~~~~~~-----~~~~~~~~~~-~~~~~~kgGe~~e 441 (441) T protein:vir:94 411 -----LNHVNIEL-----VDEYQMNKSR-ATDKKLKGGEENE 441 (441) T ss_pred -----cccccccc-----cccccccccc-ccccccCCCCCCC Confidence 00000000 0000000000 0011111111111 No 191 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=97.96 E-value=9.1e-06 Score=48.23 Aligned_cols=393 Identities=10% Similarity=0.070 Sum_probs=152.1 Q ss_pred HHHHHH---HHHHHHHHHHHH--HHHHHHhhccC-cc--h------------hc----CcccchhhhcceeccchHHHHH Q lcl|NC_021324. 5 HEHVER---LQGLLARDLPNL--LEAEAYRNGTR-RL--K------------TI----GIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 5 ~~~i~~---l~~~~~~~~~~~--~~~~~YY~G~~-~i--~------------~~----~~~~~~~~~~~~~~~n~~~~iV 60 (480) .-|..- .++-..++.+|- ..+--|...+. .+ + .. ...+.+.. -++... .-.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-al~~~~--V~~cv 77 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE-AIRHSD--IFTAV 77 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhh-hhccHH--HHHHH Confidence 111111 001001111111 10011111000 00 0 00 00000000 011110 11123 Q ss_pred HHHHhhhccCccccCCCc--hhHHHHHHHHH--hcC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEE Q lcl|NC_021324. 61 RTLSDRLDIEGFRISEDS--EGLEELWNWWQ--AND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRV 132 (480) Q Consensus 61 d~~~~~l~~~~~~~~~d~--~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~ 132 (480) +..++-+-.-++.+-.+. .....+..++. -|. -..+...+..+.+.+|.||+.+..+ ..|+| .+.. T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~ 151 (441) T protein:vir:79 78 MMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTF 151 (441) T ss_pred HHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEE Confidence 322222222234332111 11223344442 232 2356677888899999999998653 35665 5788 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecc Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~ 212 (480) ++|..+.++.|+.. .+.+.. ...+..+. .....|.++. |+||++. T Consensus 152 i~~~~v~v~~d~~g--~~~~~~---~~~~~~~~-~~~~~~~~~d-----------------------------vih~k~~ 196 (441) T protein:vir:79 152 RKTSEIELKSDARG--RLYYFH---QRIDSNGN-NIERNVKFED-----------------------------MLDIKFY 196 (441) T ss_pred EcCceeEEEECCCc--cEEEEE---EEeccCCc-eeEEEEcccc-----------------------------EEEeccC Confidence 99999888776432 221111 11111110 1111122222 3455543 Q ss_pred cccCccCCCccchHHHHHHHHHHHHHHHHH---HHHHHHhcchhhhhc--CCCccccccccccchhhh------hcchhe Q lcl|NC_021324. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNL---QSASQILGTPLRVIS--GVTTDELTNDGENTTLDI------YYGRIL 281 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~---~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~------~~~~~~ 281 (480) . ..+.+|.|.|. .+.+.+.....-. .+.+...+.|-.+|. |...++...+.-...|.. ..++++ T Consensus 197 ~-~dg~~G~spl~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag~~~ 271 (441) T protein:vir:79 197 S-LDGINGLSLLD----TLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVV 271 (441) T ss_pred C-CCCccccCHHH----HHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcce Confidence 2 23467777553 3334443332222 222333344554443 321111100000111211 113455 Q ss_pred ecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 282 TLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 282 ~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) .+++ +.++.+++.+. -..|++..+....+|+..-++|+..+|....+. |-......+ ...|.- T Consensus 272 vl~~-G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~q~~~~~--------------~~tl~P 335 (441) T protein:vir:79 272 VLDE-SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDY--------------LSTLKP 335 (441) T ss_pred ecCC-CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHH--------------HHHHHH Confidence 5653 45777775432 224778888889999999999999998654332 211111111 111221 Q ss_pred HHHHHHHHhCCCCccc--ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HH-HHHHHHHH Q lcl|NC_021324. 361 AMRIAMQIMGREVTEE--YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQ-MRDWDKQE 436 (480) Q Consensus 361 ~~~li~~~~~~~~~~~--~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~-~~-~~~~~~~~ 436 (480) ++..+..-........ ...+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+- .. +.... T Consensus 336 ~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G--~~T~NE~R~~~gl~Pi~ggd~~~~~~~--- 410 (441) T protein:vir:79 336 YITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVD--- 410 (441) T ss_pred HHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeec--- Confidence 1111111000000011 123444444555678888899898888775 667777777777753210 00 00000 Q ss_pred HHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 437 TEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 437 ~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) -....... ..+.+...++ +.+....|+++.. T Consensus 411 -----~n~~~~~~-----~~~~~~~~~~-~~~~~~kgGe~~e 441 (441) T protein:vir:79 411 -----LNHVNIEL-----VDEYQMNKSR-ATDKKLKGGEENE 441 (441) T ss_pred -----cccccccc-----cccccccccc-ccccccCCCCCCC Confidence 00000000 0000000000 0011111111111 No 192 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=97.96 E-value=9.3e-06 Score=48.18 Aligned_cols=331 Identities=11% Similarity=0.051 Sum_probs=141.8 Q ss_pred chhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc-CCCchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcC Q lcl|NC_021324. 35 LKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI-SEDSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFG 108 (480) Q Consensus 35 i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~-~~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G 108 (480) |-.. ++.+ ..++.....+.+++.. | .-..+...++...+.+| T Consensus 1 ia~l--------------------------------p~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~G 48 (348) T protein:vir:93 1 MASL--------------------------------PLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKG 48 (348) T ss_pred Cccc--------------------------------ceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcC Confidence 1111 1111 1111122334444431 2 23355667788889999 Q ss_pred eeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCc Q lcl|NC_021324. 109 RAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLN 187 (480) Q Consensus 109 ~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (480) .||+++..+. .|+| .+..++|..+.++.+.... .+ + |......+.. ..|.++. T Consensus 49 na~~~i~r~~------~G~~~~L~~l~~~~v~~~~~~~~~-~~-~----y~~~~~~g~~---~~~~~~e----------- 102 (348) T protein:vir:93 49 NAYVLIERDI------YHQPSKLFLLNPDVVEMLIENQSR-EL-Y----YSIHAATGNK---LIVHNMD----------- 102 (348) T ss_pred CeEEEEEECC------CCcEEEEEEEcCCceEEEEeCCCc-EE-E----EEEEcCCCeE---EEEcccc----------- Confidence 9999987533 4555 5667888877776654321 11 1 1111111110 0122222 Q ss_pred cccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhh--cCCCccccc Q lcl|NC_021324. 188 DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI--SGVTTDELT 265 (480) Q Consensus 188 ~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i--~G~~~~~~~ 265 (480) |+||++.......+|.|.+. .+.+.++...+-....+..+..+-.++ .+....++. T Consensus 103 ------------------iih~r~~~~~~~~~G~s~~~----~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~ 160 (348) T protein:vir:93 103 ------------------MLHFKHIVASNMVQGISPID----VLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVSTEK 160 (348) T ss_pred ------------------EEEecCCCCCCceeeccHHH----HHHHHHHHHHHHHHHHHHhcCCCceeEEecCCCCCHHH Confidence 35665544445667877553 333333333222222233334332222 122211111 Q ss_pred cccccchhh---hhcchheecCCCCceEEEeccccHH-HHHHHHHHHHHHHHhhcCCChHHhccccc-cchHHHHHHHHH Q lcl|NC_021324. 266 NDGENTTLD---IYYGRILTLASEAAKISEFKAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATD 340 (480) Q Consensus 266 ~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~ 340 (480) .+.-...|+ -..++++.++ ++.++.+++.+..+ .+++..+....+|+..-++|+..+|.... +.++..+ .. T Consensus 161 ~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~---~~ 236 (348) T protein:vir:93 161 RQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEE---LN 236 (348) T ss_pred HHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH---HH Confidence 110011111 1223445554 44678777643222 46777788889999999999999986432 2222222 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhC Q lcl|NC_021324. 341 SRIVKMAERKGRIFGGAWERAMRIAM-QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDL 419 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~~li~-~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~ 419 (480) ..+...| +.+-++.+-..+. .+...........+++.+......+..+.++++.+++.+| +++...+++.+ T Consensus 237 ~~~~~~~------l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G--~~T~NE~R~~~ 308 (348) T protein:vir:93 237 RFYLQHT------LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSG--YYTINDIREWE 308 (348) T ss_pred HHHHHHH------HHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHh Confidence 1111111 1111111111111 1111111111223444455555678889999999998876 66777778888 Q ss_pred CCChhHH-HHHHHHHHHHHHHHHHHHhhhcccCCCcC-CCCCCCCCCccC Q lcl|NC_021324. 420 GYTATQR-EQMRDWDKQETEDMIDTLYSTTKAQADAT-PKPTVTETKTET 467 (480) Q Consensus 420 ~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~~~~ 467 (480) |+.+-+- ++.- + ...+..... ..+.. ...+++++..++ T Consensus 309 g~~p~~ggD~~~-~--------~~n~~~~~~-~~~~~~~~~gg~~n~~~~ 348 (348) T protein:vir:93 309 DLPPVEGGDKPL-I--------SGDLYPIDT-PLELRKSLKGGDKNVNES 348 (348) T ss_pred CCCCCCCcCeEe-e--------ccccccccc-chhhcccccCCCCCcCCC Confidence 7753220 0000 0 000000000 00000 000111111111 No 193 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=97.94 E-value=1e-05 Score=47.96 Aligned_cols=429 Identities=14% Similarity=0.068 Sum_probs=171.1 Q ss_pred CCCHHHH--HHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC-------- Q lcl|NC_021324. 1 MTTYHEH--VERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE-------- 70 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~-------- 70 (480) --++.+. ..+.++. ...-++-+ .+|.|..-+-+.-- ..+.. +.-.++++.+.+..+.-+ T Consensus 78 ~~~~~~~~~~~~~~~~---~~~~~~~l-~~~~~~~F~Gy~~l---a~laQ----~~eyr~~~~~ia~e~~R~w~~~~~~~ 146 (694) T protein:vir:10 78 NYTPRERRAASYALDF---NGTSMDAL-SFVTSSGFPGFPTL---VLLAQ----LPEYRAMHEVLADECIRTWGEAIGGT 146 (694) T ss_pred CCCccccchhhhhhcc---Ccccccch-hhhhccCcchHHHH---HHHhh----ccchhhHHHHHHHHhhcccceecccc Confidence 1122111 1111110 11112222 34554322211000 00000 111222233222222111 Q ss_pred -------cccc------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEec-Cccc----------cCCCCc Q lcl|NC_021324. 71 -------GFRI------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSH-PDVE----------SGDPAG 126 (480) Q Consensus 71 -------~~~~------~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~-~~~~----------~~d~~g 126 (480) |+.+ ..+.+..+.|..-+++-++...+.++.+.+-.||.+.+++.. ++.. .....| T Consensus 147 ~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~~~I~kG 226 (694) T protein:vir:10 147 KEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVPKG 226 (694) T ss_pred chhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccccccccccccCc Confidence 1221 122244566777777778899999999999999998755542 2110 001122 Q ss_pred ee-EEEEEcccceEEEE-eCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccccccccc Q lcl|NC_021324. 127 IP-LIRVESPLYMYAEL-DPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVV 204 (480) Q Consensus 127 ~~-~~~~~~p~~~~~v~-d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~i 204 (480) .+ -+.+++|..+.|-. +..+.-. .+.+.+.++.+- +..++..+- +.|..- T Consensus 227 slKGl~ViDp~~vtP~~~n~~dP~s-----------pdfgkP~~y~V~--G~~IH~SRL---------------~~f~g~ 278 (694) T protein:vir:10 227 SFQGLRVVEPYWVTPNNYNSINPVA-----------DDFYKPSTWWMI--GTEVHATRL---------------HTIVSR 278 (694) T ss_pred ceeeeEeecccccccchhhhccchh-----------hccCCCceEEEe--ceEEeeeeE---------------EEecCC Confidence 22 25667777666632 1111100 111112221111 111111100 000000 Q ss_pred ceEE-eecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-----Cccccccccccchhhh--h Q lcl|NC_021324. 205 PVVP-LTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-----TTDELTNDGENTTLDI--Y 276 (480) Q Consensus 205 Pvv~-~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-----~~~~~~~~~~~~~~~~--~ 276 (480) |+-. ++ ....++|.|-. .-+.+-+++.+++.-.....+..++...+.. +. ......-......+.. . T Consensus 279 plPd~LK---p~y~~~G~Sv~-q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~-dla~~L~~g~~~~l~~R~eli~~~Rs 353 (694) T protein:vir:10 279 PVGDMLK---PTYSFAGISMT-QLAMPYIDNWLRTRQSVSDIVKQFSVSGILM-DLAQALMPGANVDLSMRAELINRYRD 353 (694) T ss_pred Cchhhhh---cccccCcccHH-HHHHHHHHHHHHHHhHHHHHHHhhhhHHHHH-HHHHhhcChhHHHHHHHHHHHHHhcC Confidence 1100 11 11224566644 3355556666666555555554333322111 10 0000000000011111 1 Q ss_pred cchheecCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc-cc-chHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 277 YGRILTLASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 277 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~~-~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) ...++.+++++-+|.+++. ++..+-+.+-.....+|..++||.--|-+.+ .. |+||++-..-|...+. ..++..+ T Consensus 354 n~G~~llDk~~Eefeq~st-slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~--s~Qe~~L 430 (694) T protein:vir:10 354 NRNILFLDKATEEFFQFNT-PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVR--AYQRNAL 430 (694) T ss_pred ccceEEEecCCcceEEEec-ccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHH--HHHHHHH Confidence 1233445544457777763 4667777788888999999999976664443 22 4688876666666665 5668899 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhc-----cccCCCHHHHHHhC------CCCh Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYAN-----GQGPIPKEQARIDL------GYTA 423 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~-----~~~~~s~e~~~~~~------~~~~ 423 (480) ...|++++.++.+-.-+..+. .+.+.|.+-...+..+.|+.-.|-+++ ..+.++......+| +|.. T Consensus 431 ~p~L~rl~~ii~rS~~G~idp---~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~ 507 (694) T protein:vir:10 431 QQLMNDVIVMIQLSLFGAVDP---SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAG 507 (694) T ss_pred HHHHHHHHHHHHHHhcCCCCC---cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCccccc Confidence 999999999876544333332 478899999999999998864433221 11122222222221 1110 Q ss_pred hHHHHHHH--HHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCC---CCC----CCcccCC Q lcl|NC_021324. 424 TQREQMRD--WDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTS---PSG----FNRTKTR 480 (480) Q Consensus 424 d~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~---~~~----~~~~~~~ 480 (480) . ++.-.. ...+.+.+-...+......+.+.++..++....+.+++- +.. ..++++. T Consensus 508 ~-~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~~~~ag~~~~ 572 (694) T protein:vir:10 508 K-LDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGGARAGATAPPTVANVNANVNPREAGAQDA 572 (694) T ss_pred c-cccccCCCcCccchhhhhHhhhcCcccccccCCCCcccccccCCCcccccccccCccccCCCCc Confidence 0 000000 000000000000000000000000000000000000000 000 0111111 No 194 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=97.93 E-value=1.1e-05 Score=47.87 Aligned_cols=416 Identities=11% Similarity=0.073 Sum_probs=195.0 Q ss_pred CCCHHH--------HHH--HHHHHHHHHHHHHHHHHHHhhccCcchhcCccc--chhhhcceeccchHHHHHHHHHhhhc Q lcl|NC_021324. 1 MTTYHE--------HVE--RLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA--PPELAYLDVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~~--------~i~--~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~--~~~~~~~~~~~n~~~~iVd~~~~~l~ 68 (480) +..+.. .+. .-++.-..-..+|+.+..+++-+..+...-..+ .+... .=+.+++- T Consensus 46 ~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~-~pV~l~L~------------ 112 (521) T protein:vir:65 46 DNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVENAVQNIVNDAIVFEEGH-EVVSLNLE------------ 112 (521) T ss_pred CCccccccccceeeeccccchhhhHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCC-ceEEEEec------------ Confidence 111000 000 000001112233444444443333222110000 00000 00000100 Q ss_pred cCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCC Q lcl|NC_021324. 69 IEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTR 148 (480) Q Consensus 69 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~ 148 (480) -..++-.-.++..+.+..|++--+|+....+..+...+.|+.|++.-.+. ...+|-..++.++|..+..+.--.... T Consensus 113 ~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~---~pk~GI~ELr~lDPr~i~~vr~i~k~~ 189 (521) T protein:vir:65 113 ATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGK---NPKDGIVELRQLDPRNLEYVREIITED 189 (521) T ss_pred ccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcC---CccccceeeeeeCCcceeeeeeecccc Confidence 00111111223445666677666899999999999999999998865431 346788899999999887765221111 Q ss_pred ceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeeccccc--CccCCCccc Q lcl|NC_021324. 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRL--GNRYGRSEI 224 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~--~~~~G~s~i 224 (480) .....+ .+....+-+|.++... |...+... . |+.-=+||- |.|.+.-.. ....-.|-| T Consensus 190 ~~~~~v--------~~~~~e~f~Y~~~~~~-~~~~g~~~----~-----~~~~vkI~~dAI~y~hSGl~d~~~~~i~syL 251 (521) T protein:vir:65 190 TPEGKI--------YKATKEYFIYTVGNSS-YCAGGQVF----S-----PNSRVKIPRSAITYAHSGLMDCDDKYIIGYL 251 (521) T ss_pred cCCcce--------ecceeeeeeeecCCcc-eeccceee----c-----CCcceeechhheeeeeccceeCCCCeeeecc Confidence 000000 0111222334332221 11111100 0 000011221 233321111 111112334 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc------------- Q lcl|NC_021324. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG------------- 278 (480) Q Consensus 225 ~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~------------- 278 (480) .+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| T Consensus 252 hkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlE 329 (521) T protein:vir:65 252 HRAVKPANQ--LKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTE 329 (521) T ss_pred hhhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhh Confidence 455555432 2667777777777778877776555555543322110 011111 Q ss_pred hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccccc---chHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 279 RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSEN---PASAEAIIATDSRIVKMAERKGR 352 (480) Q Consensus 279 ~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~---~~Sg~Al~~~~~~l~~k~~~~~~ 352 (480) -.|. .+|.+.++.++++.+.=.-++-++-....++..-++|.+-++...++ ..-|..|--.+.....-+.+.+. T Consensus 330 DyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~ 409 (521) T protein:vir:65 330 DYWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQS 409 (521) T ss_pred hhcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHH Confidence 1111 23445678888875444456666777778888889998876433221 12234555566677777888888 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhccccCCCHHHHHHh-CC Q lcl|NC_021324. 353 IFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQGPIPKEQARID-LG 420 (480) Q Consensus 353 ~~~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~~~s~e~~~~~-~~ 420 (480) .|..-+.++++.=+-+.|.--..++ ..+.+.|..-....+...++. +.++..-..-.+|.++++.. |. T Consensus 410 rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr 489 (521) T protein:vir:65 410 QFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILK 489 (521) T ss_pred HHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhc Confidence 8888888888754434443333333 346677754433333333332 22332211224589988755 78 Q ss_pred CChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 421 YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 421 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) .++++++++.+.++++..+. .-.++++..+.- T Consensus 490 ~tDeei~~~~k~I~~E~~~~-------~~~~p~~~~~~f 521 (521) T protein:vir:65 490 YTDDQMDTEKKQIEEEANDP-------RFKQTPDEIEDF 521 (521) T ss_pred cCHHHHHHHHHHHHHhhhCC-------CCCCCcccccCC Confidence 99999887776666554211 111111111111 No 195 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=97.92 E-value=1.1e-05 Score=47.80 Aligned_cols=390 Identities=10% Similarity=0.002 Sum_probs=172.0 Q ss_pred HHH--HHHHHHHHHHHHHHHHHHhhccC----cchh-cCcccchhhhcceeccchHHHHHHHHHhhhccCcccc--CCC- Q lcl|NC_021324. 8 VER--LQGLLARDLPNLLEAEAYRNGTR----RLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--SED- 77 (480) Q Consensus 8 i~~--l~~~~~~~~~~~~~~~~YY~G~~----~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~--~~d- 77 (480) |++ |..+......-.+....|..|-. +|.. ........+..+. ......-++++....+....|.+ +++ T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~-~D~~i~s~l~~rk~av~~~~w~i~p~~~~ 79 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEIL-SDAQVKTVWGQRQLAVVSREWKVEAGGDR 79 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHh-hChHHHHHHHHHHHHHhcCCceEEcCCCC Confidence 333 22222221111112222332211 1100 0000001111111 13344555555555555556654 222 Q ss_pred ---chhHHHHHHHHHhcCHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEEEEE---cccceEEEEeCCCCCce Q lcl|NC_021324. 78 ---SEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRVE---SPLYMYAELDPRNTRRV 150 (480) Q Consensus 78 ---~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~~~~---~p~~~~~v~d~~~~~~~ 150 (480) .+..+.+.+.++.-+|+..+..+. ++..+|.+ ++++|.. .+|...+..+ +|+.. .||+... T Consensus 80 ~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~------~~g~~~~~~l~~r~~~~f--~~d~~~~--- 147 (488) T protein:vir:99 80 PIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGR------DDRYITLEAIKVRNRRRF--RYDQDGG--- 147 (488) T ss_pred hHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEee------cCCeeeEeeeeeecccce--eecCCCc--- Confidence 223456777777767888888765 68889987 4556632 1344333322 22211 1111110 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHH Q lcl|NC_021324. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) ..+...... ......|.+++.+ ++ .+....+.++|.|-+.. +-. T Consensus 148 ---------------------------l~~~~~~~~-----~~g~~lp~~~~~i--~~-~~~~~~g~p~g~gLl~~-~~w 191 (488) T protein:vir:99 148 ---------------------------LRLLTPNNM-----FEGEPCPAPYFWH--FS-TGADNDDEPYGLGLAHW-LYW 191 (488) T ss_pred ---------------------------eEEeccCCC-----CCccccccCceEE--EE-eecCCCCCcccchHHHH-HHH Confidence 011111100 0111222223322 22 34456678888887654 222 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhcchheec-CCCCceEEEeccccHHHHHHHH Q lcl|NC_021324. 231 VTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTL-ASEAAKISEFKAAELRNFAEEM 305 (480) Q Consensus 231 l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~----~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~~l 305 (480) ..=-=+..+.+++.-++.|+.|+++.+- +.....++.... +..+..+....+ .|...+|.+....+...|...+ T Consensus 192 ~~~fK~~~~~~w~~f~E~yG~P~~igky-~~~~a~~~ek~~l~~av~~~~~~~~~viP~~~~ie~~ea~~~~~~~~~~li 270 (488) T protein:vir:99 192 PVFFKRNGIKFWLIFLDKFGMPTAVGRY-DDKTATPEDKAKLLAALHAIQTDSAIIMPAGMQAELLEAGRSGTADYKTLH 270 (488) T ss_pred HHHHHHhhHHHHHHHHHHcCCceeeeec-CCCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEeecCCCChHHHHHHH Confidence 2122256677888888999999876652 110111111111 222233333333 3445566665555555666677 Q ss_pred HHHHHHHHhhcCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021324. 306 EVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWE-RAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~v~ 383 (480) +..-.+|+...- -+.+.+...+ .| -|+. ....-....+..-.+.+...+. +++..++.+..... ....+. T Consensus 271 ~~~d~~Isk~iL--Gqtlts~~~~-Gs-~a~~~vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~~----~~p~~~ 342 (488) T protein:vir:99 271 DTMDATIAKVGL--GQVASTQGTP-GR-LGNDDLQADVRLDLVKADADLICESFNLGPARWLTEWNFPGA----QPPRVY 342 (488) T ss_pred HHHHHHHHHHHh--hhhhcccccc-cc-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCCc----CCceeE Confidence 666666665420 0112111110 11 1221 1222333444445566666664 46666666653221 224567 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCC Q lcl|NC_021324. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTET 463 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) |....+.++.+.++.+.+++..+...++.+.+.+.+|+......+ + . .. +.+...+ .+. T Consensus 343 ~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~-------~---~----~~-----~~~~~~~--~~~ 401 (488) T protein:vir:99 343 RVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQA-------E---A----TA-----PTPSTEF--AEG 401 (488) T ss_pred ecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCccccc-------c---c----cc-----CCCcccC--CCC Confidence 777778899999999999988633567999999999986432110 0 0 00 0000000 000 Q ss_pred CccCCCCCCCCCcccCC Q lcl|NC_021324. 464 KTETQTSPSGFNRTKTR 480 (480) Q Consensus 464 ~~~~~~~~~~~~~~~~~ 480 (480) ...+......+.+ T Consensus 402 ----~~~~~~~~~~~~~ 414 (488) T protein:vir:99 402 ----DQPSDPAAAMAPQ 414 (488) T ss_pred ----CCCCCchHHHHHH Confidence 0000000001111 No 196 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=97.90 E-value=1.2e-05 Score=47.55 Aligned_cols=407 Identities=11% Similarity=0.106 Sum_probs=194.8 Q ss_pred CCCHHH---HHHHHHHH------HHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh-ccC Q lcl|NC_021324. 1 MTTYHE---HVERLQGL------LARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-DIE 70 (480) Q Consensus 1 ~~~~~~---~i~~l~~~------~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l-~~~ 70 (480) |.+.-. +...+.+. -..-..+|+.+..+++-+..+ ..||+..+-+= ..+ T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~pEvd~Av---------------------~eIvneaiv~d~~~~ 105 (521) T protein:vir:10 47 IDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYHEVDNAI---------------------DEIINDAIVQEDNRD 105 (521) T ss_pred CCccccccchhhhhhccccccchHHHHHHHHHHHhhccchhhHH---------------------HhhhcceEEecCCCc Confidence 111100 00000000 000112233333333222221 22222111000 011 Q ss_pred cccc---------CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEE Q lcl|NC_021324. 71 GFRI---------SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAE 141 (480) Q Consensus 71 ~~~~---------~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v 141 (480) ++.+ .-.++..+.+..|++--+|+....+..+...+.|+.|.+.-.+. ....+|-..++.++|..+-.+ T Consensus 106 pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~--~~pk~GI~Elr~lDPr~i~~v 183 (521) T protein:vir:10 106 TVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDP--ARPKDGIKELRLLDPRNVEYY 183 (521) T ss_pred eEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeC--CCccccceeeeeeCCcceeee Confidence 1111 11122345566666666889999999999999999988854321 123467788889999877554 Q ss_pred EeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeeccc--ccCc Q lcl|NC_021324. 142 LDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDP--RLGN 217 (480) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~--~~~~ 217 (480) .--.... ...+..+ +....+-+|.+....+|...++. ...=+||. |.|.+.- +... T Consensus 184 r~i~k~~--~~~~~v~------~~~~e~f~Y~~~~~~~~~~~g~~------------~~~vkI~~daI~y~hSGL~d~~~ 243 (521) T protein:vir:10 184 RVNLKSN--ENGNDVY------KGVKEFFTYGATEDNRYNISGNS------------NNLVQIPIDAIVYSHSGKVDIDG 243 (521) T ss_pred eeecCCC--CCcchhh------ccceeeeeeccCCCceecCCCCC------------CcceeechhheeeecccceeCCC Confidence 3211110 0000000 01122234443332233222211 11112332 3444421 1223 Q ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc------ Q lcl|NC_021324. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG------ 278 (480) Q Consensus 218 ~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~------ 278 (480) ....|-|.+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| T Consensus 244 ~~i~syLhkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddr 321 (521) T protein:vir:10 244 KTIVGYLHNVIKPANQ--LKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSS 321 (521) T ss_pred CceeccchhhhHhHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccch Confidence 3445556666666533 2667777777777778877776555555443322110 011111 Q ss_pred -------hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc--cchHHHHHHHHHHHHHHH Q lcl|NC_021324. 279 -------RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE--NPASAEAIIATDSRIVKM 346 (480) Q Consensus 279 -------~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~--~~~Sg~Al~~~~~~l~~k 346 (480) -+|. .+|.+.++.++++.+.-.-++-++-....++..-++|.+-+..... +-.-|..|--.......- T Consensus 322 k~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KF 401 (521) T protein:vir:10 322 NNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKY 401 (521) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHH Confidence 1111 2344567888887653345666677777888888999887765421 211122355566667777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHH---HHHhcccc------CCCHH Q lcl|NC_021324. 347 AERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVS---KLYANGQG------PIPKE 413 (480) Q Consensus 347 ~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~~~---kl~~~~~~------~~s~e 413 (480) +.+.+..|..-+.++++.=+-+.|---..++ ..+.+.|..-....+...++.+. .+.+...+ .+|.+ T Consensus 402 I~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~d 481 (521) T protein:vir:10 402 IRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHE 481 (521) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchH Confidence 8888888888888888754444443333333 34667775443333333333221 12222223 67889 Q ss_pred HHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 414 QARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 414 ~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +++.. |..++++++++.+.++++..+. .-.++++..+.- T Consensus 482 yi~k~ILr~tDeeik~~~k~I~~E~~~~-------~~~~p~~e~~df 521 (521) T protein:vir:10 482 YVMKNILRMSDEDIKTEREKIDGELKDS-------VYKNPEDPMEEF 521 (521) T ss_pred HHHHHHhcCCHhHHHHHHHHHHHhhhCC-------CCCCCcchhhcC Confidence 88755 7899998887766666554211 111111111111 No 197 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.88 E-value=1.3e-05 Score=47.40 Aligned_cols=393 Identities=11% Similarity=0.066 Sum_probs=172.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhh-ccC-----cch-hcCcccchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRN-GTR-----RLK-TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~-G~~-----~i~-~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) ++...+-+...+.. +. +.. ++-. |-. .|. ..+.. ..-+..+. ......-.+++....+.+..|. T Consensus 15 ~~~~~~~~~~~ia~---~~-~~~---~~~~~~~~~~~~~~iLr~~~~~-~~~y~~m~-~D~~i~s~l~~Rk~av~~~~w~ 85 (491) T protein:vir:10 15 FGEPDKSLSSQIAT---RA-RSI---DFFALGMYLPNPDPVLKALGKD-IRVYRELR-ADAHVGGCVRRRKAAVKALEWG 85 (491) T ss_pred cccCChHHHHHHHh---hh-ccc---ccccccCCccchHHHHHhcCCC-HHHHHHHh-hChHHHHHHHHHHHHHhCCCcE Confidence 44433322222221 10 000 1100 100 010 00000 01111222 2445555555555555566676 Q ss_pred c---CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEE---EEEcccceEEEEeCCC Q lcl|NC_021324. 74 I---SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLI---RVESPLYMYAELDPRN 146 (480) Q Consensus 74 ~---~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~---~~~~p~~~~~v~d~~~ 146 (480) + .++++..+.+.++++.-.|+..+..+. ++..+|.+ ++++|.. .+|...+ ..++|+.. .||+. T Consensus 86 i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~------~~g~~~~~~l~~r~~~~f--~~d~~- 155 (491) T protein:vir:10 86 LDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGK------VGNYIVPIDVVGKPADWF--VYDPE- 155 (491) T ss_pred EecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEee------cCCeeEEEEeeeecccce--eeccC- Confidence 5 234456678888888778888888875 78889987 5556642 1333332 22233211 12211 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) ....|...++.. ..... ..++ ++.+.+....+.++|.+-+.. T Consensus 156 -----------------------------~~l~~~~~~~~~-----~g~~l-~~~k---~i~~~~~~~~~~p~g~gLl~~ 197 (491) T protein:vir:10 156 -----------------------------NQLRFRSKDHWM-----QGEEL-PARK---FLVPRQEATYLNPYGFPDLSM 197 (491) T ss_pred -----------------------------CceEEecCCCCC-----Cccee-cCCC---EEEEEecCCCCCcccchhHHH Confidence 111121111100 00000 1122 344556667778888887644 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhcchheec-CCCCceEEEecc--ccHH Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTL-ASEAAKISEFKA--AELR 299 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~----~~~~~~~~~~~~-~g~~~~~~~~~~--~~~~ 299 (480) +-...=--+..+.+++.-++.|+.|+++.+- +....++.... +..+..+....+ .|...+|.+... .+.. T Consensus 198 -~~w~~~fK~~~~~~w~~f~E~yG~P~~igky--~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~ 274 (491) T protein:vir:10 198 -CFWPTTFKKGGLKFWVQFTEKYGSPMLVGKH--PRSASDGEKNLLLDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSAD 274 (491) T ss_pred -HHHHHHHHHHHHHHHHHHHHHcCCCeEEEec--CCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeEEEecCCCCCChh Confidence 3332333467778888889999999877762 11111111111 122333333333 344566766542 3344 Q ss_pred HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccce Q lcl|NC_021324. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 378 (480) .|...++..-.+|+...- -+.+... + .++.|+-- ...-....++.-.+.+...+.++++-++.++..... T Consensus 275 ~y~~li~~~d~~Isk~iL--GqtlTt~--~-~gs~a~~~vh~~v~~di~~~D~~~i~~tln~li~~l~~~N~~~~~---- 345 (491) T protein:vir:10 275 VYERLLHFCRGEVSIALL--GQNQTTE--A-TSTRASAQAGLEVTDDIRDGDKAVVSEAMNMLIRWICDLNFDGAD---- 345 (491) T ss_pred HHHHHHHHHHHHHHHHHh--hhhcccC--c-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC---- Confidence 566666555555555320 0111111 1 11112211 112222333334455666677777766666654322 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) ...+.|..+. ......++.+.++++.|. .++.+.+.+.+|+...+..+. .. ............ T Consensus 346 ~p~f~~~~~~-e~~~~~a~~~~~L~~~G~-~i~~~~i~e~~Gip~~~~~~~----------~~-----~~~~~~~~~~~~ 408 (491) T protein:vir:10 346 RPVFDMWEQE-QVDEIQAGRDQKLTQAGA-RFTPAYFKRAYNLQDGDLDER----------PL-----PVSAVDTVGAAS 408 (491) T ss_pred cceEEecCcC-chhHHHHHHHHHHHhCCC-cCCHHHHHHHhCCCCCCcCcc----------cc-----ccCCCCCccccc Confidence 2456665543 333667899999998874 578999999999764432110 00 000000000000 Q ss_pred CCCCCCccCCCCCC-CCCcccCC Q lcl|NC_021324. 459 TVTETKTETQTSPS-GFNRTKTR 480 (480) Q Consensus 459 ~~~d~~~~~~~~~~-~~~~~~~~ 480 (480) . ....+..+..+. -.+....| T Consensus 409 ~-~~~~~~~~~~~d~~~~~~~~~ 430 (491) T protein:vir:10 409 F-AEFEAPDQDALDAALNTLSAR 430 (491) T ss_pred c-cccCCCCCCchHHHHHHHHHH Confidence 0 000000000000 00000011 No 198 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=97.85 E-value=1.5e-05 Score=47.02 Aligned_cols=366 Identities=13% Similarity=0.085 Sum_probs=135.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHH-H---HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPN-L---LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~-~---~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~ 76 (480) |- +..+........... + .....+.-|.-.. ..+.+. .-+...-...+|+..++-+-.-+|.+.. T Consensus 1 Mg----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~v~~~---~al~~~~v~~~i~~ia~~ia~~p~~v~~ 69 (385) T protein:vir:10 1 MG----LLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQL----SYVSAL---SALQNTNVYSVINRIASDVASAHFKTEN 69 (385) T ss_pred Cc----cccchhcccccccccccccchhhhhhhccccCc----cccCHH---HhhccHHHHHHHHHHHHHHhhCceeeec Confidence 11 111100000000000 0 0000111110000 001110 0011112233444444433344565533 Q ss_pred CchhHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 77 DSEGLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 77 d~~~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .. +..++.+ | ....+...+..+.+.+|.||+.+..+. ..+..+++..+.+..+.. . T Consensus 70 ~~-----~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---------~~~~p~~~~~v~~~~~~~--~---- 129 (385) T protein:vir:10 70 TA-----TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---------LEHIPNSDVQINYLPGNM--G---- 129 (385) T ss_pred cc-----hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---------eeEeecCCceEEEEEcCC--c---- Confidence 21 1222222 2 234566677888889999999875321 112222332222222111 0 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecc--cccCccCCCccchHHHHH Q lcl|NC_021324. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND--PRLGNRYGRSEISPELRK 230 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~--~~~~~~~G~s~i~~~v~~ 230 (480) +.++......+. ...+.++. |+||++. ......+|.|.+.. +.. T Consensus 130 -~~~~~~~~~~~~---~~~~~~~e-----------------------------iihik~~~~~~~~~~~G~s~i~~-~~~ 175 (385) T protein:vir:10 130 -IVYTVLESNDRP---QMVLRQDQ-----------------------------MLHFRLMPDPQYRYLIGRSPLES-LQN 175 (385) T ss_pred -eEEEEEEcCCce---EEEEcccc-----------------------------EEEeccCCCCcccccccccHHHH-HHH Confidence 111111111110 01112222 3555532 12234578776532 333 Q ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh-----hcchheecCCCCceEEEeccc--cHHHH Q lcl|NC_021324. 231 VTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAA--ELRNF 301 (480) Q Consensus 231 l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~--~~~~~ 301 (480) .++....+.....+.+...+.|..++. |...++...+.-...++. ..++++.++ ++.++..++.. ..+.+ T Consensus 176 ~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~~~l 254 (385) T protein:vir:10 176 ALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVLP-DGFDYTQLEMKTDVFKAL 254 (385) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCccccC-CCceEEecCCChhHHHHH Confidence 333332332223333333334544443 211111110100111111 123455554 34677766543 34434 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) .+..+....+|+..-++|+..+|......+++..+............ -+-..+.+.+. ..+.+ ..++ T Consensus 255 ~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~~l~----P~~~~ie~~l~--~~l~~-------~~~~ 321 (385) T protein:vir:10 255 ADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLN----SYVNPIVDELR--LKMNA-------PDLE 321 (385) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHHHHH----HHHHHHHHHHH--HhhCC-------ceEE Confidence 57778888999999999999998643221222122211111111111 11111111111 11111 1255 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......+..++++++.++++.| +++...+++.+|+.+=+ .++.+... .... T Consensus 322 f~~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~p-----------------------~~~~~~~~-~~~~ 375 (385) T protein:vir:10 322 LDIKDMLDVDDSALINQVSNLAKSG--VLGAEQAQFILTRSGFL-----------------------PDNLPEFK-PLTT 375 (385) T ss_pred eechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCccC-----------------------CCCCcccc-Cccc Confidence 5556666788899999999888876 45554455544432100 00000000 0000 Q ss_pred CCCccCCCCCCCCC Q lcl|NC_021324. 462 ETKTETQTSPSGFN 475 (480) Q Consensus 462 d~~~~~~~~~~~~~ 475 (480) ...+..+++| T Consensus 376 ----~~~~g~~~dn 385 (385) T protein:vir:10 376 ----QVKGGDEGDN 385 (385) T ss_pred ----ccCCCCCCCC Confidence 0011111111 No 199 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=97.84 E-value=1.5e-05 Score=46.98 Aligned_cols=382 Identities=13% Similarity=0.034 Sum_probs=156.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcc-hhcC--------------cccchhhhcceeccchHHHHHHHHHh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRL-KTIG--------------IGAPPELAYLDVQPGWVATYLRTLSD 65 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i-~~~~--------------~~~~~~~~~~~~~~n~~~~iVd~~~~ 65 (480) |..++-.+ .+..+..-+..+.....|.... +..+ ..+.+.. -+.+.-.-.+|+..++ T Consensus 1 ~~~~~~~~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~---al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYTI-----DLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDER---ILQISTVWRCVSLIST 72 (424) T ss_pred CCCCcceE-----eecCCCchHHHHHhhhcccccccccccccccccccccccccccccHHH---hhccHHHHHHHHHHHH Confidence 33221110 1122223333444444332211 1000 0011100 0011112223333333 Q ss_pred hhccCcccc---CCCc---h--hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEE Q lcl|NC_021324. 66 RLDIEGFRI---SEDS---E--GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIR 131 (480) Q Consensus 66 ~l~~~~~~~---~~d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~ 131 (480) -+-.-++.+ ..+. . ....+..++.. | ....+...+..+.+.+|.||+++-.+ ..|++ .+. T Consensus 73 ~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~ 146 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLL 146 (424) T ss_pred hhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEE Confidence 332223322 1111 1 12335555532 3 23456677888999999999988643 34554 566 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeec Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 211 (480) .++|..+.+..+.. .+ .|.+. .++. ...|.++. |+||++ T Consensus 147 pl~~~~V~v~~~~~---~~----~y~~~-~~g~----~~~~~~~e-----------------------------Iih~r~ 185 (424) T protein:vir:18 147 PLQSANMDVKLVGK---KV----VYRYQ-RDSE----YADFSQKE-----------------------------IFHLKG 185 (424) T ss_pred EecCcceEEEEcCC---eE----EEEEE-eCCe----EEEecccc-----------------------------EEEecC Confidence 77887776544321 11 11111 1110 01122222 345543 Q ss_pred ccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC--ccccccccccchhhh-----hcchheecC Q lcl|NC_021324. 212 DPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVT--TDELTNDGENTTLDI-----YYGRILTLA 284 (480) Q Consensus 212 ~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~--~~~~~~~~~~~~~~~-----~~~~~~~~~ 284 (480) .. ..+.+|.|.|.- +...++....+.....+.+...+.|-.++.-.. ..+...+.-...|+. ..++++.++ T Consensus 186 ~~-~dg~~G~spi~~-~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~g~nag~~~vl~ 263 (424) T protein:vir:18 186 FG-FTGLVGLSPIAF-ACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) T ss_pred cC-CCCcccccHHHH-HHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCceecc Confidence 22 344667775532 322233222222222222333344554553211 111111100111211 123455555 Q ss_pred CCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 285 SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 285 g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) ++.++.+++.+. -..+++..+....+|+..-++|+..+|....+...+..++.....+.. ..|.-+++ T Consensus 264 -~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~----------~tl~P~~~ 332 (424) T protein:vir:18 264 -AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQ----------YTLQPYIS 332 (424) T ss_pred -CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHH----------HHHHHHHH Confidence 346777776432 224677778888999999999999998643332222233322222221 11222222 Q ss_pred HHH-----HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHH Q lcl|NC_021324. 364 IAM-----QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETE 438 (480) Q Consensus 364 li~-----~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~ 438 (480) .+. .+...... ....+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+ -- T Consensus 333 ~ie~~l~~~L~~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~--gG--------- 398 (424) T protein:vir:18 333 RWENSIQRWLIPAKDV-GRIHAEHNLDGLLRGDSASRAAFMKAMGEAG--LRTINEMRRTDNLPPLP--GG--------- 398 (424) T ss_pred HHHHHHHhhcCCcccc-CCeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--Cc--------- Confidence 111 11111111 1123445555566778889999999988775 55666666666654321 00 Q ss_pred HHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 439 DMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 439 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 477 (480) +.+..... .. |. ++...+.++ ..|++ T Consensus 399 ---D~~~~~~n--~~----~l-~~~~~~~~p---~~~ga 424 (424) T protein:vir:18 399 ---DVAMRQSQ--YV----PI-TDLGTNKEP---RNNGA 424 (424) T ss_pred ---CeeeeccC--cc----ch-HhhhccCCC---ccCCC Confidence 00000000 00 00 000001111 11221 No 200 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.84 E-value=1.6e-05 Score=46.94 Aligned_cols=367 Identities=10% Similarity=0.011 Sum_probs=140.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhc-cCc-ch--hcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNG-TRR-LK--TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G-~~~-i~--~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) .-|.+++...- ..+... ....... ... +. ..+..+.... -....=...+|+..+.-+-.-+|.+.... . T Consensus 1 Mglf~~~~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~~~~---al~~~~V~~~i~~Ia~~ia~l~~~~~~~~-~ 73 (384) T protein:vir:49 1 MPIFNITNLAT--ESPPSN-QDSFFDITDPEFLDALNGSEWVSAET---ALKNSDLFSIISQLSNDLATAKITTSRKQ-L 73 (384) T ss_pred CccccccccCc--cccccc-chhhccccchhhcccccCCceechhh---hhccHHHHHHHHHHHHHHhhCceeeecch-h Confidence 11111000000 000000 0000000 000 00 0000010000 00011123344444443334455543221 1 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ...+.+-........+...+..+.+.+|.||+.+-.+ ..|++ .+..++|..+.++.++... . +.|... T Consensus 74 ~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~v~~~~~~~-~----~~y~~~ 142 (384) T protein:vir:49 74 QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------ENGRDMKWEYLRPSQVSFNRLDNQN-G----LYYNIT 142 (384) T ss_pred hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEEC------CCCcEEEEEEEcCceeEEEEcCCCc-e----EEEEEE Confidence 1111111111234567778889999999999988753 34554 6777888888776644321 1 111111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) ..+... .....+.++. |+||++....+..+|.|.|. .+.+.++... T Consensus 143 ~~~~~~-~~~~~~~~~e-----------------------------Vih~~~~~~~~~~~G~s~i~----~~~~~i~~~~ 188 (384) T protein:vir:49 143 FDDPRI-PPKQHVPQGD-----------------------------ILHFRLLSVDGGLTSVSPLM----ALGRELNIQK 188 (384) T ss_pred ecCccc-cceeEecCcc-----------------------------EEEecCCCCCCceeeccHHH----HHHHHHHHHH Confidence 111100 0001112222 45555443334467877553 3444444333 Q ss_pred HH---HHHHHHHhcchhhhhc--CCCccccccccccc--hhhhhcchheecCCCCceEEEeccc-cHHHHHHHHHHHHHH Q lcl|NC_021324. 240 MN---LQSASQILGTPLRVIS--GVTTDELTNDGENT--TLDIYYGRILTLASEAAKISEFKAA-ELRNFAEEMEVFRKE 311 (480) Q Consensus 240 s~---~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~--~~~~~~~~~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~ 311 (480) .- ..+.+...+.|-.+++ |...++........ ......++++.++ ++.++.+++.. .-..+++..+....+ T Consensus 189 ~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~ 267 (384) T protein:vir:49 189 ASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLD-DLEDFTPLEIKSNVAQLLSQADWTTGQ 267 (384) T ss_pred HHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecC-CCceEEEccCChhhHHHHHHHHHHHHH Confidence 33 3333333344544443 22111110000000 0111234455554 44688777643 223467888889999 Q ss_pred HHhhcCCChHHhccccccchHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCC Q lcl|NC_021324. 312 AASITGLPPQYLSSSSENPASAEAIIATDSRIVKM-AERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTP 390 (480) Q Consensus 312 i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k-~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~ 390 (480) |+..-++|+..+|....+.+++.+++..+...+.. +.-....+...|.+- ...+ +.... ++... T Consensus 268 Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~-----------l~~~---~~~~~-~~~~~ 332 (384) T protein:vir:49 268 FAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCE-----------VDAD---ILPAV-DPTGS 332 (384) T ss_pred HHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchh-----------hhhh---hhhhh-hccch Confidence 99999999999987654444454444433332221 111112221111110 0000 00000 01111 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHh---CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 391 TVAAKADAVSKLYANGQGPIPKEQARID---LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~e~~~~~---~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) ... ..+..+..++ +.++..+++. .|+.++++.+++. +... .++++.+.- T Consensus 333 ~~~---~~~~~l~~~~--~~t~~e~~~~l~~~g~~~ne~r~~~~------------~~p~--~gGd~~~~~ 384 (384) T protein:vir:49 333 NYI---GLINSMVKTG--TLAQNQGLYVLQQAEILPKDLPEGET------------DSTL--KGGETNEQY 384 (384) T ss_pred HHH---HHHHHHhhcC--cccHHHHHHHHhhCCCCChhHHHHcC------------CCCC--CCCCCCCCC Confidence 111 1222333433 3444444443 3666554333211 0111 112222222 No 201 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=97.81 E-value=1.7e-05 Score=46.69 Aligned_cols=425 Identities=11% Similarity=-0.001 Sum_probs=165.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccc--hhhhcceeccchHHHHHHHHHhhhc----c--Cccc-c Q lcl|NC_021324. 4 YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAP--PELAYLDVQPGWVATYLRTLSDRLD----I--EGFR-I 74 (480) Q Consensus 4 ~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~--~~~~~~~~~~n~~~~iVd~~~~~l~----~--~~~~-~ 74 (480) .+..++++.++++ +.+.....++|++ +.+|.+..... ......++..+-+...++++++.|. + .+|+ . T Consensus 1 mk~~~~~~~~~lk-R~~~e~~w~e~a~--~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:63 1 MKTTAAMLWEKLR-DGSVEQRAIEFAK--TTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHh-ccchHHHHHHHHH--hhccccCCCCCCccccccCCCccchHHHHHHHHHHHHHhhhcCCCCccccc Confidence 4444555555543 2222223333442 22222211100 1111123445566777777777663 2 1232 1 Q ss_pred --CC--------Cch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEE Q lcl|NC_021324. 75 --SE--------DSE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) Q Consensus 75 --~~--------d~~----~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~ 133 (480) .+ +.. .. ..+...+..++|.....++.++...+|.+-+++.. ++. +++.+ T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~~~~--------~~~-~~~~~ 148 (510) T protein:vir:63 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRDS--------DAA-TVVAW 148 (510) T ss_pred CCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEcC--------CCc-EEEEE Confidence 11 111 11 13555667789999999999999999998666532 232 45666 Q ss_pred cccceEEEEeCCCCCceEEEEEEEEEe-----c-----------CCceEEEEEEEeCCcEEEEEecCCCcc--ccccc-- Q lcl|NC_021324. 134 SPLYMYAELDPRNTRRVTRAVRLYTTR-----D-----------DVAVPDRATLYLPDETVPLRRNGGLND--QWVVD-- 193 (480) Q Consensus 134 ~p~~~~~v~d~~~~~~~~~~~~~~~~~-----~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~-- 193 (480) +-.+.++.-|. .+ .+...++.+... . +......+++|+. .++.++.... .+..+ T Consensus 149 pl~~y~v~~d~-~G-~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~----V~~~~~~~~~~~sv~~e~d 222 (510) T protein:vir:63 149 SLRSYAVRRDA-TG-RWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTH----VQRKKGTAMEYAELYHEID 222 (510) T ss_pred EcceeEEeeCC-Cc-CeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEE----EEeecCCCceEEEEEEEec Confidence 55554333333 22 333333332211 0 0011112233321 0111111000 00000 Q ss_pred ----cccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc Q lcl|NC_021324. 194 ----GDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE 269 (480) Q Consensus 194 ----~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~ 269 (480) ......+|..+|++.++-+...++.||+|-.. +..+=+..+|.+.-...........|.+.+. ++. -... T Consensus 223 g~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~---p~g--~~~~ 296 (510) T protein:vir:63 223 GVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVE-DYIGDFAKLSLLSEKLGLYELESLEVLNLVD---EAK--GAVV 296 (510) T ss_pred CceeccccccccccCceeeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhccCCcccC---ccc--ccch Confidence 11112346778999888888889999999653 3455566667666666666665566553332 000 0000 Q ss_pred cchhhhhcchheecCCCCceEEEecc-ccHH---HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHH Q lcl|NC_021324. 270 NTTLDIYYGRILTLASEAAKISEFKA-AELR---NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVK 345 (480) Q Consensus 270 ~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~---~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~ 345 (480) ........+.+..-..++....++.. ++++ ..++.++.-|...+... + ...++.. .+++.++..-..+.. T Consensus 297 ~~~~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~-l----~~~~~~r-vTAtEV~~r~~E~~~ 370 (510) T protein:vir:63 297 DDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-A----NQRDAER-VTAEEVRITAEEAEN 370 (510) T ss_pred hhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHhh-c----ccCCCCC-cCHHHHHHHHHHHHH Confidence 11112222333221112233334332 3344 34444444444443321 1 0111111 234444433222221 Q ss_pred HH----HH-HHHHHHHHHHHHHHHHHHHhCC-CCccc-ceeeeEEecCCCCCCHHHHHHHHH---HHHhccc------cC Q lcl|NC_021324. 346 MA----ER-KGRIFGGAWERAMRIAMQIMGR-EVTEE-YTRLETVWRDPSTPTVAAKADAVS---KLYANGQ------GP 409 (480) Q Consensus 346 k~----~~-~~~~~~~~l~~~~~li~~~~~~-~~~~~-~~~~~v~f~~~~~~~~~~~ad~~~---kl~~~~~------~~ 409 (480) .. .+ ....+.+-+.+++.++..- +. ....+ .....|++-.++.+. ..++.+. +.++... .. T Consensus 371 ~LGpv~~rl~~E~l~Pli~r~~~il~r~-gl~p~p~~~~~~~~v~~is~Lara--q~~~~l~~~~q~l~~~~~~aq~~~~ 447 (510) T protein:vir:63 371 TLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSRS--AAVQSMLNASQVIAGLAPIAQLDPR 447 (510) T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHhc-cCCCCCchhcccceecchhHHHHH--HHHHHHHHHHHHHHHhcCchhhhcc Confidence 11 11 1122222333333333221 10 11111 122223443333222 2222222 2222111 11 Q ss_pred CCH----HHHHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 410 IPK----EQARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 410 ~s~----e~~~~~~~~~-------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.. +.+...+|.+ +++++.+.+.+++++.....+.....++.+..+..+.+- T Consensus 448 id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 448 ISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred CCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 222 2333445652 344444433322222222222222222222233333332 No 202 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=97.79 E-value=1.9e-05 Score=46.45 Aligned_cols=379 Identities=13% Similarity=0.036 Sum_probs=156.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcch-hcC--------------cccchhhhcceeccchHHHHHHHHHh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLK-TIG--------------IGAPPELAYLDVQPGWVATYLRTLSD 65 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~-~~~--------------~~~~~~~~~~~~~~n~~~~iVd~~~~ 65 (480) |..++= ..++..+..-+..+.....|..... ..+ ..+.++. -.+ +.=...+|+..++ T Consensus 1 ~~~~~~-----~~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~-al~--~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKY-----TIDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDER-ILQ--ISTVWRCVSLIST 72 (424) T ss_pred CCCCcc-----ccccCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHH-hhc--cHHHHHHHHHHHH Confidence 333211 1112222233333444444432110 000 0011100 001 1112223333333 Q ss_pred hhccCcccc---CCCc---h--hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEE Q lcl|NC_021324. 66 RLDIEGFRI---SEDS---E--GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIR 131 (480) Q Consensus 66 ~l~~~~~~~---~~d~---~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~ 131 (480) -+-.-++.+ ..+. + ....+..++.. | .-..+...+..+.+.+|.||+++-.+ ..|++ .+. T Consensus 73 ~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~ 146 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRN------SAGDVISLL 146 (424) T ss_pred hhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEE Confidence 332223322 1111 1 12335555532 3 23456777888999999999988653 34554 566 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeec Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTN 211 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 211 (480) .++|..+.+..+.. .+ .|.+. .+ +. ...|.++. |+||++ T Consensus 147 ~l~~~~v~v~~~~~---~~----~y~~~-~~-g~---~~~~~~~e-----------------------------Vihir~ 185 (424) T protein:vir:18 147 PLQSANMDVKLVGK---KV----VYRYQ-RD-SE---YADFSQKE-----------------------------IFHLKG 185 (424) T ss_pred EecCcceEEEEcCC---eE----EEEEE-eC-Ce---EEEecccc-----------------------------EEEecC Confidence 77787776544321 11 11111 11 10 01122222 345544 Q ss_pred ccccCccCCCccchHHHHHHHHHHHHHHHHHH---HHHHHhcchhhhhcCCC--ccccccccccchhhh-----hcchhe Q lcl|NC_021324. 212 DPRLGNRYGRSEISPELRKVTDAASRTLMNLQ---SASQILGTPLRVISGVT--TDELTNDGENTTLDI-----YYGRIL 281 (480) Q Consensus 212 ~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~---~~~~~~~~p~l~i~G~~--~~~~~~~~~~~~~~~-----~~~~~~ 281 (480) .. ..+.+|.|.|. .+.+.+....+-.. +.+...+.|-.++.-.. ..+...+.-...|+. ..++++ T Consensus 186 ~~-~dg~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~nag~~~ 260 (424) T protein:vir:18 186 FG-FTGLVGLSPIA----FACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKKRLW 260 (424) T ss_pred cC-CCCcccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccCCce Confidence 32 34466777553 23344433322222 22333334544443111 111110000111211 113455 Q ss_pred ecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 282 TLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 282 ~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) .++ ++.++.+++.+. -..+++..+....+|+..-++|+..+|....+...|.+++.....+... .|.- T Consensus 261 vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~----------tl~P 329 (424) T protein:vir:18 261 ILE-AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQY----------TLQP 329 (424) T ss_pred ecc-CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHH----------HHHH Confidence 565 346787775432 2246777788889999999999999986543333233333333222221 1222 Q ss_pred HHHHHH-----HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHH Q lcl|NC_021324. 361 AMRIAM-----QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQ 435 (480) Q Consensus 361 ~~~li~-----~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~ 435 (480) +++.+. .+....... ...+++.+......|..+.++.+.++...| +++...+++.+|+.+-+- -.+. T Consensus 330 ~~~~ie~~ln~~L~~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~pi~g--gD~~--- 401 (424) T protein:vir:18 330 YISRWENSIQRWLIPSKDVG-RLHAEHNLDGLLRGDSASRAAFMKAMGESG--LRTINEMRRTDNMPPLPG--GDVA--- 401 (424) T ss_pred HHHHHHHHHHhhcCCccccC-CeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cCee--- Confidence 222111 111111111 223555556666778899999999988765 566666666766543210 0000 Q ss_pred HHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 436 ETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 477 (480) .- +.+. .|. .+.....++. .|++ T Consensus 402 ---------~~--~~n~----~~l-~~~~~~~~~~---~n~a 424 (424) T protein:vir:18 402 ---------MR--QAQY----VPI-TDLGTNKEPR---NNGA 424 (424) T ss_pred ---------ee--ccCc----cch-hhhhccCCcc---ccCC Confidence 00 0000 000 0000001111 1111 No 203 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=97.75 E-value=2.2e-05 Score=46.10 Aligned_cols=400 Identities=14% Similarity=0.086 Sum_probs=153.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhc-CcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTI-GIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~-~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~ 76 (480) ||+ .-.|....... +......-.............|+..++-+-.-+|.+ .+ T Consensus 1 ~~~------------------------~~~~~g~~~~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~ 56 (723) T protein:vir:94 1 MTT------------------------FPSGAGGWNAWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDG 56 (723) T ss_pred Ccc------------------------cccCCCccccccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCC Confidence 111 00000000000 000000000000111222333443333332223332 11 Q ss_pred CchhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 77 DSEGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 77 d~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~ 150 (480) .......+..++.. |. ...+...+..+.+.+|.+|+++..+.. +..|.| .+..++|..+.++......... T Consensus 57 ~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r---~~~g~p~~l~~l~~~~~~v~~~~~~~~~~ 133 (723) T protein:vir:94 57 ELDELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGR---TPAGVPDEIWYVYDRVTTIVATRAADAVP 133 (723) T ss_pred ccchhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCC---ccccceeEEEEecCcceEEeecCCCccce Confidence 12222345555542 32 345666677788999999999875421 223444 3445555544443322211100 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHH Q lcl|NC_021324. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRK 230 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~ 230 (480) ......|...... +..+.|. -. =|+||+......+.+|.|.|. . T Consensus 134 ~~~~~~y~~~~~~-----------G~~~~~~-------------------~~--dIiHir~~~~~dg~~G~Spi~----~ 177 (723) T protein:vir:94 134 QAQIIGYVIERTD-----------GVRVPVL-------------------AD--EMLWLRFSDPYDPLAVMAPWK----A 177 (723) T ss_pred eeeeeEEEEEecC-----------ceeEEec-------------------cc--ceEEecCCCCCCCcccccHHH----H Confidence 0000001000000 0001100 00 146666443445678888653 3 Q ss_pred HHHHHHHHHHHHHHHHHHhcc---hhhhhcCCCccccccccccchhhh------hcchheecCC---------CCceEEE Q lcl|NC_021324. 231 VTDAASRTLMNLQSASQILGT---PLRVISGVTTDELTNDGENTTLDI------YYGRILTLAS---------EAAKISE 292 (480) Q Consensus 231 l~D~~~~~~s~~~~~~~~~~~---p~l~i~G~~~~~~~~~~~~~~~~~------~~~~~~~~~g---------~~~~~~~ 292 (480) +.+.+...++-......+|.+ |-.+|.--...+...+.-...|+. ..++.+.++| ++.++.. T Consensus 178 a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~ 257 (723) T protein:vir:94 178 ARAAVDADFYAATWQRQSFKNGARPGGVVNLGDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTS 257 (723) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEE Confidence 445554444433333444443 444443111111000000111211 1234444443 3457766 Q ss_pred ecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021324. 293 FKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) Q Consensus 293 ~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 370 (480) ++.+. -..|++..+..+.+|+..-++|+..+++.+..+.+..+.+ +....|.-.+ ..|++.+.. .+.. T Consensus 258 l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~~~f~~~tL~P~~--------~~ie~~ln~--~Ll~ 327 (723) T protein:vir:94 258 LSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAKAAVWTETLIPQM--------EVMASITDL--QLLP 327 (723) T ss_pred ccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHHHHHHHHHHHHHH--------HHHHHHHhH--hhcc Confidence 65432 2247788888899999999999998876543211222211 1111111111 112211111 1111 Q ss_pred CCCcccceeeeEEecC--CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH--HHH--HH----HH------HH Q lcl|NC_021324. 371 REVTEEYTRLETVWRD--PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ--REQ--MR----DW------DK 434 (480) Q Consensus 371 ~~~~~~~~~~~v~f~~--~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~--~~~--~~----~~------~~ 434 (480) .....+++.|.. .+..|..+.++.+.+++.+| +++...+++.+|+.+-+ ... +. .. .. T Consensus 328 ----~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G--~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p 401 (723) T protein:vir:94 328 ----DIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVND--VLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAP 401 (723) T ss_pred ----cccCceEEeecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCc Confidence 111245667753 34568888899998888876 66766677777764211 000 00 00 00 Q ss_pred HHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 435 QETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..++. ..++.+...-.....+.++.....+.....+.|.+...|+ T Consensus 402 ~~~e~-~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 402 AVEEG-AARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTL 446 (723) T ss_pred cchhh-hHhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhH Confidence 00000 0011110000000011111111111111222233333444 No 204 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=97.72 E-value=2.5e-05 Score=45.81 Aligned_cols=369 Identities=15% Similarity=0.154 Sum_probs=141.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC----- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS----- 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~----- 75 (480) |- ..+|+..-++ |-...+.+ + .++.......+.........+.-....|+..++-+..-++.+. T Consensus 1 mg-~~~~~~~~~~------~~~~~~~~-~---~~~~~~~~~~~~~t~~~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~ 69 (403) T protein:vir:10 1 MG-FKSWITEKLN------PGQRIIRD-M---EPVSHRTNRKPFTTGQAYSKIEILNRTANMVIDSAAECSYTVGDKYNI 69 (403) T ss_pred Cc-chhhhhhccc------hhhhhhhc-c---cccccccCCcccccHHHHHHHHHHHHHHHHHHHHHhhCceeEeecccc Confidence 44 3333321111 00000000 1 1111110000000000000011111222222222212223221 Q ss_pred ---CCchhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCC Q lcl|NC_021324. 76 ---EDSEGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNT 147 (480) Q Consensus 76 ---~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~ 147 (480) .+......+..++.. |. ...+...+..+.+.+|.||+++.. . .+..++|..+.+..+.. T Consensus 70 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~~----------~-~l~~l~~~~~~v~~~~~-- 136 (403) T protein:vir:10 70 VTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWDG----------T-SLYHVPAALMQVEADAN-- 136 (403) T ss_pred cccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEeC----------c-eeEeecCcceEEEEcCC-- Confidence 111112334555543 32 346667778889999999976531 1 13345555444332211 Q ss_pred CceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeeccc----ccCccCCCcc Q lcl|NC_021324. 148 RRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP----RLGNRYGRSE 223 (480) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~----~~~~~~G~s~ 223 (480) .. +..|.. ..+ ..|..++ |+||++.. ...+++|.|. T Consensus 137 ~~----~~~~~~--~~~-----~~~~~~e-----------------------------iih~~~~~~~~~~~~~~~G~s~ 176 (403) T protein:vir:10 137 KF----IKKFIF--NNQ-----INYRVDE-----------------------------IIFIKDNSYVCGTNSQISGQSR 176 (403) T ss_pred ce----EEEEEe--cCc-----eeecccc-----------------------------eEEecccccccCCCCCcccccH Confidence 10 101100 000 0011111 23333211 1244677776 Q ss_pred chHHHHHHHHHHHHHHHHH---HHHHHHhcchhhhhcCCC-ccccccccccchhhh------hcchheecCCCCceEEEe Q lcl|NC_021324. 224 ISPELRKVTDAASRTLMNL---QSASQILGTPLRVISGVT-TDELTNDGENTTLDI------YYGRILTLASEAAKISEF 293 (480) Q Consensus 224 i~~~v~~l~D~~~~~~s~~---~~~~~~~~~p~l~i~G~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~ 293 (480) |. .+.+.++....-. .+.+...+.|-.++.... +.+...+.-...|+. ..++++.+++ +.++..+ T Consensus 177 i~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~~ 251 (403) T protein:vir:10 177 VA----TVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEILNKKLRERKQEELQLDYNPSTGQSSVLILDG-GMKAKPY 251 (403) T ss_pred HH----HHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhCCcccCcceeecCC-CceeEEe Confidence 53 3344444333332 333333333544444211 111100000111211 1234555654 4566665 Q ss_pred cc-ccH--HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021324. 294 KA-AEL--RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) Q Consensus 294 ~~-~~~--~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 370 (480) +. .+. ..+++..+....+|+..-++|+..+|....++.+...+.+....|.-.+...+..+... ++ T Consensus 252 ~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~-----------L~ 320 (403) T protein:vir:10 252 SQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRPNIELFYYMTIIPMLNKLTSSLTFF-----------FG 320 (403) T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHh-----------cC Confidence 42 111 13677778889999999999999997543222222223333323322222222222211 11 Q ss_pred CCCcccceeeeEEecC--CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021324. 371 REVTEEYTRLETVWRD--PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 371 ~~~~~~~~~~~v~f~~--~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) . .+.+.+.. ....+..+.++++.+++..| +++...+++.+|+.+-+-+.. +.+.... T Consensus 321 ~-------~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G--~lT~NE~R~~~gl~pi~~~~~------------d~~~~p~ 379 (403) T protein:vir:10 321 Y-------KITPNTKEVAALTPDKEAEAKHLTSLVNNG--IITGNEARSELNLEPLDDEQM------------NKIRIPA 379 (403) T ss_pred c-------eeeeccchhhhcccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCcccc------------ccccccc Confidence 1 12233322 23456778888888888765 667777888888754321111 1111111 Q ss_pred c--cCCCcCCCCCCCCCCccCCCC Q lcl|NC_021324. 449 K--AQADATPKPTVTETKTETQTS 470 (480) Q Consensus 449 ~--~~~~~~~~~~~~d~~~~~~~~ 470 (480) . +...+....++++++++++++ T Consensus 380 n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 380 NVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred ccccccccCCCCcCCCCCCCcCCC Confidence 1 111111222223333333333 No 205 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=97.68 E-value=3e-05 Score=45.40 Aligned_cols=373 Identities=9% Similarity=-0.047 Sum_probs=142.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccC-cchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchhHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTR-RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~-~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~ 83 (480) .-+..++..+-.....+.........+.. .....+..+... .-..+.-...+|+..++-+-.-++.+-.. ..... T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~---~~~~~~~v~~~i~~ia~~ia~~p~~~~~~-~~~~l 76 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAE---SALRNSDLFSIINQLSNDLATVKLTASRK-QLQGI 76 (386) T ss_pred CcccccccccccccccccccccccccchhcccccCCceechh---hhhcchHHHHHHHHHHHhhccCceeeccc-hhHHH Confidence 00000000000000000000000000000 000000000000 00001111223333333222334443221 11111 Q ss_pred HHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecC Q lcl|NC_021324. 84 LWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDD 162 (480) Q Consensus 84 l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~ 162 (480) +.+-........+...+..+.+.+|.||+.+-.+ ..|.+ .+..++|..+.+..+.... . +.|....++ T Consensus 77 ~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~g~~~~L~~l~~~~v~v~~~~~~~-~----~~y~~~~~~ 145 (386) T protein:vir:48 77 IDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRN------ENGRDMKWEYLRPSQVSFNRLDNKD-G----IYYNITFDD 145 (386) T ss_pred hhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEEC------CCCcEEEEEEecCceeEEEEcCCCc-e----EEEEEEecC Confidence 1111111234466777888999999999987653 34554 5667888887776543321 1 111111111 Q ss_pred CceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 163 VAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNL 242 (480) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~ 242 (480) .. ......+.++. |+||++....+..+|.|.+.. +...+.....+.... T Consensus 146 ~~-~~~~~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~i~~-~~~~i~~~~~~~~~~ 194 (386) T protein:vir:48 146 PR-IPPKQHVPQGD-----------------------------VLHFKLLSVDGGLTSVSPLMA-LSRELNIQKASDKLT 194 (386) T ss_pred cc-ccceeEecCcc-----------------------------EEEecCCCCCCceeeccHHHH-HHHHHHHHHHHHHHH Confidence 10 00001111111 466665544455778886643 333333322222223 Q ss_pred HHHHHHhcchhhhhcCCCccccccccccc---hh---hhhcchheecCCCCceEEEeccccH-HHHHHHHHHHHHHHHhh Q lcl|NC_021324. 243 QSASQILGTPLRVISGVTTDELTNDGENT---TL---DIYYGRILTLASEAAKISEFKAAEL-RNFAEEMEVFRKEAASI 315 (480) Q Consensus 243 ~~~~~~~~~p~l~i~G~~~~~~~~~~~~~---~~---~~~~~~~~~~~g~~~~~~~~~~~~~-~~~~~~l~~~~~~i~~~ 315 (480) ...+...+.|-.++.--.. ..++.... .+ ....++++.++ ++.++.++..+.. ..+++..+....+|+.. T Consensus 195 ~~~~~ng~~~~~ii~~~~~--~~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 271 (386) T protein:vir:48 195 LNSLKNALNANGILKIKGG--GLLDFKTKLSRSRQAMKQMQGGPLVLD-DLEEFTPLEIKSNVSQLLKQADWTTGQFAKV 271 (386) T ss_pred HHHHhccCCcceEEEeCCC--CCHHHHHHHHHHHHHhhcCCCCceecC-CCceEEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 3333333445555542111 11111000 01 11223455554 4467877764322 24778888889999999 Q ss_pred cCCChHHhccccccch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021324. 316 TGLPPQYLSSSSENPA-SAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 316 ~~~p~~~~g~~~~~~~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~ 394 (480) -++|+..+|..+.++. ....+.+....|.-.+...+..+...| ... +++.+......+... T Consensus 272 fgVPp~~lg~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l-----------~~~-------~~~~~~~~~~~d~~~ 333 (386) T protein:vir:48 272 YGIPENVVGGQGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQKL-----------SCD-------VDADILPAVDPTGSN 333 (386) T ss_pred hCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----------cch-------hhcchhhhhccChHH Confidence 9999999986543322 222222222222222222222221111 111 111111122234455 Q ss_pred HHHHHHHHHhccccCCCHHHHHHhCC---CChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 395 KADAVSKLYANGQGPIPKEQARIDLG---YTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 395 ~ad~~~kl~~~~~~~~s~e~~~~~~~---~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) .+..+.+++.+| +++...+++.+| +.+.++.+.. . .. .+...++|++++. T Consensus 334 ~~~~~~~l~~~g--~~t~nE~r~~lg~~~~~~~~~~~~~---------------~-~~-----~~~~~gGd~~~~~ 386 (386) T protein:vir:48 334 SVSRINSMVKSG--TLAQNQGLYILQQAEILPKELPEGE---------------N-PN-----KTTLKGGEINGED 386 (386) T ss_pred HHHHHHHHHhCC--CcCHHHHHHHhhcCCCCCccchhhc---------------C-CC-----CCccCCCCCCCCC Confidence 666777777765 556655666554 3333322110 0 00 0011112221111 No 206 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=97.67 E-value=3.1e-05 Score=45.31 Aligned_cols=420 Identities=13% Similarity=0.105 Sum_probs=165.6 Q ss_pred CCC-------HHHHHHHHHHHHHHHH----HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc- Q lcl|NC_021324. 1 MTT-------YHEHVERLQGLLARDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD- 68 (480) Q Consensus 1 ~~~-------~~~~i~~l~~~~~~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~- 68 (480) |-| .++.+++..+.+..++ .+.+.+.+|.. |++-...+...+.-++..+-+...++++++.|. T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tl-----P~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~ 75 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTL-----PYLMNNKGDNETSQNGWQGVGAQATNHLANKLAQ 75 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhc-----ccccCCCCCcccccccccchHHHHHHHHHHHHHH Confidence 433 1233444555544333 33444444442 222111111111113445556677777776663 Q ss_pred ---c--Cccc-cC-CCc---------hhH-----------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc Q lcl|NC_021324. 69 ---I--EGFR-IS-EDS---------EGL-----------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVES 121 (480) Q Consensus 69 ---~--~~~~-~~-~d~---------~~~-----------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~ 121 (480) + .+|+ .. .|+ ... ..+...+..++|.....++.++...+|.+.+++.. T Consensus 76 ~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d~----- 150 (515) T protein:vir:70 76 VLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYKPS----- 150 (515) T ss_pred hhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEEeC----- Confidence 2 2232 11 110 011 12444466789999999999999999998766522 Q ss_pred CCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEec------------------CCceEEEEEEEe-----CCcEE Q lcl|NC_021324. 122 GDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD------------------DVAVPDRATLYL-----PDETV 178 (480) Q Consensus 122 ~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~-----~~~~~ 178 (480) ++. +++++-.+.++--|. .+ .+...++.+...- ..+....+++|+ ++..+ T Consensus 151 ---~~~--~~~~pl~~y~v~~d~-~G-~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~~~~~ 223 (515) T protein:vir:70 151 ---KGA--MSAVPMHHYVVNRDT-NG-DLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAGEGFW 223 (515) T ss_pred ---CCC--eEEEEcCeEEEeeCC-Cc-CeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecCCCce Confidence 222 455555554333333 22 3333333321100 000001222332 22212 Q ss_pred EEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc- Q lcl|NC_021324. 179 PLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS- 257 (480) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~- 257 (480) .+....... ........+|..+|++.++-+...++.||+|-.. +..+=+..+|.+.-...........|.+.+. T Consensus 224 ~~~~e~d~~----~~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~~ 298 (515) T protein:vir:70 224 KINQSADDI----PVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAE-DYSGDLFVIQFLSEAMARGAALMADIKYLIRP 298 (515) T ss_pred EEEEecCce----eeccccccccccCCceeeeeeecCCCCcccchHH-HhhHHHHHHHHHHHHHHHHHHHhcCCCeeeCc Confidence 111111110 0111112236778998888888889999999654 3555567777777777777777777765553 Q ss_pred -CCCccccccccccchhhhhcchheecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhcc-ccccchHHH Q lcl|NC_021324. 258 -GVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAE 334 (480) Q Consensus 258 -G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~~~~Sg~ 334 (480) |.... ........+.+..-..++....++.. .+++.....++.+...|....-+. .+.. ++.. -+++ T Consensus 299 ~g~~~~-------~~l~~~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~rd~~r-vTAt 368 (515) T protein:vir:70 299 GSQTDV-------DHFVNSGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMME--TMTRRDAER-VTAV 368 (515) T ss_pred ccccch-------hhccccCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh--hhhccCCcc-ccHH Confidence 11100 00111122232221122233334332 344444444444433332221111 0111 1111 1333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHH---hCCCCcccceeeeEEecCCCC-CCHHHHHHHHHH---H Q lcl|NC_021324. 335 AIIATDSRIVKMAERKGRIFGGAWERAMR-----IAMQI---MGREVTEEYTRLETVWRDPST-PTVAAKADAVSK---L 402 (480) Q Consensus 335 Al~~~~~~l~~k~~~~~~~~~~~l~~~~~-----li~~~---~~~~~~~~~~~~~v~f~~~~~-~~~~~~ad~~~k---l 402 (480) .++ ..+..++..++..+.++-. ++... ..-..+.+. +++.+-.++. -.....++.+.. . T Consensus 369 EV~-------~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~--v~~~~vs~l~~L~r~q~~~~i~~~~q~ 439 (515) T protein:vir:70 369 EIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSEL--VDPVIVTGIEALGRMAELDKLANFAQY 439 (515) T ss_pred HHH-------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhh--cccceehhHHHHHHHHHHHHHHHHHHH Confidence 333 4455666667766665422 11111 111222222 2332211111 111112222222 2 Q ss_pred Hh---cccc----CCCH----HHHHHhCCC------ChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCc Q lcl|NC_021324. 403 YA---NGQG----PIPK----EQARIDLGY------TATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKT 465 (480) Q Consensus 403 ~~---~~~~----~~s~----e~~~~~~~~------~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 465 (480) ++ .+.. .+.. +.+...+|. ++++++++. +++++..+.+.... +...+.....++..- T Consensus 440 i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r---~q~~~~~~~~~~~~---~~~~a~~~~~~~~~~ 513 (515) T protein:vir:70 440 MSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEM---AQQAQAQQEAMLNE---GVAKAVPGVIQQEMK 513 (515) T ss_pred HHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHH---HHHHHHHHHHHHHH---hhhhhcccchhhhhc Confidence 11 1100 1111 222333332 234444332 22222211211111 111122222222222 Q ss_pred cC Q lcl|NC_021324. 466 ET 467 (480) Q Consensus 466 ~~ 467 (480) +. T Consensus 514 ~~ 515 (515) T protein:vir:70 514 EG 515 (515) T ss_pred cC Confidence 11 No 207 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.66 E-value=3.1e-05 Score=45.27 Aligned_cols=388 Identities=10% Similarity=0.018 Sum_probs=153.6 Q ss_pred CCCHHHHHHHHHHHHHHHHH--HHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLP--NLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-ED 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~--~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d 77 (480) |..+. ++.+.-..+..... .-..+..+ .+..+.....-..+.-..+.-...+|+..++-+-.-++.+- .. T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~v~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~~~~~ 73 (409) T protein:vir:93 1 MAKEN-IVTRIKKKLIDNWIDQSTSKLYDF------SPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDY 73 (409) T ss_pred CCccc-hhhhhhhhhhhhhhcccccccccc------ccccCccccccchhhhhccHHHHHHHHHHHHhhhhCceeEeecc Confidence 54421 22221111111000 00000000 00000000000000011112223333333332222233321 11 Q ss_pred chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceE Q lcl|NC_021324. 78 SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 78 ~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 151 (480) +.....+..++.. | .-..+...+..+.+.+|.||+++..+ ..|.+ .+..++|..+.++.+.... .+ T Consensus 74 ~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~~~~~~~~~-~~- 145 (409) T protein:vir:93 74 KVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQSR-EL- 145 (409) T ss_pred ccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCCc-EE- Confidence 1222334444432 3 23456677888999999999988653 34554 5677888888777654321 11 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHH Q lcl|NC_021324. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) + |......+.. ..+.++. |+||++.....+.+|.|.|. .+ T Consensus 146 ---~-y~~~~~~g~~---~~~~~~e-----------------------------Vih~r~~~~~~~~~G~s~i~----~~ 185 (409) T protein:vir:93 146 ---Y-YSIHAATGNK---LIVHNMD-----------------------------MLHFKHIVASNMVQGISPID----VL 185 (409) T ss_pred ---E-EEEEcCCceE---EEEcccc-----------------------------EEEeCCCCCCCccccccHHH----HH Confidence 1 1111111110 0122222 35555443445567887653 23 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhh--cCCCccccccccccchhh---hhcchheecCCCCceEEEecccc-HHHHHHHH Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILGTPLRVI--SGVTTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAE-LRNFAEEM 305 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~~p~l~i--~G~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l 305 (480) .+.++...+-....+..+..+-..+ .+....+...+.-...|+ -..+.++.++ ++.++.+++... -..+++.. T Consensus 186 ~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q~~e~r 264 (409) T protein:vir:93 186 KNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASE 264 (409) T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHH Confidence 3333332221111233333332222 222221111111111121 1223455554 456787776432 22467777 Q ss_pred HHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCcccceee Q lcl|NC_021324. 306 EVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEEYTRL 380 (480) Q Consensus 306 ~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~~~~~~~ 380 (480) +....+|+..-++|+.-+|....++-|. +......+.. ..|.-++..+. .+...........+ T Consensus 265 ~~~~~~Ia~~fgVPp~~lg~~~~~~~sn--~e~~~~~f~~----------~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~ 332 (409) T protein:vir:93 265 NLTRERVANVFQLPSVFLNARSNTNFAK--NEELNRFYLQ----------HTLLPIVKQYEEEFNRKLLTKTDREKNRYF 332 (409) T ss_pred HHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHHHHHHH----------HHHHHHHHHHHHHHHhhcCCcccccCcceE Confidence 7888999999999999998643321111 1111111111 11222222111 11111111111224 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCC Q lcl|NC_021324. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++.+......+..+.++++.+++.+| +++...+++.+|+.+-+- . +.+.-..+ ..+-..+.. T Consensus 333 ~fd~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--g------------D~~~~~~n--~~~~~~~~~ 394 (409) T protein:vir:93 333 KFNVKSYLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVEG--G------------DKPLISGD--LYPIDTPLE 394 (409) T ss_pred EeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------Ceeeeccc--ccccccchh Confidence 44344455568889999999998876 667777777877754320 0 00000000 000000000 Q ss_pred C-CCCccCCCCCCCCCcc Q lcl|NC_021324. 461 T-ETKTETQTSPSGFNRT 477 (480) Q Consensus 461 ~-d~~~~~~~~~~~~~~~ 477 (480) . ....+++ ++.+++ T Consensus 395 ~~~~~~gG~---~n~~e~ 409 (409) T protein:vir:93 395 LRKSLKGGD---KNVNES 409 (409) T ss_pred hcccccCCC---CCcCCC Confidence 0 0000000 011111 No 208 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=97.66 E-value=3.2e-05 Score=45.26 Aligned_cols=372 Identities=12% Similarity=0.090 Sum_probs=131.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCchhHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~ 83 (480) .-+..+|.+.. ......+.+ ..+..+.. ...........+|+..++-+-.-++.+- .++..... T Consensus 1 Mg~f~~lf~~~-------~~~~~~~~~-----~~~~~v~~---~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~ 65 (395) T protein:vir:95 1 MSILEKIFKTR-------KDITYMLDL-----DMIEDLSQ---QAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKND 65 (395) T ss_pred CchhhhhhccC-------ccccccccc-----hhccccch---hhhhhhHHHHHHHHHHHHhhccceeEeccCCccccch Confidence 11111111110 000000000 00011100 1111122233334433333333333221 12222233 Q ss_pred HHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEE--EEeCCCCCceEEEEEE Q lcl|NC_021324. 84 LWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA--ELDPRNTRRVTRAVRL 156 (480) Q Consensus 84 l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~--v~d~~~~~~~~~~~~~ 156 (480) +..++.. | ....+...+..+.+..|.+|+++..+ +.+. ..++....+ +++.. . ..+ T Consensus 66 ~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--------~~~~--~~~~~~~~~~~~~~~~----~-~~~-- 128 (395) T protein:vir:95 66 VYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--------KELL--IADSFYREEYALYDDI----F-KDV-- 128 (395) T ss_pred HHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC--------CCeE--ecCCccceeEeecCcc----e-eEE-- Confidence 4444432 3 23455666777788888887765421 1111 111111111 11100 0 000 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHH Q lcl|NC_021324. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~ 236 (480) .....+ +...+.++ -|+||+.+......+|.|.|.- .. ..+. T Consensus 129 --~~~~~~---~~~~~~~~-----------------------------evih~~~~~~~~~~~G~spi~~-~~---~~~~ 170 (395) T protein:vir:95 129 --TVKDYT---YQRTFTMQ-----------------------------EVIYLKYNNNKVTHFVESLFED-YG---KIFG 170 (395) T ss_pred --EEcCce---eeeeeccc-----------------------------cEEEEccCCCCcccccchHHHH-HH---HHHH Confidence 000000 00011112 2455554333344567665432 22 2222 Q ss_pred HHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh-------hcchheecCCCCceEEEecccc------HHHH Q lcl|NC_021324. 237 RTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI-------YYGRILTLASEAAKISEFKAAE------LRNF 301 (480) Q Consensus 237 ~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~-------~~~~~~~~~g~~~~~~~~~~~~------~~~~ 301 (480) ..+. .....+.+--++. +....+...+.-...++- ....++.++ ++.++.+++... ...+ T Consensus 171 ~~~~----~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~-~g~~~~~l~~~~~~~~~~~~q~ 245 (395) T protein:vir:95 171 RMIG----AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLI-EGFDYEELSNGGKNSNMPFSEL 245 (395) T ss_pred HHHH----HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC-CCceeeeccccccccchhHHHH Confidence 2222 2222232222222 211111111111111111 111233343 445676654321 1247 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) ++..+....+|+.+-++|+..++++..| .+...+.+....|.-.+...+..+.. .+....... ...+ T Consensus 246 ~e~~~~~~~~Ia~~f~VPp~~l~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~----------kL~~~~~~~--~~~~ 312 (395) T protein:vir:95 246 SELMRDAIKNVALMIGIPPGLIYGETAD-LEKNTLVFEKFCLTPLLKKIQNELNA----------KLITQSMYL--KDTR 312 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCcccC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hhcChhhhc--ccce Confidence 7888888999999999999999864322 12211222221222222111111111 111111111 1234 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......|..+.++++.+++.+| +++...+++++|+.+-+-....+..-- ..+...... .....+... T Consensus 313 f~~~~l~~~D~~~~~~~~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~------~n~~~~~~~--~~~~~~~~~ 382 (395) T protein:vir:95 313 IEIVGVNKKDPLQYAEAIDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLIT------KNYEKANSG--ENDEKEKDE 382 (395) T ss_pred ecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeec------ccccccccc--ccccCcccc Confidence 4555566678889999999998876 566666777777653211000000000 000000000 000000000 Q ss_pred CCCccCCCCCCCC Q lcl|NC_021324. 462 ETKTETQTSPSGF 474 (480) Q Consensus 462 d~~~~~~~~~~~~ 474 (480) ....+++...+|+ T Consensus 383 ~~~kgg~~~~~g~ 395 (395) T protein:vir:95 383 NTLKGGDEDESGD 395 (395) T ss_pred cccCCCCCCCCCC Confidence 0001111111122 No 209 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=97.66 E-value=3.2e-05 Score=45.26 Aligned_cols=372 Identities=12% Similarity=0.090 Sum_probs=131.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCchhHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~ 83 (480) .-+..+|.+.. ......+.+ ..+..+.. ...........+|+..++-+-.-++.+- .++..... T Consensus 1 Mg~f~~lf~~~-------~~~~~~~~~-----~~~~~v~~---~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~ 65 (395) T protein:vir:10 1 MSILEKIFKTR-------KDITYMLDL-----DMIEDLSQ---QAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKND 65 (395) T ss_pred CchhhhhhccC-------ccccccccc-----hhccccch---hhhhhhHHHHHHHHHHHHhhccceeEeccCCccccch Confidence 11111111110 000000000 00011100 1111122233334433333333333221 12222233 Q ss_pred HHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEE--EEeCCCCCceEEEEEE Q lcl|NC_021324. 84 LWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA--ELDPRNTRRVTRAVRL 156 (480) Q Consensus 84 l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~--v~d~~~~~~~~~~~~~ 156 (480) +..++.. | ....+...+..+.+..|.+|+++..+ +.+. ..++....+ +++.. . ..+ T Consensus 66 ~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--------~~~~--~~~~~~~~~~~~~~~~----~-~~~-- 128 (395) T protein:vir:10 66 VYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--------KELL--IADSFYREEYALYDDI----F-KDV-- 128 (395) T ss_pred HHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC--------CCeE--ecCCccceeEeecCcc----e-eEE-- Confidence 4444432 3 23455666777788888887765421 1111 111111111 11100 0 000 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHH Q lcl|NC_021324. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~ 236 (480) .....+ +...+.++ -|+||+.+......+|.|.|.- .. ..+. T Consensus 129 --~~~~~~---~~~~~~~~-----------------------------evih~~~~~~~~~~~G~spi~~-~~---~~~~ 170 (395) T protein:vir:10 129 --TVKDYT---YQRTFTMQ-----------------------------EVIYLKYNNNKVTHFVESLFED-YG---KIFG 170 (395) T ss_pred --EEcCce---eeeeeccc-----------------------------cEEEEccCCCCcccccchHHHH-HH---HHHH Confidence 000000 00011112 2455554333344567665432 22 2222 Q ss_pred HHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh-------hcchheecCCCCceEEEecccc------HHHH Q lcl|NC_021324. 237 RTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI-------YYGRILTLASEAAKISEFKAAE------LRNF 301 (480) Q Consensus 237 ~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~-------~~~~~~~~~g~~~~~~~~~~~~------~~~~ 301 (480) ..+. .....+.+--++. +....+...+.-...++- ....++.++ ++.++.+++... ...+ T Consensus 171 ~~~~----~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~-~g~~~~~l~~~~~~~~~~~~q~ 245 (395) T protein:vir:10 171 RMIG----AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLI-EGFDYEELSNGGKNSNMPFSEL 245 (395) T ss_pred HHHH----HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC-CCceeeeccccccccchhHHHH Confidence 2222 2222232222222 211111111111111111 111233343 445676654321 1247 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) ++..+....+|+.+-++|+..++++..| .+...+.+....|.-.+...+..+.. .+....... ...+ T Consensus 246 ~e~~~~~~~~Ia~~f~VPp~~l~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~----------kL~~~~~~~--~~~~ 312 (395) T protein:vir:10 246 SELMRDAIKNVALMIGIPPGLIYGETAD-LEKNTLVFEKFCLTPLLKKIQNELNA----------KLITQSMYL--KDTR 312 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCcccC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hhcChhhhc--ccce Confidence 7888888999999999999999864322 12211222221222222111111111 111111111 1234 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......|..+.++++.+++.+| +++...+++++|+.+-+-....+..-- ..+...... .....+... T Consensus 313 f~~~~l~~~D~~~~~~~~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~------~n~~~~~~~--~~~~~~~~~ 382 (395) T protein:vir:10 313 IEIVGVNKKDPLQYAEAIDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLIT------KNYEKANSG--ENDEKEKDE 382 (395) T ss_pred ecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeec------ccccccccc--ccccCcccc Confidence 4555566678889999999998876 566666777777653211000000000 000000000 000000000 Q ss_pred CCCccCCCCCCCC Q lcl|NC_021324. 462 ETKTETQTSPSGF 474 (480) Q Consensus 462 d~~~~~~~~~~~~ 474 (480) ....+++...+|+ T Consensus 383 ~~~kgg~~~~~g~ 395 (395) T protein:vir:10 383 NTLKGGDEDESGD 395 (395) T ss_pred cccCCCCCCCCCC Confidence 0001111111122 No 210 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=97.66 E-value=3.2e-05 Score=45.26 Aligned_cols=372 Identities=12% Similarity=0.090 Sum_probs=131.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCchhHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~ 83 (480) .-+..+|.+.. ......+.+ ..+..+.. ...........+|+..++-+-.-++.+- .++..... T Consensus 1 Mg~f~~lf~~~-------~~~~~~~~~-----~~~~~v~~---~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~ 65 (395) T protein:vir:10 1 MSILEKIFKTR-------KDITYMLDL-----DMIEDLSQ---QAYVKRLAIDSCIEFVARAVAQSHFKVLEGNRIQKND 65 (395) T ss_pred CchhhhhhccC-------ccccccccc-----hhccccch---hhhhhhHHHHHHHHHHHHhhccceeEeccCCccccch Confidence 11111111110 000000000 00011100 1111122233334433333333333221 12222233 Q ss_pred HHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEE--EEeCCCCCceEEEEEE Q lcl|NC_021324. 84 LWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA--ELDPRNTRRVTRAVRL 156 (480) Q Consensus 84 l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~--v~d~~~~~~~~~~~~~ 156 (480) +..++.. | ....+...+..+.+..|.+|+++..+ +.+. ..++....+ +++.. . ..+ T Consensus 66 ~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~--------~~~~--~~~~~~~~~~~~~~~~----~-~~~-- 128 (395) T protein:vir:10 66 VYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS--------KELL--IADSFYREEYALYDDI----F-KDV-- 128 (395) T ss_pred HHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC--------CCeE--ecCCccceeEeecCcc----e-eEE-- Confidence 4444432 3 23455666777788888887765421 1111 111111111 11100 0 000 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHH Q lcl|NC_021324. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAAS 236 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~ 236 (480) .....+ +...+.++ -|+||+.+......+|.|.|.- .. ..+. T Consensus 129 --~~~~~~---~~~~~~~~-----------------------------evih~~~~~~~~~~~G~spi~~-~~---~~~~ 170 (395) T protein:vir:10 129 --TVKDYT---YQRTFTMQ-----------------------------EVIYLKYNNNKVTHFVESLFED-YG---KIFG 170 (395) T ss_pred --EEcCce---eeeeeccc-----------------------------cEEEEccCCCCcccccchHHHH-HH---HHHH Confidence 000000 00011112 2455554333344567665432 22 2222 Q ss_pred HHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh-------hcchheecCCCCceEEEecccc------HHHH Q lcl|NC_021324. 237 RTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI-------YYGRILTLASEAAKISEFKAAE------LRNF 301 (480) Q Consensus 237 ~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~-------~~~~~~~~~g~~~~~~~~~~~~------~~~~ 301 (480) ..+. .....+.+--++. +....+...+.-...++- ....++.++ ++.++.+++... ...+ T Consensus 171 ~~~~----~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~-~g~~~~~l~~~~~~~~~~~~q~ 245 (395) T protein:vir:10 171 RMIG----AQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLI-EGFDYEELSNGGKNSNMPFSEL 245 (395) T ss_pred HHHH----HHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC-CCceeeeccccccccchhHHHH Confidence 2222 2222232222222 211111111111111111 111233343 445676654321 1247 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) ++..+....+|+.+-++|+..++++..| .+...+.+....|.-.+...+..+.. .+....... ...+ T Consensus 246 ~e~~~~~~~~Ia~~f~VPp~~l~~~~sn-~e~~~~~~~~~~l~P~~~~ie~~l~~----------kL~~~~~~~--~~~~ 312 (395) T protein:vir:10 246 SELMRDAIKNVALMIGIPPGLIYGETAD-LEKNTLVFEKFCLTPLLKKIQNELNA----------KLITQSMYL--KDTR 312 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCcccC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hhcChhhhc--ccce Confidence 7888888999999999999999864322 12211222221222222111111111 111111111 1234 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......|..+.++++.+++.+| +++...+++++|+.+-+-....+..-- ..+...... .....+... T Consensus 313 f~~~~l~~~D~~~~~~~~~~~~~~G--~lt~NE~R~~~g~~p~~~g~~d~~~~~------~n~~~~~~~--~~~~~~~~~ 382 (395) T protein:vir:10 313 IEIVGVNKKDPLQYAEAIDKLVSSG--SFTRNEVRIMLGEEPSDNPELDEYLIT------KNYEKANSG--ENDEKEKDE 382 (395) T ss_pred ecchhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCceeeec------ccccccccc--ccccCcccc Confidence 4555566678889999999998876 566666777777653211000000000 000000000 000000000 Q ss_pred CCCccCCCCCCCC Q lcl|NC_021324. 462 ETKTETQTSPSGF 474 (480) Q Consensus 462 d~~~~~~~~~~~~ 474 (480) ....+++...+|+ T Consensus 383 ~~~kgg~~~~~g~ 395 (395) T protein:vir:10 383 NTLKGGDEDESGD 395 (395) T ss_pred cccCCCCCCCCCC Confidence 0001111111122 No 211 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=97.66 E-value=3.2e-05 Score=45.25 Aligned_cols=404 Identities=12% Similarity=0.094 Sum_probs=192.2 Q ss_pred CCC---------------HHHHHHHHH----H---HHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHH Q lcl|NC_021324. 1 MTT---------------YHEHVERLQ----G---LLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVAT 58 (480) Q Consensus 1 ~~~---------------~~~~i~~l~----~---~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ 58 (480) -+| ..-+-+.+. . .-..-..+|+.+..+++-+..+ .. T Consensus 36 ~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av---------------------~e 94 (523) T protein:vir:68 36 LDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLMTNYEVDNAV---------------------SE 94 (523) T ss_pred CCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHhhccchhhHH---------------------HH Confidence 000 000000000 0 0011112333333333332222 11 Q ss_pred HHHHHHhhh-ccCc---------cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee Q lcl|NC_021324. 59 YLRTLSDRL-DIEG---------FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP 128 (480) Q Consensus 59 iVd~~~~~l-~~~~---------~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~ 128 (480) ||+..+-+= ..++ ++-.-.+...+.+..|++--+|+....+..+...+.|+.|++...+.. ...+|-. T Consensus 95 IVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k--~pk~GI~ 172 (523) T protein:vir:68 95 IVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPK--RPKEGIK 172 (523) T ss_pred hhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCC--Cccccce Confidence 222111000 0011 111112234456666776668999999999999999999988764321 2346778 Q ss_pred EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCce-----EEEEEEEeCCcEEEEEecCCCccccccccccccccccc Q lcl|NC_021324. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAV-----PDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGV 203 (480) Q Consensus 129 ~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 203 (480) .++.++|..+-.|.- ..+..+.+. ...+-+|.+.. ..|...+... ... +++ + T Consensus 173 Elr~lDPr~i~~vr~-------------i~~~~~~g~~vi~~~~e~f~Y~~~~-~~~~~~g~~~----~~~----~~i-k 229 (523) T protein:vir:68 173 ELRRLDPRQVQYVRE-------------VITTTEAGVKIVKGYKEYFIYDTSH-ESYACDGRIY----EAG----TKI-K 229 (523) T ss_pred eeeeeCCcceeEEEe-------------ecCCCCcchhhhhhhhhheeecccc-cccccccccc----CCC----cce-e Confidence 888999987644431 111111110 11122344332 1121111100 000 011 2 Q ss_pred cce--EEeecccccCcc--CCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch------- Q lcl|NC_021324. 204 VPV--VPLTNDPRLGNR--YGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT------- 272 (480) Q Consensus 204 iPv--v~~~n~~~~~~~--~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~------- 272 (480) ||- |.|.+.-..... .-.|-|.+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- T Consensus 230 I~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQ--LkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k 307 (523) T protein:vir:68 230 IPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQ--LKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNT 307 (523) T ss_pred cchhheeeeeccceeCCCCceeccchhhhHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHh Confidence 221 334332111111 112334455555432 2566777777777777777766555555443322110 Q ss_pred ------hhhhcc-------------hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc--c Q lcl|NC_021324. 273 ------LDIYYG-------------RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS--E 328 (480) Q Consensus 273 ------~~~~~~-------------~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~--~ 328 (480) .....| -.|. .+|.+.++.++++.+.-.-++-++-....++..-++|.+-+.... . T Consensus 308 ~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f 387 (523) T protein:vir:68 308 MKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGI 387 (523) T ss_pred hcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcce Confidence 011111 1111 234456788888764444566667777788888889887774332 1 Q ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHH---H Q lcl|NC_021324. 329 NPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVS---K 401 (480) Q Consensus 329 ~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~~~---k 401 (480) +-.-|..|--.+.....-+.+.+..|..-+.++++.=+-+.|.--..++ ..+.+.|..-....+...++.+. . T Consensus 388 ~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 467 (523) T protein:vir:68 388 QFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRIN 467 (523) T ss_pred ecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHH Confidence 2122335555666777778888888888888888854444443333333 34677775443333333333222 1 Q ss_pred HHhcccc----CCCHHHHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 402 LYANGQG----PIPKEQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 402 l~~~~~~----~~s~e~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +.+...+ .+|.++++.. |..++++++++.+.++++..+. .-.++++.++.- T Consensus 468 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~-------~~~~p~~e~~~f 523 (523) T protein:vir:68 468 MLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEA-------RFQDPDQEQEDF 523 (523) T ss_pred HHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcC-------CCCCCchhhhcC Confidence 2222223 4588888755 7899999887776666554311 111222222222 No 212 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=97.65 E-value=3.3e-05 Score=45.13 Aligned_cols=398 Identities=12% Similarity=0.085 Sum_probs=193.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh-ccCc-------- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-DIEG-------- 71 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l-~~~~-------- 71 (480) +.+..++| .+|+.+..+++-+..+.. ||+..+-+= ..++ T Consensus 69 ~~~~~eLI-----------~~YR~ma~~pEvd~Av~e---------------------IVneaiv~d~~~~pV~l~L~~~ 116 (524) T protein:vir:10 69 MKTTRELI-----------DTYRNLMNNYEVDNAVSE---------------------IVSDAIVYEDDTEVVALNLDKS 116 (524) T ss_pred cchHHHHH-----------HHHHHHhhccchhhHHHH---------------------hhcceeEecCCCceEEEEecCc Confidence 11222222 234444444433332221 111111000 0001 Q ss_pred -cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 72 -FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 72 -~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) ++-.-.+...+.+..|++--+|+....+..+...+.|+.|++...+.. ...+|-..++.++|..+-.|.--... . T Consensus 117 ~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k--~pk~GI~Elr~lDPr~i~~vr~i~~~--~ 192 (524) T protein:vir:10 117 KFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPK--RPKEGIKELRRLDPRQVQYVREIITE--T 192 (524) T ss_pred CcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCC--CccccceeeeeeCCccceeeeeeccC--C Confidence 111112234456666776668999999999999999999988764321 23467788888999877554311100 0 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeecccccCcc--CCCccchH Q lcl|NC_021324. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNR--YGRSEISP 226 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~~~~--~G~s~i~~ 226 (480) ......+ +....+-+|.+.. ..|...+... ... +++ +||- |.|.+.-..... .-.|-|.+ T Consensus 193 ~~~~~vi------~~~~e~f~Y~~~~-~~y~~~g~~~----~~~----~~i-kI~~dAI~y~hSGL~d~~~~~i~gyLhk 256 (524) T protein:vir:10 193 EAGTKIV------KGYKEYFIYDTAH-ESYACDGRMY----EAG----TKI-KIPKAAIVYAHSGLVDCCGKNIIGYLHR 256 (524) T ss_pred Cccchhh------cchhhheeeccCc-cccccCcccc----CCC----cce-ecchhheeeeeccceeCCCCceeccchh Confidence 0000000 0011222344432 2222111100 000 111 2322 334332111111 11233445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hh Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RI 280 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~ 280 (480) .|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -. T Consensus 257 AiKp~NQ--LkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDy 334 (524) T protein:vir:10 257 AVKPANQ--LKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDY 334 (524) T ss_pred hhHHHHh--hhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhh Confidence 5555432 2566777777777777777766555555443322110 011111 11 Q ss_pred ee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc---ccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 281 LT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 281 ~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~~~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) |. .+|.+.++.++++.+.-.-++-++-....++..-++|.+-+.... .+-.-|..|--.+.....-+.+.+..| T Consensus 335 WLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rF 414 (524) T protein:vir:10 335 WLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKF 414 (524) T ss_pred cccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHH Confidence 11 234456788888764444566677777888888899988883221 122234456666667777788888888 Q ss_pred HHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHH---HHHhcccc----CCCHHHHHHh-CCCC Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVS---KLYANGQG----PIPKEQARID-LGYT 422 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~~~---kl~~~~~~----~~s~e~~~~~-~~~~ 422 (480) ..-+.++++.=+-+.|.--..++ ..+.+.|..-....+...++.+. .+.+...+ .+|.++++.. |..+ T Consensus 415 s~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t 494 (524) T protein:vir:10 415 EEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMT 494 (524) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccC Confidence 88888888854444443333333 34677775444333333333222 12222223 4588888755 7899 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 423 ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 423 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +++++++.+.++++..+. .-.++++.++.- T Consensus 495 Deei~~~~k~I~~E~k~~-------~~~~~~~~~~~f 524 (524) T protein:vir:10 495 DEEIEQEAKQIEEESKEA-------RFQDPDQEQEDF 524 (524) T ss_pred HHHHHHHHHHHHHHhhcC-------CCCCCchhhhcC Confidence 999887776666554311 111222222222 No 213 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=97.63 E-value=3.5e-05 Score=44.99 Aligned_cols=398 Identities=12% Similarity=0.084 Sum_probs=193.6 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh-ccCc-------- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL-DIEG-------- 71 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l-~~~~-------- 71 (480) +.+..++| .+|+.+..+++-+..+.. ||+..+-+= ..++ T Consensus 69 ~~~~~eLI-----------~~YR~ma~~pEvd~Av~e---------------------IVneaiv~d~~~~pV~l~L~~~ 116 (524) T protein:vir:72 69 MKTTRELI-----------DTYRNLMNNYEVDNAVSE---------------------IVSDAIVYEDDTEVVALNLDKS 116 (524) T ss_pred cchHHHHH-----------HHHHHHhhccchhhHHHH---------------------hhcceeEecCCCceEEEEecCc Confidence 11222222 234444444433332221 111111000 0001 Q ss_pred -cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCce Q lcl|NC_021324. 72 -FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRV 150 (480) Q Consensus 72 -~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~ 150 (480) ++-.-.+...+.+..|++--+|+....+..+...+.|+.|++...+.. ...+|-..++.++|..+-.|.--... . T Consensus 117 ~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k--~pk~GI~Elr~lDPr~i~~vr~i~~~--~ 192 (524) T protein:vir:72 117 KFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPK--RPKEGIKELRRLDPRQVQYVREIITE--T 192 (524) T ss_pred CcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCC--CccccceeeeeeCCccceeeeeeccC--C Confidence 111112234456666776668999999999999999999988764321 23467788888999877554311100 0 Q ss_pred EEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeecccccCcc--CCCccchH Q lcl|NC_021324. 151 TRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNR--YGRSEISP 226 (480) Q Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~~~~--~G~s~i~~ 226 (480) ......+ +....+-+|.+.. ..|...+... ... +++ +||- |.|.+.-..... .-.|-|.+ T Consensus 193 ~~~~~vi------~~~~e~f~Y~~~~-~~y~~~g~~~----~~~----~~i-kI~~dAI~y~hSGL~d~~~~~i~gyLhk 256 (524) T protein:vir:72 193 EAGTKIV------KGYKEYFIYDTAH-ESYACDGRMY----EAG----TKI-KIPKAAVVYAHSGLVDCCGKNIIGYLHR 256 (524) T ss_pred Cccchhh------cchhhheeeccCc-cccccCcccc----CCC----cce-ecchhheeeeeccceeCCCCceeccchh Confidence 0000000 0011222344432 2222111100 000 111 2322 334432111111 11233445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hh Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RI 280 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~ 280 (480) .|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -. T Consensus 257 AiKp~NQ--LkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDy 334 (524) T protein:vir:72 257 AVKPANQ--LKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDY 334 (524) T ss_pred hhHhHHh--hhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhh Confidence 5555432 2566777777777777777766555555443322110 011111 11 Q ss_pred ee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc---ccchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 281 LT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 281 ~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~~~~Sg~Al~~~~~~l~~k~~~~~~~~ 354 (480) |. .+|.+.++.++++.+.-.-++-++-....++..-++|.+-+.... .+-.-|..|--.+.....-+.+.+..| T Consensus 335 WLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rF 414 (524) T protein:vir:72 335 WLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKF 414 (524) T ss_pred cccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHH Confidence 11 234456788888764444566677777888888899988883221 122234456666667777788888888 Q ss_pred HHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHH---HHHhcccc----CCCHHHHHHh-CCCC Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVS---KLYANGQG----PIPKEQARID-LGYT 422 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~~~---kl~~~~~~----~~s~e~~~~~-~~~~ 422 (480) ..-+.++++.=+-+.|.--..++ ..+.+.|..-....+...++.+. .+.+...+ .+|.++++.. |..+ T Consensus 415 s~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~t 494 (524) T protein:vir:72 415 EEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMT 494 (524) T ss_pred HHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccC Confidence 88888888854444443333333 34677775443333333333222 12222223 4588888755 7899 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 423 ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 423 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +++++++.+.++++..+. .-.++++.++.- T Consensus 495 Deei~~~~k~I~~E~k~~-------~~~~~~~~~~~f 524 (524) T protein:vir:72 495 DEEIEQEAKQIEEESKEA-------RFQDPDQEQEDF 524 (524) T ss_pred HHHHHHHHHHHHHHhhcC-------CCCCCchhhhcC Confidence 999887776666554311 111222222222 No 214 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.63 E-value=3.6e-05 Score=44.98 Aligned_cols=380 Identities=11% Similarity=0.007 Sum_probs=152.0 Q ss_pred HHHHHHHHHH-HH---HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCCch--h Q lcl|NC_021324. 10 RLQGLLARDL-PN---LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSE--G 80 (480) Q Consensus 10 ~l~~~~~~~~-~~---~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~--~ 80 (480) -+.++.-.+. .. .....-.+-|... .+..+.+. .-+.+.-...+|+..++-+-.-+|.+ .++.+ . T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~~~ 74 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWLGINP---SETYVNGK---SCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKRVP 74 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHhcCCc---Ccceechh---hhhccHHHHHHHHHHHHhhhhCceEEEEecCCeeecc Confidence 1111110000 00 0000000111100 00011110 00111112233333333332223322 11111 1 Q ss_pred HHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 81 LEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 81 ~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ...+..++.. | ....+...++.+.+.+|.||+++-.+ ..|.+ .+..++|..+.++.++........-+ T Consensus 75 ~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~ 148 (409) T protein:vir:10 75 DHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFK------KNGEIKGLYPLKSDGMKIFVDDTGLLNSENNV 148 (409) T ss_pred CchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEEcCCceEEEEcCCccccccceE Confidence 1234444432 3 23466778889999999999988653 34554 57788888888777543211111111 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHH Q lcl|NC_021324. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~ 234 (480) .+... ...+.. ..+.++. |+|+++.. ..+.+|.|.|.. +...++. T Consensus 149 ~y~~~-~~~g~~---~~~~~~e-----------------------------vih~r~~~-~d~~~G~s~i~~-~~~~i~~ 193 (409) T protein:vir:10 149 WYLYT-DDLGQR---HKFMSDE-----------------------------ILHFKGLT-ADGLAGLSVIEL-LNHLIEN 193 (409) T ss_pred EEEEE-eCCcee---EEecccc-----------------------------EEEecCcC-CCCcccccHHHH-HHHHHHH Confidence 11111 111110 0111222 35554432 345678776532 3333333 Q ss_pred HHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEecccc-HHHHHHHHH Q lcl|NC_021324. 235 ASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEEME 306 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~ 306 (480) ...+.......+...+.|-.+++-. ...+...+.-...|+- ..++++.++ ++.++.+++.+. -..+++..+ T Consensus 194 ~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~ 272 (409) T protein:vir:10 194 GKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLP-IGYKFEPISQKLVDAQFLENSQ 272 (409) T ss_pred HHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecC-CCceEEEccCChhhHHHHHHHH Confidence 2222222233333333454444321 1111111111111111 123455554 446777765432 224677778 Q ss_pred HHHHHHHhhcCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCcccceee Q lcl|NC_021324. 307 VFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEEYTRL 380 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~~~~~~~ 380 (480) ....+|+..-++|+..+|....+ .++..... ..+ +...|+-+++.+. .+...........+ T Consensus 273 ~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~---~~f----------~~~~l~P~~~~ie~~ln~kL~~~~~~~~~~~~ 339 (409) T protein:vir:10 273 LTIRQIASVFGVKMHQLNDLDRATHSNITEQN---REF----------YIDTLQSILNMYELEINYKLFLISEIKNGFYS 339 (409) T ss_pred HHHHHHHHHhCCCHHHcCCCCCCccccHHHHH---HHH----------HHHHHHHHHHHHHHHHHHhhcCchhccCCcEE Confidence 88899999999999999854322 22221111 111 1112222222111 11111111111234 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCC Q lcl|NC_021324. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++.+......|..++++++.+++.+| +++...+++.+|+.+-+- . +.+..... ..+.. .. T Consensus 340 ~fd~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~lgl~p~~g--g------------D~~~~~~n--~~~~~--~~ 399 (409) T protein:vir:10 340 KFNVDTILRADIKTRYESYKEAIQNG--FKTPNEIRELEEDEPLEG--G------------DVLLINGN--MIPVK--MA 399 (409) T ss_pred EEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------CeeeeccC--ccchh--hc Confidence 55555566678899999999998876 566666677777643210 0 00000000 00000 00 Q ss_pred CCCCccCCCCCCCCCc Q lcl|NC_021324. 461 TETKTETQTSPSGFNR 476 (480) Q Consensus 461 ~d~~~~~~~~~~~~~~ 476 (480) + +....|+.+ T Consensus 400 ~------~~~~kgGe~ 409 (409) T protein:vir:10 400 G------EQYSKGGEK 409 (409) T ss_pred c------ccccccCCC Confidence 0 000111111 No 215 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=97.61 E-value=3.8e-05 Score=44.81 Aligned_cols=396 Identities=11% Similarity=0.071 Sum_probs=149.8 Q ss_pred HHHHHH---HHHHHHHHHHHHHHH--HHHhhccCc-c--------------hhc----CcccchhhhcceeccchHHHHH Q lcl|NC_021324. 5 HEHVER---LQGLLARDLPNLLEA--EAYRNGTRR-L--------------KTI----GIGAPPELAYLDVQPGWVATYL 60 (480) Q Consensus 5 ~~~i~~---l~~~~~~~~~~~~~~--~~YY~G~~~-i--------------~~~----~~~~~~~~~~~~~~~n~~~~iV 60 (480) .-|..- .++-..++.++-.+. --+...+.+ + ... ...+.+.. -++... .-.+| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-al~~~~--V~acv 77 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIE-AIRHSD--IFTAV 77 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhh-hhccHH--HHHHH Confidence 111000 000000111111000 000000000 0 000 00000000 011100 11123 Q ss_pred HHHHhhhccCccccCCC--chhHHHHHHHHH--hcC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEE Q lcl|NC_021324. 61 RTLSDRLDIEGFRISED--SEGLEELWNWWQ--AND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRV 132 (480) Q Consensus 61 d~~~~~l~~~~~~~~~d--~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~ 132 (480) +..++-+-.-++.+-.+ ......+..++. -|. -..+...+..+.+.+|.||+.+..+ ..|+| .+.. T Consensus 78 ~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~------~~G~~~~L~~ 151 (441) T protein:vir:98 78 MMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRD------KTGEPMNLTF 151 (441) T ss_pred HHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEE Confidence 33322222223333211 112223444442 232 3356677888899999999998653 34555 5778 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecc Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~ 212 (480) ++|..+.+..+... .+.+.. +. .+..+. .....+.++. |+||++. T Consensus 152 i~~~~v~v~~~~~g--~~~~~~-~~--~~~~~~-~~~~~~~~~d-----------------------------viHir~~ 196 (441) T protein:vir:98 152 RKTSEIELKLDARG--RLYYFH-QR--IDSNGN-NIERNVKFED-----------------------------MLDIKFY 196 (441) T ss_pred EcCceeEEEECCCC--cEEEEE-EE--eccCcc-eeeEEEcccc-----------------------------EEEeccC Confidence 89998888776432 222111 11 111110 0111122222 3455443 Q ss_pred cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhhh------hcchheecC Q lcl|NC_021324. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI------YYGRILTLA 284 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~------~~~~~~~~~ 284 (480) . ..+.+|.|.|.- +...++....+.....+.+...+.|-.++. |...++...+.-...|+. ..++++.++ T Consensus 197 ~-~dg~~G~spi~~-~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~ 274 (441) T protein:vir:98 197 S-LDGINGLSLLDT-LSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD 274 (441) T ss_pred C-CCCccccCHHHH-HHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcceecC Confidence 2 234567765432 223333222222222222333334444443 221111100000111211 113455665 Q ss_pred CCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 285 SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMR 363 (480) Q Consensus 285 g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 363 (480) ++.++..++.+. -..+++..+....+|+..-++|+..+|....+. |-......+. .... -+-..++..+. T Consensus 275 -~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~q~~~~y~---~tl~----P~~~~ie~~ln 345 (441) T protein:vir:98 275 -ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDYL---STLK----PYITCVCAELN 345 (441) T ss_pred -CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHHH---HHHH----HHHHHHHHHHH Confidence 345777765432 224778888888999999999999998654332 2111111111 1111 11111211111 Q ss_pred HHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH---H-HHHHHHHHHHHHH Q lcl|NC_021324. 364 IAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ---R-EQMRDWDKQETED 439 (480) Q Consensus 364 li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~---~-~~~~~~~~~~~~~ 439 (480) . .+... .....+++........|..++++++.+++.+| +++...+++.+|+.+-+ . ....... T Consensus 346 ~--~L~~~---~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~pi~gGd~~~~~~~~n------ 412 (441) T protein:vir:98 346 F--KFNDE---YVNREFKFDTTEIRVVDEKTQAEIDKINIDSG--KMNIDEIRQRDGLAPIPGGNGSIHRVDLN------ 412 (441) T ss_pred h--hcccc---ccCceEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCcceEeeccc------ Confidence 1 11111 11122344334455678888999998888776 66777777777764321 0 0000000 Q ss_pred HHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 440 MIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) ...... ..+.+...++. .+....|+++.. T Consensus 413 ----~~~~~~-----~~~~q~~~~~~-~~~~~kgGe~ne 441 (441) T protein:vir:98 413 ----HVNIEL-----VDEYQMNKSRA-TDKKLKGGEENE 441 (441) T ss_pred ----cccccc-----ccccccccccc-cccccCCCCCCC Confidence 000000 00000000000 000000111101 No 216 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.59 E-value=4.1e-05 Score=44.63 Aligned_cols=382 Identities=13% Similarity=0.041 Sum_probs=149.3 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCC--c Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNL-LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED--S 78 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~-~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d--~ 78 (480) .-+..++.........+. .....++-+....... ..... . -........+|+..++-+-.-+|.+ .++ + T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~--~-~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~~~ 75 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDVSFL--VPGYV--R-LSDNPEVRMAVHKIADLISSMTIYLMQNTEDGDI 75 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCcccCcc--ccCHH--H-HhhcHHHHHHHHHHHHhhccCceEEEEecCCcce Confidence 111111100000000000 0001111111101000 00000 0 0112344556666555554445543 111 1 Q ss_pred hhHHHHHH-HHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 79 EGLEELWN-WWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~-~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+.. +..+ | ....+...+..+.+.+|.||.++.... +..|.+ .+..++|..+.++.+... T Consensus 76 ~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~----~~~g~~~~l~~i~~~~v~~~~~~~~------ 145 (406) T protein:vir:95 76 RIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKY----TADGLIDELVPLTPSKVNFLDTPDG------ 145 (406) T ss_pred eecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEE----CCCCcEEEEEEEcCceeEEEEcCCe------ Confidence 11122222 2221 2 345677778888888877755543211 234554 466778877766554321 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeeccc-ccCccCCCccchHHHHHH Q lcl|NC_021324. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP-RLGNRYGRSEISPELRKV 231 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~-~~~~~~G~s~i~~~v~~l 231 (480) + ++. .+ + ..|.++ =|+||+.+. .....+|.|.+.- +... T Consensus 146 ~-~~~---~~-~-----~~~~~~-----------------------------evih~~~~~~~~~~~~G~s~i~~-~~~~ 185 (406) T protein:vir:95 146 Y-QVL---YG-G-----QTFNYD-----------------------------EVLHFIYNPDPERPYIGRGYRVV-LKDI 185 (406) T ss_pred E-EEE---ec-c-----EEEchh-----------------------------HEEEeeccCCCCCCccccCHHHH-HHHH Confidence 0 000 00 0 011111 145665432 2234568876532 3333 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhhh------hcchheecCCCCceEEEe---ccccHHHH Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLDI------YYGRILTLASEAAKISEF---KAAELRNF 301 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~---~~~~~~~~ 301 (480) ++....+.....+.+...+.|-.++.-- ...+...+.....+.- ..+....++++..++.++ +.... .+ T Consensus 186 i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d~-q~ 264 (406) T protein:vir:95 186 ADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKDI-AI 264 (406) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhHH-HH Confidence 3333333223333333333444444311 1111111111111111 112233344343344433 22223 46 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCCcccceee Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG-REVTEEYTRL 380 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~-~~~~~~~~~~ 380 (480) ++..+....+|+..-++|++-+|..+.+ +.. +.. .+...|.-+++.+...+. .-.......+ T Consensus 265 ~e~~~~~~~~Ia~~fgVp~~~lg~~~~~--~~~-----~~~----------~~~~~l~P~~~~ie~~l~~~l~~~~~~~~ 327 (406) T protein:vir:95 265 NEAVELDKRTVAGMFGVPAFLLGIGEFN--RDE-----YNN----------FINSTILPIAKGIEQELTRKLLISPDLYF 327 (406) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCCCch--HHH-----HHH----------HHHHHHHHHHHHHHHHHHHhcCCCCCcEE Confidence 7888888999999999999988753211 111 111 222223333222221111 0011111235 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCC Q lcl|NC_021324. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++.+......|..+.++.+.+++.+| +++...+++++|+.+-+- ..+..-...-..++.+ .......+ T Consensus 328 ~fd~~~l~~~d~~~~~~~~~~l~~~G--~~t~NE~R~~~gl~p~~~--gd~~~~~~n~~~~~~~--------~~~~~~k~ 395 (406) T protein:vir:95 328 KFNPRSLYAYDLKELAEVGSNMYVRG--IMEGNEVRDWLGLSPKEG--LSELVILENYIPLDKI--------GDQSKLKG 395 (406) T ss_pred EeechhhhcCCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cceeeeccCccchhhc--------ccccccCC Confidence 55556666678889999998988865 667777888888764321 1000000000000000 00011111 Q ss_pred CCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 461 TETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 461 ~d~~~~~~~~~~~~~~~~~~ 480 (480) ++.++ .++ ++- T Consensus 396 g~~~~--------~~~-~~~ 406 (406) T protein:vir:95 396 GDNSG--------ADG-QTD 406 (406) T ss_pred CCCCC--------CCC-CCC Confidence 11111 111 111 No 217 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=97.57 E-value=4.3e-05 Score=44.51 Aligned_cols=400 Identities=13% Similarity=0.060 Sum_probs=195.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) +.+..+|| .+|+.+..+++-+..+...-..+ ++.+--..+|.--.+ -..++-.-.++. T Consensus 64 ~~~~~eLI-----------~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~L~---~~~~s~~ik~kI 121 (516) T protein:vir:10 64 ISGTKDLI-----------NTYRQLINNPEVERAVANIVNEA--------IVYERGHKVVSLDLD---DTDFGSNVKEKI 121 (516) T ss_pred cchHHHHH-----------HHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEEec---ccCcchHHHHHH Confidence 22222222 23444444444333222110000 000000000000000 000111112234 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+...+.|+.|++...+ ...+|-..++.++|..+..+.-- ..+ T Consensus 122 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid----~~k~GI~Elr~lDPr~i~~vR~i-------------~~~ 184 (516) T protein:vir:10 122 LEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP----NPKKGIAELRRLDPRFMEYYREI-------------VTS 184 (516) T ss_pred HHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec----CccccceeeeeeCCcceeeEeee-------------ccc Confidence 4556666666689999999999999999999885543 34578888999999887665421 111 Q ss_pred cCCc-----eEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeecccccCcc--CCCccchHHHHHH Q lcl|NC_021324. 161 DDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNR--YGRSEISPELRKV 231 (480) Q Consensus 161 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~~~~--~G~s~i~~~v~~l 231 (480) +..+ ....+-+|.++.. .|...+... ... +++ +||- |.|.+.-..... .-.|-|.+.|+++ T Consensus 185 ~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~~----~~~----~~i-kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 185 DIGGTTIVKGYREFFIYTTGNE-GYSYNGRIF----EPN----TRI-KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPA 254 (516) T ss_pred ccccchhhhhhhheeeeccCcc-cccccccee----CCC----cce-eechhheeeecccceeCCCCceeeeehhhhHhH Confidence 1111 1112233443222 222222110 000 011 1221 334432111111 1133344555554 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hhee--- Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RILT--- 282 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~~~--- 282 (480) .. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -.|. T Consensus 255 NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRR 332 (516) T protein:vir:10 255 NQ--LKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR 332 (516) T ss_pred Hh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc Confidence 32 2667777777777778877776555555543322110 011111 1111 Q ss_pred cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENP---ASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) .+|.+.++.++++.+.=.-++-++-....++..-++|.+-+....+.+ .-|..|--.+.....-+.+.+..|..-+. T Consensus 333 eGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~ 412 (516) T protein:vir:10 333 DGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFL 412 (516) T ss_pred CCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 234456788888765444566667777888888899988886543211 22334554566667777888888888888 Q ss_pred HHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHH-------HHHHHHHhccccCCCHHHHHHh-CCCChhHHH Q lcl|NC_021324. 360 RAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKA-------DAVSKLYANGQGPIPKEQARID-LGYTATQRE 427 (480) Q Consensus 360 ~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~a-------d~~~kl~~~~~~~~s~e~~~~~-~~~~~d~~~ 427 (480) ++++.=+-+.|---..++ ..+.+.|..-....+...+ +.+.++..-....+|.++++.. |..++++++ T Consensus 413 ~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~ 492 (516) T protein:vir:10 413 DPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIA 492 (516) T ss_pred HHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHH Confidence 888754444443333333 3466777544333333322 3333333322357899998755 789999988 Q ss_pred HHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 428 QMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) ++.+.++++..+. .-..+++ +.+- T Consensus 493 ~e~k~I~~E~~~~-------~~~~p~~----~~~f 516 (516) T protein:vir:10 493 QEEKQIEQEAGIK-------RFQNPEN----EDDF 516 (516) T ss_pred HHHHHHHHhhhCC-------CCCCCCc----cccC Confidence 7776666554211 0011111 1111 No 218 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=97.57 E-value=4.3e-05 Score=44.51 Aligned_cols=400 Identities=13% Similarity=0.060 Sum_probs=195.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) +.+..+|| .+|+.+..+++-+..+...-..+ ++.+--..+|.--.+ -..++-.-.++. T Consensus 64 ~~~~~eLI-----------~~YR~ma~~pEvd~Av~eIVnea--------iv~d~~~~pV~l~L~---~~~~s~~ik~kI 121 (516) T protein:vir:10 64 ISGTKDLI-----------NTYRQLINNPEVERAVANIVNEA--------IVYERGHKVVSLDLD---DTDFGSNVKEKI 121 (516) T ss_pred cchHHHHH-----------HHHHHHhhccchhhHHHHhhcce--------eEecCCCceEEEEec---ccCcchHHHHHH Confidence 22222222 23444444444333222110000 000000000000000 000111112234 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .+.+..|.+--+|+....+..+...+.|+.|++...+ ...+|-..++.++|..+..+.-- ..+ T Consensus 122 ~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid----~~k~GI~Elr~lDPr~i~~vR~i-------------~~~ 184 (516) T protein:vir:10 122 LEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMP----NPKKGIAELRRLDPRFMEYYREI-------------VTS 184 (516) T ss_pred HHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec----CccccceeeeeeCCcceeeEeee-------------ccc Confidence 4556666666689999999999999999999885543 34578888999999887665421 111 Q ss_pred cCCc-----eEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeecccccCcc--CCCccchHHHHHH Q lcl|NC_021324. 161 DDVA-----VPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRLGNR--YGRSEISPELRKV 231 (480) Q Consensus 161 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~~~~--~G~s~i~~~v~~l 231 (480) +..+ ....+-+|.++.. .|...+... ... +++ +||- |.|.+.-..... .-.|-|.+.|+++ T Consensus 185 ~~~~~~v~~~~~e~~~Y~~~~~-~~~~~g~~~----~~~----~~i-kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~ 254 (516) T protein:vir:10 185 DIGGTTIVKGYREFFIYTTGNE-GYSYNGRIF----EPN----TRI-KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPA 254 (516) T ss_pred ccccchhhhhhhheeeeccCcc-cccccccee----CCC----cce-eechhheeeecccceeCCCCceeeeehhhhHhH Confidence 1111 1112233443222 222222110 000 011 1221 334432111111 1133344555554 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hhee--- Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RILT--- 282 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~~~--- 282 (480) .. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -.|. T Consensus 255 NQ--Lkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRR 332 (516) T protein:vir:10 255 NQ--LKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRR 332 (516) T ss_pred Hh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccccc Confidence 32 2667777777777778877776555555543322110 011111 1111 Q ss_pred cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 283 LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENP---ASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 283 ~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) .+|.+.++.++++.+.=.-++-++-....++..-++|.+-+....+.+ .-|..|--.+.....-+.+.+..|..-+. T Consensus 333 eGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~ 412 (516) T protein:vir:10 333 DGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFL 412 (516) T ss_pred CCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 234456788888765444566667777888888899988886543211 22334554566667777888888888888 Q ss_pred HHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHH-------HHHHHHHhccccCCCHHHHHHh-CCCChhHHH Q lcl|NC_021324. 360 RAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKA-------DAVSKLYANGQGPIPKEQARID-LGYTATQRE 427 (480) Q Consensus 360 ~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~a-------d~~~kl~~~~~~~~s~e~~~~~-~~~~~d~~~ 427 (480) ++++.=+-+.|---..++ ..+.+.|..-....+...+ +.+.++..-....+|.++++.. |..++++++ T Consensus 413 ~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~ 492 (516) T protein:vir:10 413 DPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIA 492 (516) T ss_pred HHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHH Confidence 888754444443333333 3466777544333333322 3333333322357899998755 789999988 Q ss_pred HHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 428 QMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) ++.+.++++..+. .-..+++ +.+- T Consensus 493 ~e~k~I~~E~~~~-------~~~~p~~----~~~f 516 (516) T protein:vir:10 493 QEEKQIEQEAGIK-------RFQNPEN----EDDF 516 (516) T ss_pred HHHHHHHHhhhCC-------CCCCCCc----cccC Confidence 7776666554211 0011111 1111 No 219 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=97.52 E-value=5.1e-05 Score=44.11 Aligned_cols=428 Identities=11% Similarity=0.020 Sum_probs=165.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHH---HHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLE---AEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-- 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~---~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-- 75 (480) |++-++...-|--....+..+.-. ...||+-.-++...+... .-+..+. ......-.+++-...+....|.+. T Consensus 1 ~~~~~~~~~gl~p~rl~~i~~~~~~~~~~~~~~~~~~~Lr~~~~~-~ly~~m~-~D~hi~s~l~~Rk~av~~~~w~v~p~ 78 (488) T protein:vir:95 1 MADITETQESLPPFRMGEVGSLGLKVKNGRIYEEPRQALRFPESI-KTFQLMM-RDPAVAASVNIIKMFVRKVNWRFVPP 78 (488) T ss_pred CCCccccCCCCCHHHHHHHHHHhhccccchhhccchhhhcccchH-HHHHHHh-hChHHHHHHHHHHHHHhcCCceEecC Confidence 877766655444433222222111 123343111111111110 1122332 245555555555555555555542 Q ss_pred C----Cch---hHHHHHHHHHh--cCHHHHHHHHHHHHhhcCee-EEEEecCccccC------CCCceeEEEEEcccceE Q lcl|NC_021324. 76 E----DSE---GLEELWNWWQA--NDLDEESVLGHDDSLTFGRA-YITVSHPDVESG------DPAGIPLIRVESPLYMY 139 (480) Q Consensus 76 ~----d~~---~~~~l~~~~~~--n~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~------d~~g~~~~~~~~p~~~~ 139 (480) + +.+ ..+.+++++.. +.|..++..+ .++..+|.+ ++++|....+.. -.+|...+..+.++ T Consensus 79 ~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~-lda~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~R--- 154 (488) T protein:vir:95 79 KGKEQDPKMLERADFFNSLMDDMEHDWADFINSV-MSFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIR--- 154 (488) T ss_pred CCCchhHHHHHHHHHHHHHHhccCccHHHHHHHH-HHhhcccceeeeeeeeccccccccccccccCCeeeeeeeeec--- Confidence 1 111 22345555543 2355666665 478889977 566774321100 01222222111110 Q ss_pred EEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc---eEEeecccccC Q lcl|NC_021324. 140 AELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP---VVPLTNDPRLG 216 (480) Q Consensus 140 ~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n~~~~~ 216 (480) +..-++.+.-..+.+.. +.+...........+.. +. .....+ -.|| ++.+.+....+ T Consensus 155 ----------pq~~~~~f~~d~d~~l~----~~~~~~~~~~~~~~~~~--~~---~~~~~~-~~lP~~kfi~~~~~~~~g 214 (488) T protein:vir:95 155 ----------NQSTLDKWYFDEDFRRV----TGVRQNLRNVSHIAGAI--NL---GERPLT-RKLPRAKFMLFKYDDEYG 214 (488) T ss_pred ----------CcccccceeeccCCCce----eeccccccccccccccc--cc---cccccc-ccccccceEEEeecCCCC Confidence 00000011000011100 00000000000000000 00 000000 1233 24456777788 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc--cccccccccc----hhhhhcc--------hhee Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTT--DELTNDGENT----TLDIYYG--------RILT 282 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~--~~~~~~~~~~----~~~~~~~--------~~~~ 282 (480) .++|.+-+..-.-..+- =+..+..++.-++.|+.|..+.+|... ....+..... ...+..+ .++. T Consensus 215 ~p~g~gLlr~~~w~~~f-K~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP 293 (488) T protein:vir:95 215 NPEGRSPLLNAYVPWKY-KVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWP 293 (488) T ss_pred ccchhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeec Confidence 89988866432222211 145566677777888778777766211 1111111111 1111111 1111 Q ss_pred cCCCCceEE---------EeccccHHHHHHHHHHHHHHHHhhcCCChHHhccc--c-ccchHHHHHHH-HHHHHHHHHHH Q lcl|NC_021324. 283 LASEAAKIS---------EFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS--S-ENPASAEAIIA-TDSRIVKMAER 349 (480) Q Consensus 283 ~~g~~~~~~---------~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~--~-~~~~Sg~Al~~-~~~~l~~k~~~ 349 (480) .|-+.+++ .....+...|...++..-.+|+.+. ||.. + ....++-|+.- ...-....+.. T Consensus 294 -~g~~~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~i------LGqtLT~~~~~~Gs~Al~~vh~ev~~~i~~a 366 (488) T protein:vir:95 294 -RYIDPDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAF------MSDVLAMGQSKYGSFSLADSKTSLLAMSVDI 366 (488) T ss_pred -cccccccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHH------hccccccccCcchhhhHHHHHHHHHHHHHHH Confidence 11112221 1111223345555555445555542 2211 0 01112222221 12222333333 Q ss_pred HHHHHHHHHH-HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCC----HHHHHHhCCCChh Q lcl|NC_021324. 350 KGRIFGGAWE-RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIP----KEQARIDLGYTAT 424 (480) Q Consensus 350 ~~~~~~~~l~-~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s----~e~~~~~~~~~~d 424 (480) -.+.+...+. +++.-++.+..... ....++.|....+.++.+.++.+.+|+..|.. ++ .+.+.+.+|+... T Consensus 367 Da~~i~~tln~~li~~l~~~Nfg~~---~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~-i~~~~~~~~i~e~~gip~~ 442 (488) T protein:vir:95 367 LLKQIKNVINRDLVAQTYALNMWDD---EEHVQITYDDIETPDLEAIGSYIQKTVAVGAL-EVDKELSNKLREHIGLPPA 442 (488) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCC---CCccEEEecCcChhhHHHHHHHHHHHHhCCCc-cccHHHHHHHHHHhCCCCC Confidence 3445555553 35554555543221 12356788778888889999999999988744 44 4678888887643 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 425 QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 425 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) +..+ + . . .. ...++++...+.......+...++..-+++..| T Consensus 443 ~~~e-------~---~-~--~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 484 (488) T protein:vir:95 443 DESQ-------P---V-S--EK-LSPNSQSRSGDGYKTAGEGTAKTPSAKDPSTAN 484 (488) T ss_pred CCCc-------c---c-c--cc-CCCCCCCCCCcccCCCcccCCcccccccchhhh Confidence 2110 0 0 0 00 100111111110011111122222222333333 No 220 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=97.52 E-value=5.2e-05 Score=44.07 Aligned_cols=400 Identities=12% Similarity=0.084 Sum_probs=193.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCccc--chhhhcceeccchHHHHHHHHHhhhccCccccCCCc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA--PPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~--~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~ 78 (480) ..+ +|+| .+|..+..+++-+..+...-..+ .+... .=+.+++ +. ..++-.-.+ T Consensus 59 ~~~-~eLI-----------~~YR~ma~~pEvd~Av~eIvne~iv~d~~~-~pV~l~l-----d~-------~~~s~~iK~ 113 (511) T protein:vir:56 59 IPV-KELI-----------KSYRALAEYHEVDDAIQEIVDEAIVYENDK-EVVWLNL-----DN-------TDFSENIKA 113 (511) T ss_pred cch-HHHH-----------HHHHHHhhccchhhHHHHhhcceeEecCCC-ceEEEEe-----cc-------cCcchHHHH Confidence 111 1222 23444444444333222110000 00000 0000010 00 011111122 Q ss_pred hhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 79 EGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 79 ~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ...+.+..+++--+|+....+..+...+.|+.|.+.-.+ .++|-..++.++|+.+-.|..-..+ ..-.+.. T Consensus 114 kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid-----~k~GI~eLr~lDPr~i~~vr~i~~~--~~~~~~v-- 184 (511) T protein:vir:56 114 KINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILD-----KDNNIIELRPLNPMKMELVREIQKE--TIDGVEV-- 184 (511) T ss_pred HHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEec-----cccceeehhhcCcccchhhhhhhcc--ccccccc-- Confidence 344566667766689999999999999999998886432 2358788888999887665432111 0000000 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeeccccc----CccCCCccchHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRL----GNRYGRSEISPELRKVT 232 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~----~~~~G~s~i~~~v~~l~ 232 (480) .+....+-+|.+.......... ..... ..++ +||. |.|.+.-.. ......|-|.+.|+++. T Consensus 185 ----~~~~~ey~~Y~~~~~~~~~~~~-------~~~~~-~~~v-kI~~daI~y~hSGL~d~~~~~g~i~syLhkAiKp~N 251 (511) T protein:vir:56 185 ----VKGTLEYYVYKQSDYKMPSWMS-------ATNRA-QTSF-RIPKDAIVFAHSGLMRGCADDPYIIGYLDRAIKPAN 251 (511) T ss_pred ----ccceeeeeEecCCCcccCcccc-------ccccc-ccce-eechhheeeecccceeccCCCCeeeccchhhhHHHH Confidence 0011222333332211100000 00000 0111 2332 223332211 12234455656666553 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-------------hhee---c Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-------------RILT---L 283 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-------------~~~~---~ 283 (480) . -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| -+|. . T Consensus 252 Q--Lkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRRe 329 (511) T protein:vir:56 252 Q--LKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRRE 329 (511) T ss_pred h--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccC Confidence 2 2667777777777778877776555555443322110 011111 1111 2 Q ss_pred CCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccc----cchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 284 ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSE----NPASAEAIIATDSRIVKMAERKGRIFGGAWE 359 (480) Q Consensus 284 ~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~----~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 359 (480) +|.+.++.++++.+.-.-++-++-....++..-++|.+-+....+ +-.-|..|--.+.....-+.+.+..|..-+. T Consensus 330 GgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~ 409 (511) T protein:vir:56 330 GSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVIT 409 (511) T ss_pred CCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 344567888887654445666677777888888999888864321 1112335555666777778888888888888 Q ss_pred HHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHHHH---HHHhcccc----CCCHHHHHHh-CCCChhHHH Q lcl|NC_021324. 360 RAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADAVS---KLYANGQG----PIPKEQARID-LGYTATQRE 427 (480) Q Consensus 360 ~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~~~---kl~~~~~~----~~s~e~~~~~-~~~~~d~~~ 427 (480) ++++.=+-+.|---..++ ..+.+.|..-....+...++.+. .+.+...+ .+|.++++.. |..++++++ T Consensus 410 ~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~ 489 (511) T protein:vir:56 410 DPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQIT 489 (511) T ss_pred HHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHH Confidence 888754444443333333 34667775443333333333222 22222233 3489988755 789999988 Q ss_pred HHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 428 QMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) ++.+.++++..+. .-.+ +..+- T Consensus 490 ~~~k~I~~E~k~~-------~~~~------~e~~f 511 (511) T protein:vir:56 490 AMQSEIDEEETNP-------RFQQ------DDQGF 511 (511) T ss_pred HHHHHHHHhhcCC-------CCCC------cccCC Confidence 7776666554321 1111 11111 No 221 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.48 E-value=5.8e-05 Score=43.81 Aligned_cols=386 Identities=11% Similarity=0.053 Sum_probs=154.3 Q ss_pred CCCHH--HHHHHHHHHHHHHHHHHHHHHHHh----hccCcchh-cCcccchhhhcceeccchHHHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYH--EHVERLQGLLARDLPNLLEAEAYR----NGTRRLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~--~~i~~l~~~~~~~~~~~~~~~~YY----~G~~~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~ 73 (480) |.-.. -+++++-... ..++. .+-..... .+.....-..+.-....-...+|+..++-+-.-++. T Consensus 1 m~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~ 71 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKL---------IDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLK 71 (412) T ss_pred CccchhhhhhhhhhhhH---------hhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCcee Confidence 44431 1222111111 11111 00000000 000000000001111122233333333322222443 Q ss_pred c-CCCchhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCC Q lcl|NC_021324. 74 I-SEDSEGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRN 146 (480) Q Consensus 74 ~-~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~ 146 (480) + ...+.....+..++.. |. -..+...++.+.+.+|.||+++..+ ..|.+ .+..++|..+.+..+... T Consensus 72 ~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~ 145 (412) T protein:vir:26 72 MYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQS 145 (412) T ss_pred EeeccccccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCC Confidence 2 1122223334444432 32 3456677888999999999988764 34554 567788888888776542 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 147 TRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) . .+ . |......+.. ..+.++. |+||++.....+.+|.|.|.- T Consensus 146 ~-~~----~-y~~~~~~g~~---~~~~~~e-----------------------------vih~~~~~~~~~~~G~s~i~~ 187 (412) T protein:vir:26 146 R-EL----Y-YSIHAATGNK---LIVHNMD-----------------------------MLHFKHIVASNMVQGISPIDV 187 (412) T ss_pred c-EE----E-EEEEcCCceE---EEEcccc-----------------------------EEEeCCCCCCCCcccccHHHH Confidence 1 11 1 1111111110 1122222 245554334455678776532 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhh-hh-cCCCccccccccccchhh---hhcchheecCCCCceEEEecccc-HHH Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLR-VI-SGVTTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAE-LRN 300 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l-~i-~G~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~-~~~ 300 (480) +...++ ...+-....+..+..+-. ++ .+....+...+.-...|+ ...+.++.++ ++.++.+++... -.. T Consensus 188 -~~~~i~---~~~a~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~d~q 262 (412) T protein:vir:26 188 -LKNTTD---FDNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSED 262 (412) T ss_pred -HHHHHH---HHHHHHHHHHHhcCCCCceEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHH Confidence 222233 222111112223333221 22 121211111111111111 1233455554 446787776432 224 Q ss_pred HHHHHHHHHHHHHhhcCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCc Q lcl|NC_021324. 301 FAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVT 374 (480) Q Consensus 301 ~~~~l~~~~~~i~~~~~~p~~~~g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~ 374 (480) +++..+....+|+..-++|+..+|....+ -++..... ..+. ...|.-++..+. .+...... T Consensus 263 ~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~---~~f~----------~~~l~P~~~~ie~~ln~kLl~~~~~ 329 (412) T protein:vir:26 263 IVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN---RFYL----------QHTLLPIVKQYEEEFNRKLLTKTDR 329 (412) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH---HHHH----------HHHHHHHHHHHHHHHHhhcCCcccc Confidence 77777778899999999999999864322 12221111 1111 111222211111 11111111 Q ss_pred ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCc Q lcl|NC_021324. 375 EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADA 454 (480) Q Consensus 375 ~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (480) .....+++.+......+..++++++.+++.+| +++...+++.+|+.+-+ -. +.+.-... ..+ T Consensus 330 ~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~--gg------------D~~~~~~n--~~~ 391 (412) T protein:vir:26 330 EKNRYFKFNVKSYLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVE--GG------------DKPLISGD--LYP 391 (412) T ss_pred cCcceEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--Cc------------Ceeeeccc--ccc Confidence 11122444444555678899999999998875 66777778888765432 00 00000000 000 Q ss_pred CCCCCCCCCCccCCCCCCCCCcc Q lcl|NC_021324. 455 TPKPTVTETKTETQTSPSGFNRT 477 (480) Q Consensus 455 ~~~~~~~d~~~~~~~~~~~~~~~ 477 (480) -..+. +......+...+.+++ T Consensus 392 ~~~~~--~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 392 IDTPL--ELRKSLKGGDKNVNES 412 (412) T ss_pred cccch--hhcccccCCCCCcCCC Confidence 00000 0000000000111111 No 222 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=97.48 E-value=5.8e-05 Score=43.81 Aligned_cols=374 Identities=11% Similarity=0.074 Sum_probs=143.1 Q ss_pred HHHhhccCcchhcC-------cccchhhh-------cceeccchHHHHHHHHHhhhccCccc---cCCCchh-HHHHHHH Q lcl|NC_021324. 26 EAYRNGTRRLKTIG-------IGAPPELA-------YLDVQPGWVATYLRTLSDRLDIEGFR---ISEDSEG-LEELWNW 87 (480) Q Consensus 26 ~~YY~G~~~i~~~~-------~~~~~~~~-------~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~~~-~~~l~~~ 87 (480) .+.|.+........ ....+..+ -.+... .-.+|+..++-+-.-++. ...++.. ...+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~~--V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~l 78 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNSD--VLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYL 78 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccHH--HHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHH Confidence 11122211110000 00000000 011110 112333333322222332 2222221 1234444 Q ss_pred HHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeE-EEEEcccceEEEEeCCCCCceEEEEEEEEEec Q lcl|NC_021324. 88 WQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPL-IRVESPLYMYAELDPRNTRRVTRAVRLYTTRD 161 (480) Q Consensus 88 ~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~-~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~ 161 (480) +.. |. ...+...+..+.+.+|.||+.+..+. ..|.|. +..++|..+.+..++.. . ++|+.... T Consensus 79 L~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~-----~g~~~~~l~~l~p~~v~v~~~~~~--~----~~y~~~~~ 147 (417) T protein:vir:38 79 MNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDP-----ITNEPAMFEFYAPSQTQVDTSDPD--N----IIYRFTPY 147 (417) T ss_pred HhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcC-----CCCEEEEEEEeCCceEEEEEcCCC--e----EEEEEEEc Confidence 432 32 34566778888999999999987532 124443 55677777766543221 1 11111111 Q ss_pred CCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 162 DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMN 241 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~ 241 (480) +.+. ..++..+. |+||++.. ..+.+|.|.+. .+.+.+....+- T Consensus 148 ~~~~---~~~~~~~d-----------------------------viH~r~~~-~d~~~G~s~l~----~~~~~i~~~~~~ 190 (417) T protein:vir:38 148 NSSM---QKVCGFED-----------------------------VIHWKFFS-YDTIMGRSPLL----SLGDEIGLQESG 190 (417) T ss_pred CCcE---EEEecCcc-----------------------------eEEecCCC-CCCccccCHHH----HHHHHHHHHHHH Confidence 1110 01111111 35666543 34466877653 333443332222 Q ss_pred HHHHHHHh---cchhhhhcC-CCccccccccccchhhh-----hcchheecCCCCceEEEecccc-HHHHHHHHHHHHHH Q lcl|NC_021324. 242 LQSASQIL---GTPLRVISG-VTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKE 311 (480) Q Consensus 242 ~~~~~~~~---~~p~l~i~G-~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~ 311 (480) ..-...+| +.|-.++.- ..+.+...+.-...|+. ..++++.++ ++.+|..++... -..|++..+....+ T Consensus 191 ~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~~~~ 269 (417) T protein:vir:38 191 VSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVD-ATMDYQPLEVDTNVLNLINSNNYSTAQ 269 (417) T ss_pred HHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecc-CCceEEEccCCHHHHHHHHHHHhhHHH Confidence 22222333 334434331 11111111111111111 123455554 346777765332 22467777788899 Q ss_pred HHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCcccceeeeEEecC Q lcl|NC_021324. 312 AASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEEYTRLETVWRD 386 (480) Q Consensus 312 i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~~~~~~~~v~f~~ 386 (480) |+..-++|+..+|....+ +|...+... .+...|.-+++.+. .+..... .....+.|.. T Consensus 270 Ia~~fgVPp~~lg~~~~~-s~~e~~~~~-------------~~~~tl~P~~~~ie~~l~~~Ll~~~~---~~~~~~~fd~ 332 (417) T protein:vir:38 270 IAKALRVPAYRLAQNSPN-QSVKQLADD-------------YIRNDLPFYFEPITSEFELKLLDDAQ---RHQYCIGFDT 332 (417) T ss_pred HHHHhCCCHHHhCCCCcc-hhHHHHHHH-------------HHHHHHHHHHHHHHHHHHhhhcChhh---cccceEEech Confidence 999999999999854322 222111111 11122222222221 1111111 1234466621 Q ss_pred CCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhH---HHHHH-HHHHHHHHHHHHHHhhhcccCCCcCCCCCCCC Q lcl|NC_021324. 387 PSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQ---REQMR-DWDKQETEDMIDTLYSTTKAQADATPKPTVTE 462 (480) Q Consensus 387 ~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 462 (480) . ...... ...+.+++.+| +++...+++++|+.+-+ ..++. ... -...+.... .+..+.....+++ T Consensus 333 ~-~l~~~~-~~~~~~~~~~G--~~T~NE~R~~~gl~pi~~g~~d~~~~~~n----~~~~d~~~~---~~~~~~~~~kgg~ 401 (417) T protein:vir:38 333 K-SVNGLP-IADVNTAVNGG--LWTGNEGRAELGKKPLKDPNMDRIQSTLN----TVFLDQKEA---YQAEHAAELKGGD 401 (417) T ss_pred h-hhhHHH-HHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCeeeeccc----ccccccccc---cccccccccCCCC Confidence 1 111122 22355666654 66777777777764211 11110 000 000000000 0011112223344 Q ss_pred CCccCCCCCCCCCccc Q lcl|NC_021324. 463 TKTETQTSPSGFNRTK 478 (480) Q Consensus 463 ~~~~~~~~~~~~~~~~ 478 (480) ++.++....++.|..+ T Consensus 402 ~~~~~~~~~~~~~~~~ 417 (417) T protein:vir:38 402 TNAKGNQNGSGTNANS 417 (417) T ss_pred CCCCCCCcCCCCcCCC Confidence 4444443333333222 No 223 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=97.47 E-value=6e-05 Score=43.73 Aligned_cols=423 Identities=11% Similarity=-0.019 Sum_probs=162.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccc--hhhhcceeccchHHHHHHHHHhhhc----c--Cccc-c Q lcl|NC_021324. 4 YHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAP--PELAYLDVQPGWVATYLRTLSDRLD----I--EGFR-I 74 (480) Q Consensus 4 ~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~--~~~~~~~~~~n~~~~iVd~~~~~l~----~--~~~~-~ 74 (480) .+..++.+..+++ +.+.....++||+ +.+|.+..... ....-.++..+-+...++++++.|. + .+|+ . T Consensus 1 mk~~~~~~~~~lk-r~~~e~~w~e~a~--~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF~l 77 (510) T protein:vir:78 1 MKSTAAMLWEKLR-DGSVEQRAIEFAK--TTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAKLARSLFPTGIPFFRS 77 (510) T ss_pred ChhHHHHHHHHHh-ccchHHHHHHHHH--hhccccccCCCCcccccccCcccchHHHHHHHHHHHHHHhhcCCCCccccc Confidence 4555666666553 2222233333442 22222211000 0001113445556777777776663 2 1232 1 Q ss_pred --CC--------Cch----hH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEE Q lcl|NC_021324. 75 --SE--------DSE----GL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVE 133 (480) Q Consensus 75 --~~--------d~~----~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~ 133 (480) ++ +.. .. ..+...+..++|.....++.++...+|.+.+++..+ .+ +++.+ T Consensus 78 ~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~-------~~--~~~~~ 148 (510) T protein:vir:78 78 ELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRNSD-------EA--TVVAW 148 (510) T ss_pred CCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEeCC-------CC--eEEEE Confidence 11 001 11 124445667889999999999999999986655321 22 35556 Q ss_pred cccceEEEEeCCCCCceEEEEEEEEEe-------c---------CCceEEEEEEEeC-----Cc-----EEEEEecCCCc Q lcl|NC_021324. 134 SPLYMYAELDPRNTRRVTRAVRLYTTR-------D---------DVAVPDRATLYLP-----DE-----TVPLRRNGGLN 187 (480) Q Consensus 134 ~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~---------~~~~~~~~~~~~~-----~~-----~~~~~~~~~~~ 187 (480) +-.+.++.-|. .+ ++...++.+... . .......+++|+. +. .+++..++. T Consensus 149 pl~~y~v~~d~-~G-~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~-- 224 (510) T protein:vir:78 149 SLRSYAVRRDA-TG-RWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQRRKGTAMDYAEMYHEIDGV-- 224 (510) T ss_pred EcceeEEeeCC-Cc-CeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEEeecCCCCcEEEEEEEecCe-- Confidence 55553333333 22 333333333211 0 0011122333321 10 111111110 Q ss_pred cccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccc Q lcl|NC_021324. 188 DQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTND 267 (480) Q Consensus 188 ~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~ 267 (480) ........++..+|++.++-+...++.||+|-.. +..+=+..+|.+.-...........|.+.+. -+. . . T Consensus 225 ----~i~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~-p~g--~--~ 294 (510) T protein:vir:78 225 ----RVGETGRWPIHLCPYIVPTWNLAPGEHYGRGHVE-DYIGDFAKLSLLSEKLGLYELESLEVLNLVD-EAK--G--A 294 (510) T ss_pred ----eeccccccccccCCeeeeeeeecCCCccccchHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCcccC-Ccc--c--c Confidence 0011122345678999888888889999999653 3455556666665555555555555543331 000 0 0 Q ss_pred cccchhhhhcchheecCCCCceEEEecc-ccHHH---HHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHH Q lcl|NC_021324. 268 GENTTLDIYYGRILTLASEAAKISEFKA-AELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRI 343 (480) Q Consensus 268 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l 343 (480) ..........+.+..-..++....++.. ++++. .++.++.-|...+... +. ..++.. .+++.++..-..+ T Consensus 295 ~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~-l~----~~~~~r-vTAtEV~~r~~E~ 368 (510) T protein:vir:78 295 VVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG-AN----QRDAER-VTAEEVRITAEEA 368 (510) T ss_pred chhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhhc-cc----cCCCCC-cCHHHHHHHHHHH Confidence 0000111122222221112223333322 23443 3344444444333321 10 111111 2344344332222 Q ss_pred HHHH----HH-HHHHHHHHHHHHHHHHHHHhCC-CCcc-cceeeeEEecCCCCCCHHHHHHHHH---HHHhcccc----- Q lcl|NC_021324. 344 VKMA----ER-KGRIFGGAWERAMRIAMQIMGR-EVTE-EYTRLETVWRDPSTPTVAAKADAVS---KLYANGQG----- 408 (480) Q Consensus 344 ~~k~----~~-~~~~~~~~l~~~~~li~~~~~~-~~~~-~~~~~~v~f~~~~~~~~~~~ad~~~---kl~~~~~~----- 408 (480) .... .+ ....+.+-+.+++.++..- +. .... ......|++-.++.+ .+.++.+. +.++...+ T Consensus 369 ~~~LGpv~~rl~~E~l~Pli~r~~~il~r~-gl~p~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~q~l~~~~~~~q~~ 445 (510) T protein:vir:78 369 ENTLGGTYSLLAENLQSPLAYVCLSEVDDA-LLQGLITKQHKPAIETGLPALSR--SAAVQSMLNASQVIAGLAPIAQLD 445 (510) T ss_pred HHHhhHHHHHHHHHHHHHHHHHHHHHHHhc-cCCCCCcccccceeeecccHHHH--HHHHHHHHHHHHHHHHhcChhhhh Confidence 2111 11 1122233333334433221 10 1111 122333455333322 22222222 22221111 Q ss_pred -CCCH----HHHHHhCCCC-------hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 409 -PIPK----EQARIDLGYT-------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 409 -~~s~----e~~~~~~~~~-------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) .+.. +.+...+|.+ +++++++.+.+.+++.....+..+...+.++.+..+.+- T Consensus 446 ~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 446 PRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred hcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccCCCC Confidence 1222 2333445652 444544433222222111111122222222222222222 No 224 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=97.39 E-value=7.8e-05 Score=43.10 Aligned_cols=409 Identities=11% Similarity=0.062 Sum_probs=191.2 Q ss_pred CCCH------HHHHHH----HHH---HHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTY------HEHVER----LQG---LLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~------~~~i~~----l~~---~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) +..- .-+.+. +.. .-..-..+|+.+..+++-+..+. .||+..+-+= T Consensus 45 I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~---------------------eIVneaiv~d 103 (524) T protein:vir:10 45 IETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNYEVDNAVQ---------------------EIVSDAIVYE 103 (524) T ss_pred eccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhccchhhHHH---------------------HhhcceeEec Confidence 0000 000000 000 11112223333333333332221 1121111000 Q ss_pred -ccCc---------cccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccc Q lcl|NC_021324. 68 -DIEG---------FRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLY 137 (480) Q Consensus 68 -~~~~---------~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~ 137 (480) ..++ ++-.-.++..+.+..|++--+|+....+..+...+.|+.|.+.-.+. ....+|-..++.++|.. T Consensus 104 ~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~--~~pk~GI~Elr~lDPr~ 181 (524) T protein:vir:10 104 DDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINP--KKMKDGVQELRRLDPRQ 181 (524) T ss_pred CCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeC--CCccccceeeeeeCCcc Confidence 0001 11111223445566676666899999999999999999998854321 12346778888899987 Q ss_pred eEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccce--EEeeccccc Q lcl|NC_021324. 138 MYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPV--VPLTNDPRL 215 (480) Q Consensus 138 ~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPv--v~~~n~~~~ 215 (480) +-.|.--... ....+..+ +....+-+|.++.- .|...+. ...... ++ +||- |.|.+.-.. T Consensus 182 i~~vr~i~~~--~~~~~~vi------~~~~e~f~Y~~~~~-~~~~~~~----~~~~~~----~i-kI~~dAIvy~~SGL~ 243 (524) T protein:vir:10 182 VQYIREIVTR--MEDGVKIV------DGYREFFVYDTGHE-SYCADGR----IYSAGT----KV-KIPRAAVVYAHSGLL 243 (524) T ss_pred ceeeeeeccc--Ccccchhh------cchhhheeecCCCc-ccccCcc----eecCCc----ce-ecchhheeeeccCcc Confidence 7554321110 00000000 00111222332211 1111110 000000 11 2322 222221111 Q ss_pred Cc--cCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhcc-- Q lcl|NC_021324. 216 GN--RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYYG-- 278 (480) Q Consensus 216 ~~--~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~~-- 278 (480) .. ..-.|-|.+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- .....| T Consensus 244 d~~~~~i~syLhkAiKp~NQ--Lkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev 321 (524) T protein:vir:10 244 DCCGKNIIGYLQRAIKPANQ--LKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKI 321 (524) T ss_pred cCCCCceeccchHhhHHHHh--hHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCee Confidence 11 1112334455555422 2667777777777778877776555555543322110 011111 Q ss_pred -----------hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc---ccchHHHHHHHHHH Q lcl|NC_021324. 279 -----------RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAIIATDS 341 (480) Q Consensus 279 -----------~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~~~~Sg~Al~~~~~ 341 (480) -+|. .+|.+.++.+++..+.-.-++-++-....++..-++|.+-++... -+..-|..|--.+. T Consensus 322 ~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEi 401 (524) T protein:vir:10 322 KNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDEL 401 (524) T ss_pred ccchhhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHH Confidence 1111 234456788888765444566667777888888899988885332 12223345555666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc----eeeeEEecCCCCCCHHHHHHH-------HHHHHhccccCC Q lcl|NC_021324. 342 RIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTVAAKADA-------VSKLYANGQGPI 410 (480) Q Consensus 342 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~----~~~~v~f~~~~~~~~~~~ad~-------~~kl~~~~~~~~ 410 (480) ....-+.+.+..|..-+.++++.=+-+.|.--..++ ..+.+.|..-....+...++. +.++-.-..-.+ T Consensus 402 KF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~ 481 (524) T protein:vir:10 402 KFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYI 481 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 777778888888888888888854444443333333 346677754433333333332 222222111245 Q ss_pred CHHHHHHh-CCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 411 PKEQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 411 s~e~~~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) |.++++.. |..++++++++.+.++++..+. .-.++++..+.- T Consensus 482 s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~-------~~~~~~~~~~~f 524 (524) T protein:vir:10 482 SHQTAMKDFLQMTDEEINQEAKQIEEESKEA-------RFQNPDEEEEDF 524 (524) T ss_pred hhHHHHHHHhccCHHHHHHHHHHHHHHhhcC-------CCCCCChhhhcC Confidence 88888755 7899999888776666654311 111222222222 No 225 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=97.37 E-value=8.3e-05 Score=42.95 Aligned_cols=410 Identities=10% Similarity=0.043 Sum_probs=167.4 Q ss_pred CCCHHHHHHH-----HHHHHHHHHHHHHHHHHHhhccCcchhcCccc------c----------hhhhcceeccchHHHH Q lcl|NC_021324. 1 MTTYHEHVER-----LQGLLARDLPNLLEAEAYRNGTRRLKTIGIGA------P----------PELAYLDVQPGWVATY 59 (480) Q Consensus 1 ~~~~~~~i~~-----l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~------~----------~~~~~~~~~~n~~~~i 59 (480) |..+-+.--+ .+.+. ..+++.-..+-+.+ |+...+.... . +-+.++.-......-. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~--~~~~~~~~~~~~~~-~~~~gltp~~l~~iLr~a~~gd~~~~~~L~e~m~e~D~~i~s~ 77 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREP--QTSRLAGLAKEFAQ-HPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAE 77 (526) T ss_pred CCeeECCCCCccccccccch--hhhhhhhhhhhhcc-cCcCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHH Confidence 1111110000 00000 00111111122211 1111000000 0 0000000012222333 Q ss_pred HHHHHhhhccCccccC---C----CchhHHHHHHHHHhc-CHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEE Q lcl|NC_021324. 60 LRTLSDRLDIEGFRIS---E----DSEGLEELWNWWQAN-DLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLI 130 (480) Q Consensus 60 Vd~~~~~l~~~~~~~~---~----d~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~ 130 (480) +++-...+....|++. + +.+..+.+.+++..- +|...+..+. ++.-+|.+ ++++|... +|...+ T Consensus 78 l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~------~g~~~~ 150 (526) T protein:vir:99 78 MSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQ------GREWMP 150 (526) T ss_pred HHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeec------CCceeE Confidence 3333333334455441 1 223345677777553 5777777654 68889977 45555421 233222 Q ss_pred ---EEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceE Q lcl|NC_021324. 131 ---RVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVV 207 (480) Q Consensus 131 ---~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 207 (480) .+++|+.. .|++... . ...++.. .. ..... .+++. + T Consensus 151 ~~l~~r~~~~f--~~~~~~~----------------~------------~l~~~~~-~~------~g~~l-~~~k~---i 189 (526) T protein:vir:99 151 LAFHHRPQSWF--QLNPEDQ----------------N------------ELRLRDN-SP------AGEAL-QPFGW---I 189 (526) T ss_pred EEeeeecccce--eeccCCC----------------c------------EEEecCC-CC------Cceee-cCCCe---E Confidence 22222111 1111111 0 0111100 00 00111 12222 2 Q ss_pred EeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhcchheec Q lcl|NC_021324. 208 PLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTL 283 (480) Q Consensus 208 ~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~----~~~~~~~~~~~~ 283 (480) .+.+....+.++|.+-+..-.-.. =-=+..+.+++.-++.|+.|+++.+- +....++.... +..+..+....+ T Consensus 190 ~~~~~~~~g~p~g~gLlr~~~w~~-~fK~~~~~~w~~f~E~yG~P~~igky--~~~a~~~ek~~L~~av~~i~~d~~~ii 266 (526) T protein:vir:99 190 IHRPRARSGYVARSGLFRVLAWPY-LFRHYATSDLAEMLEIYGLPIRLGKY--PPGTADEEKATLLRAVTGLGHAAAGII 266 (526) T ss_pred EEeecCCcCCccccchHHHHHHHH-HHHHhhHHHHHHHHHHcCCceEEEec--CCCCCHHHHHHHHHHHHHHhhCcEEEe Confidence 334556667788887664321111 11255778888888999999888762 11111111111 122333333333 Q ss_pred -CCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc-ccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_021324. 284 -ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-ENPASAEAII-ATDSRIVKMAERKGRIFGGAWE- 359 (480) Q Consensus 284 -~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~- 359 (480) .|...+|.+....+...|...++..-.+|+...- -+.+.... .+..+..|+. ....-....+..-.+.+...|. T Consensus 267 P~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL--GqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~ 344 (526) T protein:vir:99 267 PETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAVL--GGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSR 344 (526) T ss_pred cCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHHh--hhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3445667665555556677777766667766520 01111110 0111112322 2222233344445566666674 Q ss_pred HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021324. 360 RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 360 ~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~ 439 (480) .+++.++.+............+++|....+.++.+.++.+.+++..|. .++.+.+.+.+|+......+- T Consensus 345 ~Li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~-~i~~~~i~e~~Gip~~~~~e~---------- 413 (526) T protein:vir:99 345 DLLWPLLVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGL-EIPSAWVYDKLGIPQPAKNEP---------- 413 (526) T ss_pred HHHHHHHHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCC-ccCHHHHHHHhCCCCCCCccc---------- Confidence 477777777654332222346778888888999999999999998874 479999999999853321100 Q ss_pred HHHHHhhhccc-CCCcCCCCCCC-CCCccCCCCCCCCC--c-ccCC Q lcl|NC_021324. 440 MIDTLYSTTKA-QADATPKPTVT-ETKTETQTSPSGFN--R-TKTR 480 (480) Q Consensus 440 ~~~~~~~~~~~-~~~~~~~~~~~-d~~~~~~~~~~~~~--~-~~~~ 480 (480) .+. ..... .+.+.+..... ......+..|.... . ...+ T Consensus 414 ~l~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~~ 456 (526) T protein:vir:99 414 VLR---SAAQPAILSRQHGQRVAALATIVGPRYGDQQALDKALADL 456 (526) T ss_pred ccC---CCCCCcccccccccccccccccccccCcchhhHHHHHHHH Confidence 000 00000 00000000000 00000000000000 0 0000 No 226 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=97.36 E-value=8.6e-05 Score=42.86 Aligned_cols=412 Identities=12% Similarity=0.050 Sum_probs=168.6 Q ss_pred CCCHHHHHHHHHHHH--H-HHHHHHHHHHHHhhccCcchhcCcc-------------cc---hhhhcceeccchHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLL--A-RDLPNLLEAEAYRNGTRRLKTIGIG-------------AP---PELAYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~~~~~i~~l~~~~--~-~~~~~~~~~~~YY~G~~~i~~~~~~-------------~~---~~~~~~~~~~n~~~~iVd 61 (480) |..+-+...+-++.- . ...++..-..+-+.+ |....+... .. +-+..+.-......-.++ T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~-~~~~gltp~~l~~il~~a~~gd~~~~~~L~~~m~e~D~~i~s~l~ 79 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFAN-HPAKGLTPAKLAHILIEAEQGHLQAQAELFMDMEERDAHLFAEMS 79 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcc-cCCCCCCHHHHHHHHHhhhCCCHHHHHHHHHHHHhhChHHHHHHH Confidence 222211110000000 0 000111122222211 111100000 00 000000001233333444 Q ss_pred HHHhhhccCccccC---CC----chhHHHHHHHHHhc-CHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEEE- Q lcl|NC_021324. 62 TLSDRLDIEGFRIS---ED----SEGLEELWNWWQAN-DLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIR- 131 (480) Q Consensus 62 ~~~~~l~~~~~~~~---~d----~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~~- 131 (480) +-...+....|++. ++ .+..+.+.+.+..- +|+..+.. ..++..+|.+ ++++|... +|...+. T Consensus 80 ~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~~~------~g~~~~~~ 152 (528) T protein:vir:10 80 KRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIELDWSLQ------GREWLPQA 152 (528) T ss_pred HHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEEEEEeec------CCceeEEE Confidence 44444445555542 12 22334566666542 46666665 4668889977 45556321 2333222 Q ss_pred --EEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEe Q lcl|NC_021324. 132 --VESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPL 209 (480) Q Consensus 132 --~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 209 (480) .++|+.. .|++... .+++...+.. ..... .+++ ++.+ T Consensus 153 ~~~r~~~~f--~~~~~~~------------------------------~~l~~~~~~~-----~g~~l-~~~k---~iv~ 191 (528) T protein:vir:10 153 FDHRPQSWF--QLNPDDQ------------------------------DELRLRDNSI-----AGEVL-QPFG---WIMH 191 (528) T ss_pred eeeecccce--eeccCCC------------------------------cEEeccCCCC-----Cceee-cCCC---eEEE Confidence 2222110 0111110 0111000000 00111 1222 2334 Q ss_pred ecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhcchheec-C Q lcl|NC_021324. 210 TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTL-A 284 (480) Q Consensus 210 ~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~----~~~~~~~~~~~~-~ 284 (480) .+....+.++|.+-+.. +-...=-=+..+.+++.-++.|+.|+++.+-- ....++.... +..+..+....+ . T Consensus 192 ~~~~~~g~p~g~gLlr~-~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~--~~a~~~ek~~L~~al~~i~~~~~~iiP~ 268 (528) T protein:vir:10 192 KPRSRSGYVARSGLFRV-LAWPYLFKHYSTADLAEMLEIYGLPIRLGKYP--PGTPDEEKVTLLRAVTGLGHAAAGIIPE 268 (528) T ss_pred eecCCCCCccccchHHH-HHHHHHHHHhhHHHHHHHHHHcCCCeEEEecC--CCCCHHHHHHHHHHHHHHhhCcEEEecC Confidence 45566677888876643 22222222667788888899999998877621 1111111111 122233333333 3 Q ss_pred CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccc-cccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_021324. 285 SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSS-SENPASAEAII-ATDSRIVKMAERKGRIFGGAWE-RA 361 (480) Q Consensus 285 g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~-~~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~ 361 (480) |...+|.+....+...|...++..-.+|+...- -+.+... +.+-.+..|+. ....-....+..-.+.+...|. .+ T Consensus 269 ~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iL--GqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~l 346 (528) T protein:vir:10 269 SMSIDFQEASKGSAEPFMAMMRWCDDSMSKAIL--GGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRDL 346 (528) T ss_pred CceeEEeecCCCChhHHHHHHHHHHHHHHHHHh--hhhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455677766555566677777766666666430 0111110 00001222322 1222333444445566666674 46 Q ss_pred HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 362 MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMI 441 (480) Q Consensus 362 ~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 441 (480) +.-++.+..........-.++.|....+.++.+.++.+.+++..|. .++.+.+.+.+|+......+. .. T Consensus 347 i~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~-~i~~~~i~e~~gip~p~~~e~----------~~ 415 (528) T protein:vir:10 347 LWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGV-QVPVNWVQEQLGIPLPANGEA----------VL 415 (528) T ss_pred HHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCC-CCCHHHHHHHhCCCCCCCCcc----------cc Confidence 6666666644332222345778888888999999999999998874 479999999999754321110 00 Q ss_pred HHHhhhcccCCCcCCCCCCCCCCccCCC-CCCCCCcc------cC---C Q lcl|NC_021324. 442 DTLYSTTKAQADATPKPTVTETKTETQT-SPSGFNRT------KT---R 480 (480) Q Consensus 442 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~------~~---~ 480 (480) .......+.+.............+. .+...... .+ + T Consensus 416 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 461 (528) T protein:vir:10 416 ---GDQAGAGIAQLSRRPGPRIAALAQVIGPRYRDQEALDQVLASLPAQ 461 (528) T ss_pred ---cCCCcccccccCcccccccccccccccccccccchHHHHHHHHHHH Confidence 0000000000000000000000000 00000000 00 0 No 227 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.32 E-value=9.6e-05 Score=42.61 Aligned_cols=389 Identities=11% Similarity=0.046 Sum_probs=151.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchh-cCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKT-IGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDS 78 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~-~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~ 78 (480) |.. +-+++++-+...... +..-..+-++... .+.....-..+.-....-...+|+..+.-+-.-++.+- ..+ T Consensus 1 ~~~-~~~~~~~k~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~V~~ci~~ia~~ia~lp~~~~~~~~ 74 (409) T protein:vir:96 1 MAK-ENIVTRIKKKLIDNW-----IDQSASKLYDFSPWKNKSFWGVINNTLETNETIFSAITKLSNSMASLPLKMYEDYK 74 (409) T ss_pred Ccc-ccchhhhhhHHhhhh-----hccccccccccccccCccccccchhhHhhhHHHHHHHHHHHHhhhhCceEEeeccc Confidence 444 222222211110000 0000001011100 00000000001011111222233333322222233221 111 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+.+++.. | .-..+...++.+.+.+|.||+++..+ ..|.+ .+..++|..+.++.++... . T Consensus 75 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~~~-~--- 144 (409) T protein:vir:96 75 VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPDVVEMLIENQSR-E--- 144 (409) T ss_pred ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEEC------CCCcEEEEEEEcCceeEEEEeCCCc-E--- Confidence 222334455432 3 23456677888999999999988653 34554 5667788888777654321 1 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) + +|......+.. ..|.++ -|+||++.....+.+|.|.+.- +...+ T Consensus 145 -~-~y~~~~~~g~~---~~~~~~-----------------------------evih~r~~~~~~~~~G~s~l~~-~~~~i 189 (409) T protein:vir:96 145 -L-YYSIHAATGNK---LIVHNM-----------------------------DMLHFKHIVASNMVQGISPIDV-LKNTT 189 (409) T ss_pred -E-EEEEEcCCceE---EEEccc-----------------------------cEEEeCCCCCCCccccccHHHH-HHHHH Confidence 1 11111111110 011111 1456654434456778886532 33333 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhh--cCCCccccccccccchhh---hhcchheecCCCCceEEEecccc-HHHHHHHHH Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVI--SGVTTDELTNDGENTTLD---IYYGRILTLASEAAKISEFKAAE-LRNFAEEME 306 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i--~G~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~ 306 (480) +..+.+ . ...+..++.+-.++ .+....+...+.-...|+ ...++++.++ ++.++.+++... -..+++..+ T Consensus 190 ~~~~~~-~--~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~~~ 265 (409) T protein:vir:96 190 DFDNAV-R--TFNLTEMQKPDSFMLKYGSNVSTEKRQQVLEDFKQYYEENGGILFQE-PGVEIEPLPKKYVSEDIVASEN 265 (409) T ss_pred HHHHHH-H--HHHHHhcCCCceeEEecCCCCCHHHHHHHHHHHHHHhhcCCCeeecC-CCceEEEcCCChhHHHHHHHHH Confidence 322211 1 11222223322122 222222111111111111 1223455554 446787776432 224677777 Q ss_pred HHHHHHHhhcCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCcccceee Q lcl|NC_021324. 307 VFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEEYTRL 380 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~~~~~~~ 380 (480) ....+|+..-++|+..+|....+ -++..... .. .+...|.-++..+. .+...........+ T Consensus 266 ~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~---~~----------f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~g~~i 332 (409) T protein:vir:96 266 LTRERVANVFQLPSIFLNARSNTNFAKNEELN---RF----------YLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYF 332 (409) T ss_pred HHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH---HH----------HHHHHHHHHHHHHHHHHHhhcCCcccccCcceE Confidence 78899999999999999864322 11211111 11 11111222222111 11111111111223 Q ss_pred eEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCC Q lcl|NC_021324. 381 ETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 381 ~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) ++........+..++++++.+++.+| +++...+++.+|+.+-+- . +.+..... ..+-..+.. T Consensus 333 ~fd~~~ll~~d~~~~~e~~~~~~~~G--~~T~NE~R~~~g~~pi~g--g------------D~~~~~~n--~~~~~~~~~ 394 (409) T protein:vir:96 333 KFNVKSYLRADSATQAEVYFKAVRSG--YYTINDIREWEDLPPVEG--G------------DKPLISGD--LYPIDTPLE 394 (409) T ss_pred EeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCC--c------------ceeeeccc--ccccccchh Confidence 33334455568889999999998876 566666777777643320 0 00000000 000000000 Q ss_pred CCCCccCCCCCCCCCcc Q lcl|NC_021324. 461 TETKTETQTSPSGFNRT 477 (480) Q Consensus 461 ~d~~~~~~~~~~~~~~~ 477 (480) .+....+.....+++ T Consensus 395 --~~~~~~gG~~n~~e~ 409 (409) T protein:vir:96 395 --LRKSLKGGDKNVNES 409 (409) T ss_pred --hcccccCCCCCcCCC Confidence 000000000011111 No 228 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=97.31 E-value=9.8e-05 Score=42.57 Aligned_cols=389 Identities=12% Similarity=0.005 Sum_probs=156.6 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc---cCCCc--- Q lcl|NC_021324. 6 EHVERLQGLLARD-LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISEDS--- 78 (480) Q Consensus 6 ~~i~~l~~~~~~~-~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~--- 78 (480) =+...+.++.... ......+...+-+.... ..+..+..+. -+.+.-...+|+..++-+-.-++. ..++. T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~-~~g~~v~~~~---~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~~~ 76 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDT-YTGKRISSQR---AMRLTAVYSCVRVLAESVGMLPCSLYKISGTLKTR 76 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCccc-ccCceechhh---hhccHHHHHHHHHHHHhhhhCceEEEEecCCccee Confidence 0111111111000 00011111122111100 0111111110 011122223344333333222332 22111 Q ss_pred hhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 79 EGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 79 ~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) .....+..++.. | ....+...+..+.+.+|.||+++..+ .|+| .+..++|..+.+..+... .+. T Consensus 77 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-------~g~~~~L~~l~~~~v~~~~~~~~--~~~- 146 (413) T protein:vir:48 77 VVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-------LGEVVELLPIDPGCVEPKLNSQW--QPV- 146 (413) T ss_pred ecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-------CCcEEEEEEEcCceEEEEEcCCc--eEE- Confidence 112234455432 2 34567788889999999999988643 3444 466788888887776432 111 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) |......+.. ..|.++.+ +|+++.. ..+++|.|.|.. +...+ T Consensus 147 ----y~~~~~~g~~---~~~~~~ev-----------------------------ih~~~~~-~d~~~G~s~i~~-~~~~i 188 (413) T protein:vir:48 147 ----YQVTFPDGSV---DVLTQDEI-----------------------------WHVRTLT-LDGLVGLNPIAY-AREAI 188 (413) T ss_pred ----EEEEecCceE---EEEccccE-----------------------------EEecCcC-CCCcccccHHHH-HHHHH Confidence 1111111211 11233333 3443322 234677775532 33333 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCCC-ccccccccccchhhh------hcchheecCCCCceEEEecccc-HHHHHHH Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVISGVT-TDELTNDGENTTLDI------YYGRILTLASEAAKISEFKAAE-LRNFAEE 304 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i~G~~-~~~~~~~~~~~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~ 304 (480) +....+.....+.+...+.|-.++.... ...+..+.-...|+. ..++++.++ ++.++.++.... -..+++. T Consensus 189 ~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~~~~d~q~~e~ 267 (413) T protein:vir:48 189 SLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILE-MGLDWKSMALNAEDSQFLET 267 (413) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEeccCChhHHHHHHH Confidence 3333222222222333344554544211 111111111111111 123455554 346777776432 2236788 Q ss_pred HHHHHHHHHhhcCCChHHhccccc-cchHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSE-NPASAEAII--ATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~Al~--~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) .+....+|+..-++|+..+|.... +.++..... +....|.-.+. .+++.+. ..+....... ...++ T Consensus 268 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P~~~--------~ie~~l~--~~L~~~~~~~-~~~~~ 336 (413) T protein:vir:48 268 RKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVPYLT--------RIEQRIN--TGLVRESKQG-KFYAK 336 (413) T ss_pred HHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHHHHH--------HHHHHHH--hhccCccccC-CeEEE Confidence 888899999999999999986432 212222111 11111111111 1111111 0111111111 12344 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.+......|..++++++.+++++| +++...+++++|+.+-+- . +.+.-... . .+.+..+ T Consensus 337 fd~~~l~~~d~~~~~~~~~~~~~~g--~~T~NE~R~~~g~~p~~g--g------------D~~~~~~n--~--~~~~~~~ 396 (413) T protein:vir:48 337 FNAGALLRGDMKSRFEAYATGINWG--IYSPNDCRDLEDMNPRPG--G------------DVYLTPMN--M--TTSPSAG 396 (413) T ss_pred EechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------ceeecccc--c--ccccccc Confidence 5555566678889999999988876 566666777777653320 0 00000000 0 0111111 Q ss_pred CCCccCCCCCCCCCcccC Q lcl|NC_021324. 462 ETKTETQTSPSGFNRTKT 479 (480) Q Consensus 462 d~~~~~~~~~~~~~~~~~ 479 (480) ++++ .+++.+..++++| T Consensus 397 ~~~~-~~~~~~~~~~~~~ 413 (413) T protein:vir:48 397 DDNG-KKKESGDADKTAS 413 (413) T ss_pred ccCC-CCCCCCCccccCC Confidence 1111 1111223333333 No 229 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=97.30 E-value=0.0001 Score=42.46 Aligned_cols=392 Identities=11% Similarity=0.063 Sum_probs=165.1 Q ss_pred CC--------------CHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhh Q lcl|NC_021324. 1 MT--------------TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~--------------~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 66 (480) ++ |+..+-.-|...-.-...++.. -|+ ++.. ......-.+.+-... T Consensus 25 ~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~~~~~~---L~e---dm~e--------------~D~~i~s~l~~Rk~a 84 (526) T protein:vir:79 25 LAGLAKEFAQHPAKGLTPAKLARILVEAEQGNLQAQAE---LFM---DMEE--------------RDAHLFAEMSKRKRA 84 (526) T ss_pred hhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCHHHHHH---HHH---HHHh--------------hChHHHHHHHHHHHH Confidence 11 2222222111111111111111 111 0100 012222223333333 Q ss_pred hccCccccC---C----CchhHHHHHHHHHhc-CHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEEE---EEc Q lcl|NC_021324. 67 LDIEGFRIS---E----DSEGLEELWNWWQAN-DLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIR---VES 134 (480) Q Consensus 67 l~~~~~~~~---~----d~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~~---~~~ 134 (480) +....|++. + +.+..+.+.+++..- +|...+..+ .++.-+|.+ ++++|... +|...+. ..+ T Consensus 85 v~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~-ldA~~~G~s~~Ei~w~~~------~g~~~~~~l~~r~ 157 (526) T protein:vir:79 85 ILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDA-LDGIGHGYSCIELEWALQ------GREWMPLAFHHRP 157 (526) T ss_pred HhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHH-HhhhhhcceeEEEEEeec------CCceeEEEeeeec Confidence 334445431 1 223445577777553 577777765 558889977 45555421 2332222 222 Q ss_pred ccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccc Q lcl|NC_021324. 135 PLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) Q Consensus 135 p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~ 214 (480) |+.. .||+... ..++...... ...+. .+++. +.+.+... T Consensus 158 ~~~F--~~~~~~~------------------------------~~l~~~~~~~-----~g~~l-~~~k~---iv~~~~~~ 196 (526) T protein:vir:79 158 QSWF--QLNPEDQ------------------------------NELRLRDNSP-----AGEAL-QPFGW---IIHRPRAR 196 (526) T ss_pred ccce--EeccCCC------------------------------cEEEecCCCC-----Cceee-cCCce---EEEeecCC Confidence 2111 1111111 0011000000 00111 12222 33455666 Q ss_pred cCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhcchheec-CCCCce Q lcl|NC_021324. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTL-ASEAAK 289 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~----~~~~~~~~~~~~-~g~~~~ 289 (480) .+.++|.+-+..-.-+.+ -=+..+.+++.-++.|+.|+++.+- ++...++.... +..+..+....+ .|...+ T Consensus 197 ~g~p~g~gLlr~~~w~~~-fK~~~~~~w~~F~E~yG~P~~igky--~~~a~~~ek~~L~~av~~i~~da~~iiP~~~~ie 273 (526) T protein:vir:79 197 SGYVARSGLFRVLAWPYL-FRHYATSDLAEMLEIYGLPIRLGKY--PPGTADEEKATLLRAVTGLGHAAAGIIPETMAID 273 (526) T ss_pred cCCccccchHHHHHHHHH-HHHhhHHHHHHHHHHcCCceEEEec--CCCCCHHHHHHHHHHHHHHhcCcEEEecCCceeE Confidence 677888876543211211 1155778888888999999887762 11111111111 122333333333 344566 Q ss_pred EEEeccccHHHHHHHHHHHHHHHHhhc-CCChHHhccc-cccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_021324. 290 ISEFKAAELRNFAEEMEVFRKEAASIT-GLPPQYLSSS-SENPASAEAII-ATDSRIVKMAERKGRIFGGAWE-RAMRIA 365 (480) Q Consensus 290 ~~~~~~~~~~~~~~~l~~~~~~i~~~~-~~p~~~~g~~-~~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~~~li 365 (480) |.+....+...|...++..-.+|+... | +.+... +.+..+.-|+. ....-....+..-.+.+...|. .+++.+ T Consensus 274 ~~ea~~~~~~~f~~li~~~d~~Isk~iLG---qtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l 350 (526) T protein:vir:79 274 FQQAAQGSSEPFLAMMRQSEDAISKAVLG---GTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQLAATLSRDLLWPL 350 (526) T ss_pred EeecCCCCHHHHHHHHHHHHHHHHHHHhh---hhhccccccCcchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 776555555667777776666776652 1 111110 00111112222 1222233334444456666674 477766 Q ss_pred HHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH-HHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 366 MQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR-EQMRDWDKQETEDMIDTL 444 (480) Q Consensus 366 ~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~-~~~~~~~~~~~~~~~~~~ 444 (480) +.+..........-.++.|....+.++.+.++.+.+++..|. .++.+.+.+.+|+..... +.+. ... T Consensus 351 ~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~-~i~~~~i~e~~gip~~~~~e~~l-----------~~~ 418 (526) T protein:vir:79 351 LVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGL-EIPSAWVYDKLGIPQPAKNEPVL-----------RPA 418 (526) T ss_pred HHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCC-cCCHHHHHHHhCCCCCCCchhhc-----------ccc Confidence 766654332222345678888888899999999999998874 479999999999853321 1110 000 Q ss_pred hhhcccCCCcCCCCCCC-CCCccCCCCCC--CCCcccCC Q lcl|NC_021324. 445 YSTTKAQADATPKPTVT-ETKTETQTSPS--GFNRTKTR 480 (480) Q Consensus 445 ~~~~~~~~~~~~~~~~~-d~~~~~~~~~~--~~~~~~~~ 480 (480) .. +..+...+..... .........+. ..+..-.+ T Consensus 419 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~l~~ 455 (526) T protein:vir:79 419 AQ--PAILSRQHGQRVAALATIVGPRYGDQQALDKALAD 455 (526) T ss_pred CC--ccccccccccccccccccccccCchhhHHHHHHHH Confidence 00 0000000000000 00000000000 00000000 No 230 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=97.29 E-value=0.0001 Score=42.45 Aligned_cols=400 Identities=13% Similarity=0.085 Sum_probs=157.1 Q ss_pred CCCHHHH--HHHHHHHHHHHHHH----------HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc Q lcl|NC_021324. 1 MTTYHEH--VERLQGLLARDLPN----------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~~~--i~~l~~~~~~~~~~----------~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~ 68 (480) |.|.+.+ ..++-.-+....+. .....+.+ |-. ....+..+.... -.+ +.-...+|+..++-+- T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~g~~v~~~~-al~--~~~V~~~i~~Ia~~ia 75 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDL-GII-ISDTGAAVNADA-IMR--LDAVAACVKLVSQAIA 75 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCccchhhh-ccc-ccccCcccchHh-hhc--cHHHHHHHHHHHHhhh Confidence 8876552 33322222111100 00000000 000 000011111100 001 1111222333222222 Q ss_pred cCcccc---CCC--ch-hHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEccc Q lcl|NC_021324. 69 IEGFRI---SED--SE-GLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPL 136 (480) Q Consensus 69 ~~~~~~---~~d--~~-~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~ 136 (480) .-++.+ ..+ .+ ....+..++.. |. -..+...+..+.+.+|.||+.+... +|++ .+..++|. T Consensus 76 ~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-------~g~~~~L~~l~~~ 148 (432) T protein:vir:81 76 AMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-------DGRIESLQYLAND 148 (432) T ss_pred hCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEcCC Confidence 223321 111 11 12334455432 32 3356777888999999999887642 2444 56678888 Q ss_pred ceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 137 YMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) .+.+..++.. .+. |..... .+.. ..+..+. |+||++.. .. T Consensus 149 ~v~v~~~~~g--~~~----y~~~~~-~g~~---~~~~~~~-----------------------------iih~r~~~-~d 188 (432) T protein:vir:81 149 RLTITTDPKG--NTA----YRYRRT-DGQM---IDIPKQQ-----------------------------IWKIMGYS-LD 188 (432) T ss_pred ceEEEECCCC--cEE----EEEEec-CceE---EEEcccc-----------------------------EEEecCCC-CC Confidence 8887776432 111 111111 1110 0112222 24444332 23 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcc---hhhhhcC-CCccccccccccchhh--hhcchheecCCCCceE Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGT---PLRVISG-VTTDELTNDGENTTLD--IYYGRILTLASEAAKI 290 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~---p~l~i~G-~~~~~~~~~~~~~~~~--~~~~~~~~~~g~~~~~ 290 (480) +.+|.|.|. .+.+.+...+.-......+|.+ |-.++.- ....+...+.....+. ...++++.+++ +.++ T Consensus 189 g~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~-g~~~ 263 (432) T protein:vir:81 189 GENGLSAIR----YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEG-GMDV 263 (432) T ss_pred CcccccHHH----HHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhcCCCceecCC-CceE Confidence 456777553 3444444444333333344433 3333321 1111110000111111 12245666654 4578 Q ss_pred EEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 291 SEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 291 ~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 368 (480) .++..+. -..+++..+..+.+|+..-++|+..+|....+ .+.|..++.....+...+ -.-+-..|+.-+. ..+ T Consensus 264 ~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~t---l~P~~~~ie~~l~--~kL 338 (432) T protein:vir:81 264 KSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMT---LSPWLRRIEQSIA--LNL 338 (432) T ss_pred EEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHH---HHHHHHHHHHHHH--hhc Confidence 7776432 22467888888999999999999999864322 122223332222221111 0001111111111 111 Q ss_pred hCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|NC_021324. 369 MGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTT 448 (480) Q Consensus 369 ~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 448 (480) ...... ....+++.+......+..++++.+.+++++| +++...+++++|+.+-+-. ...... ...+.... T Consensus 339 l~~~~~-~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~t~NE~R~~~glpp~~g~--~~~~~~-----~~~~~pl~ 408 (432) T protein:vir:81 339 LSPAER-RRYFADFDTSALLRADSAARSSYYSQLVNNG--LMTRDEAREIEGLPKLGGN--AAVLTV-----QSAMVPLD 408 (432) T ss_pred cCcccc-CceEEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCC--cceEee-----cCcccchh Confidence 111111 1122444444555678889999999998876 6677778878776432100 000000 00000000 Q ss_pred ccCCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 449 KAQADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 449 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) ....+..+++. ++.+.+...+-++ T Consensus 409 ~~~~~~~~~~~------~~~~n~~~~~~~~ 432 (432) T protein:vir:81 409 SIGLQASPEPA------SGLGNQQQDKVSK 432 (432) T ss_pred hhccCCCCCCC------CCCCCcccccccC Confidence 00000001111 1111111111112 No 231 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=97.26 E-value=0.00011 Score=42.22 Aligned_cols=393 Identities=14% Similarity=0.068 Sum_probs=159.0 Q ss_pred CCCHHHHHHHHHHHH-HHHHHHH------------HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhh Q lcl|NC_021324. 1 MTTYHEHVERLQGLL-ARDLPNL------------LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL 67 (480) Q Consensus 1 ~~~~~~~i~~l~~~~-~~~~~~~------------~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l 67 (480) |.. .+.+++... ......+ ..+...+-|.... .+..+.... -.+. .=...+|+..++-+ T Consensus 1 ~~~---~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--~g~~v~~~~-al~~--~~V~~~i~~ia~~i 72 (434) T protein:vir:43 1 MSK---SLGKVLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESS--SGKKVTVDK-AMKL--SAVWACVRLISTSV 72 (434) T ss_pred Ccc---chhhhhhhcccccchhhhcccccccccCchHHHHHHhcCCcc--CCceechhh-hhcc--HHHHHHHHHHHHhh Confidence 322 111111111 0000100 1111112221100 111111110 0111 01112233332222 Q ss_pred ccCcccc---CCC----chhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEc Q lcl|NC_021324. 68 DIEGFRI---SED----SEGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVES 134 (480) Q Consensus 68 ~~~~~~~---~~d----~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~ 134 (480) -.-++.+ ..+ ......+..++.. |. -..+...+..+.+.+|.+|+++..+ .|++ .+..++ T Consensus 73 a~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-------~G~~~~L~~l~ 145 (434) T protein:vir:43 73 AGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-------AGRPAALDFLL 145 (434) T ss_pred hhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-------CCcEEEEEEEc Confidence 2223321 211 1112245555532 33 3467777888999999999887532 3554 566788 Q ss_pred ccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccc Q lcl|NC_021324. 135 PLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR 214 (480) Q Consensus 135 p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~ 214 (480) |..+.+..+... . ++++.. ...+. ...|.++. |+||++. . T Consensus 146 p~~v~~~~~~~g--~----~~y~~~-~~~g~---~~~~~~~e-----------------------------Vih~~~~-~ 185 (434) T protein:vir:43 146 PSRVDLECDENG--R----LKYFYT-TKKGA---RREIERTN-----------------------------MLHIPAF-T 185 (434) T ss_pred CcceEEEEcCCC--e----EEEEEE-ecCce---EEEEcccc-----------------------------EEEecCc-C Confidence 888877765431 1 111111 11111 01122222 2445443 2 Q ss_pred cCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcC-CCccccccccccchhhh-----hcchheecCC Q lcl|NC_021324. 215 LGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISG-VTTDELTNDGENTTLDI-----YYGRILTLAS 285 (480) Q Consensus 215 ~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i~G-~~~~~~~~~~~~~~~~~-----~~~~~~~~~g 285 (480) ..+.+|.|.|. .+.+.+.....--.-...+|. .|-.+++- ....+...+.-...++. ..++++.++ T Consensus 186 ~dg~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~~vl~- 260 (434) T protein:vir:43 186 LDGRIGLSAIR----YGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRSPVLE- 260 (434) T ss_pred CCCccccCHHH----HHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCccccC- Confidence 34466777553 333444333322222223333 34444431 11111111110111111 123455555 Q ss_pred CCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 286 EAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI 364 (480) Q Consensus 286 ~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l 364 (480) ++.++.++.... -..+++..+....+|+..-++|+..+|....+...+.++......+. ...|.-++.. T Consensus 261 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~----------~~~L~P~~~~ 330 (434) T protein:vir:43 261 QGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFL----------TFSISSITNQ 330 (434) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHH----------HHHHHHHHHH Confidence 346887776432 22477888888999999999999988754322111222322222211 1112222221 Q ss_pred HH-----HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021324. 365 AM-----QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 365 i~-----~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~ 439 (480) +. .+....... ...+++.+......|..++++++.+++.+| +++...+++.+|+.+-+ ...+..-...-. T Consensus 331 ie~~ln~kL~~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~gl~p~~--ggD~~~~~~n~~ 405 (434) T protein:vir:43 331 IQQCVNKRLLTAPERI-RYYAEFSLEGFLKADSAGRAAWYSTMAQNG--FMTRNEGRRKENLPELP--GGDILTVQSNLV 405 (434) T ss_pred HHHHHHhhcCChhhhc-CceEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CCCeEeeccCcc Confidence 11 111111111 123444445556678899999999998876 66777777787765321 000000000000 Q ss_pred HHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCC Q lcl|NC_021324. 440 MIDTLYSTTKAQADATPKPTVTETKTETQTSPSG 473 (480) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 473 (480) .++.. ++.+..+......++..++++|.. T Consensus 406 ~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 406 PIDQL-----GQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred chhhh-----hccCCCcchhhhhhccCCCCCCCC Confidence 01100 111111112222233333333333 No 232 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=97.23 E-value=0.00012 Score=42.01 Aligned_cols=352 Identities=16% Similarity=0.126 Sum_probs=137.1 Q ss_pred HHHHHHH-----HHHHH---HHHHHhhccCcchhcCc-ccchhhh--c-------------------------ceeccch Q lcl|NC_021324. 12 QGLLARD-----LPNLL---EAEAYRNGTRRLKTIGI-GAPPELA--Y-------------------------LDVQPGW 55 (480) Q Consensus 12 ~~~~~~~-----~~~~~---~~~~YY~G~~~i~~~~~-~~~~~~~--~-------------------------~~~~~n~ 55 (480) +..|..- .+++. =..+|.-|+.++..+.. ..++..+ . .-..... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~~~ 80 (409) T protein:vir:83 1 MGFWSNLFGIPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLIDV 80 (409) T ss_pred CchhhhhcccccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhhHH Confidence 2222211 11221 12344445444322211 1110000 0 0000112 Q ss_pred HHHHHHHHHhhhccCccccCCCchhHHHHHHHHH--hcC---HHHHHHHHHHHHhhcCeeEEEE-ecCccccCCCCcee- Q lcl|NC_021324. 56 VATYLRTLSDRLDIEGFRISEDSEGLEELWNWWQ--AND---LDEESVLGHDDSLTFGRAYITV-SHPDVESGDPAGIP- 128 (480) Q Consensus 56 ~~~iVd~~~~~l~~~~~~~~~d~~~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v-~~~~~~~~d~~g~~- 128 (480) +..+|+..++-+-.-++.+-.+.+..+.+..++. -|. ...+...+.... ..|.+|+.+ .. +.+|.| T Consensus 81 v~acV~~Ia~~iA~lpl~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~l~~~l-llGnay~~~i~r------~~~G~~~ 153 (409) T protein:vir:83 81 AWACIDLNASVLSSMPIYRMRNGRIIDSVAWMSNPDPEVYTSWQEFAKQLFWDF-QLGEAFVLPMAH------GSDGYPI 153 (409) T ss_pred HHHHHHHHHHhhccCceEEeeCCccccchhhhcccCCCCCCCHHHHHHHHHHHH-hhCCcEEEEEEE------CCCCcEE Confidence 2233443333332223322111111111222222 121 223334444444 448888764 33 235655 Q ss_pred EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEE Q lcl|NC_021324. 129 LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVP 208 (480) Q Consensus 129 ~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~ 208 (480) .+..++|..+.+.++... . . .| ...+.. +.=.|+| T Consensus 154 ~L~pl~p~~v~v~~~~~g--~----~-~y-----------------------~~~~~~---------------~~~eiiH 188 (409) T protein:vir:83 154 RFRVVPPWLVNVELKKGA--R----R-EY-----------------------RIGGLN---------------VTDEILH 188 (409) T ss_pred EEEEECCcceEEEEcCCc--e----E-EE-----------------------EEcccc---------------CccceEE Confidence 466778877665554321 0 0 01 000000 0014678 Q ss_pred eecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccchhh----hhcchheec Q lcl|NC_021324. 209 LTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENTTLD----IYYGRILTL 283 (480) Q Consensus 209 ~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~----~~~~~~~~~ 283 (480) |+......+.+|.|.|.- ....++..+.+.....+.+...+.|..++.-- .+.++..+.-...|+ -..++.+.+ T Consensus 189 ir~~~~~~~~~G~spi~~-~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~~ls~e~~~~~~~~~~~~~~~nag~~~il 267 (409) T protein:vir:83 189 IRYQGNTADAHGHGPLES-AAPRQVVIGLLQKYVQNLAETGGVPLYWLGVERRLSETEAVDLMDRWIESRSKYAGHPALV 267 (409) T ss_pred eCCCCCCCCcccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceEeecCCCCCHHHHHHHHHHHHHhhCCccCcccee Confidence 876555566789887642 33333332322222222233333455444311 111110000011111 122333444 Q ss_pred CCCCceE---EEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccch---H---HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 284 ASEAAKI---SEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA---S---AEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 284 ~g~~~~~---~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~---S---g~Al~~~~~~l~~k~~~~~~~~ 354 (480) +++. ++ ..++..++ .|++..+....+|+..-++|+..+|....+.. | ...+.+....|.-.+.+.+.. T Consensus 268 ~~g~-~~~~~~~~s~~d~-q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ie~~- 344 (409) T protein:vir:83 268 TGGA-TLNQAKSMSAQDL-SLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATAVMAA- 344 (409) T ss_pred cCCc-ccccccCCCHHHH-HHHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHHHHHH- Confidence 4332 32 23333333 37787788899999999999998885332211 1 111222112222222222211 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHH Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDK 434 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~ 434 (480) |.+ .+... ...+++.+......+..++++++.+++++| +++...+++.+|..+. T Consensus 345 ---l~~------~Ll~~-----~~~~~f~~~~llr~d~~~r~~~~~~~~~~G--~lT~NE~R~~~glpp~---------- 398 (409) T protein:vir:83 345 ---LDR------WALPS-----PQHLELNRDDYTRPSLVERATAYKIMIEAG--VMEPNEARAMERLHSE---------- 398 (409) T ss_pred ---HHH------hhCCC-----CcEEEeehhhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCC---------- Confidence 111 11111 123455555555677888888888887765 4444444444333211 Q ss_pred HHHHHHHHHHhhhcccCCCcCCCCCC Q lcl|NC_021324. 435 QETEDMIDTLYSTTKAQADATPKPTV 460 (480) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 460 (480) .+++.-...++ T Consensus 399 ---------------~ggd~l~~~gv 409 (409) T protein:vir:83 399 ---------------AAAVRLSGGGV 409 (409) T ss_pred ---------------CCCcccCCCCC Confidence 11111122222 No 233 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=97.18 E-value=0.00014 Score=41.74 Aligned_cols=423 Identities=14% Similarity=0.122 Sum_probs=163.6 Q ss_pred CCCH--------HHHHHHHHHHHHHHHHHH-HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc--- Q lcl|NC_021324. 1 MTTY--------HEHVERLQGLLARDLPNL-LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD--- 68 (480) Q Consensus 1 ~~~~--------~~~i~~l~~~~~~~~~~~-~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~--- 68 (480) |-.. ++.+++..+.+..++..+ ...+++|+ +.+|++....+...+.-++..+-+...++++++.|. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~--~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~l 78 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSK--LTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVL 78 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHH--hhcccccCCCCCcccccccccchHHHHHHHHHHHHHhhh Confidence 3222 234555555554433332 23334442 223322111111111113555666777777777663 Q ss_pred -c--Cccc-c--CCC---------c---hhH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCC Q lcl|NC_021324. 69 -I--EGFR-I--SED---------S---EGL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGD 123 (480) Q Consensus 69 -~--~~~~-~--~~d---------~---~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d 123 (480) + .+|+ . ++. . +.. ..+...+..++|.....++.++...+|.|.+++. T Consensus 79 tpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d-------- 150 (516) T protein:vir:10 79 FPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKP-------- 150 (516) T ss_pred cCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEec-------- Confidence 1 2232 1 110 0 011 1244456678999999999999999999865442 Q ss_pred CCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe-------cC-----------CceEEEEEEEe-----CCcEEEE Q lcl|NC_021324. 124 PAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR-------DD-----------VAVPDRATLYL-----PDETVPL 180 (480) Q Consensus 124 ~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~-------~~-----------~~~~~~~~~~~-----~~~~~~~ 180 (480) +++. ++.++-.+.++.-|. .+ .+...++..... .. .+....+++|+ ++..+.+ T Consensus 151 ~~~~--~~~~pl~~y~v~~d~-~G-~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~~~~~~~~ 226 (516) T protein:vir:10 151 SKGA--ISAIPMHHYVVNRDT-NG-DLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYLGEGFWEL 226 (516) T ss_pred CCCC--eEEEEcCeEEEeeCC-CC-CeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEecCCCceEE Confidence 2222 455554554433333 32 333333322100 00 01111223332 2222222 Q ss_pred EecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--C Q lcl|NC_021324. 181 RRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--G 258 (480) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G 258 (480) ....+.. .......-+|..+|++.++-+...++.||+|-.. +..+=+..+|.+.-...........|.+.+. | T Consensus 227 ~~~~d~~----~~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g 301 (516) T protein:vir:10 227 KQSADDI----PVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAE-DYSGDLFVIQFLSEAVARGAALMADIKYLIRPGA 301 (516) T ss_pred EEeeCce----eeccccccccccCCeeeeeeeecCCCCcccchHH-HhhHHHHHHHHHHHHHHHHHHHhcCCCcccCccc Confidence 2211110 0111112346678998888888889999999653 3445556666665556666666666655542 1 Q ss_pred CCccccccccccchhhhhcchheecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhcc-ccccchHHHHH Q lcl|NC_021324. 259 VTTDELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAI 336 (480) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~~~~Sg~Al 336 (480) ... .........|.+..-..++....++.. .+++...+.++.+...|....-+. .+.. ++.. -+++.+ T Consensus 302 ~~~-------~~~l~~~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~rd~~r-vTAtEV 371 (516) T protein:vir:10 302 QTD-------VDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAER-VTAVEI 371 (516) T ss_pred ccc-------hhhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhh--hhhccCCcc-ccHHHH Confidence 110 000111122233221112223334332 344444444444433332221111 0111 1111 133333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-----HH---HHHhCCCCcccceeeeE-EecCCCCCCHHHHHHHHHHHHh--- Q lcl|NC_021324. 337 IATDSRIVKMAERKGRIFGGAWERAMR-----IA---MQIMGREVTEEYTRLET-VWRDPSTPTVAAKADAVSKLYA--- 404 (480) Q Consensus 337 ~~~~~~l~~k~~~~~~~~~~~l~~~~~-----li---~~~~~~~~~~~~~~~~v-~f~~~~~~~~~~~ad~~~kl~~--- 404 (480) + ..+..++..++..+.++-. ++ +....-..+.....+.+ ++-.++ .....++.+..+.+ T Consensus 372 ~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~v~~i~~L--~raq~~~~i~~~~q~i~ 442 (516) T protein:vir:10 372 Q-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVIITGIEAL--GRMAELDKLANFAQYMS 442 (516) T ss_pred H-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChhhcCcceehhHHHH--HHHHHHHHHHHHHHHHH Confidence 3 4455566666666555422 11 11112222222222222 221111 11122222222111 Q ss_pred ccccC-------CC----HHHHHHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 405 NGQGP-------IP----KEQARIDLGYT------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 405 ~~~~~-------~s----~e~~~~~~~~~------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ..... +. -+.+...+|.. +++++++.+.+++.++ ........+ .+.....++.--+. T Consensus 443 ~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~---~~~~~~~~~---~~~~~~~~~~~~~~ 516 (516) T protein:vir:10 443 LPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQ---AQMLEEGVA---KAVPGVIQQELKEA 516 (516) T ss_pred HHhcCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHH---HHHHHHHhh---hcccchhhhhhhcC Confidence 11000 11 13334444432 4444444333322222 111111111 11111111111111 No 234 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=97.17 E-value=0.00015 Score=41.63 Aligned_cols=423 Identities=14% Similarity=0.120 Sum_probs=162.4 Q ss_pred CCCHH----HHHHHHHHHHHHHHHHH-HHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc----c-- Q lcl|NC_021324. 1 MTTYH----EHVERLQGLLARDLPNL-LEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD----I-- 69 (480) Q Consensus 1 ~~~~~----~~i~~l~~~~~~~~~~~-~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~-- 69 (480) |+... +.++++.+.+...+..+ ...+++|+ +.+|+.-...+...+.-++..+-+...++++++.|. + T Consensus 5 ~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~--~~lP~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~ 82 (516) T protein:vir:96 5 IDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSK--LTLPYLMNDKGDNETSQNGWQGVGAQATNHLANKLAQVLFPAQ 82 (516) T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHH--hhcccccCCCCCccccCCcccchHHHHHHHHHHHHHhhhcCCC Confidence 44332 34555555555444332 23333432 222322111111111124555666777777777663 1 Q ss_pred Cccc-cC-CC----------c---hhH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCce Q lcl|NC_021324. 70 EGFR-IS-ED----------S---EGL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI 127 (480) Q Consensus 70 ~~~~-~~-~d----------~---~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~ 127 (480) .+|+ .. +| . +.. ..+...+..++|.....++.++...+|.|.+++. +++ T Consensus 83 ~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~d--------~~~- 153 (516) T protein:vir:96 83 RSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCMLYKP--------SKG- 153 (516) T ss_pred CcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeEEec--------CCC- Confidence 2232 11 11 0 011 1244456678899999999999999999866542 222 Q ss_pred eEEEEEcccceEEEEeCCCCCceEEEEEEEE-------Eec-----------CCceEEEEEEE-----eCCcEEEEEecC Q lcl|NC_021324. 128 PLIRVESPLYMYAELDPRNTRRVTRAVRLYT-------TRD-----------DVAVPDRATLY-----LPDETVPLRRNG 184 (480) Q Consensus 128 ~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~-------~~~-----------~~~~~~~~~~~-----~~~~~~~~~~~~ 184 (480) .++.++-.+.++.-|.. + .+...++... ... ..+....+++| .++..+.+.... T Consensus 154 -~~~~~pl~~y~v~~d~~-G-~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~ 230 (516) T protein:vir:96 154 -AISAIPMHHYVVNRDTN-G-DLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSA 230 (516) T ss_pred -CEEEEEcCeEEEeeCCC-C-CeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEe Confidence 24555545533333332 2 2222222110 000 00001122333 233322222211 Q ss_pred CCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCcc Q lcl|NC_021324. 185 GLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTD 262 (480) Q Consensus 185 ~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~ 262 (480) +.. ........+|..+|++.++-+...++.||+|-.. +..+=+..+|.+.-...........|.+.+. |... T Consensus 231 d~~----~~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~-~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~p~g~~~- 304 (516) T protein:vir:96 231 DDI----PVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAE-DYSGDLFVIQFLSEAVARGAALMADIKYLIRPGAQTD- 304 (516) T ss_pred Cce----eeccccccccccCCeeeeeeeecCCCCcccchHH-HhhHHHHHHHHHHHHHHHHHHHhcCCccccCcccccc- Confidence 110 1111122346678999888888889999999653 3445556666666666666666666665442 1110 Q ss_pred ccccccccchhhhhcchheecCCCCceEEEecc-ccHHHHHHHHHHHHHHHHhhcCCChHHhcc-ccccchHHHHHHHHH Q lcl|NC_021324. 263 ELTNDGENTTLDIYYGRILTLASEAAKISEFKA-AELRNFAEEMEVFRKEAASITGLPPQYLSS-SSENPASAEAIIATD 340 (480) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~-~~~~~~Sg~Al~~~~ 340 (480) .........+.+..-..++....++.. .+++.....++.+...|....-+. .+.. ++.. -+++.++ T Consensus 305 ------~~~l~~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~r~~~r-vTAtEV~--- 372 (516) T protein:vir:96 305 ------VDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAER-VTAVEIQ--- 372 (516) T ss_pred ------hhhhccCCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCcc-ccHHHHH--- Confidence 000111122222221112223334332 234433344443333332221110 0111 1111 1333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH--------HHHHHHhCCCCcccceeeeEEecCCCC-CCHHHHHHHHHHHHh------- Q lcl|NC_021324. 341 SRIVKMAERKGRIFGGAWERAM--------RIAMQIMGREVTEEYTRLETVWRDPST-PTVAAKADAVSKLYA------- 404 (480) Q Consensus 341 ~~l~~k~~~~~~~~~~~l~~~~--------~li~~~~~~~~~~~~~~~~v~f~~~~~-~~~~~~ad~~~kl~~------- 404 (480) ..+..++..++..+.++- ...+...+-...... +++.+-.++. -.....++.+.+..+ T Consensus 373 ----~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp~~~--v~~~~vs~l~~l~r~~~~~~i~~~~~~i~~~~~ 446 (516) T protein:vir:96 373 ----RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFTSDL--VDPVIITGIEALGRMAELDKLANFAQYMSLPLQ 446 (516) T ss_pred ----HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCcccc--ccceeechHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 444555556666555532 112222332222222 3333321111 001111222222111 Q ss_pred ccc---cCCCHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccC Q lcl|NC_021324. 405 NGQ---GPIPKE----QARIDLGYT------ATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTET 467 (480) Q Consensus 405 ~~~---~~~s~e----~~~~~~~~~------~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 467 (480) ... -.+... .+...+|.. +++++++ ++++++..........-+++.+ ...+++.-+. T Consensus 447 ~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~---~~~~~~~q~~~~~a~~~~~~~~---~~~~~~~~~~ 516 (516) T protein:vir:96 447 WPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQE---QEAQMQAQQAQMLEEGVAKAVP---GVIQQELKEA 516 (516) T ss_pred CChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHH---HHHHHHHHHHHHHHHHhhhhhh---HHhhcccccC Confidence 110 011111 223444543 3333333 2222221111111111111000 0001111100 No 235 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=97.15 E-value=0.00015 Score=41.52 Aligned_cols=394 Identities=13% Similarity=0.080 Sum_probs=167.2 Q ss_pred CCC-----HHHHH------HHHHHHHHHHHHHHHHHHHH-hhccC--cchh---cCcccchhhhcceeccchHHHHHHHH Q lcl|NC_021324. 1 MTT-----YHEHV------ERLQGLLARDLPNLLEAEAY-RNGTR--RLKT---IGIGAPPELAYLDVQPGWVATYLRTL 63 (480) Q Consensus 1 ~~~-----~~~~i------~~l~~~~~~~~~~~~~~~~Y-Y~G~~--~i~~---~~~~~~~~~~~~~~~~n~~~~iVd~~ 63 (480) ||- .-..+ +-+..+.......+ ..+ +.|-. +.+- .+... .-+..+. ......-.+++. T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~---~~~~~~~~~p~~~~il~~~~~~~-~~y~~m~-~D~~i~s~l~~R 75 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATRARSI---DFFALGMYLPNPDPVLKALGKDI-RVYRELR-ADAHVGGCVRRR 75 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhhcccc---ccccccccCcchhHHHhhccCCH-HHHHHHh-hChHHHHHHHHH Confidence 110 01111 11112211111111 111 11100 0000 00000 1111222 234445555555 Q ss_pred HhhhccCcccc---CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeE---EEEEccc Q lcl|NC_021324. 64 SDRLDIEGFRI---SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL---IRVESPL 136 (480) Q Consensus 64 ~~~l~~~~~~~---~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~---~~~~~p~ 136 (480) ...+....|.+ .++++..+.+.+.++.-+|+..+..+ .++..+|.+ ++++|.. .+|... +..++|+ T Consensus 76 k~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~------~~g~~~~~~l~~r~~~ 148 (491) T protein:vir:79 76 KAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGK------VGNYIVPIDVVGKPAD 148 (491) T ss_pred HHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEee------cCCeeeEEeeeeeccc Confidence 44455556665 23444567888888877888888776 578889987 4555632 134332 3333332 Q ss_pred ceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 137 YMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) ... ||+.. ...++..... ....+. .+++ ++.+.+....+ T Consensus 149 ~f~--~d~~~------------------------------~l~l~~~~~~-----~~g~~l-p~~k---~i~~~~~~~~g 187 (491) T protein:vir:79 149 WFV--YDPEN------------------------------QLRFRSKEHW-----VQGEEL-PARK---FLVPRQEATYL 187 (491) T ss_pred cee--eccCC------------------------------ceEEeecCCC-----CCceee-cCCC---eEEEEecCCCC Confidence 211 22111 1111111110 000111 1122 33455666677 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccc----hhhhhcchheec-CCCCceEE Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT----TLDIYYGRILTL-ASEAAKIS 291 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~----~~~~~~~~~~~~-~g~~~~~~ 291 (480) .++|.+-+.. +-...=-=+..+.+++.-++.|+.|+++.+- +....++.... ...+..+....+ .|.+.+|. T Consensus 188 ~p~g~gLl~~-~~w~~~fK~~~~~~w~~f~E~~G~P~~igky--~~~a~~~ek~~l~~al~~~~~~a~~viP~~~~ie~~ 264 (491) T protein:vir:79 188 NPYGFPDLSM-CFWPTTFKKGGLKFWVQFTEKYGSPMLVGKH--PRSASDAETNLLLDRLEDMVQDAVAVIPDDSSIEIK 264 (491) T ss_pred CcccchhHHH-HHHHHHHHHhhHHHHHHHHHHcCCCeEEEec--CCCCCHHHHHHHHHHHHHHhcCeEEEecCCceeEEE Confidence 7888886643 3222222356677888888999999877662 11111111111 122333333333 34456676 Q ss_pred Eecc--ccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 292 EFKA--AELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 292 ~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 368 (480) +... .+...|...++..-.+|+...- -+.+.. ++ .++.|+. ....-....+..-.+.+...+.++++-++.+ T Consensus 265 ea~~~~g~~~~y~~li~~~d~~Isk~iL--GqtlTt--~~-~gs~a~~~vh~~v~~~i~~~D~~~i~~tln~li~~l~~~ 339 (491) T protein:vir:79 265 EAAGKSGSADVYERLLHFCRGEVSIALL--GQNQTT--EA-TSTRASAQAGLEVTDDIRDGDKAIVVEAMNMLIRWICDL 339 (491) T ss_pred eccCCCCChhHHHHHHHHHHHHHHHHHh--hhhhcc--Cc-ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6542 3445566666555555555320 011111 11 1111221 1222233334444566666777777777777 Q ss_pred hCCCCcccceeeeEEecCCCCCCH-HHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021324. 369 MGREVTEEYTRLETVWRDPSTPTV-AAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYST 447 (480) Q Consensus 369 ~~~~~~~~~~~~~v~f~~~~~~~~-~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 447 (480) ++... ....+.|.++. +. ...++.+.++++.|. .++.+.+.+.+|+...+..+. .. ... T Consensus 340 N~~~~----~~p~f~~~e~e--e~~~~~a~~~~~L~~~G~-~i~~~~~~e~~Gip~~~~~e~----------~~---~~~ 399 (491) T protein:vir:79 340 NFDGA----ARPVFDMWEQE--QVDEIQAGRDEKLTRAGA-RFTPAYFKRAYNLQDGDLDER----------PL---PVS 399 (491) T ss_pred cCCCC----CcceEeecCcC--chhHHHHHHHHHHHhCCC-ccCHHHHHHHhCCCCCCCCcc----------cc---CcC Confidence 65432 22345554433 33 457888889998874 579999999999764332110 00 000 Q ss_pred cccCCCcCCCCCCCCCCccCCCCCCCCCccc---CC Q lcl|NC_021324. 448 TKAQADATPKPTVTETKTETQTSPSGFNRTK---TR 480 (480) Q Consensus 448 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~---~~ 480 (480) ....+.++... +..+ +..+..+.-.. .| T Consensus 400 ~~~~~~~~~~~---~~~~--~~~~~~d~~~~~~~~~ 430 (491) T protein:vir:79 400 AVDAVGAASFA---EFEA--PDQDALDAALNALSAR 430 (491) T ss_pred ccccccccccc---ccCC--CCCcchHHHHHHHHHH Confidence 00000000000 0000 00000000000 01 No 236 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=97.05 E-value=0.00019 Score=40.96 Aligned_cols=406 Identities=12% Similarity=0.012 Sum_probs=159.5 Q ss_pred CCCH--HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhh---hcceeccchHHHHHHHHHhhhccCcccc- Q lcl|NC_021324. 1 MTTY--HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPEL---AYLDVQPGWVATYLRTLSDRLDIEGFRI- 74 (480) Q Consensus 1 ~~~~--~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~---~~~~~~~n~~~~iVd~~~~~l~~~~~~~- 74 (480) |.=+ ..-...|.++.............|..-+.-++..+....... .++.-.....+-.+++.-..+....+++ T Consensus 1 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~ 80 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQ 80 (446) T ss_pred CcccccCCCchhhhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceec Confidence 1100 000111222222222222222233311111122222111111 1111112333344444333334444544 Q ss_pred CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEEEEE------cccceEEEEeCCCC Q lcl|NC_021324. 75 SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRVE------SPLYMYAELDPRNT 147 (480) Q Consensus 75 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~~~~------~p~~~~~v~d~~~~ 147 (480) +.+++..+.+.+++..-.+...... ..++..||.+ .++||....+ +..-.++. .|...-=.+|.. + T Consensus 81 p~~~~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g-----~~~p~~~~d~~~~~~~~~~r~~~~~~-~ 153 (446) T protein:vir:98 81 HGDKRIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGAR-----DNMPATVLDDIVNYHPLQVMLIANDN-G 153 (446) T ss_pred CccHHHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeeccc-----ccccchhhccccccccccceeeeccC-C Confidence 4566677788888876666555554 5788889987 5556642211 11100111 111100011111 0 Q ss_pred CceEEEEEEEEEecCCceEEEEE-EEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchH Q lcl|NC_021324. 148 RRVTRAVRLYTTRDDVAVPDRAT-LYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISP 226 (480) Q Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~ 226 (480) ..+... ..+...+.. .|.+-. .+......... ......++ ++.--++.+.++...+.|+|.|-+.. T Consensus 154 ~~~~~~--------~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~g~~~~--iP~~kfi~~~~~~~~~~p~G~gLlr~ 220 (446) T protein:vir:98 154 RIVDGD--------TVTASQYKSGYWVPLP--PYRIGDPPKKV-DVVGSHVR--LPSHKRLFINYNTKGNNPWGTSCLTS 220 (446) T ss_pred cccccc--------ccchhhcccccccCcc--cchhhhhhhhc-ccCccccc--ccccceEEEEecCCCCCccccchHHH Confidence 000000 000000000 000000 00000000000 00001111 11112355677788888999886532 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcchhhhhc---CCCcccccccccc--------chhh----hhcchheec-----C-C Q lcl|NC_021324. 227 ELRKVTDAASRTLMNLQSASQILGTPLRVIS---GVTTDELTNDGEN--------TTLD----IYYGRILTL-----A-S 285 (480) Q Consensus 227 ~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~---G~~~~~~~~~~~~--------~~~~----~~~~~~~~~-----~-g 285 (480) +--.-=-=+..+-+++.-++.|+.|.++.+ |++.++..+.+.. ..+. +..+....+ + | T Consensus 221 -~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g 299 (446) T protein:vir:98 221 -VLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQP 299 (446) T ss_pred -HHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCC Confidence 211111115566678888899999988876 3332221111100 0111 111111111 1 2 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_021324. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAII-ATDSRIVKMAERKGRIFGGAWE-RAMR 363 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~-~~~~~l~~k~~~~~~~~~~~l~-~~~~ 363 (480) -..++.+........|...++..-.+|+...--....+|.......|. |+. ....-....++.-.+.+...+. ++++ T Consensus 300 ~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~-ala~vh~~V~~d~~~aDa~~i~~tln~~Li~ 378 (446) T protein:vir:98 300 VQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTG-RASEIQLELFDGKINSIFDTVIHAFTEQVIG 378 (446) T ss_pred ceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223344433333334666666555666664322212233221111221 221 1112222223334455555563 4666 Q ss_pred HHHHHhCCCCcccc--eeeeEEecCCCCCCHHHHHHHHHHHHhccccC-CCHHHHHHhCCCChhHHHH Q lcl|NC_021324. 364 IAMQIMGREVTEEY--TRLETVWRDPSTPTVAAKADAVSKLYANGQGP-IPKEQARIDLGYTATQREQ 428 (480) Q Consensus 364 li~~~~~~~~~~~~--~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~-~s~e~~~~~~~~~~d~~~~ 428 (480) -++.++........ ....+.|....+.++...++++.++++.|... .+.+.+.+.+|+.+.+-.. T Consensus 379 ~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 379 NLIRLNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred HHHHhCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 66666654322111 11123454456678889999999999887432 1356788888764222111 No 237 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=97.00 E-value=0.00022 Score=40.67 Aligned_cols=427 Identities=14% Similarity=0.077 Sum_probs=159.4 Q ss_pred CCC---H---HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhc--CcccchhhhcceeccchHHHHHHHHHhhhccCcc Q lcl|NC_021324. 1 MTT---Y---HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTI--GIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGF 72 (480) Q Consensus 1 ~~~---~---~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~--~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~ 72 (480) ||+ + ..-+-+....=. ... .+.|+--...+.+ +... .-+..+.-......-.+++....+....| T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~--~~~----~~~~~~~e~~~~lr~~~~~-~ly~~m~e~D~~i~s~l~~rk~av~~~~w 73 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGV--VDG----WTVWDPFEQTPELQWPQSV-AVYSRMDNEDSRVTSLLEAISLPIRSTPW 73 (469) T ss_pred CCCcccCCCCccchhhhhhccc--ccc----hhhccccccccccccccch-HHHHHHHhhChHHHHHHHHHHHHHhcCCc Confidence 332 1 111111111000 000 1111100000000 0000 01111111234444444444444445555 Q ss_pred ccC---CCchhHHHHHHHHH-----------------hcCHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEEE Q lcl|NC_021324. 73 RIS---EDSEGLEELWNWWQ-----------------ANDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIR 131 (480) Q Consensus 73 ~~~---~d~~~~~~l~~~~~-----------------~n~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~~ 131 (480) ++. ++++..+.+.+.+. +..+..++.++..++..||.+ +++||..... ..+|...+. T Consensus 74 ~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~--~~dG~~~~~ 151 (469) T protein:vir:10 74 RIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQ--SPDGRFWLR 151 (469) T ss_pred eEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccc--cCCCceeee Confidence 541 22222233322211 123556777777788889988 5577753321 124555444 Q ss_pred EEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeC-CcEEEEEecCCCccccccccccccccccccceEEee Q lcl|NC_021324. 132 VESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLP-DETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLT 210 (480) Q Consensus 132 ~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~ 210 (480) .+.+.. + +.. .+..... +.+ ....+...+ ..+.... +.......+ ++.-=++.+. T Consensus 152 ~l~~rp------~----~~i--~~~~~~~-~~~-l~~~~~~~~~~~~~~~~--------~~~~~~~~~--lp~~k~i~~~ 207 (469) T protein:vir:10 152 KLAPRP------Q----WTI--SKFNVAP-DGG-LESIEQIAPPARTRGSL--------YVANIAPPE--IPVNRLVVYT 207 (469) T ss_pred eeeecC------c----ccc--eeeeecc-CCc-eeeeeecCccccccccc--------ccCCCCccc--cccCcEEEEE Confidence 333221 0 000 0011111 111 111110000 0000000 000000000 0001135566 Q ss_pred cccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch----hhh--hcchheecC Q lcl|NC_021324. 211 NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT----LDI--YYGRILTLA 284 (480) Q Consensus 211 n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~----~~~--~~~~~~~~~ 284 (480) ++...+.++|.|-+..-.... =-=+..+.+++.-++.|+.|+++.+--... .++..... -.+ +......++ T Consensus 208 ~~~~~g~p~g~gLlr~~~~~~-~fK~~~~~~w~~f~EryG~P~~vgky~~~a--~~~ek~~l~~a~~~~~~g~~a~~iip 284 (469) T protein:vir:10 208 RNKRPGQWQGKSILRSAYKHW-LLKDKLLRIEAATAERNGMGIPVGTASSAT--DEDEVRKMAALARSVRGGINAGVGLA 284 (469) T ss_pred ecCCCCCcccchhHHHHHHHH-HHHHHHHHHHHHHHHHcCCcceEEecCCCC--CHHHHHHHHHHHHHHhcCCceEEEcc Confidence 777788899988765322221 122557777888889999998876532111 11111111 111 112222332 Q ss_pred -CCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_021324. 285 -SEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE-RAM 362 (480) Q Consensus 285 -g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~ 362 (480) |.+.++.+.+ .+...|...++..-.+|+...--......+++++.+.|.. ...-....+..-.+.+...|. .++ T Consensus 285 ~~~~ie~~ea~-g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~v---h~ev~~d~~~sDa~~i~~tln~~li 360 (469) T protein:vir:10 285 QGQILELLGVS-GNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASV---LEDPFTQAVHAYATSICRIANQHII 360 (469) T ss_pred CCceEEEeecC-CCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3445555543 2344566666655566655421000001111111111211 122222333334455555664 355 Q ss_pred HHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhcccc---CCCHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021324. 363 RIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQG---PIPKEQARIDLGYTATQREQMRDWDKQETED 439 (480) Q Consensus 363 ~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~---~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~ 439 (480) +-++.+.-... ....+++|.... ......++.+.++++.|.. -++.+.+.+.+|+...+..+.. ..... T Consensus 361 ~~l~~lN~g~~---~~~P~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~--~~~~~-- 432 (469) T protein:vir:10 361 EDLVDINFGVD---TPAPVLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPS--AEPEE-- 432 (469) T ss_pred HHHHHhcCCCC---CCccEEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCccc--ccchh-- Confidence 55555542221 122556775443 4456678889999988753 2567788899987533221110 00000 Q ss_pred HHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 440 MIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 440 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+.+.++.+.+.....+..+....+|..+...-+= T Consensus 433 -----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~d 468 (469) T protein:vir:10 433 -----PAAVPNQSAAPARTRSSGNADARARAPKADQGVLFD 468 (469) T ss_pred -----cccCCCCCccccccCCCCCcccccccCCChHHhhcc Confidence 000011110111111111111111111111100000 No 238 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=96.96 E-value=0.00024 Score=40.47 Aligned_cols=344 Identities=12% Similarity=0.055 Sum_probs=131.6 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchhHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDL-PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~-~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~~~~ 83 (480) .-|+.. +.++. ...........+.... ..+..+.+.. -.+. .=.-.+|+..++-+-..++. ++ .. T Consensus 1 M~~~~~----f~~r~~~~~~~~~~~~~~~~~~-~~~~~v~~~~-al~~--~av~~cv~~ia~~ia~~p~~--~~----~~ 66 (359) T protein:vir:10 1 MSILNP----FERRSSITPNNYYPFMVQNGSI-VPNSLVDATE-ALKN--SDLYAVTSLISSDIAGTRFI--GN----QV 66 (359) T ss_pred Ccccch----hhccccCCCCcchhhhhccccc-cCCcccCHHH-hhcc--hHHHHHHHHHHHhhhcCccc--cc----hH Confidence 111110 10000 0000000111000000 0111111110 0011 00112333333322222332 11 12 Q ss_pred HHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 84 LWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 84 l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) +..++.+ | .-..+...+....+.+|.||+++-.+ ..|.+ .+..++|..+.+..++. . ++|.. T Consensus 67 ~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~------~~g~~~~l~~l~~~~v~i~~~~~---~----~~y~~ 133 (359) T protein:vir:10 67 FTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKG------DNSLMKELRLIPSNAITIDLTDD---T----LTYEV 133 (359) T ss_pred HHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEEC------CCCeEEEEEEeCCceEEEEEcCC---e----EEEEE Confidence 2233332 2 22345566777888999999988653 35654 46677777766544321 1 11111 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) .....+. ...|.++++++++... + +.....+..|.|.|.- +...+.....+ T Consensus 134 ~~~~~~~---~~~~~~~evih~~~~~------------------------~-~~~~~dg~~G~spi~~-~~~~i~~~~~~ 184 (359) T protein:vir:10 134 NQFDDYP---SAKYNASEMIHVKIMA------------------------Y-GVDTLHNLVGHSPLES-LTSEIGQQKEA 184 (359) T ss_pred EecCCce---EEEEcccceEEeccCC------------------------C-CCCccCccccccHHHH-HHHHHHHHHHH Confidence 1111111 1122333333322110 0 0011234567775532 33333322222 Q ss_pred HHHHHHHHHHhcchhhhhcC--CCccccccccccchhhh-----hcchheecCCCCceEEEeccccHH-HHHHHHHHHHH Q lcl|NC_021324. 239 LMNLQSASQILGTPLRVISG--VTTDELTNDGENTTLDI-----YYGRILTLASEAAKISEFKAAELR-NFAEEMEVFRK 310 (480) Q Consensus 239 ~s~~~~~~~~~~~p~l~i~G--~~~~~~~~~~~~~~~~~-----~~~~~~~~~g~~~~~~~~~~~~~~-~~~~~l~~~~~ 310 (480) .....+.+...+.|-.+++- ........+.-...|+. ..++++.++ ++.++..++.+..+ .+++..+.... T Consensus 185 ~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~~~ 263 (359) T protein:vir:10 185 NRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLD-QSADFSTVSINADVANYLNSMNWGRT 263 (359) T ss_pred HHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecC-CCcceeeecCCHHHHHHHHHHHHHHH Confidence 22233333333345444431 11111111111111211 113455555 44677777644333 47788888899 Q ss_pred HHHhhcCCChHHhccccccchHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCC Q lcl|NC_021324. 311 EAASITGLPPQYLSSSSENPASAEAIIATDSRIV-KMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST 389 (480) Q Consensus 311 ~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~-~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~ 389 (480) +|+..-++|++.+|+...+.++...++..+.... .........+...|.+ ....+.. ..+.| T Consensus 264 ~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~-----------~~~~~~~-~~~~~----- 326 (359) T protein:vir:10 264 QIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLISELRIKCDS-----------SIGVDMS-PITDY----- 326 (359) T ss_pred HHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----------hhcccch-hhhhc----- Confidence 9999999999999865444344444443333221 1121111111111111 0011100 01112 Q ss_pred CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHH Q lcl|NC_021324. 390 PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQR 426 (480) Q Consensus 390 ~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~ 426 (480) +.......+.+++.+| +++...+++.++..+= + T Consensus 327 -d~~~~~~~~~~~~~~G--~~t~NE~R~~l~~~pv-~ 359 (359) T protein:vir:10 327 -SNSVFKADILNWVKEG--IIEPTEAKTLLESKGI-I 359 (359) T ss_pred -CHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCC-C Confidence 2233333455666655 5666666666643211 1 No 239 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=96.95 E-value=0.00024 Score=40.39 Aligned_cols=371 Identities=10% Similarity=0.043 Sum_probs=141.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCCchhH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSEGL 81 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~~~ 81 (480) .=++.++..........-..+.....+.... .+..+... .-+.+.-+..+|+..++-+-.-++.+ .+++... T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~vt~~---~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~~~ 75 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRY--SGVYVTDS---NILQSSDVYELLQDISNQMVLADIVVEDEFGNEIKD 75 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhccccc--CccccChh---hhhccHHHHHHHHHHHHhhcccceEEEcCCCcccch Confidence 2222222222111111101112222222111 11111111 00111223444444444333334432 2222222 Q ss_pred HHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEE Q lcl|NC_021324. 82 EELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY 157 (480) Q Consensus 82 ~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~ 157 (480) ..+..++.+ | ....+...+..+.+.+|.||+++-. .+. . -+..+.|..+... ..+ T Consensus 76 ~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~----------~~~-~--~~~~~~~~~~~~~-------~~~- 134 (394) T protein:vir:62 76 DIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNG----------AQI-H--LASNVFTELDDNL-------VEH- 134 (394) T ss_pred hhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEec----------cee-e--ccccceEEECCce-------EEE- Confidence 334445443 3 2346677788899999999998732 111 0 0122333332110 000 Q ss_pred EEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHH Q lcl|NC_021324. 158 TTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASR 237 (480) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~ 237 (480) |..+. .. |..=-|+|+++.. ..+.+|.|.+.. +...++.... T Consensus 135 --------------~~~~~-~~---------------------~~~~eiih~r~~~-~d~~~G~s~~~~-~~~~i~~~~~ 176 (394) T protein:vir:62 135 --------------FNIGG-HE---------------------IPPCMIRHVKNIG-ADHLRGKGILDL-GRDTLEGVMS 176 (394) T ss_pred --------------EeeCC-EE---------------------echhheEEecCcC-CCCccccChHHH-HHHHHHHHHH Confidence 00000 00 0011146665443 345678775532 3333333222 Q ss_pred HHHHHHHHHHHhcchhhhhc--CCCc-cccccccccchhhh------hcchheecC-CCCceEEEecccc-HHHHHHHHH Q lcl|NC_021324. 238 TLMNLQSASQILGTPLRVIS--GVTT-DELTNDGENTTLDI------YYGRILTLA-SEAAKISEFKAAE-LRNFAEEME 306 (480) Q Consensus 238 ~~s~~~~~~~~~~~p~l~i~--G~~~-~~~~~~~~~~~~~~------~~~~~~~~~-g~~~~~~~~~~~~-~~~~~~~l~ 306 (480) +.......+...+.|..+++ +... +....+.-...|.- ..+++..++ |.+.++.+++... -..+++..+ T Consensus 177 ~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~ 256 (394) T protein:vir:62 177 AEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLN 256 (394) T ss_pred HHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHH Confidence 22223333333345544443 2111 10000000111111 113333343 3445666665432 223677778 Q ss_pred HHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHhCCCCcccceeee Q lcl|NC_021324. 307 VFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAM-----QIMGREVTEEYTRLE 381 (480) Q Consensus 307 ~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~-----~~~~~~~~~~~~~~~ 381 (480) ....+|+..-++|+..+|....++.+...+.+ +...|.-++..+. .+.... +...+. T Consensus 257 ~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~~---------------~~~~l~P~~~~ie~~l~~kll~~~---~~~~~~ 318 (394) T protein:vir:62 257 VYKKDLGKFLGINVDTYTELIKEDIEKAMMYI---------------HNKAVRPIMKNFEDHLSLLFYAQN---SGKRIK 318 (394) T ss_pred HHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHH---------------HHHHHHHHHHHHHHHHhhhhcCcc---ccCceE Confidence 88899999999999999864322211111111 1122222222211 122211 123466 Q ss_pred EEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCC Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVT 461 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 461 (480) +.|......+....++++.+++.+| +++...+++++|+.+-+-+..... .......+-. ...... T Consensus 319 ~~fd~~~~~~~~~~~~~~~~~~~~g--~~T~NE~R~~~gl~p~~~~~gd~~------------~~~~n~~~~~-~~~~~~ 383 (394) T protein:vir:62 319 FKINILDFVTYSNKTNIGYNLVRTA--ITSPDNVADMLGFPKQNTKESQAI------------YISNDVTEIG-KKEATD 383 (394) T ss_pred EEechhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCCee------------eccccccccc-cccccc Confidence 7776555555566777787887765 566677777777653211100000 0000000000 000000 Q ss_pred CCCccCCCCCCCCC Q lcl|NC_021324. 462 ETKTETQTSPSGFN 475 (480) Q Consensus 462 d~~~~~~~~~~~~~ 475 (480) ++..+++. ..| T Consensus 384 ~~~kgge~---~en 394 (394) T protein:vir:62 384 GSLGGGEE---NEN 394 (394) T ss_pred ccCCCCCC---CCC Confidence 00011110 011 No 240 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=96.93 E-value=0.00025 Score=40.30 Aligned_cols=402 Identities=12% Similarity=0.058 Sum_probs=157.0 Q ss_pred CCCHHH--HHHHHHHHHHHHHHHH---HHHHHHhhccC-----cchhcCcccchhhhcceeccchHHHHHHHHHhhhccC Q lcl|NC_021324. 1 MTTYHE--HVERLQGLLARDLPNL---LEAEAYRNGTR-----RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~--~i~~l~~~~~~~~~~~---~~~~~YY~G~~-----~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |.+++- ++.++-..+....+.- ........+.. .....+..+.+. .-+.+.-...+|+..++-+-.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~---~al~~~~V~~~i~~Ia~~ia~l 77 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNAD---AIMRLDAVAACVKLVSQAIAAM 77 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchh---hhhcchHHHHHHHHHHHhhhhC Confidence 555432 2222222221111100 00000000000 000011111110 0011112222333333322222 Q ss_pred cccc---CCCc--h-hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccce Q lcl|NC_021324. 71 GFRI---SEDS--E-GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYM 138 (480) Q Consensus 71 ~~~~---~~d~--~-~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~ 138 (480) +|.+ ..+. + ....+..++.. | ....+...+..+.+.+|.||+.+... +|++ .+..++|..+ T Consensus 78 p~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-------~g~~~~L~~l~~~~v 150 (432) T protein:vir:10 78 PLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-------DGRIESLQYLANDRL 150 (432) T ss_pred ceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEcCCce Confidence 3321 1111 1 12334555432 3 23456777888999999999887542 2443 5667888888 Q ss_pred EEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCcc Q lcl|NC_021324. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~ 218 (480) .++.+... .+ .|+....+ +.. ..+..+.+ +|+++.. ..+. T Consensus 151 ~v~~~~~g--~~----~y~~~~~~-g~~---~~~~~~~i-----------------------------ih~~~~~-~dg~ 190 (432) T protein:vir:10 151 TITTDTKG--NT----AYRYRRTD-GQM---IDIPKQQI-----------------------------WKIMGYS-LDGE 190 (432) T ss_pred EEEEcCCC--cE----EEEEEecC-ceE---EEEcCccE-----------------------------EEecCCC-CCCc Confidence 88776432 11 11111111 110 01122222 3444332 2345 Q ss_pred CCCccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCC-Cccccccccccchhh--hhcchheecCCCCceEEE Q lcl|NC_021324. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGV-TTDELTNDGENTTLD--IYYGRILTLASEAAKISE 292 (480) Q Consensus 219 ~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i~G~-~~~~~~~~~~~~~~~--~~~~~~~~~~g~~~~~~~ 292 (480) +|.|.|. .+.+.+...+.-......+|. .|-.++.-- ...+...+.....+. ...++++.+++ +.++.+ T Consensus 191 ~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~nag~~~vl~~-g~~~~~ 265 (432) T protein:vir:10 191 NGLSAIR----YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLTDDQYDSFAKKVSGSVEAGRAPLLEG-GMDVKS 265 (432) T ss_pred ccccHHH----HHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHhhhhhCCCceecCC-CceEEE Confidence 6777543 344444444333332333333 344444311 111110000011111 12245566654 467877 Q ss_pred ecccc-HHHHHHHHHHHHHHHHhhcCCChHHhcccccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Q lcl|NC_021324. 293 FKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMG 370 (480) Q Consensus 293 ~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 370 (480) ++... -..+++..+....+|+...++|+..+|....+ .+.+..++.....+...+- .-+-..|+..+. ..+.. T Consensus 266 l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~tl---~P~~~~ie~~ln--~kL~~ 340 (432) T protein:vir:10 266 LGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMTL---SPWLRRIEQSIA--LNLLS 340 (432) T ss_pred ccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHHH---HHHHHHHHHHHH--hhhcC Confidence 76432 22477888888999999999999999864322 1222233322222211110 001111111111 01111 Q ss_pred CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|NC_021324. 371 REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKA 450 (480) Q Consensus 371 ~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 450 (480) ..... ...+++........|..++++.+.+++++| +++...+++.+|+.+-+-. ...... ...+...... T Consensus 341 ~~~~~-~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G--~~T~NE~R~~~glppi~g~--~~~~~~-----~~~~~pl~~~ 410 (432) T protein:vir:10 341 PAERR-RYFADFDTSALLRADSAARSSYYSQLVNNG--LMTRDEAREIEGLPKLGGN--AAVLTV-----QSAMVPLDSI 410 (432) T ss_pred ccccC-ceEEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCC--cceEee-----cCcccchhhh Confidence 11111 122333334455678889999999998875 6677777777776432100 000000 0000000000 Q ss_pred CCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 451 QADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 451 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) +.++ ..++++ +.+.....+.++ T Consensus 411 ----~~~~-~~~~~~-~~~~~~~~~~~~ 432 (432) T protein:vir:10 411 ----GLQA-SPEPAS-GLGNQQQDKVSK 432 (432) T ss_pred ----cccC-CCCCCC-CCCCcccccccC Confidence 0001 111111 111111112222 No 241 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=96.85 E-value=0.0003 Score=39.89 Aligned_cols=377 Identities=11% Similarity=0.032 Sum_probs=149.2 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHhhccCcc-hh---------------cCcccchhhhcceeccchHHHHHHHHHhhhcc Q lcl|NC_021324. 7 HVERLQGLLARD-LPNLLEAEAYRNGTRRL-KT---------------IGIGAPPELAYLDVQPGWVATYLRTLSDRLDI 69 (480) Q Consensus 7 ~i~~l~~~~~~~-~~~~~~~~~YY~G~~~i-~~---------------~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~ 69 (480) ++- |.-.|... -.-...+..++..+..- +. .+..+.+.. -.+. .=...+|+..++-+-. T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~-al~~--~~v~~cv~~Ia~~iA~ 76 (424) T protein:vir:45 1 MLY-CWWAHWLWPEGGRVLLDALFRSKSLENPSTPITGDAVDTDGLFRADVYVSPET-AMKL--AAVYSCIYVLSSSLAQ 76 (424) T ss_pred Cee-EeeeceecCcchhHHHHhhccccCCCCCccccchhhhhhhccccCCceechHH-hhcc--HHHHHHHHHHHHHHhh Confidence 111 11111111 11112222223222100 00 000010000 0011 1112233333333322 Q ss_pred Ccccc---CC-Cc-h-hHHHHHHHHH--hcC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccc Q lcl|NC_021324. 70 EGFRI---SE-DS-E-GLEELWNWWQ--AND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLY 137 (480) Q Consensus 70 ~~~~~---~~-d~-~-~~~~l~~~~~--~n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~ 137 (480) -++.+ .+ +. . ....+..++. -|. ...+...+..+.+.+|.||+.+-.+ ..|.+ .+..++|.. T Consensus 77 lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~------~~G~~~~L~~l~~~~ 150 (424) T protein:vir:45 77 MPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRN------RRGEVISLDCCMPWE 150 (424) T ss_pred CceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEecCce Confidence 23322 11 11 1 1123455443 232 3356677888999999999987653 34655 466677766 Q ss_pred eEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCc Q lcl|NC_021324. 138 MYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGN 217 (480) Q Consensus 138 ~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~ 217 (480) +.+..+. . .+ +|.. ....+. ..+.++ =|+||++. ...+ T Consensus 151 v~i~~~~--~-~~----~y~~-~~~~~~----~~~~~~-----------------------------eVih~r~~-~~d~ 188 (424) T protein:vir:45 151 TTLMNTG--G-RY----TYGL-YNEYGA----FAISPD-----------------------------DMIHIRAL-GNNQ 188 (424) T ss_pred EEEEEcC--C-eE----EEEE-EecCce----EEECcc-----------------------------cEEEecCc-CCCC Confidence 6543221 1 11 0100 001110 011111 13555543 2345 Q ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHH---HHHhcchhhhhcCC-Cccccccccccchhhh-------hcchheecCCC Q lcl|NC_021324. 218 RYGRSEISPELRKVTDAASRTLMNLQSA---SQILGTPLRVISGV-TTDELTNDGENTTLDI-------YYGRILTLASE 286 (480) Q Consensus 218 ~~G~s~i~~~v~~l~D~~~~~~s~~~~~---~~~~~~p~l~i~G~-~~~~~~~~~~~~~~~~-------~~~~~~~~~g~ 286 (480) .+|.|.+. .+.+.+....+-..-. +...+.|-.++.-. ...++..+.-...|.. ..++++.++ + T Consensus 189 ~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~-~ 263 (424) T protein:vir:45 189 KMGLSPIM----QHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLP-A 263 (424) T ss_pred cccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEcC-C Confidence 67877653 3344444333332222 23333455555421 1111100000111211 123455555 3 Q ss_pred CceEEEeccccHH-HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 287 AAKISEFKAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIA 365 (480) Q Consensus 287 ~~~~~~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li 365 (480) +.++.+++.+..+ .|++..+....+|+..-++|+..+|....+.-|. +..... ..+...|.-.++.+ T Consensus 264 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~eq~~~----------~f~~~tL~P~~~~i 331 (424) T protein:vir:45 264 DLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSN--ISAQAI----------QFVRYTMMPWVTNW 331 (424) T ss_pred CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHHHHH----------HHHHHHHHHHHHHH Confidence 4688777644322 3678888888999999999999998643221121 111111 11111222222221 Q ss_pred H-----HHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHH Q lcl|NC_021324. 366 M-----QIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDM 440 (480) Q Consensus 366 ~-----~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~ 440 (480) . .+...........+++.+......|..++++.+.+++++| +++...+++.+|+.+-+ .- T Consensus 332 e~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g--~~T~NE~R~~~gl~pi~--gg----------- 396 (424) T protein:vir:45 332 EQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDG--WMSRNEARAFEDMNPVE--GL----------- 396 (424) T ss_pred HHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--Cc----------- Confidence 1 1111111111122444444555678889999999998876 66666677777765321 00 Q ss_pred HHHHhhhcccC-CCcCCCCCCCCCCccCCCCCCC Q lcl|NC_021324. 441 IDTLYSTTKAQ-ADATPKPTVTETKTETQTSPSG 473 (480) Q Consensus 441 ~~~~~~~~~~~-~~~~~~~~~~d~~~~~~~~~~~ 473 (480) +.+....... +.....+..++...+. . T Consensus 397 -D~~~~~~n~~~~~~~~~~~~~~~~~~~-----~ 424 (424) T protein:vir:45 397 -DEMLVSVNAANPAGDFKPPKNDEGKTN-----E 424 (424) T ss_pred -ceeeecccccccccccCCCCCCCCCCC-----C Confidence 0000000000 0001111111111100 1 No 242 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=96.71 E-value=0.00039 Score=39.27 Aligned_cols=372 Identities=11% Similarity=0.004 Sum_probs=159.1 Q ss_pred HHHHHHhhccCcch----------------hcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCC--ch-- Q lcl|NC_021324. 23 LEAEAYRNGTRRLK----------------TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED--SE-- 79 (480) Q Consensus 23 ~~~~~YY~G~~~i~----------------~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d--~~-- 79 (480) ..+.++++++..-. ..+..+.+. .-+...-...+|+..+.-+-.-++.+ ..+ .+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~---~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~ 77 (419) T protein:vir:57 1 MFIPQFWKGRPSENRVNWQVVPGGMRSSSSQAGVIITPE---TALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIA 77 (419) T ss_pred CcchhhhccCCccccccccccccccccccccCCceechH---HhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceecc Confidence 22223333321100 000111110 00111112334444333332223322 111 11 Q ss_pred hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 80 GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 80 ~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ....+..++.. | ....+...+..+.+.+|.||+++..+ ..|.+ .+..++|..+.+..+... . T Consensus 78 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~------~~G~~~~L~pl~~~~v~v~~~~~g--~---- 145 (419) T protein:vir:57 78 FDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRN------GRGDITELIPINPHKVIVLKGPDG--M---- 145 (419) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEEcCcceEEEECCCc--e---- Confidence 12345555532 2 34566778888999999999988653 34554 667788877776553321 1 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) .+|.. ...+ .++..+. |+|+++.. ..+.+|.|.|.. +...++ T Consensus 146 -~~y~~-~~~~-----~~~~~~~-----------------------------vih~r~~~-~d~~~G~s~i~~-~~~~i~ 187 (419) T protein:vir:57 146 -PYYDI-PSIG-----EILPMRM-----------------------------VHHIKSFS-LDGYIGTSPIQT-NPDVLG 187 (419) T ss_pred -EEEEE-cCCc-----eEEchhh-----------------------------EEEecCcC-CCCcccccHHHH-HHHHHH Confidence 11111 1111 0111111 24454432 345778886542 333333 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhcCC-Ccccccccccc----chhhh------hcchheecCCCCceEEEecccc-HHHH Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGEN----TTLDI------YYGRILTLASEAAKISEFKAAE-LRNF 301 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~----~~~~~------~~~~~~~~~g~~~~~~~~~~~~-~~~~ 301 (480) ....+.....+.+...+.|-.+|.-- .......++.. ..|.. ..++++.++ ++.++.+++... -..| T Consensus 188 ~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~-~g~~~~~l~~~~~d~q~ 266 (419) T protein:vir:57 188 LGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQ-EGMTYKQLSQDNEKAQL 266 (419) T ss_pred HHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecC-CCceEEEcCCChhhHHH Confidence 32222222222233333454444311 01111111110 11111 123455554 446887776432 2247 Q ss_pred HHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCCCccc Q lcl|NC_021324. 302 AEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGREVTEE 376 (480) Q Consensus 302 ~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~~~~~~ 376 (480) ++..+....+|+..-++|+..+|....+.-|. +..... ..+...|.-++..+.. +...... . T Consensus 267 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~~~~----------~f~~~~l~P~~~~ie~~l~~~ll~~~~~-~ 333 (419) T protein:vir:57 267 LQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNN--IEHQGL----------QYVIYTMLAILKRHESAMMRDLLLPSER-R 333 (419) T ss_pred HHHHHHHHHHHHHHhCCCHHHhCCCCCCcccc--HHHHHH----------HHHHHHHHHHHHHHHHHHHhhccCcccc-C Confidence 78878888999999999999998643221121 111111 1112223222222211 1111111 1 Q ss_pred ceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCC Q lcl|NC_021324. 377 YTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATP 456 (480) Q Consensus 377 ~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (480) ...+++.+......|..++++.+.+++.+| +++...+++.+|+.+-+- - +.+.-.... .+.. T Consensus 334 ~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--g------------D~~~~~~n~--~~~~ 395 (419) T protein:vir:57 334 DFYIEFNVSSLLRGDQKSRYESYALGRQWG--WLSVNDIRRMENLTPIPG--G------------DKYLTPLNM--VDSK 395 (419) T ss_pred CeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------Ceeeecccc--cccc Confidence 123444445566678899999999988876 667777888888754221 0 000000000 0000 Q ss_pred CCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 457 KPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 457 ~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ... +.++...+..+.......|| T Consensus 396 ~~~-~~~~~~~~~~~~~~~~~~~~ 418 (419) T protein:vir:57 396 ALT-GIGKATPQQLKDIEAILCTR 418 (419) T ss_pred ccc-cccCCCcccCcchhhhhhcc Confidence 000 00111122223333344555 No 243 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=96.69 E-value=0.0004 Score=39.19 Aligned_cols=260 Identities=11% Similarity=0.015 Sum_probs=113.7 Q ss_pred HhhhccCcccc-CCCchhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEccc Q lcl|NC_021324. 64 SDRLDIEGFRI-SEDSEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPL 136 (480) Q Consensus 64 ~~~l~~~~~~~-~~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~ 136 (480) +. .-++.+ ..++.....+..++.. | ....+...+..+.+.+|.||+.+..+ .+|++ .+..++|. T Consensus 1 ia---~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~------~~G~~~~l~~l~~~ 71 (278) T protein:vir:78 1 MA---SLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERD------IYHQPSKLFLLNPD 71 (278) T ss_pred Cc---cceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEEC------CCCcEEEEEEECCc Confidence 11 111211 1111222333343321 2 24466778889999999999988753 34554 66778888 Q ss_pred ceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 137 YMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) .+.+..+.... ..+ |......+.. ..+.+++ |+||++..... T Consensus 72 ~v~v~~~~~~~--~~~----y~~~~~~g~~---~~~~~~e-----------------------------vih~~~~~~~~ 113 (278) T protein:vir:78 72 VVEMLIENQSR--ELY----YSIHAATGNK---LIVHNMD-----------------------------MLHFKHIVASN 113 (278) T ss_pred eeEEEEcCCCc--eEE----EEEEcCCceE---EEEcccc-----------------------------EEEECCCCCCC Confidence 88776654322 111 1111111111 0112222 45665443445 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccchhh---hhcchheecCCCCceEE Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTTLD---IYYGRILTLASEAAKIS 291 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~---~~~~~~~~~~g~~~~~~ 291 (480) +++|.|.+.. + .+.++...+-....+..++.+-..+. +....++..+.-...++ ...++++.++ ++.++. T Consensus 114 ~~~G~s~~~~-~---~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~ 188 (278) T protein:vir:78 114 MVQGISPIDV-L---KNTTDFDNAVRTFNLTEMQKPDSFMLKYGSNVGKEKRQQVLEDFKQYYEENGGILFQE-PGVEIE 188 (278) T ss_pred CeeeccHHHH-H---HHHHHHHHHHHHHHHHHhcCCCcEEEEeCCCCCHHHHHHHHHHHHHHhccCCCceecC-CCceEE Confidence 6778876543 3 33333322222223344443322222 11111111111011111 1234455555 346777 Q ss_pred Eeccc-cHHHHHHHHHHHHHHHHhhcCCChHHhccccc-cchHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 292 EFKAA-ELRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAE-AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQI 368 (480) Q Consensus 292 ~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~-Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 368 (480) +++.+ .-..+++..+....+|+..-++|+..+|.... +.++.. +.+ ..+...|.-+++.+..- T Consensus 189 ~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~--------------~~~~~~l~P~~~~i~~~ 254 (278) T protein:vir:78 189 PLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNR--------------FYLQHTLLPIVKQYEEE 254 (278) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--------------HHHHHHHHHHHHHHHHH Confidence 77643 22346777888899999999999999986432 222211 111 11111222222222111 Q ss_pred hCCC-Cc-cc-ceeeeEEecCCCC Q lcl|NC_021324. 369 MGRE-VT-EE-YTRLETVWRDPST 389 (480) Q Consensus 369 ~~~~-~~-~~-~~~~~v~f~~~~~ 389 (480) .... .. .+ .....+.|.-..- T Consensus 255 ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 255 FNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHhhcCChhHhcCCceEEEecccC Confidence 1100 00 00 0123355521111 No 244 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=96.59 E-value=0.00049 Score=38.74 Aligned_cols=372 Identities=13% Similarity=0.095 Sum_probs=136.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHHH-HHHHHHhhccCcchhcCcccchhhhcceec-cchHHHHHHHHHhhhccCcccc---C Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNL-LEAEAYRNGTRRLKTIGIGAPPELAYLDVQ-PGWVATYLRTLSDRLDIEGFRI---S 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~-~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~-~n~~~~iVd~~~~~l~~~~~~~---~ 75 (480) |- +.+.+.++...- ..-..++-........+. ..+ .++. ......+|+..++-+-.-++.+ . T Consensus 1 Mg--------~~~~f~~k~~~~~~~~~~~~~~~~~~~~~~~---~~~--~~~~~~~~V~~~I~~ia~~iA~~p~~~~~~~ 67 (403) T protein:vir:80 1 MG--------LFNFFRRKTRSEPTNAISWFLTQEAYDTLAI---PGY--TRLSDNPEVRMAVHKIAELISSMTIHLMQNT 67 (403) T ss_pred Cc--------ccccccccccccccchhhhhccccccccccc---chh--hhhhhhHHHHHHHHHHHHhhhhCceEEEEec Confidence 10 000010000000 000000000000000000 000 0111 1122334444444332223321 1 Q ss_pred CC--chhHHHHHHHHHh--cC---HHHHHHHHHHHHhhc--CeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCC Q lcl|NC_021324. 76 ED--SEGLEELWNWWQA--ND---LDEESVLGHDDSLTF--GRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPR 145 (480) Q Consensus 76 ~d--~~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~--G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~ 145 (480) ++ ......+..++.. |. -..+...++.+.+.. |.||+++.. +..|++ .+..++|..+.++.+.. T Consensus 68 ~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~------~~~g~~~~L~~l~p~~v~~~~~~~ 141 (403) T protein:vir:80 68 DNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKY------TTSGLIDELIPLAPSKVSFVDTDT 141 (403) T ss_pred CCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEE------cCCCcEEEEEEEcCCeeEEEEcCC Confidence 11 1122334444432 32 224455556667665 556776643 234555 45567777766554432 Q ss_pred CCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccc-cCccCCCccc Q lcl|NC_021324. 146 NTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPR-LGNRYGRSEI 224 (480) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~-~~~~~G~s~i 224 (480) .. ..+|. . ..|.++ -|+||+.+.. .....|.|.+ T Consensus 142 g~------~~~y~----~------~~~~~~-----------------------------eiih~~~~~~~~~~~~G~s~~ 176 (403) T protein:vir:80 142 GY------QIWYQ----G------KAYNYD-----------------------------EVLHFIVNPDPEKPYMGRGYR 176 (403) T ss_pred ce------EEEEe----e------cccchh-----------------------------hEEEEeccCCCcCccccccHH Confidence 10 00110 0 011111 2345553222 2334576643 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcC-CCccccccccccch-hhh-----hcchheecCCCCceEEEe- Q lcl|NC_021324. 225 SPELRKVTDAASRTLMNLQSASQILG---TPLRVISG-VTTDELTNDGENTT-LDI-----YYGRILTLASEAAKISEF- 293 (480) Q Consensus 225 ~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i~G-~~~~~~~~~~~~~~-~~~-----~~~~~~~~~g~~~~~~~~- 293 (480) ..+.+.++....-..-...++. .|-.++.- ....+...+..... .+. ..+..+.++++..++.++ T Consensus 177 ----~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 252 (403) T protein:vir:80 177 ----VVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQVK 252 (403) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCeeeecccccccceec Confidence 3344444444332222223333 33434321 11111111111111 111 123344444443334343 Q ss_pred --ccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_021324. 294 --KAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR 371 (480) Q Consensus 294 --~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 371 (480) +..+. .+++..+....+|+..-++|++-+|....+ +.....+. ...|.-++..+..-+.. T Consensus 253 ~l~~~d~-q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~~~~~f~---------------~~~l~P~~~~ie~~l~~ 314 (403) T protein:vir:80 253 PLSLKDL-AIHETVELDKRTVAGIFGVPAFLLGVGKYD--KDEYNNFI---------------NSTILPIAKGIEQELTR 314 (403) T ss_pred cCCHHHH-HHHHHHHHhHHHHHHHhCCCHHHcCCCCcc--HHHHHHHH---------------HHHHHHHHHHHHHHHHH Confidence 22233 367888888899999999999998743221 22111111 11222222211111100 Q ss_pred -CC-cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|NC_021324. 372 -EV-TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTK 449 (480) Q Consensus 372 -~~-~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~ 449 (480) -. ..+ ..+++........|..++++.+.+++.+| +++...+++.+|+.+.+- -. .+..... T Consensus 315 kll~~~~-~~~~f~~~~ll~~d~~~~~~~~~~~~~~G--i~t~NE~R~~~gl~p~~g--gd------------~~~~~~n 377 (403) T protein:vir:80 315 KLLISPD-LYFKFNPRSLYAYDLKELAEVGSNMYVRG--LMEGNEVRDWLGLSPKEG--LS------------ELVILEN 377 (403) T ss_pred hccCCCC-cEEEeechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CC------------eEeeccc Confidence 01 111 12333334455678889999999998876 667777788888764321 00 0000000 Q ss_pred cCCCcCCCCCCCCCCccCCCCCCCCCcccC Q lcl|NC_021324. 450 AQADATPKPTVTETKTETQTSPSGFNRTKT 479 (480) Q Consensus 450 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 479 (480) - .+ -...++......++.++.++... T Consensus 378 ~--~p--l~~~~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 378 Y--IP--LDKIGDQNKLKGGEKGGADGQTD 403 (403) T ss_pred c--cc--hhhccchhhccCCCCCCCCCCCC Confidence 0 00 00000111111111111111111 No 245 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=96.58 E-value=0.0005 Score=38.69 Aligned_cols=384 Identities=13% Similarity=0.088 Sum_probs=165.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc--CCC- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI--SED- 77 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~--~~d- 77 (480) -=|+..+-.-|...-.-...++..| ||+--. ......-.+.+-...+....|++ +.+ T Consensus 39 gltp~~l~~iL~~a~~gd~~~~~~L--~~dm~~------------------~D~hi~s~l~~Rk~av~~~~w~I~p~~~~ 98 (512) T protein:vir:19 39 GVTPNRAAQMLRDAERGDLTAQADL--AFDMEE------------------KDTHLFSELSKRRLAIQALEWRIAPARDA 98 (512) T ss_pred CCCHHHHHHHHHHhhCCCHHHHHHH--HHHHHh------------------hChHHHHHHHHHHHHHhCCCceEecCCCC Confidence 0112222221111111112222221 222110 01222223333333334445544 222 Q ss_pred ----chhHHHHHHHHHhc-CHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeE---EEEEcccceEEEEeCCCCC Q lcl|NC_021324. 78 ----SEGLEELWNWWQAN-DLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPL---IRVESPLYMYAELDPRNTR 148 (480) Q Consensus 78 ----~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~---~~~~~p~~~~~v~d~~~~~ 148 (480) .+..+.+.+.+..- +|+..+..+ .++..+|.+ ++++|... +|... +.+++|+.. .|++.... T Consensus 99 ~~~~~~~a~~v~~~l~~~~~f~~~~~~l-ldA~~~G~s~~Ei~w~~~------~g~~~~~~~~~r~~~~f--~~~~~~~~ 169 (512) T protein:vir:19 99 SAQEKKDADMLNEYLHDAAWFEDALFDA-GDAILKGYSMQEIEWGWL------GKMRVPVALHHRDPALF--CANPDNLN 169 (512) T ss_pred CHHHHHHHHHHHHHHhcCCCHHHHHHHH-HhhhhhcceeeeeEeeee------CCceeeeeeeeeccccc--eeccCCCc Confidence 13334566666543 577777765 578889987 45556321 22222 223333211 12211110 Q ss_pred ceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHH Q lcl|NC_021324. 149 RVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPEL 228 (480) Q Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v 228 (480) . + .++... . ...+. .+++ ++.+.++...+.++|.+-+.. + T Consensus 170 ~----l------------------------r~~~~~-~------~G~~l-~~~k---~i~~~~~~~~g~p~g~gLlr~-~ 209 (512) T protein:vir:19 170 E----L------------------------RLRDAS-Y------HGLEL-QPFG---WFMHRAKSRTGYVGTNGLVRT-L 209 (512) T ss_pred E----E------------------------EecCCC-C------Cceee-cCCc---eEEEeccCCCCCcccccHHHH-H Confidence 0 0 000000 0 00011 1222 233455666777888876643 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc----chhhhhcchheec-CCCCceEEEeccccHHHHHH Q lcl|NC_021324. 229 RKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN----TTLDIYYGRILTL-ASEAAKISEFKAAELRNFAE 303 (480) Q Consensus 229 ~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~----~~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~ 303 (480) --.-=-=+..+.+++.-++.|+.|+++.+- +....++... .+..+..+....+ .|...+|.+....+...|.. T Consensus 210 ~w~~~fK~~~~~~w~~f~E~yG~P~~igky--~~~a~~~ek~~L~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~ 287 (512) T protein:vir:19 210 IWPFIFKNYSVRDFAEFLEIYGLPMRVGKY--PTGSTNREKATLMQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMA 287 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHcCCCeeEEec--CCCCCHHHHHHHHHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHH Confidence 222222367778888889999999877652 1111111111 1222333333333 34455666654445556777 Q ss_pred HHHHHHHHHHhhcCCChHHhccc--cccchHH-HHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhCCCCcccce Q lcl|NC_021324. 304 EMEVFRKEAASITGLPPQYLSSS--SENPASA-EAI-IATDSRIVKMAERKGRIFGGAWE-RAMRIAMQIMGREVTEEYT 378 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~--~~~~~Sg-~Al-~~~~~~l~~k~~~~~~~~~~~l~-~~~~li~~~~~~~~~~~~~ 378 (480) .++..-.+|+... +|.+ +.+..+| -|+ .....-....+..-.+.+...+. ++++-++.+.......... T Consensus 288 li~~~d~~Isk~i------LGqtlTs~~g~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~~~~ 361 (512) T protein:vir:19 288 MIGWAEKAISKAI------LGGTLTTEAGDKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTIDINR 361 (512) T ss_pred HHHHHHHHHHHHH------hhhhhcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCccc Confidence 7776667776642 2221 1110111 122 11222233344444566666674 4666666665443322223 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) ...+.|....+.++...++.+.++. .| ..++.+.+.+.+|+......+- .. . .....+.+.... T Consensus 362 ~p~~~f~~~e~eDl~~~a~~~~~l~-~G-~~i~~~~i~e~~Gip~~~~~e~--~~--------~----~~~~~~~~~~~~ 425 (512) T protein:vir:19 362 LPGIVFDTSEAGDITALSDAIPKLA-AG-MRIPVSWIQEKLHIPQPVGDEA--VF--------T----IQPVVPDNGSQK 425 (512) T ss_pred cceEEecCCChhhHHHHHHHHHHHh-cC-CCCCHHHHHHHhCCCCCCCccc--cc--------c----CCCccccccccc Confidence 3567888788889899999998886 55 4579999999999753321100 00 0 000000000000 Q ss_pred CCCCCCccCCCCCC--CCCcc--cCC Q lcl|NC_021324. 459 TVTETKTETQTSPS--GFNRT--KTR 480 (480) Q Consensus 459 ~~~d~~~~~~~~~~--~~~~~--~~~ 480 (480) . .....+..|. ..+.. +.+ T Consensus 426 ~---~~~~~~~~~~~~~~d~~~~~~~ 448 (512) T protein:vir:19 426 E---AALSAEDIPQEDDIDRMGVSPE 448 (512) T ss_pred c---ccccccCCCchhhHhHHhhhHH Confidence 0 0000000000 00000 000 No 246 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=96.54 E-value=0.00053 Score=38.53 Aligned_cols=402 Identities=14% Similarity=0.017 Sum_probs=145.6 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHH-----------------------HHhhccCcc--hhcCcccchhhhcceeccchHHH Q lcl|NC_021324. 5 HEHVERLQGLLARD-LPNLLEAE-----------------------AYRNGTRRL--KTIGIGAPPELAYLDVQPGWVAT 58 (480) Q Consensus 5 ~~~i~~l~~~~~~~-~~~~~~~~-----------------------~YY~G~~~i--~~~~~~~~~~~~~~~~~~n~~~~ 58 (480) .-|+.+|....... ......+. .+-.|.... +..+..+.. ..-..+..... T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~---~~a~~~~~v~~ 77 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLAT---QAYQANGPVFA 77 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccch---hhhhccHHHHH Confidence 45555555543210 00111110 010110000 000000100 11122334445 Q ss_pred HHHHHHhhhccCcccc---CCC--ch-hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccC--CCCc Q lcl|NC_021324. 59 YLRTLSDRLDIEGFRI---SED--SE-GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESG--DPAG 126 (480) Q Consensus 59 iVd~~~~~l~~~~~~~---~~d--~~-~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~--d~~g 126 (480) +|+..++-+-.-++.+ .++ .+ ....+..++.+ | ....+...+..+.+.+|.||+.+..+..+.. +..| T Consensus 78 ~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~~g 157 (466) T protein:vir:81 78 CMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDWVD 157 (466) T ss_pred HHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccccccCc Confidence 5555544443334432 111 11 11234444433 2 2345677788899999999999876543321 2223 Q ss_pred e-eEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc Q lcl|NC_021324. 127 I-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP 205 (480) Q Consensus 127 ~-~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 205 (480) . ..+..++|..+.+..+....... .+++.............|.++ - T Consensus 158 ~~~~l~~l~~~~v~~~~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~~-----------------------------d 204 (466) T protein:vir:81 158 VVVEERMVRGGRGELGGGQLGWRKV----GYLYTEGGRQSGNESVGFLAE-----------------------------D 204 (466) T ss_pred ceeEEEEecCcceEEEEcCCCceEE----EEEEEecCcccccceeeeccc-----------------------------c Confidence 3 34556666666665543322111 111100000000000011111 2 Q ss_pred eEEeeccc-ccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcC-CCccccccccccchhhh------hc Q lcl|NC_021324. 206 VVPLTNDP-RLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISG-VTTDELTNDGENTTLDI------YY 277 (480) Q Consensus 206 vv~~~n~~-~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G-~~~~~~~~~~~~~~~~~------~~ 277 (480) |+||++.. ...+.+|.|.+.- ....++....+.......+...+.|-.++.- ....++..+.-...|.- .. T Consensus 205 viHir~~~~~~d~~~G~s~i~~-~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~ 283 (466) T protein:vir:81 205 VVHFAPIPDPLASYRGMSWLTP-ILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMADPAAVKKWADEVNSKHAGVDNA 283 (466) T ss_pred EEEEcCCCCcccccccccHHHH-HHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCCHHHHHHHHHHHHHHhcCcccc Confidence 46666432 2345678876532 3333332222222222223333445544431 11111111111111211 11 Q ss_pred chheecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccc-cchHHHHHHHHHHHHHHHH-HHHHHHH Q lcl|NC_021324. 278 GRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIVKMA-ERKGRIF 354 (480) Q Consensus 278 ~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~~~l~~k~-~~~~~~~ 354 (480) ++++.++ ++.++.+++... -..+++..+..+.+|+..-++|+..+|.... +.+++..++.....+...+ .-....+ T Consensus 284 g~~~vl~-~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~~tl~P~~~~i 362 (466) T protein:vir:81 284 WKNLNLY-PGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLADGTAHPLWQNL 362 (466) T ss_pred ccceEcC-CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHHHHHHHHHHHH Confidence 3455555 346777776432 2247788888999999999999999985422 1122222222222211111 1111111 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEec--CCCCCCHHHHHHH-------HHHHHhccccCCCHHHHHHhCCCChhH Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVWR--DPSTPTVAAKADA-------VSKLYANGQGPIPKEQARIDLGYTATQ 425 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~--~~~~~~~~~~ad~-------~~kl~~~~~~~~s~e~~~~~~~~~~d~ 425 (480) ...|.+ . +..... ...+.+.|. ..+-.+..+++++ +..++++ |+++++ T Consensus 363 e~~l~~---~---L~~~~~---~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~--------------g~t~nE 419 (466) T protein:vir:81 363 SGCIGH---V---MPDMGP---DVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITA--------------GYEPES 419 (466) T ss_pred HHHHHh---h---cCCccc---CcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHc--------------CCChhh Confidence 111211 1 111111 112344552 3333455555543 2222222 333333 Q ss_pred HHHHHHH-----HHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCC Q lcl|NC_021324. 426 REQMRDW-----DKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFN 475 (480) Q Consensus 426 ~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 475 (480) ...+.+- .........+.+.. +.+ .+..++......++. .+| T Consensus 420 ~r~~~~~gd~~~~~~~~~~~~~~~~~---~~~----~~~~~~~~~~~Gg~~-ngn 466 (466) T protein:vir:81 420 VVAAVNSGDLRLLKHTGLTSVQLLPP---GVS----ASASSDTPTSGGADD-NGN 466 (466) T ss_pred ccccccCCccccccCCCcchhhhccc---ccc----cccCCCCcccCCCCc-CCC Confidence 2211100 00000000000000 000 000011111111111 112 No 247 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=96.51 E-value=0.00055 Score=38.44 Aligned_cols=399 Identities=13% Similarity=0.081 Sum_probs=155.6 Q ss_pred CCCHHH--HHHHHHHHHHHHHHHHH---HHHHHhhccC-----cchhcCcccchhhhcceeccchHHHHHHHHHhhhccC Q lcl|NC_021324. 1 MTTYHE--HVERLQGLLARDLPNLL---EAEAYRNGTR-----RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE 70 (480) Q Consensus 1 ~~~~~~--~i~~l~~~~~~~~~~~~---~~~~YY~G~~-----~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~ 70 (480) |.+++- ++.++...+....+... .......+.. .....+..+.+.. -.+ +.=...+|+..++-+-.- T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~-a~~--~~aV~~~v~~Ia~~ia~l 77 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADA-IMR--LDAVAACVKLVSQAVAAM 77 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHh-hhc--chHHHHHHHHHHHhhccC Confidence 555432 22222222211111000 0000000000 0000111111110 001 111112223222222222 Q ss_pred cccc---CCC---chhHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccce Q lcl|NC_021324. 71 GFRI---SED---SEGLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYM 138 (480) Q Consensus 71 ~~~~---~~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~ 138 (480) ++.+ ..| +.....+..++.. | .-..+...+..+.+.+|.||+++... +|++ .+..++|..+ T Consensus 78 p~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-------~g~~~~L~~l~p~~v 150 (432) T protein:vir:97 78 PLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-------DGRIESLQYLANDRL 150 (432) T ss_pred ceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-------CCcEEEEEEEcCcce Confidence 3321 111 1112334555432 3 23466777888999999999887642 2443 5667888888 Q ss_pred EEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCcc Q lcl|NC_021324. 139 YAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 139 ~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~ 218 (480) .++.+... .+ .|... ...+.. ..+..+. |+|+++.. ..+. T Consensus 151 ~v~~~~~g--~~----~y~~~-~~~g~~---~~~~~~~-----------------------------iih~r~~~-~dg~ 190 (432) T protein:vir:97 151 TITTDTKG--NT----AYRYR-RTDGQM---IDIPRQQ-----------------------------IWKIMGYS-LDGE 190 (432) T ss_pred EEEEcCCC--cE----EEEEE-ecCceE---EEEcccc-----------------------------EEEecCcC-CCCc Confidence 88776432 11 11111 111110 1122222 24444432 2345 Q ss_pred CCCccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCCCccccccccccchhh------hhcchheecCCCCce Q lcl|NC_021324. 219 YGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGVTTDELTNDGENTTLD------IYYGRILTLASEAAK 289 (480) Q Consensus 219 ~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i~G~~~~~~~~~~~~~~~~------~~~~~~~~~~g~~~~ 289 (480) +|.|.|. .+.+.+...+.-......+|. .|-.++.- +. ...++ ....++ ...++++.++ ++.+ T Consensus 191 ~G~spi~----~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~-~~-~l~~e-~~~~~~~~~~~~~nag~~~vl~-~g~~ 262 (432) T protein:vir:97 191 NGLSAIR----YGAQIFGTAIAAEAQAARAFRNGQLQSVYYQI-DR-FLTDD-QYDSFSKKVSGSVEAGRAPLLE-GGMD 262 (432) T ss_pred ccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCcceeEec-CC-CCCHH-HHHHHHHHHhhhhcCCCceecC-CCce Confidence 6777543 333444443333333333333 34333331 11 11111 111111 1224556665 3467 Q ss_pred EEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 290 ISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENP-ASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ 367 (480) Q Consensus 290 ~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~ 367 (480) +.+++.+. -..+++..+....+|+..-++|+..+|....+. +.|..+......+...+- .-+-..|+..+.. . T Consensus 263 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~tl---~P~~~~ie~~ln~--k 337 (432) T protein:vir:97 263 VKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMTL---SPWLRRIEQSIAL--N 337 (432) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHHH---HHHHHHHHHHHhh--h Confidence 87776432 234677788888999999999999998643221 112223322222221110 0011111111110 1 Q ss_pred HhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_021324. 368 IMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYST 447 (480) Q Consensus 368 ~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 447 (480) +....... ...+++.+......|..++++.+.+++++| +++...+++++|+.+-+-. ...... ...+... T Consensus 338 Ll~~~e~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G--~~T~NE~R~~~glpp~~g~--~~~~~~-----~~~~~pl 407 (432) T protein:vir:97 338 LLTPAERR-RYFADFDTSALLRADSAARSSYYSQLVNNG--LMTRDEAREIEGLPKLGGN--AAVLTV-----QSAMVPL 407 (432) T ss_pred ccCccccC-ceEEEeechhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCC--cceEee-----cccccch Confidence 11111111 122444444555678889999999998876 6677777777776432100 000000 0000000 Q ss_pred cccCCCcCCCCCCCCCCccCCCCCCCCCccc Q lcl|NC_021324. 448 TKAQADATPKPTVTETKTETQTSPSGFNRTK 478 (480) Q Consensus 448 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 478 (480) .....+..+++.. +.+.+...+-++ T Consensus 408 ~~~~~~~~~~~~~------~~~~~~~~~~~~ 432 (432) T protein:vir:97 408 DSIGLQASPEPAS------GLGNQQQDKVSK 432 (432) T ss_pred hhhcccCCCCCCC------CCCCcccccccC Confidence 0000001111111 111111111111 No 248 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=96.51 E-value=0.0005 Score=38.66 Aligned_cols=196 Identities=10% Similarity=0.091 Sum_probs=89.0 Q ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcc--hheecCCCCceEEEecc Q lcl|NC_021324. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYG--RILTLASEAAKISEFKA 295 (480) Q Consensus 218 ~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~~~~~~~~~ 295 (480) -|. +..|.+.+..-......-+. . +....+ ..+.+.+.+-++.+++. T Consensus 1 V~k-------~~~l~~~~~~~~~~~~~r~~-------~-----------------~~~~~~~~~~~~ld~~~e~~e~~~~ 49 (201) T protein:vir:10 1 MWK-------AKGLADLCDDSDGAARLRLA-------Q-----------------VDNNSGVGQAIGIDADSEEYNVLNS 49 (201) T ss_pred Ccc-------chHHHHHhcCChHHHHHHHH-------H-----------------HHHhhhhhhhheeecCCcceeeeec Confidence 221 22222222110000000000 0 000000 11223333334544433 Q ss_pred ccHHHHHHHHHHHHHHHHhhcCCChHHhcccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_021324. 296 AELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREV 373 (480) Q Consensus 296 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~-~~-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~ 373 (480) ++..+-+.+......+++.++||..-|-+.+ .. ++||..=..-|...+. ..++..+.+.|+++++++.. T Consensus 50 -~lsGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~nyyd~i~--~~Qe~~l~p~le~l~~~~~~------ 120 (201) T protein:vir:10 50 -DIGGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALETFYGYVD--RKRKAELLPLLEFLLPFIVT------ 120 (201) T ss_pred -CcCChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHhhcC------ Confidence 3555666777788899999999976665443 22 3467655544544444 34457788889988886441 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCC-hhHHHHHHHHHHHHHHHHHHHHhhhcccCC Q lcl|NC_021324. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYT-ATQREQMRDWDKQETEDMIDTLYSTTKAQA 452 (480) Q Consensus 374 ~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (480) . .++.+.|.+....+..++|+...+.+++. ..+...|.+ ++++.+ .+ ++.. ....-+ T Consensus 121 ~---~~~~~~f~pL~~~s~kekAei~~~~a~a~-------~~~~~~g~i~~~e~r~--~L-~~~~---------~~~~~~ 178 (201) T protein:vir:10 121 E---QEWSVEFNPLSQVSDKDKSEILEKNVNSV-------AALIAAGIIDADEARD--TL-RAIS---------TEVKIG 178 (201) T ss_pred C---CCceEeeCCCCCCCHHHHHHHHHHHHHHH-------HHHHHcCCCCHHHHHH--HH-HhcC---------CcCCCC Confidence 2 35889999999999999998776655432 122334533 333221 11 1110 000111 Q ss_pred CcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 453 DATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 453 ~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) +...++.. +.+.+.+++..+.|+ T Consensus 179 ~~~~~~~~-~~~e~~dp~~~~~~~ 201 (201) T protein:vir:10 179 EGSIQTEV-VINESEDPLDVSANN 201 (201) T ss_pred CCCCCccc-cccccCCCCCCCCCC Confidence 11111111 111122222223333 No 249 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=96.48 E-value=0.00059 Score=38.30 Aligned_cols=392 Identities=10% Similarity=0.009 Sum_probs=142.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-hhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---C--CCc Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAY-RNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---S--EDS 78 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~Y-Y~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~--~d~ 78 (480) .-++++|. .+.........+ ..|.--.............+.-..+.....+|+..++-+-.-++.+ . ++. T Consensus 1 Mg~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg~~ 76 (423) T protein:vir:81 1 MGFLQKLG----LAPSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDGGR 76 (423) T ss_pred CchhHhhc----cccccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCCce Confidence 22222221 000000000000 0000000000000000000111112334455555554443334322 1 111 Q ss_pred -h-hHHHHHHHHHh-c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceE Q lcl|NC_021324. 79 -E-GLEELWNWWQA-N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 79 -~-~~~~l~~~~~~-n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 151 (480) . ....+..++.+ | ....+...+..+.+.+|.||+++..+... .+.. .+..+++..+.+... T Consensus 77 ~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~----~~~~~~l~p~~~~~v~~~~~-------- 144 (423) T protein:vir:81 77 ERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGV----DTPTLDIRPIPVSWVQRRAY-------- 144 (423) T ss_pred eeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCc----CcceEEEeecccceeeeeec-------- Confidence 1 12234555543 2 24566677788999999999988643211 1111 122222221111111 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHH Q lcl|NC_021324. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) .+..+...|...... ...+ ..+ .+..==|+|+++.......+|.|.+. .+ T Consensus 145 --------~~~~~~~~Y~~~~~~--------~~~g--~~~--------~~~~~evih~r~~~~~~~~~G~spi~----~~ 194 (423) T protein:vir:81 145 --------KDGWGSLDYIIIESG--------DNDG--RSV--------KVPGERVIHRHGYNPKTMKRGKSPVQ----SL 194 (423) T ss_pred --------cCCCcceEEEEEEec--------CCCc--eEE--------EEcccceEEecCCCCCCccccccHHH----HH Confidence 011111111000000 0000 000 00000145665443334457888653 34 Q ss_pred HHHHHHHHHHHHHHHHHhc---chhhhhcCC--Cc-cccccccc---cchhhh-------hcchheecCCCCceEEEecc Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILG---TPLRVISGV--TT-DELTNDGE---NTTLDI-------YYGRILTLASEAAKISEFKA 295 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~---~p~l~i~G~--~~-~~~~~~~~---~~~~~~-------~~~~~~~~~g~~~~~~~~~~ 295 (480) .+.+...+.-......+|. .|-.+|+-. .. ....++.. ...|+. ..++++.++ ++.++.+++. T Consensus 195 ~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~-~g~~~~~l~~ 273 (423) T protein:vir:81 195 RDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLE-DGMKAENFHT 273 (423) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecC-CCceEEeccC Confidence 4444444333333333333 454454311 11 01111100 011111 123555665 3467877764 Q ss_pred ccHH-HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CC Q lcl|NC_021324. 296 AELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGR-EV 373 (480) Q Consensus 296 ~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~ 373 (480) +..+ .+++..+....+|+..-++|+..+|....+.-|. ++.....+...+ -.-+-..|++.+.. .+... .. T Consensus 274 s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn--~e~~~~~f~~~~---L~P~~~~ie~~l~~--~L~~~~~~ 346 (423) T protein:vir:81 274 TSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSN--VREFRKALYGDN---LGSWIRIIQDVMNL--FLLPRVGI 346 (423) T ss_pred ChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCccc--HHHHHHHHHHHH---HHHHHHHHHHHHhh--hhcCcccc Confidence 3222 4677777888899999999999887533221121 221111111111 00011112111110 11111 11 Q ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCC Q lcl|NC_021324. 374 TEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQAD 453 (480) Q Consensus 374 ~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (480) ......+++.+......|..++++++.+++.. .++++...+++.+|+.+.+ -. +.+.......+ T Consensus 347 ~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~-~G~~T~NE~R~~~gl~p~~--gG------------D~~~~p~n~~~- 410 (423) T protein:vir:81 347 DNEKFYFEFNLEEKLRASFEEAAEIKRAAVGN-VAWMTINEVRAMDNLPSID--GG------------DDLARPLNTEF- 410 (423) T ss_pred ccCccEEEecchhhhccCHHHHHHHHHHHHhC-CCCcCHHHHHHHhCCCCCC--Cc------------ceeeccccccc- Confidence 11112233334455567888888888776543 2456666666666664332 00 00011111000 Q ss_pred cCCCCCCCCCCccCCCCCCCCCcccC Q lcl|NC_021324. 454 ATPKPTVTETKTETQTSPSGFNRTKT 479 (480) Q Consensus 454 ~~~~~~~~d~~~~~~~~~~~~~~~~~ 479 (480) ..+ .+..++...| T Consensus 411 -~~~------------~~~~~~~~~t 423 (423) T protein:vir:81 411 -GDS------------EDAPGEEVET 423 (423) T ss_pred -Ccc------------CCCCCCCCCC Confidence 000 0111122222 No 250 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=96.31 E-value=0.00076 Score=37.70 Aligned_cols=383 Identities=10% Similarity=0.019 Sum_probs=154.8 Q ss_pred HHHHHHHHHHHH-------HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCCc- Q lcl|NC_021324. 10 RLQGLLARDLPN-------LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDS- 78 (480) Q Consensus 10 ~l~~~~~~~~~~-------~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~- 78 (480) -++..+.....+ +..+.....+... ..+..+.+.. -.+ +.-...+|+..++-+-.-+|.+ ..+. T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~~~~~~~~~~--~~g~~vt~~~-al~--~~~v~~~i~~Ia~~iA~lp~~~~~~~~~g~ 75 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAMLGGVRSSHS--KAGVMITPET-ALA--LSAVRACVTLLAESVAQLPVELYRRDKNGG 75 (421) T ss_pred CCCcchhcccccccCcchhhHHHhhhhccCcc--cCCceechHH-hhc--cHHHHHHHHHHHHhhccCceEEEEEcCCCc Confidence 111111111100 0011111111100 0111111111 011 1112223333333222223322 1111 Q ss_pred -h--hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCc Q lcl|NC_021324. 79 -E--GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRR 149 (480) Q Consensus 79 -~--~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~ 149 (480) + ....+..++.. | ....+...+..+.+.+|.||+++-.+ .+|+| .+..++|..+.++.++.. . T Consensus 76 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~------~~G~~~~L~~l~~~~v~v~~~~~g--~ 147 (421) T protein:vir:10 76 RQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRD------GKGYPKELIPINPKKVIVLKGPDG--M 147 (421) T ss_pred eeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCCcEEEEEEecCceEEEEECCCc--e Confidence 1 11234454432 3 24456677788999999999988753 34555 466777877776554321 1 Q ss_pred eEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHH Q lcl|NC_021324. 150 VTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELR 229 (480) Q Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~ 229 (480) .+|... ..+ .++. .. =|+|+++.. ..+.+|.|.|.- +. T Consensus 148 -----~~y~~~-~~g-----------~~~~---------------------~~--eiih~~~~~-~d~~~G~spi~~-~~ 185 (421) T protein:vir:10 148 -----PYYEIP-EIG-----------ETLP---------------------MR--MMHHVKVFS-LDGYIGSSPIQT-NA 185 (421) T ss_pred -----EEEEEc-CCC-----------cEEc---------------------hh--hEEEecCcC-CCCcccccHHHH-HH Confidence 111111 111 1100 00 135555443 345678876542 33 Q ss_pred HHHHHHHHHHHHHHHHHHHhcchhhhhcCC-Ccccccccccc----chhhh------hcchheecCCCCceEEEeccccH Q lcl|NC_021324. 230 KVTDAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGEN----TTLDI------YYGRILTLASEAAKISEFKAAEL 298 (480) Q Consensus 230 ~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~----~~~~~------~~~~~~~~~g~~~~~~~~~~~~~ 298 (480) ..++....+.....+.+...+.|-.+|.-. .......++.. ..|+- ..++++.++ ++.++.+++.... T Consensus 186 ~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~-~g~~~~~l~~~~~ 264 (421) T protein:vir:10 186 DVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQ-EGMSYKQMSQDNE 264 (421) T ss_pred HHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecC-CCceEEecCCChh Confidence 333322222222222233334454444311 01000011111 11111 123455565 3468887764322 Q ss_pred -HHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCC Q lcl|NC_021324. 299 -RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGRE 372 (480) Q Consensus 299 -~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~~ 372 (480) ..+++..+....+|+..-++|+..+|....+.-|. +...... .+...|.-++..+.. +.... T Consensus 265 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~e~~~~~----------f~~~tl~P~~~~ie~~ln~kL~~~~ 332 (421) T protein:vir:10 265 KAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNN--IEHQGLQ----------FVMYTLLAWLKRHEGALQRDLLLPS 332 (421) T ss_pred HHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCcccc--HHHHHHH----------HHHHHHHHHHHHHHHHHhhhccCcc Confidence 24778888889999999999999987543221121 1111111 111122222222111 11111 Q ss_pred CcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCC Q lcl|NC_021324. 373 VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQA 452 (480) Q Consensus 373 ~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (480) ... ...+++.+......|..+.++.+.+++.+| +++...+++.+|+.+-+ --.+ +.-.... T Consensus 333 ~~~-~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~--ggD~------------~~~~~n~-- 393 (421) T protein:vir:10 333 ERR-DLYIEFNVSGLLRGDQKSRYESYALGRQWG--WLSVNDIRRMENLPPIA--GGDK------------YLTPLNM-- 393 (421) T ss_pred ccC-CeEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--Ccce------------eeecccc-- Confidence 111 122444444555678889999999988776 66777777888775322 0000 0000000 Q ss_pred CcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 453 DATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 453 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ....+...++. .....+++..|...+| T Consensus 394 ~~~~~~~~~~~-~~~~~~~~e~d~~~~~ 420 (421) T protein:vir:10 394 VDSAQIIPGDK-KPTAQQMAEIDTILSR 420 (421) T ss_pred ccccccccCCC-CcccccCccccccccc Confidence 00011110111 1111123334444445 No 251 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=95.65 E-value=0.0017 Score=35.80 Aligned_cols=424 Identities=11% Similarity=0.077 Sum_probs=181.7 Q ss_pred CCCHHHHHHHHHHHHHH-------------------HHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR-------------------DLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLR 61 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~-------------------~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd 61 (480) =-|+...-+.|.+++.. ..+-.-....||-|. +.+... .-..|+.+-..+.=|-..|+ T Consensus 9 ~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~--~~n~~e-LI~~YR~ma~~~pEVd~Aid 85 (533) T protein:vir:58 9 KLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGI--EFNRFF-LYDMYDRMDYTDPLISTVLD 85 (533) T ss_pred hhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccc--cccHHH-HHHHHHHhhccCcchhhHHH Confidence 11111111111111110 000111123344332 111000 01112222112223333333 Q ss_pred HHHhhhc-----cCccccC-CCchhHHHH-HHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEc Q lcl|NC_021324. 62 TLSDRLD-----IEGFRIS-EDSEGLEEL-WNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVES 134 (480) Q Consensus 62 ~~~~~l~-----~~~~~~~-~d~~~~~~l-~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~ 134 (480) ..++-.+ ..++.+. ++.+..+.+ ..+..-.+|+....+..+...+.|+.|++.-.. +.+.|-..++.+| T Consensus 86 eIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik----~~k~GI~elr~lD 161 (533) T protein:vir:58 86 IIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEK----GSDGTIEKFQVVS 161 (533) T ss_pred hhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccC----CcccchhhheecC Confidence 3332221 1122211 122222322 223344579999999999999999999887432 2346777899999 Q ss_pred ccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc---eEEeec Q lcl|NC_021324. 135 PLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP---VVPLTN 211 (480) Q Consensus 135 p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP---vv~~~n 211 (480) |..+-.+++...+. .+-+|++..... .....+ -+|| |+++.. T Consensus 162 Pr~i~~vr~~~t~~-------------------eyyvy~~~~~~~----~s~~~~------------~kI~~daI~y~~S 206 (533) T protein:vir:58 162 PYIFSKRYNPETDT-------------------WYYVITDVYRNV----VSGYFN------------EDIPEEDVIHFSH 206 (533) T ss_pred CeeeEEEEeeccce-------------------EEEeeccccccc----ccCccc------------cccchhheeeeee Confidence 99888877653320 112333321110 000000 0122 222221 Q ss_pred c-cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccch-------------hhhhc Q lcl|NC_021324. 212 D-PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTT-------------LDIYY 277 (480) Q Consensus 212 ~-~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~-------------~~~~~ 277 (480) . .+.....+.|-|-+.|+++.. -+++-|.+.+.+....|-+=++-.+..+.+..+...- ..... T Consensus 207 Gl~d~~~~~iisyLhkAiKp~NQ--LkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~T 284 (533) T protein:vir:58 207 KIDTNFFPYGRSYLESARAIWNQ--LRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQ 284 (533) T ss_pred ccccCCCCceehhhhHHHHHHHH--HHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccC Confidence 1 222345566666665555422 2556666766677777766666555544443321110 00011 Q ss_pred c----------------hhee---cCCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH Q lcl|NC_021324. 278 G----------------RILT---LASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA 338 (480) Q Consensus 278 ~----------------~~~~---~~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~ 338 (480) | -+|. .+|.+.++.+++..++ .-++-++-+...++..-++|.+-++...... -|..|-- T Consensus 285 Gev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~l-gemeDV~YF~kkLy~ALnVP~sRl~~e~~fg-r~~eItR 362 (533) T protein:vir:58 285 NQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQGSKV-DLAEDVEYMLNRLISALKVPKAFIGYEGDVN-AKNTLAT 362 (533) T ss_pred CeEeeccchhhhhhhHhhhcccccCCCccceeeecCCCCC-CcHHHHHHHHHHHHHHhCCCeeecCCCCCCc-cchhhhH Confidence 1 1111 1334567778887654 4556677777888899999988887654321 1223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHH---HHHHhccccCCCHHHH Q lcl|NC_021324. 339 TDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAV---SKLYANGQGPIPKEQA 415 (480) Q Consensus 339 ~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~---~kl~~~~~~~~s~e~~ 415 (480) .......-+.+.+..|..-|++- |+ +.+ .... .++++.|..-........++.+ ..+.+...+.+++.++ T Consensus 363 DEiKF~KFI~rLR~rF~~ll~~q--Li--lk~-iit~--eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi 435 (533) T protein:vir:58 363 QDIKFNNTIKRIQGFFVEELERM--VR--MNK-EFAD--QDFRLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVREDWI 435 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHhcc--cc--ccc-Ccch--hheeeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhHHHH Confidence 33334445555555565555441 21 122 2222 2356777544433333333333 2344455678888887 Q ss_pred HHh-CCCChhHHHHHHHHHHHHHHH-----------HHHHHhhhcccCCCcCCCCCCC---CCCccCCCCCC-------- Q lcl|NC_021324. 416 RID-LGYTATQREQMRDWDKQETED-----------MIDTLYSTTKAQADATPKPTVT---ETKTETQTSPS-------- 472 (480) Q Consensus 416 ~~~-~~~~~d~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~---d~~~~~~~~~~-------- 472 (480) +.. |..+++.+++.+. ++++... +......+.. .++.+.|... +.+++.+..+. T Consensus 436 ~k~ILr~tdei~~q~e~-ie~E~~~~~~~~~~~~~e~~~~~~~~~~--~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 512 (533) T protein:vir:58 436 YSNILQIPYDLKPQEEV-AEAAGGGGLFDTGGFGEETTPADFLGER--GSPIESPRGRTEFDFGTEGGEELGGELNLGGA 512 (533) T ss_pred HHHHhcCChhhhHHHHH-HHHhhcCCCCCCCCcccccCCcccCccc--cCcccCCCChhhHhcccCCccccccccccccc Confidence 755 6788754444332 2222111 0000001111 1111111100 11111111111 Q ss_pred -----CCCcccCC Q lcl|NC_021324. 473 -----GFNRTKTR 480 (480) Q Consensus 473 -----~~~~~~~~ 480 (480) ..+++.++ T Consensus 513 ~~~~~~~~g~~~~ 525 (533) T protein:vir:58 513 FEEFEEETGGGEE 525 (533) T ss_pred chhhhhhcCCccc Confidence 11222233 No 252 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=95.28 E-value=0.0024 Score=34.98 Aligned_cols=356 Identities=10% Similarity=0.049 Sum_probs=140.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~ 79 (480) |. +..++.++. ......|++.- . ..+.. ...........+|+..++-+-.-++.+- .+.. T Consensus 1 Mg----~f~~~f~~~-------~~~~~~~~~~~-~----~~~~~---~~a~~~~~v~~~i~~ia~~ia~~p~~~~~~~~~ 61 (385) T protein:vir:95 1 MG----LFDSVFKRH-------SELSWMYDLEF-L----QDKSK---KAYLKQIALNTVVEMVARTISQSEFRVMKNNTK 61 (385) T ss_pred Cc----hhhhhhccC-------cccccccchhh-h----hccch---hhhhhhHHHHHHHHHHHHHHcccceeeeecCcc Confidence 22 223222211 11111111110 0 00000 1111122334445544444433344331 1222 Q ss_pred hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEE-EEEcccceEEEEeCCCCCceEEE Q lcl|NC_021324. 80 GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLI-RVESPLYMYAELDPRNTRRVTRA 153 (480) Q Consensus 80 ~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~-~~~~p~~~~~v~d~~~~~~~~~~ 153 (480) ....+..++.. | ....+...++.+.+.+|.||++.... .+.+.. .++.|.. ..++.. . T Consensus 62 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~-------~~~~~~~~~~~~~~-~~~~~~------~-- 125 (385) T protein:vir:95 62 EKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDE-------GHFFVADDFEKEDE-LGLYSH------R-- 125 (385) T ss_pred ccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecC-------CCeeeccccccccc-cccccc------c-- Confidence 23344555532 3 23466777888999999999765321 111111 1111110 000000 0 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) ++ .+...+..+. ..+..--|+||++.......+|.|.+. .+.. T Consensus 126 --~~------------~~~~~~~~~~-------------------~~~~~~eiih~~~~~~~~~~~G~s~~~----~~~~ 168 (385) T protein:vir:95 126 --FT------------NVLVNDFEFK-------------------RVFTMDDVIYLKYNNQKLDAFSLGLFE----DYGE 168 (385) T ss_pred --ce------------eeeeccccee-------------------eeeccccEEEecCCCCCcccccchHHH----HHHH Confidence 00 0000000000 001111246666554444566776442 2333 Q ss_pred HHHHHHHHHHHHHHHhcchhhhhc--CC-Cccccccccccchhhh-------hcchheecCCCCceEEEeccc------- Q lcl|NC_021324. 234 AASRTLMNLQSASQILGTPLRVIS--GV-TTDELTNDGENTTLDI-------YYGRILTLASEAAKISEFKAA------- 296 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~~p~l~i~--G~-~~~~~~~~~~~~~~~~-------~~~~~~~~~g~~~~~~~~~~~------- 296 (480) .+...++ ...+.+.+--++. +. ..++...+.-...|+. ..+.++.++ ++.++.+++.. T Consensus 169 ~i~~~~~----~~~~~~~~~g~l~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~-~g~~~~~l~~~~~~~~s~ 243 (385) T protein:vir:95 169 IFGRMID----LQMLNNQIRGILKVDATKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLT-EGLAYEEHSNRGAAQSAQ 243 (385) T ss_pred HHHHHHH----HHHhcCCCceEEEeCCccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcC-CCceeEeecccccccCCH Confidence 3333222 2223333322222 11 1111110100111111 122344454 44677665421 Q ss_pred cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCC Q lcl|NC_021324. 297 ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGR 371 (480) Q Consensus 297 ~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~ 371 (480) .-..|++..+....+|+.+-++|+..++++..| .+...+. .+...|.-++..+.. +... T Consensus 244 ~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~sn-~e~~~~~---------------~~~~~l~P~~~~ie~~l~~~L~~~ 307 (385) T protein:vir:95 244 QFSELNELKKTVLTDVARMIGVPPSLVLGEMAD-LEKTIES---------------YLQFCINPLLRKIEAELNSKFFYQ 307 (385) T ss_pred HHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcC-HHHHHHH---------------HHHHHHHHHHHHHHHHHHhhcCCh Confidence 123578888889999999999999999754322 1221222 222222222222211 1111 Q ss_pred CCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccC Q lcl|NC_021324. 372 EVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQ 451 (480) Q Consensus 372 ~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 451 (480) ..... ..+++.+......|..+.++++.+++.+| +++...+++.+|+.+-+-+.-.+ +.- +.+ T Consensus 308 ~~~~~-~~~~fd~~~l~~~D~~~~~~~~~~~~~~g--~lt~NE~R~~~g~~p~~~~~gd~------------~~~--~~n 370 (385) T protein:vir:95 308 DEYLN-DDMHIKVVGIDKRDPLKLSEAIDKLVASG--TFTRNQVRIMTGEEPADDPELDK------------FII--TKN 370 (385) T ss_pred hhccc-ceEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCce------------eee--ccc Confidence 11111 23555555666778889999999998876 56667777777765321000000 000 000 Q ss_pred CCcCCCCCCCCCCcc Q lcl|NC_021324. 452 ADATPKPTVTETKTE 466 (480) Q Consensus 452 ~~~~~~~~~~d~~~~ 466 (480) ..+.....++|.+++ T Consensus 371 ~~~~~~~kgge~~~e 385 (385) T protein:vir:95 371 LQSADAFKGGESNEE 385 (385) T ss_pred ceecccccCCCCCCC Confidence 111111222222222 No 253 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=95.05 E-value=0.0029 Score=34.51 Aligned_cols=387 Identities=10% Similarity=0.004 Sum_probs=154.9 Q ss_pred HHHHHHHHHHHHH--HHHHHHhh----ccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccc---cCCCc-h Q lcl|NC_021324. 10 RLQGLLARDLPNL--LEAEAYRN----GTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFR---ISEDS-E 79 (480) Q Consensus 10 ~l~~~~~~~~~~~--~~~~~YY~----G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~---~~~d~-~ 79 (480) -+.++........ .....|.. +.... .+..+.+.. -.+ +.-...+|+..++-+-.-++. ..++. . T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~--~~~~vt~~~-al~--~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~ 75 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSD--SGQVVTPAS-ALA--LTVLQNCVTLLAESIAQLPIELYERSGEDRK 75 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCcc--CCcccchHH-hhc--cHHHHHHHHHHHHhhccCceEEEEecCCccc Confidence 0111100000000 00001111 11110 111111110 011 111233344333333222332 22211 1 Q ss_pred --hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceE Q lcl|NC_021324. 80 --GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 80 --~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 151 (480) ....+..++.. | ....+...++.+.+.+|.+|+++..+ ..|.+ .+..++|..+.+..+... .+ T Consensus 76 ~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~------~~G~~~~l~pl~~~~v~v~~~~~~--~~- 146 (419) T protein:vir:14 76 PATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRD------SDGVIQGLYPLDNEAVTVMRGSDL--KP- 146 (419) T ss_pred cccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEecCceEEEEECCCc--eE- Confidence 12235555542 3 23456677788999999999998653 34555 477788887776654321 11 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHH Q lcl|NC_021324. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) +|. + +... + +. .+ =|+|++++. ..+.+|.|.|.. +... T Consensus 147 ----~y~------------~-~~~~--------~-----~~-~~---------~i~h~~~~~-~dg~~G~s~i~~-~~~~ 184 (419) T protein:vir:14 147 ----VYR------------V-RGSD--------P-----MP-QR---------LVHHVRWMS-INGYTGLSPVLL-HANA 184 (419) T ss_pred ----EEE------------E-ccCc--------c-----cc-hh---------heeEecCcC-CCCcccccHHHH-HHHH Confidence 010 0 0000 0 00 00 134555443 345788887643 3333 Q ss_pred HHHHHHHHHHHHHHHHHhcchhhhhcCCC-ccccccccccch----hhh------hcchheecCCCCceEEEeccccHH- Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILGTPLRVISGVT-TDELTNDGENTT----LDI------YYGRILTLASEAAKISEFKAAELR- 299 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~~p~l~i~G~~-~~~~~~~~~~~~----~~~------~~~~~~~~~g~~~~~~~~~~~~~~- 299 (480) ++....+.....+.+...+.|-.+|.-.. .....+++.... |+. ..++++.+++ +.++.+++....+ T Consensus 185 i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~l~~~~~d~ 263 (419) T protein:vir:14 185 IGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQE-GMTFRPLSMTNVDA 263 (419) T ss_pred HHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCC-CceEEEccCChhhH Confidence 33322222222222333344544553111 100001111111 111 1134566653 4688777644322 Q ss_pred HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccee Q lcl|NC_021324. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTR 379 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 379 (480) .+++..+....+|+..-++|+..+|....+.-|+ ++.....+...+ -.-+-..++..+.. .+....... ... T Consensus 264 q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~--~E~~~~~f~~~~---L~P~~~~ie~~l~~--kll~~~~~~-~~~ 335 (419) T protein:vir:14 264 ALIDALRLSALDIARIYKIPAHMVNELERATFSN--IEHQSLQFVIYT---LLPWVKRHEQAKTR--DLLLPSERK-QYF 335 (419) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCccc--HHHHHHHHHHHH---HHHHHHHHHHHHhh--hccCccccC-CeE Confidence 3677777888999999999999997532221121 221111111111 00011111111110 111111111 123 Q ss_pred eeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 380 LETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 380 ~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +++.+......|..++++++.+++++| +++...+++.+|+.+-+- - +.+..... ......+. T Consensus 336 i~fd~~~l~r~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--G------------D~~~~~~n--~~~~~~~~ 397 (419) T protein:vir:14 336 IEYNLAGLLRGDQSSRYAAYAVGRQWG--WLSINDIRRLENMPPVKG--G------------DIYLSPMN--MVDASKPQ 397 (419) T ss_pred EEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--c------------Ceeeeccc--cccccccc Confidence 444445555678889999999988876 566666777777643220 0 00000000 00001110 Q ss_pred CCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 460 VTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 460 ~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+.++.+..+++.++...+ T Consensus 398 -~~~~~~~~~~~~~~~e~~~~ 417 (419) T protein:vir:14 398 -QLPVGKSEPTKAAIDEIGRI 417 (419) T ss_pred -cccCCCCCCccccccchhcc Confidence 11112222223333333333 No 254 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=94.85 E-value=0.0033 Score=34.15 Aligned_cols=385 Identities=14% Similarity=0.055 Sum_probs=148.4 Q ss_pred CCCHHHHHHHHHHHHH---HHHHHHHH-H----------HHHhhccCc--chh-------cCcccchhhhcceeccchHH Q lcl|NC_021324. 1 MTTYHEHVERLQGLLA---RDLPNLLE-A----------EAYRNGTRR--LKT-------IGIGAPPELAYLDVQPGWVA 57 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~---~~~~~~~~-~----------~~YY~G~~~--i~~-------~~~~~~~~~~~~~~~~n~~~ 57 (480) |- +...+...-. ...++.+- . -+.+.|-.+ +.. .+..+.+.. -+.+.-.. T Consensus 1 Mg----l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~---al~~~~V~ 73 (431) T protein:vir:10 1 MG----LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETR---ALRNMAVL 73 (431) T ss_pred Cc----chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhh---hhccHHHH Confidence 21 1111111000 00011100 0 001111000 000 011111100 00111122 Q ss_pred HHHHHHHhhhccCcccc---CCCc--hhHHHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCce Q lcl|NC_021324. 58 TYLRTLSDRLDIEGFRI---SEDS--EGLEELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI 127 (480) Q Consensus 58 ~iVd~~~~~l~~~~~~~---~~d~--~~~~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~ 127 (480) .+|+..++-+-.-++.+ .++. .....+..++.. |. -..+...+..+.+.+|.||+++..+ ...- T Consensus 74 ~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~------~g~~ 147 (431) T protein:vir:10 74 RCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWS------GNRP 147 (431) T ss_pred HHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEc------CCce Confidence 33333333222223322 1111 122345555532 32 3355677888999999999988652 1222 Q ss_pred eEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceE Q lcl|NC_021324. 128 PLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVV 207 (480) Q Consensus 128 ~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv 207 (480) ..+..++|..+.+..+... .+ .|+.. ...|.. ..+..+. |+ T Consensus 148 ~~L~pl~~~~v~~~~~~~~--~~----~y~~~-~~~g~~---~~~~~~d-----------------------------Vi 188 (431) T protein:vir:10 148 IRLIPMDRGSAKGRLTSTW--QI----VYDYT-TPTGDK---IELPARE-----------------------------VF 188 (431) T ss_pred EEEEEEcCceeEEEEcCCC--eE----EEEEE-eCCceE---EEEchhh-----------------------------EE Confidence 3566777877776654321 11 11111 111110 0111111 24 Q ss_pred EeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhhhcCC-Cccccccccccchhhh------hc Q lcl|NC_021324. 208 PLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVISGV-TTDELTNDGENTTLDI------YY 277 (480) Q Consensus 208 ~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i~G~-~~~~~~~~~~~~~~~~------~~ 277 (480) ||++. ...+.+|.|.|. .+.+.+.....-......+|. .|-.++.-- ...++..+.-...|.. .. T Consensus 189 Hir~~-~~dg~~G~spi~----~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~ls~e~~~~~~~~~~~~~~g~~n~ 263 (431) T protein:vir:10 189 HLRDL-SIDGVSGVSRVK----LSGNALELAEQAERAASRTFRTGVMAGGAIEVPKELSDNAYGRMKASVQENHTGSENA 263 (431) T ss_pred EecCc-CCCCcccccHHH----HHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCCCCHHHHHHHHHHHHHHhcCcccc Confidence 55543 234567877653 333444333333333333333 344444311 1111111111111211 12 Q ss_pred chheecCCCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 278 GRILTLASEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGG 356 (480) Q Consensus 278 ~~~~~~~g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~ 356 (480) ++++.++ ++.++.+++... -..+++..+....+|+..-++|+..+|....+ ++..++.....+...+ -.-+-. T Consensus 264 g~~~vl~-~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~--t~sn~eq~~~~f~~~t---L~P~~~ 337 (431) T protein:vir:10 264 GSWMLLE-EGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS--WGSGIEQLAIFFIQYG---LSHWFV 337 (431) T ss_pred CCceecC-CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC--ccccHHHHHHHHHHHH---HHHHHH Confidence 3455565 446888776432 22466777788899999999999999864322 1212222222222111 001111 Q ss_pred HHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccc--cCCCHHHHHHhCCCChhHHHHHHHHHH Q lcl|NC_021324. 357 AWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQ--GPIPKEQARIDLGYTATQREQMRDWDK 434 (480) Q Consensus 357 ~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~--~~~s~e~~~~~~~~~~d~~~~~~~~~~ 434 (480) .|+..+.. .+....... ...+++.+......|..++++++.+++..|. ++++...+++++|+.+-+-... T Consensus 338 ~ie~~ln~--~Ll~~~~~~-~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~g----- 409 (431) T protein:vir:10 338 SWEQAAAR--AFLPEKMLG-QRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPVA----- 409 (431) T ss_pred HHHHHHHh--hccChhhcC-CceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCccc----- Confidence 11111110 111111111 1234455555566788999999999887653 3455555666655532110000 Q ss_pred HHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccC Q lcl|NC_021324. 435 QETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKT 479 (480) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 479 (480) +.+....... +.+.++ ..| ++| T Consensus 410 -------D~~~~p~n~~----~~~~~~-------~~p-----~~~ 431 (431) T protein:vir:10 410 -------DQLRNPMTQK----QKGSGD-------EPP-----ATT 431 (431) T ss_pred -------cceecccccc----cCCCCC-------CCC-----CCC Confidence 0011111100 000000 001 111 No 255 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=94.02 E-value=0.0056 Score=32.91 Aligned_cols=312 Identities=12% Similarity=0.059 Sum_probs=113.4 Q ss_pred ceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccC Q lcl|NC_021324. 137 YMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLG 216 (480) Q Consensus 137 ~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~ 216 (480) =.-.+|...........+ .-. ......++.+...+....++..... ..+..+ +..-=++.+.+....+ T Consensus 1 v~Eivw~~~~g~~~~~~l---~~r-~~~~~~~f~~~~~~~l~~~~~~~~~------g~~~~~--lp~~kfi~~~~~~~~g 68 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKL---AWR-PPRTISRFDVAPDGGLVAIEQWGVF------GKATVR--IPVDRLVVFVNEREGA 68 (355) T ss_pred CeEEEEEeeCCeEEEeee---eec-CccceeeeeeccCCceeEEEecCCC------CCCcce--eccCCEEEEEeCCCCC Confidence 001234322211111111 000 1111111111122222222211100 001111 1111145556677788 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccc-----------cchhh------hhcch Q lcl|NC_021324. 217 NRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGE-----------NTTLD------IYYGR 279 (480) Q Consensus 217 ~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~-----------~~~~~------~~~~~ 279 (480) .++|.+-+..-.-+.+- =+..+.+++.-++.|+.|+.+.+|-......+... ..... .+... T Consensus 69 ~p~G~gLlr~~~w~~~f-K~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a 147 (355) T protein:vir:78 69 NWLGQSLLRQAYKNWLL-KDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAA 147 (355) T ss_pred CccchhhHHHHHHHHHH-HHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcce Confidence 89998876432222221 25667778888898988888877643322111110 00000 11111 Q ss_pred heec-CCCCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHH-HHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 280 ILTL-ASEAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIA-TDSRIVKMAERKGRIFGGA 357 (480) Q Consensus 280 ~~~~-~g~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~-~~~~l~~k~~~~~~~~~~~ 357 (480) ...+ .|...++.+.... ...+.+.++..-.+|+...-- +.+-.......+.-|+.. ...-....++.-.+.+... T Consensus 148 ~~iip~g~~ie~~ea~g~-~~~~~~~i~~~d~~Isk~iLG--qtlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ 224 (355) T protein:vir:78 148 GGYIPHGANFTLTGVQGK-LPEMDGPIRYHDEQIARAVLA--HFLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADV 224 (355) T ss_pred eEeecCCceEEEeecCCC-cccHHHHHHHHHHHHHHHHhh--hhhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 1122 2334455543322 222334444444455443210 011100000011112222 2222333334444555566 Q ss_pred HH-HHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH----HHHHHhCCCChhHHHHHHHH Q lcl|NC_021324. 358 WE-RAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPK----EQARIDLGYTATQREQMRDW 432 (480) Q Consensus 358 l~-~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~----e~~~~~~~~~~d~~~~~~~~ 432 (480) |. .+++-++.+...... ....++|.. .+.+..+.++.+.+++..|.. ++. +.+.+.+|+.+....+ +.. T Consensus 225 ln~~li~~l~~lN~~~~~---~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~-~~~~~~~~~~~e~~gip~p~~~~-~~~ 298 (355) T protein:vir:78 225 TQQHVVEDLVDQNWGPEE---PAPRLVPAQ-LGKEQPVTAEAIRALVECGAF-TADPELEKDLRARYGLPAPAERD-DGA 298 (355) T ss_pred HHHHHHHHHHHhcCCCCC---CCCEEEecC-cChhHHHHHHHHHHHHhCCCc-cccHHHHHHHHHHhCCCCCCCCC-ccc Confidence 64 466655655532221 224566743 445666788999999988754 342 3677888874321110 000 Q ss_pred HHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 433 DKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 433 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ................+........+..+| .....+.| T Consensus 299 ---------~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~-~a~~~~~~ 336 (355) T protein:vir:78 299 ---------DAAAAKAAGRRRAKRLPGQRQGAALPSRSP-RADPPRRR 336 (355) T ss_pred ---------CCccccccccccccccCCccccccccccCC-CCCChhhh Confidence 000000000000000000000001111111 11111111 No 256 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=93.64 E-value=0.0069 Score=32.45 Aligned_cols=374 Identities=10% Similarity=-0.006 Sum_probs=131.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCchhHHH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSEGLEE 83 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~~~~~ 83 (480) .-+++++...+.......... .... .... .... ....+...-...+|+..++-+-.-++.+- .++..... T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~-~~~~----~~~~-~~~~---~~~~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~~~~~~~ 71 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLT-DTVW----CSIP-SEKL---KELSIKKWAIDSCANKIANTLSCAEVLTYEKGEEVRKK 71 (395) T ss_pred CchHHHHHhhhcccccccccc-cchh----hccc-cccc---hhhhhhhHHHHHHHHHHHHHHhhCceeeccCCccccch Confidence 222333322222111110000 0000 0000 0000 01111111122233333332222244332 22222333 Q ss_pred HHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEE- Q lcl|NC_021324. 84 LWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLY- 157 (480) Q Consensus 84 l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~- 157 (480) +..++.. |. ...+...+..+.+.+|.||+++..+. +. .|............ .+++ T Consensus 72 ~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~---------~~----~~~~~~~~~~~~~~------~~~~~ 132 (395) T protein:vir:40 72 NWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEY---------IY----VADSFTKNDKSLYE------NTYTE 132 (395) T ss_pred HHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCc---------ee----ecCCcccccccccc------ceeee Confidence 4444432 32 34556677888999999998764321 11 01111000000000 0000 Q ss_pred EEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHH Q lcl|NC_021324. 158 TTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASR 237 (480) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~ 237 (480) ...+.. .+-..|.++. |+||+.+......++ ..+...+.. T Consensus 133 v~~~~~---~~~~~~~~~e-----------------------------vih~r~~~~~~~~~~--------~~l~~~~~~ 172 (395) T protein:vir:40 133 VTLKDL---TLKKEFKESE-----------------------------VLHLTLNNESIKSII--------DGFYLLYGD 172 (395) T ss_pred eeecCc---eeeeeecccc-----------------------------EEEeecCCCCccccc--------hhHHHHHHH Confidence 000000 0011122222 234432222222221 122233333 Q ss_pred HHHHHHHHHHHhcc--hhhhhcCC-Cccccccccccchhh-------hhcchheecCCCCceEEEecccc-HHHHHHHH- Q lcl|NC_021324. 238 TLMNLQSASQILGT--PLRVISGV-TTDELTNDGENTTLD-------IYYGRILTLASEAAKISEFKAAE-LRNFAEEM- 305 (480) Q Consensus 238 ~~s~~~~~~~~~~~--p~l~i~G~-~~~~~~~~~~~~~~~-------~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~l- 305 (480) ..+...+...+... +.+++... .+.+...+.....|+ ...+.++.++ ++.++.+++... ...+++.- T Consensus 173 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~-~g~~~~~l~~~~~d~q~~e~~~ 251 (395) T protein:vir:40 173 LLTAAVNKYKKLNSRKIIVKLKAMFGQTPEAEEKLRLMLSERMKKFLAEGDSALPVE-DGMEIDELAGDSKIAESRDIKK 251 (395) T ss_pred HHHHHHHHHHhcCCCCceEEEecccCCCHHHHHHHHHHHHHHHHHhhccCCceeecC-CCceEEeccCChhhhhHHHHHH Confidence 33333333332222 33333211 111111111111121 1223345554 446787776432 12244322 Q ss_pred --HHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEE Q lcl|NC_021324. 306 --EVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETV 383 (480) Q Consensus 306 --~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~ 383 (480) +..+.+||.+-++|+..+++...| .+...+.+....|.-.+...+..+.. .+...........+++. T Consensus 252 ~~~~~~~~Ia~~fgVPp~~l~~~~sn-~e~~~~~f~~~~L~P~~~~ie~~l~~----------kLl~~~~~~~g~~i~fd 320 (395) T protein:vir:40 252 MIDDVFEMVANSFNIPLGLAKGDTVG-LSEQVNSFLMFSINPIAEMFTDEGNR----------KFYGRDSVLERTYMKLD 320 (395) T ss_pred HHHHHHHHHHHHhCCCHHHhcCCCcC-HHHHHHHHHHHHHHHHHHHHHHHHHH----------hcCChhhhcCCceEEEe Confidence 344678999999999999765332 22222222211222211111111110 11121111112335555 Q ss_pred ecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCC Q lcl|NC_021324. 384 WRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTET 463 (480) Q Consensus 384 f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) +......+..+.++++.+++.+| +++...+++.+|+.+-+-... +.+.-.... .+ -....+. T Consensus 321 ~~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~g~~pi~~~~g------------D~~~~~~n~--~~--~~~~~~~ 382 (395) T protein:vir:40 321 TTRIKVQDIQEIASSMDVLFHIG--VNTIDDNLRMIGREPVMSPET------------QERFVTKNY--AP--LGENEED 382 (395) T ss_pred chhhhccCHHHHHHHHHHHHhCC--CCCHHHHHHHhCCCCCCCCCC------------ceeeecccc--cc--ccccccc Confidence 55666778899999999998876 667777777777653210000 000000000 00 0000111 Q ss_pred CccCCCCCCCCCc Q lcl|NC_021324. 464 KTETQTSPSGFNR 476 (480) Q Consensus 464 ~~~~~~~~~~~~~ 476 (480) ..++++.++..|. T Consensus 383 ~kgge~~~~~~~~ 395 (395) T protein:vir:40 383 LKGGDINENKGDS 395 (395) T ss_pred cCCCCCCCCcCCC Confidence 1111111111111 No 257 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=93.54 E-value=0.0072 Score=32.32 Aligned_cols=353 Identities=13% Similarity=0.051 Sum_probs=126.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccC-CCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS-EDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~-~d~~ 79 (480) |- +..+|..+ ......-+.+. ....+.. ...+...-...+|+..++-+-.-++.+- .+.. T Consensus 1 Mg----~f~~l~~~-------~~~~~~~~~~~-----~~~~~~~---~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~ 61 (376) T protein:vir:78 1 MG----FFSELFKR-------NKEIEWMWDLD-----FLEDKTT---KVYLKKMALNTCVKHIARTIAKSDFRLKNGETS 61 (376) T ss_pred Cc----hhhhhhcc-------CCccccccchh-----hccccch---hhhhhhHHHHHHHHHHHHhhcccceeecccccc Confidence 22 22222211 00000000000 0000000 0011112233344444433333344332 1222 Q ss_pred hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 80 GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 80 ~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) ....+..++.. | ....+...+..+.+.+|.||+++..+. .|.+. ..+|+..... ... T Consensus 62 ~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~------~~~~~-------~~~~~~~~~~-----~~~ 123 (376) T protein:vir:78 62 VRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTD------DFLIA-------DSYVRKEFAF-----FPD 123 (376) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCC------Ceeec-------cceeecccce-----eee Confidence 22334444432 3 244667778888999999998875432 22221 1112111000 000 Q ss_pred EEEE-EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 155 RLYT-TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 155 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) .++. ...+.+ +...|..+.+++ |+.+... +...+..+.. T Consensus 124 ~~~~~~~~~~~---~~~~~~~~evih-----------------------------~~~~~~~--------~~~~~~~~~~ 163 (376) T protein:vir:78 124 VFEGVTVKDYR---YNRNFSMDDVIF-----------------------------LEYGNER--------LSAFTDGMFE 163 (376) T ss_pred eeeeeeeecce---eeeeeccccEEE-----------------------------eccCCCC--------chhhhhHHHH Confidence 0100 000100 011122233333 3221111 1111223333 Q ss_pred HHHHHHHHHHHHHHHhc--chhhhhc-CCCccccccccccchhhh-------hcchheecCCCCceEEEecccc------ Q lcl|NC_021324. 234 AASRTLMNLQSASQILG--TPLRVIS-GVTTDELTNDGENTTLDI-------YYGRILTLASEAAKISEFKAAE------ 297 (480) Q Consensus 234 ~~~~~~s~~~~~~~~~~--~p~l~i~-G~~~~~~~~~~~~~~~~~-------~~~~~~~~~g~~~~~~~~~~~~------ 297 (480) .+..+.........+.. -+.+++. +....+...+.-...|.. ....++.++ ++.++..++... T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~-~g~~~~~l~~~~~~~~~~ 242 (376) T protein:vir:78 164 DYGELFGKMIRAQMRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQL-EGFNYEEFGTTSVNNSQS 242 (376) T ss_pred HHHHHHHHHHHHHHhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcC-CCceEEeeccCccccchh Confidence 44444333333322211 1222221 111111111101111111 112244454 345666654322 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 377 (480) ...+++..+....+|+..-++|+..+|++..+ .+...+.+....|.-.+...+..+... +.... + T Consensus 243 ~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~-~e~~~~~f~~~~l~P~~~~ie~~l~~k----------ll~~~---~- 307 (376) T protein:vir:78 243 FDEVKKLRKEMIDYVASILGIPSSLLHGDMAD-LSNNMKAYMEYCIDPLTKKLEDELNAK----------LFTFS---E- 307 (376) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHhh----------hCCcc---c- Confidence 12477888888999999999999999865322 232222222222222222222111111 11110 1 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ..++..+......|..+.++++.+++.+| +++...+++.+|+.+-+-... +...-. .+. T Consensus 308 ~~~~~~~~~ll~~d~~~~~~~~~~~~~~G--~~t~NE~R~~lg~~p~~~g~~------------d~~~~~--~n~----- 366 (376) T protein:vir:78 308 FLAGEHIKIIHKKDIIENAEAVDKLVASG--SFNRNEVRELLGAERVDNPEL------------DKYLIT--KNY----- 366 (376) T ss_pred ceecccchhhcccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCC------------ceeeec--cCc----- Confidence 11222222234467888899998888876 555555665555532110000 000000 000 Q ss_pred CCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFNR 476 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~~ 476 (480) .+.+ ..+.|| T Consensus 367 ~~~~---------~~~e~g 376 (376) T protein:vir:78 367 QSAD---------EGGEDG 376 (376) T ss_pred eehh---------ccccCC Confidence 0000 001111 No 258 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=93.50 E-value=0.0073 Score=32.29 Aligned_cols=401 Identities=11% Similarity=-0.012 Sum_probs=134.1 Q ss_pred CC----CHHHHHHHHHHHHHHHHHHHHHHHHH-----hhccC-----cchhcCcccchhhhcceeccchHHHHHHHHHhh Q lcl|NC_021324. 1 MT----TYHEHVERLQGLLARDLPNLLEAEAY-----RNGTR-----RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~----~~~~~i~~l~~~~~~~~~~~~~~~~Y-----Y~G~~-----~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 66 (480) |. -+++++..-...-....+.+...... |.|-- +|...... -.-+..+.. .....-.+++.... T Consensus 1 m~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~-~~ly~~m~~-D~hi~s~l~~Rk~a 78 (448) T protein:vir:79 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDG-LLVYHKMLS-DGTVKNALNYIFGR 78 (448) T ss_pred CCCCCCCCccccCcccccccccchhhhhhhhhhcccccccccccchhHhhccccc-hHHHHHHhh-ChHHHHHHHHHHHH Confidence 11 11111000000000000000000000 11100 00000000 001111111 23333344444444 Q ss_pred hccCccccC--CCch----hHHHHHHHHHh-------cCHHHHHHHHHHHHhhcCeeE-EEEecCccccCCCCceeEEEE Q lcl|NC_021324. 67 LDIEGFRIS--EDSE----GLEELWNWWQA-------NDLDEESVLGHDDSLTFGRAY-ITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 67 l~~~~~~~~--~d~~----~~~~l~~~~~~-------n~~~~~~~~~~~~~~~~G~a~-~~v~~~~~~~~d~~g~~~~~~ 132 (480) +....|.+. +++. ..+.+.+.+.. ..|..++.+ ..++..+|.++ +++|.. ..+|+..+.. T Consensus 79 v~~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~-~lda~~~G~s~~Eivw~~-----~~~g~~~~~~ 152 (448) T protein:vir:79 79 IRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTL-----GADGKLILDK 152 (448) T ss_pred HhcCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHH-HHHhhhhcceeEEEEeee-----cCCCceeccc Confidence 445556542 2221 22333333321 235555554 46788899884 455531 1234433222 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecc Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~ 212 (480) +.+. .+.. +.++ . .+.++ + ..+....+.........+..+-++++ ++++.+ T Consensus 153 l~~r------~~~~---~~~f--~-~~~d~-~-------------l~~~~~~~~~~~~~~~~~~~~lP~~~--~i~~~~- 203 (448) T protein:vir:79 153 IVPI------HPFN---IDEV--L-YDEEG-G-------------PKALKLSGEVKGGSQFVSGLEIPIWK--TVVFLH- 203 (448) T ss_pred cccc------CCcc---ccce--e-eecCC-c-------------eEEeecCCcccccccCCCccccccce--EEEEec- Confidence 2110 0000 0000 0 01111 0 11111111100111111222223443 345544 Q ss_pred cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhh----h--hcchheec-CC Q lcl|NC_021324. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLD----I--YYGRILTL-AS 285 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~----~--~~~~~~~~-~g 285 (480) ...+.++|.+-+.. +--..=-=+..+.+++.-++.|+.|+++.+.-......++....... + +......+ .| T Consensus 204 ~~~g~p~g~gLlr~-~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~ 282 (448) T protein:vir:79 204 NDDGSFTGQSALRA-AVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDD 282 (448) T ss_pred CccCCcccchhHHH-HHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHhcCCceEEEecCC Confidence 45567888876643 22212222667788888899999999877632211111111111111 1 12222222 33 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_021324. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE-RAMRI 364 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~l 364 (480) ...++.+..... ..+.+.++..-.+|+...- -+.+.....+.++..++.....-.......-.+.+...+. .++.- T Consensus 283 ~~ie~~ea~~~~-~~~~~~i~~~d~~Isk~iL--GqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~ 359 (448) T protein:vir:79 283 WKFDTVDLKSAM-PDAIPYLTYHDAGIARALG--IDFNTVQLNMGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPK 359 (448) T ss_pred ceEEEEecCCCc-ccHHHHHHHHHHHHHHHHh--hhhhccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 445555543322 2233444444445544321 0111111111111112221111111122223344455554 35554 Q ss_pred HHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 365 AMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTL 444 (480) Q Consensus 365 i~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 444 (480) ++.+.-+... .-..+.|....+.++.+.|+.+.+++..+ ........+.++.. T Consensus 360 l~~lNfg~~~---~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~--~~~~~~~~~~~~~p---------------------- 412 (448) T protein:vir:79 360 LVLPNWPSAT---RFPRLTFEMEERNDFSAAANLMGMLINAV--KDSEDIPTELKALI---------------------- 412 (448) T ss_pred HHHhcCCCcC---CCcEEEecCCChHHHHHHHHHhhhhhccc--hhhHHHHHHhhcCC---------------------- Confidence 4545422211 12467787666777777777666554321 11111111111110 Q ss_pred hhhcccCCCcCCCCCC---CCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 445 YSTTKAQADATPKPTV---TETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 445 ~~~~~~~~~~~~~~~~---~d~~~~~~~~~~~~~~~~~~ 480 (480) .+.+.+.+.+ .++..+. ....+.+| T Consensus 413 ------~~~~~~~~~a~~~~~~~~~~-----~~~~~~~~ 440 (448) T protein:vir:79 413 ------DALPSKMRRALGVVDEVREA-----VRQPADSR 440 (448) T ss_pred ------CCCCCccccccCCCCccccc-----ccCCcccc Confidence 0111111111 0111111 11223344 No 259 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=92.91 E-value=0.0095 Score=31.67 Aligned_cols=383 Identities=13% Similarity=0.031 Sum_probs=148.8 Q ss_pred HHHHHHH-HHH----HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCCc--h Q lcl|NC_021324. 10 RLQGLLA-RDL----PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDS--E 79 (480) Q Consensus 10 ~l~~~~~-~~~----~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~--~ 79 (480) -..++.. .+. +.-.-.....-|...- ..+..+.+.. -.+ +.-...+|+..++-+-.-++.+ ..+. . T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~g~~~s-~~~~~v~~~~-al~--~~~v~~cv~~ia~~ia~lp~~~~~~~~~~~~~ 76 (419) T protein:vir:80 1 MFFSRQLLSNLGQTQPGSGGWVSALLGSARS-EAGQVVTPAS-ALS--LTVLQNCVTLLAESIAQLPVELYERSGDDRKP 76 (419) T ss_pred CCcccccccccCcCCCCcchhhHHhhccccc-ccCcccChHH-hhc--cHHHHHHHHHHHHhhccCceEEEEecCCCccc Confidence 0000100 000 0000000011111000 0111111110 011 1112223333333332224322 1111 1 Q ss_pred -hHHHHHHHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 80 -GLEELWNWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 80 -~~~~l~~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) ....+..++.. | .-..+...+..+.+.+|.||+++..+ ..|++ .+..++|..+.+..+... .+ T Consensus 77 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~------~~G~~~~L~~i~~~~v~i~~~~~~--~~-- 146 (419) T protein:vir:80 77 ATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRD------QDGVIQGLYPLDNEAVTVMKGPDL--KP-- 146 (419) T ss_pred ccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEC------CCCcEEEEEEecCceEEEEECCCc--eE-- Confidence 12235555432 3 23466677888999999999988653 34665 467788887766554321 10 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) .| ...+.. .+. . . =|+|++++. ..+.+|.|.|.. ....+ T Consensus 147 ---~y-----------------------~~~~~~---~~~-~-------~--~i~h~~~~~-~d~~~G~s~i~~-~~~~i 185 (419) T protein:vir:80 147 ---MY-----------------------RVAGAD---PLP-Q-------R--LVHHVRWMS-INGYTGLSPVLL-HANAI 185 (419) T ss_pred ---EE-----------------------EEcCcc---ccc-h-------h--heEEecCCC-CCCcccccHHHH-HHHHH Confidence 01 000000 000 0 0 135555543 345788886642 33333 Q ss_pred HHHHHHHHHHHHHHHHhcchhhhhcCC-Cccccccccccc----hhh------hhcchheecCCCCceEEEeccccHH-H Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGTPLRVISGV-TTDELTNDGENT----TLD------IYYGRILTLASEAAKISEFKAAELR-N 300 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~p~l~i~G~-~~~~~~~~~~~~----~~~------~~~~~~~~~~g~~~~~~~~~~~~~~-~ 300 (480) +....+.....+.+...+.|-.+++-. +.....+++... .|. ...++++.++ ++.++.++.....+ . T Consensus 186 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~-~g~~~~~l~~s~~d~q 264 (419) T protein:vir:80 186 GHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQ-EGMKFKPLSMTNVDAA 264 (419) T ss_pred HHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecC-CCceEEeccCChhhHH Confidence 322222222222223333454444310 111111111111 111 0123456665 44688777643322 3 Q ss_pred HHHHHHHHHHHHHhhcCCChHHhccccccchHH-HH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021324. 301 FAEEMEVFRKEAASITGLPPQYLSSSSENPASA-EA--IIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 301 ~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg-~A--l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 377 (480) +++..+....+|+..-++|+..+|....+.-|+ .. +.+....|.-.+... ...|.+ .+....... . T Consensus 265 ~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~~i----e~~l~~------kll~~~~~~-~ 333 (419) T protein:vir:80 265 LIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVKRH----EQAKTR------DLLLPSERK-Q 333 (419) T ss_pred HHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHHHH----HHHHhh------hccCccccC-C Confidence 677778888999999999999997532211111 11 111111111111111 111110 111111111 1 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ..+++.+......|..++++.+.+++..| +++...+++.+|+.+-+- -. .+...... ..... T Consensus 334 ~~i~fd~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--GD------------~~~~~~n~--~~~~~ 395 (419) T protein:vir:80 334 YFIEYNLAGLLRGDQSSRYAAYAVGRQWG--WLSINDIRRLENMPPVKG--GD------------IYLSPMNM--VDASK 395 (419) T ss_pred eEEEEechhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--cc------------eeeecccc--ccccc Confidence 22444445556678889999998888765 566666777777643210 00 00000000 00000 Q ss_pred CCCCCCCccCCCCCCCCC-cccCC Q lcl|NC_021324. 458 PTVTETKTETQTSPSGFN-RTKTR 480 (480) Q Consensus 458 ~~~~d~~~~~~~~~~~~~-~~~~~ 480 (480) +... +.+ +..|+... ..--| T Consensus 396 ~~~~-~~~--~~~~~~~~~~~~~~ 416 (419) T protein:vir:80 396 PQPI-PMG--KTEPTKAALDEIGR 416 (419) T ss_pred cccc-cCC--CCCchhhhHHHHHh Confidence 0000 000 00010000 00011 No 260 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=91.76 E-value=0.014 Score=30.69 Aligned_cols=409 Identities=12% Similarity=0.012 Sum_probs=139.7 Q ss_pred CCCH----HHHHHHHHHHHHHHHHHHHHHHHH-----hhccC-----cchhcCcccchhhhcceeccchHHHHHHHHHhh Q lcl|NC_021324. 1 MTTY----HEHVERLQGLLARDLPNLLEAEAY-----RNGTR-----RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDR 66 (480) Q Consensus 1 ~~~~----~~~i~~l~~~~~~~~~~~~~~~~Y-----Y~G~~-----~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~ 66 (480) |+.- ++.+..-........+++...... |.|-- +|...... -.-+..+.. .....-++++-... T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr~~~~-~~ly~~m~~-D~hi~s~l~~Rk~a 78 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQGKDG-LLVYHKMLS-DGTVKNALNYIFGR 78 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhccccc-hHHHHHHhh-ChHHHHHHHHHHHH Confidence 1111 000000000000000000000000 11100 00000000 001111111 23333333333333 Q ss_pred hccCccccC--CCc----hhHHHHHHHHHh-------cCHHHHHHHHHHHHhhcCee-EEEEecCccccCCCCceeEEEE Q lcl|NC_021324. 67 LDIEGFRIS--EDS----EGLEELWNWWQA-------NDLDEESVLGHDDSLTFGRA-YITVSHPDVESGDPAGIPLIRV 132 (480) Q Consensus 67 l~~~~~~~~--~d~----~~~~~l~~~~~~-------n~~~~~~~~~~~~~~~~G~a-~~~v~~~~~~~~d~~g~~~~~~ 132 (480) +....|++. +++ +..+.+.+.+.. ..|+.++..+ .++..+|.+ ++++|.. ..+|...+.. T Consensus 79 v~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~-----~~dg~~~~~~ 152 (448) T protein:vir:77 79 IRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTL-----GADGKLILDK 152 (448) T ss_pred HhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEee-----cCCCceeecc Confidence 344455542 222 222334444432 2566777665 689999988 4556631 1244443322 Q ss_pred EcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecc Q lcl|NC_021324. 133 ESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTND 212 (480) Q Consensus 133 ~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~ 212 (480) +.+. .+.. ++++. |.++.-..++...+.........+..+-+++++ +++.+ T Consensus 153 l~~r------~~~~-------~~~f~-------------~~~~~~l~~~~~~~~~~~~~~~~~~~~lP~~~~--i~~~~- 203 (448) T protein:vir:77 153 IVPI------HPFN-------IDEVL-------------YDEEGGPKALKLSGEVKGGSQFVNGLEIPIWKT--VVFLH- 203 (448) T ss_pred cccc------CCCc-------cceee-------------eecCCceEEEecCCcccccccCCCccccccceE--EEEec- Confidence 2211 0000 00110 111111111111111001111112223344444 34443 Q ss_pred cccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhh----h--hcchheec-CC Q lcl|NC_021324. 213 PRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLD----I--YYGRILTL-AS 285 (480) Q Consensus 213 ~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~----~--~~~~~~~~-~g 285 (480) ...+.++|.+-+..-.-+.+ -=+..+.+++.-++.|+.|+++.+.-......++....... + +......+ .| T Consensus 204 ~~~g~p~g~gLlr~~~w~~~-fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~g 282 (448) T protein:vir:77 204 NDDGSFTGQSALRAAVPHWL-AKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQKPRHGIILPDD 282 (448) T ss_pred CCcCCcccchHHHHHHHHHH-HHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcCCceEEEecCC Confidence 34567888876643222121 12566778888899999999887732211111111111111 1 12222222 34 Q ss_pred CCceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_021324. 286 EAAKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWE-RAMRI 364 (480) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~l 364 (480) .+.++.+..... ..+.+.++..-.+|+...-- +.+.....+.+++.+......-.......-.+.+...+. +++.- T Consensus 283 ~~ie~~ea~~~~-~~~~~~i~~~d~~Isk~iLG--qtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~Li~~ 359 (448) T protein:vir:77 283 WKFDTVDLKSAM-PDAIPYLTYHDAGIARALGI--DFNTVQLNMGVQAVNIGEFVSLTQQTIISLQREFASAVNLYLIPK 359 (448) T ss_pred ceEEEEecCCCc-cCHHHHHHHHHHHHHHHHhc--cccccccccchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 445565543322 22334444444455443210 011111111111222221111111112223334444453 35554 Q ss_pred HHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 365 AMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTL 444 (480) Q Consensus 365 i~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 444 (480) ++.++.+... .-..+.|....+.++.+.++.+.+++.. ..+.+++.... .. . T Consensus 360 l~~lNfg~~~---~~P~~~f~~~e~eDl~~~a~~~~~l~~~---------~~~~~~ip~~~-~~---------------~ 411 (448) T protein:vir:77 360 LVLPNWPGAT---RFPRLTFEMEERNDFSAAANLMGMLINA---------VKDSEDIPTEL-KA---------------L 411 (448) T ss_pred HHHhcCCCCC---CCCEEEecCCChhhHHHHHHHhHHHHHH---------HHHHhcCCccC-Cc---------------C Confidence 4555432221 1246778777778888888777666421 22333322110 00 0 Q ss_pred hhhcccCCCcCCCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 445 YSTTKAQADATPKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 445 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ..+.. ..+.+.+..+++..+....|.+---..+| T Consensus 412 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 445 (448) T protein:vir:77 412 IDALP--SKMRRALGVVDEVREAVRQPADSRYLYTR 445 (448) T ss_pred CCCCc--hhcccccCCCCCCCchhhcchhhHHHHhh Confidence 00000 00111111112222222222221122222 No 261 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=91.03 E-value=0.018 Score=30.18 Aligned_cols=203 Identities=9% Similarity=0.024 Sum_probs=76.9 Q ss_pred EEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHH Q lcl|NC_021324. 154 VRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTD 233 (480) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D 233 (480) +|.- .+ +..++. +.... + ...+ +.....-.. |+||++-....+.+|.|.|.. ... T Consensus 1 ~r~~---~d-g~~~y~--~~~~~---~--~~~g--------~~~~~~~~e--ilH~r~~~~~~~~~Glspi~~----a~~ 55 (219) T protein:vir:98 1 MRVC---KD-GNYKYL--MKKSL---Y--DTKS--------EIYEYNKND--VIFIKLYDPMQQVYGSPDYVG----GIT 55 (219) T ss_pred Ccee---ec-CeEEEE--Eecce---e--cCCc--------eeEEecccc--EEEecCCCCCCCcceecHHHH----HHH Confidence 1110 11 111111 00000 0 0000 000001111 356665333455678886543 333 Q ss_pred HHHHHHHHHHHH---HHHhcchhhhh--cCCCccccccccccchhhhh----c-chheec-CC---CCceEEEeccc--c Q lcl|NC_021324. 234 AASRTLMNLQSA---SQILGTPLRVI--SGVTTDELTNDGENTTLDIY----Y-GRILTL-AS---EAAKISEFKAA--E 297 (480) Q Consensus 234 ~~~~~~s~~~~~---~~~~~~p~l~i--~G~~~~~~~~~~~~~~~~~~----~-~~~~~~-~g---~~~~~~~~~~~--~ 297 (480) ++....+-..-. +...+.|-.+| +|..+.+...+.-...|+.. . ..++.+ +| ++.++..+... . T Consensus 56 ~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~~~~~~d 135 (219) T protein:vir:98 56 SALLNSDATIFRRRYYSNGAHMGFILYSTDPDMTEEMEDEIAERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGQK 135 (219) T ss_pred HHHHHHHHHHHHHHHHhcCCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCcccccceeEecCCCCccceeEEEccCCHHH Confidence 333332222222 23334454444 33222221111111112111 1 112222 22 34677666532 3 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 377 (480) . .|++.-+....+|+..-++|+..+|....+.+++..+...... .+...|.-.+..+....+..... . T Consensus 136 ~-qfle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~----------f~~~tL~P~~~~ie~~ln~~~~~-~ 203 (219) T protein:vir:98 136 D-EFANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREA----------YQADEVLPLQEIIAESINSDYEI-K 203 (219) T ss_pred H-HHHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHH----------HHHHHHHHHHHHHHHHhhhhhcC-C Confidence 4 3888888889999999999999988543222222112211111 11111221111111111110000 1 Q ss_pred eeeeEEecCCCCCCHH Q lcl|NC_021324. 378 TRLETVWRDPSTPTVA 393 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~ 393 (480) ..+++.|..+.+.++. T Consensus 204 ~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 204 SALKVNFKQPEKRDKN 219 (219) T ss_pred CccEEeecCcccccCC Confidence 2356788777766554 No 262 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=90.92 E-value=0.018 Score=30.10 Aligned_cols=417 Identities=11% Similarity=-0.016 Sum_probs=151.1 Q ss_pred CCCHHHHHHHHHHHHHH--HHHHHHHHHHHhhccCcchhcCc----ccchhhhcceeccchHHHHHHHHHhhhc----c- Q lcl|NC_021324. 1 MTTYHEHVERLQGLLAR--DLPNLLEAEAYRNGTRRLKTIGI----GAPPELAYLDVQPGWVATYLRTLSDRLD----I- 69 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~--~~~~~~~~~~YY~G~~~i~~~~~----~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~- 69 (480) |-. -...|..+..+ ...+.+.+.+|. +|.+.. ......+..+...+-+...++++++.|. + T Consensus 1 m~~---~~~~l~~k~~R~~~e~~w~e~a~~~-----lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp 72 (514) T protein:vir:80 1 MRQ---QASAMWAEYRDSTAIRKAEDFAKFT-----IASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLALTLFPP 72 (514) T ss_pred Ccc---chHHHHHHhhcchHHHHHHHHHHHh-----cccccCCCCCCcccccccccccchhHHHHHHHHHHHHHhhhcCC Confidence 322 22333333321 234455555554 222111 0111111112233445666666666553 2 Q ss_pred -Cccc-cC-CCc-------------hhH-------HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCc Q lcl|NC_021324. 70 -EGFR-IS-EDS-------------EGL-------EELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG 126 (480) Q Consensus 70 -~~~~-~~-~d~-------------~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g 126 (480) .+|+ .. +|+ +.. ..+...+..++|.....++.++...+|.|.+++-.+ T Consensus 73 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~--------- 143 (514) T protein:vir:80 73 GRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPG--------- 143 (514) T ss_pred CCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecC--------- Confidence 1232 11 110 011 124445667899999999999999999987766321 Q ss_pred eeEEEEEcccceEEEEeCCCCCceEEEEEEEE-EecC---------------CceEEEEEEEe-----CCc----EEEEE Q lcl|NC_021324. 127 IPLIRVESPLYMYAELDPRNTRRVTRAVRLYT-TRDD---------------VAVPDRATLYL-----PDE----TVPLR 181 (480) Q Consensus 127 ~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~-~~~~---------------~~~~~~~~~~~-----~~~----~~~~~ 181 (480) .-.++.++-.+.++.-|. .+ .+...++.+. .... .+....+++|+ +++ +..|. T Consensus 144 ~~~~~~~pl~~y~v~~d~-~G-~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~ 221 (514) T protein:vir:80 144 TGKMLVWTMQSYTVRRTS-HG-DPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWH 221 (514) T ss_pred CCcEEEEEcCeEEEeeCC-Cc-CeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEE Confidence 123555655554433333 33 3333333221 1000 01111223332 111 11111 Q ss_pred ecCCCcccccccccccccc--ccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc-- Q lcl|NC_021324. 182 RNGGLNDQWVVDGDVIKHG--LGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS-- 257 (480) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~--~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~-- 257 (480) ...+. .. ....+ +..+|++.++-+...++.||+|-.. +..+=+..+|.+.-...........|.+.+. T Consensus 222 e~~g~-----~i--~~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~-~al~D~k~L~~l~~~~l~~~~~a~~~~~~v~~~ 293 (514) T protein:vir:80 222 ELEGK-----RV--GPESSYPAHLCPYVPVAWNVPDGEHYGRGYVE-EYSGDFARLSILSERLGLYEFEALSLLNLVDEA 293 (514) T ss_pred eccce-----ee--cccCccccccCCeeeeeeEecCCCCcccchHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCceeCcc Confidence 11100 00 11223 3458998888888889999999654 3445556666665555555555555554331 Q ss_pred CCCccccccccccchhhhhcchheecCCCCceEEEec-cccHHH---HHHHHHHHHHHHHhhcCCChHHhccccccchHH Q lcl|NC_021324. 258 GVTTDELTNDGENTTLDIYYGRILTLASEAAKISEFK-AAELRN---FAEEMEVFRKEAASITGLPPQYLSSSSENPASA 333 (480) Q Consensus 258 G~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~~~~~---~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg 333 (480) |.... ........|.+..-..++....++. .++++. .++.++.-|.+.+.... .. .+..+ -++ T Consensus 294 g~~~~-------~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~~----~~-rd~~r-vTA 360 (514) T protein:vir:80 294 KGGAV-------DDYRDAETGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYTG----QV-RDAER-VTV 360 (514) T ss_pred cccch-------hhhcccCCceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhc----cC-CCCCC-CCH Confidence 11100 0011111122222112223333433 234443 34444444433332211 01 11111 134 Q ss_pred HHHHHHHHHHHHH----HHH-HHHHHHHHHHHHHHHHHHHhCCC---CcccceeeeEEecCCC-CCCHHHHHHHHH---H Q lcl|NC_021324. 334 EAIIATDSRIVKM----AER-KGRIFGGAWERAMRIAMQIMGRE---VTEEYTRLETVWRDPS-TPTVAAKADAVS---K 401 (480) Q Consensus 334 ~Al~~~~~~l~~k----~~~-~~~~~~~~l~~~~~li~~~~~~~---~~~~~~~~~v~f~~~~-~~~~~~~ad~~~---k 401 (480) +.++.....+... ..+ ....+.+-+.+.+.++.+...+. .+.+. +++.+.-++ .-.....++.+. + T Consensus 361 tEV~~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l--~~~~~vs~la~l~r~~~~~~l~~~~~ 438 (514) T protein:vir:80 361 EEIRTVAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGV--YRPSIITGIPALTRNIETANILRATQ 438 (514) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchh--hcceeeecHHHHHHHHHHHHHHHHHH Confidence 4444332222221 111 11222223333344333221111 12222 333332221 111111222222 2 Q ss_pred HHhcccc-------CCCHHHH----HHhCCCChh-------HHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCC Q lcl|NC_021324. 402 LYANGQG-------PIPKEQA----RIDLGYTAT-------QREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTET 463 (480) Q Consensus 402 l~~~~~~-------~~s~e~~----~~~~~~~~d-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 463 (480) .++.... .+....+ ...+|.... +.+..++..++++++.+........++.+.+-- + T Consensus 439 ~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~ 513 (514) T protein:vir:80 439 EASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAGVL-----T 513 (514) T ss_pred HHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc-----C Confidence 2221111 1222222 233454322 221111111111111111110000000000000 0 Q ss_pred C Q lcl|NC_021324. 464 K 464 (480) Q Consensus 464 ~ 464 (480) + T Consensus 514 ~ 514 (514) T protein:vir:80 514 S 514 (514) T ss_pred C Confidence 0 No 263 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=90.64 E-value=0.02 Score=29.92 Aligned_cols=350 Identities=13% Similarity=0.037 Sum_probs=125.0 Q ss_pred HHHHHHHHHHhhccCcchhcCcccchhhhccee--ccchHHHHHHHHHhhhccCcccc---C-C-------CchhHHHHH Q lcl|NC_021324. 19 LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDV--QPGWVATYLRTLSDRLDIEGFRI---S-E-------DSEGLEELW 85 (480) Q Consensus 19 ~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~--~~n~~~~iVd~~~~~l~~~~~~~---~-~-------d~~~~~~l~ 85 (480) +.-+.++..+-.+.-. -...... .+....+ ....+..+|+..++-+-.-++.+ . + .+.....+. T Consensus 1 Mg~f~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLN-NDTQRVT--AWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred Cccchhhhhhhccccc-CCcceee--ecccchhhHHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccchHH Confidence 1111222222111100 0000000 0000000 01122233333333222223321 1 0 011234566 Q ss_pred HHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 86 NWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 86 ~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .++.. | ....+...+....+.+|.||++...+ +..|++.- +| . T Consensus 78 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d-----~~~g~~~~----------l~-----------------~ 125 (378) T protein:vir:16 78 EVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFD-----DNTGELLD----------LL-----------------F 125 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-----cCCceEEE----------EE-----------------e Confidence 66642 3 34566777888999999999864321 11122210 00 0 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHH Q lcl|NC_021324. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s 240 (480) .+.+ ..|..+.+ +||++. .....|.| .+..+.++++..++ T Consensus 126 ~~~~-----~~~~~~di-----------------------------ih~r~~--~~~~~~~s----~l~~~~~~i~~~~~ 165 (378) T protein:vir:16 126 ADDK-----KEYKPEEL-----------------------------VRLTSP--FYINEDTS----ILDNALASIQTKLE 165 (378) T ss_pred cCCe-----eEecccce-----------------------------EEecCc--cCccchhH----HHHHHHHHHHHHHh Confidence 0110 01122233 333321 01111222 13334444433222 Q ss_pred HHHHHHHHhcchhhhhcC-CCccccccccccchhhh---------hcchheecCCCCceEEEeccccHHHHHHHHHHHHH Q lcl|NC_021324. 241 NLQSASQILGTPLRVISG-VTTDELTNDGENTTLDI---------YYGRILTLASEAAKISEFKAAELRNFAEEMEVFRK 310 (480) Q Consensus 241 ~~~~~~~~~~~p~l~i~G-~~~~~~~~~~~~~~~~~---------~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~ 310 (480) .+.+-.++.- ....+...+.....+.. ..++++.++ ++.+|.+++......-+..++.+.. T Consensus 166 --------~~~~~g~l~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~~~~~~~~~~~~~ 236 (378) T protein:vir:16 166 --------QGKLRGLLKINAFLDIDNTQEYREKALTTIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNKDEIDLIKS 236 (378) T ss_pred --------cCccceeeEeCCcCCHHHHHHHHHHHHHHHHHhhcccccccceEcC-CCceEEEccCChhhhhHHHHHHHHH Confidence 1122222211 11111111111111111 123456665 4567877764332222345577788 Q ss_pred HHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCC Q lcl|NC_021324. 311 EAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTP 390 (480) Q Consensus 311 ~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~ 390 (480) +|+..-++|+..+++.. +....+.+....|.-.+...+..+...|-.--. +..+.. ......+++.+...... T Consensus 237 ~Ia~~fgVPp~~l~g~~---~e~~~~~f~~~tl~P~~~~ie~~l~~kLl~~~e---~~~~~~-~~~~~~~~f~~~~l~~~ 309 (378) T protein:vir:16 237 ELLTGYFMNENILLGTA---SQEQQIYFYNSTIIPLLIQLEKELTYKLISTNR---RRVVKG-NLYYERIIVDNQLFKFA 309 (378) T ss_pred HHHHHhCCCHHHhcCCc---hHHHHHHHHHHHHHHHHHHHHHHHHhhcCChhh---hhhhhh-cccccceeeccchhhhc Confidence 99999999998886532 111112221112222222222111111100000 000000 00112344555566677 Q ss_pred CHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCcc Q lcl|NC_021324. 391 TVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) Q Consensus 391 ~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 466 (480) +..+.++++.+++..| +++...+++++|+.+-+- ..+..--..-..++.. ...++..+....+++++.+ T Consensus 310 d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~p~~g--gD~~~~~~n~~~~~~~---~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 310 TLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVYIANLNAVAVKNL---SDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred CHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeEeeccccccccch---hhhcCccCCCCCCCCCCCC Confidence 8889999999998876 667777788887654320 0000000000000000 0000111111111111111 No 264 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=89.92 E-value=0.024 Score=29.50 Aligned_cols=349 Identities=12% Similarity=0.069 Sum_probs=122.5 Q ss_pred HHHHHHHHHHhhccC--cchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc----CCC-------chhHHHHH Q lcl|NC_021324. 19 LPNLLEAEAYRNGTR--RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI----SED-------SEGLEELW 85 (480) Q Consensus 19 ~~~~~~~~~YY~G~~--~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~----~~d-------~~~~~~l~ 85 (480) +.-+.++..|....- ............ .-........+|+..++-+-.-++.+ ..+ +.....+. T Consensus 1 M~if~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVTAWQNEA---VEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred CchhHHhHhhhhcccccCcceeeeeecch---hhhhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccccccchHH Confidence 222222222221110 000000000000 00011123334443333332223211 110 11223455 Q ss_pred HHHHh--c---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 86 NWWQA--N---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 86 ~~~~~--n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .++.. | .-..+...+..+.+..|.||++. ++++..+ .+.. . +|. T Consensus 78 ~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~--------------------------i~~~~~g-~~~~-~-~~~-- 126 (378) T protein:vir:94 78 EVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYP--------------------------IFDSETG-ELLD-L-LFA-- 126 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEE--------------------------EeeCCCC-cEEE-E-EEe-- Confidence 55542 3 23456666788899999998751 2222211 1100 0 010 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHH Q lcl|NC_021324. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s 240 (480) ..+ ..|.++.+ +|+.+ +++.+.+...+..+.++++..+. T Consensus 127 -~~~-----~~~~~~dv-----------------------------ih~~~------~~~~~~~~~~~~~~~~~~~~~~~ 165 (378) T protein:vir:94 127 -NDK-----KEYKPEEL-----------------------------VRLTS------PFYINEDTSILDNALASIQTKLE 165 (378) T ss_pred -cCc-----EEechhce-----------------------------eeecC------cCCcccchhHHHHHHHHHHHHHh Confidence 011 01122222 23321 11111111112222233322221 Q ss_pred HHHHHHHHhcchhhhh--cCCCccccccccccchhh---------hhcchheecCCCCceEEEeccccHHHHHHHHHHHH Q lcl|NC_021324. 241 NLQSASQILGTPLRVI--SGVTTDELTNDGENTTLD---------IYYGRILTLASEAAKISEFKAAELRNFAEEMEVFR 309 (480) Q Consensus 241 ~~~~~~~~~~~p~l~i--~G~~~~~~~~~~~~~~~~---------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~ 309 (480) .+.+--++ .+. ......+.....++ ...+.++.++ ++.++.+++......-.+.++... T Consensus 166 --------~~~~~g~l~~~~~-l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~-~g~~~~~l~~~~~~~~~~~~~~~~ 235 (378) T protein:vir:94 166 --------QGKLRGLLKINAF-LDIDNTQEYREKALATIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNKDEIDLIK 235 (378) T ss_pred --------hCCcccceeeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceecc-CCceEEEccCChHHhhHHHHHHHH Confidence 11111122 111 11110000111111 1123456665 456888776433222245667778 Q ss_pred HHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCC Q lcl|NC_021324. 310 KEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPST 389 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~ 389 (480) .+|+..-++|+..+.++.. ....+.+....|.-.+...+..+...|-.--. ...+.. ......+.+.+..-.- T Consensus 236 ~~Ia~~fgvPp~~l~g~~~---e~~~~~f~~~tl~P~~~~ie~~l~~~Ll~~~e---~~~g~~-~~~~~~~~f~~~~l~~ 308 (378) T protein:vir:94 236 SELLTGYFMNENILLGTAT---QEQQIYFYNSTIIPLLIQLEKELTYKLISTNR---RRVVKG-NLYYERIIVDNQLFKF 308 (378) T ss_pred HHHHHHhCCCHHHhcCCch---HHHHHHHHHHHHHHHHHHHHHHHHhhcCChhH---hhhhhh-hcccceeEeecchhhh Confidence 8999999999988865321 11122222222222222222221111100000 000100 1112234455566666 Q ss_pred CCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCC Q lcl|NC_021324. 390 PTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQT 469 (480) Q Consensus 390 ~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) .+..+.++++.+++..| +++.-.+++++|+.+-+- -.+..-...-..++ .....++.......+++ T Consensus 309 ~d~~~~~e~~~~~~~~G--~~t~NE~R~~~g~~p~~g--gd~~~~~~n~~~~~---~~~~~~~~~~~~~~~~e------- 374 (378) T protein:vir:94 309 ATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVYIANLNAVAVK---NLSDLQGNRKDVTSTDE------- 374 (378) T ss_pred cCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeeeecccccchh---cchhcccccCCCCCCCC------- Confidence 78889999999998876 566666777777643210 00000000000000 00000000001111111 Q ss_pred CCCCCCc Q lcl|NC_021324. 470 SPSGFNR 476 (480) Q Consensus 470 ~~~~~~~ 476 (480) +.|+ T Consensus 375 ---~~n~ 378 (378) T protein:vir:94 375 ---TNNQ 378 (378) T ss_pred ---CCCC Confidence 1111 No 265 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=89.78 E-value=0.024 Score=29.42 Aligned_cols=303 Identities=13% Similarity=0.052 Sum_probs=109.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..|++.++. ...-++.++-+|.|+.--+-.+. ..+.+.--...+..-++-...+.+... |. ++.--. T Consensus 33 ~~~p~~v~~--------~~~~~~~~~~~~~~~~~~pp~~~---~~la~~~~~~~~h~~~l~~k~n~l~~~-~~-Pn~~~t 99 (351) T protein:vir:78 33 FDDPTPVMN--------RAEILDYVECWSNGEWFEPPVSF---AGLAKSFRASTHHSSALFFKANVLAST-FR-PHRWLS 99 (351) T ss_pred cCCceeecC--------cchhhhhhhhhccCceecCCCCH---HHHHHHHhhhHhhhhhhhhhhhHHhhc-cc-CCCCCC Confidence 222222211 00012333455666421111110 111111001112222232223322111 11 111100 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCce-eEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~-~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ...+.+++.+.+.+|.||+.+-.+. .|+ ..+..++|..+.+..+.. ++|.. T Consensus 100 -------------~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~pl~~~~v~~~~~~~---------~~~~~ 151 (351) T protein:vir:78 100 -------------RHAFERWALDFLTFGNGYLERRRNM------VGGTLRLEPALAKYVRRKADFS---------GFVYV 151 (351) T ss_pred -------------HHHHHHHHHHHHhcCCeEEEEEECC------CCCEEEEEEecCcceEEeeeCC---------eEEEE Confidence 1123456678889999999877543 344 356666666554433221 01111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) ..+ +..+.|.. + -|+||++-....+.+|.|.+...+ +.+.... T Consensus 152 ~~~------------~~~~~~~~-------------------~--eVihir~~~~~~~~yGl~~~~~a~----~si~l~~ 194 (351) T protein:vir:78 152 NGW------------QERHEFAP-------------------D--SVFQLVRPDINQEVYGLPEYLSSL----HSAWLNE 194 (351) T ss_pred ecC------------CeEEEEcc-------------------c--cEEEEcCCCCCCCcccccHHHHHH----HHHHHHH Confidence 001 11111110 1 145665433345678887764433 3332222 Q ss_pred HHH---HHHHHHhcchhhh--hcCCCccccccccccchhhhhc-----chheec-C---CCCceEEEeccccH-HHHHHH Q lcl|NC_021324. 240 MNL---QSASQILGTPLRV--ISGVTTDELTNDGENTTLDIYY-----GRILTL-A---SEAAKISEFKAAEL-RNFAEE 304 (480) Q Consensus 240 s~~---~~~~~~~~~p~l~--i~G~~~~~~~~~~~~~~~~~~~-----~~~~~~-~---g~~~~~~~~~~~~~-~~~~~~ 304 (480) +-. .+.....+.|-.+ ++|..+.++..+.-...|+... +.++.+ + .++.++..+..... ..|++. T Consensus 195 ~a~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~qf~e~ 274 (351) T protein:vir:78 195 SSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNI 274 (351) T ss_pred HHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHHHHHH Confidence 211 1222222333333 3443333222111111122111 123322 2 13446766654322 247777 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEe Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVW 384 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f 384 (480) .+....+|+..-++|+.-+|....+.+++..++-.... .+...|.-+++.+..+.+. ...+ -+.| T Consensus 275 k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~----------f~~~~l~P~~~~iee~n~~-l~~~----~~~F 339 (351) T protein:vir:78 275 KNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARV----------FGRNEIRPLQARFAELNDW-LGDE----VVRF 339 (351) T ss_pred HHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHH----------HHHHHHHHHHHHHHHHHhh-cCcc----ceec Confidence 77888999999999999887533222111111111111 1111111111111111110 0111 1445 Q ss_pred cCCCCCCHHHHHHHHH Q lcl|NC_021324. 385 RDPSTPTVAAKADAVS 400 (480) Q Consensus 385 ~~~~~~~~~~~ad~~~ 400 (480) .+..- ..+|+-+ T Consensus 340 ~~~~L----lr~d~ka 351 (351) T protein:vir:78 340 DDYEI----PPAPVAA 351 (351) T ss_pred Chhhh----ccccccC Confidence 32211 1111110 No 266 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=89.63 E-value=0.025 Score=29.34 Aligned_cols=368 Identities=11% Similarity=0.036 Sum_probs=128.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCCchhH Q lcl|NC_021324. 5 HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSEGL 81 (480) Q Consensus 5 ~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~~~ 81 (480) .=+..++. .+ +-..+...+.|. .+. ...............+|+..++-+-.-++.+ ..+.... T Consensus 1 MGlf~~~~----~~--~~~~~~~~~~~~-~~~-------~~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~~~ 66 (395) T protein:vir:98 1 MGILDFFS----FK--KSGTLSDDDSGS-TTS-------EKLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEKLTENQ 66 (395) T ss_pred Ccchhhhc----CC--Ccccccccccch-hhh-------hhcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCCccccc Confidence 00111110 00 000000001110 000 0000000111222333444433332223322 2222222 Q ss_pred HHHHHHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEE Q lcl|NC_021324. 82 EELWNWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) Q Consensus 82 ~~l~~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~ 156 (480) ..+..++.. |. -..+...+....+.+|.||+++-.+. ... + |......+... + . ..+ T Consensus 67 ~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~--------~~~---~-~~~~~~~~~~~-~-~----~~~ 128 (395) T protein:vir:98 67 KDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGK--------GIY---V-ADSFTQDKKIS-G-S----QFK 128 (395) T ss_pred chHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCC--------cee---c-CCccccccccc-C-c----ccc Confidence 345555542 32 34567778889999999998765421 111 1 11111111000 0 0 000 Q ss_pred EEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHH-HHHH Q lcl|NC_021324. 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV-TDAA 235 (480) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l-~D~~ 235 (480) +...+.. .+...|.++. |+||++.......++.+-+ +....+ ...+ T Consensus 129 ~~~~~~~---~~~~~~~~~e-----------------------------vih~k~~~~~~~~~~~~~~-~~~~~~~~~~~ 175 (395) T protein:vir:98 129 VSRVQGQ---TYEKTFTFDQ-----------------------------VIYLKNDNSDLMSKVESLW-EEYGELLGHVI 175 (395) T ss_pred eeeecCc---eeeeEecCcc-----------------------------EEEecCCCCCccccccchh-hhHHHHHHHHH Confidence 0000000 0111122222 3445433222222222211 111111 1111 Q ss_pred HHHHHH-HHHHHHHhcchhhhhcCCCcccccccc---cc----ch---hhhhcchheecCCCCceEEEeccc-------c Q lcl|NC_021324. 236 SRTLMN-LQSASQILGTPLRVISGVTTDELTNDG---EN----TT---LDIYYGRILTLASEAAKISEFKAA-------E 297 (480) Q Consensus 236 ~~~~s~-~~~~~~~~~~p~l~i~G~~~~~~~~~~---~~----~~---~~~~~~~~~~~~g~~~~~~~~~~~-------~ 297 (480) +..... .......+..+...+.+. .....+.. .. .. +.....+++.++ ++.++.+++.. . T Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~-~g~~~~~l~~~~~~~~~~~ 253 (395) T protein:vir:98 176 NNQKIANQIRFTMIPPKDKVRERAQ-ENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVT-ANTNYEEYGSKNTGAVKSY 253 (395) T ss_pred HHHHHHHHHHHhhcccccccccccc-ccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecC-CCceeEecccccccccChh Confidence 111111 011111111122222111 11000000 00 00 111222344444 34566665421 2 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccc Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEY 377 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 377 (480) ...+++..+..+.+|+.+-++|+..++++..+ .+...+.+....|.-.+...+..+... +....... T Consensus 254 ~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~sn-~e~~~~~f~~~tl~P~~~~ie~~l~~k----------ll~~~~~~-- 320 (395) T protein:vir:98 254 VDDIKKLKDQYMAEFAEMLGIPISLLHGDIAD-NQKNYELLLEGPIESLITNIVDGLEYA----------IFDKSETL-- 320 (395) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhcCCccc-HHHHHHHHHHHHHHHHHHHHHHHHHHh----------cCChhhhc-- Confidence 33566777777889999999999999754322 222222222222222222222111111 11111111 Q ss_pred eeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCC Q lcl|NC_021324. 378 TRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPK 457 (480) Q Consensus 378 ~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 457 (480) ....+.|......+..+.++++.+++..| +++...+++.+|..+-+-....+. .- +.+..+. + T Consensus 321 ~g~~f~~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~g~~Pi~~~~gD~~------------~~--~~n~~~~-~ 383 (395) T protein:vir:98 321 QGSFIKVTGLKNYDLFSISNQADKLISSG--FVFIDEVREEIGLPELPDGLGKVL------------YM--TKNYESV-L 383 (395) T ss_pred CcceeeehhhhccCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCCcee------------ee--cccceec-c Confidence 12345666777788899999999998876 667777777777643211000000 00 0000000 0 Q ss_pred CCCCCCCccCCC Q lcl|NC_021324. 458 PTVTETKTETQT 469 (480) Q Consensus 458 ~~~~d~~~~~~~ 469 (480) ..+++++.+.+. T Consensus 384 ~~gge~~~~~~~ 395 (395) T protein:vir:98 384 ERGGEVDEEVET 395 (395) T ss_pred cccCCCCCCCCC Confidence 011111111111 No 267 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=88.76 E-value=0.03 Score=28.91 Aligned_cols=295 Identities=15% Similarity=0.081 Sum_probs=102.4 Q ss_pred CCCHHHHHHHH-HHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTYHEHVERL-QGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l-~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) +.|++.++..- +..+ ..-+....+||+ +++.. ..+.+.--...+..-++..... T Consensus 26 ~~~p~~~~~~~~~~~~---~~~~~~~~~~~~--pp~~~------~~la~l~~~~~~h~~~i~~k~n-------------- 80 (346) T protein:vir:10 26 FGDPIPVLDRADILNY---LECSAMYEKWYN--PPMSF------DGLAKSLRSSTHHESAIITKAN-------------- 80 (346) T ss_pred cCCcceecCchhHHHH---HHHhhcCCceEe--cCCCH------HHHHHHHHhhhhcchhhhhhhh-------------- Confidence 33443332210 0000 000000011121 11110 0011000000010001111111 Q ss_pred hHHHHHHHHHh-cCH--HHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEE Q lcl|NC_021324. 80 GLEELWNWWQA-NDL--DEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 80 ~~~~l~~~~~~-n~~--~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) .+..+++. |.. ...+.+++.+.+.+|.||+.+..+. .|++ .+..++|..+.+..+.. . T Consensus 81 ---~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~------~G~~~~L~pl~~~~v~~~~~~~---~------ 142 (346) T protein:vir:10 81 ---ILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNR------LGQVQRIESPLAKYVRKGLEAG---Q------ 142 (346) T ss_pred ---hHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcC------CCcEEEEEEecCCceEEEEcCC---e------ Confidence 11111111 100 1223456678889999999876532 4443 55666666654432211 0 Q ss_pred EEEEecCCceEEEEEEEeC-CcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHH Q lcl|NC_021324. 156 LYTTRDDVAVPDRATLYLP-DETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~ 234 (480) ..+ .++.. +..+.|.. + -|+|+++.....+.+|.|.+.. .... T Consensus 143 ----------~~~-~~~~~~g~~~~~~~-------------------~--dIih~r~~~~~~~~~G~~~~~~----a~~s 186 (346) T protein:vir:10 143 ----------FYY-VPQRFDHQEHEFAK-------------------G--SIYHLLEPDINQDIYGLPQYLS----ALQS 186 (346) T ss_pred ----------EEE-EEEccCCeEEEEec-------------------c--cEEEecCCCCCCCeeeccHHHH----HHHH Confidence 000 11111 11111110 1 1466665433456788876643 2333 Q ss_pred HHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhhc-----chheecC-CC---CceEEEeccccH-H Q lcl|NC_021324. 235 ASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIYY-----GRILTLA-SE---AAKISEFKAAEL-R 299 (480) Q Consensus 235 ~~~~~s~~~~~~~~~~---~p~l~--i~G~~~~~~~~~~~~~~~~~~~-----~~~~~~~-g~---~~~~~~~~~~~~-~ 299 (480) +....+-..-...+|. .|-.+ +++....++..+.-...|+... +.++.+. |+ +.++..++.... . T Consensus 187 i~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~~~d~ 266 (346) T protein:vir:10 187 AWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADVSAKD 266 (346) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCChhHH Confidence 3322222222223333 34333 3443322221111111122111 2233332 22 234555543322 2 Q ss_pred HHHHHHHHHHHHHHhhcCCChHHhccccccchH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_021324. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPAS-----AEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVT 374 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~S-----g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~ 374 (480) .|++..+....+|+..-++|+..+|....+.++ ..++.+....|.=.+.+. ++++ ..++.. T Consensus 267 qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~i--------ee~n----~~L~~e-- 332 (346) T protein:vir:10 267 EFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFFITEIEPLQERL--------KEFN----QWLGQE-- 332 (346) T ss_pred HHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHH--------HHHH----hhcccc-- Confidence 377777778899999999999988753322211 111222111111111111 1111 111111 Q ss_pred ccceeeeEEecCCCCCCHHHHHHH Q lcl|NC_021324. 375 EEYTRLETVWRDPSTPTVAAKADA 398 (480) Q Consensus 375 ~~~~~~~v~f~~~~~~~~~~~ad~ 398 (480) .|.|.+...- .++. T Consensus 333 ------~i~F~~~~ll----~~~~ 346 (346) T protein:vir:10 333 ------VIKFKPSKLL----QRTQ 346 (346) T ss_pred ------eeeechhhhc----ccCC Confidence 1445321110 0010 No 268 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=86.48 E-value=0.045 Score=27.95 Aligned_cols=338 Identities=12% Similarity=0.036 Sum_probs=123.0 Q ss_pred HHHHHHHHHHhhccCcchhcCcccchhhhcceec--cchHHHHHHHHHhhhccCcccc---C-CC-------chhHHHHH Q lcl|NC_021324. 19 LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQ--PGWVATYLRTLSDRLDIEGFRI---S-ED-------SEGLEELW 85 (480) Q Consensus 19 ~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~--~n~~~~iVd~~~~~l~~~~~~~---~-~d-------~~~~~~l~ 85 (480) +.-..++..+-.+.- . ......-.+....+. ...+..+|+..++-+-.-++.+ . ++ +.....+. T Consensus 1 Mg~f~~~~~f~~~~~--~-~~~~~~~~~~~~~~~~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:93 1 MNLFGKVVSFSRGKL--N-NDTQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred Cccchhhhhhhcccc--C-CCcceeeecccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccchHH Confidence 111111111111100 0 000000000000010 1112223333333222223321 1 00 01123456 Q ss_pred HHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 86 NWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 86 ~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .++.. |. ...+...+..+.+.+|.||+++..+ +..|++.. +|- T Consensus 78 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~-----~~~g~~~~----------l~~----------------- 125 (378) T protein:vir:93 78 EVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFD-----DNTGELLD----------LLF----------------- 125 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-----cCCceEEE----------EEe----------------- Confidence 66642 32 3466777888999999999865321 11222211 110 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHH Q lcl|NC_021324. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s 240 (480) .+.+ ..|.++.++ ||++. .....|.|- +..+.++++..++ T Consensus 126 ~~~~-----~~~~~~dii-----------------------------h~r~~--~~~~~~~s~----l~~~~~~i~~~~~ 165 (378) T protein:vir:93 126 ADDK-----KEYKTEELV-----------------------------RLTSP--FYINEDTSI----LDNALASIQTKLE 165 (378) T ss_pred cCCe-----eEeccceeE-----------------------------EecCc--cccchhhHH----HHHHHHHHHHHHh Confidence 0111 011222232 33221 011112221 2333333332221 Q ss_pred HHHHHHHHhcchhhhhc--CCCccccccccccchhhh---------hcchheecCCCCceEEEeccccHHHHHHHHHHHH Q lcl|NC_021324. 241 NLQSASQILGTPLRVIS--GVTTDELTNDGENTTLDI---------YYGRILTLASEAAKISEFKAAELRNFAEEMEVFR 309 (480) Q Consensus 241 ~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~~~~---------~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~ 309 (480) .+.+-.++. |. ..+.........+.. ..+.++.++ ++.+|.+++......-++.++... T Consensus 166 --------~~~~~g~l~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~g~~~~~l~~~~~~~~~~~~~~~~ 235 (378) T protein:vir:93 166 --------QGKLRGLLKINAF-LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNKDEIDLIK 235 (378) T ss_pred --------cCcccceeeeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceEcC-CCceEEEccCChhhhhHHHHHHHH Confidence 122222221 21 111110111111110 123455665 446788776443333345567778 Q ss_pred HHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCC-----C-cccce Q lcl|NC_021324. 310 KEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGRE-----V-TEEYT 378 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~~-----~-~~~~~ 378 (480) .+|+.+-++|+..+++.. +....+. .+...|.-+++.+.. +.... . ..... T Consensus 236 ~~Ia~~fgVPp~~l~g~~---~e~~~~~---------------f~~~tl~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~ 297 (378) T protein:vir:93 236 SELLTGYFMNENILLGTA---TQEQQIY---------------FYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYE 297 (378) T ss_pred HHHHHHhCCCHHHhcCCc---HHHHHHH---------------HHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhccccc Confidence 899999999998885432 1111111 222223322222211 11100 0 00112 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) .+++.+......|..+.++++.+++.+| +++...+++++|+.+-+- .....--..-...+.... .++.....+ T Consensus 298 ~~~fd~~~l~~~d~~~~~~~~~~~~~~G--~~t~NE~R~~~gl~p~~g--gD~~~~~~n~~~~~~~~~---~~~~~~~~~ 370 (378) T protein:vir:93 298 RIIVDNQLFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVYIANLNAVAVKNLSD---LQGSRKDVT 370 (378) T ss_pred ceeeccchhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCeeeeccccccccchhh---hcCccCCCC Confidence 3445555666788889999999998876 567777787887653321 000000000000000000 000000000 Q ss_pred CCCCCCccCCCCCCCCCc Q lcl|NC_021324. 459 TVTETKTETQTSPSGFNR 476 (480) Q Consensus 459 ~~~d~~~~~~~~~~~~~~ 476 (480) .+ .++.|+ T Consensus 371 ~~----------~e~~n~ 378 (378) T protein:vir:93 371 ST----------DETNNQ 378 (378) T ss_pred CC----------CCCCCC Confidence 01 111111 No 269 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=85.56 E-value=0.052 Score=27.63 Aligned_cols=289 Identities=14% Similarity=0.080 Sum_probs=104.7 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---cchhcCcccchhhhcc-e--eccchH-HHHHHHHHhhhccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYL-D--VQPGWV-ATYLRTLSDRLDIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~---~i~~~~~~~~~~~~~~-~--~~~n~~-~~iVd~~~~~l~~~~~~ 73 (480) -.|++..+.. ..-++.++-++.|+. ++...+ +.+. + ..|+-| ...++.++....++++. T Consensus 28 ~~~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~~~~~------la~~~~a~~~h~s~i~~k~n~l~~~~~Pnp~~ 93 (344) T protein:vir:56 28 FGEPVPVLDR--------RDILDYVECISNGRWYEPPVSFTG------LAKSLRAAVHHSSPIYVKRNILASTFIPHPWL 93 (344) T ss_pred cCCceeecCc--------chhhhHHHhhhcCccccCCCCHHH------HHHHHhhhhhhCccceehhhhHHhhcCCCCCC Confidence 1222211110 000122233334432 111100 0000 0 000000 00011111111222222 Q ss_pred cCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 74 ISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 74 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) . . ..+..++.+.+.+|.||+.+-.+. .|++ .+..++|..+-+.-+.. T Consensus 94 t------~-------------~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~pl~~~~v~~~~~~~------- 141 (344) T protein:vir:56 94 S------Q-------------QDFSRFVLDFLVFGNAFLEKRYST------TGKVIRLETSPAKYTRRGVEED------- 141 (344) T ss_pred C------H-------------HHHHHHHHHHHhcCCeEEEEEECC------CCcEEEEEEeCCceeEEeecCC------- Confidence 1 0 112346678888999999876532 3444 45555554433221110 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) ++|....+ +..+.| ..=-|+|+++-....+.+|.|.+.- .. T Consensus 142 --~~~~~~~~------------g~~~~~---------------------~~~dIiHir~~~~~~~~~Gls~~~~----a~ 182 (344) T protein:vir:56 142 --VYWWVPSF------------NEPTAF---------------------APGSVFHLLEPDINQELYGLPEYLS----AL 182 (344) T ss_pred --EEEEEecC------------CeEEEE---------------------cCccEEEECCCCCCCCcccccHHHH----HH Confidence 00100000 011111 0112466654333356788887643 33 Q ss_pred HHHHHHHHHHHHHHHHhc---chhhhh--cCCCccccccccccchhhh----hcchhee--cCC---CCceEEEecccc- Q lcl|NC_021324. 233 DAASRTLMNLQSASQILG---TPLRVI--SGVTTDELTNDGENTTLDI----YYGRILT--LAS---EAAKISEFKAAE- 297 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~---~p~l~i--~G~~~~~~~~~~~~~~~~~----~~~~~~~--~~g---~~~~~~~~~~~~- 297 (480) +++....+-..-...+|. .|-.+| +|....+...+.-...|+. ..++.+. .++ ++.++..+.... T Consensus 183 ~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~~~~~~g~~~~r~l~l~~p~g~~~G~~~~pis~~~~ 262 (344) T protein:vir:56 183 NSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVAT 262 (344) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChH Confidence 444333332222233443 344333 4433332211111111211 1222222 232 345677665432 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchH-H--H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS-A--E--AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE 372 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~S-g--~--Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 372 (480) -..|++..+....+|+..-++|+..+|....+.++ | . ++.+....|.=.+ ..++++.. .++.. T Consensus 263 d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~--------~~ie~~n~----~l~~~ 330 (344) T protein:vir:56 263 KDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQ--------DRIREING----WIGQE 330 (344) T ss_pred HHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHH--------HHHHHHHh----hhccc Confidence 22478877888899999999999998853222211 1 1 1111111111111 11111111 22211 Q ss_pred CcccceeeeEEecCCCCCCHHH Q lcl|NC_021324. 373 VTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 373 ~~~~~~~~~v~f~~~~~~~~~~ 394 (480) . +.|.+.......+ T Consensus 331 ~--------~~F~~y~l~~~~~ 344 (344) T protein:vir:56 331 V--------IRFKNYSLDTDNG 344 (344) T ss_pred c--------ccCCCccccccCC Confidence 0 3343332211111 No 270 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=85.41 E-value=0.053 Score=27.58 Aligned_cols=290 Identities=14% Similarity=0.107 Sum_probs=105.3 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---cchhcCcccchhhhcceeccchH-HHHHHHHHhhhccCccccCC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYLDVQPGWV-ATYLRTLSDRLDIEGFRISE 76 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~---~i~~~~~~~~~~~~~~~~~~n~~-~~iVd~~~~~l~~~~~~~~~ 76 (480) -.|++.++.. ..-++.++-++.|+. ++...+-. ... +-...|.-| ....+..+....++++-. T Consensus 28 f~~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~~~~~la--~~~-~a~~~h~~~i~~k~n~l~~~~~Pn~~lt-- 94 (344) T protein:vir:20 28 FGEPVPVLDR--------RDILDYVECISNGRWYEPPVSFTGLA--KSL-RAAVHHSSPIYVKRNILASTFIPHPWLS-- 94 (344) T ss_pred cCCceEecCc--------chhhhhhhhhhcCceecCCCCHHHHH--HHH-hhhhhhCccceehhhhHHHhccCCCCCC-- Confidence 1122111110 000122222333432 11111000 000 000001000 000011111112222210 Q ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEE Q lcl|NC_021324. 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVR 155 (480) Q Consensus 77 d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~ 155 (480) . ..+..++.+.+.+|.||+.+-.+. .|++ .+..++|..+-...+.. + T Consensus 95 ----~-------------~~f~~~~~d~ll~Gnay~~i~rn~------~G~~~~L~pl~~~~vr~~~~~~---------~ 142 (344) T protein:vir:20 95 ----Q-------------QDFSRFVLDFLVFGNAFLEKRYST------TGKVIRLETSPAKYTRRGVEED---------V 142 (344) T ss_pred ----H-------------HHHHHHHHHHHhcCCeEEEEEECC------CCcEEEEEEcCCceeEeeecCC---------E Confidence 0 112346677788999999876532 3444 44445444332211110 0 Q ss_pred EEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHH Q lcl|NC_021324. 156 LYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAA 235 (480) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~ 235 (480) +|.... .+..+.|. -+ -|+|+++.....+.+|.|.+. ..++.+ T Consensus 143 ~~~~~~------------~~~~~~~~-------------------~~--eIiHir~~~~~~~~yGls~~~----~a~~si 185 (344) T protein:vir:20 143 YWWVPS------------FNEPTAFA-------------------PG--SVFHLLEPDINQELYGLPEYL----SALNSA 185 (344) T ss_pred EEEEcc------------CCeEEEEc-------------------Cc--cEEEeCCCCCCCCcccccHHH----HHHHHH Confidence 010000 01111111 01 246776543346678988764 334444 Q ss_pred HHHHHHHHHHHHHh---cchhhhh--cCCCccccccccccchhhhh----cch--heecCC---CCceEEEeccccHH-H Q lcl|NC_021324. 236 SRTLMNLQSASQIL---GTPLRVI--SGVTTDELTNDGENTTLDIY----YGR--ILTLAS---EAAKISEFKAAELR-N 300 (480) Q Consensus 236 ~~~~s~~~~~~~~~---~~p~l~i--~G~~~~~~~~~~~~~~~~~~----~~~--~~~~~g---~~~~~~~~~~~~~~-~ 300 (480) ....+-..-...+| +.|-.+| +|....+...+.-...|+.. .++ ++..++ ++.++..++....+ . T Consensus 186 ~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~l~~e~~~~ik~~~~~~~g~~n~r~l~l~~p~g~~~gi~~~pis~~~~d~q 265 (344) T protein:vir:20 186 WLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQGKADGIKIIPLSEVATKDD 265 (344) T ss_pred HHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCCCCccceEEecCCCCccceeEEEcCCChhHHH Confidence 44333333333444 3333333 34333322211111122211 122 222333 24567666543222 3 Q ss_pred HHHHHHHHHHHHHhhcCCChHHhccccccchH-H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc Q lcl|NC_021324. 301 FAEEMEVFRKEAASITGLPPQYLSSSSENPAS-A----EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTE 375 (480) Q Consensus 301 ~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~S-g----~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~ 375 (480) |++..+....+|+..-++|+.-+|....+.++ | .++.+....|.=.+.+ |++ +..++|... T Consensus 266 f~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~P~~~~--------~e~----in~~lg~~~-- 331 (344) T protein:vir:20 266 FFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQDR--------IRE----INGWLGQEV-- 331 (344) T ss_pred HHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHHHH--------HHH----HHHhcCCcc-- Confidence 77887888899999999999988753222111 1 1111111111111111 111 122223211 Q ss_pred cceeeeEEecCCC--CCCH Q lcl|NC_021324. 376 EYTRLETVWRDPS--TPTV 392 (480) Q Consensus 376 ~~~~~~v~f~~~~--~~~~ 392 (480) +.|.+.. ..|. T Consensus 332 ------i~F~~~~l~~~d~ 344 (344) T protein:vir:20 332 ------IRFKNYSLDTDND 344 (344) T ss_pred ------cccCccccccCCC Confidence 3343222 1221 No 271 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=83.75 E-value=0.066 Score=27.06 Aligned_cols=309 Identities=9% Similarity=-0.025 Sum_probs=104.7 Q ss_pred CC----------------------CHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHH Q lcl|NC_021324. 1 MT----------------------TYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVAT 58 (480) Q Consensus 1 ~~----------------------~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~ 58 (480) || |++.++.. ..-++-+.-+|+|.-+.-.-+... ..+.+.--...+..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~--------~~~~~~~~~~~~~~~~~~epp~~~-~~La~l~~~n~~h~~ 71 (348) T protein:vir:26 1 MTEQLIHSHTTDGTESKSVYSFDPNPEPVDTN--------SWMTRYCELFYNDFDDYWEPPISL-KGLAEIANANGYHGS 71 (348) T ss_pred CCccccchhhccccCCceEEEecCCCeeecCc--------chHHHHHHHHhcCCCccccCCCCH-HHHHHHHhhhhhhhh Confidence 22 12211110 000112222333311110000000 111111001112222 Q ss_pred HHHHHHhhhccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccc Q lcl|NC_021324. 59 YLRTLSDRLDIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLY 137 (480) Q Consensus 59 iVd~~~~~l~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~ 137 (480) ++....+.+... |. ++.--. ...+.+++.+.+.+|.||+.+-.+. .|++ .+..++|.. T Consensus 72 ~i~~k~N~l~~~-~~-Pn~~~t-------------~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~~l~~~~ 130 (348) T protein:vir:26 72 LLKARANYVAGR-FM-NGGGLP-------------MYKMNSACWDYFGLGMSAFVKIRSY------LKNVIALEPLPMVH 130 (348) T ss_pred hHhhhhhHHhhc-cc-CCCCCC-------------HHHHHHHHHHHHhcCCeEEEEEEcC------CCcEEEEEEecCce Confidence 222222222111 11 111000 1123356678888999999876543 3443 455565544 Q ss_pred eEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCc Q lcl|NC_021324. 138 MYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGN 217 (480) Q Consensus 138 ~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~ 217 (480) +-+.-| . . .++... ++.. ..|.++. |+||++-....+ T Consensus 131 v~~~~d---~-~-----~~~~~~-~g~~----~~f~~~d-----------------------------IiHir~~~~~~~ 167 (348) T protein:vir:26 131 MRKRKN---G-D-----FVQLLR-NNEQ----KVFKAKD-----------------------------VIFIPQYDPQQQ 167 (348) T ss_pred eEeeec---C-c-----EEEEEe-cCeE----EEEcCcc-----------------------------EEEEcCCCCCCC Confidence 332111 0 0 001000 0000 0111111 355554222355 Q ss_pred cCCCccchHHHHHHHHHHHHHHHHHHHHHHHh---cchhhhh--cCCCccccccccccchhhhhc-----chheec-C-- Q lcl|NC_021324. 218 RYGRSEISPELRKVTDAASRTLMNLQSASQIL---GTPLRVI--SGVTTDELTNDGENTTLDIYY-----GRILTL-A-- 284 (480) Q Consensus 218 ~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~---~~p~l~i--~G~~~~~~~~~~~~~~~~~~~-----~~~~~~-~-- 284 (480) .+|.|.+.. .+..+....+-..-...+| +.|-.++ ++....++..+.-...|+... +.++.+ + T Consensus 168 ~~Gls~~~~----a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g 243 (348) T protein:vir:26 168 IYGLPDYLG----SIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNG 243 (348) T ss_pred cccccHHHH----HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCC Confidence 678876543 3333333222222222333 2333333 343332221111111121111 122222 2 Q ss_pred -CCCceEEEeccccHH-HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 285 -SEAAKISEFKAAELR-NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAI-IATDSRIVKMAERKGRIFGGAWERA 361 (480) Q Consensus 285 -g~~~~~~~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al-~~~~~~l~~k~~~~~~~~~~~l~~~ 361 (480) .++.++..+.....+ .|++..+....+|+..-++|+.-+|....+.++...+ +....-......=....+..+|. T Consensus 244 ~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~ln-- 321 (348) T protein:vir:26 244 KEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVYDFYEVIPVCKRFMDAVN-- 321 (348) T ss_pred CccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 223466666543322 4777777778899999999999887532222211111 11111111111111111111111 Q ss_pred HHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHH Q lcl|NC_021324. 362 MRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAV 399 (480) Q Consensus 362 ~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~ 399 (480) ...+ .. ....+++.|.+.. +...+-++ T Consensus 322 -----~~l~--~~-~~~~~~fdl~~~~---e~~~~~a~ 348 (348) T protein:vir:26 322 -----NDPE--IP-DNLKLKFNLNPGV---ESANGSAV 348 (348) T ss_pred -----hhhC--CC-CccEEEEecCccc---ccchhhcC Confidence 1111 11 1122333332111 11111112 No 272 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=83.57 E-value=0.067 Score=27.01 Aligned_cols=289 Identities=13% Similarity=0.084 Sum_probs=104.9 Q ss_pred CC---------------------------CHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---cchhcCcccchhhhcce Q lcl|NC_021324. 1 MT---------------------------TYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYLD 50 (480) Q Consensus 1 ~~---------------------------~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~---~i~~~~~~~~~~~~~~~ 50 (480) |+ +++.++. ...-++.++-+++|+. ++...+-. + -.+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~--------~~~~~~~~~~~~~~~~~~pp~~~~~la--~---~~~ 67 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGEPVPVLD--------RRDILDYVECISNGRWYEPPISFTGLA--K---SLR 67 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCCceeecC--------CcchhHHHHhhhcCccccCCCCHHHHH--H---HHH Confidence 11 1111110 0001122233334432 11110000 0 000 Q ss_pred eccchHHHHHHHHHhhh----ccCccccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCc Q lcl|NC_021324. 51 VQPGWVATYLRTLSDRL----DIEGFRISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAG 126 (480) Q Consensus 51 ~~~n~~~~iVd~~~~~l----~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g 126 (480) .. .+..-++....+.+ .++++.. ...+..++.+.+.+|.||+.+-.+. .| T Consensus 68 a~-~~h~~~i~~k~n~l~~~~~Pn~~~t-------------------~~~f~~~~~d~ll~Gnay~~i~rn~------~G 121 (344) T protein:vir:60 68 AA-VHHSSPIYVKRNILASTFIPHPWLS-------------------QQDFSRFVLDFLVFGNAFLEKRYST------TG 121 (344) T ss_pred hh-hhhccchhhhhhHHHhhccCCCCCC-------------------HHHHHHHHHHHHhcCCeEEEEEECC------CC Confidence 00 00000011111111 1221110 0123456678888999999876532 45 Q ss_pred ee-EEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccc Q lcl|NC_021324. 127 IP-LIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVP 205 (480) Q Consensus 127 ~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iP 205 (480) ++ .+..++|..+-+..+.. ++|. +...+..+.|. -+ . T Consensus 122 ~~~~L~~l~~~~vr~~~~~~---------~~~~------------v~~~~~~~~~~-------------------~~--e 159 (344) T protein:vir:60 122 KVIRLETSPAKYTRRGVEED---------VYWW------------VPSFNEPTAFA-------------------PG--S 159 (344) T ss_pred cEEEEEEcCcceEEEeecCC---------eEEE------------EccCCeEEEEc-------------------Cc--c Confidence 44 45555555443322111 0110 00001111110 01 2 Q ss_pred eEEeecccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhh---- Q lcl|NC_021324. 206 VVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIY---- 276 (480) Q Consensus 206 vv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~--i~G~~~~~~~~~~~~~~~~~~---- 276 (480) |+|+++.....+.+|.|.|.. .++++....+-..-...+|. .|-.+ ++|....++..+.-...|+.. T Consensus 160 IiHir~~~~~~~~yGlsp~~~----a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~ik~~~~~~~g~~ 235 (344) T protein:vir:60 160 VFHLLEPDINQELYGLPEYLS----ALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRN 235 (344) T ss_pred EEEEcCCCCCCCcccccHHHH----HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCCC Confidence 567765433456789887643 33444333332222233333 23333 344333322211111122211 Q ss_pred cchh--eecCC---CCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccch---HH--HHHHHHHHHHHH Q lcl|NC_021324. 277 YGRI--LTLAS---EAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPA---SA--EAIIATDSRIVK 345 (480) Q Consensus 277 ~~~~--~~~~g---~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~---Sg--~Al~~~~~~l~~ 345 (480) .++. +..++ ++.++..+.... -..|++..+....+|+..-++|+.-+|....+.+ +. .++.+....|.= T Consensus 236 ~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~L~P 315 (344) T protein:vir:60 236 NFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIP 315 (344) T ss_pred CCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHHH Confidence 1122 22233 234566665432 2247888888899999999999998875322211 11 112111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021324. 346 MAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 346 k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~ 394 (480) .+. .|++ +..++|... +.|.+........ T Consensus 316 l~~--------~~e~----ln~~lg~~~--------i~F~~~~l~~~d~ 344 (344) T protein:vir:60 316 LQD--------RIRE----INGWLGQEV--------IRFKNYSLDTDNG 344 (344) T ss_pred HHH--------HHHH----HHHhcCCcc--------cccCccccCCCCC Confidence 111 1111 122223211 2343322111111 No 273 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=83.20 E-value=0.07 Score=26.90 Aligned_cols=300 Identities=14% Similarity=0.053 Sum_probs=110.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCchh Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSEG 80 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~~ 80 (480) ..|++.++. ...-++.++-+|.|+.--+-.+. ..+.+.--...+..-++....+.+... |. ++..-. T Consensus 33 ~~~p~~v~~--------~~~~~~~~~~~~~~~~~~pp~~~---~~la~~~~~~~~h~~~l~~k~n~l~~~-~~-Pnp~~t 99 (351) T protein:vir:79 33 FDDPTPVMN--------RAEILDYVECWSNGEWFEPPVSF---AGLAKSFRASTHHSSALFFKANVLAST-FR-PHRWLS 99 (351) T ss_pred cCCceeecC--------cchhhhhhhhhhcCceecCCCCH---HHHHHHHhhhHhhhhhhhhhhhHHhhc-cc-CCCCCC Confidence 222222211 00012333455666421111110 111111001112222233233322111 11 111000 Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCce-eEEEEEcccceEEEEeCCCCCceEEEEEEEEE Q lcl|NC_021324. 81 LEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGI-PLIRVESPLYMYAELDPRNTRRVTRAVRLYTT 159 (480) Q Consensus 81 ~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~-~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~ 159 (480) ...+.+++.+.+.+|.||+.+..+. .|+ ..+..++|..+-+..+.. ++|.. T Consensus 100 -------------~~~f~~~v~d~ll~Gnay~~~~r~~------~G~~~~L~~l~~~~v~~~~~~~---------~~~~~ 151 (351) T protein:vir:79 100 -------------RHAFERWALDFLTFGNGYLERRRNM------VGGTLRLEPALAKYVRRKADFS---------GFVYV 151 (351) T ss_pred -------------HHHHHHHHHHHHhcCCeEEEEEECC------CCCEEEEEEeCCcceeeeecCC---------eEEEE Confidence 0113456678888999999886543 344 356666666554332211 01111 Q ss_pred ecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHH Q lcl|NC_021324. 160 RDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTL 239 (480) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~ 239 (480) ..+ +. ...|.++ -|+|+++.....+.+|.|.+... .+.+.... T Consensus 152 ~~~-g~---~~~~~~~-----------------------------eIihir~~~~~~~~yGl~~~~~a----~~si~l~~ 194 (351) T protein:vir:79 152 NGW-QE---RHEFEPD-----------------------------SVFQLVRPDINQEVYGLPEYLSS----LHSAWLNE 194 (351) T ss_pred ecC-ce---EEEEcCc-----------------------------cEEEeCCCCCCCCcccccHHHHH----HHHHHHHH Confidence 111 10 0011111 14566543334567888765432 33333222 Q ss_pred HHHHHHHHHh---cchhh--hhcCCCccccccccccchhhhhc-----chheec-CC---CCceEEEecccc-HHHHHHH Q lcl|NC_021324. 240 MNLQSASQIL---GTPLR--VISGVTTDELTNDGENTTLDIYY-----GRILTL-AS---EAAKISEFKAAE-LRNFAEE 304 (480) Q Consensus 240 s~~~~~~~~~---~~p~l--~i~G~~~~~~~~~~~~~~~~~~~-----~~~~~~-~g---~~~~~~~~~~~~-~~~~~~~ 304 (480) +-..-...+| +.|-. +++|....++..+.-...|+... ++++.+ ++ ++.++..+.... -..|++. T Consensus 195 ~a~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~ef~e~ 274 (351) T protein:vir:79 195 SSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDEFFNI 274 (351) T ss_pred HHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHHHHHH Confidence 2222222233 23333 33443332221111111122111 123322 22 234666655432 2247787 Q ss_pred HHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhCCCCcccceeee Q lcl|NC_021324. 305 MEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRI---AMQIMGREVTEEYTRLE 381 (480) Q Consensus 305 l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l---i~~~~~~~~~~~~~~~~ 381 (480) .+....+|+..-++|+.-+|....+.+++..++-...... ...|.-+++. +..+++.. - T Consensus 275 k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~f~----------~~~l~Pl~~~ie~ln~~lg~~--------~ 336 (351) T protein:vir:79 275 KNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFG----------RNEIRPLQARFAELNDWLGDE--------V 336 (351) T ss_pred HHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHH----------HHHHHHHHHHHHHHHhhcCcc--------e Confidence 7888899999999999998763222221111111111111 1111111111 11122211 1 Q ss_pred EEecCCCCCCHHHHHHHHHHH Q lcl|NC_021324. 382 TVWRDPSTPTVAAKADAVSKL 402 (480) Q Consensus 382 v~f~~~~~~~~~~~ad~~~kl 402 (480) +.|.+.. ....-++. T Consensus 337 ~~F~~~~------llr~d~~a 351 (351) T protein:vir:79 337 VTFDDYE------IPPAPVAA 351 (351) T ss_pred eeeChhh------hccccccC Confidence 4563321 11111111 No 274 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=79.67 E-value=0.1 Score=26.02 Aligned_cols=299 Identities=16% Similarity=0.155 Sum_probs=101.3 Q ss_pred HHHHHHHHHHHHHHHHHhhccCc-chhc--Ccccchhhhccee-ccchHHHHHH--HHHhhh--ccCc--cccCCCchhH Q lcl|NC_021324. 12 QGLLARDLPNLLEAEAYRNGTRR-LKTI--GIGAPPELAYLDV-QPGWVATYLR--TLSDRL--DIEG--FRISEDSEGL 81 (480) Q Consensus 12 ~~~~~~~~~~~~~~~~YY~G~~~-i~~~--~~~~~~~~~~~~~-~~n~~~~iVd--~~~~~l--~~~~--~~~~~d~~~~ 81 (480) +++|+ +....... .+.. ............. ..+=+..+.+ ...+|+ ..+| +..+-+-... T Consensus 1 m~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~~~~~~~y~~~~~~~~~~~pp~~~~~l 70 (350) T protein:vir:11 1 MSKRR----------SHRRQQPVTVQSAQEGEFIPRQGGRAEAFTFGDPMPVLDGRGILDYLECWPNGRWYEPPLSMEGL 70 (350) T ss_pred CCccc----------cCCCcCccccCCcchhhhccccccceEEEEeCCceeecCcchhhHHHHHhhcCccccCCCCHHHH Confidence 11110 00000000 0000 0000000000000 0000000000 001111 0111 1001110000 Q ss_pred HH------------------HHHHHHhcC-H-HHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEE Q lcl|NC_021324. 82 EE------------------LWNWWQAND-L-DEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYA 140 (480) Q Consensus 82 ~~------------------l~~~~~~n~-~-~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~ 140 (480) .. |...+.-|. + ...+.+++.+.+.+|.||+.+..+. .|++ .+..++|..+-+ T Consensus 71 a~~~~~~~~h~~~l~~k~n~l~~~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~~~rn~------~G~~~~L~~l~~~~vr~ 144 (350) T protein:vir:11 71 AKSVGSSVYLQSGLKFKRNMLAKTFIPHRLLSRATFEQFSLDWLTFGSAYLEQPRSR------LGTRMPLQAPLAKYMRR 144 (350) T ss_pred HHHHhhhhhhccchhhhhhhhhhcccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcC------CCCEEEEEEeCCceeEe Confidence 00 000111122 1 2335677888899999999887543 4443 455666654433 Q ss_pred EEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCC Q lcl|NC_021324. 141 ELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYG 220 (480) Q Consensus 141 v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G 220 (480) .-+.. . +|....+ +..+.|. -+ -|+||++-....+.+| T Consensus 145 ~~~~~----~-----~~~~~~~------------~~~~~~~-------------------~~--eVihir~~~~~~~~yG 182 (350) T protein:vir:11 145 GTDLE----T-----FYQVRSW------------KDEHEFE-------------------KG--SVIQLREADINQEIYG 182 (350) T ss_pred eecCC----e-----EEEEeeC------------CeEEEEC-------------------cc--cEEEeCCCCCCCCccc Confidence 21110 0 1110001 1111110 01 1456654333345688 Q ss_pred CccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhhc-----chheec-CC---C Q lcl|NC_021324. 221 RSEISPELRKVTDAASRTLMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIYY-----GRILTL-AS---E 286 (480) Q Consensus 221 ~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~--i~G~~~~~~~~~~~~~~~~~~~-----~~~~~~-~g---~ 286 (480) .|.+.. .++++....+-..-...+|. .|-.+ ++|....+...+.-...|+... ++++.+ ++ + T Consensus 183 ls~~~~----a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~g~~~ 258 (350) T protein:vir:11 183 VPEWFC----ALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPNGKKE 258 (350) T ss_pred ccHHHH----HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCCCCcc Confidence 876543 33333332222222223332 33333 3443333222111111222111 122222 22 2 Q ss_pred CceEEEeccccH-HHHHHHHHHHHHHHHhhcCCChHHhccccccchH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 287 AAKISEFKAAEL-RNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS-----AEAIIATDSRIVKMAERKGRIFGGAWER 360 (480) Q Consensus 287 ~~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~S-----g~Al~~~~~~l~~k~~~~~~~~~~~l~~ 360 (480) +.++..+..... ..|++..+....+|+..-++|+..+|....+.++ ..++.+....|.-.+ .. +++ T Consensus 259 g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~----~~----ie~ 330 (350) T protein:vir:11 259 GIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWASLELAPMQ----TR----LQQ 330 (350) T ss_pred ceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHH----HH----HHH Confidence 346666654322 2478887888899999999999998853222111 111111111111111 11 111 Q ss_pred HHHHHHHHhCCCCcccceeeeEEecCCCCCCH Q lcl|NC_021324. 361 AMRIAMQIMGREVTEEYTRLETVWRDPSTPTV 392 (480) Q Consensus 361 ~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~ 392 (480) +.+ .++.. .+.|.+.....+ T Consensus 331 ln~----~l~~~--------~~~F~~~~~~~l 350 (350) T protein:vir:11 331 VNE----MIGEE--------VVRFAQFDAPGL 350 (350) T ss_pred HHh----hcCcc--------ccccCcccccCC Confidence 111 12211 123432222222 No 275 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=79.38 E-value=0.1 Score=25.95 Aligned_cols=291 Identities=15% Similarity=0.095 Sum_probs=110.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccC---cchhcCcccchhhhcceeccchHHHHHHHHHhhh----ccCccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTR---RLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRL----DIEGFR 73 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~---~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l----~~~~~~ 73 (480) ..|++.++.. ..-++-+.-+|.|+. ++.. ..+.+..-...+..-++....+.+ .++++. T Consensus 58 fg~p~~v~~~--------~~~~~~~~~~~~~~~~~pp~~~------~~La~~~~~~~~h~s~l~~k~n~l~~~~~Pnp~l 123 (376) T protein:vir:10 58 FDDPTPVMNR--------AEILDYVECWSNGEWFEPPVSF------AGLAKSFRASTHHSSALFFKANVLASTFRPHRWL 123 (376) T ss_pred cCCceeccCc--------chhhhhhhhhhcCceecCCCCH------HHHHHHHhhhHHhhhhHHHHhHHHHhccCCCCCC Confidence 2222222110 000112233344431 2211 111111001112222222222222 122211 Q ss_pred cCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEE Q lcl|NC_021324. 74 ISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTR 152 (480) Q Consensus 74 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~ 152 (480) . ...+.+++.+.+.+|.||+.+-.+. .|.+ .+..++|..+.+..+... T Consensus 124 T-------------------~~~f~~~v~d~ll~Gnay~~~~rn~------~G~~~~L~pl~~~~vr~~~d~~~------ 172 (376) T protein:vir:10 124 S-------------------RHAFERWALDFLTFGNGYLERRRNM------VGGTLRLEPALAKYVRRKADFNG------ 172 (376) T ss_pred C-------------------HHHHHHHHHHHHhcCCeEEEEEECC------CCCEEEEEEeCCcceEEEeeCCe------ Confidence 0 0123456678888999999876532 4543 566777776655443220 Q ss_pred EEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHH Q lcl|NC_021324. 153 AVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVT 232 (480) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~ 232 (480) +|....... ...|.++ -|+||++-....+.+|.|.+.. .+ T Consensus 173 ---~~~~~~~~~----~~~~~~~-----------------------------eViHir~~~~~~~~yGls~~~~----a~ 212 (376) T protein:vir:10 173 ---FVYVNGWQE----RHEFEPD-----------------------------SVFQLVRPDINQEVYGLPEYLS----SL 212 (376) T ss_pred ---EEEEEcCCe----EEEEccc-----------------------------cEEEecCCCCCCCcccccHHHH----HH Confidence 111111100 0011111 1456654333356778876543 33 Q ss_pred HHHHHHHHHHHHHHHHhcc---hh--hhhcCCCccccccccccchhhhhc-----chheec-C---CCCceEEEecccc- Q lcl|NC_021324. 233 DAASRTLMNLQSASQILGT---PL--RVISGVTTDELTNDGENTTLDIYY-----GRILTL-A---SEAAKISEFKAAE- 297 (480) Q Consensus 233 D~~~~~~s~~~~~~~~~~~---p~--l~i~G~~~~~~~~~~~~~~~~~~~-----~~~~~~-~---g~~~~~~~~~~~~- 297 (480) ..+....+-..-...+|.+ |- ++++|....++..+.-...|+-.. +.++.+ + .++.++..+.... T Consensus 213 ~si~l~~aa~~f~~~~f~NGa~pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~ 292 (376) T protein:vir:10 213 HSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAA 292 (376) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHH Confidence 4443333322222233333 33 333443333222111111121111 122222 2 2345676665432 Q ss_pred HHHHHHHHHHHHHHHHhhcCCChHHhccccccchH---HH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_021324. 298 LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS---AE--AIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE 372 (480) Q Consensus 298 ~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~S---g~--Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 372 (480) -..|++..+....+|+..-++|+.-+|....+.++ .. ++.+....|.=.+... +++.. .++.. T Consensus 293 d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~i--------eeln~----~L~~~ 360 (376) T protein:vir:10 293 KDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARF--------AELND----WLGEE 360 (376) T ss_pred HHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHH--------HHHHh----hcccc Confidence 22478887888899999999999888753322211 11 1111111111111111 11111 11111 Q ss_pred CcccceeeeEEecCCCCCCHHHHHHHHH Q lcl|NC_021324. 373 VTEEYTRLETVWRDPSTPTVAAKADAVS 400 (480) Q Consensus 373 ~~~~~~~~~v~f~~~~~~~~~~~ad~~~ 400 (480) -+.|.+.. ....|.-+ T Consensus 361 --------~~~F~~~~----Llr~d~ka 376 (376) T protein:vir:10 361 --------VVRFDDYE----IPPAPVAA 376 (376) T ss_pred --------ccccChhH----hhcccccC Confidence 14453211 11111111 No 276 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=77.95 E-value=0.12 Score=25.64 Aligned_cols=336 Identities=12% Similarity=0.058 Sum_probs=123.2 Q ss_pred HHHHHHHHHHhhccCcchhcCcccchhhhcceec--cchHHHHHHHHHhhhccCcccc----CCC-------chhHHHHH Q lcl|NC_021324. 19 LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQ--PGWVATYLRTLSDRLDIEGFRI----SED-------SEGLEELW 85 (480) Q Consensus 19 ~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~--~n~~~~iVd~~~~~l~~~~~~~----~~d-------~~~~~~l~ 85 (480) +.=+.++..+..+...- -.... ..+....+. ......+|+..++-+-.-++.+ ..+ +.....+. T Consensus 1 Mg~f~~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN--DTQRV-TAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred CCccccchhcccccccC--Cccee-eeeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccchHH Confidence 11111122222111000 00000 000000011 1122333333333222223221 111 01123455 Q ss_pred HHHHh--cC---HHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEe Q lcl|NC_021324. 86 NWWQA--ND---LDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTR 160 (480) Q Consensus 86 ~~~~~--n~---~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~ 160 (480) .++.. |. ...+...+..+.+.+|.||+++... +..|.++.. | + T Consensus 78 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~-----~~~g~~~~l----------~-p---------------- 125 (378) T protein:vir:94 78 EVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFD-----DNTGELLDL----------L-F---------------- 125 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEee-----CCCceEEEE----------E-e---------------- Confidence 66542 32 3466777888999999999864221 112222211 1 0 Q ss_pred cCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHH Q lcl|NC_021324. 161 DDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLM 240 (480) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s 240 (480) .+.+ . .|.++.+ +||++- .....|.|- +..+..+++..++ T Consensus 126 ~~~~-~----~~~~~di-----------------------------iH~~~~--~~~~~g~s~----l~~~~~~i~~~~~ 165 (378) T protein:vir:94 126 ADDK-K----EYKPEEL-----------------------------VRLTSP--FYINEDTSI----LDNALASIQTKLE 165 (378) T ss_pred cCCe-e----Eeeeeee-----------------------------EEecCc--CCccchhHH----HHHHHHHHHHHHh Confidence 0110 0 0112222 334321 111123222 3333344433322 Q ss_pred HHHHHHHHhcchhhhh--cCCCccccccccccchh----hh-----hcchheecCCCCceEEEeccccHHHHHHHHHHHH Q lcl|NC_021324. 241 NLQSASQILGTPLRVI--SGVTTDELTNDGENTTL----DI-----YYGRILTLASEAAKISEFKAAELRNFAEEMEVFR 309 (480) Q Consensus 241 ~~~~~~~~~~~p~l~i--~G~~~~~~~~~~~~~~~----~~-----~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~ 309 (480) . +.+-.++ .+. ............+ +- ..++++.++ ++.+|.+++......-++.++... T Consensus 166 ~--------~~~~gil~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-~g~~~~~l~~~~~~~~~~~~~~~~ 235 (378) T protein:vir:94 166 Q--------GKLRGLLKINAF-LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVD-NKTEIVELKKDYSVLNKDEIDLIK 235 (378) T ss_pred c--------ccccceeeeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceecC-CCceEEEccCChhhhhHHHHHHHH Confidence 1 1222222 121 1111111111111 10 123456665 446888776433222234557777 Q ss_pred HHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCC-----CC-cccce Q lcl|NC_021324. 310 KEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGR-----EV-TEEYT 378 (480) Q Consensus 310 ~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~-----~~-~~~~~ 378 (480) .+|+..-++|+.-++++. +....+. .+...|.-++..+.. +... .. ..... T Consensus 236 ~~Ia~~fgVP~~~l~~~~---se~~~~~---------------f~~~tL~P~~~~ie~~l~~~Ll~~~er~~g~~~~~~~ 297 (378) T protein:vir:94 236 SELLTGYFMNENILLGTA---SQEQQIY---------------FYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYE 297 (378) T ss_pred HHHHHHhCCCHHHhcCCh---HHHHHHH---------------HHHHHHHHHHHHHHHHHHhhcCChhHhhhhhhccccc Confidence 899999999998885432 1111111 222223322222211 1110 00 00112 Q ss_pred eeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCC--CcCC Q lcl|NC_021324. 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQA--DATP 456 (480) Q Consensus 379 ~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 456 (480) .+++.+......|..+.++.+.+++.+| +++.-.+++++|+.+-+- -.+. .-.....+ .... T Consensus 298 ~~~f~~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~p~~g--GD~~------------~~~~n~~~~~~~~~ 361 (378) T protein:vir:94 298 RIIVDNQLFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIEG--GDVY------------IANLNAVAVKNLSD 361 (378) T ss_pred ceeecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCC--CCee------------eecccccccccchh Confidence 3445555566678889999999998876 566666777777643320 0000 00000000 0000 Q ss_pred CCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 457 KPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 457 ~~~~~d~~~~~~~~~~~~~~ 476 (480) .+. . +....+..++.|+ T Consensus 362 ~~~-~--~~~~~~~~e~~n~ 378 (378) T protein:vir:94 362 LQG-S--RKDVTSTDETNNQ 378 (378) T ss_pred hcC-C--cCCCCCCCCCCCC Confidence 000 0 0000011111122 No 277 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=74.49 E-value=0.16 Score=24.98 Aligned_cols=458 Identities=10% Similarity=-0.010 Sum_probs=172.5 Q ss_pred CCC-----HHHHHHHHHHHHHHHH-----------HHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHH Q lcl|NC_021324. 1 MTT-----YHEHVERLQGLLARDL-----------PNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLS 64 (480) Q Consensus 1 ~~~-----~~~~i~~l~~~~~~~~-----------~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~ 64 (480) |.+ +.+-=+-|.++|..+. .|-+...+-|.+...-. .+...++.+.+.-...|.=+.. T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~------~~~~~r~nl~~sni~~i~P~iY 74 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSA------HDAETRWNLFSTNIQTQMASLY 74 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCC------CccccccchhhhhHHHHhhhhh Confidence 443 1111111222222222 22223333444322111 1111122222222222322222 Q ss_pred hhhccCccc----cCCCch----hHHHHHHHH------HhcCHHHHHHHHHHHHhhcCeeEEEEecCcccc--------C Q lcl|NC_021324. 65 DRLDIEGFR----ISEDSE----GLEELWNWW------QANDLDEESVLGHDDSLTFGRAYITVSHPDVES--------G 122 (480) Q Consensus 65 ~~l~~~~~~----~~~d~~----~~~~l~~~~------~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~--------~ 122 (480) ++. +++.. ...|.. ..+.+.+.+ +.++|+..+....++++.+|+|-++|.+-.... . T Consensus 75 ar~-P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~~~~~ 153 (663) T protein:vir:34 75 GQT-PKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGVDAIL 153 (663) T ss_pred cCC-CcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccccccC Confidence 221 11111 122221 223344433 345689999999999999999877776511000 0 Q ss_pred CC-----------------CceeEEEEEcccceEEEEeCCCCC-ceEEEEEE-E-------------------------E Q lcl|NC_021324. 123 DP-----------------AGIPLIRVESPLYMYAELDPRNTR-RVTRAVRL-Y-------------------------T 158 (480) Q Consensus 123 d~-----------------~g~~~~~~~~p~~~~~v~d~~~~~-~~~~~~~~-~-------------------------~ 158 (480) |+ ...++|-.+.=.++ ++|+.... .+.|..+. | . T Consensus 154 D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~df--l~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~~~~~ 231 (663) T protein:vir:34 154 DEATGAELAAAVPPTQRKAYECVETDYLHWQDV--LWSPARVWHEVRWLAFRNLLDMREFNARFDADGSRNLWASVPKVG 231 (663) T ss_pred CCccccchhcccccchhhcccceeeeeechhhc--ccchhhccccccceeeeccCCHHHHHHhhcCChhhhhhhhccCcC Confidence 00 01223333222221 11111000 11111100 0 0 Q ss_pred Eec---------CCceEEEEEEEeCCc--EEEEEecCCCccccccccc---cccccccccceEEeecccccCccCCCccc Q lcl|NC_021324. 159 TRD---------DVAVPDRATLYLPDE--TVPLRRNGGLNDQWVVDGD---VIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) Q Consensus 159 ~~~---------~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~---~~~~~~g~iPvv~~~n~~~~~~~~G~s~i 224 (480) ... ........++|.... ++++. .+.. .+....+ ..++.|+ ||...|++. ...+.+..+++ T Consensus 232 ~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~-eg~~--~~L~~~~p~lgl~~ffP-cPrpl~~~~-~~ds~ipvpd~ 306 (663) T protein:vir:34 232 KPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYV-EGYS--AVLDTQPDPLGLESFFP-CPKPLLANW-TTDKVVPRPDF 306 (663) T ss_pred CccccCCCCCcchhcCcceeEEEecCCcEEEEEE-cCcc--eecccCCCCCCCCCCCC-Cccccccee-cCCCeecCCcH Confidence 000 001334566776533 33332 2221 1111111 1123333 566555553 23466777777 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCccccccccccchhhhhcchheec--------CCCCce-EEEecc Q lcl|NC_021324. 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILTL--------ASEAAK-ISEFKA 295 (480) Q Consensus 225 ~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~--------~g~~~~-~~~~~~ 295 (480) .- -+.+++.+|.+ ++..|.+...- .+.|.-.....++-.....+...+.++.+ .|+-.+ +..++- T Consensus 307 ~~-y~~~~~E~n~~-t~Rin~l~d~i----kv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi 380 (663) T protein:vir:34 307 VL-AQDLYKEIDLV-STRITLLERAI----RVVGVYDKSSGLTIGRLLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPL 380 (663) T ss_pred HH-HHHHHHHHHHH-HHHHHHHHhhh----hhceeeccccchhHHHHHHHhhCCCceecchhhhhhhhcCccchhhcccc Confidence 63 56788888876 44444443321 22232111111111111112222222222 122112 222332 Q ss_pred c----cHHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_021324. 296 A----ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ---- 367 (480) Q Consensus 296 ~----~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~---- 367 (480) . .+......-..++..+..++|+.+-.=| .+..+-.+.|-..+-+.+..++..++.......++++++... T Consensus 381 ~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rg-a~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~ 459 (663) T protein:vir:34 381 EPVVAALTSLRDYRRELVDALHQVTGMADIMRG-ASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAE 459 (663) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhc-ccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2333333334555666666776654333 333334566777777888888888888888888888776432 Q ss_pred ---------HhCCCCc----------------ccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHH--HHHhCC Q lcl|NC_021324. 368 ---------IMGREVT----------------EEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQ--ARIDLG 420 (480) Q Consensus 368 ---------~~~~~~~----------------~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~--~~~~~~ 420 (480) +.+...+ .....++|.=..+..++..+.-...+++.++... +.... ++.+.| T Consensus 460 ~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~-~~qq~~pl~~q~p 538 (663) T protein:vir:34 460 HYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIAS-FMQGVAPLAQQVP 538 (663) T ss_pred hcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHH-HHHHHHHHHHhhh Confidence 2232222 1123345544455556665554555555443222 12222 233444 Q ss_pred CChhHHHHHHHHH------HHHHHHHHHHHhhhcccCC-CcCC---CCCCCCCCccCC-----CCCCCC-Ccc---cCC Q lcl|NC_021324. 421 YTATQREQMRDWD------KQETEDMIDTLYSTTKAQA-DATP---KPTVTETKTETQ-----TSPSGF-NRT---KTR 480 (480) Q Consensus 421 ~~~d~~~~~~~~~------~~~~~~~~~~~~~~~~~~~-~~~~---~~~~~d~~~~~~-----~~~~~~-~~~---~~~ 480 (480) -...-..+|.+.- ..+.+..++++.......+ .+++ .++..+.+-+.+ .+-..+ ..+ ++| T Consensus 539 ~~~p~l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~~~pa~~~~~~k~~~~q~k~q~~~aeAq~e~q~~~~~ 617 (663) T protein:vir:34 539 GSAPFLLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQQSPAPQQPDPKVVAQAMKGQQEMAKVQAEVQGDLLR 617 (663) T ss_pred hhHHHHHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCCCCcccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333333332221 1122233333333222111 0011 010000100000 000000 000 000 No 278 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=73.18 E-value=0.17 Score=24.75 Aligned_cols=289 Identities=12% Similarity=0.077 Sum_probs=101.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---CcchhcCcccchhhhcc-eeccchHHHHHHHHHhh----hccCcc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGT---RRLKTIGIGAPPELAYL-DVQPGWVATYLRTLSDR----LDIEGF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~---~~i~~~~~~~~~~~~~~-~~~~n~~~~iVd~~~~~----l~~~~~ 72 (480) ..++..++.. ..-++.++-++.|+ +++...+ +.+. +. ..+..-++....+. ..++++ T Consensus 25 ~~~p~~~~~~--------~~~~~~~~~~~~~~~~~pp~~~~~------la~l~~a-~~~h~s~i~~k~n~l~~~~~Pn~~ 89 (340) T protein:vir:98 25 FGEPVPVLDK--------RDILDYVECISNGKWYEPPVSFSG------LAKSLRS-AVHHSSPIYVKRNVLASTYIPHPL 89 (340) T ss_pred cCCceeecCc--------chhhhhhhhhhcCceecCCCCHHH------HHHHHHh-ccccchhhhhhhhHHhhccCCCCC Confidence 2222221110 00011112223332 1221111 1110 00 00111111111111 122221 Q ss_pred ccCCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceE Q lcl|NC_021324. 73 RISEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVT 151 (480) Q Consensus 73 ~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~ 151 (480) .. ...+..++.+.+.+|.||+.+-.+. .|++ .+..++|..+-...+.. T Consensus 90 lt-------------------~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L~pl~~~~vr~~~~~~------ 138 (340) T protein:vir:98 90 LS-------------------RQDFSRFALDYLVFGNAFLEQRHSV------TGQLIKLLTSPAKYTRRGVDDS------ 138 (340) T ss_pred CC-------------------HHHHHHHHHHHHhcCCeEEEEEECC------CCcEEEEEEeCCceEEEcccCc------ Confidence 10 1123456678888999999886532 3443 44455554332211110 Q ss_pred EEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHH Q lcl|NC_021324. 152 RAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKV 231 (480) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l 231 (480) ++|....++.. ..|.++ =|+||++-....+.+|.|.+...+ T Consensus 139 ---~~~~~~~~~~~----~~~~~~-----------------------------eViHir~~~~~~~~~Gls~~~~a~--- 179 (340) T protein:vir:98 139 ---VFWFVENFTQP----HEFAPD-----------------------------TVFHLLEPDINQEIYGLPEYLSAL--- 179 (340) T ss_pred ---EEEEEecCCeE----EEEccc-----------------------------cEEEEcCCCCCCCcccccHHHHHH--- Confidence 01110001000 001111 146665432335678888764433 Q ss_pred HHHHHHHHHHHHHHHHHhc---chh--hhhcCCCccccccccccchhhhhc-----chheec-CC---CCceEEEecccc Q lcl|NC_021324. 232 TDAASRTLMNLQSASQILG---TPL--RVISGVTTDELTNDGENTTLDIYY-----GRILTL-AS---EAAKISEFKAAE 297 (480) Q Consensus 232 ~D~~~~~~s~~~~~~~~~~---~p~--l~i~G~~~~~~~~~~~~~~~~~~~-----~~~~~~-~g---~~~~~~~~~~~~ 297 (480) ..+....+-..-...+|. .|- ++++|....+...+.-...|+-.. ++++.+ ++ ++.++..+.... T Consensus 180 -~si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~~g~~~~pls~~~ 258 (340) T protein:vir:98 180 -NSAWLNESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRDAMRNSKGLGNFKNLFFYSPNGKPDGIKIVPLSEVA 258 (340) T ss_pred -HHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCCh Confidence 333222222222223333 233 334443333222111111222111 123332 22 234566665332 Q ss_pred -HHHHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHhCCCC Q lcl|NC_021324. 298 -LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIA---MQIMGREV 373 (480) Q Consensus 298 -~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li---~~~~~~~~ 373 (480) -..|++..+....+|+..-++|+.-+|....+.++...+....... +...|.-++..+ ...++... T Consensus 259 ~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f----------~~~~l~Pl~~~iee~n~~L~~e~ 328 (340) T protein:vir:98 259 TKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVF----------VRNELSPLQDRFREVNDWLGMEV 328 (340) T ss_pred hHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHH----------HHHHHHHHHHHHHHHHhcccccc Confidence 2247888788889999999999998885322211111111111111 111111111111 11122110 Q ss_pred cccceeeeEEecCCCCCCHHHHHH Q lcl|NC_021324. 374 TEEYTRLETVWRDPSTPTVAAKAD 397 (480) Q Consensus 374 ~~~~~~~~v~f~~~~~~~~~~~ad 397 (480) +.|.+-...+ +| T Consensus 329 --------~rF~~~~l~~----~d 340 (340) T protein:vir:98 329 --------IRFKEYTLDN----PE 340 (340) T ss_pred --------cccCcccccc----CC Confidence 3342211110 11 No 279 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=70.52 E-value=0.21 Score=24.32 Aligned_cols=299 Identities=10% Similarity=0.056 Sum_probs=105.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhc----cCc--ccc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLD----IEG--FRI 74 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~----~~~--~~~ 74 (480) |.+-.. ... +..... .+ ..-.+..|.-.+--...+|+- .+| |.- T Consensus 1 ~~~~~~------------------~~~-----~~~~~~----~~---~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~~ep 50 (345) T protein:vir:37 1 MKTNVK------------------TDN-----KKGIVI----AP---INDRTFSLNEISASPALDYVGIGFDENYNCYLP 50 (345) T ss_pred CCCCcc------------------ccc-----hhhccc----Cc---ceeEEeecCCcccccchhhhhhhhcCCccccCC Confidence 110000 000 000000 00 000001111000001112210 000 000 Q ss_pred CCCchhHHHHHH---------------------HHHhcC-H-HHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EE Q lcl|NC_021324. 75 SEDSEGLEELWN---------------------WWQAND-L-DEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LI 130 (480) Q Consensus 75 ~~d~~~~~~l~~---------------------~~~~n~-~-~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~ 130 (480) +-+ ...|.+ .+.-|. + ...+.+++.+.+.+|.||+.+-.+. .|++ .+ T Consensus 51 p~~---~~~la~l~~~~~~h~~~i~~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~------~G~~~~L 121 (345) T protein:vir:37 51 PVN---RHALAKLPHQNAQHGGILHSRANMVSSLYEGGKALSRMDMRALCLNLIQFGDVGLLKVRNG------FGQVVRL 121 (345) T ss_pred CCC---HHHHHHHhhcccccccceeeechHHHhhccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcC------CCcEEEE Confidence 000 111111 111121 1 1235567788999999999876543 3443 56 Q ss_pred EEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEee Q lcl|NC_021324. 131 RVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLT 210 (480) Q Consensus 131 ~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~ 210 (480) ..++|..+.+..|... .+.++.+.. ...+.. ..|.++ -|+|++ T Consensus 122 ~pl~~~~vr~~~d~~~----~~~~~~~~~-~~~g~~---~~~~~~-----------------------------dVihir 164 (345) T protein:vir:37 122 VPLSSLYLRVRKDGGY----SYLMKKSLY-DTAQEI---YRYDAK-----------------------------DIIFIK 164 (345) T ss_pred EEEcCceeEEEEeCCe----eEEEEEeEe-cCCceE---EEEccc-----------------------------cEEEec Confidence 6667766554432211 111111110 000000 001111 145665 Q ss_pred cccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhc---chhhhh--cCCCccccccccccchhhhhc-----chh Q lcl|NC_021324. 211 NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILG---TPLRVI--SGVTTDELTNDGENTTLDIYY-----GRI 280 (480) Q Consensus 211 n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~---~p~l~i--~G~~~~~~~~~~~~~~~~~~~-----~~~ 280 (480) +.....+.+|.|.+... +..+....+-..-...+|. .|-.+| +|....++..+.-...|+... +.+ T Consensus 165 ~~~~~~~~~Gls~~~~a----~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~l~~e~~~~lk~~~~~~~g~~n~~~~ 240 (345) T protein:vir:37 165 LYDPMQQVYGSPDYVGG----IQSALLNSDATVFRRRYFSNGAHMGFILYSTDPDLTEEMEEEIARKISESKGVGNFRSM 240 (345) T ss_pred CCCCCCCcccccHHHHH----HHHHHHHHHHHHHHHHHHhccCCcceEEEecCCCCCHHHHHHHHHHHHHhcCcccccce Confidence 43334567888776432 3333322222222223333 343333 333222221111111122111 122 Q ss_pred eec-C---CCCceEEEecccc-HHHHHHHHHHHHHHHHhhcCCChHHhccccccchH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 281 LTL-A---SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSENPAS-AEAIIATDSRIVKMAERKGRIF 354 (480) Q Consensus 281 ~~~-~---g~~~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~S-g~Al~~~~~~l~~k~~~~~~~~ 354 (480) +.+ + .++.++..+.... -..|++..+....+|+.+-++|+..+|....+.++ |.+-+....-......=....+ T Consensus 241 ~i~~p~g~~~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l~P~~~~i 320 (345) T protein:vir:37 241 FVNIANGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQEII 320 (345) T ss_pred EEEcCCCcccceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHHHHHHHHH Confidence 222 2 2345666665432 22477777788899999999999998864322221 1111111111111111111222 Q ss_pred HHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHH Q lcl|NC_021324. 355 GGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAA 394 (480) Q Consensus 355 ~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~ 394 (480) ...+.++ .. .. ....+.|.+. ++++ T Consensus 321 e~~ln~~----~~-----~~---~~~~i~F~~~---~L~~ 345 (345) T protein:vir:37 321 AETINQD----PE-----IK---NLLKIKFREQ---NFAK 345 (345) T ss_pred HHHhhhh----cc-----CC---CcceEEecch---hhcC Confidence 2222211 00 11 1244667432 2222 No 280 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=66.96 E-value=0.26 Score=23.80 Aligned_cols=365 Identities=12% Similarity=0.065 Sum_probs=125.6 Q ss_pred HHHHHHHhhccC-cch--hcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCCchhHHHHHHHHHh--c-- Q lcl|NC_021324. 22 LLEAEAYRNGTR-RLK--TIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SEDSEGLEELWNWWQA--N-- 91 (480) Q Consensus 22 ~~~~~~YY~G~~-~i~--~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d~~~~~~l~~~~~~--n-- 91 (480) .-.++.+..++. .+. ...........+..+.+.-...+|+..++-+-.-++.+ ..+......+..++.. | T Consensus 1 Mgl~d~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~v~~~i~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~~~PN~~ 80 (395) T protein:vir:96 1 MGILDFFSFKKSGTLSDDDSGSTTSEKLTNVVLKEDALYKCVNYLARIISKSTFRIKAPEKLTENQKDWLYWINTKANPN 80 (395) T ss_pred CcchhhhcCCCCccccccccccchhhhcchhhhhhHHHHHHHHHHHHhhccceeEEEeCCccccccchHHHHHhhcCCCC Confidence 111111111111 000 00000000001111111222334444444333333432 1122223345555532 3 Q ss_pred -CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEecCCceEEEEE Q lcl|NC_021324. 92 -DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRDDVAVPDRAT 170 (480) Q Consensus 92 -~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (480) ....+...++.+.+.+|.||+++..+. ... +...+++.....+ .+++....+. ..+-. T Consensus 81 ~t~~~f~~~l~~~lll~Gna~~~~~~~~--------~~~-----~~~~~~~~~~~~~------~~~~~v~~~~--~~~~~ 139 (395) T protein:vir:96 81 QSASQFWVEVVQKLLVDGETLIFVIPGK--------GIY-----VADAFTQDKKLSG------NKFKVSRVQG--QTYEK 139 (395) T ss_pred CCHHHHHHHHHHHHhhcCceEEEEEcCC--------cee-----cCCcccccccccc------ceeeeeeecc--ceeee Confidence 234567778889999999998875431 110 1111111000000 0000000000 00111 Q ss_pred EEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccch--HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 171 LYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEIS--PELRKVTDAASRTLMNLQSASQI 248 (480) Q Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~--~~v~~l~D~~~~~~s~~~~~~~~ 248 (480) .+.++.+ +||+++......++.+-+. ..++.+.-+....-+...-.... T Consensus 140 ~~~~~dv-----------------------------ih~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 190 (395) T protein:vir:96 140 IFTFDQV-----------------------------IYLKNDNSDLMLKVESLWEEYGELLGHVINNQKIANQIRFTMTP 190 (395) T ss_pred EeccCce-----------------------------EEecccCCccccccccccchHHHHHHHHHHHHHHHHHHHHHhhh Confidence 1222333 3443322222222222111 11222221111111111111111 Q ss_pred hcchhhhhcCC---Cccccccccccchh-------hhhcchheecCCCCceEEEeccccH-------HHHHHHHHHHHHH Q lcl|NC_021324. 249 LGTPLRVISGV---TTDELTNDGENTTL-------DIYYGRILTLASEAAKISEFKAAEL-------RNFAEEMEVFRKE 311 (480) Q Consensus 249 ~~~p~l~i~G~---~~~~~~~~~~~~~~-------~~~~~~~~~~~g~~~~~~~~~~~~~-------~~~~~~l~~~~~~ 311 (480) +... ....|. ..+... +.....+ +...+.++.++ ++.++.+++.... ..+.+.....+.+ T Consensus 191 ~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~~l~-~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~e 267 (395) T protein:vir:96 191 PKDK-VRERAQENSDGGRQP-KSDKDFFKRTIEKIRTESVVGIPVT-ANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAE 267 (395) T ss_pred cccc-cccceeeccCchhhH-HHHHHHHHHHHHHhhcCCcceEEcc-CCceeEecccChhhhhhhhHHHHHHHHHHHHHH Confidence 1111 111111 111000 0000011 11222344454 3456666553321 1233444556789 Q ss_pred HHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCC Q lcl|NC_021324. 312 AASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPT 391 (480) Q Consensus 312 i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~ 391 (480) |+.+-++|++.++++..| .+...+.+....|.-.+.. |+..+. ..+....... ....+.|......+ T Consensus 268 Ia~~fgVPp~~l~~~~sn-~e~~~~~f~~~~L~P~~~~--------ie~~l~--~~Ll~~~e~~--~~~~f~~~~l~~~d 334 (395) T protein:vir:96 268 FAEMLGIPISLLHGDIAD-NQKNYELLLEGPIESLITN--------IVDGLE--YAIFDKSETL--EGSFIKVTGLKNYD 334 (395) T ss_pred HHHHhCCCHHHhcCCCcc-HHHHHHHHHHHHHHHHHHH--------HHHHHH--hhcCChhhhc--CceeEeecchhccC Confidence 999999999999764322 1221112111112211111 111111 0111111111 12345666777788 Q ss_pred HHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCC Q lcl|NC_021324. 392 VAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTETQT 469 (480) Q Consensus 392 ~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 469 (480) ..+.++.+.+++..| +++...+++.+|+.+-+-... +.+.. +.+..+..+ .++|.+.+.+. T Consensus 335 ~~~~~~~~~~~~~~G--~~T~NE~R~~~gl~pi~~~~g------------D~~~~--~~N~~~~~~-~gge~~~~~~~ 395 (395) T protein:vir:96 335 LFSISSQADKLISSG--FVFIDEVREEIGLPELPDGLG------------KVLYM--TKNYESVLE-RGGEVDEEVET 395 (395) T ss_pred HHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCCCCCC------------ceeee--cccceechh-ccCCCCCCCCC Confidence 999999999988876 566666777776643210000 00000 000000000 11111111111 No 281 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=63.97 E-value=0.31 Score=23.39 Aligned_cols=305 Identities=12% Similarity=0.089 Sum_probs=104.5 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccCcchhcCcc-----cchhhhcceeccchHHHHHHHHHhhhccCccccC Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG-----APPELAYLDVQPGWVATYLRTLSDRLDIEGFRIS 75 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~~~~~~~YY~G~~~i~~~~~~-----~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~ 75 (480) -.++..++..- .+ ......-...+||+ +++...+.. .+..-+-+...+|+... . ..+ T Consensus 41 fg~p~~~~~~~--~~-~~~~~~~~~~~~~~--~pi~~~~la~~~~~~~~h~~~~~~~~n~l~l---------~----~~P 102 (368) T protein:vir:79 41 FGDPVEVLDRR--EL-LDYVECMRMGQWYE--PPMPWDGLARSFRAAAHHSSAVYVKRNILVS---------T----FIP 102 (368) T ss_pred cCCceeecchh--hH-HHHHHHHhccchhc--cCcCHHHHHHHHhhccccchhhhhhcchhhh---------h----cCC Confidence 12222222110 00 00000000112332 222211100 00000001111222210 0 111 Q ss_pred CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEE Q lcl|NC_021324. 76 EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAV 154 (480) Q Consensus 76 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~ 154 (480) +..-.. ..+.+++.+.+.+|.||+.+..+ ..|++ .+..++|..+...-+. + T Consensus 103 n~~~t~-------------~~f~~l~~d~ll~Gnay~~~~r~------~~G~~~~L~~l~~~~v~~~~~~--~------- 154 (368) T protein:vir:79 103 HPLLSR-------------ATFERLVLDWQVFGNAYLERREN------VLGGTIRLDTPLAKYVRRGLDL--N------- 154 (368) T ss_pred CcCCCH-------------HHHHHHHHHHhhcCCeEEEEEEc------CCCCEEEEEEeCcccceeeccC--C------- Confidence 111000 11234667889999999987653 24554 4555555544321111 0 Q ss_pred EEEEEecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHH Q lcl|NC_021324. 155 RLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDA 234 (480) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~ 234 (480) ++|....+ +..+.|.. + -|+|+++-....+.+|.|.|.- .... T Consensus 155 ~~~~~~~~------------~~~~~~~~-------------------~--dIihir~~~~~~~~yGlsp~~~----a~~s 197 (368) T protein:vir:79 155 TYFFVQNW------------QQPYTFAA-------------------G--SVFHLQEPDINQEVYGLPEYLS----ALNA 197 (368) T ss_pred EEEEEecC------------CeEEEEcc-------------------c--cEEEecCCCCCCCcccccHHHH----HHHH Confidence 00100000 11111110 1 1466765433456789887643 3333 Q ss_pred HHHHHHHHHHHHHHh---cchhhhh--cCCCccccccccccchhhhh-----cchheecC----CCCceEEEecccc-HH Q lcl|NC_021324. 235 ASRTLMNLQSASQIL---GTPLRVI--SGVTTDELTNDGENTTLDIY-----YGRILTLA----SEAAKISEFKAAE-LR 299 (480) Q Consensus 235 ~~~~~s~~~~~~~~~---~~p~l~i--~G~~~~~~~~~~~~~~~~~~-----~~~~~~~~----g~~~~~~~~~~~~-~~ 299 (480) +....+-..-...+| +.|-.+| +|....++..+.-...|+-. .++++.+. .++.++..+.... -. T Consensus 198 i~l~~aa~~~~~~~~~NGa~~~gil~~~~~~l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~~d~ 277 (368) T protein:vir:79 198 TWLNESATLFRRRYYKNGSHAGFILYMTDAAQKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVAAKD 277 (368) T ss_pred HHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCHHHH Confidence 333222222222333 3344333 34332222111111112211 12333332 2345676665432 22 Q ss_pred HHHHHHHHHHHHHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccee Q lcl|NC_021324. 300 NFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTR 379 (480) Q Consensus 300 ~~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 379 (480) .|++..+....+|+..-++|+.-+|....+.+++..++-....+ +...|.-++..+..+... ...+ T Consensus 278 qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~f----------~~~~l~Pl~~~ie~ln~~-l~~e--- 343 (368) T protein:vir:79 278 EFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMVF----------ARNEVKPLQDRLLAINDW-IGDE--- 343 (368) T ss_pred HHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHH----------HHHHHHHHHHHHHHHHhc-cCcc--- Confidence 47788788889999999999998875332221111111111111 111111111111111110 0001 Q ss_pred eeEEecCCC--CCCHHHHHHHHHHHHhccccCCCH Q lcl|NC_021324. 380 LETVWRDPS--TPTVAAKADAVSKLYANGQGPIPK 412 (480) Q Consensus 380 ~~v~f~~~~--~~~~~~~ad~~~kl~~~~~~~~s~ 412 (480) .+.|.+.. ..+.+..|+. +.=|. T Consensus 344 -~~rF~~~~l~~~D~~a~a~~---------~~rsa 368 (368) T protein:vir:79 344 -VVRFAPYALGGHDQPAAAPG---------GQRSA 368 (368) T ss_pred -eeeechhHhhcccccccCCc---------ccccC Confidence 13452211 1111111110 00011 No 282 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=61.15 E-value=0.36 Score=23.02 Aligned_cols=297 Identities=10% Similarity=-0.002 Sum_probs=105.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHH-HHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCccccCCCch Q lcl|NC_021324. 1 MTTYHEHVERLQGLLARDLPN-LLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRISEDSE 79 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~~~~~~-~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~~~d~~ 79 (480) ..+++.++..- .+.+...- +....+|| .+++... .+.+.--...+...++....+.+.. .|... T Consensus 22 ~~~p~~~~~~~--~~~~~~~~~~~~~~~~~--~pP~~~~------~La~l~~~~~~h~~~L~~k~N~~~~-~f~~~---- 86 (337) T protein:vir:78 22 FSMPEAIDPTA--WMTDYTGVFYNPYGEYY--QPPIDRK------GLAKVARANAHHGAILMARRNMVAG-RFTNQ---- 86 (337) T ss_pred ecCcccccCcc--hhHhhhhhhhccCccee--cCCCCHH------HHHHHhhcchhhhhHHHhhhccccc-cCcCc---- Confidence 22222222100 00000000 00011222 1222111 0111100111222222222221110 11110 Q ss_pred hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCcee-EEEEEcccceEEEEeCCCCCceEEEEEEEE Q lcl|NC_021324. 80 GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIP-LIRVESPLYMYAELDPRNTRRVTRAVRLYT 158 (480) Q Consensus 80 ~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~-~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~ 158 (480) ...+..++.+.+.+|.||+.+-.+. .|++ .+..++|..+-..-| T Consensus 87 --------------~~~~~~~~~d~ll~GNay~~~~rn~------~G~~~~L~pl~~~~v~~~~d--------------- 131 (337) T protein:vir:78 87 --------------RATITAFVHNYLQFGDGGLLKLRNS------FGQVVGLHPLSSVYLRRRED--------------- 131 (337) T ss_pred --------------HHHHHHHHHHHHhhCCeEEEEEECC------CCcEEEEEEeCCceeEeeeC--------------- Confidence 1245566778889999999876532 4444 344454443321110 Q ss_pred EecCCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHH Q lcl|NC_021324. 159 TRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRT 238 (480) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~ 238 (480) +...++ ...+..+. |..=-|+|+++-....+.+|.|.+. ..+.++... T Consensus 132 -----~~~~~~--~~~~~~~~---------------------~~~~eIiHik~~~~~~~~~Gls~~~----~a~~si~l~ 179 (337) T protein:vir:78 132 -----GCFVYL--QQGKPNLI---------------------YRPDDVIWLAQYDPEQQVYGMPDYL----GGLQSALLN 179 (337) T ss_pred -----CeEEEE--EcCCceEE---------------------ECCccEEEECCCCCCCCcccccHHH----HHHHHHHHH Confidence 100000 00001011 1111246676543345678888654 333444433 Q ss_pred HHHHHHHHHHhc---chhhh--hcCCCccccccccccchhhhhc-----chheec-CC---CCceEEEecccc-HHHHHH Q lcl|NC_021324. 239 LMNLQSASQILG---TPLRV--ISGVTTDELTNDGENTTLDIYY-----GRILTL-AS---EAAKISEFKAAE-LRNFAE 303 (480) Q Consensus 239 ~s~~~~~~~~~~---~p~l~--i~G~~~~~~~~~~~~~~~~~~~-----~~~~~~-~g---~~~~~~~~~~~~-~~~~~~ 303 (480) .+-..-...+|. .|-.+ ++|....+...+.-...++... +.++.+ ++ ++.++..+.... -..|++ T Consensus 180 ~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n~~~~~v~~~~g~~~Gi~~~pis~~~~d~qfle 259 (337) T protein:vir:78 180 QDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMIANSKGVGNFRSMFVNIPDGKPDGIKLIPVGDIATKDEFAA 259 (337) T ss_pred HHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHhcCcccccceEEEcCCCCccceeEEEcCCChhHHHHHH Confidence 333333333443 33333 3443332221111111122111 122222 22 234666665432 224777 Q ss_pred HHHHHHHHHHhhcCCChHHhccccccchHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeee Q lcl|NC_021324. 304 EMEVFRKEAASITGLPPQYLSSSSENPASA--EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLE 381 (480) Q Consensus 304 ~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg--~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 381 (480) .-+....+|+..-++|+..+|....+..+| .+-+....-......=..+.+..++. ... ... ... T Consensus 260 ~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~~~ie~~~n--------~~l--l~~---~~~ 326 (337) T protein:vir:78 260 IKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATYARNEVLPLCELVQDAIN--------SAG--LPR---ALW 326 (337) T ss_pred HHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHHHHHHHHHHHHHHHHHHh--------hhc--CCh---hhc Confidence 777888899999999999988543332222 11111111111111111111111111 111 110 011 Q ss_pred EEecCCCCCCH Q lcl|NC_021324. 382 TVWRDPSTPTV 392 (480) Q Consensus 382 v~f~~~~~~~~ 392 (480) +.|..+...-+ T Consensus 327 ~~f~~~~~~~~ 337 (337) T protein:vir:78 327 VTFRETIGAAV 337 (337) T ss_pred eeccccccccC Confidence 33433322211 No 283 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=54.91 E-value=0.49 Score=22.27 Aligned_cols=414 Identities=9% Similarity=-0.011 Sum_probs=113.8 Q ss_pred CCCHHHHH-HHHHHHHHHH-H-HHHHHHHHHhhccCcchhcCcccc-----hhhhcceeccchHHHHHHHHHhhhc--cC Q lcl|NC_021324. 1 MTTYHEHV-ERLQGLLARD-L-PNLLEAEAYRNGTRRLKTIGIGAP-----PELAYLDVQPGWVATYLRTLSDRLD--IE 70 (480) Q Consensus 1 ~~~~~~~i-~~l~~~~~~~-~-~~~~~~~~YY~G~~~i~~~~~~~~-----~~~~~~~~~~n~~~~iVd~~~~~l~--~~ 70 (480) |.+..+.+ ...+..+... . .++.++..+++- +.+.... +.........| .+++..+..+++ .. T Consensus 23 ~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~y-----Y~g~~~~i~~~~~~~~~~~~~~~--~ki~~n~~~~ivd~~~ 95 (481) T protein:vir:10 23 VSDLAELLKEENLRNFISRHQTEQVPRLEMLESY-----YLNRNTDILAGERRLQKYGDKAD--HRAVHNYAKYVSRFIV 95 (481) T ss_pred eecchhhcCHHHHHHHHHHHHHHHHHHHHHHHHH-----hcCCCcccccCcccccccccccc--ceeecchHHHHHHHHH Confidence 66655543 3345554333 3 234555555543 2222111 11111111111 234444444332 23 Q ss_pred ccccCCCch---hHHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEE---EEeC Q lcl|NC_021324. 71 GFRISEDSE---GLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYA---ELDP 144 (480) Q Consensus 71 ~~~~~~d~~---~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~---v~d~ 144 (480) +|..+.... ..+..++.+..-.-+..+......+....-.| |...+.+.......| ++++ T Consensus 96 ~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~--------------G~~~~~~~~d~dg~~~i~~~~p 161 (481) T protein:vir:10 96 GYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIY--------------GRAYEIVYRDFEDRDTFKVLDP 161 (481) T ss_pred hhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhc--------------CeEEEEEEeCCCCeEEEEEEcc Confidence 555544432 22334443332111111222233333333221 222222222222211 2222 Q ss_pred CCCCceEEEEEEEEEecCCceEEEEEEEeC---C--cEEEEEecCCCccccccccccccccccccceEEeeccc--ccCc Q lcl|NC_021324. 145 RNTRRVTRAVRLYTTRDDVAVPDRATLYLP---D--ETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDP--RLGN 217 (480) Q Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~--~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~--~~~~ 217 (480) .. .+..|...........+.+|.. + .+.++..-+......+ ......|..+ -+.+|.- -+-- T Consensus 162 ~~------~~~v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~---~~~~~~~~~~--~~~~~~~g~vPvv 230 (481) T protein:vir:10 162 KS------TFVVYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYI---EIKGGTYHRV--EEVEHYYNDVPII 230 (481) T ss_pred cc------eEEEEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEE---EecCCceeec--ccccccCCceeEE Confidence 21 1111221111122222333321 1 1122111111110000 1011111111 0111110 0000 Q ss_pred cCCCccc-hHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCc--cccccccccchhhhhcchheecCCCCceEEEec Q lcl|NC_021324. 218 RYGRSEI-SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTT--DELTNDGENTTLDIYYGRILTLASEAAKISEFK 294 (480) Q Consensus 218 ~~G~s~i-~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~--~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 294 (480) .|-.... ..++..+++-++....-+++.. +.+-.....-. ................+..+.+.........-. T Consensus 231 ~~~n~~~g~~~~~~v~~lida~~~~~s~~~----~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (481) T protein:vir:10 231 EYLNDQFKQGDFENVIALIDLYDSAQSDTA----NYMTDLNDAMLAIIGNVDLDSEDAKAFRDANMIHLEPGTNANGSEG 306 (481) T ss_pred EeecCCCCCCchhhHHHHHHHHHHHHHHHH----HHHHHhcCceeEeecCcCCCccchhhhhhccceeccccccccCCCC Confidence 1111111 1335555555555444333322 22222211000 000000111111111222222211110000001 Q ss_pred cc-------cH--HHHHHHHHHHHHHHHhhcCCChHHhcccc---ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 295 AA-------EL--RNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) Q Consensus 295 ~~-------~~--~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 362 (480) .+ .. +.+.++++.+...| ..++... ....+|.+=...+.........+.......+...+ T Consensus 307 ~~~~~~l~~~~~~~~~~~~~~~l~~~i--------~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l 378 (481) T protein:vir:10 307 KAEVKYVYKQYDVAGVEAYKKRLQNDI--------HKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGL 378 (481) T ss_pred CcceeEEeecCCHHHHHHHHHHHHHHH--------HHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 12 12334444443332 2333221 12223332222333444455555555566666666 Q ss_pred HHHHHHhCCC-CcccceeeeE-EecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHH-----HHH Q lcl|NC_021324. 363 RIAMQIMGRE-VTEEYTRLET-VWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDW-----DKQ 435 (480) Q Consensus 363 ~li~~~~~~~-~~~~~~~~~v-~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~-----~~~ 435 (480) +-++++.-.- ........+. ...-...++.+....+. +++ +....|.+..+. .++.. -++ T Consensus 379 ~~~~~li~~~~~~~~~~~~~~~~i~v~f~~~~~~~~~~~---a~~---------~~kl~g~is~et-~~~~l~~i~d~~~ 445 (481) T protein:vir:10 379 MKRYKLLLNNVNLTGLKQHNYAELTITFTPNLPKSMMES---INA---------FNALSGGVSEST-RLSLLDFIDNPKE 445 (481) T ss_pred HHHHHHHHHHHhccCCCccccceeeEEeCCCCCcCHHHH---HHH---------HHHHhccCChHH-HHHhCCCCCCHHH Confidence 6555543221 1111111111 12222334433222111 111 112235443321 11111 112 Q ss_pred HHHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 436 ETEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) +.+........... ..+...-+... ...+++.+++| T Consensus 446 E~~ri~~E~~~~~~-~~~~~~~~~~~----~~~~~~dd~~g 481 (481) T protein:vir:10 446 ELEKMQEEEAQREK-QADKRGYGEAF----ENHLNVDDSNG 481 (481) T ss_pred HHHHHHHHHHHHHh-hhhhccCCccC----CCCCCCCCCCC Confidence 22222111111111 11111111111 11112222222 No 284 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=34.55 E-value=1.3 Score=19.99 Aligned_cols=338 Identities=11% Similarity=0.052 Sum_probs=117.4 Q ss_pred HHHHHHHHHHhhccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccCcccc---CCC--------chhHHHHHHH Q lcl|NC_021324. 19 LPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIEGFRI---SED--------SEGLEELWNW 87 (480) Q Consensus 19 ~~~~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~~~~~---~~d--------~~~~~~l~~~ 87 (480) +.-+.++..++.++-... .+......-.+..........+|+..++-+-.-++.+ ..+ +.....+..+ T Consensus 1 M~~f~k~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~l 79 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNND-TQRVTAWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEV 79 (378) T ss_pred CchhhhhhhhhhcccccC-CcceeeeeccchhhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccccccccchHHHH Confidence 222222222332211000 0000000000000111222333443333222223211 110 1122445666 Q ss_pred HHh--c---CHHHHHHHHHHHHhhcCeeEEEE-ecCccccCCCCceeEEEEEcccceEEEEeCCCCCceEEEEEEEEEec Q lcl|NC_021324. 88 WQA--N---DLDEESVLGHDDSLTFGRAYITV-SHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRLYTTRD 161 (480) Q Consensus 88 ~~~--n---~~~~~~~~~~~~~~~~G~a~~~v-~~~~~~~~d~~g~~~~~~~~p~~~~~v~d~~~~~~~~~~~~~~~~~~ 161 (480) +.. | .-..+...+..+.+.+|.||++. .. +..|.+... +| . T Consensus 80 L~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~------~~~g~~~~~------------------------~~---~ 126 (378) T protein:vir:85 80 LNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFD------SETGELLDL------------------------LF---A 126 (378) T ss_pred HhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeec------CCCceEEEE------------------------Ee---c Confidence 542 3 23355666778888999999752 21 112222110 00 0 Q ss_pred CCceEEEEEEEeCCcEEEEEecCCCccccccccccccccccccceEEeecccccCccCCCccchHHHHHHHHHHHHHHHH Q lcl|NC_021324. 162 DVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEISPELRKVTDAASRTLMN 241 (480) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~ 241 (480) +.+ ..|.++.+++++ +. +....+...+..+.++++..+ T Consensus 127 ~~~-----~~~~~~dvih~~-----------------------------~~------~~~~~~~~~~~~a~~~~~~~~-- 164 (378) T protein:vir:85 127 NDK-----KEYKPEELVRLV-----------------------------SP------FYINEDTSILDNALASIQTKL-- 164 (378) T ss_pred CCC-----EEEcccceEEEe-----------------------------cC------cCccchhhHHHHHHHHHHHHH-- Confidence 111 012222322222 10 000000011222223222211 Q ss_pred HHHHHHHhcchhhhh--cCCCccccccccccchhh---------hhcchheecCCCCceEEEeccccHHHHHHHHHHHHH Q lcl|NC_021324. 242 LQSASQILGTPLRVI--SGVTTDELTNDGENTTLD---------IYYGRILTLASEAAKISEFKAAELRNFAEEMEVFRK 310 (480) Q Consensus 242 ~~~~~~~~~~p~l~i--~G~~~~~~~~~~~~~~~~---------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~ 310 (480) . .+.+--++ .+. ......+.....++ ...+.++.+++ +.++.+++......-.+.++.... T Consensus 165 -~-----~~~~~g~l~~~~~-l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~~l~~~~~~~~~~~~~~~~~ 236 (378) T protein:vir:85 165 -E-----QGKLRGLLKINAF-LDIDNTQEYREKALATIKNMQEGSSYNGLTPVDN-KTEIVELKKDYSVLNKDEIELIKS 236 (378) T ss_pred -h-----cCCcceEEEeCCc-CCHHHHHHHHHHHHHHHHHhhcccccccceecCC-CceEEeccCChhhhhHHHHHHHHH Confidence 1 11221122 121 11111111111111 01234556654 468877754321112345677778 Q ss_pred HHHhhcCCChHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HhCCC-----C-ccccee Q lcl|NC_021324. 311 EAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQ-----IMGRE-----V-TEEYTR 379 (480) Q Consensus 311 ~i~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~-----~~~~~-----~-~~~~~~ 379 (480) +|+..-++|+..++++. +....+. .+...|.-++..+.. +.... . ...... T Consensus 237 ~Ia~~fgVPp~~l~~s~---~e~~~~~---------------f~~~tL~P~~~~ie~~l~~kLl~~~er~~~~~~~~~~~ 298 (378) T protein:vir:85 237 ELLTGYFMNENILLGTA---TQEQQIY---------------FYNSTIIPLLIQLEKELTYKLISTNRRRVVKGNLYYER 298 (378) T ss_pred HHHHHhCCCHHHhcCCc---hHHHHHH---------------HHHHHHHHHHHHHHHHHHhhcCChhhhhhhhhccccce Confidence 99999999998886432 1111111 222222222222111 11100 0 001112 Q ss_pred eeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHhhhcccCCCcCCCCC Q lcl|NC_021324. 380 LETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT 459 (480) Q Consensus 380 ~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~e~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 459 (480) +.+.+......+..+.++.+.+++..| +++.-.+++++|+.+-+ .-. .+.... +..+..... T Consensus 299 ~~f~~~~l~~~d~~~~~~~~~~~~~~G--~~T~NE~R~~lgl~p~~--gGD------------~~~~~~--N~~~~~~~~ 360 (378) T protein:vir:85 299 IIVDNQLFKFATLKELIDLYHENINGP--IFTQNQLLVKMGEQPIE--GGD------------IYIANL--NAVAVKNLS 360 (378) T ss_pred eeecchhhhhcCHHHHHHHHHHHHhCC--CcCHHHHHHHhCCCCCC--CCC------------eEeecc--cccccccch Confidence 333344455678888999999988876 56666677777764321 000 000000 000000000 Q ss_pred CCC-CCccCCCCCCCCCc Q lcl|NC_021324. 460 VTE-TKTETQTSPSGFNR 476 (480) Q Consensus 460 ~~d-~~~~~~~~~~~~~~ 476 (480) ... .+.+..+..++.|. T Consensus 361 ~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 361 DLQGSRKDVASTDETNNQ 378 (378) T ss_pred hhcCccCCCCCCCCCCCC Confidence 000 00011111112222 No 285 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=32.05 E-value=1.5 Score=19.69 Aligned_cols=433 Identities=12% Similarity=0.064 Sum_probs=141.0 Q ss_pred CCCHHHHHHHHHHHHH-H-HHHHHHHHHHHhh-----ccCcchhcCcccchhhhcceeccchHHHHHHHHHhhhccC-cc Q lcl|NC_021324. 1 MTTYHEHVERLQGLLA-R-DLPNLLEAEAYRN-----GTRRLKTIGIGAPPELAYLDVQPGWVATYLRTLSDRLDIE-GF 72 (480) Q Consensus 1 ~~~~~~~i~~l~~~~~-~-~~~~~~~~~~YY~-----G~~~i~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~-~~ 72 (480) ||.--|.+-.=..... . +.| -.+++.|+. |...|... -++.+.+-+. =|..++-.++. +. T Consensus 63 ~t~~~D~~~~g~~~~~~~~~~p-r~R~qiY~~~eeM~~~p~Ia~A--------lniHVtaALg---gde~TGd~vfI~p~ 130 (569) T protein:vir:10 63 SGMAGDGLVDGSRFIFDEVQLP-EDRLQRYPLLEEMAVYSTIATA--------LNIHITHALS---FDKKTGQTFSIVPV 130 (569) T ss_pred cchhhhhHHHHHHHHhhhccCc-hhHHHHHHHHHHHhcCchhhhh--------hhhhhheeec---ccccccceEEEEee Confidence 5655554322222211 1 222 223333332 22222110 0111111100 01111111111 11 Q ss_pred c--cCCCchhHHHHHHHHHhc---CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCCceeEEEEEcccceEEEE-e-CC Q lcl|NC_021324. 73 R--ISEDSEGLEELWNWWQAN---DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAEL-D-PR 145 (480) Q Consensus 73 ~--~~~d~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~g~~~~~~~~p~~~~~v~-d-~~ 145 (480) . .+.+.+..+.+..-.+.+ =+......++..+..||++|..+|..+. .|.. ..+-.....|-+ - -. T Consensus 131 ~~~~~a~~daakai~~el~~dl~~~iNr~~~~lA~~~~aFGdsYaRiY~~~~-----~GV~--dl~~s~yt~PsfIqpFE 203 (569) T protein:vir:10 131 HNGNDSDYDAAQALCGELMNDIGRTINKEVAGWAFIMSVFGVAYVRPYAKEG-----IGIT--SFECSYYTLPSFIKEFE 203 (569) T ss_pred cCCCCCcchHHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhheeeeccCC-----ceeE--EEEecccccccccchhh Confidence 0 012223332322222211 2446678899999999999999996432 2222 222111111211 0 11 Q ss_pred CCCceEEEEEEEEEecCCceEEEEEEEeCCcEEEEEec-----CCCcc--ccccccccccccccccceEEeecccccCcc Q lcl|NC_021324. 146 NTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRN-----GGLND--QWVVDGDVIKHGLGVVPVVPLTNDPRLGNR 218 (480) Q Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~--~~~~~~~~~~~~~g~iPvv~~~n~~~~~~~ 218 (480) ......++.-.| ..+..++... -++....++..+ ..... .+....-.....-.+.|+. ... T Consensus 204 ~g~~tvGF~~~~-~~~~~~ti~~---l~p~qm~rmKmPrm~~i~q~~~v~~g~~~~~L~~d~~~~~Pi~--------psn 271 (569) T protein:vir:10 204 VSGNLAGFSGDY-LKDASGKMVF---ADPWAIIPMKIPYWRPKSNLMPVHTGHKAYSLLDNPEERTPIE--------TQN 271 (569) T ss_pred hcCceEEeeccc-CCccccceee---echhhhhhhcccceeeccccchhhhhhhheeeccccccccccc--------chh Confidence 233445554444 2233332222 222222221111 00000 0000000001111233332 233 Q ss_pred CCCccch---HHHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc--chhhhhc---------c-h---- Q lcl|NC_021324. 219 YGRSEIS---PELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGEN--TTLDIYY---------G-R---- 279 (480) Q Consensus 219 ~G~s~i~---~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~G~~~~~~~~~~~~--~~~~~~~---------~-~---- 279 (480) +|.|=|. +.-..|+-++..+.+...+..-..++--+-+.|+++..-.+-... ..+|... | . T Consensus 272 ~GgSFL~~ae~pf~~l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~ 351 (569) T protein:vir:10 272 YGTSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNMPTV 351 (569) T ss_pred hhhHHHHHHHhHHHHHHHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCcccccc Confidence 5555332 222334445555555555544444443333344433321111000 0111110 0 1 Q ss_pred ---heecCCCC-----ceEEEeccccHHHHHHHHHHHHHHHHhhcCCChHHhcccc---ccchHHHHHHHHHHHHHHHHH Q lcl|NC_021324. 280 ---ILTLASEA-----AKISEFKAAELRNFAEEMEVFRKEAASITGLPPQYLSSSS---ENPASAEAIIATDSRIVKMAE 348 (480) Q Consensus 280 ---~~~~~g~~-----~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~g~~~---~~~~Sg~Al~~~~~~l~~k~~ 348 (480) +++.=|+. .+.++.+.. +- -++.+=-.+++++..-|+....+|... +.-.-|-+++..-+. -.++. T Consensus 352 ~~H~LPv~gekq~~~tvDt~~~~A~-~~-gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frtSaQa-a~RS~ 428 (569) T protein:vir:10 352 TNTLLPIMGDGKGQMTIDTQTIQAD-IN-GIEDILTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQA-AMRAS 428 (569) T ss_pred ceeeeeeecCccccccccccccccC-cc-cHHHHHHHHHHHHhhhccchhHhhHHHHhcccccccHHHHHHHHH-HHHHH Confidence 11111111 112221111 11 122233334677787888777666421 111234344432221 12223 Q ss_pred HHHHHHHHHHHHHHHHHHHHh-CCCCcccceeeeEEecCCCC-------CCHHHHHHHHHHHHhc-------cccCCCHH Q lcl|NC_021324. 349 RKGRIFGGAWERAMRIAMQIM-GREVTEEYTRLETVWRDPST-------PTVAAKADAVSKLYAN-------GQGPIPKE 413 (480) Q Consensus 349 ~~~~~~~~~l~~~~~li~~~~-~~~~~~~~~~~~v~f~~~~~-------~~~~~~ad~~~kl~~~-------~~~~~s~e 413 (480) ..+......+.+++.+=+.++ |.....+....+|.|....+ .+..+.++.++-+++. ..+..+.+ T Consensus 429 ~iRqa~~e~in~iidiH~~fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~ 508 (569) T protein:vir:10 429 WIQQGVEEFIQRAIDIHLAFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDA 508 (569) T ss_pred HHHHHHHHHHHHHHHHHhhhhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHH Confidence 333334444455544434443 44455666789999965543 2223333332222221 11111222 Q ss_pred HHH----HhCCCChhHHHHHHHHHHHH---HHHHHHHHhhhcccCCCcCCCCCCCCCCccCCCCCCCCCc Q lcl|NC_021324. 414 QAR----IDLGYTATQREQMRDWDKQE---TEDMIDTLYSTTKAQADATPKPTVTETKTETQTSPSGFNR 476 (480) Q Consensus 414 ~~~----~~~~~~~d~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 476 (480) ... ..+++..+..+.+.+.-+++ ++-.+++.-...++ +-.+-. ...=++.+ +|. T Consensus 509 ~m~y~l~d~~~~De~~~e~l~ae~~akp~DEe~~~~~~~~~~~~---~~~~~~-----~~~~~~~~-~~~ 569 (569) T protein:vir:10 509 FKRYLFSDVLEIDEKISEALVNELKAKSEDDDHLMDSIIKTPPQ---ELAQIL-----ESVFKEGN-DND 569 (569) T ss_pred HHHHHHHHHhhcchhHHHHHHhhcCCCcchhHHHHHHHhcCChH---HHHHHH-----HHHhhccC-CCC Confidence 111 11233333222222221111 11122221111100 000000 00000000 000 No 286 >protein:vir:105154 Length: 525 # NCBI annotation: conserved phage-related protein # Family: family:all:6660 # MgeID: mge:1466 # MgeName: C-St # Cross-refs: genbank:acc:YP_398597;genbank:gi:80159853;genbank:GeneID:3772992 Probab=20.11 E-value=2.8 Score=18.11 Aligned_cols=442 Identities=14% Similarity=0.139 Sum_probs=138.2 Q ss_pred CCCHHH--H----HHHHHHHHHHHHHHHHHH-HHHhhccC-cchhcCcccchhhhcceecc----chHHHHHHHHHhhhc Q lcl|NC_021324. 1 MTTYHE--H----VERLQGLLARDLPNLLEA-EAYRNGTR-RLKTIGIGAPPELAYLDVQP----GWVATYLRTLSDRLD 68 (480) Q Consensus 1 ~~~~~~--~----i~~l~~~~~~~~~~~~~~-~~YY~G~~-~i~~~~~~~~~~~~~~~~~~----n~~~~iVd~~~~~l~ 68 (480) -||++. + ++.++.++.+....|+.. ..|-+|-- ++-+-++-..-+....+... -|...|++-..=|.. T Consensus 11 ~~t~~k~~~~~e~~~~~~n~~~~~y~ty~~~~~~f~~gfv~~~~~ng~i~~v~~~~l~~~f~npd~~~~~i~~l~~y~yi 90 (525) T protein:vir:10 11 STTIEKQSLQIEQLQEHINELERQYNTYDDVVDAFIDGFVMDLCNNGKIKTVNLDTLQLWFNNPDKYINNIVNLLTYYYI 90 (525) T ss_pred ccchhhhhhhHHHHHHHHhhhhhhcchhhhHHHHHHHHHHHHhhcCCceeeeeHHHHHhhhcChHHHHHHHHHHHHHhhh Confidence 233221 1 222222222222223221 23334420 00000000000001111111 244455553222221 Q ss_pred cCc----------------ccc---CCCchhH---HHHHHHHHhc-CHHHHHHHHHHHHhhcCeeEEEEecCccccCCCC Q lcl|NC_021324. 69 IEG----------------FRI---SEDSEGL---EELWNWWQAN-DLDEESVLGHDDSLTFGRAYITVSHPDVESGDPA 125 (480) Q Consensus 69 ~~~----------------~~~---~~d~~~~---~~l~~~~~~n-~~~~~~~~~~~~~~~~G~a~~~v~~~~~~~~d~~ 125 (480) ..| |.+ ..+.+.. ..|...+... .-..+-+.+.+.....|. .+-.|-|... T Consensus 91 ~~~~v~ql~~li~~lp~l~y~i~~~~~~k~~~~~~s~~n~~l~k~i~hk~ltrdll~q~a~~gt-lig~wlg~~~----- 164 (525) T protein:vir:10 91 IDGNVFQLYDLIFSLPPLDYQIKVLKRDKDYKEDLSTINLYLEKKIQHKQLTRDLLVQLAHSGT-LIGTWLGSKR----- 164 (525) T ss_pred hcchHHHHHHHHHhcCCcceeehhhhhccchhhHHHHHHHHHHHhHHHHHHHHHHHHHhhccCc-eeEeeecCCC----- Confidence 111 111 0111112 2222222110 111222333333333342 3444544322 Q ss_pred ceeEEEEEcc-cceEEEEeCCCCCceEEEEEE--EEEecCCceEEEEEEEeCCcEEEEEecCCCcccccccccccccccc Q lcl|NC_021324. 126 GIPLIRVESP-LYMYAELDPRNTRRVTRAVRL--YTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGDVIKHGLG 202 (480) Q Consensus 126 g~~~~~~~~p-~~~~~v~d~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 202 (480) .|.+.+++- ..+||-. +......+.+-. +....+......++-..| .....+...+..+...+.....|- T Consensus 165 -~py~~vf~~~kyvfp~~--r~~g~~v~vid~~~f~~~~~~~r~~~~~~lsp----~i~~~~y~~~~~~~~~~~~~~r~i 237 (525) T protein:vir:10 165 -EPYFNVFNNLKYVFPYG--RAKGKMVAVIDLQWFDEMSELERKLTFENLSP----LITENKYKKWKEYNGENEDALRYI 237 (525) T ss_pred -Ccchhhhhhhhhhcccc--ccCCceEEEEehHHhhhhhHHHHHHHHHhhch----hhhhhhhhHHhhcccccchhheee Confidence 233333332 2334432 222233333311 111111000000000000 000000000000111111112233 Q ss_pred ccce---EEee-----cccccCccCCCccchHHHHHHHHHHHHHHHHHHHHHHHhcchhhhhc--CCCccccccccccch Q lcl|NC_021324. 203 VVPV---VPLT-----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVIS--GVTTDELTNDGENTT 272 (480) Q Consensus 203 ~iPv---v~~~-----n~~~~~~~~G~s~i~~~v~~l~D~~~~~~s~~~~~~~~~~~p~l~i~--G~~~~~~~~~~~~~~ 272 (480) .+|+ ++.+ .++.++-+|+...+. +|.-+.+ |-++-- .+......|..+++ |...... .-..... T Consensus 238 ~LP~e~t~~lr~~tl~rnqrlG~s~vtp~l~-dI~hk~k-lrd~Eq---sIA~kii~a~avLk~gg~~gn~m-k~p~~~k 311 (525) T protein:vir:10 238 MLPISKTLVARIHTLSRNQRLGIPYGTQTLF-DIQHKQK-LRDLEQ---SIADKIIKAMAVLKFRGKDDNDS-KVKESAK 311 (525) T ss_pred ecccceeEEeeecccccCcccCcchhhhHHH-HHHHHHH-HHHHHH---HHHHHhhhhheeeeeccccCccc-cCchHHH Confidence 3343 2221 234444444443332 1222222 111111 11233334444443 4322111 0000000 Q ss_pred hhhhcchheecCC------C-----CceEEEeccc-------cHHH-HHHHHHHHHHHHHhhcCCChHHhccccccchHH Q lcl|NC_021324. 273 LDIYYGRILTLAS------E-----AAKISEFKAA-------ELRN-FAEEMEVFRKEAASITGLPPQYLSSSSENPASA 333 (480) Q Consensus 273 ~~~~~~~~~~~~g------~-----~~~~~~~~~~-------~~~~-~~~~l~~~~~~i~~~~~~p~~~~g~~~~~~~Sg 333 (480) -++-.+.--.++. + -+.|..+.-+ .+.+ -.++++ ..|....|++..-..|+..|-+|+ T Consensus 312 qkil~gVk~aleK~~kdK~Gi~vi~~Pdfa~~efp~ik~~~~glDg~K~d~I~---~DI~~A~GlS~sL~nGdggNyAta 388 (525) T protein:vir:10 312 RKVLAGVKRALEKGVKDKNGIACIAMPDFATFEFPEIKNGDKTLDPKKYDSID---NDITNATGISQVLTNGTKGNYASA 388 (525) T ss_pred HHHHHHHHHHHhcccccccCeEEEeccceeecccccccCcccCCCchhhhhhh---hhhhhhhccceeeecCCCCceeee Confidence 0111110000110 0 0233322211 1111 223444 455566666655555566665443 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCH Q lcl|NC_021324. 334 -EAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPK 412 (480) Q Consensus 334 -~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~f~~~~~~~~~~~ad~~~kl~~~~~~~~s~ 412 (480) ..|. -.-+|..-+.+...++.++++.+++ +... ..+.-..+.+-.|.+..+..|.+.++...| + |. T Consensus 389 slnld----~fykkigVm~e~Iee~y~kL~d~Vl---~~~k---~~nyifnydkd~pi~~kkk~d~LIkL~d~g--~-s~ 455 (525) T protein:vir:10 389 KLNLD----VFYKKIGVMLEIIEEIYNQLIDIIL---GEEK---GCNYIFQYNKDTPIEREKKLDTLIKLEAQG--Y-SA 455 (525) T ss_pred eeeHH----HHHHHHHHHHHHHHHHHHHHHhhhc---Cccc---CcceEEecCCCchhhhhhhhhhhhhhhccc--h-hh Confidence 3343 2334555555666666666666554 2211 122223455666778888899999998865 3 45 Q ss_pred HHHHHhCCCChhH-HHH-HHHHHHHHHHHHHHHHhhhcccCCCcC-CCCCCCCCCccCCCCCCCCCcccCC Q lcl|NC_021324. 413 EQARIDLGYTATQ-REQ-MRDWDKQETEDMIDTLYSTTKAQADAT-PKPTVTETKTETQTSPSGFNRTKTR 480 (480) Q Consensus 413 e~~~~~~~~~~d~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~~~~~~~~~~~~~~~~~ 480 (480) ...+..+|.+.++ +++ +-+..+.+ +.+.++.......-.+ ..-..+-|+-+...+....=.++.| T Consensus 456 k~vldl~gis~e~y~E~s~yEtE~lk---l~EKi~pp~~~~v~SGk~~n~iG~P~~dd~~~~dati~s~~~ 523 (525) T protein:vir:10 456 KYVLDILGISSEEYFEESIYEIEKLK---LREKIMPPLNTNVLSGKDGNDIGSPKLDDSDSSDATIESKER 523 (525) T ss_pred hhhhhhhccCcchHHHHHHHHHHHHH---HhhhccccccceeeeccccccccCCccCCCcchhhhhhhhhc Confidence 5566677765443 222 11111111 2222222222111111 0011111221111111111233445 Done!