Query lcl|NC_021301.1_cdsid_YP_008051129.1 [gene=3] [protein=portal protein] [protein_id=YP_008051129.1] [location=2240..3610] Match_columns 456 No_of_seqs 141 out of 497 Neff 9.9 Searched_HMMs 1612 Date Thu Nov 7 17:26:48 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_3 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_3_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105819 Length: 456 100.0 4E-113 3E-116 636.6 52.3 456 1-456 1-456 (456) 2 protein:vir:102602 Length: 456 100.0 4E-113 3E-116 636.6 52.3 456 1-456 1-456 (456) 3 protein:vir:7987 Length: 456 # 100.0 1E-110 6E-114 623.6 52.5 456 1-456 1-456 (456) 4 protein:vir:104082 Length: 485 100.0 5.2E-90 3.2E-93 510.1 50.9 432 1-456 9-467 (485) 5 protein:vir:2500 Length: 501 # 100.0 7.1E-90 4.4E-93 509.3 48.5 439 1-456 23-481 (501) 6 protein:vir:7768 Length: 484 # 100.0 3.7E-89 2.3E-92 505.4 49.6 431 1-456 8-468 (484) 7 protein:vir:4223 Length: 486 # 100.0 1.5E-88 9.1E-92 502.1 50.1 429 1-456 8-467 (486) 8 protein:vir:2427 Length: 485 # 100.0 1.3E-88 8.1E-92 502.4 49.6 432 1-456 1-472 (485) 9 protein:vir:78227 Length: 480 100.0 2.8E-88 1.7E-91 500.6 49.3 431 3-456 1-454 (480) 10 protein:vir:78537 Length: 480 100.0 4.2E-88 2.6E-91 499.6 49.3 432 3-456 1-463 (480) 11 protein:vir:2341 Length: 488 # 100.0 1.2E-87 7.1E-91 497.2 49.6 434 1-456 1-480 (488) 12 protein:vir:99916 Length: 504 100.0 4.1E-87 2.6E-90 494.2 48.0 432 1-456 1-488 (504) 13 protein:vir:8184 Length: 474 # 100.0 3.3E-87 2E-90 494.7 47.0 432 1-456 12-474 (474) 14 protein:vir:80680 Length: 441 100.0 1.9E-85 1.2E-88 485.0 48.9 419 2-453 1-441 (441) 15 protein:vir:99072 Length: 479 100.0 1.8E-83 1.1E-86 474.2 47.3 426 1-456 9-455 (479) 16 protein:vir:9568 Length: 410 # 100.0 6.3E-83 3.9E-86 471.3 39.9 386 17-444 1-410 (410) 17 protein:vir:94742 Length: 409 100.0 1.4E-82 8.9E-86 469.3 41.3 385 5-426 1-409 (409) 18 protein:vir:9751 Length: 422 # 100.0 4.4E-82 2.7E-85 466.6 43.2 398 1-439 1-422 (422) 19 protein:vir:98444 Length: 434 100.0 1.3E-81 7.9E-85 464.1 45.3 407 39-456 1-425 (434) 20 protein:vir:1634 Length: 409 # 100.0 8.6E-82 5.3E-85 465.0 41.1 385 5-426 1-409 (409) 21 protein:vir:3964 Length: 453 # 100.0 2.1E-80 1.3E-83 457.4 46.2 426 1-456 11-453 (453) 22 protein:vir:97171 Length: 512 100.0 3E-80 1.9E-83 456.5 45.3 440 1-456 31-505 (512) 23 protein:vir:99522 Length: 470 100.0 3.2E-79 2E-82 450.9 47.3 426 1-456 19-462 (470) 24 protein:vir:9306 Length: 511 # 100.0 1.7E-78 1.1E-81 446.9 46.2 439 1-456 37-504 (511) 25 protein:vir:733 Length: 453 # 100.0 1.9E-78 1.2E-81 446.7 45.4 429 1-456 11-450 (453) 26 protein:vir:103951 Length: 511 100.0 2.7E-78 1.7E-81 445.8 46.2 438 1-456 37-499 (511) 27 protein:vir:94805 Length: 492 100.0 3.8E-78 2.4E-81 445.0 46.7 433 1-456 38-489 (492) 28 protein:vir:99781 Length: 511 100.0 3.9E-78 2.4E-81 445.0 46.7 437 1-456 37-504 (511) 29 protein:vir:78805 Length: 511 100.0 3.4E-78 2.1E-81 445.3 46.2 438 1-456 37-504 (511) 30 protein:vir:96366 Length: 511 100.0 3.4E-78 2.1E-81 445.3 46.2 438 1-456 37-504 (511) 31 protein:vir:97336 Length: 492 100.0 5.9E-78 3.6E-81 444.0 46.9 433 1-456 38-481 (492) 32 protein:vir:95806 Length: 440 100.0 2.1E-78 1.3E-81 446.4 44.3 425 12-456 1-436 (440) 33 protein:vir:9871 Length: 429 # 100.0 6.1E-78 3.8E-81 443.9 46.5 415 4-456 1-425 (429) 34 protein:vir:106571 Length: 499 100.0 5.2E-78 3.3E-81 444.3 45.8 427 1-456 13-476 (499) 35 protein:vir:96494 Length: 501 100.0 5.1E-78 3.2E-81 444.3 45.4 432 1-456 30-491 (501) 36 protein:vir:96266 Length: 474 100.0 1.1E-77 6.6E-81 442.6 47.1 432 1-456 23-472 (474) 37 protein:vir:95899 Length: 474 100.0 1.1E-77 6.6E-81 442.6 47.1 432 1-456 23-472 (474) 38 protein:vir:4898 Length: 502 # 100.0 6.5E-78 4E-81 443.8 45.8 431 1-456 37-499 (502) 39 protein:vir:3609 Length: 452 # 100.0 8.7E-78 5.4E-81 443.1 46.0 418 1-456 11-443 (452) 40 protein:vir:1236 Length: 483 # 100.0 1.3E-77 8.3E-81 442.0 46.6 433 1-456 31-472 (483) 41 protein:vir:102950 Length: 471 100.0 8.8E-78 5.5E-81 443.0 45.4 439 1-456 1-470 (471) 42 protein:vir:96240 Length: 511 100.0 1.6E-77 9.6E-81 441.7 46.2 438 1-456 37-499 (511) 43 protein:vir:2732 Length: 501 # 100.0 1.7E-77 1E-80 441.5 46.0 431 1-456 36-489 (501) 44 protein:vir:102330 Length: 451 100.0 1.6E-77 1E-80 441.6 45.8 424 4-455 1-451 (451) 45 protein:vir:105292 Length: 478 100.0 5.4E-77 3.3E-80 438.7 46.7 438 1-456 22-476 (478) 46 protein:vir:93747 Length: 472 100.0 5.6E-77 3.5E-80 438.6 46.6 433 1-456 18-461 (472) 47 protein:vir:96839 Length: 474 100.0 6.2E-77 3.8E-80 438.4 46.8 436 1-456 22-466 (474) 48 protein:vir:5961 Length: 503 # 100.0 1E-76 6.4E-80 437.2 45.7 436 1-456 25-491 (503) 49 protein:vir:105461 Length: 470 100.0 2.2E-76 1.4E-79 435.4 46.0 439 1-456 1-470 (470) 50 protein:vir:107112 Length: 478 100.0 4.3E-76 2.7E-79 433.7 46.7 437 1-456 20-477 (478) 51 protein:vir:94546 Length: 506 100.0 2.7E-76 1.7E-79 434.8 44.7 434 1-456 19-501 (506) 52 protein:vir:106639 Length: 481 100.0 5.3E-76 3.3E-79 433.3 46.2 433 1-456 26-475 (481) 53 protein:vir:94498 Length: 474 100.0 6.9E-76 4.3E-79 432.6 46.8 432 1-456 21-471 (474) 54 protein:vir:97447 Length: 474 100.0 6.9E-76 4.3E-79 432.6 46.8 432 1-456 21-471 (474) 55 protein:vir:96179 Length: 468 100.0 3.2E-75 2E-78 429.0 46.9 435 1-455 23-468 (468) 56 protein:vir:95113 Length: 474 100.0 3.1E-75 1.9E-78 429.1 46.1 431 1-456 24-471 (474) 57 protein:vir:94101 Length: 474 100.0 9.3E-75 5.8E-78 426.5 46.0 424 1-456 12-468 (474) 58 protein:vir:105889 Length: 474 100.0 9.3E-75 5.8E-78 426.5 46.0 424 1-456 12-468 (474) 59 protein:vir:9922 Length: 489 # 100.0 3.6E-74 2.2E-77 423.2 46.7 439 1-456 9-488 (489) 60 protein:vir:79043 Length: 479 100.0 3.8E-74 2.4E-77 423.1 46.0 439 1-456 16-475 (479) 61 protein:vir:78083 Length: 537 100.0 1.7E-72 1E-75 414.1 44.4 443 1-456 8-495 (537) 62 protein:vir:38 Length: 496 # N 100.0 3.9E-56 2.4E-59 324.4 41.6 444 1-453 17-496 (496) 63 protein:vir:80959 Length: 499 100.0 2.4E-53 1.5E-56 309.1 40.7 450 1-456 1-498 (499) 64 protein:vir:79703 Length: 505 100.0 1.5E-45 9.4E-49 266.3 45.3 442 1-452 1-505 (505) 65 protein:vir:1587 Length: 508 # 100.0 5.2E-46 3.2E-49 268.8 40.3 442 1-455 1-508 (508) 66 protein:vir:101494 Length: 527 100.0 1.1E-46 6.9E-50 272.5 33.1 443 1-456 1-506 (527) 67 protein:vir:102239 Length: 527 100.0 1.4E-46 8.5E-50 272.0 33.0 443 1-456 1-506 (527) 68 protein:vir:3028 Length: 500 # 100.0 5E-44 3.1E-47 258.0 41.2 446 1-455 18-500 (500) 69 protein:vir:9815 Length: 500 # 100.0 5E-44 3.1E-47 258.0 41.2 446 1-455 18-500 (500) 70 protein:vir:4782 Length: 522 # 100.0 2.5E-40 1.5E-43 237.7 43.2 449 1-456 1-521 (522) 71 protein:vir:7430 Length: 563 # 100.0 1.3E-41 7.8E-45 244.8 30.4 443 1-456 9-542 (563) 72 protein:vir:78907 Length: 518 100.0 3.8E-39 2.3E-42 231.2 38.9 440 4-454 1-518 (518) 73 protein:vir:98883 Length: 517 100.0 1.4E-38 8.8E-42 228.1 40.0 444 1-455 1-517 (517) 74 protein:vir:97265 Length: 513 100.0 7.1E-29 4.4E-32 174.9 35.0 426 1-456 1-494 (513) 75 protein:vir:94956 Length: 452 100.0 9.6E-28 6E-31 168.7 36.9 420 1-456 1-451 (452) 76 protein:vir:95149 Length: 501 99.9 9.4E-25 5.8E-28 152.3 36.1 432 1-456 1-495 (501) 77 protein:vir:80453 Length: 535 99.9 5.3E-25 3.3E-28 153.7 34.2 435 1-456 28-532 (535) 78 protein:vir:78393 Length: 489 99.9 2.3E-24 1.4E-27 150.2 35.3 438 1-456 1-489 (489) 79 protein:vir:95014 Length: 491 99.9 2.2E-23 1.4E-26 144.8 35.7 432 1-452 2-491 (491) 80 protein:vir:96783 Length: 488 99.9 1.9E-20 1.2E-23 128.7 34.1 417 1-442 14-488 (488) 81 protein:vir:80040 Length: 461 99.8 7.2E-20 4.5E-23 125.5 26.4 419 1-454 1-461 (461) 82 protein:vir:3420 Length: 533 # 99.8 8E-17 5E-20 108.8 35.9 432 1-456 1-532 (533) 83 protein:vir:79538 Length: 502 99.8 2E-16 1.2E-19 106.6 37.0 424 7-456 1-500 (502) 84 protein:vir:93630 Length: 776 99.7 3E-17 1.9E-20 111.2 31.9 435 1-456 38-654 (776) 85 protein:vir:10321 Length: 495 99.7 1E-16 6.2E-20 108.3 34.6 428 2-456 1-495 (495) 86 protein:vir:96738 Length: 505 99.7 2.4E-16 1.5E-19 106.2 36.3 432 1-456 1-502 (505) 87 protein:vir:389 Length: 530 # 99.7 4.6E-17 2.8E-20 110.2 31.6 422 1-456 1-528 (530) 88 protein:vir:5249 Length: 437 # 99.7 1.1E-17 6.9E-21 113.5 26.1 394 21-456 1-437 (437) 89 protein:vir:6382 Length: 553 # 99.7 1.7E-15 1.1E-18 101.5 36.5 440 1-455 12-553 (553) 90 protein:vir:108295 Length: 711 99.7 8.6E-16 5.3E-19 103.2 33.8 436 1-456 23-661 (711) 91 protein:vir:95542 Length: 548 99.7 1.7E-15 1E-18 101.6 33.1 425 7-456 1-498 (548) 92 protein:vir:107742 Length: 537 99.7 4.7E-17 2.9E-20 110.1 24.2 404 1-456 28-534 (537) 93 protein:vir:817 Length: 714 # 99.6 1.4E-14 8.5E-18 96.6 34.5 438 1-456 8-647 (714) 94 protein:vir:10117 Length: 714 99.6 1.4E-14 8.5E-18 96.6 34.5 438 1-456 8-647 (714) 95 protein:vir:9950 Length: 714 # 99.6 1.4E-14 8.5E-18 96.6 34.5 438 1-456 8-647 (714) 96 protein:vir:2764 Length: 714 # 99.6 1.4E-14 8.5E-18 96.6 34.5 438 1-456 8-647 (714) 97 protein:vir:3296 Length: 714 # 99.6 1.4E-14 8.5E-18 96.6 34.5 438 1-456 8-647 (714) 98 protein:vir:94049 Length: 532 99.6 2.8E-16 1.7E-19 105.8 25.0 412 1-456 23-509 (532) 99 protein:vir:79647 Length: 435 99.6 3.3E-16 2E-19 105.5 24.0 386 1-456 5-434 (435) 100 protein:vir:104338 Length: 422 99.6 4.2E-16 2.6E-19 104.9 24.3 377 21-453 1-422 (422) 101 protein:vir:107662 Length: 427 99.6 6.1E-16 3.8E-19 104.0 25.1 384 1-453 1-427 (427) 102 protein:vir:99563 Length: 862 99.6 4.8E-16 3E-19 104.6 22.0 403 1-456 68-564 (862) 103 protein:vir:105619 Length: 772 99.6 2.7E-14 1.7E-17 95.0 31.5 437 1-456 11-650 (772) 104 protein:vir:96068 Length: 765 99.6 2.5E-15 1.5E-18 100.7 24.8 396 1-456 71-545 (765) 105 protein:vir:104437 Length: 714 99.6 1E-13 6.3E-17 91.8 32.9 434 1-456 1-647 (714) 106 protein:vir:77597 Length: 725 99.6 3.8E-14 2.3E-17 94.2 29.5 441 1-456 1-624 (725) 107 protein:vir:105429 Length: 708 99.6 8.9E-14 5.5E-17 92.1 31.0 446 1-456 1-643 (708) 108 protein:vir:8846 Length: 705 # 99.5 1.2E-12 7.1E-16 86.0 35.9 429 1-456 7-618 (705) 109 protein:vir:172 Length: 708 # 99.5 1.9E-12 1.2E-15 84.9 33.1 447 1-456 1-636 (708) 110 protein:vir:105520 Length: 706 99.5 2.4E-12 1.5E-15 84.3 32.2 445 1-456 1-635 (706) 111 protein:vir:100920 Length: 725 99.5 2E-12 1.2E-15 84.7 30.2 435 1-456 1-624 (725) 112 protein:vir:9263 Length: 725 # 99.4 6.3E-12 3.9E-15 82.0 32.1 438 1-456 1-624 (725) 113 protein:vir:80165 Length: 651 99.4 2.2E-11 1.4E-14 79.0 35.2 437 1-456 3-619 (651) 114 protein:vir:3648 Length: 695 # 99.3 7E-12 4.3E-15 81.7 25.8 401 1-456 77-551 (695) 115 protein:vir:3520 Length: 720 # 99.3 1.1E-10 6.7E-14 75.2 32.0 442 1-456 1-634 (720) 116 protein:vir:101541 Length: 694 99.3 1E-11 6.3E-15 80.8 26.1 409 1-456 59-550 (694) 117 protein:vir:78589 Length: 695 99.3 1.2E-11 7.8E-15 80.4 25.9 409 1-456 60-551 (695) 118 protein:vir:106716 Length: 698 99.3 6.8E-12 4.2E-15 81.8 24.1 408 1-456 60-550 (698) 119 protein:vir:1380 Length: 422 # 99.3 2.5E-11 1.6E-14 78.7 27.1 394 7-456 1-422 (422) 120 protein:vir:95449 Length: 584 99.3 1.3E-10 8.4E-14 74.7 30.9 431 1-456 1-584 (584) 121 protein:vir:102118 Length: 409 99.2 7.7E-11 4.8E-14 76.0 27.1 393 8-456 1-409 (409) 122 protein:vir:6240 Length: 457 # 99.2 3.7E-11 2.3E-14 77.8 25.3 408 7-456 1-438 (457) 123 protein:vir:105782 Length: 449 99.2 1.6E-11 9.9E-15 79.8 23.0 399 1-456 1-445 (449) 124 protein:vir:7407 Length: 392 # 99.2 1.2E-10 7.4E-14 75.0 27.3 379 9-452 1-392 (392) 125 protein:vir:3843 Length: 397 # 99.2 3.6E-11 2.2E-14 77.8 24.3 378 7-456 1-395 (397) 126 protein:vir:63755 Length: 547 99.2 1.7E-10 1E-13 74.2 27.2 404 7-456 1-513 (547) 127 protein:vir:101648 Length: 518 99.2 1E-10 6.4E-14 75.3 25.9 394 1-456 9-424 (518) 128 protein:vir:3153 Length: 467 # 99.2 1.4E-10 8.7E-14 74.6 26.5 375 48-456 1-464 (467) 129 protein:vir:1266 Length: 416 # 99.2 2E-10 1.2E-13 73.8 27.2 385 8-454 1-416 (416) 130 protein:vir:102080 Length: 429 99.2 2.7E-10 1.7E-13 73.0 27.8 393 7-456 1-427 (429) 131 protein:vir:1023 Length: 392 # 99.2 1.8E-10 1.1E-13 74.0 26.6 380 5-456 1-389 (392) 132 protein:vir:3989 Length: 392 # 99.2 1.8E-10 1.1E-13 74.0 26.6 380 5-456 1-389 (392) 133 protein:vir:105002 Length: 432 99.2 3.6E-10 2.3E-13 72.3 28.2 396 7-456 1-430 (432) 134 protein:vir:102855 Length: 432 99.2 3.6E-10 2.3E-13 72.3 28.2 396 7-456 1-430 (432) 135 protein:vir:107605 Length: 432 99.2 3.6E-10 2.3E-13 72.3 28.2 396 7-456 1-430 (432) 136 protein:vir:8418 Length: 409 # 99.2 3.8E-10 2.3E-13 72.2 28.1 386 7-456 1-409 (409) 137 protein:vir:1326 Length: 457 # 99.2 1.6E-10 9.9E-14 74.3 25.8 406 7-456 1-439 (457) 138 protein:vir:4952 Length: 386 # 99.2 2.1E-10 1.3E-13 73.7 25.7 379 7-453 1-386 (386) 139 protein:vir:80644 Length: 551 99.1 4.6E-10 2.9E-13 71.8 26.8 404 3-456 1-517 (551) 140 protein:vir:7853 Length: 518 # 99.1 1.7E-10 1.1E-13 74.1 24.3 390 1-456 7-424 (518) 141 protein:vir:100150 Length: 437 99.1 8.4E-10 5.2E-13 70.3 27.8 391 13-456 1-437 (437) 142 protein:vir:4454 Length: 414 # 99.1 6.3E-10 3.9E-13 71.0 26.7 387 7-456 1-409 (414) 143 protein:vir:4598 Length: 416 # 99.1 1.4E-09 8.6E-13 69.2 28.2 389 7-456 1-414 (416) 144 protein:vir:81095 Length: 416 99.1 1.4E-09 8.6E-13 69.2 28.2 389 7-456 1-414 (416) 145 protein:vir:81152 Length: 411 99.1 1.1E-09 7.1E-13 69.6 27.7 383 7-456 1-411 (411) 146 protein:vir:10362 Length: 432 99.1 2.1E-09 1.3E-12 68.2 27.6 392 1-456 1-432 (432) 147 protein:vir:81072 Length: 432 99.1 2.2E-09 1.3E-12 68.1 27.3 394 1-456 1-429 (432) 148 protein:vir:93610 Length: 454 99.0 4.1E-09 2.5E-12 66.6 28.4 391 9-456 1-434 (454) 149 protein:vir:9408 Length: 441 # 99.0 3.7E-09 2.3E-12 66.8 27.1 397 1-456 1-439 (441) 150 protein:vir:79984 Length: 441 99.0 3.7E-09 2.3E-12 66.8 27.1 397 1-456 1-439 (441) 151 protein:vir:98396 Length: 441 99.0 5E-09 3.1E-12 66.1 28.7 398 1-456 1-439 (441) 152 protein:vir:5737 Length: 419 # 99.0 5E-09 3.1E-12 66.1 28.3 386 7-456 1-418 (419) 153 protein:vir:483 Length: 413 # 99.0 5.1E-09 3.2E-12 66.0 28.1 387 8-456 1-410 (413) 154 protein:vir:79772 Length: 648 99.0 5.6E-09 3.5E-12 65.8 31.4 399 1-456 49-493 (648) 155 protein:vir:105064 Length: 421 99.0 2E-09 1.2E-12 68.3 24.9 387 1-456 1-414 (421) 156 protein:vir:94426 Length: 409 99.0 4.6E-09 2.8E-12 66.3 26.7 393 1-456 1-408 (409) 157 protein:vir:9359 Length: 348 # 99.0 1.8E-09 1.1E-12 68.6 24.2 333 37-456 1-347 (348) 158 protein:vir:189 Length: 424 # 99.0 6.4E-09 4E-12 65.5 26.2 393 1-455 1-424 (424) 159 protein:vir:4854 Length: 386 # 99.0 7.9E-09 4.9E-12 65.0 26.6 377 7-456 1-386 (386) 160 protein:vir:1884 Length: 424 # 99.0 4.2E-09 2.6E-12 66.5 25.1 389 1-455 8-424 (424) 161 protein:vir:97060 Length: 432 99.0 9.2E-09 5.7E-12 64.6 27.2 396 1-456 1-432 (432) 162 protein:vir:960 Length: 413 # 98.9 1E-08 6.4E-12 64.4 25.8 387 1-456 1-413 (413) 163 protein:vir:4828 Length: 382 # 98.9 1.5E-09 9E-13 69.0 21.0 371 7-453 1-382 (382) 164 protein:vir:4995 Length: 384 # 98.9 5.8E-09 3.6E-12 65.8 23.5 373 7-456 1-383 (384) 165 protein:vir:4509 Length: 424 # 98.9 1.5E-08 9.4E-12 63.4 25.6 386 9-456 1-422 (424) 166 protein:vir:93943 Length: 409 98.9 2.4E-08 1.5E-11 62.4 28.2 392 1-456 1-408 (409) 167 protein:vir:102727 Length: 945 98.9 2.6E-08 1.6E-11 62.1 31.2 404 1-456 47-532 (945) 168 protein:vir:80796 Length: 574 98.8 2.8E-08 1.8E-11 62.0 30.6 406 1-456 11-526 (574) 169 protein:vir:99232 Length: 526 98.8 2.9E-08 1.8E-11 61.9 30.3 378 1-456 37-463 (526) 170 protein:vir:99853 Length: 488 98.8 1.6E-08 1E-11 63.3 23.9 387 10-456 1-446 (488) 171 protein:vir:94599 Length: 641 98.7 6.7E-08 4.1E-11 59.9 35.6 442 1-456 20-603 (641) 172 protein:vir:96980 Length: 409 98.7 6.7E-08 4.2E-11 59.9 26.6 388 1-456 1-408 (409) 173 protein:vir:4337 Length: 434 # 98.7 6.7E-08 4.2E-11 59.9 28.5 394 1-456 1-430 (434) 174 protein:vir:2683 Length: 412 # 98.7 7.7E-08 4.7E-11 59.6 29.1 390 1-456 1-411 (412) 175 protein:vir:95821 Length: 763 98.7 8E-08 4.9E-11 59.5 30.1 427 1-456 1-659 (763) 176 protein:vir:3139 Length: 599 # 98.7 1E-07 6.3E-11 58.9 29.7 437 1-456 1-599 (599) 177 protein:vir:101647 Length: 460 98.7 1E-07 6.4E-11 58.9 26.5 400 6-456 1-460 (460) 178 protein:vir:4156 Length: 542 # 98.7 1.1E-07 6.6E-11 58.8 23.6 393 1-456 9-466 (542) 179 protein:vir:80333 Length: 419 98.7 1.3E-07 8.3E-11 58.3 26.3 378 1-456 1-404 (419) 180 protein:vir:103860 Length: 528 98.6 1.5E-07 9.1E-11 58.0 30.7 388 1-456 18-444 (528) 181 protein:vir:107880 Length: 491 98.6 1.5E-07 9.2E-11 58.0 28.8 386 1-456 1-453 (491) 182 protein:vir:81218 Length: 423 98.6 1.6E-07 9.9E-11 57.9 25.1 391 7-456 1-420 (423) 183 protein:vir:100691 Length: 535 98.6 1.7E-07 1.1E-10 57.7 29.5 395 1-456 53-516 (535) 184 protein:vir:100882 Length: 383 98.6 1.7E-07 1.1E-10 57.7 25.6 368 1-456 1-382 (383) 185 protein:vir:9641 Length: 395 # 98.6 1.5E-07 9.1E-11 58.1 23.1 376 1-455 1-395 (395) 186 protein:vir:1431 Length: 419 # 98.6 1.9E-07 1.1E-10 57.5 26.7 383 11-456 1-416 (419) 187 protein:vir:6210 Length: 394 # 98.6 1.9E-07 1.2E-10 57.5 25.3 377 7-456 1-391 (394) 188 protein:vir:79233 Length: 526 98.6 1.9E-07 1.2E-10 57.4 32.0 388 1-456 18-444 (526) 189 protein:vir:79063 Length: 491 98.6 2.2E-07 1.3E-10 57.1 26.8 392 1-456 1-459 (491) 190 protein:vir:3868 Length: 417 # 98.6 2.4E-07 1.5E-10 56.8 25.0 369 28-456 1-412 (417) 191 protein:vir:99452 Length: 651 98.6 1.3E-07 8.1E-11 58.3 21.2 430 1-456 1-538 (651) 192 protein:vir:96579 Length: 576 98.6 2.9E-07 1.8E-10 56.4 28.0 367 1-456 54-520 (576) 193 protein:vir:100650 Length: 395 98.6 2.5E-07 1.5E-10 56.8 22.6 365 7-455 1-395 (395) 194 protein:vir:9507 Length: 395 # 98.6 2.5E-07 1.5E-10 56.8 22.6 365 7-455 1-395 (395) 195 protein:vir:101289 Length: 395 98.6 2.5E-07 1.5E-10 56.8 22.6 365 7-455 1-395 (395) 196 protein:vir:4089 Length: 395 # 98.5 3.3E-07 2E-10 56.1 24.9 374 7-456 1-392 (395) 197 protein:vir:95965 Length: 385 98.5 1.9E-07 1.2E-10 57.5 21.5 362 1-455 1-385 (385) 198 protein:vir:100187 Length: 385 98.5 3.8E-07 2.3E-10 55.8 27.0 368 1-455 1-385 (385) 199 protein:vir:104259 Length: 403 98.5 4E-07 2.5E-10 55.7 24.7 374 7-455 1-403 (403) 200 protein:vir:78310 Length: 376 98.5 4.1E-07 2.6E-10 55.6 25.6 358 1-454 1-376 (376) 201 protein:vir:94666 Length: 723 98.5 4.8E-07 3E-10 55.2 27.6 371 1-456 1-429 (723) 202 protein:vir:98643 Length: 395 98.5 3.6E-07 2.2E-10 55.9 21.6 371 7-455 1-395 (395) 203 protein:vir:95378 Length: 406 98.5 5.4E-07 3.4E-10 54.9 24.2 380 7-456 1-405 (406) 204 protein:vir:9702 Length: 406 # 98.5 6E-07 3.7E-10 54.7 24.4 377 1-456 1-403 (406) 205 protein:vir:99312 Length: 563 98.4 6E-07 3.7E-10 54.7 30.2 401 1-456 25-524 (563) 206 protein:vir:95599 Length: 563 98.4 6E-07 3.7E-10 54.7 30.2 401 1-456 25-524 (563) 207 protein:vir:100249 Length: 431 98.4 7.2E-07 4.5E-10 54.3 26.3 388 7-456 1-426 (431) 208 protein:vir:1082 Length: 359 # 98.4 7.3E-07 4.6E-10 54.2 24.3 348 1-426 1-359 (359) 209 protein:vir:94709 Length: 522 98.4 9.1E-07 5.6E-10 53.7 36.6 432 1-456 1-514 (522) 210 protein:vir:7321 Length: 556 # 98.3 1.2E-06 7.5E-10 53.0 38.1 433 1-456 1-531 (556) 211 protein:vir:95315 Length: 559 98.3 1.6E-06 1E-09 52.3 38.6 433 1-456 1-531 (559) 212 protein:vir:3361 Length: 535 # 98.3 1.8E-06 1.1E-09 52.1 37.9 427 1-456 1-517 (535) 213 protein:vir:4194 Length: 540 # 98.3 1.9E-06 1.2E-09 51.9 28.7 392 1-456 6-462 (540) 214 protein:vir:103765 Length: 549 98.2 2.5E-06 1.5E-09 51.3 38.6 431 1-456 1-539 (549) 215 protein:vir:80134 Length: 403 98.2 3.5E-06 2.1E-09 50.5 25.3 372 1-456 1-402 (403) 216 protein:vir:102668 Length: 547 98.1 5.4E-06 3.3E-09 49.5 37.2 434 1-456 1-544 (547) 217 protein:vir:77981 Length: 448 98.0 6.4E-06 3.9E-09 49.1 24.3 404 1-456 1-440 (448) 218 protein:vir:108215 Length: 469 98.0 7.7E-06 4.8E-09 48.6 28.4 411 1-456 1-457 (469) 219 protein:vir:1986 Length: 512 # 98.0 7.9E-06 4.9E-09 48.6 29.9 376 1-456 37-470 (512) 220 protein:vir:103219 Length: 201 98.0 6.2E-08 3.8E-11 60.1 7.8 182 255-453 1-201 (201) 221 protein:vir:98506 Length: 555 98.0 7.9E-06 4.9E-09 48.5 40.6 435 1-456 1-540 (555) 222 protein:vir:107822 Length: 555 98.0 7.9E-06 4.9E-09 48.5 40.6 435 1-456 1-540 (555) 223 protein:vir:107404 Length: 555 98.0 7.9E-06 4.9E-09 48.5 40.6 435 1-456 1-540 (555) 224 protein:vir:104500 Length: 537 98.0 8.6E-06 5.4E-09 48.3 21.9 418 1-456 23-525 (537) 225 protein:vir:1538 Length: 535 # 97.9 1.1E-05 6.8E-09 47.8 38.2 430 1-456 1-518 (535) 226 protein:vir:94572 Length: 535 97.9 1.2E-05 7.3E-09 47.6 35.8 427 1-456 1-518 (535) 227 protein:vir:78641 Length: 278 97.9 1.3E-05 8.1E-09 47.4 25.7 265 71-392 1-278 (278) 228 protein:vir:79511 Length: 448 97.8 1.6E-05 1E-08 46.9 25.6 395 1-456 19-440 (448) 229 protein:vir:2198 Length: 536 # 97.8 1.7E-05 1.1E-08 46.7 37.4 436 1-456 1-525 (536) 230 protein:vir:10447 Length: 536 97.8 1.9E-05 1.2E-08 46.5 37.0 436 1-456 1-525 (536) 231 protein:vir:98816 Length: 446 97.8 2E-05 1.2E-08 46.3 25.0 402 1-429 6-446 (446) 232 protein:vir:8883 Length: 543 # 97.8 2E-05 1.3E-08 46.3 33.8 436 1-456 1-520 (543) 233 protein:vir:6896 Length: 523 # 97.7 2.5E-05 1.6E-08 45.8 19.6 420 1-454 29-523 (523) 234 protein:vir:94869 Length: 378 97.7 2.8E-05 1.7E-08 45.6 23.0 348 21-456 1-378 (378) 235 protein:vir:103458 Length: 524 97.6 3.4E-05 2.1E-08 45.1 19.5 415 1-454 29-524 (524) 236 protein:vir:7208 Length: 524 # 97.6 3.6E-05 2.2E-08 45.0 19.5 415 1-454 29-524 (524) 237 protein:vir:95254 Length: 488 97.6 4.3E-05 2.7E-08 44.5 28.2 422 1-456 8-482 (488) 238 protein:vir:104892 Length: 558 97.5 5.3E-05 3.3E-08 44.0 23.1 418 1-456 23-526 (558) 239 protein:vir:8100 Length: 466 # 97.4 7.3E-05 4.5E-08 43.3 25.5 395 7-455 1-466 (466) 240 protein:vir:94002 Length: 378 97.4 8.5E-05 5.2E-08 42.9 21.6 345 21-456 1-378 (378) 241 protein:vir:99672 Length: 532 97.3 8.9E-05 5.5E-08 42.8 35.6 430 1-456 1-521 (532) 242 protein:vir:8317 Length: 409 # 97.3 9.1E-05 5.6E-08 42.7 25.4 358 7-447 1-409 (409) 243 protein:vir:1661 Length: 378 # 97.3 0.0001 6.2E-08 42.5 23.5 347 21-456 1-378 (378) 244 protein:vir:1785 Length: 555 # 97.2 0.00013 7.9E-08 41.9 35.0 421 1-456 1-535 (555) 245 protein:vir:81017 Length: 521 97.2 0.00015 9.3E-08 41.5 23.7 398 1-454 46-521 (521) 246 protein:vir:103177 Length: 533 97.1 0.00016 9.9E-08 41.4 23.3 422 1-456 19-514 (533) 247 protein:vir:93867 Length: 378 97.1 0.00016 9.9E-08 41.4 21.3 347 21-456 1-378 (378) 248 protein:vir:6596 Length: 521 # 97.1 0.00018 1.1E-07 41.2 23.7 406 1-454 41-521 (521) 249 protein:vir:100039 Length: 522 97.0 0.0002 1.3E-07 40.8 37.4 424 4-456 1-508 (522) 250 protein:vir:106282 Length: 521 97.0 0.0002 1.3E-07 40.8 20.6 417 1-454 31-521 (521) 251 protein:vir:108049 Length: 524 97.0 0.00022 1.3E-07 40.7 19.9 398 1-454 44-524 (524) 252 protein:vir:78161 Length: 355 97.0 0.00023 1.4E-07 40.6 22.6 294 116-456 1-326 (355) 253 protein:vir:105641 Length: 516 96.9 0.00029 1.8E-07 40.0 33.8 421 1-456 1-510 (516) 254 protein:vir:7017 Length: 515 # 96.7 0.00043 2.6E-07 39.1 34.7 420 1-456 1-513 (515) 255 protein:vir:98265 Length: 524 96.6 0.00048 3E-07 38.8 22.2 400 1-454 51-524 (524) 256 protein:vir:858 Length: 378 # 96.2 0.00091 5.7E-07 37.2 23.0 347 21-456 1-378 (378) 257 protein:vir:101806 Length: 516 96.1 0.001 6.4E-07 37.0 22.2 413 1-454 27-516 (516) 258 protein:vir:101189 Length: 516 96.1 0.001 6.4E-07 37.0 22.2 413 1-454 27-516 (516) 259 protein:vir:5665 Length: 511 # 95.7 0.0016 9.8E-07 36.0 20.8 416 1-454 21-511 (511) 260 protein:vir:100598 Length: 516 95.7 0.0016 9.9E-07 35.9 22.1 417 1-456 27-516 (516) 261 protein:vir:96988 Length: 516 95.2 0.0026 1.6E-06 34.7 33.1 417 1-456 5-512 (516) 262 protein:vir:103330 Length: 517 94.9 0.0033 2E-06 34.2 35.8 423 1-456 1-517 (517) 263 protein:vir:106999 Length: 564 94.7 0.0037 2.3E-06 33.9 22.2 426 1-456 20-534 (564) 264 protein:vir:267 Length: 348 # 94.5 0.0042 2.6E-06 33.6 25.0 297 1-402 27-348 (348) 265 protein:vir:5691 Length: 344 # 94.2 0.0051 3.2E-06 33.1 19.9 294 1-397 1-344 (344) 266 protein:vir:78696 Length: 542 93.9 0.0059 3.7E-06 32.8 37.7 419 1-456 1-520 (542) 267 protein:vir:5839 Length: 533 # 93.5 0.0073 4.5E-06 32.3 25.7 399 1-456 1-480 (533) 268 protein:vir:103971 Length: 376 93.1 0.0088 5.5E-06 31.8 23.2 301 1-399 56-376 (376) 269 protein:vir:78191 Length: 351 91.8 0.014 8.9E-06 30.7 24.3 302 1-399 31-351 (351) 270 protein:vir:98567 Length: 340 91.6 0.015 9.4E-06 30.5 19.3 293 1-396 1-340 (340) 271 protein:vir:2013 Length: 344 # 89.7 0.025 1.5E-05 29.4 22.3 294 1-397 1-344 (344) 272 protein:vir:1150 Length: 350 # 89.6 0.025 1.6E-05 29.3 21.8 292 1-395 34-350 (350) 273 protein:vir:79207 Length: 351 88.9 0.03 1.8E-05 29.0 24.1 302 1-399 31-351 (351) 274 protein:vir:6322 Length: 510 # 88.7 0.031 1.9E-05 28.9 34.8 419 1-456 1-502 (510) 275 protein:vir:78749 Length: 337 86.6 0.044 2.8E-05 28.0 23.0 296 1-395 1-337 (337) 276 protein:vir:345 Length: 663 # 86.2 0.047 2.9E-05 27.9 29.1 423 1-456 1-592 (663) 277 protein:vir:6058 Length: 344 # 85.6 0.052 3.2E-05 27.6 24.0 294 1-397 1-344 (344) 278 protein:vir:78942 Length: 510 80.3 0.096 6E-05 26.2 36.7 419 1-456 1-506 (510) 279 protein:vir:80211 Length: 514 78.2 0.12 7.2E-05 25.7 34.9 432 1-456 1-510 (514) 280 protein:vir:3780 Length: 345 # 76.0 0.14 8.7E-05 25.3 25.0 310 1-397 1-345 (345) 281 protein:vir:100328 Length: 346 72.7 0.18 0.00011 24.7 22.2 299 1-397 24-346 (346) 282 protein:vir:98853 Length: 219 69.4 0.22 0.00014 24.2 12.3 198 154-396 1-219 (219) 283 protein:vir:79150 Length: 368 67.8 0.25 0.00015 23.9 20.0 305 1-408 39-368 (368) 284 protein:vir:3743 Length: 345 # 64.5 0.3 0.00019 23.5 24.2 303 1-393 23-345 (345) 285 protein:vir:4698 Length: 251 # 45.0 0.79 0.00049 21.2 18.6 241 1-307 1-251 (251) 286 protein:vir:105889 Length: 474 23.5 2.3 0.0014 18.6 21.5 404 1-456 15-474 (474) 287 protein:vir:94101 Length: 474 23.5 2.3 0.0014 18.6 21.5 404 1-456 15-474 (474) 288 protein:vir:101418 Length: 569 21.6 2.6 0.0016 18.3 23.0 432 1-456 63-567 (569) No 1 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=4.4e-113 Score=636.58 Aligned_cols=456 Identities=98% Similarity=1.485 Sum_probs=437.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) ||+.||++++++|+++|..+++|++++++||+|+|+|++++++.+++++..++|+++|||++|||+.++|++|+||++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 99999999999999999999999999999999999999999999999999889999999999999999999999999988 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 160 (456) +.|.+....++++|++|+|+.++.++++++++|||||+++|.|++|.+++++++|++++++||+..++++.+++++|.+. T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~ 160 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDL 160 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEec Confidence 77888888899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHH Q lcl|NC_021301. 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) Q Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~ 240 (456) ++...+..+|.++++..+...+..............++.|++....+|++++||||+++|++|+|+|+++++|||+||++ T Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~~~~ 240 (456) T protein:vir:10 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) T ss_pred CCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHHHHH Confidence 99999999999998888887777766666666667788899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHH Q lcl|NC_021301. 241 ELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLS 320 (456) Q Consensus 241 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~ 320 (456) +|++.++.+++++|+++++|...+.+..++.|+++.....++...+.+|..++++++++++++++++|+++++.++++|+ T Consensus 241 ~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~ 320 (456) T protein:vir:10 241 ELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQANDFTPMLSAIKEHIRQLS 320 (456) T ss_pred HHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEecccChhHHHHHHHHHHHHHH Confidence 99999999999999999999988888889999999888889999999999999999999999999999999999999999 Q ss_pred hhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH Q lcl|NC_021301. 321 SATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA 400 (456) Q Consensus 321 ~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad 400 (456) ++++||++.||++++|+||+||++++.+|++||+++++.|+++|++++++++++.|..+..+++++|+++.|+|.++.|| T Consensus 321 ~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~~~~~~~~ad 400 (456) T protein:vir:10 321 SATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYS 400 (456) T ss_pred hccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999888889999999999999999999 Q ss_pred HHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 401 ~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +++||+++|++|++++++++||++++++++|++|+++|.+.+++...+.|+++|+| T Consensus 401 a~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=4.4e-113 Score=636.58 Aligned_cols=456 Identities=98% Similarity=1.485 Sum_probs=437.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) ||+.||++++++|+++|..+++|++++++||+|+|+|++++++.+++++..++|+++|||++|||+.++|++|+||++.+ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ivd~~~~~l~~~~~~~~~ 80 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 99999999999999999999999999999999999999999999999999889999999999999999999999999988 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 160 (456) +.|.+....++++|++|+|+.++.++++++++|||||+++|.|++|.+++++++|++++++||+..++++.+++++|.+. T Consensus 81 ~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~i~~~~~~ 160 (456) T protein:vir:10 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRAAMRWWRDL 160 (456) T ss_pred CCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEEEEEEEEec Confidence 77888888899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHH Q lcl|NC_021301. 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) Q Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~ 240 (456) ++...+..+|.++++..+...+..............++.|++....+|++++||||+++|++|+|+|+++++|||+||++ T Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~g~gd~e~vi~liDa~~~~ 240 (456) T protein:vir:10 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) T ss_pred CCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecCCCCCchhhhhHHHHHHHHHH Confidence 99999999999998888887777766666666667788899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHH Q lcl|NC_021301. 241 ELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLS 320 (456) Q Consensus 241 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~ 320 (456) +|++.++.+++++|+++++|...+.+..++.|+++.....++...+.+|..++++++++++++++++|+++++.++++|+ T Consensus 241 ~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~ 320 (456) T protein:vir:10 241 ELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQANDFTPMLSAIKEHIRQLS 320 (456) T ss_pred HHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEecccChhHHHHHHHHHHHHHH Confidence 99999999999999999999988888889999999888889999999999999999999999999999999999999999 Q ss_pred hhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH Q lcl|NC_021301. 321 SATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA 400 (456) Q Consensus 321 ~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad 400 (456) ++++||++.||++++|+||+||++++.+|++||+++++.|+++|++++++++++.|..+..+++++|+++.|+|.++.|| T Consensus 321 ~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~~~~~~v~w~~~~~~~~~~~ad 400 (456) T protein:vir:10 321 SATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYS 400 (456) T ss_pred hccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999888889999999999999999999 Q ss_pred HHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 401 ~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +++||+++|++|++++++++||++++++++|++|+++|.+.+++...+.|+++|+| T Consensus 401 a~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=1e-110 Score=623.60 Aligned_cols=456 Identities=99% Similarity=1.484 Sum_probs=438.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |++.||++++++|+++|..+++|++++++||+|+|+|++++++.+++++..++++++||+++|||+.++|++|+||++.. T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ivd~~~~~l~~~g~~~~~ 80 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHHHHHHHHhhhccCCeecCC Confidence 99999999999999999999999999999999999999999999999999888999999999999999999999999988 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 160 (456) +.|.+..+.++++|++|+|+.++.++++++++|||||+++|++++|.+++++++|++++++||+...+++.+++++|.+. T Consensus 81 ~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~~~ 160 (456) T protein:vir:79 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) T ss_pred CCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCceEEEEEEEEec Confidence 77888888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHH Q lcl|NC_021301. 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) Q Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~ 240 (456) ++...+..+|.++.++.+....+..............+.|.+....+|++++|||++++|++|+|+|+++++|||+||++ T Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N~~~~gd~e~v~~liD~~~~~ 240 (456) T protein:vir:79 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) T ss_pred CCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecCCCCCchhhhhHHHHHHHHHH Confidence 99999999999999999988887777777777777788888899999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHH Q lcl|NC_021301. 241 ELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLS 320 (456) Q Consensus 241 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~ 320 (456) +|++.++++++++|+++++|.....+..+++|+.+.....+....+.+|..+++++++|++++++++|+++++.++++|+ T Consensus 241 ~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~ 320 (456) T protein:vir:79 241 ELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLS 320 (456) T ss_pred HHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccCCCCcceeeecccChHHHHHHHHHHHHHHH Confidence 99999999999999999999988888889999998888889999999999999999999999999999999999999999 Q ss_pred hhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH Q lcl|NC_021301. 321 SATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA 400 (456) Q Consensus 321 ~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad 400 (456) +.+++|++.||++++|+||+||++++.+|++||+++++.|+++|++++++++++.|..+..+++++|+++.|+|.++.|| T Consensus 321 ~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~~~~~i~v~w~~~~~~s~~~~ad 400 (456) T protein:vir:79 321 SATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYS 400 (456) T ss_pred hhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccceEEeCCCCCcCHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999888889999999999999999999 Q ss_pred HHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 401 ~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +++||+++|++|++++++.+|+++++++++|++|+++|.+.+++..++.+++|||+ T Consensus 401 a~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 401 AASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=5.2e-90 Score=510.07 Aligned_cols=432 Identities=18% Similarity=0.191 Sum_probs=355.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -.++++..+++.|+++|..+++|++++++||+|+|+++++++..++.++.. ++++|||++|||+.++||+++||+..+ T Consensus 9 ~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~--~~~~n~~~~ivd~~~~~l~~~g~~~~~ 86 (485) T protein:vir:10 9 EEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSL--LAHVGYPRLYVDSIAERQAVEGFRFGD 86 (485) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhh--hhhcCcHHHHHHHHHhhhcccceecCC Confidence 347889999999999999999999999999999999999999998887743 577899999999999999999998653 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC--------CCceEEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD--------DGTATITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~--------dg~~~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) +.+....++++|++|+|+.++.++++++++|||||++||+++ ++.++|++++|++++++||+..++...+ T Consensus 87 --~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~~~~~~ 164 (485) T protein:vir:10 87 --ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPRIGRVSKA 164 (485) T ss_pred --CchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCCCCceeEE Confidence 445677899999999999999999999999999999999985 4678899999999999999988776666 Q ss_pred EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC------CCCCCc Q lcl|NC_021301. 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN------PDGMGE 226 (456) Q Consensus 153 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n------~~g~s~ 226 (456) +++++...++...+..+|+++.++.|.. .++.|......+|.+++||||+|.| ++|+|+ T Consensus 165 ~~~~~~~~~~~~~~~~~y~~~~~~~~~~---------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~ 229 (485) T protein:vir:10 165 IRVAYDAEGNEIQAATLYTPNDIFGWYR---------------VENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSE 229 (485) T ss_pred EEEEEeeCCCeEEEEEEEeCCeEEEEEE---------------cCCceEEeccccCCCCcccEEEeccccccCCCCCccc Confidence 5555555556667788999999888753 2344655666788999999988766 368999 Q ss_pred HhH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc-ccccchhhhhhhhhhhccceeccC-CCceeEeeccc Q lcl|NC_021301. 227 VEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV-DENGNAIDYASIFEAAPGALWELP-PGVDIWESQTN 303 (456) Q Consensus 227 ~~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~~~ 303 (456) |++ +++|||+||+++|++.++++++++|+++++|.+.+.+.. ++.|. ..+....+++|..+ ++++|+|++.+ T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~-----~~~~~~~~~i~~~~~~d~k~~q~~~~ 304 (485) T protein:vir:10 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQ-----TLFDAYLARILAFEDAEGKIQQFSAA 304 (485) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccc-----hhhhhcccceeccCCCCceEEeeccc Confidence 985 899999999999999999999999999999987654432 22232 23556677888765 78999999999 Q ss_pred chHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----C Q lcl|NC_021301. 304 DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE----S 378 (456) Q Consensus 304 ~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~----~ 378 (456) ++++|+++++.++++|++++++|++.||+.+.| +||+||++++.+|++||+++++.|+.+|++++++++++.+. . T Consensus 305 ~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~~~~~~~ 384 (485) T protein:vir:10 305 ELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMKGGDVPP 384 (485) T ss_pred chHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCcc Confidence 999999999999999999999999999987777 59999999999999999999999999999999999887653 2 Q ss_pred cccceeEEecCCCCcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhHHHHHHHHHHHHHHH---HHhhhhhhhcccc Q lcl|NC_021301. 379 VEDTVDVSFESPDRVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQIT---LFAGNSVQRPQED 453 (456) Q Consensus 379 ~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~---~~~~~~~~~~~~d 453 (456) +...++++|+++.|+|.++.+|+++||+++| ++|++|+++++||+++++++++..+.++... .+..-....+..+ T Consensus 385 ~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~ 464 (485) T protein:vir:10 385 DMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLIGTMVDPNPTVP 464 (485) T ss_pred cceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCC Confidence 3457899999999999999999999999876 8999999999999998876554322222111 1111111111111 Q ss_pred cCC Q lcl|NC_021301. 454 GSR 456 (456) Q Consensus 454 ~~~ 456 (456) +.. T Consensus 465 ~~~ 467 (485) T protein:vir:10 465 GSP 467 (485) T ss_pred CCC Confidence 111 No 5 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=7.1e-90 Score=509.34 Aligned_cols=439 Identities=16% Similarity=0.176 Sum_probs=358.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |+++...++++.|+.+|..+++|++++++||+|+|+++.++++.++.++..++++++|||++|||++++||+++||+... T Consensus 23 ~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ivd~~a~~l~~~gf~~~d 102 (501) T protein:vir:25 23 MSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSLVRDSFAQNLSVVGYRNAL 102 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHHHHHHHHhhhcccceecCC Confidence 78888888999999999999999999999999999999999999999999888899999999999999999999998753 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEE-eCCCCceEEEEEEEEEe Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV-DPLQPWRIRSAMRWWRD 159 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~-d~~~~~~~~~~~~~~~~ 159 (456) .+..+.++++|+.|+|+.++.++++++++|||||++||.+++| ++|+++||++++++| |+..++++.+++++|.. T Consensus 103 ---~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~-~~i~~~sp~~~~~iy~D~~~~~~~~~ai~~~~~ 178 (501) T protein:vir:25 103 ---AKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG-PVFRTRSPRQILAVYADPSVDAWPQYALETWVA 178 (501) T ss_pred ---ccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-CeEEEeccccEEEEEecCCCCcceeEEEEEEee Confidence 3345678999999999999999999999999999999999888 689999999999999 56777788999998876 Q ss_pred cC--CceEEEEEEcCCeEEEEEEeeeeccccc--ceeeccCC----CceeecccccccCceeEEEEccC-----CCCCCc Q lcl|NC_021301. 160 LD--AESDFAIVWSGDGWQKFARPCFVQSSSR--RRLVTRIS----DSWVPVGDAVVTGSPPPVVVYQN-----PDGMGE 226 (456) Q Consensus 160 ~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~~~~~~~~~~pvv~~~n-----~~g~s~ 226 (456) .+ +...+.++|++..+|.+........... .+...... .........+|.+++||||+|.| ++|+|+ T Consensus 179 ~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~~~~~~~g~sd 258 (501) T protein:vir:25 179 QKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNGRDADDMIVGE 258 (501) T ss_pred ccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccceeeEeccCccccCccccch Confidence 54 3456678899888877754332211111 11111111 11122234567888899988865 568999 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccC-CCceeEeecccch Q lcl|NC_021301. 227 VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQTNDF 305 (456) Q Consensus 227 ~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~~~~~ 305 (456) |+++++|+|+||+++|++.++++++++|++|++|++...+ ..+....+++|..+ +++++++++++++ T Consensus 259 ie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~------------~~~~~~~~~i~~~~~~~~~~~q~~~~~~ 326 (501) T protein:vir:25 259 VAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKA------------EVLKASALRVWTFEDPEVKAQAFPPASV 326 (501) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCcc------------chhhhcccceeccCCCCceEEEecccCh Confidence 9999999999999999999999999999999999865322 23455667788765 6899999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---ccc Q lcl|NC_021301. 306 TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV---EDT 382 (456) Q Consensus 306 ~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~---~~~ 382 (456) ++|+++++.++++|++.+++|++.||+.++|+||+||++++.+|.++++++++.|+++|++++++++++.|... ..+ T Consensus 327 ~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~~~ 406 (501) T protein:vir:25 327 EPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAADSG 406 (501) T ss_pred HHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999988543 357 Q ss_pred eeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHH-hCCCChhHHHHHHHHHHHHHHHHHhhhhhh-hcccccCC Q lcl|NC_021301. 383 VDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRN-ILNYNADQIKQDDLDRAREQITLFAGNSVQ-RPQEDGSR 456 (456) Q Consensus 383 i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~-~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~-~~~~d~~~ 456 (456) ++++|+++.|+|.++.||+++||+++|+ |.+|++. ++|++++++++++.++.+++.......... .+...++. T Consensus 407 i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 481 (501) T protein:vir:25 407 AEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPPP 481 (501) T ss_pred eeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCCC Confidence 8999999999999999999999999886 6677765 568998887776655555544433332211 11111111 No 6 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=3.7e-89 Score=505.43 Aligned_cols=431 Identities=17% Similarity=0.174 Sum_probs=353.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -++.||+++++.|++++..+.+|++++.+||+|+|+++++++..+++++.. ++++|||++|||++++||+++||+... T Consensus 8 ~~~~~~~~~~~~l~~~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~--~~~~n~~~~ivd~~~~~l~~~g~~~~~ 85 (484) T protein:vir:77 8 QENVDPEKAREEMLNLFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQKL--LAHVGYPRLYIDAIAARQELEGFRLGG 85 (484) T ss_pred cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhHHhh--hhhcCcHHHHHHHHHhhhccCceecCC Confidence 455667899999999999999999999999999999999998888887643 578999999999999999999999753 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc--------eEEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT--------ATITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~--------~~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) +.+..+.++++|++|+|+.++.++++++++||+||++||.+++|. ++|++++|++++++||+..+ ++.+ T Consensus 86 --~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~~~-~~~~ 162 (484) T protein:vir:77 86 --ADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPRTR-QVMR 162 (484) T ss_pred --cchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCCCC-ceEE Confidence 445577899999999999999999999999999999999998875 57999999999999998865 4666 Q ss_pred EEEEEEec-CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCC------CCCC Q lcl|NC_021301. 153 AMRWWRDL-DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP------DGMG 225 (456) Q Consensus 153 ~~~~~~~~-d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~------~g~s 225 (456) ++++|... ++...+.++|+++.++.+.. ..+.|......+|++++||||+|.|. +|+| T Consensus 163 a~~~~~~~~~~~~~~~~~y~~~~~~~~~~---------------~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~~G~s 227 (484) T protein:vir:77 163 AIRAIEDEEGNEVIGATLYLPNNTVIWNR---------------EDGQWVQVANVAHNLEMVPVIPIPNRTRLSDLYGTT 227 (484) T ss_pred EEEEEEeecCCcEEEEEEEecCeEEEEEe---------------cCCceEeeccccCCCCCcceEEeccccccCccCCcc Confidence 66666654 45567788999988877753 23456666677889999999888663 6899 Q ss_pred cHhH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccc-cccchhhhhhhhhhhccceeccC-CCceeEeecc Q lcl|NC_021301. 226 EVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVD-ENGNAIDYASIFEAAPGALWELP-PGVDIWESQT 302 (456) Q Consensus 226 ~~~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~~ 302 (456) +|++ +++|+|+||+++|++.++++++++|+++++|.+...+..+ +.| ...+....+++|..+ +++++++++. T Consensus 228 ~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~q~~~ 302 (484) T protein:vir:77 228 EITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETG-----QTLFDAYLARILAFEDHESKAQQFSA 302 (484) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhccccccc-----chhhhhhhhhhcccCCCCceeEeecC Confidence 9985 8999999999999999999999999999999876544322 222 234566677787765 6799999999 Q ss_pred cchHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--- Q lcl|NC_021301. 303 NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--- 378 (456) Q Consensus 303 ~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~--- 378 (456) +++++|+++++.++++|+++++||++.||+.+.| +||+||++++.+|++||+++++.|+++|++++++++++.+.. T Consensus 303 ~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~~~~~ 382 (484) T protein:vir:77 303 AELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMNGGDIP 382 (484) T ss_pred CChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcc Confidence 9999999999999999999999999999988777 599999999999999999999999999999999999887532 Q ss_pred -cccceeEEecCCCCcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhHHHHHHHHHHHHHHHH---Hhhhhhhhccc Q lcl|NC_021301. 379 -VEDTVDVSFESPDRVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITL---FAGNSVQRPQE 452 (456) Q Consensus 379 -~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~---~~~~~~~~~~~ 452 (456) +...++++|+++.++|.++.+|+++||+++| ++|++|+++++||+++++++++..+.++.... +.......++. T Consensus 383 ~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~ 462 (484) T protein:vir:77 383 PEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSG 462 (484) T ss_pred cccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHhhhccccccC Confidence 3357899999999999999999999999876 89999999999999987776543332222211 11100100111 Q ss_pred --ccCC Q lcl|NC_021301. 453 --DGSR 456 (456) Q Consensus 453 --d~~~ 456 (456) ++.. T Consensus 463 ~~~~~~ 468 (484) T protein:vir:77 463 GGNPDN 468 (484) T ss_pred CCCCCC Confidence 1111 No 7 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=1.5e-88 Score=502.12 Aligned_cols=429 Identities=18% Similarity=0.202 Sum_probs=349.9 Q ss_pred CC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MT-ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~-~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) |+ ++++.++++.|+++|..+.+|++++.+||+|+|+|++++...+++++.. ++++|||++|||++++||.++||+.. T Consensus 8 ~~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~--~~v~n~~~~iVd~~~~~l~~~g~~~~ 85 (486) T protein:vir:42 8 MEEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTVPREMQQL--LAHVGYPRLYVDSVAERQAVEGFRLG 85 (486) T ss_pred CCCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhcccccchhHhhh--hhccchHHHHHHHHHhhhcccceecC Confidence 32 5567889999999999999999999999999999999988888877643 57899999999999999999999865 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC--------CCceEEEEEccceeEEEEeCCCCceEE Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD--------DGTATITADSPETMVVSVDPLQPWRIR 151 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~--------dg~~~i~~~~p~~~~~~~d~~~~~~~~ 151 (456) . +++....++++|++|+|+.++.++++++++|||||++||+++ ++.+++++++|++++++||+..++ +. T Consensus 86 ~--~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~~~~-~~ 162 (486) T protein:vir:42 86 D--ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPRINR-VS 162 (486) T ss_pred C--CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCCCCC-eE Confidence 3 444567799999999999999999999999999999999875 556799999999999999988765 66 Q ss_pred EEEEEEEecC-CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCC------CCC Q lcl|NC_021301. 152 SAMRWWRDLD-AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP------DGM 224 (456) Q Consensus 152 ~~~~~~~~~d-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~------~g~ 224 (456) +++++|++.+ +...+..+|+++.+++|.. .++.|......+|.++.||||+|.|. +|. T Consensus 163 ~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~---------------~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~ 227 (486) T protein:vir:42 163 KAIRVAYDKEGNEIQAATLYTPMETIGWFR---------------ADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGT 227 (486) T ss_pred EEEEEEEecCCCeEEEEEEEcCCcEEEEEe---------------cCCcEEeecceecCCCCceEEEeccccccCCCCCc Confidence 6676666544 4456688999998888753 23446666667788889998877653 689 Q ss_pred CcHhH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccc-cccchhhhhhhhhhhccceeccC-CCceeEeec Q lcl|NC_021301. 225 GEVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVD-ENGNAIDYASIFEAAPGALWELP-PGVDIWESQ 301 (456) Q Consensus 225 s~~~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~ 301 (456) |+|++ |++|||+||+++|++.++++++++|+++++|.+...+..+ +.+. ..+....+++|..+ ++++++|++ T Consensus 228 s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~q~~ 302 (486) T protein:vir:42 228 SEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQ-----TLFDAYLARILAFEDAEGKIQQFS 302 (486) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCcccccccccccc-----chhhhhhchhcccCCCCceEEeec Confidence 99995 8899999999999999999999999999999876554322 2222 34566677777664 789999999 Q ss_pred ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--- Q lcl|NC_021301. 302 TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE--- 377 (456) Q Consensus 302 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~--- 377 (456) ++++++|+++++.++++|++++++|++.||+.+.| +||+||++++.+|++||+++++.|+++|++++++++++.+. T Consensus 303 ~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~~~~ 382 (486) T protein:vir:42 303 AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKGGDV 382 (486) T ss_pred ccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc Confidence 99999999999999999999999999999987777 59999999999999999999999999999999999988653 Q ss_pred -CcccceeEEecCCCCcCHHHHHHHHHHHHhc--CCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh----h-h Q lcl|NC_021301. 378 -SVEDTVDVSFESPDRVTLGEKYAAASLAKAA--GESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV----Q-R 449 (456) Q Consensus 378 -~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~--g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~----~-~ 449 (456) .+..+++++|+++.|+|.++.||+++||+++ |++|++|+++++||+++++++++ ++++|......... . . T Consensus 383 ~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~--~~~~e~~~~~~~~~~~~~~~~ 460 (486) T protein:vir:42 383 PPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMR--RWDEEEAAMGLGLLGTMVDAD 460 (486) T ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHH--HHHHHHHHHHHHHHHHhhcCC Confidence 2346799999999999999999999999986 78999999999999988766443 33322221111110 0 0 Q ss_pred cccccCC Q lcl|NC_021301. 450 PQEDGSR 456 (456) Q Consensus 450 ~~~d~~~ 456 (456) +..+|.- T Consensus 461 ~~~~~~~ 467 (486) T protein:vir:42 461 PTVPGSP 467 (486) T ss_pred CCCCCCC Confidence 0000000 No 8 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=1.3e-88 Score=502.39 Aligned_cols=432 Identities=18% Similarity=0.190 Sum_probs=352.3 Q ss_pred CC--------CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MT--------ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~--------~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |+ .+.+..+++.|+++|..+.+|++++++||+|+|+|+++++..+++++. .++++||+++|||+.++||+ T Consensus 1 ~~~~i~~~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~--~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:24 1 MTAPLPGQEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQS--LLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCCCCCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcccchhhhh--hhhccchHHHHHHHHhhhhc Confidence 22 234567889999999999999999999999999999999988887764 46889999999999999999 Q ss_pred cCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC--------CceEEEEEccceeEEEEeC Q lcl|NC_021301. 73 PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD--------GTATITADSPETMVVSVDP 144 (456) Q Consensus 73 ~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d--------g~~~i~~~~p~~~~~~~d~ 144 (456) ++||++.. +.+..+.++++|++|+|+.++.++++++++|||||++||.+++ |.++|+++||++++++||+ T Consensus 79 ~~g~~~~~--~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~ 156 (485) T protein:vir:24 79 VEGFRLGD--ADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDP 156 (485) T ss_pred cCceecCC--CchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeC Confidence 99998753 4456778999999999999999999999999999999999875 5578999999999999999 Q ss_pred CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC---- Q lcl|NC_021301. 145 LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN---- 220 (456) Q Consensus 145 ~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n---- 220 (456) ..++...++.+++.+.++...+..+|+++.++.+.. .++.|......+|.++.||||+|.| T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~---------------~~~~~~~~~~~~h~~g~vPvv~f~n~~~~ 221 (485) T protein:vir:24 157 RIGRPAKAIRVAYDAEGNEIQAATLYTPNETFGWFR---------------AEGEWVEWFSDPHGLGAVPVVPLPNRTRL 221 (485) T ss_pred CcCceeEEEEEEEeecCCeEEEEEEEcCCcEEEEEe---------------cCCceEeecccccCCCcccEEEeccCccc Confidence 888766666666665566677788999998888753 2345666667788899999988866 Q ss_pred --CCCCCcHhH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc-ccccchhhhhhhhhhhccceeccC-CCc Q lcl|NC_021301. 221 --PDGMGEVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV-DENGNAIDYASIFEAAPGALWELP-PGV 295 (456) Q Consensus 221 --~~g~s~~~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-~d~ 295 (456) ++|.|+|++ +++|||+||+++|+++++++++++|++|++|.+...+.. ++.+ ...+....+++|..+ +++ T Consensus 222 ~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~-----~~~~~~~~~~i~~~~~~~~ 296 (485) T protein:vir:24 222 SDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETG-----QTLFDAYLARILAFEDAEG 296 (485) T ss_pred CCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccc-----cchhhhcccceeccCCCCc Confidence 378999985 899999999999999999999999999999987654432 2222 234566778888775 689 Q ss_pred eeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021301. 296 DIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI 374 (456) Q Consensus 296 ~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 374 (456) ++++++.+++++|+++++.++++|++++++|+..||+.+.| +||+||++++.+|++||+++++.|+++|++++++++++ T Consensus 297 ~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~ 376 (485) T protein:vir:24 297 KIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRL 376 (485) T ss_pred eEEeecccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999987777 69999999999999999999999999999999999887 Q ss_pred cCC----CcccceeEEecCCCCcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhHHHHHHHHHHHHHHH--HHhhhh Q lcl|NC_021301. 375 EGE----SVEDTVDVSFESPDRVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQIT--LFAGNS 446 (456) Q Consensus 375 ~~~----~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~--~~~~~~ 446 (456) .+. .+...++++|+++.|+|.++.+|+++||+++| ++|++|+++++||+++++++++..+.++... ...+.. T Consensus 377 ~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~~~~~~~~~~~~ 456 (485) T protein:vir:24 377 MKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEEAAMGLGLLGTM 456 (485) T ss_pred hcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHHhhhhhhHHHhh Confidence 542 23457899999999999999999999999875 8999999999999998877654322222111 111111 Q ss_pred hhhcc-cccC-----C Q lcl|NC_021301. 447 VQRPQ-EDGS-----R 456 (456) Q Consensus 447 ~~~~~-~d~~-----~ 456 (456) ..... .++. . T Consensus 457 ~~~~~~~~~~~~~~e~ 472 (485) T protein:vir:24 457 VDADPTVPGSPNPTPA 472 (485) T ss_pred cccCCCCCCCCCCCCC Confidence 11111 1110 1 No 9 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=2.8e-88 Score=500.57 Aligned_cols=431 Identities=20% Similarity=0.214 Sum_probs=353.9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC Q lcl|NC_021301. 3 ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) Q Consensus 3 ~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~ 82 (456) -.|+.++|+.|+++|..+++|+.++++||+|+|++++.++..+++++. +++++||+++|||+.++||.++||+... T Consensus 1 ~~t~~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~--~~~~~n~~~~ivd~~~~~l~~~g~~~~~-- 76 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE-- 76 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHhh--hhhhcchHHHHHHHHHhhhccCceecCC-- Confidence 457888999999999999999999999999999999999888877664 4788999999999999999999998653 Q ss_pred cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEee------CCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 83 DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR------RDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 83 d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~------d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) |.+..+.++++|+.|+|+.++.++++++++|||||++||+ |++|.+++.+++|.+++++||+...+++.+++++ T Consensus 77 d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~~~~~~~i~~ 156 (480) T protein:vir:78 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) T ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEE Confidence 5566788999999999999999999999999999999997 4578899999999999999999988889999999 Q ss_pred EEecC--CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeec-ccccccCceeEEEEccC------CCCCCcH Q lcl|NC_021301. 157 WRDLD--AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPV-GDAVVTGSPPPVVVYQN------PDGMGEV 227 (456) Q Consensus 157 ~~~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~pvv~~~n------~~g~s~~ 227 (456) |...+ +...+..+|+++.++.|...... ...|... ...+|.+++|||++|.| ++|+|+| T Consensus 157 ~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~s~i 224 (480) T protein:vir:78 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGL------------NDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) T ss_pred EEeecCCCceEEEEEEeCCeEEEEEecCCC------------ccccccccccccCCCCCcceEEeecccccCCccCcccc Confidence 86544 55677889999998887643211 1112221 23568888899887755 4789999 Q ss_pred hH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceecc-CCCceeEeecccch Q lcl|NC_021301. 228 EP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWEL-PPGVDIWESQTNDF 305 (456) Q Consensus 228 ~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~d~~~~~~~~~~~ 305 (456) ++ +++|+|+||+++|+++++++++++|+++++|.+...+..+..+. .+....+.+|.. ++++++++++.+++ T Consensus 225 ~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (480) T protein:vir:78 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT------TLDIYYGRILTLASEAAKISEFKAAEL 298 (480) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccc------hhhhhhhhhccCCCCCceEEecCccCH Confidence 86 89999999999999999999999999999998765443322222 244455666654 57799999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---Cccc Q lcl|NC_021301. 306 TPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE---SVED 381 (456) Q Consensus 306 ~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~---~~~~ 381 (456) ++|+++++.++++|++++++|+..||+.+.| +||+||++++.+|+.||+++++.|+++|++++++++++.|. .+.. T Consensus 299 ~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~ 378 (480) T protein:vir:78 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) T ss_pred HHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccce Confidence 9999999999999999999999999988777 59999999999999999999999999999999999999874 2345 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .++++|+++.++|.++.+++++|++++| ++|++|+++.+||+++++++++.++.++..+...... ...+.++.. T Consensus 379 ~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~~~~~~-~~~~~~~~~ 454 (480) T protein:vir:78 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLY-STTKAQADA 454 (480) T ss_pred eeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhh-ccccccCCC Confidence 7899999999999999999999999876 7899999999999999888766443333322221111 100111111 No 10 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=4.2e-88 Score=499.61 Aligned_cols=432 Identities=20% Similarity=0.213 Sum_probs=353.1 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC Q lcl|NC_021301. 3 ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) Q Consensus 3 ~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~ 82 (456) -.|+.++|+.|+++|..+++|++++++||+|+|+++++++..+++++. +++++||+++|||+.++||+++||+... T Consensus 1 ~~t~~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~--~~~~~n~~~~ivd~~~~~l~~~g~~~~~-- 76 (480) T protein:vir:78 1 MTTYHEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAY--LDVQPGWVATYLRTLSDRLDIEGFRISE-- 76 (480) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccccchhhhh--hhhhcchHHHHHHHHHhhhccCceecCC-- Confidence 457888999999999999999999999999999999999888887763 4688999999999999999999998653 Q ss_pred cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEee------CCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 83 DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR------RDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 83 d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~------d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) |.+..+.++++|++|+|+.++.++++++++||+||++||+ +++|.+++.+++|++++++||+...+++.+++++ T Consensus 77 d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~~~~~~~i~~ 156 (480) T protein:vir:78 77 DSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNTRRVTRAVRL 156 (480) T ss_pred CchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCccceEEEEEE Confidence 4556788999999999999999999999999999999996 4688899999999999999999988889999998 Q ss_pred EEecC--CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceee-cccccccCceeEEEEccC------CCCCCcH Q lcl|NC_021301. 157 WRDLD--AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVP-VGDAVVTGSPPPVVVYQN------PDGMGEV 227 (456) Q Consensus 157 ~~~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~pvv~~~n------~~g~s~~ 227 (456) |...+ +...+..+|+++.++.|...... ...|.. ....+|.++.||||+|.| ++|.|+| T Consensus 157 ~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~g~vPvv~f~n~~~~~~~~G~sdi 224 (480) T protein:vir:78 157 YTTRDDVAVPDRATLYLPDETVPLRRNGGL------------NDQWVVDGDVIKHGLGVVPVVPLTNDPRLGNRYGRSEI 224 (480) T ss_pred EEeecCCcceEEEEEEeCCeEEEEEecCCC------------cccccccccccccCCCCcceEEeecccccCCccCccch Confidence 86544 44567889999998887643211 111222 233568888889887755 3689999 Q ss_pred hH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceecc-CCCceeEeecccch Q lcl|NC_021301. 228 EP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWEL-PPGVDIWESQTNDF 305 (456) Q Consensus 228 ~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~d~~~~~~~~~~~ 305 (456) ++ |++|+|+||+++|++.++++++++|+++++|.+.+.+..+..+. .+....+.++.. ++++++++++.+++ T Consensus 225 ~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 298 (480) T protein:vir:78 225 SPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT------TLDIYYGRILTLASEAAKISEFKAAEL 298 (480) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccc------hhhhhhhhhccCCCCCceEEecCccCH Confidence 86 89999999999999999999999999999998765544333222 234445555554 46789999999999 Q ss_pred HHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---ccc Q lcl|NC_021301. 306 TPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES---VED 381 (456) Q Consensus 306 ~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~---~~~ 381 (456) ++|+++++.++++|++++++|++.||+.+.| +||+||++++.+|++||+++++.|+++|++++++++++.|.. +.. T Consensus 299 ~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~~ 378 (480) T protein:vir:78 299 RNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT 378 (480) T ss_pred HHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccce Confidence 9999999999999999999999999987777 599999999999999999999999999999999999988743 345 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhh---hhhhhcc----- Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITLFAG---NSVQRPQ----- 451 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~---~~~~~~~----- 451 (456) .++++|+++.++|.++.+++++||+++| ++|++|+++++||+++++++++..+.+++.+.... .....++ T Consensus 379 ~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (480) T protein:vir:78 379 RLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKP 458 (480) T ss_pred eeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCC Confidence 7999999999999999999999999876 78999999999999998877654333333222211 1111100 Q ss_pred cccCC Q lcl|NC_021301. 452 EDGSR 456 (456) Q Consensus 452 ~d~~~ 456 (456) ..|+. T Consensus 459 ~~~~~ 463 (480) T protein:vir:78 459 TVTET 463 (480) T ss_pred CCCCC Confidence 01111 No 11 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=1.2e-87 Score=497.21 Aligned_cols=434 Identities=17% Similarity=0.167 Sum_probs=351.7 Q ss_pred CCC---CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MTA---STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~~---~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |.+ .++.+|++.|+++|..+++|++++++||+|+|+|+++++..+++++ ++++++|||++|||+++++|+.+||+ T Consensus 1 ~~~~~~~d~~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~--~~~~~~n~~~~ivd~~a~~l~~~Gf~ 78 (488) T protein:vir:23 1 MAETESIDPEKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAVPLDMR--KYLAHVGYPRTYVDAIAERQELEGFR 78 (488) T ss_pred CCcccCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCcccchhhh--hhhhhcchHHHHHHHHHHhhhcccee Confidence 543 4478899999999999999999999999999999999998888776 34788999999999999999999987 Q ss_pred cCCC--------CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC--------CCCceEEEEEccceeEEE Q lcl|NC_021301. 78 VGGS--------ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR--------DDGTATITADSPETMVVS 141 (456) Q Consensus 78 ~~~~--------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d--------~dg~~~i~~~~p~~~~~~ 141 (456) +..+ .|++....++++|++|+|+.++.++++++++|||||++||++ +++.++|++++|++++++ T Consensus 79 ~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~ 158 (488) T protein:vir:23 79 IPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPTALYAE 158 (488) T ss_pred ccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccceeEEE Confidence 6432 456677889999999999999999999999999999999874 566789999999999999 Q ss_pred EeCCCCceEEEEEEEEEe-cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC Q lcl|NC_021301. 142 VDPLQPWRIRSAMRWWRD-LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN 220 (456) Q Consensus 142 ~d~~~~~~~~~~~~~~~~-~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n 220 (456) ||+..++...+ +++|+. .++...+..+|+++.++.|.. .++.|......+|.+++||||+|.| T Consensus 159 ~d~~~~~~~~~-~~~~~~~~~~~~~~~~~y~~~~~~~~~~---------------~~~~~~~~~~~~h~~g~vPvv~f~n 222 (488) T protein:vir:23 159 VDPRTRKVLYA-IRAIYGADGNEIVSATLYLPDTTMTWLR---------------AEGEWEAPTSTPHGLEMVPVIPISN 222 (488) T ss_pred EecCCCceEEE-EEEEEecCCCcEEEEEEEecCcEEEEEe---------------cCCceEeccccccCCCCcceEEecc Confidence 99987765444 555544 445566788999999888753 2344656666778889999988755 Q ss_pred C------CCCCcHhH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCC Q lcl|NC_021301. 221 P------DGMGEVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPP 293 (456) Q Consensus 221 ~------~g~s~~~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (456) . +|+|+|++ +++|+|+||+++|+++++++++++|+++|+|++...+..+.. .....+....+++|.+++ T Consensus 223 ~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~----~~~~~~~~~~~~v~~~~~ 298 (488) T protein:vir:23 223 RTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAE----TGQRMFDAYMARILAFEG 298 (488) T ss_pred ccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccc----ccchhhhhhhhhhccCCC Confidence 2 68999985 899999999999999999999999999999987654332211 122456677788888754 Q ss_pred --CceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 294 --GVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK 370 (456) Q Consensus 294 --d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l 370 (456) ++++++++.+++++|+++++.++++|++.+++|++.||+.+.| +||+||++++.+|++||+++++.|+++|++++++ T Consensus 299 g~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l 378 (488) T protein:vir:23 299 GEGAHAEQFSAAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRL 378 (488) T ss_pred CCCceeEecCCCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4789999999999999999999999999999999999987777 5999999999999999999999999999999999 Q ss_pred HHHhcCCC----cccceeEEecCCCCcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHh- Q lcl|NC_021301. 371 ALQIEGES----VEDTVDVSFESPDRVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITLFA- 443 (456) Q Consensus 371 ~~~~~~~~----~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~- 443 (456) ++++.+.. +..+++++|+++.|+|.++.+|+++||+++| ++|++|+++++||+++++++++..+.+++..... T Consensus 379 ~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~ 458 (488) T protein:vir:23 379 AYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGL 458 (488) T ss_pred HHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHH Confidence 99886632 3357999999999999999999999999876 7999999999999988777655432222221111 Q ss_pred -hhhhhhcccc--------cCC Q lcl|NC_021301. 444 -GNSVQRPQED--------GSR 456 (456) Q Consensus 444 -~~~~~~~~~d--------~~~ 456 (456) ........++ |+. T Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~ 480 (488) T protein:vir:23 459 IGSLYGASTPEGKPGEAPVGEP 480 (488) T ss_pred HHHHhccCCCcccCCCCCCCCC Confidence 1111111111 111 No 12 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=4.1e-87 Score=494.17 Aligned_cols=432 Identities=15% Similarity=0.157 Sum_probs=346.6 Q ss_pred CC----------------CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHH Q lcl|NC_021301. 1 MT----------------ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVR 64 (456) Q Consensus 1 ~~----------------~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iV 64 (456) || ++++.++++.|+.+|..+.+|++++.+||+|+|+++++++..+++++.. ++++||+++|| T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p~~~~~~--~~v~n~~~~iV 78 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIPPEYLRT--ATVLGWSAKAV 78 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHhccccchhccccccHHHHHH--hhccCcHHHHH Confidence 22 2334568999999999999999999999999999999999999988743 58899999999 Q ss_pred HHHHhhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc--eEEEEEccceeEEEE Q lcl|NC_021301. 65 DSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT--ATITADSPETMVVSV 142 (456) Q Consensus 65 d~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~--~~i~~~~p~~~~~~~ 142 (456) |++++++.++||+... +.+....++++|+.|+|+.++.++++++++|||||++||.+++|+ ++|+++||++++++| T Consensus 79 d~~a~rl~~~Gf~~~d--~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iy 156 (504) T protein:vir:99 79 DTLARRCNLESFVWPD--GDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEW 156 (504) T ss_pred HHHHhhhccceeeCCC--CChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEE Confidence 9999999999998753 334467799999999999999999999999999999999999886 468999999999999 Q ss_pred eCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC-- Q lcl|NC_021301. 143 DPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN-- 220 (456) Q Consensus 143 d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n-- 220 (456) |+..++...++.+++.+.+|+.....+|+++.++.+... ..+.|..... +|.+++ |||+|.| T Consensus 157 D~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~--------------~~~~~~~~~~-~~~~gv-PvV~~~n~~ 220 (504) T protein:vir:99 157 NSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMD--------------DDGDWHADVR-THKLGV-PVEVLPYKP 220 (504) T ss_pred eCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEc--------------CCceeeeccc-cCCCCc-ceEEecccc Confidence 998876565555555677777888889999988877532 1233444433 555674 5666654 Q ss_pred ----CCCCCcHh-HHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCC-- Q lcl|NC_021301. 221 ----PDGMGEVE-PHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPP-- 293 (456) Q Consensus 221 ----~~g~s~~~-~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 293 (456) ++|.|+|+ ++++|+|+||++++++.++++|+++|++|++|+..+++. +++|++. ..+....+++|.+++ T Consensus 221 ~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~-~~d~~~~---~~~~~~~~~i~~~~~~~ 296 (504) T protein:vir:99 221 REDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFR-NKDGSMK---PAWQIALARVFALPDDE 296 (504) T ss_pred cCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccc-ccccccc---chhhhhhhhhhcCCCcc Confidence 47899986 799999999999999999999999999999998765543 3334332 345555666666543 Q ss_pred --------CceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccc--ccCcHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 294 --------GVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPD--SANQSAEGAHNIEKGFLFKCEDRLSIAKIG 363 (456) Q Consensus 294 --------d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~--~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~ 363 (456) ++++++++++++++|+++++.++++|+++|+||++.||.. .+|+||+||++++.+|.++++++++.|+++ T Consensus 297 ~~~~~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~ 376 (504) T protein:vir:99 297 DEPDAARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPA 376 (504) T ss_pred ccccccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4789999999999999999999999999999999999854 356899999999999999999999999999 Q ss_pred HHHHHHHHHHhcCCC-----cccceeEEecCCCCcCHHHHHHHHHHHHhcCCC---cHHHHHHhCCCChhHHHHHHHHHH Q lcl|NC_021301. 364 LEAILVKALQIEGES-----VEDTVDVSFESPDRVTLGEKYAAASLAKAAGES---WASIRRNILNYNADQIKQDDLDRA 435 (456) Q Consensus 364 l~~~~~l~~~~~~~~-----~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~---s~~t~~~~~~~~~~~~~~~e~~~~ 435 (456) |++++++++++.+.. ...+++++|+++.++|.++.||+++||+++|.. ..+++++++|++++++++++.++. T Consensus 377 l~~~~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~ 456 (504) T protein:vir:99 377 FRRSMIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTPQQAKRALAERR 456 (504) T ss_pred HHHHHHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCHHHHHHHHHHHH Confidence 999999999886532 336789999999999999999999999998852 357889999999999888766655 Q ss_pred HHHHHHHhhhhhhhc----ccc-----c--CC Q lcl|NC_021301. 436 REQITLFAGNSVQRP----QED-----G--SR 456 (456) Q Consensus 436 ~ee~~~~~~~~~~~~----~~d-----~--~~ 456 (456) +++............ ..+ . +. T Consensus 457 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e~ 488 (504) T protein:vir:99 457 RASSVSIIEALNRRQQEAATAGEDQDQGAGEP 488 (504) T ss_pred HHhhHHHHHHHhcccCCCCCCCCCCCcCCCCC Confidence 544333222211111 000 0 00 No 13 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=3.3e-87 Score=494.72 Aligned_cols=432 Identities=14% Similarity=0.115 Sum_probs=356.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |+. ++.++++.|+++|..+.+|++++++||+|+|+++++++.+|++++.. ++++|||+++||++++++..+||+..+ T Consensus 12 l~~-~~~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p~~~r~~--~~v~nw~~~~Vd~~a~rl~~~Gf~~~d 88 (474) T protein:vir:81 12 LSN-DENALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIPPQYFNL--GLVLGWTGKAVDALARRCNLEGFVWPD 88 (474) T ss_pred CCh-hHHHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhccccccHHHHHH--HhhcChHHHHHHHHHhhhcccceECCC Confidence 443 46789999999999999999999999999999999999999999854 578999999999999999999999754 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc--eEEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT--ATITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~--~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) + .+....++++|+.|+|+..+.+++++|++|||||++|+.+++|. ++|+++||++++++|||..++...++.++.+ T Consensus 89 ~--~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~~~~~~al~~~~~ 166 (474) T protein:vir:81 89 G--DLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRRRGLNNLLSIIDK 166 (474) T ss_pred C--CccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCCCcceeeeEEEEE Confidence 3 22345689999999999999999999999999999999987775 6799999999999999998876666666677 Q ss_pred ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCC------CCCCcH-hHHH Q lcl|NC_021301. 159 DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP------DGMGEV-EPHI 231 (456) Q Consensus 159 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~------~g~s~~-~~v~ 231 (456) +.+|+....++|+++.++.+... .....|.... .+|.+|+ |+|++.|. +|+|+| ++++ T Consensus 167 ~~~g~~~~~~ly~~~~~~~~~~~-------------~~~~~w~~~~-~~~~~gv-PvV~~~n~~~~~~~~G~s~i~e~v~ 231 (474) T protein:vir:81 167 DKEGKVLSLALYLDNETVTAQRD-------------KATLKWQVDR-DEHVYGV-PAQVLPYKPAPKRPFGQSRITKPMM 231 (474) T ss_pred cCCCcEEEEEEEeCCcEEEEEEc-------------Cccceeeecc-CCCCCCc-ceEEecccccccCcCCccccchhHH Confidence 88888888999999998877532 1223344443 3455565 56666543 899998 5899 Q ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCC----------ceeEeec Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPG----------VDIWESQ 301 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d----------~~~~~~~ 301 (456) +|+|++|++++++.++++|+++|++|++|++...+. +++|++. ..++...+++|.++++ ++++||+ T Consensus 232 ~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~-d~d~~~~---~~~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~ 307 (474) T protein:vir:81 232 GLQDAGVRELARREGHMDVFSYPEFWLLGADESALK-NADGTIK---SVWEARLGRIKGLPDDADADIPQLARADVKQFP 307 (474) T ss_pred HHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcc-ccccccc---chhhhhHHHHhcCCCcccccccccccccccccC Confidence 999999999999999999999999999999876543 3344332 3566667777776654 5789999 Q ss_pred ccchHHHHHHHHHHHHHHHhhcCCChhhhccc-ccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC- Q lcl|NC_021301. 302 TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPD-SAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES- 378 (456) Q Consensus 302 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~-~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~- 378 (456) ++++++|++.++.++.+|+++|+||+++||.. ++| +||+||++++.+|..|++++++.|+.+|++++++++++.|.. T Consensus 308 ~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~ 387 (474) T protein:vir:81 308 AASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVA 387 (474) T ss_pred CCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 99999999999999999999999999999854 466 699999999999999999999999999999999999998742 Q ss_pred ------cccceeEEecCCCCcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhc Q lcl|NC_021301. 379 ------VEDTVDVSFESPDRVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRP 450 (456) Q Consensus 379 ------~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~ 450 (456) .+++++++|.++..++.++.||+++||+++| +++++++++++|++++++++++.++.+++............ T Consensus 388 ~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~ 467 (474) T protein:vir:81 388 IDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTLQALIDRS 467 (474) T ss_pred ccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcC Confidence 2357899999999999999999999999986 56789999999999999988776665555443333322222 Q ss_pred cc-ccCC Q lcl|NC_021301. 451 QE-DGSR 456 (456) Q Consensus 451 ~~-d~~~ 456 (456) ++ ..++ T Consensus 468 ~~~~~aq 474 (474) T protein:vir:81 468 NNGATAQ 474 (474) T ss_pred CCCCCCC Confidence 22 2222 No 14 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=1.9e-85 Score=485.05 Aligned_cols=419 Identities=17% Similarity=0.135 Sum_probs=346.6 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCC Q lcl|NC_021301. 2 TASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS 81 (456) Q Consensus 2 ~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~ 81 (456) -++++.+|++.|+.+|..+++|++++.+||+|+|+++..++..++.++ ++++++|||++|||+.++|+.++||+... T Consensus 1 ~~~~~~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~--~~k~~~n~~~~ivd~~~~~l~~~g~~~~d- 77 (441) T protein:vir:80 1 MNSDELALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQ--RVQTVVSWPGIAVDALEERLDWLGWTNGD- 77 (441) T ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcccchhhh--hhhhhcchHHHHHHHHHhhhccccccCCC- Confidence 556677789999999999999999999999999999999988888766 45789999999999999999999997532 Q ss_pred CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecC Q lcl|NC_021301. 82 ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLD 161 (456) Q Consensus 82 ~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d 161 (456) .+.++++|+.|+|+.++.+++++++++||||++||.|++|.+++++++|++++++||+..++...++++++. .+ T Consensus 78 -----~~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~~~~~~-~~ 151 (441) T protein:vir:80 78 -----GYGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGSRLDAGLVVQQT-CD 151 (441) T ss_pred -----hHHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCCceeEEEEEEEE-ec Confidence 246899999999999999999999999999999999999999999999999999999988776666655554 45 Q ss_pred CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCC------CCCCcHhH-HHHHH Q lcl|NC_021301. 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNP------DGMGEVEP-HIDII 234 (456) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~------~g~s~~~~-v~~li 234 (456) +...+..+|+++.++.|... ..+.|......+|.++.||||++.|. +|+|+|++ +++|| T Consensus 152 ~~~~~~~vy~~~~~~~~~~~--------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~l~~~v~~li 217 (441) T protein:vir:80 152 PEVVEAELLLPDVIVQVERR--------------GSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDGRSEITRSIRAYT 217 (441) T ss_pred CceEEEEEEecCeEEEEEEc--------------CCcceeeccccccCCCceeEEEeeccccCCccCCcccchhhHHHHH Confidence 55667789999888877532 22334455567788888998887653 68999975 89999 Q ss_pred HHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCC-----ceeEeecccchHHHH Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPG-----VDIWESQTNDFTPML 309 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-----~~~~~~~~~~~~~~~ 309 (456) |+||+++|++.++++++++|+++++|.+.+.+..+ .+....+++|.++++ +++++++.+++++|+ T Consensus 218 Da~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~----------~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (441) T protein:vir:80 218 DEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQP----------GWVLSMASVWAVDKDDDGDTPNVGSFPVNSPTPYS 287 (441) T ss_pred HHHHHHHHHHHHHHHhhcCceeeeecCCccccccc----------hhhhcccccccCCCCCCCCcceeEecCccchHHHH Confidence 99999999999999999999999999865433211 123344566665433 678999999999999 Q ss_pred HHHHHHHHHHHhhcCCChhhhcccccCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----ccce Q lcl|NC_021301. 310 SAIKEHIRQLSSATKTPLPMLMPDSANQ-SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-----EDTV 383 (456) Q Consensus 310 ~~l~~~~~~i~~~~~~p~~~~~~~~~N~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-----~~~i 383 (456) ++++.++++|++++++|++.||+.+.|+ ||+||++++.+|.++|+++++.|+++|++++++++++.+... ...+ T Consensus 288 ~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i 367 (441) T protein:vir:80 288 DQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDV 367 (441) T ss_pred HHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceee Confidence 9999999999999999999999877774 999999999999999999999999999999999998876432 2578 Q ss_pred eEEecCCCCcCHHHHHHHHHHHHhcCCC--cHHHHHHhCCCChhHHHHHHHHHHHHH--HHHHhhhhhhhcccc Q lcl|NC_021301. 384 DVSFESPDRVTLGEKYAAASLAKAAGES--WASIRRNILNYNADQIKQDDLDRAREQ--ITLFAGNSVQRPQED 453 (456) Q Consensus 384 ~v~f~~~~~~~~~e~ad~~~kl~~~g~~--s~~t~~~~~~~~~~~~~~~e~~~~~ee--~~~~~~~~~~~~~~d 453 (456) +++|+++.|+|.++.||+++||+++|++ |++|+++.+|++++++++++.++.+++ +..+.+....++++= T Consensus 368 ~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~e~~~~~~e~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 368 GLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDVQVEAVMRHRAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred eEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHHHHHHHHHHHHHHHHHHHHHhhhhhcccccC Confidence 9999999999999999999999999975 678999999999998887765543332 233333333333333 No 15 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=1.8e-83 Score=474.24 Aligned_cols=426 Identities=14% Similarity=0.152 Sum_probs=332.0 Q ss_pred CCCCCHHHHH-HHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchh-hhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPAEWL-PVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA-WRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~~~~-~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~-~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) |++....+++ ..|+++|..+.+|++++++||+|+|+|++++.+.++. .+...+++++|||++|||+.++|++++||+. T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~~l~~~gf~~ 88 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQQLIVDGYRK 88 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHhhcccccccC Confidence 6655554444 4788999999999999999999999999887666554 3445667789999999999999999999986 Q ss_pred CCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEee-----CCCCceEEEEEccceeEEEEeCCCCceEEEE Q lcl|NC_021301. 79 GGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR-----RDDGTATITADSPETMVVSVDPLQPWRIRSA 153 (456) Q Consensus 79 ~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~-----d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~ 153 (456) . |.+....++++|+.|+|+.++.++++++++|||||++||+ |++|.+++++++|++++++||+........ T Consensus 89 ~---d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~iydd~~~~~~~~- 164 (479) T protein:vir:99 89 T---GTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPRDAFAIWEDPYWDEWPK- 164 (479) T ss_pred C---CchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechhheEEEecCCcccceee- Confidence 5 3345667899999999999999999999999999999996 567889999999999999997765443222 Q ss_pred EEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC-----CCCCCcHh Q lcl|NC_021301. 154 MRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN-----PDGMGEVE 228 (456) Q Consensus 154 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n-----~~g~s~~~ 228 (456) ++...++ .....+|+...++.+.. ..+.|......+|.+++|||++|.| ++|+|+|+ T Consensus 165 --~~~~~~~-~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~g~sd~e 226 (479) T protein:vir:99 165 --YLLERQP-NGQYWWWTEEDYSIFEF---------------KQGKFIYRETVSHDYGHIPFVRYVNVMDLRGVCYGDVE 226 (479) T ss_pred --EEEeecC-ceeEEEEecceEEEEEe---------------cCCceeeccccccCCCCcceEEeecCCCcCcCCcchhH Confidence 2222222 22345566655544432 2345666666788889899888755 47999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhc-cceeccCCCceeEeecccchHH Q lcl|NC_021301. 229 PHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAP-GALWELPPGVDIWESQTNDFTP 307 (456) Q Consensus 229 ~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~ 307 (456) ++++|||+||+++|++.++.+++++|++|++|........ +. ...+.... +.++..++++++++++++++++ T Consensus 227 ~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~---~~----~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~ 299 (479) T protein:vir:99 227 PLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGAN---AD----QEKMRFAQESMLISQNEKASFGAIPAAPLDG 299 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccc---cc----hhccccccccceeecCCCceEEEecccchHH Confidence 9999999999999999999999999999999976432211 11 11122222 3344567889999999999999 Q ss_pred HHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---cccee Q lcl|NC_021301. 308 MLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV---EDTVD 384 (456) Q Consensus 308 ~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~---~~~i~ 384 (456) |+++++.++++|+++++||++.||. ++|+||+||++++.+|+++|+.+++.|+.+|++++++++++.|... ...++ T Consensus 300 ~~~~l~~~i~~i~~~t~~p~~~~g~-~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~~~~~~~i~ 378 (479) T protein:vir:99 300 LLNAYKESLLEFLALAQLPPHIAGQ-IVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTEEATDLDFT 378 (479) T ss_pred HHHHHHHHHHHHhccCCCCHHHccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccceeee Confidence 9999999999999999999999974 6789999999999999999999999999999999999999988643 35789 Q ss_pred EEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhC-CCChhHHHHHHHHHH-HHHHHHHhhhhhhhcc---cccCC Q lcl|NC_021301. 385 VSFESPDRVTLGEKYAAASLAKAAGESWASIRRNIL-NYNADQIKQDDLDRA-REQITLFAGNSVQRPQ---EDGSR 456 (456) Q Consensus 385 v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~-~~~~~~~~~~e~~~~-~ee~~~~~~~~~~~~~---~d~~~ 456 (456) ++|+++.++|.++.+|+++||+++|++|++|+++++ |++++++++++.++. +++...........+. ++|+. T Consensus 379 ~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 455 (479) T protein:vir:99 379 ITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNGPDPAEQRGGP 455 (479) T ss_pred EEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCCC Confidence 999999999999999999999999999999999988 678777666543322 2233333333322111 11111 No 16 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=6.3e-83 Score=471.25 Aligned_cols=386 Identities=14% Similarity=0.093 Sum_probs=325.9 Q ss_pred HHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh Q lcl|NC_021301. 17 IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD 96 (456) Q Consensus 17 ~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~ 96 (456) +..+.+|++++++||+|+|++++++++.|++++..+ ++++|||++|||++++++..+||+.. | ..++++|+. T Consensus 1 l~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~-~~v~nw~~~~Vds~a~rl~~~Gf~~~---d----~~l~~i~~~ 72 (410) T protein:vir:95 1 MNLYQSRVNLRYKHYAMQHYEAPTGITIPAHIRAKY-QAVLGWAAKGVDSLADRLIFRAFAND---D----FNVTEIFDR 72 (410) T ss_pred CCcchhhHHHHHHHhcCCCCccccchhccHHHHhHH-HhhcchhHHHHHHhHhhhccccccCC---C----chHHHHHhh Confidence 666788999999999999999999999999998755 57899999999999999999999742 2 248999999 Q ss_pred cChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEE-EecCCceEEEEEEcCCeE Q lcl|NC_021301. 97 NRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWW-RDLDAESDFAIVWSGDGW 175 (456) Q Consensus 97 n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~-~~~d~~~~~~~~~~~~~~ 175 (456) |+|+..+.+++++|++|||||++|+++++|.++|+++||++++++|||.+++ +.++++++ .+.++......+|+++.+ T Consensus 73 N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~-~~~al~~~~~~~~~~~~~~~~~~~~~~ 151 (410) T protein:vir:95 73 NNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGL-LVEGYAVLARDDYNRPTLEAYFEPNAT 151 (410) T ss_pred cChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCc-eEEEEEEEEecCCCeEEEEEEEeCCcE Confidence 9999999999999999999999999999999999999999999999998765 55555554 455567778889999988 Q ss_pred EEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC------CCCCCcH-hHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 176 QKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN------PDGMGEV-EPHIDIINRINRAELQLLSTM 248 (456) Q Consensus 176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n------~~g~s~~-~~v~~liDa~~~~~s~~~~~~ 248 (456) +.+...+ +.| ..+|.+++||+|+|.| ++|+|+| +++++|+|++|++++++.+++ T Consensus 152 ~~~~~~~---------------~~~----~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~~l~da~~r~~~~~~~~~ 212 (410) T protein:vir:95 152 HFIPKDG---------------EPY----SVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGMYYQKYAKRTLERADITA 212 (410) T ss_pred EEEeeCC---------------ccc----cccCCCCCcceEEecccccCCccCCccccchhHHHHHHHHHHHHHHHHHHH Confidence 8775311 112 2467889999998865 4789998 579999999999999999999 Q ss_pred HHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCC-----ceeEeecccchHHHHHHHHHHHHHHHhhc Q lcl|NC_021301. 249 AIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPG-----VDIWESQTNDFTPMLSAIKEHIRQLSSAT 323 (456) Q Consensus 249 ~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-----~~~~~~~~~~~~~~~~~l~~~~~~i~~~~ 323 (456) +|+++|++|++|++.. |.+. ..++...+++|.++.+ ++++||+++++++|+++++.++++|+++| T Consensus 213 e~~a~pqr~i~G~d~d-------~~~~---~~~~~~~~~i~~~~~~~~~~~~~v~q~~~~~l~~~~~~l~~l~~~~a~~s 282 (410) T protein:vir:95 213 EFYSWPQKYILGLDPD-------AEPM---EKWKATVSSLLTISSSDKGVKPSVGQFTTASMSPFTEQLRTAAAGFAGEM 282 (410) T ss_pred HHhcchhheeeccCCC-------CCcC---chhhhhhhhheeccCCCCCCcceEEecCCCChHHHHHHHHHHHHHHhhhc Confidence 9999999999997542 2222 3466677788887643 78999999999999999999999999999 Q ss_pred CCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-----cccceeEEec---CCCCcC Q lcl|NC_021301. 324 KTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-----VEDTVDVSFE---SPDRVT 394 (456) Q Consensus 324 ~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~-----~~~~i~v~f~---~~~~~~ 394 (456) +||++.||+.+.| +||+||++++.+|.+|++++++.|+.+|++++++++++.+.. ...++++.|. ++..++ T Consensus 283 ~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W~p~~d~~~~s 362 (410) T protein:vir:95 283 GLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKWEPLFEADANT 362 (410) T ss_pred CCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEeeecCCcchhh Confidence 9999999998888 599999999999999999999999999999999999887643 2356899999 567779 Q ss_pred HHHHHHHHHHHHhc--CCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhh Q lcl|NC_021301. 395 LGEKYAAASLAKAA--GESWASIRRNILNYNADQIKQDDLDRAREQITLFAG 444 (456) Q Consensus 395 ~~e~ad~~~kl~~~--g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~ 444 (456) .++.||+++||.++ |+++++++++.+||+++++.+.. .++..+-++ T Consensus 363 ~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~~~~~~~----~~e~~~~g~ 410 (410) T protein:vir:95 363 MTMIGDGVVKLNQALPGYINAETIRDLTGIAGDMSAKPV----VSEGGSNGE 410 (410) T ss_pred HHHHHHHHHHHHHhccCCccHHHHHHhcCCChHHHHHHH----HHHHHhCCC Confidence 99999999999998 79999999999999988655422 222222222 No 17 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=1.4e-82 Score=469.27 Aligned_cols=385 Identities=12% Similarity=0.067 Sum_probs=328.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcc Q lcl|NC_021301. 5 TPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) Q Consensus 5 t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~ 84 (456) =+.++|++|.+++..+.+|+.++++||+|+|+++++++..|++++..+ ++++|||++|||++++++..+||+.. | T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~-~~v~nw~~~iVds~a~rl~~~Gf~~~---d- 75 (409) T protein:vir:94 1 MTEKGIGYLRFKLSVHKRRAEMRYDQYAMKYVDRFKGITIPQALSQQY-RSILGWCAKGVDSLADRLVFREFEND---D- 75 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHhcccCchhhcChhhhHHHHHHH-hhhcchhHHHHHHhHhhcccCcccCC---c- Confidence 345689999999999999999999999999999999999999887754 57889999999999999999999742 2 Q ss_pred cHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEE-ecCCc Q lcl|NC_021301. 85 DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWR-DLDAE 163 (456) Q Consensus 85 ~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~-~~d~~ 163 (456) ..++++|+.|+|+..+.++++.|++|||||++||++++|.++|+++||++++++|||..++ +.++++++. +..++ T Consensus 76 ---~~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~~~~-~~~a~~~~~~d~~~~ 151 (409) T protein:vir:94 76 ---FTVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPITGL-LTEGYAVLERDENNN 151 (409) T ss_pred ---hHHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecCCCc-eeeeEEEEEecCCCc Confidence 3589999999999999999999999999999999999999999999999999999998765 666676664 34566 Q ss_pred eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC------CCCCCcH-hHHHHHHHH Q lcl|NC_021301. 164 SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN------PDGMGEV-EPHIDIINR 236 (456) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n------~~g~s~~-~~v~~liDa 236 (456) .....+|+++.++.+... .+.|.. .+|.+++||+|+|.| ++|+|+| +++++|+|+ T Consensus 152 ~~~~~~~~~~~~~~~~~~---------------~~~~~~---~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~~l~da 213 (409) T protein:vir:94 152 VVLEAHFLPDRTDYYYRD---------------SRNNIS---IANPTGHPLLVPIIHRPDAVRPFGRSRITRSGMYWQSN 213 (409) T ss_pred eEEEEEEecCcEEEEEec---------------CceeEe---eeCCCCCcceEEeccccccccccCccccchhHHHHHHH Confidence 677788999888876431 222322 356788899988876 4789999 579999999 Q ss_pred HHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccC-----CCceeEeecccchHHHHHH Q lcl|NC_021301. 237 INRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELP-----PGVDIWESQTNDFTPMLSA 311 (456) Q Consensus 237 ~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~d~~~~~~~~~~~~~~~~~ 311 (456) +|++++++.++++|+++|++|++|++.+ +++. ..++...+++|.++ ++++++|++++++++|+++ T Consensus 214 ~~r~~~~~~~~~e~~a~pqr~i~G~d~d-------~~~~---~~~~~~~~~i~~~~~d~dg~~~~v~q~~~~~l~~~~~~ 283 (409) T protein:vir:94 214 AKRTLERADVTAEFYSFPQKYVTGLSDD-------AEPM---ETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQ 283 (409) T ss_pred HHHHHHHHHHHHHHhcChhheeEecCCC-------Cccc---chhhhhHHHhhcCCCCCCCCCceEEecCCCChhHHHHH Confidence 9999999999999999999999997532 2222 34666777888775 3468999999999999999 Q ss_pred HHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-----cccceeE Q lcl|NC_021301. 312 IKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-----VEDTVDV 385 (456) Q Consensus 312 l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~-----~~~~i~v 385 (456) ++.++++|+++|+||++.||+.++| +||+||++++.+|..+++++++.|+++|++++++++++.+.. +..++++ T Consensus 284 l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~~~~~~~~~~~v 363 (409) T protein:vir:94 284 LRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAPYLREQFRKTKP 363 (409) T ss_pred HHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccccccceE Confidence 9999999999999999999998888 599999999999999999999999999999999999887642 2257899 Q ss_pred EecCCC---CcCHHHHHHHHHHHHhcC--CCcHHHHHHhCCCChhH Q lcl|NC_021301. 386 SFESPD---RVTLGEKYAAASLAKAAG--ESWASIRRNILNYNADQ 426 (456) Q Consensus 386 ~f~~~~---~~~~~e~ad~~~kl~~~g--~~s~~t~~~~~~~~~~~ 426 (456) +|.|.. .++.++.||+++||+++| +.+.+++++.+||++++ T Consensus 364 ~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~d 409 (409) T protein:vir:94 364 KWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGGE 409 (409) T ss_pred EeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCCC Confidence 999654 445688999999999998 55679999999999876 No 18 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=4.4e-82 Score=466.61 Aligned_cols=398 Identities=14% Similarity=0.087 Sum_probs=326.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |+ ...++.|.+++..+.+|++++++||+|+|+++++++.++++++..+ ++++|||+++||++++++..+||+.. T Consensus 1 m~----~~~i~~L~~~~~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~-~~v~nw~~~~Vd~~a~rl~~~Gf~~~- 74 (422) T protein:vir:97 1 MN----YMGMGYLRRKLALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMY-RSVLEWTAKGVDSLADRIIFREFTND- 74 (422) T ss_pred CC----hHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHH-HhhcchhHHHHHHHHhccccceeeCC- Confidence 43 3367788888999999999999999999999999999999998765 57789999999999999999999853 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC-CCceEEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD-DGTATITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~-dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) | ..++++|+.|+|+..+.+++++|++|||||++|++++ +|.|+|+++||++++++|||.+++...++.++..+ T Consensus 75 --d----~~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~~~~~~~a~~~~~~~ 148 (422) T protein:vir:97 75 --D----FNAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPTTFLLTEGYAILESD 148 (422) T ss_pred --c----hhHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCCCCcceeeEEEEEec Confidence 2 2489999999999999999999999999999999986 78899999999999999999987765555555455 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC------CCCCCcH-hHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN------PDGMGEV-EPHID 232 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n------~~g~s~~-~~v~~ 232 (456) .++......+|++..++.+.. ++.+ . ..+|.+++||+|++.| ++|+|+| +++++ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~----------------~~~~--~-~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~ 209 (422) T protein:vir:97 149 SNGNPTLEAYFTDKDIWYYPK----------------KGKP--Y-NIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMY 209 (422) T ss_pred CCCcEEEEEEEcCceEEEEcC----------------CCcc--c-cccCCCCCcceEEecccCCCccccCccccchhHHH Confidence 556555444444444433321 1111 1 2467788899988876 3799999 68999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCC-----ceeEeecccchHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPG-----VDIWESQTNDFTP 307 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-----~~~~~~~~~~~~~ 307 (456) |+|+||++++++.++++|+++|++|++|++.. |.+. ..++...+++|.++++ ++++||+++++++ T Consensus 210 l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d-------~~~~---~~~~~~~~~i~~~~~de~~~~~~v~q~~~~~l~~ 279 (422) T protein:vir:97 210 HQKAAKRTLERAEVTAEFYSFPQKYVLGMDPD-------AKPM---EKWRATVSTLLEISKDEDGDKPTVGQFTTASMAP 279 (422) T ss_pred HHHHHHHHHHHHHHHHHHhcchhhhhcccCcc-------cccC---chhhhhhhhhhccCCCCCCCcceeeecCCCChhH Confidence 99999999999999999999999999998542 2222 3456666788887643 6899999999999 Q ss_pred HHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----cc Q lcl|NC_021301. 308 MLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-----ED 381 (456) Q Consensus 308 ~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-----~~ 381 (456) |+++++.++++|+++|+||++.||+.+.| +||+||++++.+|.+|++++++.|+.+|++++++++++.+... .. T Consensus 280 ~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~ 359 (422) T protein:vir:97 280 FMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEFPYLRNQFM 359 (422) T ss_pred HHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccchhhc Confidence 99999999999999999999999999888 5999999999999999999999999999999999999877432 24 Q ss_pred ceeEEecCCCCcC---HHHHHHHHHHHHhc--CCCcHHHHHHhCCCChhHHHHHHHHHHHHHH Q lcl|NC_021301. 382 TVDVSFESPDRVT---LGEKYAAASLAKAA--GESWASIRRNILNYNADQIKQDDLDRAREQI 439 (456) Q Consensus 382 ~i~v~f~~~~~~~---~~e~ad~~~kl~~~--g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~ 439 (456) +++++|+++.|.+ .++.||+++||+++ |+.+.+++++.+||++.+++..+.++.+.+. T Consensus 360 ~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 360 DTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGVKGADKPIPAITEVTTDG 422 (422) T ss_pred cceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCCCchhHHHHHHHhhhccC Confidence 6899999776666 78899999999999 6889999999999987654433333322211 No 19 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=1.3e-81 Score=464.09 Aligned_cols=407 Identities=21% Similarity=0.262 Sum_probs=325.2 Q ss_pred ccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEE Q lcl|NC_021301. 39 ELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYL 118 (456) Q Consensus 39 ~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~ 118 (456) +++++++++++..++++++|||++|||++++++.++||+.. |.+....++++|++|+|+.++.++++++++|||||+ T Consensus 1 ~l~~~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~gf~~~---d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~ 77 (434) T protein:vir:98 1 MLPKNAEQAFLDFQRKARTNFCGLIANASVHRLLALGVTGP---DGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYM 77 (434) T ss_pred CCCCCccHHHHHhhhhhhccchHHHHHHHHhhhccCceecC---CCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEE Confidence 88999999999998888999999999999999999999863 445678899999999999999999999999999999 Q ss_pred EEeeCCCC-------ceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccce Q lcl|NC_021301. 119 TCWRRDDG-------TATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR 191 (456) Q Consensus 119 ~v~~d~dg-------~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (456) +||.++++ .++|+++||++++++||+..++ +.+++++|...........+|+.+..+.+.......... T Consensus 78 ~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~-~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 153 (434) T protein:vir:98 78 LVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGE-PLVGLKVWHNDIDGFGYARVFFDDTSFPYRTRERTGARL--- 153 (434) T ss_pred EEecCCCcccccCCceeEEEEeccceeEEEEeCCCCc-eEEEEEEEEeccCCceEEEEEEeCcEEEEEEeecccccc--- Confidence 99987654 4679999999999999998776 666777776555445556667666655554332221110 Q ss_pred eeccCCCcee----ecccccccCceeEEEEc-cC----CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC Q lcl|NC_021301. 192 LVTRISDSWV----PVGDAVVTGSPPPVVVY-QN----PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG 262 (456) Q Consensus 192 ~~~~~~~~~~----~~~~~~~~~~~~pvv~~-~n----~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~ 262 (456) ......|+ .....+|+++.||||+| +| .+|+|+|+++++|||+||+++|++.+.++++++|++|++|.. T Consensus 154 --~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~ 231 (434) T protein:vir:98 154 --PWGPDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHK 231 (434) T ss_pred --ccccccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC Confidence 11111121 12334577777887766 45 479999999999999999999999999999999999999988 Q ss_pred CcccccccccchhhhhhhhhhhccceeccC-CCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHH Q lcl|NC_021301. 263 HGLPKVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEG 341 (456) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~A 341 (456) ...+. ++.+..+.....+....+++|..+ ++++++|++++++++|+++|+.++++|++++++|++.||+.++|+||+| T Consensus 232 ~~~~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~n~Sg~A 310 (434) T protein:vir:98 232 FAKRT-DPATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPTYLYATDLVNISADT 310 (434) T ss_pred ccccc-ccccccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCHHHhccccCChHHHH Confidence 76554 334455555556667778888765 5799999999999999999999999999999999999998888999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_021301. 342 AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNIL 420 (456) Q Consensus 342 l~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~ 420 (456) |++++.+|++||+++++.|+++|++++++++++.|... ..+++++|+++.|+|.++.||+++||+++|+ |.+++++++ T Consensus 311 l~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~~~~~~~~v~w~~~~~~s~~~~ada~~kl~~~g~-~~e~~~~~l 389 (434) T protein:vir:98 311 IGALDILHVAKVREHIASFSEGLESVLALAAAQAGVPEDYTEAEVRWANPAHVTMAVKADAATKLKSIGY-PLDVIAEEL 389 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCChhheeeeEEecCCCCCCHHHHHHHHHHHHhcCC-cHHHHHHhC Confidence 99999999999999999999999999999999998654 4579999999999999999999999998886 889999999 Q ss_pred CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 421 NYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 421 ~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) |++++++++++.++.++...............-|+. T Consensus 390 g~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~ 425 (434) T protein:vir:98 390 DESPARVRRIVAGAASQALLAASLLPAPGAPSAGNV 425 (434) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Confidence 999998887766554433322111111001111111 No 20 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=8.6e-82 Score=465.02 Aligned_cols=385 Identities=12% Similarity=0.073 Sum_probs=328.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcc Q lcl|NC_021301. 5 TPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) Q Consensus 5 t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~ 84 (456) =+.++|++|.+++..+.+|+.++.+||+|+|+++++++..|++++..+ ++++|||++|||++++++..+||+.. | T Consensus 1 ~~~~~i~~L~~~~~~~~~r~~~~~~yY~g~~~~~~~~~~~p~~~~~~~-~~v~nw~~~iVds~a~rl~~~Gf~~~---d- 75 (409) T protein:vir:16 1 MTEKGIGYLRFKLSVHKRRAEMRYEQYAMKHVDRFKGITIPQALSQQY-RSILGWCAKGVDSLADRLVFREFEND---D- 75 (409) T ss_pred CCHHHHHHHHHHHHHHhHHHHHHHHHHhccCchhhcchhhhHHHHHHH-hhhcChhHHHHHHhHhhcccccccCc---c- Confidence 235589999999999999999999999999999999999999988754 57889999999999999999999742 2 Q ss_pred cHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEE-EecCCc Q lcl|NC_021301. 85 DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWW-RDLDAE 163 (456) Q Consensus 85 ~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~-~~~d~~ 163 (456) ..++++|+.|+|+..+.++++.|++|||||++|+++++|.++|+++||++++++|||..++.. +++++| .+..+. T Consensus 76 ---~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~~~~~~-~a~~~~~~d~~~~ 151 (409) T protein:vir:16 76 ---FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPITGLLT-EGYAVLERDENNN 151 (409) T ss_pred ---hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeecccccce-eeeEEEEecCCCc Confidence 358999999999999999999999999999999999999999999999999999999877644 445544 455677 Q ss_pred eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC------CCCCCcH-hHHHHHHHH Q lcl|NC_021301. 164 SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN------PDGMGEV-EPHIDIINR 236 (456) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n------~~g~s~~-~~v~~liDa 236 (456) ....++|+++.++.+... .+.|. ..+|++|+||+|+|.| ++|.|+| +++++|+|+ T Consensus 152 ~~~~~~~~~~~~~~~~~~---------------~~~~~---~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~~l~da 213 (409) T protein:vir:16 152 VVLEAHFLPDRTDYYYRD---------------SRNNI---SIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGMYWQSN 213 (409) T ss_pred eEEEEEEecCcEEEEEec---------------Ccccc---ceecCCCCcceEEecccccccccCCccccchhHHHHHHH Confidence 777889999888776431 12222 2357788899988865 3799998 579999999 Q ss_pred HHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCC-----CceeEeecccchHHHHHH Q lcl|NC_021301. 237 INRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPP-----GVDIWESQTNDFTPMLSA 311 (456) Q Consensus 237 ~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----d~~~~~~~~~~~~~~~~~ 311 (456) +|++++++.++++|+++|++|++|++.+ +.+. ..++...+++|.+++ +++++||+++++++|+++ T Consensus 214 ~~r~~~~~~~~~e~~a~pqr~i~G~d~d-------~~~~---~~~~~~~~~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~ 283 (409) T protein:vir:16 214 AKRTLERADVTAEFYSFPQKYVTGLSDD-------AEPM---ETWKATVSSMLQFTKDEDGDKPTLGQFTQPSMSPFTEQ 283 (409) T ss_pred HHHHHHHHHHHHHHhcChhheeEecCCC-------CCcc---chhhhhhhHhhccCCCCCCCCceEEecCCCChhHHHHH Confidence 9999999999999999999999998532 2222 346677788887753 368999999999999999 Q ss_pred HHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-----ccceeE Q lcl|NC_021301. 312 IKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-----EDTVDV 385 (456) Q Consensus 312 l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-----~~~i~v 385 (456) ++.++++||++++||++.||+.+.| +||+||++++.+|..+++++++.|+.+|++++++++++.+... ..++++ T Consensus 284 l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~~~~~~~~~~~v 363 (409) T protein:vir:16 284 LRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVPYLREQFSKTKP 363 (409) T ss_pred HHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhccceE Confidence 9999999999999999999999999 5999999999999999999999999999999999999876532 256899 Q ss_pred EecCCC---CcCHHHHHHHHHHHHhcCCC--cHHHHHHhCCCChhH Q lcl|NC_021301. 386 SFESPD---RVTLGEKYAAASLAKAAGES--WASIRRNILNYNADQ 426 (456) Q Consensus 386 ~f~~~~---~~~~~e~ad~~~kl~~~g~~--s~~t~~~~~~~~~~~ 426 (456) +|.++. .++.++.||+++||+++|.. ..++.++.+||++++ T Consensus 364 ~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~d 409 (409) T protein:vir:16 364 KWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGAE 409 (409) T ss_pred EecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCCC Confidence 999765 45589999999999999743 468899999999876 No 21 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=2.1e-80 Score=457.37 Aligned_cols=426 Identities=11% Similarity=0.074 Sum_probs=332.4 Q ss_pred CCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTP--AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~--~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) |..+++ .++|..|+.+|..+.+|++++++||+|+|+|+..+.+. ..+.++|+++||+++||++.++||+|+|+++ T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~---~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~ 87 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKD---LWKPDNRLTVNFTKYIVDTFTGYFNGIPVKK 87 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCcc---ccCccceeecchHHHHHHHHhhhhcccCcee Confidence 777776 78999999999999999999999999999998766432 2234678999999999999999999999998 Q ss_pred CCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 79 GGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 79 ~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) ..+ +++..+.++++|+.|+|+.++.++++.++++|+||++||.|++|.+++++++|++++++||+..++.+.++++++. T Consensus 88 ~~~-d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~~~ 166 (453) T protein:vir:39 88 SHS-DKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFMVYDDTIKQEPLFAVRYGY 166 (453) T ss_pred ccC-ChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEecCCCCCeEEEEEEEEE Confidence 754 4556788999999999999999999999999999999999999999999999999999999988888999998886 Q ss_pred ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc-CCCCCCcHhHHHHHHHHH Q lcl|NC_021301. 159 DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHIDIINRI 237 (456) Q Consensus 159 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-n~~g~s~~~~v~~liDa~ 237 (456) .. +...+..+|+++.++.|.. ..+.|......+|+++.||||++. |++|+|+|+++++|||+| T Consensus 167 ~~-~~~~~~~~yt~~~i~~~~~---------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~ 230 (453) T protein:vir:39 167 DD-DYKLYGEVYTKETTYALNG---------------TMGFYNMTEQAPNPFDDLPVVEFYFNEERMSIFESVISLVNAF 230 (453) T ss_pred eC-CeEEEEEEEeCCeEEEEEe---------------cCCceeeecccccCCCceeEEEecCCCCCCcchhhhHHHHHHH Confidence 54 4567788999999888742 233455556677888888888775 689999999999999999 Q ss_pred HHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHH Q lcl|NC_021301. 238 NRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIR 317 (456) Q Consensus 238 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~ 317 (456) |+++|++++.++++++|+++++|.+.+..... .......+....+.....++++++.+. +.+.+.+.+.++.+.. T Consensus 231 ~~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~lt~-~~~~~~~~~~~~~l~~ 305 (453) T protein:vir:39 231 NKAISEKANDVDYFSDQYLTFLGAAVEEEDLK----NIRSNRVINYYGESSEAKNVDVKFLEK-PDSDSQTENLLDRLTK 305 (453) T ss_pred HHHHHHHHHHHHHhhCceeeeecCCCCchhhh----hhhhcceeeecCCCCCCCCCceeEEee-cCCHHHHHHHHHHHHH Confidence 99999999999999999999999764322111 111111111111111112233444333 3456777777778888 Q ss_pred HHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CcccceeEEecCCCCc Q lcl|NC_021301. 318 QLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE----SVEDTVDVSFESPDRV 393 (456) Q Consensus 318 ~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~----~~~~~i~v~f~~~~~~ 393 (456) .|+..+++|...++.. +|+||+||++++++|.+||.++++.|+.+|++++++++++.+. .+..+++++|+++.|. T Consensus 306 ~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~p~ 384 (453) T protein:vir:39 306 LIFQTTMVANISDESF-GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDIEYTFTRNEPK 384 (453) T ss_pred HHHHHhCCcccccccc-cCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCCCc Confidence 8888888887766543 6899999999999999999999999999999999999887653 2345789999999999 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh-hhccc---------ccCC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV-QRPQE---------DGSR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~-~~~~~---------d~~~ 456 (456) |.++.|++++|+ +|++|++|+++++|++++ .+.|++|+++|......... ..+.. ++.. T Consensus 385 ~~~~~a~~~~kl--~g~is~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 385 DIKEQAETANIL--MGITSQETALSVISVIPD--VQAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred CHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCCC Confidence 999999999987 489999999999999876 34456666655443322111 01111 1111 No 22 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=3e-80 Score=456.53 Aligned_cols=440 Identities=14% Similarity=0.060 Sum_probs=325.6 Q ss_pred CCCCCH-----HHHHHHHHHHH-HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTASTP-----AEWLPVLTKRI-DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~~t~-----~~~~~~l~~~~-~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) |++... .+.+..++.+| ..+.+|++++++||+|+|+|...+...++..+. ++|+++||+++||++.++||+|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~-~~ki~~n~~k~Ivd~~~~yl~g~ 109 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGN 109 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccC-cceeecchHHHHHHHHhhhhccc Confidence 433221 23455666665 566899999999999999998766655555444 56899999999999999999999 Q ss_pred CeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 75 GITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 75 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) |++++.+ +++..+.++++|+.|+|+.++.++++.+++||+||+++|.|++|.+++.+++|++++|+||+....++.+++ T Consensus 110 p~~~~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~~~~~iyd~~~~~~~~~~v 188 (512) T protein:vir:97 110 PIQCQDD-DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGV 188 (512) T ss_pred CceeccC-ChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 9998754 556778899999999999999999999999999999999999999999999999999999998888899999 Q ss_pred EEEEecC------CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc-CCCCCCcH Q lcl|NC_021301. 155 RWWRDLD------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEV 227 (456) Q Consensus 155 ~~~~~~d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-n~~g~s~~ 227 (456) ++|.... +...+..+|+++.++.|.......... .+......+|+++.+||+++. |++|+|+| T Consensus 189 r~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~nn~~~~gd~ 258 (512) T protein:vir:97 189 RYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL----------TPRENGFESHSFERMPITEFSNNERRKGDY 258 (512) T ss_pred EEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccc----------cccccccccccCcccceEeecCCCCCCCch Confidence 9986432 224556799999988876432111110 111223346777777877765 68999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc---ccccchhhhhhhhhhhccceeccCCCceeEeec-cc Q lcl|NC_021301. 228 EPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV---DENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TN 303 (456) Q Consensus 228 ~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~ 303 (456) +++++|||+||+++|++++.++++++|+++++|.....+.. ...+..+..........+.....+.++++..+. +. T Consensus 259 e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~ 338 (512) T protein:vir:97 259 EKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIYKQY 338 (512) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhhhhhhhcccccccccchhhcccccCCCCCcceEEEeecC Confidence 99999999999999999999999999999999975432211 111111111111111112222234455554443 34 Q ss_pred chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------ Q lcl|NC_021301. 304 DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE------ 377 (456) Q Consensus 304 ~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~------ 377 (456) +.+++...++.+...|+..+++|+..++..++|+||+||++++.+|.++|..+++.|+++|++++++++++.+. T Consensus 339 ~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~ 418 (512) T protein:vir:97 339 DVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDA 418 (512) T ss_pred CHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccc Confidence 56667777777788888888899888887778999999999999999999999999999999999998876331 Q ss_pred -CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhh------c Q lcl|NC_021301. 378 -SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQR------P 450 (456) Q Consensus 378 -~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~------~ 450 (456) .+...++++|+++.|.|.++.++++++|. |++|.+|+++++|++++ .+.|++|+++|........... + T Consensus 419 ~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~ 494 (512) T protein:vir:97 419 NKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRD 494 (512) T ss_pred ccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhcccCCCCC Confidence 22346899999999999999999999885 89999999999999865 3344555555433322211111 1 Q ss_pred -----ccccCC Q lcl|NC_021301. 451 -----QEDGSR 456 (456) Q Consensus 451 -----~~d~~~ 456 (456) +++.+. T Consensus 495 ~~~~~~~~~~~ 505 (512) T protein:vir:97 495 INDDEQDDDTK 505 (512) T ss_pred CCCCCCCCCcc Confidence 111111 No 23 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=3.2e-79 Score=450.91 Aligned_cols=426 Identities=12% Similarity=0.013 Sum_probs=329.1 Q ss_pred CC---CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MT---ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~---~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |. .-|++++.+.+.+++..+++||+++++||+|+|+|+..++.. ...++|+++||+++||++.++||+|+|++ T Consensus 19 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~----~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~ 94 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEKE----TGADNRIVVNSAKYVVDVYNGYFCGIEPK 94 (470) T ss_pred eCCCCCcCHHHHHHHHHHHHHhhHHHHHHHHHHhccccccccCcccc----cCCcceeecchHHHHHHHHhhhhccCCee Confidence 44 334555555444444567799999999999999987654331 23467899999999999999999999999 Q ss_pred cCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEE Q lcl|NC_021301. 78 VGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWW 157 (456) Q Consensus 78 ~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~ 157 (456) +....|+...+.++++|+.|+|+.++.++++.+++||+||+++|.+++|.+++++++|++++|+||+...+++.+++++| T Consensus 95 ~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~vr~~ 174 (470) T protein:vir:99 95 LALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAFIIYDDTVQRQPLAFVHYQ 174 (470) T ss_pred EeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeEEEEcCCCCcceEEEEEEE Confidence 88776777778899999999999999999999999999999999999999999999999999999998888899999998 Q ss_pred EecCCc--eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHH Q lcl|NC_021301. 158 RDLDAE--SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDII 234 (456) Q Consensus 158 ~~~d~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~li 234 (456) ...++. ..+..+|+++.++.+..... ...+......+|+++.+||+++ +|++|+|+|+++++|| T Consensus 175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~li 241 (470) T protein:vir:99 175 IDNSNNWTDAYGVIQYADKFYKFKGYDI-------------EEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSIKTLI 241 (470) T ss_pred EEecCCeeEEEEEEEecCeEEEEEeccc-------------ccccccccccccCCCccceEeecCCCCCCcchHhHHHHH Confidence 766543 44577899988887753211 1112223345677777887776 5688999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec-----cCCCceeEeec-ccchHHH Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE-----LPPGVDIWESQ-TNDFTPM 308 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~d~~~~~~~-~~~~~~~ 308 (456) |+||+++|++++..+++++|+++++|+..+. ++.|.+..... ...++. .+.+++++++. +.+.+.+ T Consensus 242 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~---~~~g~~~~~~~-----~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 313 (470) T protein:vir:99 242 NALDKVISQKANQVEYFDNAYMYMIGFKLPE---DDEGNPKFDFK-----NNRVLYVSQLDPDTNPQIGFIAKPDADQMQ 313 (470) T ss_pred HHHHHHHHHHHHHHHHhcCceeeeecCCccc---ccccchhhhhh-----hcceeeecCCCCCCCCcceEEeecCChHHH Confidence 9999999999999999999999999986543 23333332211 122222 23456666664 4466778 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----Ccccce Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-----SVEDTV 383 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-----~~~~~i 383 (456) ...++.+.+.|+..+++|+..++..++|+||+||++++.+|.+||+++++.|+.+|++++++++++.+. .+...+ T Consensus 314 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i 393 (470) T protein:vir:99 314 ENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSEL 393 (470) T ss_pred HHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccc Confidence 888888999999999999888888788999999999999999999999999999999999998876442 233578 Q ss_pred eEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhh-hhhhhcccccCC Q lcl|NC_021301. 384 DVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAG-NSVQRPQEDGSR 456 (456) Q Consensus 384 ~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~-~~~~~~~~d~~~ 456 (456) +++|+++.|.|.++.+++++++. |++|.+|+++++|+++ ++.|++|+++|.....+ ........|..+ T Consensus 394 ~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~vd---~~~E~eri~~E~~~~~~~~~~~~~~~d~~~ 462 (470) T protein:vir:99 394 DFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDIE---PDAEMKQIAKEKADAIKQTQQLSMPIDILK 462 (470) T ss_pred eEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCC---HHHHHHHHHHHHHHHHHHHHhhcCCCCcCC Confidence 99999999999999999999885 8999999999999974 22345555544322111 111111112221 No 24 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=1.7e-78 Score=446.92 Aligned_cols=439 Identities=14% Similarity=0.059 Sum_probs=323.1 Q ss_pred CCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRI-DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~-~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) -...++++ |..++.+| ..+++|++++++||+|+|+++..+...++..+. ++|+++||+++||++.++||+|+|++++ T Consensus 37 ~~~~~~~~-i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~-~~ki~~n~~k~Iv~~~~~yl~g~p~~~~ 114 (511) T protein:vir:93 37 DLLQNVNE-VSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQYQ 114 (511) T ss_pred hhhccHHH-HHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccC-cceeecchHHHHHHHHhhhhcccCeeec Confidence 22233443 55566665 567899999999999999998766655554443 5689999999999999999999999987 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) .+ +++..+.++++|+.|+|+.++.++++.+++||+||+++|.|++|.+++++++|++++|+||+....++.+++++|.. T Consensus 115 ~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~ 193 (511) T protein:vir:93 115 DD-DKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (511) T ss_pred cC-ChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 54 55677889999999999999999999999999999999999999999999999999999999887788999999864 Q ss_pred cC------CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHH Q lcl|NC_021301. 160 LD------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHID 232 (456) Q Consensus 160 ~d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~ 232 (456) .. +...+..+|+++.++.|.......... .+......+|+++.+||+.+ +|++|+|+|+++++ T Consensus 194 ~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~nn~~g~gd~e~v~~ 263 (511) T protein:vir:93 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL----------TPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (511) T ss_pred eeccccccceEEEEEEEeCCcEEEEEecCCCcccc----------ccccccccccCCCccceEEecCCCCCCCchhhHHH Confidence 32 234567899999998876432211111 11122334566777777666 56899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc-ccccc-chhhhhhhhhhhccceeccCCCceeEeec-ccchHHHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPK-VDENG-NAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPML 309 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~ 309 (456) |||+||.++|++++..+++++|+++++|....... ..... .............+......+++++..+. +.+.+++. T Consensus 264 liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 343 (511) T protein:vir:93 264 LIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTE 343 (511) T ss_pred HHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHH Confidence 99999999999999999999999999996542211 00000 00000000000111111233445554443 34567777 Q ss_pred HHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------Ccccc Q lcl|NC_021301. 310 SAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-------SVEDT 382 (456) Q Consensus 310 ~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-------~~~~~ 382 (456) ..++.+...|+..+++|+..++..++|+||+||+++++++.++|.++++.|+++|++++++++++.+. .+... T Consensus 344 ~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~ 423 (511) T protein:vir:93 344 AYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNT 423 (511) T ss_pred HHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Confidence 77888888888888899888877778999999999999999999999999999999999998865331 12346 Q ss_pred eeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhh----hhh-------cc Q lcl|NC_021301. 383 VDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNS----VQR-------PQ 451 (456) Q Consensus 383 i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~----~~~-------~~ 451 (456) ++++|+++.|.|.++.++++.+|. |++|.+|+++.+|++++. +.|++|+++|........ ... .+ T Consensus 424 i~~~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~l~~v~d~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 499 (511) T protein:vir:93 424 VRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQDP--ELEVKKIEEDEKESIKKAQKGIYKDPRDINDDEQ 499 (511) T ss_pred ceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCCC Confidence 899999999999999999999884 899999999999998752 334555554433222111 111 11 Q ss_pred cccCC Q lcl|NC_021301. 452 EDGSR 456 (456) Q Consensus 452 ~d~~~ 456 (456) ++++. T Consensus 500 ~~~~~ 504 (511) T protein:vir:93 500 DDDTK 504 (511) T ss_pred CCccc Confidence 12222 No 25 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=1.9e-78 Score=446.67 Aligned_cols=429 Identities=11% Similarity=0.045 Sum_probs=326.2 Q ss_pred CCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTAST--PAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t--~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) |+.+. -.+.|..++.+|..+++||+++.+||+|+|+|+......+ ...++|+++||+++||++.++||+|+|+++ T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~---~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~ 87 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKAKDS---WKPDNRLTNNFAKYIVDTFVGYFNGIPIKK 87 (453) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCCCCc---cCccceeecchHHHHHHHhhhhhcccCcee Confidence 33221 1246888899999999999999999999999987554322 234668999999999999999999999998 Q ss_pred CCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 79 GGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 79 ~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) ..+ ++...+.++++|+.|+|+..+.++++++++||+||+++|.+++|.+++++++|++++++||+..++.+.++++++. T Consensus 88 ~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i~~~~ 166 (453) T protein:vir:73 88 THD-DKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDSIKQKPLFAVYYGF 166 (453) T ss_pred ecC-ChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCCCCceeEEEEEEEE Confidence 754 4556778999999999999999999999999999999999999999999999999999999988888899999887 Q ss_pred ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHHHH Q lcl|NC_021301. 159 DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRI 237 (456) Q Consensus 159 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liDa~ 237 (456) +.++ ..+..+|+++.++.|... .+.|......+|.++.||||++ +|++|+|+|+++++|||+| T Consensus 167 ~~~~-~~~~~vyt~~~i~~~~~~---------------~~~~~~~~~~~~~~g~vPvv~~~n~~~g~s~~~~v~~liDa~ 230 (453) T protein:vir:73 167 DEEG-NLSGTVYTLLETISITGK---------------AGEVKFGESTYNVYSDLPIVEYNFNEERQSIFEPVHSLINSY 230 (453) T ss_pred ecCc-eEEEEEEeCCeEEEEEec---------------CCceEEccceeccCCceeEEEecCCCCCCcchhhHHHHHHHH Confidence 7665 456899999998887531 2234445556677888888776 5689999999999999999 Q ss_pred HHHHHHHHHHHHHhhchhhhhhcCCCcccccccc-cchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHHHHH Q lcl|NC_021301. 238 NRAELQLLSTMAIQAFRQRALKSAGHGLPKVDEN-GNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAIKEH 315 (456) Q Consensus 238 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~~ 315 (456) |+++|++++.++++++|+++++|+.......... +... ........+.....++++++..+. +.+.+++...++.+ T Consensus 231 ~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l 308 (453) T protein:vir:73 231 NKVTSEKANDVEYFSDQYLVFLGAEVDEEDAKNIKDNRL--INFFDKNSNGQGTNAAKVDVKFLDKPDSDVQTENLLNRL 308 (453) T ss_pred HHHHHHHHHHHHHhccceeeeecCCCCchhhhccccccc--ccccccccccccccccCceeEEeeecCCHHHHHHHHHHH Confidence 9999999999999999999999986543222111 1111 011111122223334444444332 33456677777777 Q ss_pred HHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC----cccceeEEecCCC Q lcl|NC_021301. 316 IRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES----VEDTVDVSFESPD 391 (456) Q Consensus 316 ~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~----~~~~i~v~f~~~~ 391 (456) ...|+..+++|...++.. +|+||+||++++.+|.+||+++++.|+.+|++++++++++.+.. +..+++++|+++. T Consensus 309 ~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v~f~~~~ 387 (453) T protein:vir:73 309 ERSIFQFTMAANISDENF-GNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEYTFTRNE 387 (453) T ss_pred HHHHHHHhCCcccCcccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEEeCCCC Confidence 777777788887666553 78999999999999999999999999999999999998875432 3357899999999 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHH-hhhh-hhhcccccCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLF-AGNS-VQRPQEDGSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~-~~~~-~~~~~~d~~~ 456 (456) |.|.++.|++++|+. |++|.+|+++.+|++++. +.|.+|+++|.... .... ....+++-.+ T Consensus 388 p~~~~~~a~~~~k~~--giis~et~~~~~~~~~d~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 450 (453) T protein:vir:73 388 PKDIKEQAETANILK--GITSEETALSVISVIPDV--QAEMEKIKKKKLLQLSLTRTSNLVRMKQMR 450 (453) T ss_pred CCCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHHhccCCcchhhh Confidence 999999999999986 899999999999998652 23344444332211 1111 1112223333 No 26 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=2.7e-78 Score=445.80 Aligned_cols=438 Identities=14% Similarity=0.057 Sum_probs=324.4 Q ss_pred CCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKR-IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~-~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) -+.++++. |..++.+ ...+++|++++++||+|+|+++..+...++..+. ++|+++||+++||++.++||+|+|+++. T Consensus 37 ~~~~~~~~-i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~-~~ki~~n~~k~Iv~~~~~yl~g~p~~~~ 114 (511) T protein:vir:10 37 DLLQNVNE-VSKCIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQYQ 114 (511) T ss_pred hcccCHHH-HHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccC-cceeecchHHHHHHHHhhhhcccCceee Confidence 34445554 4455554 4667899999999999999998766655555444 5689999999999999999999999987 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) .+ +++..+.++++|+.|+|+.++.++++.+++||+||+++|.|++|.+++++++|++++|+||+....++.+++++|.. T Consensus 115 ~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~ 193 (511) T protein:vir:10 115 DD-DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (511) T ss_pred cC-chHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 54 55677889999999999999999999999999999999999999999999999999999999887889999999875 Q ss_pred cC------CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc-CCCCCCcHhHHHH Q lcl|NC_021301. 160 LD------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHID 232 (456) Q Consensus 160 ~d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-n~~g~s~~~~v~~ 232 (456) .. +...+..+|+++.++.|......... ..+......+|+++.+||+++. |++|+|+|+++++ T Consensus 194 ~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~----------~~~~~~~~~~~~~~~vPvv~f~nn~~g~gd~e~v~~ 263 (511) T protein:vir:10 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------LTPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (511) T ss_pred eecccCccceEEEEEEEeCCcEEEEEecCCCccc----------ccccccccccccCcceeEEEecCCCCCCCchhhhHH Confidence 32 23456779999999887643211100 0111223346777777777765 5789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc-cc--cccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPK-VD--ENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPM 308 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~ 308 (456) |||+||.++|++++..+++++|+++++|....... .. ..+..+.. .......+.....++++++..+. +.+.+++ T Consensus 264 liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~ 342 (511) T protein:vir:10 264 LIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFL-EPTVYADSEGRETEGSVDGGYIYKQYDVQGT 342 (511) T ss_pred HHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccchhccceec-ccccccccccccCCCCcceeEEeecCCHHHH Confidence 99999999999999999999999999996432111 00 01111100 00000111112233445554443 3456677 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------Cccc Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-------SVED 381 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-------~~~~ 381 (456) ...++.+...|+..+++|+..++..++|+||+||+++++++.++|.++++.|+++|++++++++++.+. .+.. T Consensus 343 e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~~ 422 (511) T protein:vir:10 343 EAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 422 (511) T ss_pred HHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCcccccccc Confidence 777777788888888899888877778999999999999999999999999999999999998876431 2234 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh------hhcccccC Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV------QRPQEDGS 455 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~------~~~~~d~~ 455 (456) +++++|+++.|+|.++.++++++|. |++|++|+++.+|++++ .+.|++|+++|......... ..+..+++ T Consensus 423 ~i~i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:10 423 TVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred eeeEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCC Confidence 6899999999999999999999985 89999999999999875 33455555555332221111 11111111 Q ss_pred C Q lcl|NC_021301. 456 R 456 (456) Q Consensus 456 ~ 456 (456) . T Consensus 499 ~ 499 (511) T protein:vir:10 499 Q 499 (511) T ss_pred C Confidence 1 No 27 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=3.8e-78 Score=445.02 Aligned_cols=433 Identities=11% Similarity=0.037 Sum_probs=316.1 Q ss_pred CCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccc----hhhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTA--STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTS----AAWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~--~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~----~~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) ... ++..++|.+|+.+|..+++|++++.+||+|+|+|...++... ....+.++|+++||+++|||+.++||+|+ T Consensus 38 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~ 117 (492) T protein:vir:94 38 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 117 (492) T ss_pred cCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHHhhhccc Confidence 222 334568899999999999999999999999999876654321 11223466899999999999999999999 Q ss_pred CeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 75 GITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 75 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) |++++.+ |++..+.++++| .|+|+..+.++++++++||+||+++|.|++|.+++++++|++++++||+.....+.+++ T Consensus 118 p~~~~~~-d~~~~~~l~~~~-~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~i 195 (492) T protein:vir:94 118 PIAFKHT-DDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI 195 (492) T ss_pred CceeccC-chHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 9998754 455666677766 48899999999999999999999999999999999999999999999988778899999 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) ++|...+. ....+|++..++.|........ .......+.+... ..+|.++.|||+++ +|++|.|+|+++++| T Consensus 196 r~~~~~~~--~~~~~y~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~g~vPvv~~~nn~~~~sd~e~v~~l 268 (492) T protein:vir:94 196 RMYKLENE--TKVEYWDKVTVNYYVYENGSLI----PDYSNNLENSKTH-FSTGSWGKIPFIPFKNNDLEISDIFMYKTL 268 (492) T ss_pred EEEeeccc--eeEEEEecCeEEEEEEecCeee----ecccccccccccc-ccccCCCccceEEecCCCCCCCchHHHHHH Confidence 99876543 3468899988888754321110 0111112223333 34566666676665 578999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEee-cccchHHHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWES-QTNDFTPMLSAI 312 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~-~~~~~~~~~~~l 312 (456) ||+||+++|++++..+++++|+++++|++..... + . ...+. ...++..+.++++..+ .+.+.+++...+ T Consensus 269 iDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~--~---~---~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 338 (492) T protein:vir:94 269 IDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELP--E---F---KRLLR--YYGAIKVSDNGGVDTIQVEVPVENSKKYL 338 (492) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcccch--h---h---HHHHh--hccceecCCCCcceeEeccCCHHHHHHHH Confidence 9999999999999999999999999997643211 1 1 11111 1223334444443322 122334444444 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-cceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVE-DTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~-~~i~v~f~~~~ 391 (456) +.+...|+..+++|...++..++|+||+||++++.+|..||+++++.|+.+|++++++++++.+...+ ..++++|++++ T Consensus 339 ~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~ 418 (492) T protein:vir:94 339 DELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNK 418 (492) T ss_pred HHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCC Confidence 44444555555566555555556899999999999999999999999999999999999998886543 57999999999 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh--hcccc--------cCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ--RPQED--------GSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~--~~~~d--------~~~ 456 (456) |.|.++.+++++++. |++|++|+++++|++++ .+.|++|+++|.......... ...++ +++ T Consensus 419 p~~~~e~~~~~~kl~--giiS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 489 (492) T protein:vir:94 419 VANTELQVQTAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADSAQQQERSNNK 489 (492) T ss_pred CCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCccccCCccc Confidence 999999999999985 89999999999999875 344555555543322221111 11111 111 No 28 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=3.9e-78 Score=444.98 Aligned_cols=437 Identities=14% Similarity=0.076 Sum_probs=322.0 Q ss_pred CCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRI-DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~-~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) -...++++ |..++.+| ..+++||+++++||+|+|+++......++..+. ++|+++||+++||++.++||+|+|+++. T Consensus 37 ~~~~~~~~-i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~-~~ki~~n~~k~Iv~~~~~yl~g~p~~~~ 114 (511) T protein:vir:99 37 DLLQNVNE-VSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQYQ 114 (511) T ss_pred hhhccHHH-HHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcccccccC-cceeecchHHHHHHHHHhhhcccCceee Confidence 23334444 44555555 567899999999999999998766555554444 5689999999999999999999999987 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) .+ +++..+.++++|+.|+|+.++.++++.++++|+||+++|.|++|.+++++++|++++|+||+....++.+++++|.. T Consensus 115 ~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~ 193 (511) T protein:vir:99 115 DD-DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (511) T ss_pred cC-chHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 54 55677889999999999999999999999999999999999999999999999999999999887789999999864 Q ss_pred cC------CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHH Q lcl|NC_021301. 160 LD------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHID 232 (456) Q Consensus 160 ~d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~ 232 (456) .. +...+..+|+++.++.|.......... ..+ ......|+++.+||+++ +|++|+|+|+++++ T Consensus 194 ~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~---------~~~-~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~ 263 (511) T protein:vir:99 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL---------TPR-ENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (511) T ss_pred eecccCccceEEEEEEEeCCcEEEEEecCCccccc---------ccc-ccccccCCCCccceEEecCCCCCCCchhhhHH Confidence 31 234567799999998886532211110 111 12334566676777666 56899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc-cc---cccchhhhhhhhhhhccceeccCCCceeEeec-ccchHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPK-VD---ENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTP 307 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~ 307 (456) |||+||.++|++++..+++++|+++++|....... .. ..+........... +.......++++..+. +.+.++ T Consensus 264 liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~d~~~l~~~~~~~~ 341 (511) T protein:vir:99 264 LIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYAD--SEGRETEGSVDGGYIYKQYDVQG 341 (511) T ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhcccccccceecccccccc--cccccCCCCcceeEEeecCCHHH Confidence 99999999999999999999999999996532211 00 00100000000111 1111223344444332 335667 Q ss_pred HHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------Ccc Q lcl|NC_021301. 308 MLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-------SVE 380 (456) Q Consensus 308 ~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-------~~~ 380 (456) +...++.+...|+..+++|+..++..++|+||+||++++.++.+||.++++.|+++|++++++++++.+. .+. T Consensus 342 ~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~ 421 (511) T protein:vir:99 342 TEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDF 421 (511) T ss_pred HHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccc Confidence 7777788888888888899888877778999999999999999999999999999999999998876432 123 Q ss_pred cceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhh----c------ Q lcl|NC_021301. 381 DTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQR----P------ 450 (456) Q Consensus 381 ~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~----~------ 450 (456) ..++++|+++.|.|.++.+++++++. |++|++|+++++|++++ .+.|++|+++|........... + T Consensus 422 ~~i~i~f~~~~p~n~~e~~~~~~kl~--GiiS~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 497 (511) T protein:vir:99 422 NTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKNMYQDPRNINDD 497 (511) T ss_pred ccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhcccccCCCCCCC Confidence 46899999999999999999999885 89999999999999875 3345555555433222111111 1 Q ss_pred -ccccCC Q lcl|NC_021301. 451 -QEDGSR 456 (456) Q Consensus 451 -~~d~~~ 456 (456) +++.+. T Consensus 498 ~~~~~~~ 504 (511) T protein:vir:99 498 EQDDSTK 504 (511) T ss_pred CCCCCCc Confidence 111111 No 29 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=3.4e-78 Score=445.33 Aligned_cols=438 Identities=14% Similarity=0.081 Sum_probs=322.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -..++++++.+.+.++...+++|++++++||+|+|+++......++..+. ++|+++||+++||++.++||+|+|+++.. T Consensus 37 ~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~-~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:78 37 DLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccC-cceeecchHHHHHHHHhhhhcccCceeec Confidence 23334554444444444577899999999999999988666555554443 56899999999999999999999999875 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 160 (456) + +++..+.++++|+.|+++.++.++++.+++||+||+++|.|++|.+++++++|++++|+||+....++.+++++|... T Consensus 116 ~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~ 194 (511) T protein:vir:78 116 D-DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTK 194 (511) T ss_pred C-chHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEee Confidence 4 555778899999999999999999999999999999999999999999999999999999998877899999998643 Q ss_pred C------CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 161 D------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 161 d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) . +...+..+|+++.++.|.......... .+......+|+++.|||+++ +|++|+|+|+++++| T Consensus 195 ~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~l 264 (511) T protein:vir:78 195 PIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKL----------TPRENSFESHSFERMPITEFSNNERRKGDYEKVITL 264 (511) T ss_pred eccccccceEEEEEEEeCCcEEEEEecCCCcccc----------cccccccccCcCcccceEEecCCCCCCCchhhhHHH Confidence 2 223467799999988876432111110 11122334567777777776 567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc-cc--cccchhhhhh-hhhhhccceeccCCCceeEeec-ccchHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPK-VD--ENGNAIDYAS-IFEAAPGALWELPPGVDIWESQ-TNDFTPM 308 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~-~~--~~~~~~~~~~-~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~ 308 (456) ||+||.++|++++..+++++|+++++|....... .. ..+..+.... ......+. ....++++..+. +.+.+++ T Consensus 265 iDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~~~ 342 (511) T protein:vir:78 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGR--ETEGSVDGGYIYKQYDVQGT 342 (511) T ss_pred HHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccc--cCCCCcceeEEeecCCHHHH Confidence 9999999999999999999999999996432211 00 0111110000 00001111 122334443332 3356677 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------Cccc Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-------SVED 381 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-------~~~~ 381 (456) ...++.+...|+..+++|+..++..++|+||+||++++.++.++|..+++.|+.+|++++++++++.+. .+.. T Consensus 343 e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~ 422 (511) T protein:vir:78 343 EAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 422 (511) T ss_pred HHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc Confidence 777777788888888899888877778999999999999999999999999999999999998876432 2234 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh----hccc--c-- Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ----RPQE--D-- 453 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~----~~~~--d-- 453 (456) +++++|+++.|.|.++.++++++|. |++|++|+++++|++++ .+.|++|+++|.......... .+.. + T Consensus 423 ~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:78 423 TVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 6899999999999999999999985 89999999999999875 344555565554332221111 1111 1 Q ss_pred ---cCC Q lcl|NC_021301. 454 ---GSR 456 (456) Q Consensus 454 ---~~~ 456 (456) .++ T Consensus 499 ~~~~~~ 504 (511) T protein:vir:78 499 QDDDTK 504 (511) T ss_pred CCCCcc Confidence 111 No 30 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=3.4e-78 Score=445.33 Aligned_cols=438 Identities=14% Similarity=0.081 Sum_probs=322.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -..++++++.+.+.++...+++|++++++||+|+|+++......++..+. ++|+++||+++||++.++||+|+|+++.. T Consensus 37 ~~~~~~~~i~~~i~~~~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~-~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~ 115 (511) T protein:vir:96 37 DLLQNVNEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQYQD 115 (511) T ss_pred hhhcCHHHHHHHHHHHHHhhhHHHHHHHHHhhccCccccccCcccccccC-cceeecchHHHHHHHHhhhhcccCceeec Confidence 23334554444444444577899999999999999988666555554443 56899999999999999999999999875 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 160 (456) + +++..+.++++|+.|+++.++.++++.+++||+||+++|.|++|.+++++++|++++|+||+....++.+++++|... T Consensus 116 ~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~vr~~~~~ 194 (511) T protein:vir:96 116 D-DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAGVRYLRTK 194 (511) T ss_pred C-chHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEee Confidence 4 555778899999999999999999999999999999999999999999999999999999998877899999998643 Q ss_pred C------CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 161 D------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 161 d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) . +...+..+|+++.++.|.......... .+......+|+++.|||+++ +|++|+|+|+++++| T Consensus 195 ~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~----------~~~~~~~~~~~~g~vPvv~~~n~~~g~gd~e~v~~l 264 (511) T protein:vir:96 195 PIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKL----------TPRENSFESHSFERMPITEFSNNERRKGDYEKVITL 264 (511) T ss_pred eccccccceEEEEEEEeCCcEEEEEecCCCcccc----------cccccccccCcCcccceEEecCCCCCCCchhhhHHH Confidence 2 223467799999988876432111110 11122334567777777776 567899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc-cc--cccchhhhhh-hhhhhccceeccCCCceeEeec-ccchHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPK-VD--ENGNAIDYAS-IFEAAPGALWELPPGVDIWESQ-TNDFTPM 308 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~-~~--~~~~~~~~~~-~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~ 308 (456) ||+||.++|++++..+++++|+++++|....... .. ..+..+.... ......+. ....++++..+. +.+.+++ T Consensus 265 iDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~l~~~~~~~~~ 342 (511) T protein:vir:96 265 IDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVYVDAEGR--ETEGSVDGGYIYKQYDVQGT 342 (511) T ss_pred HHHHHHHHHHHHHHHHHhhcchhheecCccCCchhhcccccccceeccccceeccccc--cCCCCcceeEEeecCCHHHH Confidence 9999999999999999999999999996432211 00 0111110000 00001111 122334443332 3356677 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------Cccc Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-------SVED 381 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-------~~~~ 381 (456) ...++.+...|+..+++|+..++..++|+||+||++++.++.++|..+++.|+.+|++++++++++.+. .+.. T Consensus 343 e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~~ 422 (511) T protein:vir:96 343 EAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDFN 422 (511) T ss_pred HHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccc Confidence 777777788888888899888877778999999999999999999999999999999999998876432 2234 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh----hccc--c-- Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ----RPQE--D-- 453 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~----~~~~--d-- 453 (456) +++++|+++.|.|.++.++++++|. |++|++|+++++|++++ .+.|++|+++|.......... .+.. + T Consensus 423 ~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d--~~~El~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:96 423 TVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 6899999999999999999999985 89999999999999875 344555565554332221111 1111 1 Q ss_pred ---cCC Q lcl|NC_021301. 454 ---GSR 456 (456) Q Consensus 454 ---~~~ 456 (456) .++ T Consensus 499 ~~~~~~ 504 (511) T protein:vir:96 499 QDDDTK 504 (511) T ss_pred CCCCcc Confidence 111 No 31 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=5.9e-78 Score=443.99 Aligned_cols=433 Identities=12% Similarity=0.039 Sum_probs=321.0 Q ss_pred CCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccch----hhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTA--STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSA----AWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~--~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~----~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) ++. +...++|++|+.+|..+++|++++.+||+|+|+|...+++... ...+.++|+++||+++||++.++||+|+ T Consensus 38 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~ 117 (492) T protein:vir:97 38 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 117 (492) T ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHHHhhhhccc Confidence 332 2335678999999999999999999999999999766543321 1223466899999999999999999999 Q ss_pred CeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 75 GITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 75 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) |+++..+ |++..+.++++| .|+++..+.++++++++||+||+++|.+++|.+++++++|++++++||+....++.+++ T Consensus 118 p~~~~~~-d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~v 195 (492) T protein:vir:97 118 PIAFKHT-DDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI 195 (492) T ss_pred CceeccC-chHHHHHHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 9998654 455666677766 58999999999999999999999999999999999999999999999987777899999 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) ++|...+. ....+|+++.++.|....... ........+.+.. ...+|.++.|||+++ +|++|+|+|+++++| T Consensus 196 r~~~~~~~--~~~~~y~~~~v~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~l 268 (492) T protein:vir:97 196 RMYKLENE--TKVEYWDKVTVNYYVYENGSL----IPDYSNNLENSKT-HFSTGSWGKIPFIPFKNNDLEISDIFMYKTL 268 (492) T ss_pred EEEeeccc--eeEEEEecCeEEEEEEecCee----eeccccccccccc-ccccCCCCCcceEEecCCCCCCCchHhHHHH Confidence 99986553 356789998888875432111 0011111222333 334566666777666 568899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAI 312 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l 312 (456) ||+||+++|++++..+++++|+++++|+..... .+ .. ..+ ....++..+.++++..+. +.+.+++...+ T Consensus 269 iDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~--~~---~~---~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 338 (492) T protein:vir:97 269 IDAYNRRLSDLSNTFKDSNELTYVLKNYDDQEL--PE---FK---RLL--RYYGAIKVSDNGGVDTIQVEVPVENSKKYL 338 (492) T ss_pred HHHHHHHHHHHHHHHHHhccceeeeecCCcccc--hh---HH---HHH--hhccceecCCCCcceeEeccCCHHHHHHHH Confidence 999999999999999999999999999764321 11 11 111 112233344444443331 23445555556 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-EDTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~i~v~f~~~~ 391 (456) +.+...|+..+++|+..++..++|+||+||++++.+|..||+++++.|+.+|++++++++++.+... ..+++++|+++. T Consensus 339 ~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~ 418 (492) T protein:vir:97 339 DELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNK 418 (492) T ss_pred HHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccceeeEEecCCC Confidence 6666666666777776666666789999999999999999999999999999999999999888654 467999999999 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh--hcccccCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ--RPQEDGSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~--~~~~d~~~ 456 (456) |.|.++.+++++|+. |++|++|+++++|++++ .+.|++|+++|.+........ ..+.+..+ T Consensus 419 p~~~~e~a~~~~kl~--G~iS~et~l~~l~~v~d--~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~ 481 (492) T protein:vir:97 419 VANTELQVQTAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQTEYNKQLPNLDDGGADSAQ 481 (492) T ss_pred CCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCCc Confidence 999999999999984 89999999999999876 334555555544322221111 11111111 No 32 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=2.1e-78 Score=446.39 Aligned_cols=425 Identities=15% Similarity=0.051 Sum_probs=326.7 Q ss_pred HHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCC--CcccHHHH Q lcl|NC_021301. 12 VLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS--ADSDLALR 89 (456) Q Consensus 12 ~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~--~d~~~~~~ 89 (456) .|..++..+++||+++++||+|+|++...+....+.. ..++|+++||+++||++.++||+|+|+++... .+++..+. T Consensus 1 ~~~~~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~-~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~~ 79 (440) T protein:vir:95 1 MLAAFLGSQKQRLAILASYAQGDNFSILSGHRRLDDE-KADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLST 79 (440) T ss_pred ChhhHHHHHHHHHHHHHHHhccCCccccccccccccc-CCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHHH Confidence 7777788899999999999999999875554433333 34668999999999999999999999887543 34455667 Q ss_pred HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEE Q lcl|NC_021301. 90 ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIV 169 (456) Q Consensus 90 l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~ 169 (456) ++++|++|+|+.++.++++++++||+||+++|.|++|.++++.++|++++|+||+....++.+++++|...+ ..+..+ T Consensus 80 l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~~~--~~~~~v 157 (440) T protein:vir:95 80 IKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIYAD--KVNMTV 157 (440) T ss_pred HHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEecC--ceEEEE Confidence 899999999999999999999999999999999999999999999999999999988888999999987655 346789 Q ss_pred EcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc-CCCCCCcHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 170 WSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHIDIINRINRAELQLLSTM 248 (456) Q Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-n~~g~s~~~~v~~liDa~~~~~s~~~~~~ 248 (456) |+++.+++|..... ..+.+......+|.++.+|||++. |++|+|+|+++++|||+||+++|++++.+ T Consensus 158 yt~~~~~~~~~~~~------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~lida~~~~~s~~~~~~ 225 (440) T protein:vir:95 158 YTKDKVITYKPYSN------------NSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYESEISLIDAYDAGQSDTANYM 225 (440) T ss_pred EeCCeEEEEEEecC------------CccceeecceeeccCceeeEEEeeCCCCCCCchhhhHHHHHHHHHHHHHHHHHH Confidence 99999988764321 122344455567888888888764 57899999999999999999999999999 Q ss_pred HHhhchhhhhhcCCCcccccccccchhhhhhhhhhh-ccceeccCCCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_021301. 249 AIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAA-PGALWELPPGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTP 326 (456) Q Consensus 249 ~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p 326 (456) +++++|+++++|...+....++.+........+... .......+.++++..+. +.+.+++...++.+...|+..+++| T Consensus 226 ~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p 305 (440) T protein:vir:95 226 SDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIP 305 (440) T ss_pred HHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCc Confidence 999999999999755443333333322111111110 00111123344443332 3456778888888889999999999 Q ss_pred hhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-----CCcccceeEEecCCCCcCHHHHHHH Q lcl|NC_021301. 327 LPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG-----ESVEDTVDVSFESPDRVTLGEKYAA 401 (456) Q Consensus 327 ~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~-----~~~~~~i~v~f~~~~~~~~~e~ad~ 401 (456) +..++..++|+||+||++++.+|.+||+++++.|+++|++++++++.+.+ ..+...++++|+++.|+|.++.||+ T Consensus 306 ~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~v~i~f~~~~p~~~~~~ad~ 385 (440) T protein:vir:95 306 NLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEANKLTFTFHPNIPQDVWTEIKA 385 (440) T ss_pred ccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccccceEEeCCCCCCCHHHHHHH Confidence 98888878899999999999999999999999999999999999887643 2244578999999999999999999 Q ss_pred HHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh-hcccccCC Q lcl|NC_021301. 402 ASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ-RPQEDGSR 456 (456) Q Consensus 402 ~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~-~~~~d~~~ 456 (456) ++|+ +|++|.+|+++++|+++++. |++|+++|.+.......+ .+..++.. T Consensus 386 ~~kl--~g~iS~et~~~~l~~~d~~~---E~~ri~~E~~~~~~~~~~~~~~~~~~~ 436 (440) T protein:vir:95 386 YIEA--GGEISQETLMENASFTDYKT---EHSRILKQGGSSDLEIGQIVGDADVGQ 436 (440) T ss_pred HHHH--hccCcHHHHHHhCCCCCcHH---HHHHHHHHHHHhhhhHHhhccCCCCCC Confidence 9998 48999999999999875422 344444443332222221 22233333 No 33 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=6.1e-78 Score=443.91 Aligned_cols=415 Identities=10% Similarity=0.007 Sum_probs=324.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCc Q lcl|NC_021301. 4 STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD 83 (456) Q Consensus 4 ~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d 83 (456) .|++ +|..|+.+|..+.+||+++++||+|+|+|+....+. . ...++|+++||+++||++.++||+|+|++++.+ + T Consensus 1 l~~~-~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~--~-~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~-~ 75 (429) T protein:vir:98 1 MTKD-LLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKE--Q-YKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHE-N 75 (429) T ss_pred CCHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccccccc--c-CCCcceeecchHHHHHHHHhhhhcccCceeecC-C Confidence 5555 577788999999999999999999999987655432 2 234668999999999999999999999998754 4 Q ss_pred ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCc Q lcl|NC_021301. 84 SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAE 163 (456) Q Consensus 84 ~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~ 163 (456) +.....++++|+.|+|+.++.++++++++||+||+++|.+++|.+++++++|++++++||+....++.+++++|...++ T Consensus 76 ~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~- 154 (429) T protein:vir:98 76 KQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYNKGG- 154 (429) T ss_pred hHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEecCc- Confidence 4567789999999999999999999999999999999999999999999999999999998887889999999876554 Q ss_pred eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHHHHHHHHH Q lcl|NC_021301. 164 SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRINRAEL 242 (456) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liDa~~~~~s 242 (456) .....+|+.+.++.|.. ..+.+......+|+++.|||+++ +|++|+|+|+++++|+|+||+++| T Consensus 155 ~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liD~~d~~~s 219 (429) T protein:vir:98 155 VLEGSYSDASNITYFKD---------------GEKGIEIGESEPHPFDGVPMIEYVENEERQSLLASVVTLINAFNKAIS 219 (429) T ss_pred eEEEEEEeCceEEEEEe---------------cCCceEecccccccCCccceEEecCCCCCCCcHHHHHHHHHHHHHHHH Confidence 45566777776665532 11223444556777788888776 568999999999999999999999 Q ss_pred HHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccC----CCceeEeec-ccchHHHHHHHHHHHH Q lcl|NC_021301. 243 QLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELP----PGVDIWESQ-TNDFTPMLSAIKEHIR 317 (456) Q Consensus 243 ~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~d~~~~~~~-~~~~~~~~~~l~~~~~ 317 (456) ++++.++++++|+++++|........ .. ...+.++..+ .+++++.+. +.+.+.+...++.+.. T Consensus 220 ~~~~~~~~~~~p~~~i~g~~~~~~~~-------~~-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~ 287 (429) T protein:vir:98 220 EKANDVEYFADAYLKILGAELDDETL-------KS-----LRDTRIINLKDTDAQQLTVEFLQKPDADATQEHLLDRLEN 287 (429) T ss_pred HHHHHHHHhcCceeeeecCCCCcchh-------hh-----HhhCceeeccCCCCCCcceeEEeecCCHHHHHHHHHHHHH Confidence 99999999999999999976532111 00 1111222221 223343332 3456778888888888 Q ss_pred HHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CcccceeEEecCCCCc Q lcl|NC_021301. 318 QLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE----SVEDTVDVSFESPDRV 393 (456) Q Consensus 318 ~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~----~~~~~i~v~f~~~~~~ 393 (456) .|+..+++|...++.. +|+||+||++++.+|.+|+.++++.|+.+|++++++++++.+. .+..+++++|+++.|. T Consensus 288 ~i~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~v~f~~~~p~ 366 (429) T protein:vir:98 288 LIFRTAMVANISDESF-GTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIKYKFTRNLPA 366 (429) T ss_pred HHHHHhCccccCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccceEEeCCCCCc Confidence 8999999998777654 7899999999999999999999999999999999999887553 2335689999999999 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) |.++.|++++|+ +|++|++|+++++|++++ .+.|++|+++|.....+......+.+.+. T Consensus 367 ~~~~~a~~~~kl--~g~is~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 425 (429) T protein:vir:98 367 NLLEESQIAGNL--AGIVSEETQVGVLSIVEN--PQKEIERKNSDKSTLISRQAGGLNGQNTT 425 (429) T ss_pred CHHHHHHHHHHH--hccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhhcCCCCC Confidence 999999999997 489999999999999876 34456666665544333222222222222 No 34 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=5.2e-78 Score=444.26 Aligned_cols=427 Identities=14% Similarity=0.040 Sum_probs=328.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) ++..|++ +|++++.+|..+.+|++++.+||+|+|+|+..+.+. ....++|+++||+++||++.++||+|+|+++.. T Consensus 13 ~~~~~~~-~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~---~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~ 88 (499) T protein:vir:10 13 VNEPNIE-AINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFDN---ATVEAANVMVNHAKYITDMNVGFMTGNPVKYVA 88 (499) T ss_pred hhcCCHH-HHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcCc---CCCCcceeecchHHHHHHHHhhhhcccCceeec Confidence 5555544 688899999999999999999999999997654332 223467899999999999999999999999875 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc-----------------eEEEEEccceeEEEEe Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT-----------------ATITADSPETMVVSVD 143 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~-----------------~~i~~~~p~~~~~~~d 143 (456) + +++..+.++++|+.|+|+.++.++++.+++||+||+++|.+++|. +++..++|++++++|+ T Consensus 89 ~-~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~ 167 (499) T protein:vir:10 89 E-KGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCD 167 (499) T ss_pred C-ChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceEEEEEcccceEEEec Confidence 4 455678899999999999999999999999999999999999874 5689999999999999 Q ss_pred CCCCceEEEEEEEEEecC----CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc Q lcl|NC_021301. 144 PLQPWRIRSAMRWWRDLD----AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ 219 (456) Q Consensus 144 ~~~~~~~~~~~~~~~~~d----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~ 219 (456) +...+.+.+++++|...+ +...+..+|+++.++.|....... ..+.+......+|+++.||||+|. T Consensus 168 d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~----------~~~~~~~~~~~~~~~g~vPvv~~~ 237 (499) T protein:vir:10 168 DTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTME----------VSANDPIVYDGENLFGAVPIIEFR 237 (499) T ss_pred CCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCcc----------ccCcceecccccCCCCccceEEec Confidence 988888999999887653 234567899999998886432111 111223344566788888888775 Q ss_pred -CCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec--cCCCce Q lcl|NC_021301. 220 -NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE--LPPGVD 296 (456) Q Consensus 220 -n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~d~~ 296 (456) |++|+|+|+++++|||+||+++|++++.++++++|+++++|+..+... ... . ....+.++. .+.+++ T Consensus 238 n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~-----~~~---~--~~~~~~~~~~~~~~~~d 307 (499) T protein:vir:10 238 NNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDK-----DDI---Q--RLKRGAIEAPPREEGAD 307 (499) T ss_pred CCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccc-----chh---h--hhhhcceeccCCCCCCc Confidence 578999999999999999999999999999999999999997654221 111 1 112233333 345566 Q ss_pred eEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021301. 297 IWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE 375 (456) Q Consensus 297 ~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 375 (456) +..+. +.+.+++...++.+...|+..+++|...++..++|+||+||++++.++.+||.++++.|+++|++++++++.+. T Consensus 308 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~ 387 (499) T protein:vir:10 308 IEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIV 387 (499) T ss_pred ceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65553 34567777778888888888888887777666789999999999999999999999999999999999998875 Q ss_pred CC----CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHh----h--- Q lcl|NC_021301. 376 GE----SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFA----G--- 444 (456) Q Consensus 376 ~~----~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~----~--- 444 (456) +. .+...++++|+++.|.|.++.++++++| +|++|.+|+++++|++++. +.|++|+++|..... . T Consensus 388 ~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~--~~E~~ri~~E~~~~~~~~~~~~~ 463 (499) T protein:vir:10 388 NIKGANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKYTYSWLPDVDNP--QDVIDEMNQQDAETIKKNQEALR 463 (499) T ss_pred hccCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHHhhhc Confidence 43 2345789999999999999999999998 4899999999999997662 334455544432211 0 Q ss_pred -hhhhhcccccCC Q lcl|NC_021301. 445 -NSVQRPQEDGSR 456 (456) Q Consensus 445 -~~~~~~~~d~~~ 456 (456) .......+++.. T Consensus 464 ~~~~~~~~~~~~~ 476 (499) T protein:vir:10 464 GQDPDRLELEDKQ 476 (499) T ss_pred cCCCCCCCCCCCC Confidence 000000011110 No 35 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=5.1e-78 Score=444.32 Aligned_cols=432 Identities=17% Similarity=0.124 Sum_probs=320.0 Q ss_pred CC-----CCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccC-cccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MT-----ASTPAEWLPVLTKRIDD-GMSRVRLLARYSNGDA-PLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~-----~~t~~~~~~~l~~~~~~-~~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) +. +....+++..++.+|.. +.+||+++.+||+|+| .+...+.. + .....++|+++||+++||++.++||+| T Consensus 30 ~~~~~~~~~~~~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~-~-~~~~~~~ri~~n~~k~Ivd~~~~yl~g 107 (501) T protein:vir:96 30 ADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR-K-DNEMADKRAVHNYGRMISKFKTGYLAG 107 (501) T ss_pred ccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCcccc-C-ccccccceeecchHHHHHHHHhhhhcc Confidence 11 11223578889988864 5689999999999985 45433322 2 233446789999999999999999999 Q ss_pred CCeecCCCCc---ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceE Q lcl|NC_021301. 74 NGITVGGSAD---SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRI 150 (456) Q Consensus 74 ~~~~~~~~~d---~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~ 150 (456) +|+++.+..+ +.....++++|+.|+|+.++.++++++++||+||+++|++++|.+++++++|++++|+||+....++ T Consensus 108 ~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v~d~~~~~~~ 187 (501) T protein:vir:96 108 NPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLEDNS 187 (501) T ss_pred cCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEEEEcCCCCCce Confidence 9999866432 2345668899999999999999999999999999999999999999999999999999999887889 Q ss_pred EEEEEEEEecC--CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcH Q lcl|NC_021301. 151 RSAMRWWRDLD--AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEV 227 (456) Q Consensus 151 ~~~~~~~~~~d--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~ 227 (456) .+++++|...+ +...+..+|+++.++.|... +.+......+|.++.|||+++ +|++|+|+| T Consensus 188 ~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~----------------~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~ 251 (501) T protein:vir:96 188 IAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDAS----------------DDFNEISVTTHAFGTVPITEYLNNIDGIGDY 251 (501) T ss_pred EEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeC----------------CCceeccccccCCCccceEEecCCccCCCch Confidence 99999987543 55677889999999887531 122233345677777777766 678999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhh-hhccceeccCCCceeEeec-ccch Q lcl|NC_021301. 228 EPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFE-AAPGALWELPPGVDIWESQ-TNDF 305 (456) Q Consensus 228 ~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~~~-~~~~ 305 (456) +++++|||+||+++|++++.++++++|+++++|...... ++.+........+. ...+.......++++..+. +.+. T Consensus 252 e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 329 (501) T protein:vir:96 252 ETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPK--GMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDV 329 (501) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCc--ccchhhhhhcCeeeecccccccccccCcceeeEeccCCH Confidence 999999999999999999999999999999999764322 11111111111111 1112222233344443332 2344 Q ss_pred HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------Cc Q lcl|NC_021301. 306 TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE------SV 379 (456) Q Consensus 306 ~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~------~~ 379 (456) +++...++.+...|+..+++|+..++..++|+||+||++++.+|.+||..+++.|+.+|++++++++++.+. .+ T Consensus 330 ~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d 409 (501) T protein:vir:96 330 SGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFD 409 (501) T ss_pred HHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 556666667777777778888888887778999999999999999999999999999999999998876432 23 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhh---------hhhhhc Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAG---------NSVQRP 450 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~---------~~~~~~ 450 (456) ...++++|+++.|.|.++.|++++|+. |++|++|+++++|++++. +.|++|+++|.+.+.. ...... T Consensus 410 ~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~iS~et~~~~l~~v~D~--~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 485 (501) T protein:vir:96 410 ESLLKITFTPNLPKSLNEQVSILTGLG--GQVSQETALSLSGLVESP--NEELDKINKEMSEIDFKGYSNDFNEHVGKYT 485 (501) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCH--HHHHHHHHHHHHHhhccccccchhhcccccC Confidence 346899999999999999999999985 899999999999998763 3455555544432211 111111 Q ss_pred ccccCC Q lcl|NC_021301. 451 QEDGSR 456 (456) Q Consensus 451 ~~d~~~ 456 (456) ++.+++ T Consensus 486 ~~~~e~ 491 (501) T protein:vir:96 486 DEVKET 491 (501) T ss_pred CcCCCC Confidence 111111 No 36 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=1.1e-77 Score=442.57 Aligned_cols=432 Identities=9% Similarity=0.038 Sum_probs=328.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccch----hhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSA----AWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~----~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) -...+..++|+.|+.+|..+.+|+.++++||+|+|+|..++..... .....++|+++||+++||++.++||+|+|+ T Consensus 23 ~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~ 102 (474) T protein:vir:96 23 PKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPV 102 (474) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCc Confidence 3334556799999999999999999999999999999876543221 122346689999999999999999999999 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) +++.+ +++..+.+++++ .|+++..+.++++.++++|+||+++|.+++|.+++.+++|++++|+||+.....+.+++++ T Consensus 103 ~~~~~-~~~~~~~l~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~ 180 (474) T protein:vir:96 103 TYAHD-DDKVLDVIHQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFIRI 180 (474) T ss_pred eeccC-ChHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEE Confidence 98754 444556666665 5889999999999999999999999999999999999999999999998887889999999 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liD 235 (456) |.... ..+..+|+++.++.|....... .........+......+|+++.+|++++ +|++|.|+|+++++||| T Consensus 181 ~~~~~--~~~~~vy~~~~i~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liD 253 (474) T protein:vir:96 181 FTFNG--ETKVEYWTAETVTYYVYENGGL-----IPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVD 253 (474) T ss_pred EeecC--eeEEEEEeCCeEEEEEEcCCce-----eeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHH Confidence 87533 4567899999988875422111 1111112223333445667777777776 56899999999999999 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAIKE 314 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~ 314 (456) +||.++|++++.++++++|+++++|++.... +..... .....++.++.++++..+. +.+.+.+...++. T Consensus 254 a~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~-----~~~~~~-----~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:96 254 AIDKRLSDVQNMFDESVELIYILRGYEGEDL-----SEFMEG-----LKYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHhhcchhhhcCCCcccc-----cchhhh-----hhccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 9999999999999999999999999764321 111111 1122344445555554442 3456777777888 Q ss_pred HHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccceeEEecCCCCc Q lcl|NC_021301. 315 HIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-VEDTVDVSFESPDRV 393 (456) Q Consensus 315 ~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~-~~~~i~v~f~~~~~~ 393 (456) +..+|+..+++|+..+++.++|+||+||+++++++.+||.++++.|+++|++++++++++.|.. +...++++|+++.|. T Consensus 324 l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~ 403 (474) T protein:vir:96 324 MRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMV 403 (474) T ss_pred HHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCcc Confidence 8888888888998777777789999999999999999999999999999999999999988754 446799999999999 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh--hhccccc---------CC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV--QRPQEDG---------SR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~--~~~~~d~---------~~ 456 (456) |.++.|+++++ +|++|++|+++++|++++ .+.|++|+++|......... .....++ ++ T Consensus 404 ~~~e~a~~~~~---~giiS~et~~~~lp~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 472 (474) T protein:vir:96 404 NDLEQSQIGAQ---SQYLSKETLVRHHPWVDD--PKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQ 472 (474) T ss_pred CHHHHHHHHHH---cCCCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccc Confidence 99999998764 599999999999999876 34456666555332221111 1111111 11 No 37 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=1.1e-77 Score=442.57 Aligned_cols=432 Identities=9% Similarity=0.038 Sum_probs=328.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccch----hhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSA----AWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~----~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) -...+..++|+.|+.+|..+.+|+.++++||+|+|+|..++..... .....++|+++||+++||++.++||+|+|+ T Consensus 23 ~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~ 102 (474) T protein:vir:95 23 PKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPV 102 (474) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhhcccCc Confidence 3334556799999999999999999999999999999876543221 122346689999999999999999999999 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) +++.+ +++..+.+++++ .|+++..+.++++.++++|+||+++|.+++|.+++.+++|++++|+||+.....+.+++++ T Consensus 103 ~~~~~-~~~~~~~l~~~~-~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~ir~ 180 (474) T protein:vir:95 103 TYAHD-DDKVLDVIHQVL-DTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFIRI 180 (474) T ss_pred eeccC-ChHHHHHHHHHH-hccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEEEE Confidence 98754 444556666665 5889999999999999999999999999999999999999999999998887889999999 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liD 235 (456) |.... ..+..+|+++.++.|....... .........+......+|+++.+|++++ +|++|.|+|+++++||| T Consensus 181 ~~~~~--~~~~~vy~~~~i~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d~e~v~~liD 253 (474) T protein:vir:95 181 FTFNG--ETKVEYWTAETVTYYVYENGGL-----IPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSDIWMYKSFVD 253 (474) T ss_pred EeecC--eeEEEEEeCCeEEEEEEcCCce-----eeccccccccccCcccccCCCccceEEecCCCCCCCchHHHHHHHH Confidence 87533 4567899999988875422111 1111112223333445667777777776 56899999999999999 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAIKE 314 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~ 314 (456) +||.++|++++.++++++|+++++|++.... +..... .....++.++.++++..+. +.+.+.+...++. T Consensus 254 a~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~-----~~~~~~-----~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:95 254 AIDKRLSDVQNMFDESVELIYILRGYEGEDL-----SEFMEG-----LKYYKAINVSSDGGVETIQVEVPVASTKEYLDM 323 (474) T ss_pred HHHHHHHHHHHHHHHhhcchhhhcCCCcccc-----cchhhh-----hhccceeeccCCCceeEEeccCCHHHHHHHHHH Confidence 9999999999999999999999999764321 111111 1122344445555554442 3456777777888 Q ss_pred HHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccceeEEecCCCCc Q lcl|NC_021301. 315 HIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-VEDTVDVSFESPDRV 393 (456) Q Consensus 315 ~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~-~~~~i~v~f~~~~~~ 393 (456) +..+|+..+++|+..+++.++|+||+||+++++++.+||.++++.|+++|++++++++++.|.. +...++++|+++.|. T Consensus 324 l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~d~~~i~i~f~~~~p~ 403 (474) T protein:vir:95 324 MRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKLDAKEIEITFNFNVMV 403 (474) T ss_pred HHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCcc Confidence 8888888888998777777789999999999999999999999999999999999999988754 446799999999999 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh--hhccccc---------CC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV--QRPQEDG---------SR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~--~~~~~d~---------~~ 456 (456) |.++.|+++++ +|++|++|+++++|++++ .+.|++|+++|......... .....++ ++ T Consensus 404 ~~~e~a~~~~~---~giiS~et~~~~lp~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 472 (474) T protein:vir:95 404 NDLEQSQIGAQ---SQYLSKETLVRHHPWVDD--PKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQ 472 (474) T ss_pred CHHHHHHHHHH---cCCCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccc Confidence 99999998764 599999999999999876 34456666555332221111 1111111 11 No 38 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=6.5e-78 Score=443.75 Aligned_cols=431 Identities=17% Similarity=0.124 Sum_probs=322.0 Q ss_pred CCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccC-cccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRID-DGMSRVRLLARYSNGDA-PLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~-~~~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) .+.+. -+++..++.+|. .+.+|++++.+||+|+| .|...+.. ......++|+++||+++||++.++||+|+|+++ T Consensus 37 ~~~~~-~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~--~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~ 113 (502) T protein:vir:48 37 LMVNN-WELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR--KDNEMADKRAVHNYGRMISKFKTGYLAGNPIRV 113 (502) T ss_pred hcccc-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc--cccccccceeecchHHHHHHHHhhhhcccCeeE Confidence 22222 356888888886 45789999999999975 55543332 223345678999999999999999999999988 Q ss_pred CCCCcc---cHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 79 GGSADS---DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 79 ~~~~d~---~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) +...++ ...+.++++|+.|+|+.++.++++.+++||+||+++|.+++|.+++++++|++++|+||+....++.++++ T Consensus 114 ~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ir 193 (502) T protein:vir:48 114 EYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLEDNSIAAVR 193 (502) T ss_pred ecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEEEE Confidence 765332 23456889999999999999999999999999999999999999999999999999999887778999999 Q ss_pred EEEec--CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHH Q lcl|NC_021301. 156 WWRDL--DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHID 232 (456) Q Consensus 156 ~~~~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~ 232 (456) +|... ++...+..+|+++.++.+.. .+.+......+|.++.+||+++ +|++|+|+|+++++ T Consensus 194 ~~~~~~~~~~~~~~~iyt~~~i~~~~~----------------~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~ 257 (502) T protein:vir:48 194 YYNRGTLQNAKDVVEIYTNQHIYTLDA----------------SDSFNEISVTPHAFGTVPITEFLNNADGIGDYETELY 257 (502) T ss_pred EEEEeecCCcEEEEEEEeCCeEEEEEe----------------CCceeeccceecCCCccceEEecCCCCCCCchhhhHH Confidence 98643 34456778999999887742 1223333445566666666665 68899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhh-hccceeccCCCceeEeec-ccchHHHHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEA-APGALWELPPGVDIWESQ-TNDFTPMLS 310 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~d~~~~~~~-~~~~~~~~~ 310 (456) |||+||+++|++++..+++++|+++++|..... .++.+........+.. ..+.....++++++..+. +.+.+.+.. T Consensus 258 liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~ 335 (502) T protein:vir:48 258 LIDLYDSAESDTANHMSDMADAILAIYGDLALP--QGMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEA 335 (502) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeeecCcccc--cccchhhhhhcceeeccccccccccccCcceeEeeecCCHHHHHH Confidence 999999999999999999999999999975432 1222222211111111 111122233444544432 345566777 Q ss_pred HHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------Cccccee Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE------SVEDTVD 384 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~------~~~~~i~ 384 (456) .++.+..+|+..+++|+..++..++|+||+||++++.+|.+|+..+++.|+++|++++++++++.+. .+..+++ T Consensus 336 ~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~ 415 (502) T protein:vir:48 336 YKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESRLK 415 (502) T ss_pred HHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccce Confidence 7888888888889999888888788999999999999999999999999999999999999876442 2334689 Q ss_pred EEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhh-------cccc---- Q lcl|NC_021301. 385 VSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQR-------PQED---- 453 (456) Q Consensus 385 v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~-------~~~d---- 453 (456) ++|++++|+|.++.|++++|+. |++|++|+++++|++++ ++.|++|+++|.+......... ...| T Consensus 416 i~f~~~~p~d~~e~a~~~~kl~--g~iS~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~d~~~e 491 (502) T protein:vir:48 416 ITFTPNLPKSLYEQVSILNDLG--GQVSQETALSLSGLVEN--PTEELDKINEESSKIDFKGYPSYFYDNVGKYTDEVKE 491 (502) T ss_pred EEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCC--HHHHHHHHHHHHHhhhhhcccccccccccccCCCccC Confidence 9999999999999999999984 89999999999999876 3345666665543321111110 0011 Q ss_pred -----cCC Q lcl|NC_021301. 454 -----GSR 456 (456) Q Consensus 454 -----~~~ 456 (456) ++. T Consensus 492 ~~~~~~~~ 499 (502) T protein:vir:48 492 THTDDFER 499 (502) T ss_pred CCCcCcCC Confidence 111 No 39 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=8.7e-78 Score=443.06 Aligned_cols=418 Identities=13% Similarity=0.095 Sum_probs=325.7 Q ss_pred CCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTP--AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~--~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) ++.+.+ .+.|..|+.+|..+++|++++++||+|+|+|+..+.+.+ .+.++|+++||+++||++.++||+|+|+++ T Consensus 11 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~---~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~ 87 (452) T protein:vir:36 11 FSKDEPITVEVVTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAKDS---WKPDNRLAVNFTKYIVDTFTGYFNGIPVKK 87 (452) T ss_pred cCCccCCCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCccccc---cCccceeecchHHHHHHHHhhhhcccCcee Confidence 222221 357888999999999999999999999999976654322 234668999999999999999999999998 Q ss_pred CCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 79 GGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 79 ~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) ..+ +++..+.++++|+.|+|+.++.++++.++++|+||+++|.|++|.+++.+++|++++|+||+.....+.+++++|. T Consensus 88 ~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~ 166 (452) T protein:vir:36 88 SHS-DKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVYDDTVKQEPLFAVRYGV 166 (452) T ss_pred ecC-ChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 754 5566788999999999999999999999999999999999999999999999999999999988888999999998 Q ss_pred ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHHHH Q lcl|NC_021301. 159 DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRI 237 (456) Q Consensus 159 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liDa~ 237 (456) +.++ ..+..+|+++.++.+... .+.+......+|+++.|||+++ +|++|+|+|+++++|||+| T Consensus 167 ~~~~-~~~~~vyt~~~i~~~~~~---------------~~~~~~~~~~~~~~g~iPvv~~~n~~~g~sd~e~v~~liDa~ 230 (452) T protein:vir:36 167 DEDK-KLQGEVYTLLETIKISGE---------------NDEISFGEGTYNPYPDLPVVEFYFNEERMSIFESVISLVNAF 230 (452) T ss_pred ecCc-eEEEEEEecCeEEEEEEc---------------CCceEEecceeccCCcccEEEecCCCCCCcchHHHHHHHHHH Confidence 7665 556789999998887531 1233344455677777787776 5689999999999999999 Q ss_pred HHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceecc-------CCCceeEeecccchHHHHH Q lcl|NC_021301. 238 NRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWEL-------PPGVDIWESQTNDFTPMLS 310 (456) Q Consensus 238 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~d~~~~~~~~~~~~~~~~ 310 (456) |+++|++++.++++++|+++++|........ +. . ..+.++.. ++++++... +.+.+++.. T Consensus 231 d~~~s~~~~~~~~~~~p~~~~~g~~~~~~~~---~~-~--------~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~ 297 (452) T protein:vir:36 231 NKAISEKANDVDYFSDQYLTFLGAAVEEEDL---KN-I--------RSNRVINYYADGEGKNVDVKFLEK-PDSDSQTEN 297 (452) T ss_pred HHHHHHHHHHHHHhcCceeEeecCCcCchhh---hh-h--------hhcceEEecCCCCccCCcceeEee-cCCHHHHHH Confidence 9999999999999999999999976432111 00 0 01112221 123333332 335677778 Q ss_pred HHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CcccceeEE Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE----SVEDTVDVS 386 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~----~~~~~i~v~ 386 (456) .++.+...|+..+++|...++.. +|+||+||++++++|.+||.++++.|+.+|++++++++++.+. .+..++++. T Consensus 298 ~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i~ 376 (452) T protein:vir:36 298 LLDRLTKLIFQTTMVANISDESF-GSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEYT 376 (452) T ss_pred HHHHHHHHHHHHhCccccCcccc-cCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceEE Confidence 88888888888899998776654 7899999999999999999999999999999999999876543 234578999 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhh-hhhhcccccCC Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGN-SVQRPQEDGSR 456 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~-~~~~~~~d~~~ 456 (456) |+++.|.|.++.|++++|+ +|++|.+|+++.+|++++ .+.|++|+++|....... ....+..++.. T Consensus 377 f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 443 (452) T protein:vir:36 377 FTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPD--VQAEMEKIKKEEASTAIFDKDKQPSEKGTD 443 (452) T ss_pred eCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhccCCCCccc Confidence 9999999999999999987 489999999999999865 334555655554332221 11122222222 No 40 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=1.3e-77 Score=442.04 Aligned_cols=433 Identities=11% Similarity=0.036 Sum_probs=326.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccc----hhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTS----AAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~----~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) -..++..++|..|+.+|..+++|++++++||+|+|+|...++... ....+.++|+++||+++||++.++||+|+|+ T Consensus 31 ~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~G~p~ 110 (483) T protein:vir:12 31 NKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPI 110 (483) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccccccccccccccccchHHHHHHHHhhhhcccCc Confidence 222334568999999999999999999999999999876554321 1223346689999999999999999999999 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) ++..+ |++..+.++++|+ |+++..+.++++.+++||+||+++|.|++|.+++++++|++++++||+....++.+++++ T Consensus 111 ~~~~~-d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir~ 188 (483) T protein:vir:12 111 AFKHT-DDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIRM 188 (483) T ss_pred eeccC-ChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEE Confidence 98654 4556666777664 789999999999999999999999999999999999999999999998877889999999 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liD 235 (456) |...+. ....+|+++.++.|........ .......+.+.. ...+|.++.+||+++ +|++|+|+|+++++||| T Consensus 189 ~~~~~~--~~~~~y~~~~v~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~liD 261 (483) T protein:vir:12 189 YKLENE--TKVEYWDKVTVNYYVYENGSLI----PDYSNNLENSKT-HFSTGSWGKIPFIPFKNNDLEISDIFMYKTLID 261 (483) T ss_pred EEeecc--eEEEEEecCeEEEEEEeCCeee----eccccccccccc-ccccCCCCccceEEecCCCCCCCchhhHHHHHH Confidence 976543 3568999988887753221110 011111122222 335566777777766 56899999999999999 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAIKE 314 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~ 314 (456) +||.++|++++.++++++|+++++|.+.... +... ..+ ....++..+.++++..+. +.+.+++...++. T Consensus 262 a~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~-----~~~~---~~~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 331 (483) T protein:vir:12 262 AYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFK---RLL--RYYGAIKVSDNGGVDTIQVEVPVENSKKYLDE 331 (483) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCcccc-----hhHH---Hhh--hhccccccCCCCcceEEeecCCHHHHHHHHHH Confidence 9999999999999999999999999754321 1111 111 122234444555544332 3456777777777 Q ss_pred HHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCCCc Q lcl|NC_021301. 315 HIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-EDTVDVSFESPDRV 393 (456) Q Consensus 315 ~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~i~v~f~~~~~~ 393 (456) +...|+..+++|..+++..++|+||+||++++.+|..||.++++.|+.+|++++++++++.+... ...++++|+++.|. T Consensus 332 l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~p~ 411 (483) T protein:vir:12 332 LYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNKVA 411 (483) T ss_pred HHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccceeeEEeCCCCCC Confidence 77888888888887777777899999999999999999999999999999999999998888654 45799999999999 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhh--hhhcccccCC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNS--VQRPQEDGSR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~--~~~~~~d~~~ 456 (456) |.++.|++++++. |++|++|+++.+|++++ .+.|++|+++|........ ......|+++ T Consensus 412 ~~~~~a~~~~kl~--GiiS~et~~~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~ 472 (483) T protein:vir:12 412 NTELQVQTAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQ 472 (483) T ss_pred CHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccccccCCcc Confidence 9999999999984 89999999999999866 3345555555543222211 1111122222 No 41 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=8.8e-78 Score=443.02 Aligned_cols=439 Identities=10% Similarity=0.047 Sum_probs=338.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccch------------hhhhhhhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSA------------AWRSFQREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~------------~~~~~~~k~~~n~~~~iVd~~a 68 (456) |+.+.-.++|..++.+|..+++++.++++||+|+|+|++.++.... .....++|+++||+++||++.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 9999999999999999999999999999999999999865433211 1223467899999999999999 Q ss_pred hhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC-CCceEEEEEccceeEEEEeCCCC Q lcl|NC_021301. 69 DRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD-DGTATITADSPETMVVSVDPLQP 147 (456) Q Consensus 69 ~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~-dg~~~i~~~~p~~~~~~~d~~~~ 147 (456) +|++|+|++++.+ +++..+.++.++ .|+|+.++.++++.++++|+||+++|.++ +|.+++.+++|++++|+||+... T Consensus 81 ~yl~G~p~~~~~~-~~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~ 158 (471) T protein:vir:10 81 AYALTYPPTFDVD-DKKVNDMIVDVL-GDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLD 158 (471) T ss_pred hhhcccCceeccC-ChHHHHHHHHHH-hcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCC Confidence 9999999998754 444555565555 58999999999999999999999999984 69999999999999999998877 Q ss_pred ceEEEEEEEEEecC----CceEEEEEEcCCeEEEEEEeeeeccccc-----ceeeccCCCceeecccccccCceeEEEEc Q lcl|NC_021301. 148 WRIRSAMRWWRDLD----AESDFAIVWSGDGWQKFARPCFVQSSSR-----RRLVTRISDSWVPVGDAVVTGSPPPVVVY 218 (456) Q Consensus 148 ~~~~~~~~~~~~~d----~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~pvv~~ 218 (456) .++.+++++|...+ ....+..+|++++++.|........... ........+.+.......|+++.+|||++ T Consensus 159 ~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~ 238 (471) T protein:vir:10 159 KKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPF 238 (471) T ss_pred CceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEe Confidence 78999999986532 3455678999999988865332211111 11112223344455566788888888877 Q ss_pred -cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec------c Q lcl|NC_021301. 219 -QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE------L 291 (456) Q Consensus 219 -~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~ 291 (456) +|.+|.|+|+++++|||+||.++|++++..+++++|+++++|++.... +...... . ..+.+.. . T Consensus 239 ~n~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~-----~~~~~~~---~-~~~~i~~~~~~~~~ 309 (471) T protein:vir:10 239 KNNEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDK-----QEFLEDL---K-RYKMIKMDNDGMGD 309 (471) T ss_pred ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccc-----chhHHHh---h-cCCeEEecCCCCcc Confidence 467899999999999999999999999999999999999999754321 1111111 0 1111111 1 Q ss_pred CCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 292 PPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKA 371 (456) Q Consensus 292 ~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~ 371 (456) +.++++...+ .+.+++...++.+.+.|+..+++|+..++.. +|+||+||++++.++.+||.++++.|+++|+++++++ T Consensus 310 ~~~~~~l~~~-~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li 387 (471) T protein:vir:10 310 QSGVTTIAID-IPTEARNLILERTKKQIFISGQGVNPETDKL-GNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMI 387 (471) T ss_pred CccceEEeec-CChHHHHHHHHHHHHHHHHHhCCcCCCcccc-cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2234444432 3567788888888899999999998777654 7899999999999999999999999999999999999 Q ss_pred HHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh--h Q lcl|NC_021301. 372 LQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ--R 449 (456) Q Consensus 372 ~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~--~ 449 (456) +++.+..+..+++++|++++|.|.++.+++++++ +|++|.+|+++++|++++ ++.|++|+++|.+...+...+ . T Consensus 388 ~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et~~~~~p~v~D--~~~E~eri~~E~~~~~~~~~~~~~ 463 (471) T protein:vir:10 388 LKHLGLSDKLKIKQTWTRNSINNDTEMAQVVSTL--ATITSRENVAKSNPIVED--WQDELRLQKAEQEGRSEKLYDMEE 463 (471) T ss_pred HHHhccCCCceeEEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhcccccCC Confidence 9998888888899999999999999999999987 489999999999999876 445677777765544332222 1 Q ss_pred cccccCC Q lcl|NC_021301. 450 PQEDGSR 456 (456) Q Consensus 450 ~~~d~~~ 456 (456) ..+|.+. T Consensus 464 ~~~~~e~ 470 (471) T protein:vir:10 464 VEHESEV 470 (471) T ss_pred CCCcccc Confidence 1222222 No 42 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=1.6e-77 Score=441.68 Aligned_cols=438 Identities=14% Similarity=0.061 Sum_probs=323.9 Q ss_pred CCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRI-DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~-~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) -+.+++++ |..++.+| ..+++|++++++||+|+|++...+...++..+. ++|+++||+++||++.++||+|+|+++. T Consensus 37 ~~~~~~~~-i~~~i~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~-~~ki~~n~~k~Iv~~~~~yl~g~p~~~~ 114 (511) T protein:vir:96 37 DLLQNVNE-VSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMA-DNRVAHDYASYISDFINGYFLGNPIQYQ 114 (511) T ss_pred hhhccHHH-HHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcCcccccC-cceeecchHHHHHHHHHhhhccCCceee Confidence 22334444 55555555 567899999999999999998766655555444 5689999999999999999999999987 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) .+ +++..+.++++|+.|+|+.++.++++.+++||+||+++|.|++|.+++++++|++++|+||+....++.+++++|.. T Consensus 115 ~~-~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~ 193 (511) T protein:vir:96 115 DD-DKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRT 193 (511) T ss_pred cC-chHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEEEEEEEe Confidence 54 45577889999999999999999999999999999999999999999999999999999999887889999999875 Q ss_pred cC------CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc-CCCCCCcHhHHHH Q lcl|NC_021301. 160 LD------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ-NPDGMGEVEPHID 232 (456) Q Consensus 160 ~d------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~-n~~g~s~~~~v~~ 232 (456) .+ +...+..+|+++.++.|......... ..+......+|+++.+||+++. |++|+|+|+++++ T Consensus 194 ~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~----------~~~~~~~~~~~~~~~vPvv~~~nn~~g~gd~e~v~~ 263 (511) T protein:vir:96 194 KPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK----------LTPRENGFESHSFERMPITEFSNNERRKGDYEKVIT 263 (511) T ss_pred eeccccccceEEEEEEEeCCcEEEEEecCCCccc----------ccccccccccccCCceeeEEecCCCCCCCchhhhHH Confidence 32 22345679999999887543211100 0111223345677777777765 6889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc-c--ccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPK-V--DENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPM 308 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~ 308 (456) |||+||.++|++++..+++++|+++++|....... . ...+..+....... ..+......+++++..+. +.+.+++ T Consensus 264 liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~ 342 (511) T protein:vir:96 264 LIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKEANVLFLEPTVY-ADSEGRETEGSVDGGYIYKQYDVQGT 342 (511) T ss_pred HHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcccccccceecccccc-cccccccCCCCcceeEEeecCCHHHH Confidence 99999999999999999999999999996432111 0 01111110000001 111111233445555443 3456777 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-------Cccc Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-------SVED 381 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-------~~~~ 381 (456) ...++.+...|+..+++|+..++..++|+||+||+++++++.++|.++++.|+.+|++++++++++.+. .+.. T Consensus 343 e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~ 422 (511) T protein:vir:96 343 EAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFN 422 (511) T ss_pred HHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccc Confidence 777778888888888899888877778999999999999999999999999999999999998765331 2234 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhh----h--hhcccccC Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNS----V--QRPQEDGS 455 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~----~--~~~~~d~~ 455 (456) .++++|+++.|.|.++.+++++++ +|++|++|+++.+|++++ .+.|++|+++|........ . ..+..+++ T Consensus 423 ~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:96 423 TVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred cceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 789999999999999999999987 489999999999999876 3345555555543222211 1 11111122 Q ss_pred C Q lcl|NC_021301. 456 R 456 (456) Q Consensus 456 ~ 456 (456) . T Consensus 499 ~ 499 (511) T protein:vir:96 499 Q 499 (511) T ss_pred C Confidence 2 No 43 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=1.7e-77 Score=441.48 Aligned_cols=431 Identities=16% Similarity=0.143 Sum_probs=317.6 Q ss_pred CCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHhcccCc-ccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRID-DGMSRVRLLARYSNGDAP-LPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~-~~~~r~~~~~~YY~g~~~-i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) .+..+ -++|..++.+|. .+.+|++++.+||+|+|. +...+. ..+ ....++|+++||+++||++.++||+|+|+++ T Consensus 36 ~~~~~-~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~-~~~-~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~ 112 (501) T protein:vir:27 36 LMVNN-WELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGR-RKD-REMADKRAVHNYGRMISKFKTGYLAGNPIRV 112 (501) T ss_pred ccccc-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCc-cCc-cccccceeccchHHHHHHHHhhhhcccCeeE Confidence 22222 246888888885 557899999999999864 433322 222 2344678999999999999999999999998 Q ss_pred CCCCc---ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 79 GGSAD---SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 79 ~~~~d---~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) ..+.+ +...+.++++|+.|+|+.++.++++.+++||+||+++|.+++|++++++++|++++|+||+....++.++++ T Consensus 113 ~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v~d~~~~~~~~~~ir 192 (501) T protein:vir:27 113 EYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFVIYDNSLEDNSIAAVR 192 (501) T ss_pred ecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEEEecCCCCCceEEEEE Confidence 76533 223456788999999999999999999999999999999999999999999999999999988888999999 Q ss_pred EEEec--CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHH Q lcl|NC_021301. 156 WWRDL--DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHID 232 (456) Q Consensus 156 ~~~~~--d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~ 232 (456) +|... ++...+..+|+++.++.|... +.+......+|+++.+||+++ +|++|+|+|+++++ T Consensus 193 ~~~~~~~~~~~~~~~vyt~~~v~~~~~~----------------~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~ 256 (501) T protein:vir:27 193 YYNRGTLQNAKDVVEIYTNEHIYTLDAS----------------DDFNEISVTTHAFGTVPITEFLNNVDGIGDYETELY 256 (501) T ss_pred EEEeeecCCcEEEEEEEeCCeEEEEEeC----------------CceeeccccccCCCcccEEEecCCCCCCCchhhhHH Confidence 98753 355677889999998877531 122333445677777777765 67899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhh-hhccceeccCCCceeEeec-ccchHHHHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFE-AAPGALWELPPGVDIWESQ-TNDFTPMLS 310 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~~~-~~~~~~~~~ 310 (456) |||+||+++|++++..+++++|+++++|...... ++.+........+. ...+.....+.+++++.+. +.+.+++.. T Consensus 257 liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 334 (501) T protein:vir:27 257 LIDLYDSAESDTANHMSDMADAILAIYGDLALPK--GMQASDMKRTRLMQLKPPKSADGKEGTVKAEYLTKSYDVSGAEA 334 (501) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeeeecCccCCc--ccchhhhhhcCceeecccccccCCCCCcceeeeeccCCHHHHHH Confidence 9999999999999999999999999999754321 11111111111111 1112222233445554432 334555666 Q ss_pred HHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------Cccccee Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE------SVEDTVD 384 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~------~~~~~i~ 384 (456) .++.+...|+..+++|+..++..++|+||+||++++.+|.+||..+++.|+.+|++++++++++.+. .+...++ T Consensus 335 ~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~ 414 (501) T protein:vir:27 335 YKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDESLLK 414 (501) T ss_pred HHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccce Confidence 6677777777778888877777778999999999999999999999999999999999998876432 2334689 Q ss_pred EEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHH-----Hhhhhhhhccc--ccCC Q lcl|NC_021301. 385 VSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITL-----FAGNSVQRPQE--DGSR 456 (456) Q Consensus 385 v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~-----~~~~~~~~~~~--d~~~ 456 (456) ++|+++.|.|.++.|++++|+. |++|++|+++++|++++ .+.|++|+++|... ........... |.++ T Consensus 415 v~f~~~~p~n~~e~ad~~~kl~--g~iS~et~l~~l~~v~D--~~~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~~~ 489 (501) T protein:vir:27 415 ITFTPNLPKSLNEQVSILTGLG--GQVSQETALSLSGLVES--PNEELDKINKEVSEIDFKGYSNDFNEHVGKYTDEVK 489 (501) T ss_pred EEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCC--HHHHHHHHHHHHHhhhHhhhcCccccccccccCCCC Confidence 9999999999999999999874 89999999999999865 23345555444321 11111110000 1111 No 44 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=1.6e-77 Score=441.59 Aligned_cols=424 Identities=9% Similarity=0.011 Sum_probs=322.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchh----hhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 4 STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA----WRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 4 ~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~----~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) .|++ .|.+|+.+|..+++|+.++++||+|+|+|+..+...... ....++|+++||+++||++.++||+|+|+++. T Consensus 1 l~~~-~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~ 79 (451) T protein:vir:10 1 MELE-KIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLFD 79 (451) T ss_pred CCHH-HHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheecccceee Confidence 5665 466788899999999999999999999998765433222 23456789999999999999999999999987 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC--------CceEEEEEccceeEEEEeCCCCceEE Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD--------GTATITADSPETMVVSVDPLQPWRIR 151 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d--------g~~~i~~~~p~~~~~~~d~~~~~~~~ 151 (456) .+.+. ...++++.|..|+++.++.++++.++++|+||+++|.+++ |.+++.+++|++++|+||+....++. T Consensus 80 ~~~~~-~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~~~~ 158 (451) T protein:vir:10 80 IDNNK-ELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIERELE 158 (451) T ss_pred cCCcH-HHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCCceE Confidence 55444 3445566666799999999999999999999999999986 78899999999999999988888899 Q ss_pred EEEEEEEecCCc--------eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCC Q lcl|NC_021301. 152 SAMRWWRDLDAE--------SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPD 222 (456) Q Consensus 152 ~~~~~~~~~d~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~ 222 (456) +++++|...+.. ..+..+|+++.++.|..... ...+........+|+++++||+++ +|++ T Consensus 159 ~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~g~vPvv~~~nn~~ 227 (451) T protein:vir:10 159 AVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGV-----------SCCGSQIEHITVQHRFNSVPFVEFSNNIK 227 (451) T ss_pred EEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEeccc-----------CccccccccccccCCCCeeeEEEeccCCC Confidence 999998754421 24567889888887753211 112223344556788888888877 4788 Q ss_pred CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec------cCCCce Q lcl|NC_021301. 223 GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE------LPPGVD 296 (456) Q Consensus 223 g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~d~~ 296 (456) |.|+|+++++|||+||.++|++++..+++++|+++++|++..... ..... +.. .+.+.. .+++++ T Consensus 228 ~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~-----~~~~~---~~~-~~~i~~~~~~~~~~~~~~ 298 (451) T protein:vir:10 228 KQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTS-----EFLKE---LKR-YKTIKTETDSEGDSGGLK 298 (451) T ss_pred CCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccch-----hhHHH---Hhh-CCeEEecCcCCccCCcce Confidence 999999999999999999999999999999999999997653221 11111 111 111111 123344 Q ss_pred eEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021301. 297 IWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG 376 (456) Q Consensus 297 ~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 376 (456) +... +.+.+++.+.++.+...|+..+++|+..+... +|+||+||++++.+|.+||+++++.|+++|++++++++++.| T Consensus 299 ~l~~-~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~ 376 (451) T protein:vir:10 299 TMQI-EIPTEARKIILEILKKQIYESGQGLQQDTENF-GNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG 376 (451) T ss_pred EEee-cCCHHHHHHHHHHHHHHHHHHhCccccccccc-ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 4332 33567777778888888888888887666543 689999999999999999999999999999999999999999 Q ss_pred CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccC Q lcl|NC_021301. 377 ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGS 455 (456) Q Consensus 377 ~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~ 455 (456) ..+..+++++|+++.|.|.++.+++++++. |++|++|++..+|++++. ++++++++++.+.......+.-+.-+. T Consensus 377 ~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v~d~--~~e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 377 VTDYKKIQQTYTRNMMSNDLEDADIATKSV--GIIPTKIILRHHPWVDDV--EEAEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred CCCccceeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCCH--HHHHHHHHHHHHHHHHHHHhhcCCCCC Confidence 888889999999999999999999999985 899999999999998763 334444443333222222221111111 No 45 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=5.4e-77 Score=438.73 Aligned_cols=438 Identities=11% Similarity=0.036 Sum_probs=330.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchh----hhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA----WRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~----~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) =...+..++|.+++.+|..+.++++++++||+|+|++...+.+.... ....++|+++||+++||++.++||+|+|+ T Consensus 22 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~ 101 (478) T protein:vir:10 22 PKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVANPV 101 (478) T ss_pred hccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCe Confidence 12235577999999999999999999999999999987665433221 22346689999999999999999999999 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) ++..+ +++..+.++++++ |+++.++.+++++++++|+||+++|.|++|.+++++++|++++|+||+.....+.+++++ T Consensus 102 ~~~~~-~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~p~~~~~i~d~~~~~~~~~~v~~ 179 (478) T protein:vir:10 102 TFGVD-NDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFIRV 179 (478) T ss_pred eeecC-ChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEEEE Confidence 98754 4445667787775 889999999999999999999999999999999999999999999998877789999999 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liD 235 (456) |...+ .....+|++++++.|.............. ......+......+|.++.+||+++ +|++|+|+|+++++||| T Consensus 180 ~~~~~--~~~~~~y~~~~i~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g~sd~~~v~~liD 256 (478) T protein:vir:10 180 YELDG--AERVEYWTKDDVTYYELKEGQLIPDFYRS-DDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTIID 256 (478) T ss_pred EEecC--ceEEEEEeCCeEEEEEEcCCeeecccccc-ccccccceecccccccCCccceEEeccCCCCCCcHHHHHHHHH Confidence 87543 44678999998887754322211111111 1122223344455677777787776 67899999999999999 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec-cCCCceeEeec-ccchHHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE-LPPGVDIWESQ-TNDFTPMLSAIK 313 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~d~~~~~~~-~~~~~~~~~~l~ 313 (456) +||.++|++++.++++++|+++++|++.... +...... .. .+.+.. .+.++++..+. +.+.+++...++ T Consensus 257 a~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~-----~~~~~~~---~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 327 (478) T protein:vir:10 257 ALDKRLSDTQNTFDESVELIYILKGYEGEDM-----KDFMHNL---KY-YKAISVAGESGSGVDTIKVEVPIDSVKEYTK 327 (478) T ss_pred HHHHHHHHHHHHHHHhhCceeeeecCCcccc-----chhhhhh---hh-cceEEecCCCCCcceEEeecCChHHHHHHHH Confidence 9999999999999999999999999865322 1111111 11 112222 12234443332 345677777788 Q ss_pred HHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccceeEEecCCCC Q lcl|NC_021301. 314 EHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-VEDTVDVSFESPDR 392 (456) Q Consensus 314 ~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~-~~~~i~v~f~~~~~ 392 (456) .+...|+..+++|+..++..++|+||+||++++++|.+||+++++.|+++|++++++++++.|.. +..+++++|+++.| T Consensus 328 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~~~~~~~i~i~f~~~~p 407 (478) T protein:vir:10 328 MLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVKVQDIEITFNFNVM 407 (478) T ss_pred HHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEecCCCC Confidence 88888888888998888777789999999999999999999999999999999999999888753 44579999999999 Q ss_pred cCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh---------hhcccccCC Q lcl|NC_021301. 393 VTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV---------QRPQEDGSR 456 (456) Q Consensus 393 ~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~---------~~~~~d~~~ 456 (456) +|.++.|++++++ +|++|++|+++++|++++ .+.|++|+++|.+...+... .+++++.++ T Consensus 408 ~d~~e~a~~~~kl--~g~iS~et~~~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (478) T protein:vir:10 408 VNELENSQIAMNS--TGLLSKETILSNHAWVED--PVAEMERIEQENIELNQQLPDIEEGLNGEQQRQSENNQ 476 (478) T ss_pred CCHHHHHHHHHHH--hCCCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCCCCCCCCCC Confidence 9999999999987 589999999999999876 33456666655433222111 111112222 No 46 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=5.6e-77 Score=438.62 Aligned_cols=433 Identities=11% Similarity=0.035 Sum_probs=322.0 Q ss_pred CCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccc----hhhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTA--STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTS----AAWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~--~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~----~~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) |.. +...++|..++.+|..+++|++++++||+|+|+|...++... ....+.++|+++||+++|||+.++||+|+ T Consensus 18 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~~l~g~ 97 (472) T protein:vir:93 18 TNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGK 97 (472) T ss_pred ecCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhccccccccccccccccchHHHHHHHHhhhhccc Confidence 443 334678999999999999999999999999999876543221 12223466889999999999999999999 Q ss_pred CeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 75 GITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 75 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) |+++..+ |++..+.++++| .|+|+..+.++++.+++||+||+++|.|++|.+++.+++|++++++||+....++.+++ T Consensus 98 ~~~~~~~-d~~~~~~l~~~~-~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~i 175 (472) T protein:vir:93 98 PIAFKHT-DDEVVKRIDEVL-GNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI 175 (472) T ss_pred CeeeccC-ChHHHHHHHHHH-hccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 9998654 455566666666 58999999999999999999999999999999999999999999999987777899999 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) ++|...+.. ...+|++..++.|........ .......+.+.. ....|.++.||||++ +|++|+|+|+++++| T Consensus 176 r~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~vPvv~~~nn~~g~s~~e~v~~l 248 (472) T protein:vir:93 176 RMYKLENET--KVEYWDKVTVNYYVYENGSLI----PDYSNNLENSKT-HFSTGSWGKIPFIPFKNNDLEISDIFMYKTL 248 (472) T ss_pred EEEEeecce--eEEEEecCeEEEEEEecCeee----eccccccccccc-ccccCCCCCcceEEecCCCCCCCchhhhHHH Confidence 998765543 457888888877753221110 011111122332 334566777777776 568999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAI 312 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l 312 (456) ||+||+++|++++..+++++|+++++|++.... +... ..+. ...++..+.++++..+. +.+.+++...+ T Consensus 249 iDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~-----~~~~---~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 318 (472) T protein:vir:93 249 IDAYNRRLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFK---RLLR--YYGAIKVSDNGGVDTIQVEVPVENSKKYL 318 (472) T ss_pred HHHHHHHHHHHHHHHHHhcCceeEeecCCcccc-----hhhH---HHHh--hccccccCCCCcceeEeecCCHHHHHHHH Confidence 999999999999999999999999999754321 1111 1111 12234444444443331 23455666666 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-EDTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~i~v~f~~~~ 391 (456) +.+...|+..+++|...++..++|+||+||++++.+|..||+++++.|+.+|++++++++++.|... ...++++|+++. T Consensus 319 ~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~i~v~f~~~~ 398 (472) T protein:vir:93 319 DELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKGEHKDVDISFNYNK 398 (472) T ss_pred HHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeCCCC Confidence 6666677777778877776667799999999999999999999999999999999999999888654 457999999999 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh--hcccccCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ--RPQEDGSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~--~~~~d~~~ 456 (456) |.|.++.+++++|+. |++|++|+++++|++++ .+.|++|+++|.......... ....+++. T Consensus 399 p~~~~~~~~~~~k~~--giis~et~l~~l~~~~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~ 461 (472) T protein:vir:93 399 VANTELQVQTAQQSM--GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADGAQ 461 (472) T ss_pred CCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhccCcCcccCCCCC Confidence 999999999999874 89999999999999865 334555555443222221111 11112221 No 47 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=6.2e-77 Score=438.39 Aligned_cols=436 Identities=11% Similarity=0.040 Sum_probs=337.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchh----hhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA----WRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~----~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) -...+..+++..++.+|..+.+|++++.+||+|+|+|...+.+.... ....++|+++||+++||++.++||+|+|+ T Consensus 22 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~ 101 (474) T protein:vir:96 22 PKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLKPDWRMFTNYHQNLVDQKVAYAVANPV 101 (474) T ss_pred hccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccccccccccchhcccchHHHHHHhhhhhhcccCc Confidence 34455678999999999999999999999999999998776543322 22456789999999999999999999999 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) +++.+ +++..+.++++++ |+++..+.++++.++++|+||+++|.|++|++++.+++|++++|+||+.....+.+++++ T Consensus 102 ~~~~~-d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~vr~ 179 (474) T protein:vir:96 102 TFSSD-DDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPAEQAIPIWTNKERDTLKAFIRY 179 (474) T ss_pred eeecC-chHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEE Confidence 98754 5567788888875 778999999999999999999999999999999999999999999998877789999999 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liD 235 (456) |...+ .....+|+++.++.|................ ....+......+|+++.+||+++ +|++|+|+|+++++||| T Consensus 180 ~~~~~--~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liD 256 (474) T protein:vir:96 180 YRLDG--AERVEYWTDSDVTYYEYQDGILIPDYYHGEE-HIQSHYYVGNKRVSWGRVPFIPFKNNPQEMSDLFMYKTIID 256 (474) T ss_pred EeecC--ceEEEEEeCCeEEEEEecCCceeeccccccc-cccccccccccccCCCceeEEEeccCCCCCCcHHHHHHHHH Confidence 87543 3457889999888876432221111111111 11222334455677777787766 67899999999999999 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccC-CCceeEeec-ccchHHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIK 313 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~ 313 (456) +||.++|++++..+++++|+++++|+++... +... . ....+.++.++ .++++..+. +.+.+++...++ T Consensus 257 a~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~-----~~~~---~--~~~~~~~i~~~~~~~~~~~l~~~~~~~~~~~~~~ 326 (474) T protein:vir:96 257 AMDKRLSDTQNTFDESTELIYILKGYEGQDL-----DEFM---R--NLKYYKAINVDGDGSGVDTIQIEVPVQSSKEYLD 326 (474) T ss_pred HHHHHHHHHHHHHHHhccceeeeecCCcccc-----cchh---h--hhhcCceEEecCCCCceeEEeecCChHHHHHHHH Confidence 9999999999999999999999999765321 1111 1 11123344433 445555553 446788888889 Q ss_pred HHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc-ccceeEEecCCCC Q lcl|NC_021301. 314 EHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV-EDTVDVSFESPDR 392 (456) Q Consensus 314 ~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~-~~~i~v~f~~~~~ 392 (456) .+..+|+..+++|+..++..++|+||+||++++.++.+||.++++.|+++|++++++++++.|... ...++++|+++.| T Consensus 327 ~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~~~~~~i~i~f~~~~p 406 (474) T protein:vir:96 327 MLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLNIKVQDVEITFNFNVM 406 (474) T ss_pred HHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCC Confidence 999999999999998888778899999999999999999999999999999999999998887543 4578999999999 Q ss_pred cCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh-hcccccCC Q lcl|NC_021301. 393 VTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ-RPQEDGSR 456 (456) Q Consensus 393 ~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~-~~~~d~~~ 456 (456) .|.++.++++. ++|++|++|++.++|++++ ++.|++|+++|.....+.... ..+++|.. T Consensus 407 ~~~~e~~~~~~---~ag~iS~et~~~~~~~v~d--~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~ 466 (474) T protein:vir:96 407 VNELEQSQIGV---QSQYLSKETVVTNHPWVDD--PVAELERIEQDNIDFNKQLPPLEGDANGRA 466 (474) T ss_pred cCHHHHHHHHH---hcCCCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhccccccccccccc Confidence 99999998754 5699999999999999866 344666666554333322221 22222222 No 48 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=1e-76 Score=437.18 Aligned_cols=436 Identities=12% Similarity=0.083 Sum_probs=315.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhh-------hhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAW-------RSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~-------~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) -......+++++++.+| +.++++++++||.|+|+|...++...... ...++|+++||+++||++.++|++| T Consensus 25 ~~~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~~~ivd~~~~yl~g 102 (503) T protein:vir:59 25 EIAEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRTSHAWHKLFVDQKTQYLVG 102 (503) T ss_pred hccchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccccccccccccceeecchHHHHHHHHHhhhhc Confidence 12233345788888887 56889999999999999987665433221 2345688999999999999999999 Q ss_pred CCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEE Q lcl|NC_021301. 74 NGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSA 153 (456) Q Consensus 74 ~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~ 153 (456) +|++++.+ |+ ....+++.|..|+|+.++.++++.++++|+||+++|.|++|++++++++|++++|+||+....++.++ T Consensus 103 ~~~~~~~~-d~-~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~ 180 (503) T protein:vir:59 103 EPVTFTSD-NK-TLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFA 180 (503) T ss_pred CCeeeccC-cH-HHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCCCceEEE Confidence 99998654 33 34445556667999999999999999999999999999999999999999999999999888889999 Q ss_pred EEEEEecCCc---eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhH Q lcl|NC_021301. 154 MRWWRDLDAE---SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEP 229 (456) Q Consensus 154 ~~~~~~~d~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~ 229 (456) +++|...++. ..+..+|+++.++.|....... .............+.......|.++.+||+++ +|++|.|+|++ T Consensus 181 ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~sd~~~ 259 (503) T protein:vir:59 181 LRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVY-QMDYSYGENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMVSDLKF 259 (503) T ss_pred EEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcc-cccccccccccccceeecceeccCCccceEEecCCCCCCcchhh Confidence 9998765432 4567799999988876432111 11111111111222334445677777777666 67899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCc--eeE--eecccch Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGV--DIW--ESQTNDF 305 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~--~~~--~~~~~~~ 305 (456) +++|||+||+++|++++..+++++|+++++|.+..... ..... .....++..+.++ ++. +++.... T Consensus 260 ~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~-----~~~~~-----~~~~~~~~~~~~~~~~~l~~~~~~~~~ 329 (503) T protein:vir:59 260 YKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPK-----EFTAN-----LRYHSVIKVSGDGGVDTLRAEIPVDSA 329 (503) T ss_pred hHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccc-----hhhhh-----hhcccceeccCCCcceeEeccCCHHHH Confidence 99999999999999999999999999999997653211 11111 1122333444444 432 3333344 Q ss_pred HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC------Cc Q lcl|NC_021301. 306 TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE------SV 379 (456) Q Consensus 306 ~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~------~~ 379 (456) +++++.++..++.++ ++|...++..++|+||+||++++.++.++|+++++.|+.+|++++++++++.+. .. T Consensus 330 ~~~~~~l~~~i~~~s---~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~ 406 (503) T protein:vir:59 330 AKELERIQDELYKSA---QAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNP 406 (503) T ss_pred HHHHHHHHHHHHHHh---cccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Confidence 555555555555554 555444444456899999999999999999999999999999999998775432 12 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcc-------- Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQ-------- 451 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~-------- 451 (456) ..+++++|+++.|+|.++.++++++|+++|++|++|+++++|++++ ++.|++|+++|.+..........+ T Consensus 407 ~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 484 (503) T protein:vir:59 407 DKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQD--PEEELARIEEEMNQYAEMQGNLLDDEGGDDDL 484 (503) T ss_pred ccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhhccccCccCCCCCC Confidence 3468999999999999999999999999999999999999999865 344566666555433322111111 Q ss_pred --cccCC Q lcl|NC_021301. 452 --EDGSR 456 (456) Q Consensus 452 --~d~~~ 456 (456) ++++. T Consensus 485 ~~~~~~~ 491 (503) T protein:vir:59 485 EEDDPNA 491 (503) T ss_pred CcCCCCC Confidence 11111 No 49 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=2.2e-76 Score=435.35 Aligned_cols=439 Identities=10% Similarity=0.034 Sum_probs=331.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcc--------cchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRN--------TSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~--------~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |.-++-.++++.++.+|.++.+|+.++++||+|+|+|...+.. .+...+..++|+++||+++||++.++||+ T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 9999999999999999999999999999999999998765432 12234456789999999999999999999 Q ss_pred cCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 73 PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 73 ~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) |+|++++.+ +++..+.++++++. ++...+.++++.++++|+||+++|.|++|++++.+++|.+++|+||+....++.+ T Consensus 81 G~p~~~~~~-d~~~~~~l~~~~~~-~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~a 158 (470) T protein:vir:10 81 SVFPDIDVG-KDADNKKIIDVLGD-DRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNKLLG 158 (470) T ss_pred ccceeeecC-chHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEE Confidence 999998754 45567889999874 6778888999999999999999999999999999999999999999988888999 Q ss_pred EEEEEEecC--Cc--eEEEEEEcCCeEEEEEEeeeecccccce-e-----eccCCCceeecccccccCceeEEEEc-cCC Q lcl|NC_021301. 153 AMRWWRDLD--AE--SDFAIVWSGDGWQKFARPCFVQSSSRRR-L-----VTRISDSWVPVGDAVVTGSPPPVVVY-QNP 221 (456) Q Consensus 153 ~~~~~~~~d--~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-----~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~ 221 (456) ++++|...+ +. .....+|+++.++.|............. . ....... .......|+++.+||+++ +|+ T Consensus 159 ~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~vPvv~~~nn~ 237 (470) T protein:vir:10 159 ILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYET-GQSNTLKHNFGRVPFIEFSKNK 237 (470) T ss_pred EEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceecccccccccccccccccc-ccccccccCCCeeeEEEeecCC Confidence 999987543 22 3457889999988886433211111100 0 0001111 112334566777777665 688 Q ss_pred CCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceecc----CCCcee Q lcl|NC_021301. 222 DGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWEL----PPGVDI 297 (456) Q Consensus 222 ~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~d~~~ 297 (456) +|+|+|+++++|||+||.++|++++.++++++|+++++|+..... +...... . ..+.+... +.++++ T Consensus 238 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~-----~~~~~~~---~-~~~~i~~~~~~~~~~~~~ 308 (470) T protein:vir:10 238 YRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADL-----HQFMNDL---R-KYKSIKINNTGNGDNSGV 308 (470) T ss_pred CCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCcccc-----chhhhhh---h-hcCeEeccCCCCCcCcee Confidence 999999999999999999999999999999999999999764321 1111111 1 11222221 112333 Q ss_pred Eeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021301. 298 WESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG 376 (456) Q Consensus 298 ~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 376 (456) ..+. +.+.+++...++.|.+.|+..+++|+..+..+ +|+||+||++++.+|.+||+++++.|+++|++++++++++.+ T Consensus 309 ~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~ 387 (470) T protein:vir:10 309 DKLQIDIPVEARDDALKITRKNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLN 387 (470) T ss_pred EEEeecCChHHHHHHHHHHHHHHHHHhCCCCCCcccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 3332 34567777888888888888888887776554 789999999999999999999999999999999999988766 Q ss_pred C--CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh-----h Q lcl|NC_021301. 377 E--SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ-----R 449 (456) Q Consensus 377 ~--~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~-----~ 449 (456) . .+...++++|+++.|.|.++.|++++++ +|++|.+|+++++|++++ .+.|++|+++|.........+ . T Consensus 388 ~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~~~p~v~D--~~~E~eri~~E~~e~~~~~~~~~~~~~ 463 (470) T protein:vir:10 388 FSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDD--WQQELKDLAKDKEENDPYSNQADELNG 463 (470) T ss_pred ccCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhccccccCC Confidence 4 3446799999999999999999999987 489999999999999876 344666666554433332222 1 Q ss_pred cccccCC Q lcl|NC_021301. 450 PQEDGSR 456 (456) Q Consensus 450 ~~~d~~~ 456 (456) .+.|.++ T Consensus 464 ~~~dde~ 470 (470) T protein:vir:10 464 KGVNDEQ 470 (470) T ss_pred CCCCCCC Confidence 2223333 No 50 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=4.3e-76 Score=433.75 Aligned_cols=437 Identities=11% Similarity=0.050 Sum_probs=331.2 Q ss_pred CCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccch----hhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTA--STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSA----AWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~--~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~----~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) |.+ .+..++|+.|+.+|..+.+|++++.+||+|+|+|...+.+... .....++|+++||+++||++.++||+|+ T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~ivd~~~~yl~g~ 99 (478) T protein:vir:10 20 IKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKPDWRMYTNYHQNLVDQKVAYAVAN 99 (478) T ss_pred hhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhcccccccccccceeccchHHHHHHHHhhhhccc Confidence 333 3677899999999999999999999999999998766543322 1233466899999999999999999999 Q ss_pred CeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 75 GITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 75 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) |+++..+ +++..+.++++|+ |+|+..+.++++.++++|++|+++|.|++|++++.+++|++++|+||+....++.+++ T Consensus 100 p~~~~~~-~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~~i 177 (478) T protein:vir:10 100 PVTFGVD-NDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQAFI 177 (478) T ss_pred CceeecC-ChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 9998754 4556677888875 8999999999999999999999999999999999999999999999987777789999 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) ++|...+ .....+|+++.++.|.......... ..........+......+|.++.+||+++ +|++|.|+|+++++| T Consensus 178 r~~~~~~--~~~~~~y~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~l 254 (478) T protein:vir:10 178 RVYELDG--AERVEYWTKDDVTFYELKEGQLIPD-FYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQEVSDLFMYKTI 254 (478) T ss_pred EEEeeeC--ceEEEEEeCCcEEEEEecCCeeecc-ccccccccccceecccccccCCcceEEEeccCCCCCCcHHHHHHH Confidence 8886443 3467899999988775422111111 11111122234445556788888888776 578899999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec-cC--CCceeEeecccchHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE-LP--PGVDIWESQTNDFTPMLS 310 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~d~~~~~~~~~~~~~~~~ 310 (456) ||+||+++|++++.++++++|+++++|++.... +..... +. ..+.++. .+ +++++... +.+.+++.+ T Consensus 255 iDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~-----~~~~~~---~~-~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~ 324 (478) T protein:vir:10 255 IDALDKRLSDTQNTFDESVELIYILKGYEGEDM-----KDFMHN---LK-YYKAISVAGESGSGVDTIKV-EVPIDSVKE 324 (478) T ss_pred HHHHHHHHHHHHHHHHHhhCcceeeecCCcccc-----cchhhh---hh-hCceeEecCCCCCcceEEee-cCCHHHHHH Confidence 999999999999999999999999999765321 111111 11 1122322 22 33444333 345677778 Q ss_pred HHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-cceeEEecC Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVE-DTVDVSFES 389 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~-~~i~v~f~~ 389 (456) .++.+...|+..+++|...++..++|+||+||++++.+|.+||.++++.|+.+|++++++++++.|...+ .+++++|++ T Consensus 325 ~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~d~~~i~i~f~~ 404 (478) T protein:vir:10 325 YTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRLDVRVQDIEITFNF 404 (478) T ss_pred HHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccceEEeCC Confidence 8888888888888999888777778999999999999999999999999999999999999988875443 579999999 Q ss_pred CCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh---------hhc-ccccCC Q lcl|NC_021301. 390 PDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV---------QRP-QEDGSR 456 (456) Q Consensus 390 ~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~---------~~~-~~d~~~ 456 (456) +.|.|.++.+++++++ +|++|++|+++.+|++++ .+.|++|+++|......... +.+ .+|++. T Consensus 405 ~~p~~~~e~~~~~~~~--~g~iS~et~i~~~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~~ 477 (478) T protein:vir:10 405 NVMVNELENSQIAMNS--TGLLSKETILGNHSWVQD--PVAEMERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQS 477 (478) T ss_pred CCCCCHHHHHHHHHHH--hCCCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhccccCCCCcccccccCcCCCC Confidence 9999999999999987 489999999999999866 33445555544333222111 101 111111 No 51 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=2.7e-76 Score=434.84 Aligned_cols=434 Identities=15% Similarity=0.056 Sum_probs=319.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) .-..|++++.+.+.+++..+++|++++++||+|+|++..............++|+++||+++||++.++||+|+|+++.. T Consensus 19 ~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~ 98 (506) T protein:vir:94 19 LENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVKL 98 (506) T ss_pred hhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcccCceeec Confidence 34466777777655556778899999999999999765333333333344577899999999999999999999999876 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 160 (456) + ++...+.++++|+.|+++.++.++++.++++|+||+++|.|++|.+++.+++|++++|+||+.....+.+++++|... T Consensus 99 ~-d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~ 177 (506) T protein:vir:94 99 P-DDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVIYSTDVDPKPIMAVRYHQIE 177 (506) T ss_pred C-cchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEecCCCCCceEEEEEEEeee Confidence 5 445678899999999999999999999999999999999999999999999999999999988777799999988643 Q ss_pred C--Cc-----eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHH Q lcl|NC_021301. 161 D--AE-----SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHID 232 (456) Q Consensus 161 d--~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~ 232 (456) + +. ..+..+|+...++.+.. ....|.......|+++.+||+++ +|++|.|+|+++++ T Consensus 178 ~~~~~~~~~~~~~~~~yt~~~~~~~~~---------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~sd~e~~~~ 242 (506) T protein:vir:94 178 LVDDNQVSTINYVPETWTADTYTLYNP---------------TPIMGKMQVDTTKPITTFPVVEFKNSNFRLGDFENVLP 242 (506) T ss_pred eccCCceeEEEEEEEEEeCceEEEecc---------------ccCccceeccccccCCccceEEecCCCCCCCchhhhHH Confidence 2 22 22344566666655531 11223334445577777777766 56889999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc----------ccccch------hhhhhhhhhhccceec------ Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV----------DENGNA------IDYASIFEAAPGALWE------ 290 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~----------~~~~~~------~~~~~~~~~~~~~~~~------ 290 (456) |||+||+++|++++..+++++|+++++|........ +..+.. ...... ...+.++. T Consensus 243 liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~ 320 (506) T protein:vir:94 243 LIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKE--MKDANMLLLKSGMT 320 (506) T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhh--hhhcCeeeeccccc Confidence 999999999999999999999999999965321110 000100 000000 00111111 Q ss_pred ---cCCCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 291 ---LPPGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA 366 (456) Q Consensus 291 ---~~~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 366 (456) ...+++++.+. +.+.+++...++.+...|+..+++|+..++..++|+||+||++++.++.+||.++++.|+++|++ T Consensus 321 ~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~ 400 (506) T protein:vir:94 321 VNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYA 400 (506) T ss_pred ccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12234444442 34677888888888888999999998777777789999999999999999999999999999999 Q ss_pred HHHHHHHhcCC------CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHH Q lcl|NC_021301. 367 ILVKALQIEGE------SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQIT 440 (456) Q Consensus 367 ~~~l~~~~~~~------~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~ 440 (456) ++++++++.+. .+..+++++|+++.|.|.++.|++++|+. |++|++|+++++|++++. +.|++|+++|.. T Consensus 401 ~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~lp~v~d~--~~E~~ri~~E~~ 476 (506) T protein:vir:94 401 RYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAG--ATLPQKYLYQQLPGVTNP--QDIVDMMKEQSA 476 (506) T ss_pred HHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCCH--HHHHHHHHHHHH Confidence 99998876432 23346899999999999999999999884 899999999999998763 345555555433 Q ss_pred HHhhhh--hhhcccccC-------C Q lcl|NC_021301. 441 LFAGNS--VQRPQEDGS-------R 456 (456) Q Consensus 441 ~~~~~~--~~~~~~d~~-------~ 456 (456) ...... .....++++ + T Consensus 477 ~~~~~~~~~~~~~~~~~~~~~~~~~ 501 (506) T protein:vir:94 477 NGDYSFDQNGVISNDGQTNTTATQT 501 (506) T ss_pred HHhhcchhhcCCCcccCcccccccc Confidence 211111 001111111 1 No 52 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=5.3e-76 Score=433.28 Aligned_cols=433 Identities=15% Similarity=0.101 Sum_probs=323.9 Q ss_pred CCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHhcccCcccccCc-ccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRI-DDGMSRVRLLARYSNGDAPLPELTR-NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~-~~~~~r~~~~~~YY~g~~~i~~~~~-~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) ...+...+.++.++.+| ..+.+|++++.+||+|+|.....+. +........++|+++||+++||++.++||+|+|+++ T Consensus 26 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~ 105 (481) T protein:vir:10 26 LAELLKEENLRNFISRHQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHRAVHNYAKYVSRFIVGYLTGNPITI 105 (481) T ss_pred chhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccCccccccccccccceeecchHHHHHHHHHhhhccCCceE Confidence 11111234577777776 5778999999999999986543322 222222345678999999999999999999999988 Q ss_pred CCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 79 GGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 79 ~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) +.+ +++..+.++++|++|+|+.++.++++.++++|+||+++|.+++|.+++++++|++++++||+....++.+++++|. T Consensus 106 ~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~~~ 184 (481) T protein:vir:10 106 THQ-DNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPKSTFVVYDQTLDKKVVAGVRYFE 184 (481) T ss_pred ecC-ChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 654 5667888999999999999999999999999999999999999999999999999999999988788999999887 Q ss_pred ecCCc---eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHH Q lcl|NC_021301. 159 DLDAE---SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDII 234 (456) Q Consensus 159 ~~d~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~li 234 (456) ..++. ..+..+|+++.++.+.. .++.|......+|.++.+||+++ +|++|+|+|+++++|| T Consensus 185 ~~~~~~~~~~~~~~y~~~~i~~~~~---------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~li 249 (481) T protein:vir:10 185 KQDKDKVPVQHVEVYTTDKIYYIEI---------------KGGTYHRVEEVEHYYNDVPIIEYLNDQFKQGDFENVIALI 249 (481) T ss_pred EeeCCCceEEEEEEEecCeEEEEEe---------------cCCceeecccccccCCceeEEEeecCCCCCCchhhHHHHH Confidence 54432 35667899999887743 22345555566777788888776 4689999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec-cCCCceeEeec-ccchHHHHHHH Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE-LPPGVDIWESQ-TNDFTPMLSAI 312 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~d~~~~~~~-~~~~~~~~~~l 312 (456) |+||+++|++++..+++++|+++++|..... ++.+........+....+.... .+.++++..+. +.+.+.+.+.+ T Consensus 250 da~~~~~s~~~~~~~~~~~~~~~~~g~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 326 (481) T protein:vir:10 250 DLYDSAQSDTANYMTDLNDAMLAIIGNVDLD---SEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYK 326 (481) T ss_pred HHHHHHHHHHHHHHHHhcCceeEeecCcCCC---ccchhhhhhccceeccccccccCCCCCcceeEEeecCCHHHHHHHH Confidence 9999999999999999999999999864322 2222222211111111111111 12233333322 23456777778 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----CcccceeEEe Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-----SVEDTVDVSF 387 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-----~~~~~i~v~f 387 (456) +.+...|+..+++|+..++..++|+||+||++++.+|.+||+++++.|+.+|++++++++++.+. .+...++++| T Consensus 327 ~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~f 406 (481) T protein:vir:10 327 KRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQHNYAELTITF 406 (481) T ss_pred HHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccceeeEEe Confidence 88888888889999888887778999999999999999999999999999999999999877543 2335689999 Q ss_pred cCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh----hhcccccCC Q lcl|NC_021301. 388 ESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV----QRPQEDGSR 456 (456) Q Consensus 388 ~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~----~~~~~d~~~ 456 (456) +++.|+|.++.|++++++. |++|++|+++++|++++ .+.|++|+++|......... ..+.++++- T Consensus 407 ~~~~~~~~~~~a~~~~kl~--g~is~et~~~~l~~i~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 475 (481) T protein:vir:10 407 TPNLPKSMMESINAFNALS--GGVSESTRLSLLDFIDN--PKEELEKMQEEEAQREKQADKRGYGEAFENHLN 475 (481) T ss_pred CCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhhhhccCCccCCCCCC Confidence 9999999999999999885 89999999999999876 33455555544332222111 111111111 No 53 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=6.9e-76 Score=432.63 Aligned_cols=432 Identities=10% Similarity=0.046 Sum_probs=327.3 Q ss_pred CCCC--CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCccc----chhhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTAS--TPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNT----SAAWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~~--t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~----~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) |.++ +..++|..|+.+|..+.+|++++.+||+|+|+|+...+.. .......++|+++||+++||++.++||+|+ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~ 100 (474) T protein:vir:94 21 LKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASK 100 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcC Confidence 2222 3447899999999999999999999999999987654321 223345577899999999999999999999 Q ss_pred CeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 75 GITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 75 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) |+++..+ +++..+.++.+ .+|+|+..+.++++.++++|+||+++|.|++|.+++.+++|++++|+||+....++.+++ T Consensus 101 p~~~~~~-d~~~~~~l~~~-~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i 178 (474) T protein:vir:94 101 PVTYSCE-DENVLKVIHDV-LDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFI 178 (474) T ss_pred CceeccC-cHHHHHHHHHH-HhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEE Confidence 9998654 44445555554 568999999999999999999999999999999999999999999999998888899999 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) ++|...+ .....+|+++.++.|........ ........+......+|.++.|||+++ +|++|+|+|+++++| T Consensus 179 r~~~~~~--~~~~~~yt~~~~~~y~~~~~~~~-----~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~l 251 (474) T protein:vir:94 179 RYYKFNN--EEKVEFWTDTTVTYYVLENGGLI-----PDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSI 251 (474) T ss_pred EEEEecC--eEEEEEEeCCeEEEEEEcCCccc-----cccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHH Confidence 9987544 45678999999888764221110 011111222333445677778887776 568999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAI 312 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l 312 (456) ||+||+++|++++.++++++|+++++|+++... +.... . .....++..+.++++..+. +.+.+++.+.+ T Consensus 252 iDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~-----~~~~~---~--~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~ 321 (474) T protein:vir:94 252 IDAIDKRLSDAQNMFDESVELIYILKGYEGEDL-----EEFMR---G--LKYYKAINVDGDGGVETIQVEVPVSSTKEYI 321 (474) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcccc-----hhhhh---h--hhccceeeccCCCceeEEeecCCHHHHHHHH Confidence 999999999999999999999999999764321 11111 1 1123344445555543332 34567777777 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-cceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVE-DTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~-~~i~v~f~~~~ 391 (456) +.+...|+..+++|+..++..++|+||+||++++.++.+||.++++.|+++|++++++++++.|...+ ..++++|+++. T Consensus 322 ~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~ 401 (474) T protein:vir:94 322 DLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNR 401 (474) T ss_pred HHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCc Confidence 88888888888888877776778999999999999999999999999999999999999998886544 57999999999 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh--hcccc--------cCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ--RPQED--------GSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~--~~~~d--------~~~ 456 (456) |.|.++.|+++++ +|++|++|++.++|++++ .+.|++|+++|.......... ...++ +++ T Consensus 402 p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:94 402 MMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNK 471 (474) T ss_pred ccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCccc Confidence 9999999998765 489999999999999876 334566666554332222111 11111 111 No 54 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=6.9e-76 Score=432.63 Aligned_cols=432 Identities=10% Similarity=0.046 Sum_probs=327.3 Q ss_pred CCCC--CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCccc----chhhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTAS--TPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNT----SAAWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~~--t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~----~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) |.++ +..++|..|+.+|..+.+|++++.+||+|+|+|+...+.. .......++|+++||+++||++.++||+|+ T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~g~ 100 (474) T protein:vir:97 21 LKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRITTNFHQNLVDQKVSYVASK 100 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCcceeecchHHHHHHHHHhhhhcC Confidence 2222 3447899999999999999999999999999987654321 223345577899999999999999999999 Q ss_pred CeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 75 GITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 75 ~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) |+++..+ +++..+.++.+ .+|+|+..+.++++.++++|+||+++|.|++|.+++.+++|++++|+||+....++.+++ T Consensus 101 p~~~~~~-d~~~~~~l~~~-~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i 178 (474) T protein:vir:97 101 PVTYSCE-DENVLKVIHDV-LDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFI 178 (474) T ss_pred CceeccC-cHHHHHHHHHH-HhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEEE Confidence 9998654 44445555554 568999999999999999999999999999999999999999999999998888899999 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~l 233 (456) ++|...+ .....+|+++.++.|........ ........+......+|.++.|||+++ +|++|+|+|+++++| T Consensus 179 r~~~~~~--~~~~~~yt~~~~~~y~~~~~~~~-----~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~sd~e~v~~l 251 (474) T protein:vir:97 179 RYYKFNN--EEKVEFWTDTTVTYYVLENGGLI-----PDYYYGANHVQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSI 251 (474) T ss_pred EEEEecC--eEEEEEEeCCeEEEEEEcCCccc-----cccccCcCcccccccccCCCccceEEecCCcCCCCcHHHHHHH Confidence 9987544 45678999999888764221110 011111222333445677778887776 568999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAI 312 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l 312 (456) ||+||+++|++++.++++++|+++++|+++... +.... . .....++..+.++++..+. +.+.+++.+.+ T Consensus 252 iDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~-----~~~~~---~--~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~ 321 (474) T protein:vir:97 252 IDAIDKRLSDAQNMFDESVELIYILKGYEGEDL-----EEFMR---G--LKYYKAINVDGDGGVETIQVEVPVSSTKEYI 321 (474) T ss_pred HHHHHHHHHHHHHHHHHhcCceeeeecCCcccc-----hhhhh---h--hhccceeeccCCCceeEEeecCCHHHHHHHH Confidence 999999999999999999999999999764321 11111 1 1123344445555543332 34567777777 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-cceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVE-DTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~-~~i~v~f~~~~ 391 (456) +.+...|+..+++|+..++..++|+||+||++++.++.+||.++++.|+++|++++++++++.|...+ ..++++|+++. T Consensus 322 ~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~d~~~i~v~f~~~~ 401 (474) T protein:vir:97 322 DLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKTDVKDIEISFNFNR 401 (474) T ss_pred HHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCc Confidence 88888888888888877776778999999999999999999999999999999999999998886544 57999999999 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh--hcccc--------cCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ--RPQED--------GSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~--~~~~d--------~~~ 456 (456) |.|.++.|+++++ +|++|++|++.++|++++ .+.|++|+++|.......... ...++ +++ T Consensus 402 p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (474) T protein:vir:97 402 MMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDDGGADGAQQQEGSNNK 471 (474) T ss_pred ccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccCCCCCCCcccCCCCccc Confidence 9999999998765 489999999999999876 334566666554332222111 11111 111 No 55 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=3.2e-75 Score=428.96 Aligned_cols=435 Identities=10% Similarity=0.011 Sum_probs=331.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchh----hhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA----WRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~----~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) ... +-.+++..|+.+|..+.+|++++.+||+|+|+|+...+..... ....++|+++||+++||++.++||+|+|+ T Consensus 23 ~~~-~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~g~p~ 101 (468) T protein:vir:96 23 QYE-TQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWRMYTNYHQNLVDQKVAYAVANPV 101 (468) T ss_pred ccc-CcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccccccchHHHHHHHHHhhhccCCc Confidence 222 2345788999999999999999999999999998765543222 22346789999999999999999999999 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) +++.+ +++..+.++++|+ |+++..+.++++.+++||++|+++|.|++|.+++.+++|++++|+||+.....+.+++++ T Consensus 102 ~~~~~-d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~ir~ 179 (468) T protein:vir:96 102 TYGTE-DEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPAEQAIPIWTNKERDELKAFIRL 179 (468) T ss_pred eeccC-ChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEEEE Confidence 98754 5556788888886 789999999999999999999999999999999999999999999998877789999998 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liD 235 (456) |...+. ....+|+++.++.|............. .......+......+|.++.+|++++ +|++|.|+|+++++||| T Consensus 180 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~sd~e~v~~liD 256 (468) T protein:vir:96 180 YELDGG--ERVEYWTANDVTFYELKDGQLIPDYYQ-GEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEVSDLFMYKTIID 256 (468) T ss_pred EEecCc--eEEEEEeCCeEEEEEEcCCceeecccc-cccccccceeeccccccCCcccEEEecCCCCCCCchHHHHHHHH Confidence 875443 456889999888876432211111111 11122233445556678888888776 56889999999999999 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec-cCCCceeEeec-ccchHHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE-LPPGVDIWESQ-TNDFTPMLSAIK 313 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~d~~~~~~~-~~~~~~~~~~l~ 313 (456) +||.++|++++..+++++|+++++|+..... +.... ... ..+.+.. .+.++++..+. +.+.+++...++ T Consensus 257 a~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~-----~~~~~---~~~-~~~~i~~~~d~~~~~~~l~~~~~~~~~~~~~~ 327 (468) T protein:vir:96 257 AMDKRLSDTQNTFDEATELIYVLKGYEGEDL-----EEFMY---NLK-YYKAINVDGDGSGGVDTIQIDVPVQSAKEYLD 327 (468) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCcccc-----chhhh---hhh-cCceEEecCCCCCcceEEeecCChHHHHHHHH Confidence 9999999999999999999999999765321 11111 111 1122222 22334444332 345677888888 Q ss_pred HHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccceeEEecCCCC Q lcl|NC_021301. 314 EHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-VEDTVDVSFESPDR 392 (456) Q Consensus 314 ~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~-~~~~i~v~f~~~~~ 392 (456) .+..+|+..+++|+..++..++|+||+||+++++++.+||+++++.|+++|++++++++++.|.. +...++++|++++| T Consensus 328 ~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~i~f~~~~p 407 (468) T protein:vir:96 328 MLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLSIKVQDVEITFNFNVM 407 (468) T ss_pred HHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEecCCCC Confidence 88888888899998887777789999999999999999999999999999999999999988865 44579999999999 Q ss_pred cCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhh---hhhhhcccccC Q lcl|NC_021301. 393 VTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAG---NSVQRPQEDGS 455 (456) Q Consensus 393 ~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~---~~~~~~~~d~~ 455 (456) .|.++.|++++ ++|++|++|+++++|++++ .+.|++|+++|...... ......+.+.+ T Consensus 408 ~d~~e~a~~~~---~~g~iS~et~i~~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 408 VNELEQSQIGV---NSQYLSKETVVTNHPWVDD--PVAEMERIDQEELALPSIEEGLNGKENNEPT 468 (468) T ss_pred cCHHHHHHHHH---hcCCCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHhhccCCCCCCCCC Confidence 99999999775 4599999999999999866 34466666665443322 22222222333 No 56 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=3.1e-75 Score=429.10 Aligned_cols=431 Identities=9% Similarity=0.040 Sum_probs=329.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCccc----chhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNT----SAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~----~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) +...++ ++|+.++.+|..+.+|++++.+||+|+|+|....+.. ....+..++|+++||+++||++.++||+|+|+ T Consensus 24 ~~~~~~-~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~ 102 (474) T protein:vir:95 24 QFETQE-EMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRITTNFHQNLVDQKVSYVASKPV 102 (474) T ss_pred ccCChH-HHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccccccccccccceeccchHHHHHHHHHhhhccCCc Confidence 443334 5899999999999999999999999999987654432 22334456789999999999999999999999 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) ++..+ |++..+.+++++ +|+++..+.++++.++++|+||+++|.+++|++++.+++|.+++|+||+.....+.+++++ T Consensus 103 ~~~~~-d~~~~~~l~~~~-~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i~~ 180 (474) T protein:vir:95 103 TYSCE-DESVLKIIHDVL-DTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFIRY 180 (474) T ss_pred eeccC-chHHHHHHHHHH-hccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEE Confidence 98754 444455555554 5889999999999999999999999999999999999999999999998877789999999 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIIN 235 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liD 235 (456) |...+ .....+|++++++.|......... .......+.......|.++.|||+++ +|++|+|+|+++++||| T Consensus 181 ~~~~~--~~~~~~y~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liD 253 (474) T protein:vir:95 181 YKFNN--EEKVEFWTDTTVTYYVLENGGLIP-----DYYYGANHIQSHFSNGNWGRVPFIAFKNNPEEVSDIWMYKSLID 253 (474) T ss_pred EEEcC--eeEEEEEeCCeEEEEEEcCCcccc-----ccccCcccccccccccCCCccceEeecCCCCCCCcHHHHHHHHH Confidence 87544 346789999998887542211100 01111122223345567777787776 56899999999999999 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAIKE 314 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~ 314 (456) +||+++|++++..+++++|+++++|+...... .... ......++..++++++..+. +.+.+++...++. T Consensus 254 a~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-----~~~~-----~~~~~~~i~~~~~~~~~~l~~~~~~~~~~~~~~~ 323 (474) T protein:vir:95 254 AIDKRLSDAQNMFDESVELIYILKGYEGQDLE-----EFMR-----GLKYYKAINVDGDGGVETIQVEVPVSSTKEYIDL 323 (474) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCcccch-----hhhh-----hhhccceeeccCCCceeEEeecCCHHHHHHHHHH Confidence 99999999999999999999999998653211 1111 11123344455555554432 3567888889999 Q ss_pred HHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-cccceeEEecCCCCc Q lcl|NC_021301. 315 HIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES-VEDTVDVSFESPDRV 393 (456) Q Consensus 315 ~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~-~~~~i~v~f~~~~~~ 393 (456) +..+|+..+++|+..++..++|+||+||++++.++..||+++++.|+++|++++++++++.|.. +...++++|+++.|. T Consensus 324 l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~d~~~i~v~f~~~~p~ 403 (474) T protein:vir:95 324 MRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKMDVKDIEISFNFNRMM 403 (474) T ss_pred HHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccceeeEEeccCCCc Confidence 9999999999998887777789999999999999999999999999999999999999988754 456799999999999 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh----------hhcccccCC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV----------QRPQEDGSR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~----------~~~~~d~~~ 456 (456) |.++.|+++++ +|++|++|++..+|++++ .+.|++|+++|......... ...++++.| T Consensus 404 d~~e~a~~~~~---~g~iS~et~i~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~ 471 (474) T protein:vir:95 404 NDAEQSQIIAQ---SQYLSRETLVKSSPLVDD--YKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDK 471 (474) T ss_pred CHHHHHHHHHh---cCCCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccC Confidence 99999998765 599999999999999876 33456666655432222111 111112222 No 57 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=9.3e-75 Score=426.46 Aligned_cols=424 Identities=13% Similarity=0.033 Sum_probs=323.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc---ccCcc-----------cchhhhhhhhhhccChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP---ELTRN-----------TSAAWRSFQREARTNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~---~~~~~-----------~~~~~~~~~~k~~~n~~~~iVd~ 66 (456) -...|+ ++|+.++++|...++|+.++.+||+|.++.. ..++. ......+.++|+++||+++||++ T Consensus 12 ~~~~~~-e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~ 90 (474) T protein:vir:94 12 AQGILP-KHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDT 90 (474) T ss_pred ccCCCH-HHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHh Confidence 122233 5799999999999999999999999976532 21110 01111234668999999999999 Q ss_pred HHhhhccCCeecCCCC----cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEE Q lcl|NC_021301. 67 VADRIIPNGITVGGSA----DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) Q Consensus 67 ~a~~l~~~~~~~~~~~----d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~ 142 (456) .++||+|+|++++... ++.....++++|+.|+|+.++.++++++++||+||+++|.+++|++++.+++|++++|+| T Consensus 91 ~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~ 170 (474) T protein:vir:94 91 RVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFVG 170 (474) T ss_pred HhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEEE Confidence 9999999999986543 334456688999999999999999999999999999999999999999999999999999 Q ss_pred eCCCCceEEEEEEEEEecCCc----eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc Q lcl|NC_021301. 143 DPLQPWRIRSAMRWWRDLDAE----SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY 218 (456) Q Consensus 143 d~~~~~~~~~~~~~~~~~d~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~ 218 (456) |+.. .+.+++++|...++. ..+..+|++..++.|... ..+.+......+|+++.||||++ T Consensus 171 d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~--------------~~~~~~~~~~~~~~~g~vPvv~~ 234 (474) T protein:vir:94 171 DNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGE--------------GIDALQEVGRYEHLFDYNPLFGV 234 (474) T ss_pred cCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeec--------------CCCcccccccccCCCCccceEEe Confidence 8654 356777777654322 235668888888777532 12334455566788888887766 Q ss_pred -cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee Q lcl|NC_021301. 219 -QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI 297 (456) Q Consensus 219 -~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 297 (456) +|++|+|+|+++++|||+||.++|++++..+++++|+++++|+..... ... .. ...+.++..+.++++ T Consensus 235 ~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~----~~~------~~-~~~~~i~~~~~~~~~ 303 (474) T protein:vir:94 235 PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEE----MIQ------ET-QKSGAFELFDKDMDV 303 (474) T ss_pred cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCch----hhh------hh-hhcceeEecCCCCce Confidence 678999999999999999999999999999999999999999754321 111 11 123556666666666 Q ss_pred Eeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021301. 298 WESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG 376 (456) Q Consensus 298 ~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 376 (456) ..+. +.+.+++.+.++.+...|+..+++|+.+++..++|+||+||++++.+|.+||.++++.|+.+|++++++++++.+ T Consensus 304 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 383 (474) T protein:vir:94 304 KYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALK 383 (474) T ss_pred eEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5553 346788888888899999999999998888777899999999999999999999999999999999999887643 Q ss_pred C-------CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh- Q lcl|NC_021301. 377 E-------SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ- 448 (456) Q Consensus 377 ~-------~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~- 448 (456) . .+..++++.|+++.|.|.++.|++++++. |++|++|+++++|++++ .+.|++|+++|.........+ T Consensus 384 ~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d--~~~E~eri~~E~~e~~~~~~~~ 459 (474) T protein:vir:94 384 RKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDD--VDYELDEMEKESLEFNDKLPDI 459 (474) T ss_pred hccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccc Confidence 2 12246899999999999999999999985 89999999999999876 344566665554333322211 Q ss_pred -hcccccCC Q lcl|NC_021301. 449 -RPQEDGSR 456 (456) Q Consensus 449 -~~~~d~~~ 456 (456) ..+.++.. T Consensus 460 ~~~~~~~~~ 468 (474) T protein:vir:94 460 DEGDANDKS 468 (474) T ss_pred cCCCcCCCC Confidence 11111111 No 58 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=9.3e-75 Score=426.46 Aligned_cols=424 Identities=13% Similarity=0.033 Sum_probs=323.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc---ccCcc-----------cchhhhhhhhhhccChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP---ELTRN-----------TSAAWRSFQREARTNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~---~~~~~-----------~~~~~~~~~~k~~~n~~~~iVd~ 66 (456) -...|+ ++|+.++++|...++|+.++.+||+|.++.. ..++. ......+.++|+++||+++||++ T Consensus 12 ~~~~~~-e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~ 90 (474) T protein:vir:10 12 AQGILP-KHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVDT 90 (474) T ss_pred ccCCCH-HHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHHh Confidence 122233 5799999999999999999999999976532 21110 01111234668999999999999 Q ss_pred HHhhhccCCeecCCCC----cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEE Q lcl|NC_021301. 67 VADRIIPNGITVGGSA----DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) Q Consensus 67 ~a~~l~~~~~~~~~~~----d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~ 142 (456) .++||+|+|++++... ++.....++++|+.|+|+.++.++++++++||+||+++|.+++|++++.+++|++++|+| T Consensus 91 ~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~i~p~~~~~v~ 170 (474) T protein:vir:10 91 RVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKNIDPYNVIFVG 170 (474) T ss_pred HhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEEEcccceEEEE Confidence 9999999999986543 334456688999999999999999999999999999999999999999999999999999 Q ss_pred eCCCCceEEEEEEEEEecCCc----eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc Q lcl|NC_021301. 143 DPLQPWRIRSAMRWWRDLDAE----SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY 218 (456) Q Consensus 143 d~~~~~~~~~~~~~~~~~d~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~ 218 (456) |+.. .+.+++++|...++. ..+..+|++..++.|... ..+.+......+|+++.||||++ T Consensus 171 d~~~--~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~--------------~~~~~~~~~~~~~~~g~vPvv~~ 234 (474) T protein:vir:10 171 DNIL--EPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGE--------------GIDALQEVGRYEHLFDYNPLFGV 234 (474) T ss_pred cCCC--ceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeec--------------CCCcccccccccCCCCccceEEe Confidence 8654 356777777654322 235668888888777532 12334455566788888887766 Q ss_pred -cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee Q lcl|NC_021301. 219 -QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI 297 (456) Q Consensus 219 -~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 297 (456) +|++|+|+|+++++|||+||.++|++++..+++++|+++++|+..... ... .. ...+.++..+.++++ T Consensus 235 ~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~~----~~~------~~-~~~~~i~~~~~~~~~ 303 (474) T protein:vir:10 235 PNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSEE----MIQ------ET-QKSGAFELFDKDMDV 303 (474) T ss_pred cCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCch----hhh------hh-hhcceeEecCCCCce Confidence 678999999999999999999999999999999999999999754321 111 11 123556666666666 Q ss_pred Eeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021301. 298 WESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG 376 (456) Q Consensus 298 ~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 376 (456) ..+. +.+.+++.+.++.+...|+..+++|+.+++..++|+||+||++++.+|.+||.++++.|+.+|++++++++++.+ T Consensus 304 ~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~ 383 (474) T protein:vir:10 304 KYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALK 383 (474) T ss_pred eEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5553 346788888888899999999999998888777899999999999999999999999999999999999887643 Q ss_pred C-------CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh- Q lcl|NC_021301. 377 E-------SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ- 448 (456) Q Consensus 377 ~-------~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~- 448 (456) . .+..++++.|+++.|.|.++.|++++++. |++|++|+++++|++++ .+.|++|+++|.........+ T Consensus 384 ~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~--g~iS~et~~~~l~~v~d--~~~E~eri~~E~~e~~~~~~~~ 459 (474) T protein:vir:10 384 RKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK--GQVSERTRLGQSQLVDD--VDYELDEMEKESLEFNDKLPDI 459 (474) T ss_pred hccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccc Confidence 2 12246899999999999999999999985 89999999999999876 344566665554333322211 Q ss_pred -hcccccCC Q lcl|NC_021301. 449 -RPQEDGSR 456 (456) Q Consensus 449 -~~~~d~~~ 456 (456) ..+.++.. T Consensus 460 ~~~~~~~~~ 468 (474) T protein:vir:10 460 DEGDANDKS 468 (474) T ss_pred cCCCcCCCC Confidence 11111111 No 59 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=3.6e-74 Score=423.25 Aligned_cols=439 Identities=13% Similarity=0.010 Sum_probs=325.3 Q ss_pred CC---CCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MT---ASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~---~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) +. ..|++++.+.+.+++..+.+|++++++||+|+|++...+.+..+ ...++|+++||+++||++.++||+|+|++ T Consensus 9 ~~~~~~~~~~~~~~~i~~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~--~~~~~ki~~n~~~~iv~~~~~~l~g~~~~ 86 (489) T protein:vir:99 9 IDYESKLWIDQLKNYISRFKAEQLERLKELKRYYLGDNNIKYRPAKTDK--YAADNRIASDFAKYITVFEQGYMLGVPVE 86 (489) T ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccc--cCCcceeecchHHHHHHHHhhhhccCCce Confidence 22 23556666655555567889999999999999999877654332 23456899999999999999999999999 Q ss_pred cCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEee----CCCCceEEEEEccceeEEEEeCCCCceEEEE Q lcl|NC_021301. 78 VGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR----RDDGTATITADSPETMVVSVDPLQPWRIRSA 153 (456) Q Consensus 78 ~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~----d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~ 153 (456) ++.+ +++..+.++++|+.|+|+..+.++++.++++|+||+++|. |++|++++.+++|++++++||+.....+.++ T Consensus 87 ~~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~ 165 (489) T protein:vir:99 87 YKNE-NKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFVIYDDTYQRNSLMA 165 (489) T ss_pred eecC-ChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEEEEcCCCCCceEEE Confidence 8754 5557788999999999999999999999999999999985 5678899999999999999998877788899 Q ss_pred EEEEEecC---CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhH Q lcl|NC_021301. 154 MRWWRDLD---AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEP 229 (456) Q Consensus 154 ~~~~~~~d---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~ 229 (456) +++|...+ +...+..+|+++.+++|..... ..+.+......+|+++.|||+++ +|++|+|+|++ T Consensus 166 i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~------------~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~~~~ 233 (489) T protein:vir:99 166 VHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNL------------ETKGMRLKDYEGHFFKGVPVNEYANNEERTGAYES 233 (489) T ss_pred EEEEEEecCCCceEEEEEEEeCCcEEEEEecCC------------CcccceecccccccCCceeEEEeecCCCCCCchhh Confidence 98886543 3356788999999888754211 11122334456677888888877 46789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh--h----hhhhhhhccceeccCC-------Cce Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID--Y----ASIFEAAPGALWELPP-------GVD 296 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~--~----~~~~~~~~~~~~~~~~-------d~~ 296 (456) +++|||+||.++|++++.++++++|+++++|.........+...... . ........+.++..++ +++ T Consensus 234 v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (489) T protein:vir:99 234 VLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVKPQ 313 (489) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeeeeccccCccccccc Confidence 99999999999999999999999999999997543221111000000 0 0000111122222211 223 Q ss_pred eEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021301. 297 IWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE 375 (456) Q Consensus 297 ~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 375 (456) +..+. ..+.+.+...++.+...|+..+++|+..+...++|+||+||+++++++.+||.++++.|+.+|++++++++.+. T Consensus 314 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~ 393 (489) T protein:vir:99 314 AYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIW 393 (489) T ss_pred eeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33332 33556777777888888888888988777666789999999999999999999999999999999999998764 Q ss_pred CC---C-----cccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_021301. 376 GE---S-----VEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSV 447 (456) Q Consensus 376 ~~---~-----~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~ 447 (456) +. . ....++++|+++.|.|.++.+++++|+. |++|++|+++++++..++..+.|++|+++|.+....... T Consensus 394 ~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--giis~et~~~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~ 471 (489) T protein:vir:99 394 AIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GIVSDQTIFEILNTVTGVDAEAELKRLKEEADKKQSLPE 471 (489) T ss_pred hhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhcCCCCchhHHHHHHHHHHHHHHHhcccc Confidence 42 1 1235899999999999999999999985 899999999999887655566678787776544332111 Q ss_pred h--------hcccccCC Q lcl|NC_021301. 448 Q--------RPQEDGSR 456 (456) Q Consensus 448 ~--------~~~~d~~~ 456 (456) . ..++..++ T Consensus 472 ~~~~~~~~~~~~~~~~~ 488 (489) T protein:vir:99 472 PRLVGDASGQEEPTAEK 488 (489) T ss_pred ccccCCCCCCcCCCCCC Confidence 1 11111112 No 60 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=3.8e-74 Score=423.11 Aligned_cols=439 Identities=12% Similarity=0.062 Sum_probs=323.8 Q ss_pred CCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHhcccCcccccCcccch------hhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDD-GMSRVRLLARYSNGDAPLPELTRNTSA------AWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~------~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) ...++..++.+.+.+...+ +.++++++++||+|+|+|+..+..... ...+.++|+++||+++||++.++||+| T Consensus 16 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~g 95 (479) T protein:vir:79 16 LKKESTINLVKVIEHYILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAINNYHKLLVDQKVGYSVG 95 (479) T ss_pred cccCChhHHHHHHHHHHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceeecchHHHHHHHHHhhhhc Confidence 2223334444443333332 568899999999999999876543321 122356789999999999999999999 Q ss_pred CCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEE Q lcl|NC_021301. 74 NGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSA 153 (456) Q Consensus 74 ~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~ 153 (456) +|++++.+. +....+++.|..|+|+..+.++++.++++|++|+++|.+++|.+++++++|++++|+||+....++.++ T Consensus 96 ~p~~~~~~~--~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~ 173 (479) T protein:vir:79 96 NPIVFNADD--DNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEEAIPIWDSKRQRELVAF 173 (479) T ss_pred CCceeccCC--HHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccceeEEEEeCCCCCceEEE Confidence 999987543 345667778888999999999999999999999999999999999999999999999998877789999 Q ss_pred EEEEEecC---CceEEEEEEcCCeEEEEEEeeeecccc----cceeeccCCCceeecccccccCceeEEEEc-cCCCCCC Q lcl|NC_021301. 154 MRWWRDLD---AESDFAIVWSGDGWQKFARPCFVQSSS----RRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMG 225 (456) Q Consensus 154 ~~~~~~~d---~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s 225 (456) +++|...+ +...+..+|+++.++.|.......... ...........+......+|+++.+||+++ +|++|.| T Consensus 174 ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~s 253 (479) T protein:vir:79 174 IRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCVS 253 (479) T ss_pred EEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcccEEEecCCCCCCc Confidence 99987543 234567899999988875432111000 000111112223344556778888887776 5789999 Q ss_pred cHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccc Q lcl|NC_021301. 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TND 304 (456) Q Consensus 226 ~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~ 304 (456) +|+++++|||+||.++|++++..+++++|+++++|.+.... ++. .. ... .+.++..++++++..+. +.+ T Consensus 254 d~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~--~~~---~~---~~~--~~~~i~~~~~~~~~~l~~~~~ 323 (479) T protein:vir:79 254 DLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSL--QEF---ID---NIR--YYKSIKVDGGGGVDKLEINIP 323 (479) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcccc--ccc---hh---hhh--hccceecCCCCcceEEeccCC Confidence 99999999999999999999999999999999999754321 111 11 111 22333444444443332 345 Q ss_pred hHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-----Cc Q lcl|NC_021301. 305 FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE-----SV 379 (456) Q Consensus 305 ~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~-----~~ 379 (456) .+.+...++.+...|+..+++|+..++. ++|+||+||++++.++.++|.++++.|+.+|++++++++++.+. .+ T Consensus 324 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 402 (479) T protein:vir:79 324 VEAKKELLDRLEKNIIIFGQGVNPESQN-TGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYD 402 (479) T ss_pred HHHHHHHHHHHHHHHHHHhCcccccccc-ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc Confidence 6778888888888888888999887765 47899999999999999999999999999999999998876442 23 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ...++++|+++.|.|.++.|++++++. |++|.+|+++++|++++ .+.|++|+++|.+...+.....++.+..- T Consensus 403 ~~~i~i~f~~~~p~~~~~~a~~~~kl~--g~iS~et~l~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~ 475 (479) T protein:vir:79 403 YKTVQITFNHSMIINEAEKIDMAAKST--GIVSDETIVSNHPWVED--VNDELERLKKQEDTQKEYDDLIPNNQDGV 475 (479) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhccCcccCCC Confidence 457899999999999999999999874 89999999999999876 34456666666544333333222221111 No 61 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=1.7e-72 Score=414.10 Aligned_cols=443 Identities=9% Similarity=-0.012 Sum_probs=297.1 Q ss_pred CCCCCHHHHHHHHHHHH--HHHHHHHHHHHHHhcccCcccccCcccch-------hhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRI--DDGMSRVRLLARYSNGDAPLPELTRNTSA-------AWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~--~~~~~r~~~~~~YY~g~~~i~~~~~~~~~-------~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) |..+...+++...+..| .++++++.++++||+|+|+|...+..... .....++|+++||+++||++.++|| T Consensus 8 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki~~nf~k~Ivd~~~~yl 87 (537) T protein:vir:78 8 KPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKISHGFFTELVDQLAQYL 87 (537) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhcccccccccccccccccccccccccchHHHHHHHHhhhh Confidence 44444555665544443 46789999999999999999866543211 1123567899999999999999999 Q ss_pred ccCCeecCCCCc--ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCce Q lcl|NC_021301. 72 IPNGITVGGSAD--SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWR 149 (456) Q Consensus 72 ~~~~~~~~~~~d--~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~ 149 (456) +|+|++++...+ .+....++++ ..|+++.++.++++.+++||+||+++|.+++|.+++..++|+++||+||+.. + T Consensus 88 ~G~Pv~~~~~d~~~~e~~~~l~~~-~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~~~pv~d~~~--~ 164 (537) T protein:vir:78 88 LSNGVEVKVKDEDNTQLDEILQEY-FDEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLTLIPVFDDYG--V 164 (537) T ss_pred cccCceeecCcchhHHHHHHHHHH-hhccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEccceeEEEEcCCC--C Confidence 999999875432 2233334444 4589999999999999999999999999999999999999999999999764 4 Q ss_pred EEEEEEEEEec-------C-CceEEEEEEcCCeEEEEEEeeeeccccccee-----------------eccCCCceeecc Q lcl|NC_021301. 150 IRSAMRWWRDL-------D-AESDFAIVWSGDGWQKFARPCFVQSSSRRRL-----------------VTRISDSWVPVG 204 (456) Q Consensus 150 ~~~~~~~~~~~-------d-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~ 204 (456) +.+++++|... + ....+..+|+++.++.|.............. ............ T Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 244 (537) T protein:vir:78 165 LKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQ 244 (537) T ss_pred ceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccccccccccccccc Confidence 55666665431 1 2345678999999998864322111100000 000001111223 Q ss_pred cccccCceeEEEEc-cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhh Q lcl|NC_021301. 205 DAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEA 283 (456) Q Consensus 205 ~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~ 283 (456) ..+|+++.+||+.| +|++|+|+|+++++|||+||.++|++++..+++++|+++++|++... .+.... .+. T Consensus 245 ~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~-----~~~~~~---~l~- 315 (537) T protein:vir:78 245 VLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDS-----TDKLRQ---NIK- 315 (537) T ss_pred ccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCcc-----chhHHH---HHh- Confidence 34466666666654 68899999999999999999999999999999999999999975431 111111 111 Q ss_pred hccceeccCCCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 284 APGALWELPPGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKI 362 (456) Q Consensus 284 ~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~ 362 (456) ..+.+...+.++++..+. +.+.++....+++|.+.|+..+.+|+.... .++|+||+||++++++|.+||..+++.|++ T Consensus 316 ~~~~i~v~~d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~-~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~ 394 (537) T protein:vir:78 316 AKKMIGVNGDNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAV-GDGNVTNVVIKSRYTLLAMKARKMETSLRK 394 (537) T ss_pred hcCceeecCCCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccc-cccCCcHHHHHHHHhhHHHHHHHHHHHHHH Confidence 123333332333332221 223344444444444444444555543332 356899999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCC-----CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhH-HHHHHHHHHH Q lcl|NC_021301. 363 GLEAILVKALQIEGE-----SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQ-IKQDDLDRAR 436 (456) Q Consensus 363 ~l~~~~~l~~~~~~~-----~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~-~~~~e~~~~~ 436 (456) +|++++++++.+.+. .+...++++|++++|.|.++.|++++++.++|++|++|+++++|++++. .++++.++.. T Consensus 395 ~l~~~~~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~ 474 (537) T protein:vir:78 395 VLRWCADMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETLKLIAEELD 474 (537) T ss_pred HHHHHHHHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHHH Confidence 999999998765432 2445789999999999999999999999999999999999999998763 1111111111 Q ss_pred HHHHHHhhhhhhhccccc-CC Q lcl|NC_021301. 437 EQITLFAGNSVQRPQEDG-SR 456 (456) Q Consensus 437 ee~~~~~~~~~~~~~~d~-~~ 456 (456) +..+.......++..+.+ .. T Consensus 475 ~~~~~~~~~~~~~~~~~~~~~ 495 (537) T protein:vir:78 475 LDYNELKDALAEQDAQSLDVS 495 (537) T ss_pred hhhhhhhhhhhhhcccccCcC Confidence 111111111111110000 00 No 62 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=3.9e-56 Score=324.37 Aligned_cols=444 Identities=10% Similarity=0.059 Sum_probs=288.1 Q ss_pred CCCCCHHHHHHHH-HHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVL-TKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l-~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) -...+..+++... +..+..+..|+.+.++||+|+|++...............+++++|||+.||+..++||+++|++++ T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~i~~~~a~~l~~~p~~i~ 96 (496) T protein:vir:38 17 GLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKVTAKYMSKLLFNEKVKIN 96 (496) T ss_pred ccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHHHHHHHhhhhhCCcceEe Confidence 0112222222211 111345567888999999999987654332222222234568899999999999999999998876 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) .+ ++..++.++++++.|+|...+.+++..++++|.+|+++|.|++|.+++.+++|.+++|++++..+....+++..+. T Consensus 97 ~~-d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~~~~~~~~~~f~~~~~- 174 (496) T protein:vir:38 97 ID-DKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECVIANSFH- 174 (496) T ss_pred eC-ChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccceEEEEecCCcEEEEEEEEEEE- Confidence 54 4567788899999999999999999999999999999999999999999999999999988765433344444333 Q ss_pred cCCceEE-EEEEc-CCeEEEEEEeeeeccc---cccee-eccCCCceeecccccccCceeEEEEccC----------CCC Q lcl|NC_021301. 160 LDAESDF-AIVWS-GDGWQKFARPCFVQSS---SRRRL-VTRISDSWVPVGDAVVTGSPPPVVVYQN----------PDG 223 (456) Q Consensus 160 ~d~~~~~-~~~~~-~~~~~~~~~~~~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~pvv~~~n----------~~g 223 (456) .+++... ...|. .+..+......+.... .+... ........... ...+.+..||++++.| +.| T Consensus 175 ~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~-~~~~~~~~~~f~~~~~~~~N~~~~~~p~G 253 (496) T protein:vir:38 175 KNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPV-VPLPDFTRPTFIYIKPNIANNKNLTSPLG 253 (496) T ss_pred eCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccc-eeecCCCcceEEEecCCcccccccCCcCC Confidence 3332211 11121 1111111111111110 01000 00000111111 1123456788887643 469 Q ss_pred CCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhh----hhcCCCcccccccccchhhhhhhhhhhccceeccCC--Ccee Q lcl|NC_021301. 224 MGEVEPHIDIINRINRAELQLLSTMAIQAFRQRA----LKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPP--GVDI 297 (456) Q Consensus 224 ~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~----i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--d~~~ 297 (456) .|+|+.++++||+||.++|++.+..+....++.+ +... .+..|...............+..... ...+ T Consensus 254 ~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~------~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i 327 (496) T protein:vir:38 254 ISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTA------VNLDGSTTQYFDSTDEAFFLYQGDQDDNGKAI 327 (496) T ss_pred CchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhcc------CCCCCccccCCCCccceEEEeecCCCcccccc Confidence 9999999999999999999999887653222221 1111 11111111100000000011111111 1234 Q ss_pred Eeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh- Q lcl|NC_021301. 298 WESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI- 374 (456) Q Consensus 298 ~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~- 374 (456) .+++. ...+++++.++.+++.|+..+|+|+..||... +++||.+++++++.+.+++..+++.|+.+|+++++.++.+ T Consensus 328 ~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~ 407 (496) T protein:vir:38 328 KDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVG 407 (496) T ss_pred eeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44543 35689999999999999999999999998654 4468999999999999999999999999999998887643 Q ss_pred ------cC-CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhh-hh Q lcl|NC_021301. 375 ------EG-ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAG-NS 446 (456) Q Consensus 375 ------~~-~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~-~~ 446 (456) .| ......+++.|+++.|.|..++++++.+++++|++|.+|++..+++..++.++.|++|+++|...... .. T Consensus 408 ~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d~ea~~el~ri~~E~~~~~~~~d 487 (496) T protein:vir:38 408 KFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITEAEADEWAEMLAKEKQAEMPNND 487 (496) T ss_pred HHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhhccCcccc Confidence 22 23445789999999999999999999999999999999999887655555566678888877653321 11 Q ss_pred --hhhcccc Q lcl|NC_021301. 447 --VQRPQED 453 (456) Q Consensus 447 --~~~~~~d 453 (456) ....+++ T Consensus 488 ~~~~~~~~e 496 (496) T protein:vir:38 488 MNGIFGEEE 496 (496) T ss_pred ccCCCCCCC Confidence 1111222 No 63 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=2.4e-53 Score=309.09 Aligned_cols=450 Identities=10% Similarity=0.072 Sum_probs=288.8 Q ss_pred CCCCCHHHHHHHHHHH------------------HHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKR------------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLM 62 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~------------------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ 62 (456) |... --++++.++++ +.....++.+.++||+|+|+...............++++++|+++. T Consensus 1 m~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~ 79 (499) T protein:vir:80 1 MINQ-IIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKV 79 (499) T ss_pred ChhH-HHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHH Confidence 2211 11223332222 3445577888999999999765432222122223356788999999 Q ss_pred HHHHHHhhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEE Q lcl|NC_021301. 63 VRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) Q Consensus 63 iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~ 142 (456) ||+++|+||+++|++++.+ ++..++.+++++++|+|...+.+++..|+.+|.+|+.+|.|++|++++..++|.+++|+| T Consensus 80 iv~~~a~~l~~ep~~i~~~-d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~ 158 (499) T protein:vir:80 80 TAKYMSKLLFNEKVKINID-DETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLS 158 (499) T ss_pred HHHHHHHhhhCCcceEeeC-CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEE Confidence 9999999999999887654 456778899999999999999999999999999999999999999999999999999987 Q ss_pred eCCCCceEEEEEEEEEecCCceEEEE--EEcCCeEEEEEEee--eeccc---cccee-eccCCCceeecccccccCceeE Q lcl|NC_021301. 143 DPLQPWRIRSAMRWWRDLDAESDFAI--VWSGDGWQKFARPC--FVQSS---SRRRL-VTRISDSWVPVGDAVVTGSPPP 214 (456) Q Consensus 143 d~~~~~~~~~~~~~~~~~d~~~~~~~--~~~~~~~~~~~~~~--~~~~~---~~~~~-~~~~~~~~~~~~~~~~~~~~~p 214 (456) .+..+...++++..+...+....... .|.......|+... +.... .+... ........ ......+.+..|| T Consensus 159 ~d~~~~~~~~f~~~~~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~-~~~~~~~~~~~p~ 237 (499) T protein:vir:80 159 NDSENVDECLIANSFHKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDI-EPVVPLPSLTRPT 237 (499) T ss_pred ecCCCeEEEEEEEEEeecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCc-CCceeecCCCccc Confidence 76644444444444433221111111 12222222232221 11111 11100 00000000 1111123467788 Q ss_pred EEEccC----------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhh Q lcl|NC_021301. 215 VVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAA 284 (456) Q Consensus 215 vv~~~n----------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~ 284 (456) ++++.| |.|.|+|+.++++||+||+++|++.+..+....++.+-..+-. ...+..|............ T Consensus 238 f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~--~~~~~~g~~~~~~~~~~~~ 315 (499) T protein:vir:80 238 FIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVK--TAVNLDGSTTQYFDSTDEA 315 (499) T ss_pred eEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhh--ccCCCCCCcccCCCcccce Confidence 888744 4589999999999999999999999887764433332111000 0011112111100100111 Q ss_pred ccceeccCCC--ceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 285 PGALWELPPG--VDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) Q Consensus 285 ~~~~~~~~~d--~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f 360 (456) ...+...+.+ ..+..++. -..+++.+.++.+++.|...+|+++..||... ++.||.+++++++.+.+++..+++.| T Consensus 316 ~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~~~~ 395 (499) T protein:vir:80 316 FFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLI 395 (499) T ss_pred eeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 1111112222 23444543 35688999999999999999999999998654 34689999999999999999999999 Q ss_pred HHHHHHHHHHHHHh-------cC-CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHH Q lcl|NC_021301. 361 KIGLEAILVKALQI-------EG-ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDL 432 (456) Q Consensus 361 ~~~l~~~~~l~~~~-------~~-~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~ 432 (456) ..+|+++++.++.+ .+ ......+++.|++..+.|..++++.+.+++++|++|.+|++..+.+.+++.++.++ T Consensus 396 ~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~ea~~el 475 (499) T protein:vir:80 396 EQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEAEADEWA 475 (499) T ss_pred HHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChHHHHHHH Confidence 99999999887643 22 22446799999999999999999999999999999999998877554444456677 Q ss_pred HHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 433 DRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 433 ~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +++++|....... .+...-.|.- T Consensus 476 ~~i~~E~~~~~~~-~d~~g~~ge~ 498 (499) T protein:vir:80 476 EMLAKEKQAEIPN-NDMTGIFGEE 498 (499) T ss_pred HHHHHHhhcCCCC-CCccccCCCC Confidence 7777665432110 0001111111 No 64 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=1.5e-45 Score=266.30 Aligned_cols=442 Identities=12% Similarity=0.064 Sum_probs=283.6 Q ss_pred CCCCC-HHHHHHHHHHHH------------------HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHH Q lcl|NC_021301. 1 MTAST-PAEWLPVLTKRI------------------DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGL 61 (456) Q Consensus 1 ~~~~t-~~~~~~~l~~~~------------------~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~ 61 (456) |.--+ -..++.++..+. .....++++.++||+|+++..+... .......++++++|+++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~--~~~~~~~~~~~slnl~~ 78 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKN--SYGDTQKHELQSVNVTK 78 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccc--cCCCccccceeecchHH Confidence 22111 112222211110 2345667778899999988543211 11122234467789999 Q ss_pred HHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEE Q lcl|NC_021301. 62 MVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVS 141 (456) Q Consensus 62 ~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~ 141 (456) .||+.+|++++++|.+++.+ ++..++.|++++++|+|.....+++..++..|..++.+|.| .|.++|..++|.+++|+ T Consensus 79 ~i~~~~A~ll~~e~~~i~~~-d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-~~~~~i~~v~ad~~~P~ 156 (505) T protein:vir:79 79 LASAKLASLIFNEQCQVTVS-DETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-SGKIKLAWATADQVYPL 156 (505) T ss_pred HHHHHHHhhhcCCCceeecC-ChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-CCceEEEEEcCCeeEEE Confidence 99999999999999777654 45678889999999999999999999999999999999998 57899999999999998 Q ss_pred EeCCCCceEEEEEEEEEecCC-ceEEEE---EEcC-CeEEEEEEeeeec---ccccceeeccCCCceee--cccccccCc Q lcl|NC_021301. 142 VDPLQPWRIRSAMRWWRDLDA-ESDFAI---VWSG-DGWQKFARPCFVQ---SSSRRRLVTRISDSWVP--VGDAVVTGS 211 (456) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~d~-~~~~~~---~~~~-~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~--~~~~~~~~~ 211 (456) +.+..+....+++..|+..++ ...+++ .|+. ++.+.-....+.. ...+.......-..|.. .......+. T Consensus 157 ~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~ 236 (505) T protein:vir:79 157 QADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLK 236 (505) T ss_pred EEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCC Confidence 655555555555555544332 222222 1221 1111111111111 11111100000000110 111113456 Q ss_pred eeEEEEccC----------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhh----hhcCCCcccccccccchhhh Q lcl|NC_021301. 212 PPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRA----LKSAGHGLPKVDENGNAIDY 277 (456) Q Consensus 212 ~~pvv~~~n----------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~----i~g~~~~~~~~~~~~~~~~~ 277 (456) +|+++++.| |.|.|+|+.++++||++|.++|++.+..+....++.+ +.-...........+.+ T Consensus 237 ~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~--- 313 (505) T protein:vir:79 237 HPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETHPP--- 313 (505) T ss_pred cceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCccccccccc--- Confidence 777777633 4699999999999999999999999988764333222 21111100000000110 Q ss_pred hhhhhhh---ccceeccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhccccc-CcHHHHHHHHHHHHHHH Q lcl|NC_021301. 278 ASIFEAA---PGALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFK 352 (456) Q Consensus 278 ~~~~~~~---~~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k 352 (456) .+... ...+...+.+..+..++.. ..+.+.+.++.++++|...+|+++..||.+.. ..||.+++...+.+.++ T Consensus 314 --~fd~~~~~y~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t 391 (505) T protein:vir:79 314 --MFDPDETVYQAMYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQT 391 (505) T ss_pred --CCCccceeeeeccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHH Confidence 01111 0111122233445555543 46889999999999999999999999986543 35899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc--------------CCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHH Q lcl|NC_021301. 353 CEDRLSIAKIGLEAILVKALQIE--------------GESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRN 418 (456) Q Consensus 353 ~~~~~~~f~~~l~~~~~l~~~~~--------------~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~ 418 (456) +..+++.|..+|+++++.++.+. +....+.++|.|.+.++.|..++++...+++++|++|.++++. T Consensus 392 ~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~ 471 (505) T protein:vir:79 392 RSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFLM 471 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHH Confidence 99999999999999999887542 1233467899999999999999999999999999999999987 Q ss_pred hC-CCChhHHHHHHHHHHHHHHHHHhhhhhhhccc Q lcl|NC_021301. 419 IL-NYNADQIKQDDLDRAREQITLFAGNSVQRPQE 452 (456) Q Consensus 419 ~~-~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~ 452 (456) .+ |+++ +.++.|++|+++|.........+--.+ T Consensus 472 ~~~~~~e-eea~~el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 472 RNYGLDE-EEADEWLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred hcCCCCh-HHHHHHHHHHHHhccccCCCchhccCC Confidence 76 5554 446667888888754322222222222 No 65 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=5.2e-46 Score=268.84 Aligned_cols=442 Identities=9% Similarity=0.074 Sum_probs=283.6 Q ss_pred CCCCC-HHHHHHHHHHH------------------HHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHH Q lcl|NC_021301. 1 MTAST-PAEWLPVLTKR------------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGL 61 (456) Q Consensus 1 ~~~~t-~~~~~~~l~~~------------------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~ 61 (456) |.-=+ -.+++.+.+.+ =....+|+++.++||+|+++..+... . +..+....++++|+++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~-~-~~~~~~~~~~sln~~~ 78 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQA-S-DGIKKKRLKNTINMAK 78 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCccccccc-C-CCCccccceeecchHH Confidence 21100 11111111111 12355788899999999998653221 1 1122223357789999 Q ss_pred HHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEE Q lcl|NC_021301. 62 MVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVS 141 (456) Q Consensus 62 ~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~ 141 (456) .||+..|+++++++.+++.+.+...+..|++++++|+|.....+++..++.+|.+|+.+|.|. |.++|..++|.+++|+ T Consensus 79 ~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~i~~v~ad~~~P~ 157 (508) T protein:vir:15 79 TAARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG-NHIKIAWVRADQFYPL 157 (508) T ss_pred HHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC-CeeEEEEEcCCeeEEE Confidence 999999999999997776555555566789999999999999999999999999999999984 5799999999999997 Q ss_pred EeCCCCceEEEEEEEEEec-CCceEEEEE---Ec--CCeEEEEEEeeeeccc---ccceeeccCCCceee--cccccccC Q lcl|NC_021301. 142 VDPLQPWRIRSAMRWWRDL-DAESDFAIV---WS--GDGWQKFARPCFVQSS---SRRRLVTRISDSWVP--VGDAVVTG 210 (456) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~-d~~~~~~~~---~~--~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~--~~~~~~~~ 210 (456) ..+..+..-.+++..+... +.+..++++ |+ .++.+......+.... .+.......-..|.. .....+.. T Consensus 158 ~~d~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~ 237 (508) T protein:vir:15 158 QSNTNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGL 237 (508) T ss_pred EEcCCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCC Confidence 5444443334444444322 223333332 11 2112222211222111 111110000000110 01111345 Q ss_pred ceeEEEEccC----------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhh Q lcl|NC_021301. 211 SPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASI 280 (456) Q Consensus 211 ~~~pvv~~~n----------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~ 280 (456) ..||++++.| +.|.|+|+.+++++|++|.+.|++++..+....++.+-..+ ...++.+.+. T Consensus 238 ~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~----l~~d~~~~~~----- 308 (508) T protein:vir:15 238 QRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGM----LRFDDEHKPT----- 308 (508) T ss_pred CcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHH----hcCCCCCccc----- Confidence 5688877643 46999999999999999999999998886433333331111 1112222221 Q ss_pred hhhhccce--ec--cCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 281 FEAAPGAL--WE--LPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCE 354 (456) Q Consensus 281 ~~~~~~~~--~~--~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~ 354 (456) +....... +. .+.+..+.+++.. ..+.|.+.++.+++.|...+|+++..||....+ .||.+++...+.+.+++. T Consensus 309 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~ 388 (508) T protein:vir:15 309 FDTEQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRS 388 (508) T ss_pred cCCCCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHH Confidence 11111100 11 1222345566543 568899999999999999999999999865433 589999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcC----------------CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHH Q lcl|NC_021301. 355 DRLSIAKIGLEAILVKALQIEG----------------ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRN 418 (456) Q Consensus 355 ~~~~~f~~~l~~~~~l~~~~~~----------------~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~ 418 (456) .+++.|+.+|++++++++.+.. ...+..++|.|.+.+++|..++++...+++++|++|.++++. T Consensus 389 ~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~ 468 (508) T protein:vir:15 389 SYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQ 468 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHH Confidence 9999999999999988765421 123457899999999999999999999999999999999976 Q ss_pred hC-CCChhHHHHHHHHHHHHHHHHHhhhhhh---hcccccC Q lcl|NC_021301. 419 IL-NYNADQIKQDDLDRAREQITLFAGNSVQ---RPQEDGS 455 (456) Q Consensus 419 ~~-~~~~~~~~~~e~~~~~ee~~~~~~~~~~---~~~~d~~ 455 (456) .+ |+++++ ++.+++|+++|.......... ....||+ T Consensus 469 ~~~g~~dee-a~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 469 RNYGMTDEQ-AAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred hcCCCChHH-HHHHHHHHHHhccccCccccccccCCCCCCC Confidence 65 666654 566777888775433322222 2223444 No 66 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=1.1e-46 Score=272.51 Aligned_cols=443 Identities=13% Similarity=0.082 Sum_probs=298.7 Q ss_pred CCCCC-------H-----HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTAST-------P-----AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t-------~-----~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a 68 (456) |--+- | ..+=+.+-..-..|+.+|+.|.+||.|.+.-......-.+. ...+.+.++..++||+... T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~--~~~r~~~~ps~~~~~~~~~ 78 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDE--GDQRPIYVPNGEKLIEAKM 78 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccc--cccceeeehhhHHhhCCcc Confidence 10000 0 00000122233567889999999999987544332222111 1233577888889998877 Q ss_pred hhhccCCeecCC-CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCe-EEEEEeeCCC---CceEEEEEccceeEEEEe Q lcl|NC_021301. 69 DRIIPNGITVGG-SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGE-SYLTCWRRDD---GTATITADSPETMVVSVD 143 (456) Q Consensus 69 ~~l~~~~~~~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~-a~~~v~~d~d---g~~~i~~~~p~~~~~~~d 143 (456) .|+ +.|..+.. ..+++....+..+++.|++..++.+..+++.+.|+ +|.++|-.++ +++++..+||.+.|++.| T Consensus 79 ~~~-~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed 157 (527) T protein:vir:10 79 RFL-GQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYED 157 (527) T ss_pred eee-ccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeec Confidence 665 55555432 23556667778888899999999999999999999 5777775433 579999999999999999 Q ss_pred CCCCceEEEE--EEEEEecCCceEEEE-------EE--------cCCeEEEEEEeeeec--ccccc-----eeeccCCCc Q lcl|NC_021301. 144 PLQPWRIRSA--MRWWRDLDAESDFAI-------VW--------SGDGWQKFARPCFVQ--SSSRR-----RLVTRISDS 199 (456) Q Consensus 144 ~~~~~~~~~~--~~~~~~~d~~~~~~~-------~~--------~~~~~~~~~~~~~~~--~~~~~-----~~~~~~~~~ 199 (456) |.....+..+ +..|..++....-.. .| ...+-+.|....|.. ++... ......-.. T Consensus 158 ~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~ 237 (527) T protein:vir:10 158 PRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLST 237 (527) T ss_pred CCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcC Confidence 9877766655 333554443321100 01 111122222222211 11000 000000011 Q ss_pred eeecccccccCceeEEEEccC------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc Q lcl|NC_021301. 200 WVPVGDAVVTGSPPPVVVYQN------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN 273 (456) Q Consensus 200 ~~~~~~~~~~~~~~pvv~~~n------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~ 273 (456) .......++.++++|||+++| .+|+|+++++++++|++|+++|+.+.++++.+.|+.++.|+.+. +..|+ T Consensus 238 ~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~v----d~~G~ 313 (527) T protein:vir:10 238 LTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPR----DSRGN 313 (527) T ss_pred ceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccc----cccCC Confidence 233445677889999998877 38999999999999999999999999999999999999988643 22233 Q ss_pred hhhhhhhhhhhccceeccCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhc--ccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 274 AIDYASIFEAAPGALWELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLM--PDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~--~~~~N~Sg~Al~~~~~~l~ 350 (456) . ..+..++|.+|.++.++++..++. .++++|.++++.+...|+.++++|...|| ..+.++||.||+..+++|. T Consensus 314 ~----~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLl 389 (527) T protein:vir:10 314 M----VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAIL 389 (527) T ss_pred c----CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHH Confidence 2 346678899999999999998876 57899999999999999999999999999 3455689999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHH-H----HHHhc-----CCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILV-K----ALQIE-----GESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNIL 420 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~-l----~~~~~-----~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~ 420 (456) +++++++..++-..+|..+ + +-+.. +......++++|.+++|+|.++.++.+++|+++|++|++||+++| T Consensus 390 ar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L 469 (527) T protein:vir:10 390 SSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEEL 469 (527) T ss_pred HHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHH Confidence 9999999888888776432 1 11112 222335789999999999999999999999999999999998887 Q ss_pred ---CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 421 ---NYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 421 ---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) |+..+ .+.|.+++.++...++-+.+.+...=|.. T Consensus 470 ~~~~g~eD--~E~E~~~I~~era~~a~a~a~A~~~~~a~ 506 (527) T protein:vir:10 470 SKIMGFEL--TEEDFKQATEDKKTQGIAQAEAADPFGAQ 506 (527) T ss_pred HhccCCCC--hHHHHHHHHHHHHHHhHHhhhhcCchhhh Confidence 44322 23334444443333333222222222222 No 67 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=1.4e-46 Score=272.02 Aligned_cols=443 Identities=14% Similarity=0.083 Sum_probs=298.5 Q ss_pred CCCCC-------H-----HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTAST-------P-----AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t-------~-----~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a 68 (456) |--+- | ..+=+.+-..-..|+.+|+.|.+||.|.+.-......-.+. ...+.+.++..++||+... T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~--~~~r~~~~ps~~~~~~~~~ 78 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDE--GDQRPIYVPNGEKLIEAKM 78 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccc--cccceeeehhhHHhhCCcc Confidence 10000 0 00000122233567889999999999987544332222111 1233577888889998877 Q ss_pred hhhccCCeecCC-CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCe-EEEEEeeCCC---CceEEEEEccceeEEEEe Q lcl|NC_021301. 69 DRIIPNGITVGG-SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGE-SYLTCWRRDD---GTATITADSPETMVVSVD 143 (456) Q Consensus 69 ~~l~~~~~~~~~-~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~-a~~~v~~d~d---g~~~i~~~~p~~~~~~~d 143 (456) .|+ +.|..+.. ..+++....+..+++.|++..++.+..+++.+.|+ +|.++|-.++ +++++..+||.+.|++.| T Consensus 79 ~~~-~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed 157 (527) T protein:vir:10 79 RFL-GQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYED 157 (527) T ss_pred eee-ccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeec Confidence 665 55555432 23556667778888899999999999999999999 5777775433 579999999999999999 Q ss_pred CCCCceEEEE--EEEEEecCCceEEEE-------EE--------cCCeEEEEEEeeeec--ccccc-----eeeccCCCc Q lcl|NC_021301. 144 PLQPWRIRSA--MRWWRDLDAESDFAI-------VW--------SGDGWQKFARPCFVQ--SSSRR-----RLVTRISDS 199 (456) Q Consensus 144 ~~~~~~~~~~--~~~~~~~d~~~~~~~-------~~--------~~~~~~~~~~~~~~~--~~~~~-----~~~~~~~~~ 199 (456) |.....+..+ +..|..++....-.. .| ...+-+.|....|.. ++... ......-.. T Consensus 158 ~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~~e~p~~~~~~~~~~~ 237 (527) T protein:vir:10 158 PRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDRPESPLEPDDIKKLST 237 (527) T ss_pred CCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccccccccchhhhhhhcC Confidence 9877766655 333554443321100 01 111122222222211 11000 000000011 Q ss_pred eeecccccccCceeEEEEccC------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc Q lcl|NC_021301. 200 WVPVGDAVVTGSPPPVVVYQN------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN 273 (456) Q Consensus 200 ~~~~~~~~~~~~~~pvv~~~n------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~ 273 (456) .......++.++++|||+++| .+|+|+++++++++|++|+++|+.+.++++.+.|+.++.|+.+. +..|+ T Consensus 238 ~~~l~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~v----d~~G~ 313 (527) T protein:vir:10 238 LTEEEPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPR----DSRGN 313 (527) T ss_pred ceeeecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccc----cccCC Confidence 233445677889999998877 38999999999999999999999999999999999999988643 22233 Q ss_pred hhhhhhhhhhhccceeccCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhc--ccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 274 AIDYASIFEAAPGALWELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLM--PDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 274 ~~~~~~~~~~~~~~~~~~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~--~~~~N~Sg~Al~~~~~~l~ 350 (456) . ..+..++|.+|.++.++++..++. .++++|.++++.+...|+.++++|...|| ..+.++||.||+..+++|. T Consensus 314 ~----~~~~VgPG~iweL~e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLl 389 (527) T protein:vir:10 314 M----VPWTISPLGMVEHGQNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAIL 389 (527) T ss_pred c----CccccCCceeEecCCCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHH Confidence 2 346678899999999999998876 57899999999999999999999999999 3455689999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHH-H----HHHhc-----CCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILV-K----ALQIE-----GESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNIL 420 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~-l----~~~~~-----~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~ 420 (456) +++++++..++-..+|..+ + +-+.. +......++++|.+++|+|.++.++.+++|+++|++|++||+++| T Consensus 390 ar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L 469 (527) T protein:vir:10 390 SSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEEL 469 (527) T ss_pred HHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHH Confidence 9999999888888776432 1 11112 222335789999999999999999999999999999999998887 Q ss_pred ---CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 421 ---NYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 421 ---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) |+..+ .+.|.+++.++...++-+.+.+...=|.. T Consensus 470 ~~~~g~eD--~E~E~~~I~~era~~a~a~a~a~~~~~a~ 506 (527) T protein:vir:10 470 SKIMGFEL--TEEDFRQATEDKKTQGIAQAEAADPFGAQ 506 (527) T ss_pred HhccCCCc--hHHHHHHHHHHHHHHhHHhhhhcCchhhh Confidence 44322 22333344443333332222222222221 No 68 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=5e-44 Score=257.98 Aligned_cols=446 Identities=10% Similarity=0.070 Sum_probs=283.1 Q ss_pred CCCCCHHHHHHH-HHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPV-LTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~-l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) |...+-..+... -+..-.+...|+++..+||+|+++-..... ..+.. ...++.++|+++.||+.+|+++++++.++. T Consensus 18 ~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~-~~~~~-~~~~~~slnl~~~i~~~~A~lv~~e~~~i~ 95 (500) T protein:vir:30 18 MTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLN-TDGET-KKRDLNHLPIARTAAKKIASLVFNEQAEIK 95 (500) T ss_pred hhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCccccc-CCCCc-ccCceeecchHHHHHHHHhhhhcCCcceEe Confidence 211221111110 000013456788889999999965332111 11111 223467789999999999999999997765 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEE-E Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWW-R 158 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~-~ 158 (456) .+ ++..++.+++++++|+|.....+++..++..|..|+.+|.|. +.++|..++|.+++|+..+..+....+++.++ . T Consensus 96 ~~-d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~ 173 (500) T protein:vir:30 96 VD-DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVK 173 (500) T ss_pred cC-ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEee Confidence 54 456788899999999999999999999999999999999984 67999999999999987666665555554443 3 Q ss_pred ecCCceEEEEE-----EcCCeEEEEEEeeeecc---cccceeeccCCCceee--cccccccCceeEEEEccC-------- Q lcl|NC_021301. 159 DLDAESDFAIV-----WSGDGWQKFARPCFVQS---SSRRRLVTRISDSWVP--VGDAVVTGSPPPVVVYQN-------- 220 (456) Q Consensus 159 ~~d~~~~~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~~~~pvv~~~n-------- 220 (456) ..+++..+++. |.++..+......+... ..+.... ....|.- ..........|+++++.+ T Consensus 174 ~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~--l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~ 251 (500) T protein:vir:30 174 TINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVP--LSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDI 251 (500) T ss_pred eecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccc--cccccCCcCcceEeccCCCccEEEecCCccccccC Confidence 34444443332 12222222221122111 1111100 0001110 111123445677776532 Q ss_pred --CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhcc---ceecc-CCC Q lcl|NC_021301. 221 --PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPG---ALWEL-PPG 294 (456) Q Consensus 221 --~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~d 294 (456) |.|.|+|+.+++++|++|.+.|++.+..+....++.+-..+-.... ....|... ....+..... .+-.. +.+ T Consensus 252 ~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~-~~~~g~~~-~~~~~d~~~~~~~~~~~~~~~~ 329 (500) T protein:vir:30 252 NSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTV-RTTDGDVV-PRPRFESDQNVYIRMGGRDLDS 329 (500) T ss_pred CCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccC-CCCCcccc-CCcccCCCcceEEEcCCCCCcC Confidence 4699999999999999999999999887753332222111100000 00011111 0011111110 01111 122 Q ss_pred ceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 295 VDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 295 ~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) ..+.++++. ..+.+.+.++.++++|...+|+++..||.+.++ .||.+++++++.+.+++..+++.|+.+|++++++++ T Consensus 330 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il 409 (500) T protein:vir:30 330 SAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIF 409 (500) T ss_pred cceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 335555433 468899999999999999999999999865443 589999999999999999999999999999999887 Q ss_pred Hhc-------C-CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHH-hCCCChhHHHHHHHHHHHHHHHHHh Q lcl|NC_021301. 373 QIE-------G-ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRN-ILNYNADQIKQDDLDRAREQITLFA 443 (456) Q Consensus 373 ~~~-------~-~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~-~~~~~~~~~~~~e~~~~~ee~~~~~ 443 (456) .+. + ....+.+.+.|.+..++|..++++...+++++|++|.++++. ..|+++++ ++.+++++++|..... T Consensus 410 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eee-a~~~l~~i~~E~~~~~ 488 (500) T protein:vir:30 410 EIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEK-AQEIAAEINTGIVDEI 488 (500) T ss_pred HHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhccccC Confidence 442 1 234567899999999999999999999999999999999875 45888765 4445677777654333 Q ss_pred hhhhhhcccccC Q lcl|NC_021301. 444 GNSVQRPQEDGS 455 (456) Q Consensus 444 ~~~~~~~~~d~~ 455 (456) +.......-=|+ T Consensus 489 ~~~~~~~~~~g~ 500 (500) T protein:vir:30 489 NQQRTDTHLYGE 500 (500) T ss_pred CCCCccccccCC Confidence 333333333344 No 69 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=5e-44 Score=257.98 Aligned_cols=446 Identities=10% Similarity=0.070 Sum_probs=283.1 Q ss_pred CCCCCHHHHHHH-HHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPV-LTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~-l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) |...+-..+... -+..-.+...|+++..+||+|+++-..... ..+.. ...++.++|+++.||+.+|+++++++.++. T Consensus 18 ~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~-~~~~~-~~~~~~slnl~~~i~~~~A~lv~~e~~~i~ 95 (500) T protein:vir:98 18 MTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLN-TDGET-KKRDLNHLPIARTAAKKIASLVFNEQAEIK 95 (500) T ss_pred hhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCccccc-CCCCc-ccCceeecchHHHHHHHHhhhhcCCcceEe Confidence 211221111110 000013456788889999999965332111 11111 223467789999999999999999997765 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEE-E Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWW-R 158 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~-~ 158 (456) .+ ++..++.+++++++|+|.....+++..++..|..|+.+|.|. +.++|..++|.+++|+..+..+....+++.++ . T Consensus 96 ~~-d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~ad~~~P~~~d~~~~~~~a~~~~~~~ 173 (500) T protein:vir:98 96 VD-DDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQAPVFLPLQSNTQDVSSAAVVIKSVK 173 (500) T ss_pred cC-ChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeeEEEEEcCCCeEEEEEEEEEee Confidence 54 456788899999999999999999999999999999999984 67999999999999987666665555554443 3 Q ss_pred ecCCceEEEEE-----EcCCeEEEEEEeeeecc---cccceeeccCCCceee--cccccccCceeEEEEccC-------- Q lcl|NC_021301. 159 DLDAESDFAIV-----WSGDGWQKFARPCFVQS---SSRRRLVTRISDSWVP--VGDAVVTGSPPPVVVYQN-------- 220 (456) Q Consensus 159 ~~d~~~~~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~~~~pvv~~~n-------- 220 (456) ..+++..+++. |.++..+......+... ..+.... ....|.- ..........|+++++.+ T Consensus 174 ~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~--l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~~ 251 (500) T protein:vir:98 174 TINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVP--LSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKDI 251 (500) T ss_pred eecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccc--cccccCCcCcceEeccCCCccEEEecCCccccccC Confidence 34444443332 12222222221122111 1111100 0001110 111123445677776532 Q ss_pred --CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhcc---ceecc-CCC Q lcl|NC_021301. 221 --PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPG---ALWEL-PPG 294 (456) Q Consensus 221 --~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~-~~d 294 (456) |.|.|+|+.+++++|++|.+.|++.+..+....++.+-..+-.... ....|... ....+..... .+-.. +.+ T Consensus 252 ~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~-~~~~g~~~-~~~~~d~~~~~~~~~~~~~~~~ 329 (500) T protein:vir:98 252 NSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTV-RTTDGDVV-PRPRFESDQNVYIRMGGRDLDS 329 (500) T ss_pred CCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccC-CCCCcccc-CCcccCCCcceEEEcCCCCCcC Confidence 4699999999999999999999999887753332222111100000 00011111 0011111110 01111 122 Q ss_pred ceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 295 VDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 295 ~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) ..+.++++. ..+.+.+.++.++++|...+|+++..||.+.++ .||.+++++++.+.+++..+++.|+.+|++++++++ T Consensus 330 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~al~~lv~~il 409 (500) T protein:vir:98 330 SAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQSLKELVISIF 409 (500) T ss_pred cceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 335555433 468899999999999999999999999865443 589999999999999999999999999999999887 Q ss_pred Hhc-------C-CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHH-hCCCChhHHHHHHHHHHHHHHHHHh Q lcl|NC_021301. 373 QIE-------G-ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRN-ILNYNADQIKQDDLDRAREQITLFA 443 (456) Q Consensus 373 ~~~-------~-~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~-~~~~~~~~~~~~e~~~~~ee~~~~~ 443 (456) .+. + ....+.+.+.|.+..++|..++++...+++++|++|.++++. ..|+++++ ++.+++++++|..... T Consensus 410 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eee-a~~~l~~i~~E~~~~~ 488 (500) T protein:vir:98 410 EIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEK-AQEIAAEINTGIVDEI 488 (500) T ss_pred HHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHH-HHHHHHHHHHhccccC Confidence 442 1 234567899999999999999999999999999999999875 45888765 4445677777654333 Q ss_pred hhhhhhcccccC Q lcl|NC_021301. 444 GNSVQRPQEDGS 455 (456) Q Consensus 444 ~~~~~~~~~d~~ 455 (456) +.......-=|+ T Consensus 489 ~~~~~~~~~~g~ 500 (500) T protein:vir:98 489 NQQRTDTHLYGE 500 (500) T ss_pred CCCCccccccCC Confidence 333333333344 No 70 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=2.5e-40 Score=237.71 Aligned_cols=449 Identities=9% Similarity=0.048 Sum_probs=280.3 Q ss_pred CCCC-CHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHH Q lcl|NC_021301. 1 MTAS-TPAEWLPVLTKR-----------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLM 62 (456) Q Consensus 1 ~~~~-t~~~~~~~l~~~-----------------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ 62 (456) |.-= .-.+|+.++..+ +.++..|+++.+.||+|+++-.. ...........++.++|+++. T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~--~~~~~~~~~~~~~~slnl~~~ 78 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQ--YKNTDGDIKSRPMNHLPIART 78 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCccccc--ccccCcchhcccceecchHHH Confidence 2211 122333333322 34556788888999999865331 111111112234677899999 Q ss_pred HHHHHHhhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEE Q lcl|NC_021301. 63 VRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) Q Consensus 63 iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~ 142 (456) ||+.+|+++++++.+++.+ |+..++.+++++++|+|.....+++..++..|..++.+|.| .|.++|..++|..++|+. T Consensus 79 i~~~~A~lv~~e~~~i~v~-d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~i~~v~ad~~~P~~ 156 (522) T protein:vir:47 79 ASKKIASLVYNEQATITTK-NEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-GDKVRVAFIQAPVFFPLE 156 (522) T ss_pred HHHHHhhhhcCCcceeecC-ChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-CCceEEEEEcCCceEEEE Confidence 9999999999999877654 45678889999999999999999999999999988888887 578999999999999985 Q ss_pred eCCCCceEEEEEE-EEEecCCceEEEEEEcC----------------CeEEEEEEeeeecc---cccceeeccCCCcee- Q lcl|NC_021301. 143 DPLQPWRIRSAMR-WWRDLDAESDFAIVWSG----------------DGWQKFARPCFVQS---SSRRRLVTRISDSWV- 201 (456) Q Consensus 143 d~~~~~~~~~~~~-~~~~~d~~~~~~~~~~~----------------~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~- 201 (456) .+..+....+++. .+...++...++++..- ...++-....+... ..+.......-+.|. T Consensus 157 ~~~~~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~ 236 (522) T protein:vir:47 157 SNTQDVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKN 236 (522) T ss_pred EcCCceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccC Confidence 4444332222222 23333343433332110 00111111111111 111111000000111 Q ss_pred -ecccccccCceeEEEEccC----------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc Q lcl|NC_021301. 202 -PVGDAVVTGSPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE 270 (456) Q Consensus 202 -~~~~~~~~~~~~pvv~~~n----------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 270 (456) ......+.+.+|+++++.| |.|.|+|+..++++|++|.+.|++.+..+....++.+-..+-.. ..+. T Consensus 237 l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~--~~~~ 314 (522) T protein:vir:47 237 LEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQR--QYQR 314 (522) T ss_pred CCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhcc--CCCC Confidence 1111223456788877643 46999999999999999999999988777644433321111000 0000 Q ss_pred ccchhhhhhhhhhhcc---cee-ccCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHH Q lcl|NC_021301. 271 NGNAIDYASIFEAAPG---ALW-ELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHN 344 (456) Q Consensus 271 ~~~~~~~~~~~~~~~~---~~~-~~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~ 344 (456) .+........+..... .+- ..+...++..+++ --.+.|.+.++.+++.|....|+++..||.+... .+|.+++. T Consensus 315 ~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s 394 (522) T protein:vir:47 315 PDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVS 394 (522) T ss_pred CCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHH Confidence 0111000111110000 010 1112234555543 3467899999999999999999999999865443 48999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--------CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHH Q lcl|NC_021301. 345 IEKGFLFKCEDRLSIAKIGLEAILVKALQIEG--------ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIR 416 (456) Q Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~--------~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~ 416 (456) +.+.+.+++..+++.|..+|+++++.++.+.. ....+.++|.|.+.++.|..++++...+++++|++|.+++ T Consensus 395 ~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e~~ 474 (522) T protein:vir:47 395 ENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKRA 474 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHH Confidence 99999999999999999999999998875431 2345679999999999999999999999999999999998 Q ss_pred HHhC-CCChhHHHHHHHHHHHHHHHHHhh-------hhhhhcccccCC Q lcl|NC_021301. 417 RNIL-NYNADQIKQDDLDRAREQITLFAG-------NSVQRPQEDGSR 456 (456) Q Consensus 417 ~~~~-~~~~~~~~~~e~~~~~ee~~~~~~-------~~~~~~~~d~~~ 456 (456) +..+ |+++++ ++.+++|+++|...... ...++.+.-+++ T Consensus 475 i~~~~g~~eee-a~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~ 521 (522) T protein:vir:47 475 IGKTLNISGVE-AEKELNAINSELLPMNDAELAIYGMHDQNEEKADDK 521 (522) T ss_pred HHhcCCCChHH-HHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCC Confidence 6655 777665 56678888877543211 111111111222 No 71 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=1.3e-41 Score=244.82 Aligned_cols=443 Identities=13% Similarity=0.066 Sum_probs=282.0 Q ss_pred CCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTAST--PAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t--~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) |.+.- |.-...++-.....|..+|+.|.+||.|++--....+... . .+-+..++++++|++...|| |+|+.+ T Consensus 9 ~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il~G~----d-r~~~~~ps~r~~V~~~~~~L-g~~~~~ 82 (563) T protein:vir:74 9 DPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVLRGD----D-SVPILMPSGRKIVEAVHRFL-GVGFDY 82 (563) T ss_pred CCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhcCCC----c-eeeeccchHHHHHHHHHHhc-CCCcEE Confidence 43332 2222333445567789999999999999986432211111 1 11233567889999966555 999887 Q ss_pred CCCCc---c----cHHHHHHHHHHhcChhHHHHHHHHHHhhCCe-EEEEEeeC---CCCceEEEEEccceeEEEEeCCCC Q lcl|NC_021301. 79 GGSAD---S----DLALRARRIWRDNRMDSVCKQWVKYGLDFGE-SYLTCWRR---DDGTATITADSPETMVVSVDPLQP 147 (456) Q Consensus 79 ~~~~d---~----~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~-a~~~v~~d---~dg~~~i~~~~p~~~~~~~d~~~~ 147 (456) ..... + ..+..|.++++++++..++.++.+++.+.|+ +|.++|-. ..+++++..++|.+.|++-||... T Consensus 83 ~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~fp~~dpd~v 162 (563) T protein:vir:74 83 LVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIFLIEDGSTV 162 (563) T ss_pred ecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceeeeccCCCCc Confidence 44322 2 1235568889999999999999999999999 56667643 335899999999999997777654 Q ss_pred ceEE--EEEEEEEecCCceE-------EEEEEcCCeEEEEEEee-eeccccccee--------e-ccCCCcee-----ec Q lcl|NC_021301. 148 WRIR--SAMRWWRDLDAESD-------FAIVWSGDGWQKFARPC-FVQSSSRRRL--------V-TRISDSWV-----PV 203 (456) Q Consensus 148 ~~~~--~~~~~~~~~d~~~~-------~~~~~~~~~~~~~~~~~-~~~~~~~~~~--------~-~~~~~~~~-----~~ 203 (456) .-.. .++.-|..++...+ +...+.+++.+....-+ ..-+..+.|. . ....+.+. .+ T Consensus 163 ~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~~~~~~~~~~~~~d~e~ 242 (563) T protein:vir:74 163 VGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQARRKEQVRSAQHDEEE 242 (563) T ss_pred ccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhhhcccchhhhhhhhchh Confidence 3322 22223332222111 01111122221111100 0001111000 0 01111111 12 Q ss_pred ccccccCceeEEEEccC------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhh Q lcl|NC_021301. 204 GDAVVTGSPPPVVVYQN------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDY 277 (456) Q Consensus 204 ~~~~~~~~~~pvv~~~n------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~ 277 (456) ...+..++++|++.++| .+|+|++++++++++++|+++++.+.++.+.+.|+.++.|..+.+. ..| . T Consensus 243 ~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~~p~d~---~~g----~ 315 (563) T protein:vir:74 243 EELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNASAPVDP---NTG----E 315 (563) T ss_pred hhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecccccccc---ccc----c Confidence 22356778899988877 3899999999999999999999999999999999999887553221 122 2 Q ss_pred hhhhhhhccceeccCCC---ceeEeecc-cchHHHHHHHHHHHH-HHHhhcCCChhhhc--ccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 278 ASIFEAAPGALWELPPG---VDIWESQT-NDFTPMLSAIKEHIR-QLSSATKTPLPMLM--PDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 278 ~~~~~~~~~~~~~~~~d---~~~~~~~~-~~~~~~~~~l~~~~~-~i~~~~~~p~~~~~--~~~~N~Sg~Al~~~~~~l~ 350 (456) ...+..++|.+|.++.+ ..+-.++. .+++++..+++.+.. -++.++++|...|| ..+..+||.||+..+.+|. T Consensus 316 ~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SGiALeL~L~PL~ 395 (563) T protein:vir:74 316 LTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESGISLELQLKPLL 395 (563) T ss_pred ccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccchhhhhhhhhHHH Confidence 33467789999998866 44556664 457888888887776 57888999999999 4455679999999999999 Q ss_pred HHHHHHHHHHHHHHHH----HHHHHH-H-----hcCCC-------c---ccceeEEecCCCCcCHHHHHHHHHHHHhcCC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEA----ILVKAL-Q-----IEGES-------V---EDTVDVSFESPDRVTLGEKYAAASLAKAAGE 410 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~----~~~l~~-~-----~~~~~-------~---~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~ 410 (456) ++|++++..+..++++ .+++.+ . +.|.. + ...+.|+|.++.|.|.++.++.++.|+++|+ T Consensus 396 a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv~~~~tl~~aGi 475 (563) T protein:vir:74 396 AANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQVTQDTLLLQQAHL 475 (563) T ss_pred HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHHHHHHHHHHHcCc Confidence 9999999988777776 333333 1 12221 1 1236789999999999999999999999999 Q ss_pred CcHHHHHHhC---CCChh----HHHHHHHHHHHHHHHHHhhhh----hhhccccc----------CC Q lcl|NC_021301. 411 SWASIRRNIL---NYNAD----QIKQDDLDRAREQITLFAGNS----VQRPQEDG----------SR 456 (456) Q Consensus 411 ~s~~t~~~~~---~~~~~----~~~~~e~~~~~ee~~~~~~~~----~~~~~~d~----------~~ 456 (456) +|++||.++| ||... +..+++.+++.+-..+.+.++ .++..+-| +- T Consensus 476 iSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~~g~~~~~~dd~g~p 542 (563) T protein:vir:74 476 ILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDNGGAGEQQFDDQGNP 542 (563) T ss_pred hhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceecccCCCCcccccccCCc Confidence 9999998777 77532 233344444444221111111 11111111 11 No 72 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=3.8e-39 Score=231.25 Aligned_cols=440 Identities=10% Similarity=0.012 Sum_probs=260.2 Q ss_pred CCHHHHHHHHHHHHH------HHHHHHHHHHHHhcccCc----cc----ccCcccchhhhhhhhhhccChHHHHHHHHHh Q lcl|NC_021301. 4 STPAEWLPVLTKRID------DGMSRVRLLARYSNGDAP----LP----ELTRNTSAAWRSFQREARTNWGLMVRDSVAD 69 (456) Q Consensus 4 ~t~~~~~~~l~~~~~------~~~~r~~~~~~YY~g~~~----i~----~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~ 69 (456) +--=..++..++-|- ....+..++.++|.+... +. ......++. ...+++++|+++.||+.+|+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~--~~~~~~~~~l~~~i~~~~A~ 78 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPT--VHDKLMNSGTGNEIVVVAAE 78 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCc--cccccccCChHHHHHHHHHH Confidence 111111222222221 011122222222222210 00 000011111 12346789999999999999 Q ss_pred hhccCCeecCCC-----CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeC Q lcl|NC_021301. 70 RIIPNGITVGGS-----ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDP 144 (456) Q Consensus 70 ~l~~~~~~~~~~-----~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~ 144 (456) ++++++.+++.+ .++..++.+++++++|+|...+.+++..++..|..++.+|.+ +|+++|..++|..++|+|++ T Consensus 79 ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~~i~~v~ad~~~P~~~~ 157 (518) T protein:vir:78 79 YISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-NGRPSISVHSSSQFWIDFKN 157 (518) T ss_pred hhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-CCeeEEEEEcCCeeEEEeec Confidence 999998655321 234456778999999999999999999999999999888886 58899999999999999976 Q ss_pred CCCceEEEEEEEEE--ecCCceEEEEE-EcC-C-----------eEEEEEEeeee-cccccceee---ccCCCcee---- Q lcl|NC_021301. 145 LQPWRIRSAMRWWR--DLDAESDFAIV-WSG-D-----------GWQKFARPCFV-QSSSRRRLV---TRISDSWV---- 201 (456) Q Consensus 145 ~~~~~~~~~~~~~~--~~d~~~~~~~~-~~~-~-----------~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~---- 201 (456) .. +..++.+.. ..++...+..+ |.. + ..+.+...... ......... ......+. T Consensus 158 g~---~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~ 234 (518) T protein:vir:78 158 NE---PFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDI 234 (518) T ss_pred Cc---EEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccC Confidence 43 233333222 22222222211 110 0 11111110000 000000000 00000000 Q ss_pred -ecccccccCceeEEEEccC----------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc Q lcl|NC_021301. 202 -PVGDAVVTGSPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE 270 (456) Q Consensus 202 -~~~~~~~~~~~~pvv~~~n----------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 270 (456) .....+++...|++.+++| |.|.|+|+.+++++|++|.+.|++.+..+....++.+-..+ ...+. T Consensus 235 ~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~----l~~~~ 310 (518) T protein:vir:78 235 QLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERM----FRKKV 310 (518) T ss_pred ccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhH----hccCC Confidence 0111223344444444433 34999999999999999999999999887643333332111 01111 Q ss_pred ccchhhhhhhhhhhccceec----cCCCce----eEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHH Q lcl|NC_021301. 271 NGNAIDYASIFEAAPGALWE----LPPGVD----IWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEG 341 (456) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~----~~~d~~----~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~A 341 (456) .+........+......... .+.+.+ +.++++ -..+.|.+.++.+++.|...+|+++..||.+.+..||.+ T Consensus 311 ~~~~~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATe 390 (518) T protein:vir:78 311 NKSTDKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATE 390 (518) T ss_pred CCCCCccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHH Confidence 11100011111111111111 111121 444443 346889999999999999999999999987766789999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----------CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCC Q lcl|NC_021301. 342 AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG----------ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGES 411 (456) Q Consensus 342 l~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~----------~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~ 411 (456) ++...+.+.+++..++..++.+|+++++.++.+.. ..++..+.|.|.+.+++|..++++...+++++|++ T Consensus 391 i~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGim 470 (518) T protein:vir:78 391 IWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAM 470 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCC Confidence 99999999999999999999999999988775432 12345688999999999999999999999999999 Q ss_pred cHHHHHHhC--CCChhHHHHHHHHHHHHHHHHHhhhhh----hhccccc Q lcl|NC_021301. 412 WASIRRNIL--NYNADQIKQDDLDRAREQITLFAGNSV----QRPQEDG 454 (456) Q Consensus 412 s~~t~~~~~--~~~~~~~~~~e~~~~~ee~~~~~~~~~----~~~~~d~ 454 (456) |.+++++++ +++++ .++.|.+|+++|...+..... ....+-| T Consensus 471 S~e~~i~~~~~~~~de-ea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 471 SVEEKVKLIHPKWEDE-EIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred CHHHHHHHhCCCCCHH-HHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 999988754 56554 456688888888654322111 1222333 No 73 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=1.4e-38 Score=228.09 Aligned_cols=444 Identities=10% Similarity=0.063 Sum_probs=275.5 Q ss_pred CCCCC-HHHHHHHHHHH-----------------HHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHH Q lcl|NC_021301. 1 MTAST-PAEWLPVLTKR-----------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLM 62 (456) Q Consensus 1 ~~~~t-~~~~~~~l~~~-----------------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~ 62 (456) |.--+ -..|++++..+ =.....|+.+.++||+|+++-.+ ....+......++.++|+++. T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~--~~~~~~~~~~~~~~sl~~~~~ 78 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVE--YINSQGKIQERDYMTLNLRKL 78 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccc--cccccccccccceeecCcHHH Confidence 21111 12222222111 12345677888999999987432 111122223345678899999 Q ss_pred HHHHHHhhhccCCeecCCCC----------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEE Q lcl|NC_021301. 63 VRDSVADRIIPNGITVGGSA----------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITA 132 (456) Q Consensus 63 iVd~~a~~l~~~~~~~~~~~----------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~ 132 (456) |+..+|+++++++.++..+. +...++.+++++++|+|.....+++..++..|.+++.+|.| .|.++|.+ T Consensus 79 i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~I~~ 157 (517) T protein:vir:98 79 SADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVD-NGEIEFSW 157 (517) T ss_pred HHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEe-CCeeEEEE Confidence 99999999999975543321 12245678899999999999999999999999999999988 46799999 Q ss_pred EccceeEEEEeCCCCceEEEEEEE--EEecCCceEEEEE---EcCC------eEEEEEEeeeecc---cccceeeccCCC Q lcl|NC_021301. 133 DSPETMVVSVDPLQPWRIRSAMRW--WRDLDAESDFAIV---WSGD------GWQKFARPCFVQS---SSRRRLVTRISD 198 (456) Q Consensus 133 ~~p~~~~~~~d~~~~~~~~~~~~~--~~~~d~~~~~~~~---~~~~------~~~~~~~~~~~~~---~~~~~~~~~~~~ 198 (456) ++|..++|+-.+..+ ...+++.+ +...+++..++++ +..+ +.|+.....+... ..+.... ... T Consensus 158 v~ad~~~Pl~~~~~~-v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~--L~~ 234 (517) T protein:vir:98 158 ALANAFYPLRSNSNG-ISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIP--LEE 234 (517) T ss_pred EcCCeeEEEEecCCC-eEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCcccccccc--ccc Confidence 999999996444433 34343322 2233333333322 2111 1222222222111 1111110 001 Q ss_pred cee--ecccccccCceeEEEEccC----------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccc Q lcl|NC_021301. 199 SWV--PVGDAVVTGSPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLP 266 (456) Q Consensus 199 ~~~--~~~~~~~~~~~~pvv~~~n----------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~ 266 (456) .|. ...........|+++++.| +.|.|+|+..++++|++|.+.+++....+....++.+-..+ . T Consensus 235 ~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~----l 310 (517) T protein:vir:98 235 LYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVM----L 310 (517) T ss_pred cccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhh----h Confidence 111 1111123345577777633 56999999999999999999999988777643332221111 1 Q ss_pred ccccccchhhhhhhhhhhccc--eeccC-CCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHH Q lcl|NC_021301. 267 KVDENGNAIDYASIFEAAPGA--LWELP-PGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEG 341 (456) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~--~~~~~-~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~A 341 (456) ..+.++........+...... .+..+ .+..+..+++. -.+.|.+.++.+++.|...+|+++..||.+... .+|.+ T Consensus 311 ~~~~~~~g~~~~~~~d~~~~~y~~~~~~~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATE 390 (517) T protein:vir:98 311 RTVPDESGMPPPQVFDPDVNVYKSIRMGTDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATE 390 (517) T ss_pred ccccCCCCcccCCCCCcccceeeeccCCCCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHH Confidence 111111100000111111100 01111 22234444432 357899999999999999999999999866544 37899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc------C--CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcH Q lcl|NC_021301. 342 AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE------G--ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWA 413 (456) Q Consensus 342 l~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~------~--~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~ 413 (456) ++...+.+.+++..+++.+..+|++++++++.+. + ....+.+.|.|.+.+++|..++++...+++++|++|. T Consensus 391 i~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~ 470 (517) T protein:vir:98 391 IVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPT 470 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCH Confidence 9999999999999999999999999998876432 1 2345678999999999999999999999999999999 Q ss_pred HHHHHh-CCCChhHHHHHHHHHHHHHHHHHhhhhhhhccc-----ccC Q lcl|NC_021301. 414 SIRRNI-LNYNADQIKQDDLDRAREQITLFAGNSVQRPQE-----DGS 455 (456) Q Consensus 414 ~t~~~~-~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~-----d~~ 455 (456) ++++.. .|+++++ ++.++.++++|......-...+++. |++ T Consensus 471 ~~~i~~~~g~~eee-A~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 471 VEAIQRIFKVPKKT-AEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred HHHHHHhCCCChHH-HHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 998654 5988665 5667888887765443222222222 222 No 74 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.96 E-value=7.1e-29 Score=174.90 Aligned_cols=426 Identities=12% Similarity=0.081 Sum_probs=272.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc-----ccC---cccchhhhhhhhhhc-cChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP-----ELT---RNTSAAWRSFQREAR-TNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~-----~~~---~~~~~~~~~~~~k~~-~n~~~~iVd~~a~~l 71 (456) |+.-+|+. +..--..|....++++++++-|.|...++ ++| .+..+.++..-.+.+ .||++.+|+.+++++ T Consensus 1 m~~~~~~~-v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~G~v 79 (513) T protein:vir:97 1 MADKDPKS-PATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLSGKP 79 (513) T ss_pred CCCCCCCC-CCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhhhh Confidence 88888876 66666778888999999999998864333 233 343344444333333 699999999999999 Q ss_pred ccCCeecCCCCcccHHHHHHH-HHH-----hcChhHHHHHHHHHHhhCCeEEEEEeeCCCC------------------c Q lcl|NC_021301. 72 IPNGITVGGSADSDLALRARR-IWR-----DNRMDSVCKQWVKYGLDFGESYLTCWRRDDG------------------T 127 (456) Q Consensus 72 ~~~~~~~~~~~d~~~~~~l~~-~~~-----~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg------------------~ 127 (456) |.++++.... ....+.+ +++ -++++.+++.+++.++.+|+|+++|-....+ . T Consensus 80 f~k~p~~~~~----~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~r 155 (513) T protein:vir:97 80 FSEPIKLNED----VPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLR 155 (513) T ss_pred hhcCcccCcC----chHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccC Confidence 9999876421 2233433 222 2679999999999999999999999654332 3 Q ss_pred eEEEEEccceeEEEE-eCCCCceEEEEEEE---EEecCCce----EEEEEEcCCeEEEEEEeeeecccccceeeccCCCc Q lcl|NC_021301. 128 ATITADSPETMVVSV-DPLQPWRIRSAMRW---WRDLDAES----DFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDS 199 (456) Q Consensus 128 ~~i~~~~p~~~~~~~-d~~~~~~~~~~~~~---~~~~d~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (456) |.+..++|++++-.- +...++..+-.+++ +.+.||-. ..+.+++++.+..++...... ...+. T Consensus 156 Py~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~---------~~~~e 226 (513) T protein:vir:97 156 PYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSN---------AQKEE 226 (513) T ss_pred ceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCC---------ccccc Confidence 789999999986442 33344333333333 22334311 122344444432222111000 01123 Q ss_pred eeecccccccCceeEEEEccC---C--CCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch Q lcl|NC_021301. 200 WVPVGDAVVTGSPPPVVVYQN---P--DGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA 274 (456) Q Consensus 200 ~~~~~~~~~~~~~~pvv~~~n---~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~ 274 (456) |.+.....+.++++|++++.. . .+.+-|.++-.|..+.-+..|++..++.+.++|++++.|.+.. +++ T Consensus 227 ~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~------~~~- 299 (513) T protein:vir:97 227 WALADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGE------DSD- 299 (513) T ss_pred eEEecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcC------CCC- Confidence 555555667788999988743 2 2445577777887788889999999999999999999997542 121 Q ss_pred hhhhhhhhhhccceeccCC-CceeE--eecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHH Q lcl|NC_021301. 275 IDYASIFEAAPGALWELPP-GVDIW--ESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLF 351 (456) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~-d~~~~--~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~ 351 (456) .+..+.++.|..+. +++++ +.+.+.++...+.++.+..+|....- ..+...++|.||++.+.......+ T Consensus 300 -----~i~iG~~~~~~lpe~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga---~ll~~~~~~~Ta~a~~~~~~~~~S 371 (513) T protein:vir:97 300 -----PVVVGPNKVLYNPDPAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGA---EFLKRKTGGQTATARALDSAEATS 371 (513) T ss_pred -----ceEeeccccccCCCCCCcceeeccCchhHHHHHHHHHHHHHHHHHHHH---HhhccCCccccHHHHHHHHHHHHH Confidence 12345667777764 56666 44556677788888988888866442 334444567899999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCC-CCcC-HHHHHHHHHHHHhcCCCcHHHHHHhC---CCC--- Q lcl|NC_021301. 352 KCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESP-DRVT-LGEKYAAASLAKAAGESWASIRRNIL---NYN--- 423 (456) Q Consensus 352 k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~-~~~~-~~e~ad~~~kl~~~g~~s~~t~~~~~---~~~--- 423 (456) ....+...+..+++++++++..+.|...+ ..+|.-++. .... ..+.++++.++.++|.+|++|.++.| |+. T Consensus 372 ~L~~~a~~le~al~~~l~~~a~wlg~~~~-~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d 450 (513) T protein:vir:97 372 DLSAMTGLFEDALAQALDITADWLRLGPN-GGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPED 450 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCCCC-ccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCcc Confidence 99999999999999999999999885432 223333332 2222 24678888999999999999987654 453 Q ss_pred --hhHHHHHHHHHHHHHHHHHh---hhhhhhccccc------CC Q lcl|NC_021301. 424 --ADQIKQDDLDRAREQITLFA---GNSVQRPQEDG------SR 456 (456) Q Consensus 424 --~~~~~~~e~~~~~ee~~~~~---~~~~~~~~~d~------~~ 456 (456) ++++.+.+.+++.++..... ..+.+.+++.| ++ T Consensus 451 ~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 494 (513) T protein:vir:97 451 FDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGEGEG 494 (513) T ss_pred CCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCCCCC Confidence 23223333444443321100 01111222111 22 No 75 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.95 E-value=9.6e-28 Score=168.69 Aligned_cols=420 Identities=11% Similarity=0.005 Sum_probs=264.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc-----ccC---cccchhhhhhhhhhc-cChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP-----ELT---RNTSAAWRSFQREAR-TNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~-----~~~---~~~~~~~~~~~~k~~-~n~~~~iVd~~a~~l 71 (456) |.=+++- ..|....++++++++-|.|...++ ++| .+..+.++..-.+.+ .|+++.+|+..++++ T Consensus 1 m~V~~~h-------p~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~v 73 (452) T protein:vir:94 1 MPIETKH-------PEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMV 73 (452) T ss_pred CCCCCcC-------HHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchh Confidence 6544433 345667777888888888865433 233 233333333222333 799999999999999 Q ss_pred ccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCC-ceEEEEEccceeEEEEeCCCCceE Q lcl|NC_021301. 72 IPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDG-TATITADSPETMVVSVDPLQPWRI 150 (456) Q Consensus 72 ~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg-~~~i~~~~p~~~~~~~d~~~~~~~ 150 (456) |.+++++..++ ....+..--.-++++.+.+.+++.++.+|+|+++|-.+..| +|.+..++|++++-.--+..+... T Consensus 74 f~k~p~~~~p~---~l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii~W~~~~~g~l~ 150 (452) T protein:vir:94 74 LDQPPVITHPD---AMSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENILNWEEDEDGRLL 150 (452) T ss_pred hcCCceecccH---HHHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhcCccccccCCee Confidence 99998875432 22223222345789999999999999999999999776555 699999999998753323344433 Q ss_pred EEEEEEEEe-cCC-------ceEEEEEEc-CCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC- Q lcl|NC_021301. 151 RSAMRWWRD-LDA-------ESDFAIVWS-GDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN- 220 (456) Q Consensus 151 ~~~~~~~~~-~d~-------~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n- 220 (456) ...++.... .++ ....+.++. .++.|...+.. ....+.+.. ..+.......+.++++|+|++.. T Consensus 151 ~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~--~~~~~~~~~----~~~~~~~~~~~~l~~IP~v~~~~~ 224 (452) T protein:vir:94 151 MVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHE--TQDGKVWEL----AKTSTIQNVGVTMDYIPFFCITPS 224 (452) T ss_pred EEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEE--ccCCceeee----ccceeecCCCcccceeEEEEEcCC Confidence 333333211 111 111122222 22333322211 001111100 11222233456788899988743 Q ss_pred --C--CCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCC-Cc Q lcl|NC_021301. 221 --P--DGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPP-GV 295 (456) Q Consensus 221 --~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~ 295 (456) . .+.+-+.++..+.-+..+..|+...++.+.++|+.++.|.+... .+..+.+..|..+. ++ T Consensus 225 ~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~--------------~i~iG~~~~~~lpe~~~ 290 (452) T protein:vir:94 225 GLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS--------------TMHIGSTKAWVIPEVAA 290 (452) T ss_pred CCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC--------------ceEecccccccCCCCCC Confidence 2 24555778888888888999999999999999999999875321 12345667777774 76 Q ss_pred eeE--eecccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 296 DIW--ESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 296 ~~~--~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) +++ +.+.+.++.+.+.|+.+.+++..... ..+- ...++.|++|.......-.+....+-..+..++.+++++++ T Consensus 291 ~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga---~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a 367 (452) T protein:vir:94 291 KVGFLEFTGQGLQSLEKALSEKQAQLASLSA---RLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIM 367 (452) T ss_pred cceEEccCchhHHHHHHHHHHHHHHHHHHHH---HhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH Confidence 665 44555677788888888888765433 2222 22345688887766666566666677778888999999999 Q ss_pred HhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_021301. 373 QIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNIL---NYNADQIKQDDLDRAREQITLFAGNSVQR 449 (456) Q Consensus 373 ~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~ 449 (456) .+.|......+++.-......-..+.++++.++.++|.+|++|.++.| |+.+.+ .|.+++.+|....+...... T Consensus 368 ~w~g~~~~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~---~e~~~i~~E~~~~~~~~~~~ 444 (452) T protein:vir:94 368 DMESMGGTLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPP---GESMGVIPDPPAPEPSPSNT 444 (452) T ss_pred HHcCCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCc---cCHHHHHHHhhccCcccCCC Confidence 999875433344332222333345788888899999999999998766 654322 23345556655555556668 Q ss_pred cccccCC Q lcl|NC_021301. 450 PQEDGSR 456 (456) Q Consensus 450 ~~~d~~~ 456 (456) |..+||| T Consensus 445 ~~~~~~~ 451 (452) T protein:vir:94 445 PPNPSSK 451 (452) T ss_pred CCCCccC Confidence 8889999 No 76 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.93 E-value=9.4e-25 Score=152.30 Aligned_cols=432 Identities=12% Similarity=0.029 Sum_probs=258.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcc-----cccCcc--------cchhhhhhhhhhc-cChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPL-----PELTRN--------TSAAWRSFQREAR-TNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i-----~~~~~~--------~~~~~~~~~~k~~-~n~~~~iVd~ 66 (456) |.+ ++.--..|....++++++++-+.|...+ .++|+- ....++..-.+.+ .|+++.+|+. T Consensus 1 m~~------V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~~ 74 (501) T protein:vir:95 1 MPN------VSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLFG 74 (501) T ss_pred CCC------CCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHHH Confidence 652 2222233677788889999999997644 344431 1122333222333 6999999999 Q ss_pred HHhhhccCCeecCCCCcccHHHHHHHHHHh-----cChhHHHHHHHHHHhhCCeEEEEEeeCCCC--------------- Q lcl|NC_021301. 67 VADRIIPNGITVGGSADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCWRRDDG--------------- 126 (456) Q Consensus 67 ~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg--------------- 126 (456) .++++|.+++++..+ ..+..++.+ ++++.+.+.+++.++.+|+|+++|-....+ T Consensus 75 l~G~vf~k~p~~~~p------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~ 148 (501) T protein:vir:95 75 LVGQVFMRDPVVKVP------ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRI 148 (501) T ss_pred HhhhhhcCCcceeCc------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccC Confidence 999999998876422 234444443 589999999999999999999999654322 Q ss_pred ceEEEEEccceeEEEE-eCCCCceEEEEEEE---EEecCCc-----eEEEEEE-c-CCeEEEEEEeeeeccccc-ceeec Q lcl|NC_021301. 127 TATITADSPETMVVSV-DPLQPWRIRSAMRW---WRDLDAE-----SDFAIVW-S-GDGWQKFARPCFVQSSSR-RRLVT 194 (456) Q Consensus 127 ~~~i~~~~p~~~~~~~-d~~~~~~~~~~~~~---~~~~d~~-----~~~~~~~-~-~~~~~~~~~~~~~~~~~~-~~~~~ 194 (456) .|.+..++|++++-.- +...++..+-.+++ +.+.++. ...+.+. . .++.+.+..+........ ..... T Consensus 149 rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~ 228 (501) T protein:vir:95 149 RPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIP 228 (501) T ss_pred CcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceec Confidence 3889999999986443 22333322222222 1222221 1112222 2 234444443332222111 01111 Q ss_pred ----cCCCceeecccccccCceeEEEEccC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcc Q lcl|NC_021301. 195 ----RISDSWVPVGDAVVTGSPPPVVVYQN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGL 265 (456) Q Consensus 195 ----~~~~~~~~~~~~~~~~~~~pvv~~~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~ 265 (456) .....|.+.....+.++++|+|++.. .+ +.+-+.++-.+.-+.-+.-|+...++.+.++|++|++|.+... T Consensus 229 ~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~~ 308 (501) T protein:vir:95 229 KGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEEW 308 (501) T ss_pred CCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcccc Confidence 11123444444567889999998743 22 2333555555555555667888888999999999999976532 Q ss_pred cccccccchhhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHH Q lcl|NC_021301. 266 PKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNI 345 (456) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~ 345 (456) .. .++. ..+..+.+..|.++++++++.+......-..+.|+.+.+++....- ..+.....|.||+|.+.. T Consensus 309 ~~---~~~~----~~i~~G~~~~~~lP~~~~~~~ie~~~~~i~~~~l~~l~~~m~~~Ga---~ll~~~~~~~Ta~~~~~~ 378 (501) T protein:vir:95 309 VT---NVLK----GSVNFGSRGGIPLPVGADAKLLQASENTMLKEAMDTKERQMVALGA---KLVEQKEVQRTATEAELE 378 (501) T ss_pred cc---cCCC----CceeecccccccCCCCCceeEEecChhhHHHHHHHHHHHHHHHHHH---hhccCCccchhHHHHHHH Confidence 11 1110 1234455677888888888776543323235667777777766432 233334456799999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCC-CC-cCHHHHHHHHHHHHhcCCCcHHHHHHhC--- Q lcl|NC_021301. 346 EKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESP-DR-VTLGEKYAAASLAKAAGESWASIRRNIL--- 420 (456) Q Consensus 346 ~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~-~~-~~~~e~ad~~~kl~~~g~~s~~t~~~~~--- 420 (456) .....+....+-..+..++.+++++++.+.|..+. ..+|..++. .. .-..+.++++.++.++|.+|++|.++.| T Consensus 379 ~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~-~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~ 457 (501) T protein:vir:95 379 AASEGSTLSSATKNVSAAFEWALKWAARWVGQADS-GVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLRKA 457 (501) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC-ceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHHhC Confidence 88888888888899999999999999999986533 233433333 22 2235678889999999999999996654 Q ss_pred CCChhHHHHHHHHHHHHHHH---HHhhhhhhhcccccCC Q lcl|NC_021301. 421 NYNADQIKQDDLDRAREQIT---LFAGNSVQRPQEDGSR 456 (456) Q Consensus 421 ~~~~~~~~~~e~~~~~ee~~---~~~~~~~~~~~~d~~~ 456 (456) ++.+.+. ..+.++++++.+ .....+......+|-. T Consensus 458 ~v~~~~~-~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~ 495 (501) T protein:vir:95 458 GVATEDD-SKAKEKIAKDTAEAMALATPANVPGDGSGGD 495 (501) T ss_pred CCCChhH-HHHHHHHHhhhcCcccccccCCCCCCCcccc Confidence 6654332 233444444332 2222222222233333 No 77 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.93 E-value=5.3e-25 Score=153.66 Aligned_cols=435 Identities=12% Similarity=0.042 Sum_probs=251.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc-----ccCcc--------cchhhhhhhhhhc-cChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP-----ELTRN--------TSAAWRSFQREAR-TNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~-----~~~~~--------~~~~~~~~~~k~~-~n~~~~iVd~ 66 (456) +-++.|+ ++.--..|....++++++++-|.|...++ ++|+- ....++..-.+.+ .|+++.+|+. T Consensus 28 ~~~~m~d--V~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~ 105 (535) T protein:vir:80 28 LGPSLPN--VGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDG 105 (535) T ss_pred CCCCCCC--CCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHH Confidence 3333332 44455557888899999999999975443 34431 1112333323333 7999999999 Q ss_pred HHhhhccCCeecCCCCcccHHHHHHHHHHh-----cChhHHHHHHHHHHhhCCeEEEEEeeCCCC-------------ce Q lcl|NC_021301. 67 VADRIIPNGITVGGSADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCWRRDDG-------------TA 128 (456) Q Consensus 67 ~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg-------------~~ 128 (456) .++++|.+++++..+ ..+..++.+ ++++.+++.+++.++.+|+|+++|-....+ .| T Consensus 106 l~G~vfrk~p~~~~p------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rP 179 (535) T protein:vir:80 106 MMGQVFSRDPIRQLP------PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRP 179 (535) T ss_pred HhchhhcCCcceecc------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCc Confidence 999999988765322 334445443 578999999999999999999999654444 38 Q ss_pred EEEEEccceeEEEEeC-CCCceEEEEEEE---EEecCCc-----eEEEEEEcC--CeEEEEEEeeeecccccceeeccCC Q lcl|NC_021301. 129 TITADSPETMVVSVDP-LQPWRIRSAMRW---WRDLDAE-----SDFAIVWSG--DGWQKFARPCFVQSSSRRRLVTRIS 197 (456) Q Consensus 129 ~i~~~~p~~~~~~~d~-~~~~~~~~~~~~---~~~~d~~-----~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (456) .+..++|++++-.-.. ..++..+-.+++ +...++. ...+.+... ++.|....+..... ....... T Consensus 180 y~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~----~~~~~~~ 255 (535) T protein:vir:80 180 TITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQ----EEMYYSY 255 (535) T ss_pred EEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecC----Ccccccc Confidence 8999999998654322 233322222222 1122211 112222222 23343322111000 0001111 Q ss_pred CceeecccccccCceeEEEEcc---CCC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccccc Q lcl|NC_021301. 198 DSWVPVGDAVVTGSPPPVVVYQ---NPD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENG 272 (456) Q Consensus 198 ~~~~~~~~~~~~~~~~pvv~~~---n~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~ 272 (456) ..+.+.....+.++++|++++. |.+ +.+-|.++..+.-+.-+.-|+...++.+.++|++|+.|.+.......-.+ T Consensus 256 ~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~~~~~~~~~ 335 (535) T protein:vir:80 256 SKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKDWVEDVFKD 335 (535) T ss_pred ceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchhhhhcCCCC Confidence 1233334455778999999874 323 33446677777777778888999999999999999999764321110011 Q ss_pred chhhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHH Q lcl|NC_021301. 273 NAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFK 352 (456) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k 352 (456) .....+.+..|.++.+++++.+.........+.++.+..++..... ..+...++|.++.+.+.......+. T Consensus 336 ------~~i~iG~~~~~~lP~~~~~~~~e~~~~~~a~~~l~~~e~qM~~lGa---~ll~~~~~~~Ta~~a~~~~~~~~S~ 406 (535) T protein:vir:80 336 ------FKVHLGSRAIIPLPQGATAGILQITPNSVPFEAMTHKESQMIAMGA---NLLVKSGGNRTFGEAQQEEASEQSI 406 (535) T ss_pred ------cceEecCcccccCCCCCCcceeeeccchhHHHHHHHHHHHHHHHHH---HhhccCcccccHHHHHHHHHHHhHH Confidence 1123455667777777766544322222234667777777766432 2233334455444445555555666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCC-cccceeEEecCC-CCcC-HHHHHHHHHHHHhcCCCcHHHHHHhC---CCChhH Q lcl|NC_021301. 353 CEDRLSIAKIGLEAILVKALQIEGES-VEDTVDVSFESP-DRVT-LGEKYAAASLAKAAGESWASIRRNIL---NYNADQ 426 (456) Q Consensus 353 ~~~~~~~f~~~l~~~~~l~~~~~~~~-~~~~i~v~f~~~-~~~~-~~e~ad~~~kl~~~g~~s~~t~~~~~---~~~~~~ 426 (456) ...+-..+..++.+++++++.+.|.. ++..+++.-++. .... ..+.++++.++.++|.+|++|.++.| |+...+ T Consensus 407 L~~~a~~le~al~~aL~~~A~w~G~~~~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L~r~gvl~~~ 486 (535) T protein:vir:80 407 LSACTKNVSMAFRKALRWANQFQTGIVNDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGLRRAGVASED 486 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCccCCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc Confidence 77777888899999999999988853 444444443332 2222 34678888899999999999987665 665332 Q ss_pred H-HHHHHHHHHHHHHHHhhh-----------hhhhcccccC----C Q lcl|NC_021301. 427 I-KQDDLDRAREQITLFAGN-----------SVQRPQEDGS----R 456 (456) Q Consensus 427 ~-~~~e~~~~~ee~~~~~~~-----------~~~~~~~d~~----~ 456 (456) . .++|..+++.|....... ....+-.||+ | T Consensus 487 ~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~~ 532 (535) T protein:vir:80 487 DAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGNQ 532 (535) T ss_pred cchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCcccccc Confidence 2 233455565553221111 1111111121 1 No 78 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.92 E-value=2.3e-24 Score=150.18 Aligned_cols=438 Identities=8% Similarity=-0.041 Sum_probs=254.3 Q ss_pred CCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHhcccCc-------ccccCcccch-hhhhhhhhhc-cChHHHHHHHHHhh Q lcl|NC_021301. 1 MTAST-PAEWLPVLTKRIDDGMSRVRLLARYSNGDAP-------LPELTRNTSA-AWRSFQREAR-TNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t-~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~-------i~~~~~~~~~-~~~~~~~k~~-~n~~~~iVd~~a~~ 70 (456) |-+.. -..=++.--..|....++++++++-|.|... ++..++...+ .++..-.+.+ .|+++.+|+..+++ T Consensus 1 ~~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~ 80 (489) T protein:vir:78 1 MLTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGS 80 (489) T ss_pred CccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhch Confidence 43333 2222444445567888899999999999542 2222222111 1222222233 69999999999999 Q ss_pred hccCCeecCCCCcccHHHHHHHHHHh-----cChhHHHHHHHHHHhhCCeEEEEEeeCCCC------------ceEEEEE Q lcl|NC_021301. 71 IIPNGITVGGSADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCWRRDDG------------TATITAD 133 (456) Q Consensus 71 l~~~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg------------~~~i~~~ 133 (456) +|.+++++..+ ..+..++.+ ++++.+.+.+++.++.+|+|+++|-.+..+ +|.+..+ T Consensus 81 vfrk~p~~~~p------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~ 154 (489) T protein:vir:78 81 VMRKEPEINIP------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFY 154 (489) T ss_pred hhcCCcceecc------HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEe Confidence 99998776422 234444443 678999999999999999999999876655 5889999 Q ss_pred ccceeEEEE-eCCCCce-EEEEEEEEE----ec-CC----ceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceee Q lcl|NC_021301. 134 SPETMVVSV-DPLQPWR-IRSAMRWWR----DL-DA----ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVP 202 (456) Q Consensus 134 ~p~~~~~~~-d~~~~~~-~~~~~~~~~----~~-d~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (456) +|++++-.- +...++. +..++.... +. ++ ....+.++..+....|....+.....+.... ...... T Consensus 155 ~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~---~~~~~~ 231 (489) T protein:vir:78 155 TTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQE---DVVEIY 231 (489) T ss_pred chhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccc---eeeEEe Confidence 999986542 2233332 222222221 11 11 1223344444322222222222111111000 000111 Q ss_pred cccccccCceeEEEEccC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhh Q lcl|NC_021301. 203 VGDAVVTGSPPPVVVYQN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDY 277 (456) Q Consensus 203 ~~~~~~~~~~~pvv~~~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~ 277 (456) .....+.++++|++++.. .+ +.+-|.++-.|.-+.-+.-|+...++.+.++|++++.|.+.........+++ T Consensus 232 ~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~~~--- 308 (489) T protein:vir:78 232 PDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEANP--- 308 (489) T ss_pred ccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCcccccccCc--- Confidence 123346778899988742 22 3344666666666666788889999999999999999875432222222222 Q ss_pred hhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 278 ASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRL 357 (456) Q Consensus 278 ~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~ 357 (456) ..+..+.+..|.++.+++++.+......--.+.|+.+.+++..+.- ..+. ..+|-||++.+.....-.+....+- T Consensus 309 -~~i~~g~~~~~~lp~~~~~~~ie~~~~~~~r~~l~~le~qm~~lGa---~l~~-~~~~~Ta~~~~~~~~~~~S~L~~~a 383 (489) T protein:vir:78 309 -NGIKFGSRRGHNLGYGGSAQLIQAGENNLARQNMLDKEQQAIQIGA---QLIT-PTQQITAQSARIQRGADTSVMATIA 383 (489) T ss_pred -cceeeCCcccccCCCCCCcceeccCcchHHHHHHHHHHHHHHHHhh---hhcc-CCcchhHHHHHHHHHHhhHHHHHHH Confidence 1233445566777777766554333222234556666666654321 1222 2346788888888888888888888 Q ss_pred HHHHHHHHHHHHHHHHhcCCCcccceeEEecCC--CCcCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHHH Q lcl|NC_021301. 358 SIAKIGLEAILVKALQIEGESVEDTVDVSFESP--DRVTLGEKYAAASLAKAAGESWASIRRNIL---NYNADQIKQDDL 432 (456) Q Consensus 358 ~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~--~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~ 432 (456) ..+..++.+++++++.+.|..++..+++.-++. ...-..+.++++.++.++|.+|++|.++.| |+.+.+.++ +. T Consensus 384 ~~~e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~d~~~e~-~~ 462 (489) T protein:vir:78 384 RNVSQAYTDALRWVAVMLGKPEDTEVEFRLNMDFFLEPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDWTDAD-IK 462 (489) T ss_pred HHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccCcccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCccHHH-HH Confidence 999999999999999999987665555433331 112225568888899999999999987654 555433222 33 Q ss_pred HHHHHHHHHH---hhhhhhhcccccCC Q lcl|NC_021301. 433 DRAREQITLF---AGNSVQRPQEDGSR 456 (456) Q Consensus 433 ~~~~ee~~~~---~~~~~~~~~~d~~~ 456 (456) ++++++-... .+...++..++.+| T Consensus 463 ~ei~~~~~~~~~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 463 DAVADQPLPVATEVQGEIPQSAQQQEK 489 (489) T ss_pred HHHhhcCCCcccCCcccCCCCcccccC Confidence 4455442211 11112222223333 No 79 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.91 E-value=2.2e-23 Score=144.77 Aligned_cols=432 Identities=10% Similarity=-0.042 Sum_probs=250.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC-------cccccCcccch-hhhhhhhhhc-cChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDA-------PLPELTRNTSA-AWRSFQREAR-TNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~-------~i~~~~~~~~~-~~~~~~~k~~-~n~~~~iVd~~a~~l 71 (456) +|++--..=++.--..|....++++++++-|.|.. .++..++...+ .++..-.+.+ .|+++.+|+..++++ T Consensus 2 ~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~v 81 (491) T protein:vir:95 2 LTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSV 81 (491) T ss_pred cccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhchh Confidence 23322222244444556778888999999999843 12222222111 1332222333 699999999999999 Q ss_pred ccCCeecCCCCcccHHHHHHHHHHh-----cChhHHHHHHHHHHhhCCeEEEEEeeCCCC------------ceEEEEEc Q lcl|NC_021301. 72 IPNGITVGGSADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCWRRDDG------------TATITADS 134 (456) Q Consensus 72 ~~~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg------------~~~i~~~~ 134 (456) |.+++++..+ ..+..++.+ ++++.+.+.+++.++.+|+|+++|-.+..+ +|.+..++ T Consensus 82 frk~p~~~~p------~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~ 155 (491) T protein:vir:95 82 MRKEPEINIP------KELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYT 155 (491) T ss_pred hcCCceeecc------HHHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEec Confidence 9999876422 224444443 678999999999999999999999776554 48899999 Q ss_pred cceeEEEE-eCCCCceEEEEEEE-EE----ecCC-----ceEEEEEEcC--CeEEEEEEeeeecccccceeeccCCCcee Q lcl|NC_021301. 135 PETMVVSV-DPLQPWRIRSAMRW-WR----DLDA-----ESDFAIVWSG--DGWQKFARPCFVQSSSRRRLVTRISDSWV 201 (456) Q Consensus 135 p~~~~~~~-d~~~~~~~~~~~~~-~~----~~d~-----~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (456) |++++-.- +...++..+-.+++ .. +.++ ....+.++.. ++.|....+..... +... ....+. T Consensus 156 ~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~--g~~~---~~~~~~ 230 (491) T protein:vir:95 156 TENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAE--GGAQ---EEVVEI 230 (491) T ss_pred hhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCC--Ccce---eeeeee Confidence 99986542 22333323322222 21 1111 1122233322 22222222111111 1000 001111 Q ss_pred ecccccccCceeEEEEccC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh Q lcl|NC_021301. 202 PVGDAVVTGSPPPVVVYQN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID 276 (456) Q Consensus 202 ~~~~~~~~~~~~pvv~~~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~ 276 (456) ......+.++++|++++.. .+ +.+-|.++-.|.-+.-+.-|+...++.+.++|++++.|.+.........+++ T Consensus 231 ~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~~~-- 308 (491) T protein:vir:95 231 YPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEANP-- 308 (491) T ss_pred eecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhccCc-- Confidence 2223346778899988742 22 3334666666666666788888888999999999999965432221111211 Q ss_pred hhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 277 YASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDR 356 (456) Q Consensus 277 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~ 356 (456) ..+..+.+..+.++.+++++.+......--.+.|+.+.+++... | ...+. ..+|.||++.+.....-.+....+ T Consensus 309 --~~i~~g~~~~~~lP~~~~~~~ie~~~~~~~~~~l~~~e~qm~~~-G--a~l~~-~~~~~Ta~~~~~~~~~~~S~L~~~ 382 (491) T protein:vir:95 309 --NGIKFGSRCGHNLGYGGSAQLIQAGENNLARQNMLDKEQQAIQI-G--AQLIT-PSQQITAESARIQRGADTSVMATI 382 (491) T ss_pred --ceeEecCcCCcCCCCCCccceeecCcchHHHHHHHHHHHHHHHH-H--HHhcc-CCcchhHHHHHHHHHHhhHHHHHH Confidence 12334455566777777766554332222244455555554442 1 12222 234678898888888888888888 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCcccceeEEecCC--CCcCHHHHHHHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHH Q lcl|NC_021301. 357 LSIAKIGLEAILVKALQIEGESVEDTVDVSFESP--DRVTLGEKYAAASLAKAAGESWASIRRNIL---NYNADQIKQDD 431 (456) Q Consensus 357 ~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~--~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e 431 (456) -..+..++.+++++++.+.|..++..+++.-++. ...-..+.++++.++.++|.+|++|.++.| ++.+.. .+.+ T Consensus 383 a~~~e~al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~~~~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~-~e~~ 461 (491) T protein:vir:95 383 ARNVSQAYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQPMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDWT-DEDI 461 (491) T ss_pred HHHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccccccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc-HHHH Confidence 8999999999999999999977665555443332 122235678889999999999999987654 454332 2334 Q ss_pred HHHHHHHHHH---------Hhhhhhhhccc Q lcl|NC_021301. 432 LDRAREQITL---------FAGNSVQRPQE 452 (456) Q Consensus 432 ~~~~~ee~~~---------~~~~~~~~~~~ 452 (456) .++++++... ...++++..+| T Consensus 462 ~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 462 LNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHHhcCCCCCccccccccchhhhhhccC Confidence 4555544211 11233443344 No 80 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.85 E-value=1.9e-20 Score=128.71 Aligned_cols=417 Identities=12% Similarity=0.024 Sum_probs=228.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCc-----ccch-----------hhhhh-hhhhc-cChHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTR-----NTSA-----------AWRSF-QREAR-TNWGLM 62 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~-----~~~~-----------~~~~~-~~k~~-~n~~~~ 62 (456) |.-+|+.-....+..+|..-+.-...-.+ ..|+.-++..+. ...+ .+... .++++ .|+++. T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d~g~~~~k-~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n~~~~ 92 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLDCVMDNIK-RKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVNIVNP 92 (488) T ss_pred ecccccCHHHHHHhhhhhHhhhhhhHHHH-HhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCchhHH Confidence 77677655555555555322111111011 122222222111 1110 11110 01233 599999 Q ss_pred HHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh-----cChhHHHHHHHHHHhhCCeEEEEEeeCCCC----------- Q lcl|NC_021301. 63 VRDSVADRIIPNGITVGGSADSDLALRARRIWRD-----NRMDSVCKQWVKYGLDFGESYLTCWRRDDG----------- 126 (456) Q Consensus 63 iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~-----n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg----------- 126 (456) .++..++++|.+++++..+... .+..++.+ ++++.+.+.+++.++.+|+|+++|-..+.+ T Consensus 93 tl~~l~G~vfrk~p~~~~~~~~----~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~ 168 (488) T protein:vir:96 93 TMNAITGAVMRREPEFDTMDNP----VLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKK 168 (488) T ss_pred HHHHhcchhhccCceeccCCcH----HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcC Confidence 9999999999999887644222 24444443 678999999999999999999999876544 Q ss_pred ceEEEEEccceeEEEE-eCCCCceEEEEEEE---EEecCCce-----EEEEEEcCCeEEEEEEeeeecccccceeeccCC Q lcl|NC_021301. 127 TATITADSPETMVVSV-DPLQPWRIRSAMRW---WRDLDAES-----DFAIVWSGDGWQKFARPCFVQSSSRRRLVTRIS 197 (456) Q Consensus 127 ~~~i~~~~p~~~~~~~-d~~~~~~~~~~~~~---~~~~d~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (456) +|.+..++|++++-.- +...++..+-.+++ +.+.|+.. .+..+-..++.|...+... .... T Consensus 169 rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~----------~~~~ 238 (488) T protein:vir:96 169 LPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTD----------DEYS 238 (488) T ss_pred CcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEec----------CCcc Confidence 3889999999987543 33334333333332 22334321 1111111222222222111 1112 Q ss_pred CceeecccccccCceeEEEEccC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccccc Q lcl|NC_021301. 198 DSWVPVGDAVVTGSPPPVVVYQN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENG 272 (456) Q Consensus 198 ~~~~~~~~~~~~~~~~pvv~~~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~ 272 (456) ..+.+.....+.++++|++++.. .+ +.+-+.++-.|.-+.-+..|+...++.+..+|++++. ..+..+...... T Consensus 239 ~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~-~~~~~~~~~~~~ 317 (488) T protein:vir:96 239 DEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVD-MGDMNKTMASEM 317 (488) T ss_pred cceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeec-cCCCCccccccc Confidence 23444444566788999998842 22 3344666666666666777888888877778877653 322211111111 Q ss_pred chhhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHH Q lcl|NC_021301. 273 NAIDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFK 352 (456) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k 352 (456) .++ ....+ .......+.+++..++.....-..+.|+.+..++..... ..+. ..+|-||++.+.....-.+. T Consensus 318 ~~~----g~~~~-~~~~~~~~~g~~~~~e~~~~~l~~~~l~~l~~qm~~~Ga---~l~~-~~~~~Ta~~~~~~~~~~~S~ 388 (488) T protein:vir:96 318 NPL----GFTLA-GRMPYYVKNGDVKVIQAQFSPETENKVEKLFEQAVKVGA---SLFT-QQSNETATGAAIRSGSSTAS 388 (488) T ss_pred ccc----eeeec-ccccccccCCceeecCCchhHHHHHHHHHHHHHHHHHhH---hhcc-CCCcchHHHHHHHHHHhhHH Confidence 111 00010 011112234444433322212235667777777755321 1222 23456888888887787888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCc----ccceeEEecCC-CCc-CHHHHHHHHHHHHhcCCCcHHHHHHhC---CCC Q lcl|NC_021301. 353 CEDRLSIAKIGLEAILVKALQIEGESV----EDTVDVSFESP-DRV-TLGEKYAAASLAKAAGESWASIRRNIL---NYN 423 (456) Q Consensus 353 ~~~~~~~f~~~l~~~~~l~~~~~~~~~----~~~i~v~f~~~-~~~-~~~e~ad~~~kl~~~g~~s~~t~~~~~---~~~ 423 (456) ...+-..+..++.+++++++.+.|... +...++.-++. ... -..+.++++.++..+|.+|++|.++.+ |+. T Consensus 389 L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~gvl 468 (488) T protein:vir:96 389 MATLGNNVEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRARVV 468 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCcC Confidence 888889999999999999999988643 22344433332 122 235678899999999999999987654 554 Q ss_pred -hhHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 424 -ADQIKQDDLDRAREQITLF 442 (456) Q Consensus 424 -~~~~~~~e~~~~~ee~~~~ 442 (456) ++...+.+.++++++--.+ T Consensus 469 ~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 469 RGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred CccCCHHHHHHHHhhcCCCC Confidence 3322333455555432222 No 81 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.80 E-value=7.2e-20 Score=125.53 Aligned_cols=419 Identities=10% Similarity=0.094 Sum_probs=213.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhccc-Cccccc-----Ccc-cchhhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGD-APLPEL-----TRN-TSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~-~~i~~~-----~~~-~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) |..-..+.... +. .... .+...+..+=.|. ++.... +.. ....+.... ..+.+++++||..++.++. T Consensus 1 ~~~~~~a~~~~-~~-~~a~--~~~~~~~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY--~~~~l~r~iVd~~a~d~~r 74 (461) T protein:vir:80 1 MYSIDKAKQAK-ID-SKIV--NRNDFMVGHGKANSRDKLTRQTPGNGQKLDLKACENLY--ASNSIAMNIVDIISEDMVR 74 (461) T ss_pred Cccchhhhhhh-hh-hhhh--hhhHHHhhcCCcchhhhhhccccCcccccCHHHHHHHH--HhCCccchhhccchHHhhc Confidence 65554443111 11 1110 1111111111111 111100 110 111122222 2467889999999999999 Q ss_pred CCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCC---ceEEEEEcccee--EEEEeCCCCc Q lcl|NC_021301. 74 NGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDG---TATITADSPETM--VVSVDPLQPW 148 (456) Q Consensus 74 ~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg---~~~i~~~~p~~~--~~~~d~~~~~ 148 (456) +|+.+.+. +.+..+.+.+.|++-++...+.++++++..||.|++++...+.. ......+.|... +...++.... T Consensus 75 ~g~~i~~~-~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~ 153 (461) T protein:vir:80 75 AGWSLKTD-NKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNREQADLSTAIDPKTIKSIPYINTFNTQ 153 (461) T ss_pred CCeeeecC-CHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCccccCccCCcccccccceeEEEecccc Confidence 99998764 55567778889988889999999999999999999988653221 111222222221 1111111111 Q ss_pred eEEEEEEEEEec----CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEE--c-cCC Q lcl|NC_021301. 149 RIRSAMRWWRDL----DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVV--Y-QNP 221 (456) Q Consensus 149 ~~~~~~~~~~~~----d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~--~-~n~ 221 (456) .+.. .....++ .|++.+..+-.......+. . ..... ......|..+++.+.. + ... T Consensus 154 ~i~~-~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~-------------~--~~~~~-~~~~~iH~SRii~~~~~~~~~~~ 216 (461) T protein:vir:80 154 KVTQ-LYLNQDMFSEHFGEVEFFEVNRVSQLGEEI-------------L--SGTTA-STSEQIHRSRIIHEQGLRFEGET 216 (461) T ss_pred ccch-hhhcccCcCcccccceEEEEeccccccccc-------------c--ccccC-ccceEEccccEEEecCCCCCccc Confidence 1110 0011111 1222211111000000000 0 00000 0001123333333211 1 223 Q ss_pred CCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec Q lcl|NC_021301. 222 DGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ 301 (456) Q Consensus 222 ~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~ 301 (456) +|.|.++++.+.+.+++++.-.....+.....+...+.+... -..+..+.............+ +..++.+.++.+++ T Consensus 217 ~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~--~~~~~~~~~~~~~~~~~~~~g-~~~~d~~e~~e~~~ 293 (461) T protein:vir:80 217 KGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDA--LNKDDKANLTAMLDFMFRTEA-LAIIKGDEQLTKES 293 (461) T ss_pred cCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHh--hhchHHHHHHHHHHHhcCCce-EEEEcCCcceEEEe Confidence 699999999999999998887655444333333222222211 011111222222333333334 44556666776665 Q ss_pred ccchHHHHHHHHHHHHHHHhhcCCChhhh-ccc-ccCcHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcC-- Q lcl|NC_021301. 302 TNDFTPMLSAIKEHIRQLSSATKTPLPML-MPD-SANQSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILVKALQIEG-- 376 (456) Q Consensus 302 ~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-~~~-~~N~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l~~~~~l~~~~~~-- 376 (456) +++++..+.++...++|++.++||..-| |.. ..|+||+.-. ......++.+| ..+.+.+++++.+++...+ T Consensus 294 -~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~---~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~ 369 (461) T protein:vir:80 294 -TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDV---MNYYARVSSIQENRLRPQLEYLTRLLMWASDDC 369 (461) T ss_pred -cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 4577888899999999999999999765 432 3456777533 23344555555 4578889999888765432 Q ss_pred ----CCcccceeEEecCCCCcCHHHHHH-------HHHHHHhcCCCcHHHHHHhC----CCChhH-HH--HHHHHHHHHH Q lcl|NC_021301. 377 ----ESVEDTVDVSFESPDRVTLGEKYA-------AASLAKAAGESWASIRRNIL----NYNADQ-IK--QDDLDRAREQ 438 (456) Q Consensus 377 ----~~~~~~i~v~f~~~~~~~~~e~ad-------~~~kl~~~g~~s~~t~~~~~----~~~~~~-~~--~~e~~~~~ee 438 (456) .++.+.+++.|++....+.+|.|+ ++.++.++|++|.+.+++.+ +.++.. .. ..+.+.++ T Consensus 370 ~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~-- 447 (461) T protein:vir:80 370 GPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLA-- 447 (461) T ss_pred ccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhh-- Confidence 234467899999999999988754 57777788999987776533 333221 00 01111111 Q ss_pred HHHHhhhhhhhccccc Q lcl|NC_021301. 439 ITLFAGNSVQRPQEDG 454 (456) Q Consensus 439 ~~~~~~~~~~~~~~d~ 454 (456) ........+.+.+| T Consensus 448 --~~~~~~~~~e~~~g 461 (461) T protein:vir:80 448 --KLVYDAYAKKNADG 461 (461) T ss_pred --hhccccccccCCCC Confidence 12222333345555 No 82 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.76 E-value=8e-17 Score=108.82 Aligned_cols=432 Identities=12% Similarity=0.031 Sum_probs=231.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC--c--ccc-cCc--ccchh-------hhhhhhhh--ccChHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDA--P--LPE-LTR--NTSAA-------WRSFQREA--RTNWGLMVR 64 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~--~--i~~-~~~--~~~~~-------~~~~~~k~--~~n~~~~iV 64 (456) |..-.+.-+.. .. +.........||.|-. . +.. .+. ..... ++..-+-+ ..+|++-+| T Consensus 1 ~~~p~~~~~~~-----~~-~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av 74 (533) T protein:vir:34 1 MKTPTIPTLLG-----PD-GMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRGNARADDLVRNNGYAANAI 74 (533) T ss_pred CCCchhhhhhc-----cc-ccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 54444332211 11 1122245567776632 1 111 111 11111 11111222 356999999 Q ss_pred HHHHhhhccCCeecCCCC-----------cccHHHHHHHHHH---h-----------cChhHHHHHHHHHHhhCCeEEEE Q lcl|NC_021301. 65 DSVADRIIPNGITVGGSA-----------DSDLALRARRIWR---D-----------NRMDSVCKQWVKYGLDFGESYLT 119 (456) Q Consensus 65 d~~a~~l~~~~~~~~~~~-----------d~~~~~~l~~~~~---~-----------n~~~~~~~~~~~~a~~~G~a~~~ 119 (456) +..+++++|.||+..... +.+..+.+...|. + .+|...+..+++..++.|.+|+. T Consensus 75 ~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~ 154 (533) T protein:vir:34 75 QLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQ 154 (533) T ss_pred HHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEE Confidence 999999999998875422 2222333433332 2 24677888899999999999998 Q ss_pred EeeCCCC----ceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeecc Q lcl|NC_021301. 120 CWRRDDG----TATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTR 195 (456) Q Consensus 120 v~~d~dg----~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (456) ...+..+ ..++..++|..+---++...+..+..+|.+ +..|...-+.++.... .......+.... T Consensus 155 ~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~--d~~Gr~~aY~i~~~~~---------~~~~~~~~~~~~ 223 (533) T protein:vir:34 155 ATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQI--NDSGAALGYYVSEDGY---------PGWMPQKWTWIP 223 (533) T ss_pred eeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEE--CCCCCeEEEEEeecCC---------CCccccccceee Confidence 7655443 357899999987644443334455565543 4455555444432110 000000000000 Q ss_pred CCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHH---HHHHHHHHHHHHHHhhchhhhhhcCCCccccc---- Q lcl|NC_021301. 196 ISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINR---INRAELQLLSTMAIQAFRQRALKSAGHGLPKV---- 268 (456) Q Consensus 196 ~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa---~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~---- 268 (456) ... -.+-..+.|.+.. .+.....|.|.|.+++..+.. |..+..........++ .+|+.-.+..... T Consensus 224 ~~~-~v~a~~VlH~f~~---~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a---~fi~~~~~~~~~~~~~~ 296 (533) T protein:vir:34 224 REL-PGGRASFIHVFEP---VEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYA---ATIESELDTQSAMDFIL 296 (533) T ss_pred eee-ccChhHeeeeccc---cCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhhe---eeeecCCCccccccccc Confidence 000 0000011111110 122345799999998765444 4444333333332222 2333111100000 Q ss_pred ----ccccchh----------hhhhhhhhhccceeccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhccc Q lcl|NC_021301. 269 ----DENGNAI----------DYASIFEAAPGALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPD 333 (456) Q Consensus 269 ----~~~~~~~----------~~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~ 333 (456) +.....+ .........+|.+..+.++.++..+++. +..+|.+.++.+++.|++..|+|-+.+.++ T Consensus 297 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D 376 (533) T protein:vir:34 297 GANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRN 376 (533) T ss_pred CCCcccccccccccchhhhhccCcceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhh Confidence 0000000 0011123567888899999988877655 346788888999999999999999999888 Q ss_pred ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH---HhcCCC--------ccc-----ceeEEecCC--CCcC Q lcl|NC_021301. 334 SANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA-ILVKAL---QIEGES--------VED-----TVDVSFESP--DRVT 394 (456) Q Consensus 334 ~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~~l~~---~~~~~~--------~~~-----~i~v~f~~~--~~~~ 394 (456) .++.|..+.++.+......++..|..|...+-+ +++..+ .+.|.- +.. -+.+.|..+ ...| T Consensus 377 ~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iD 456 (533) T protein:vir:34 377 YAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAID 456 (533) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccC Confidence 888899999999999999999988887765533 222211 123321 111 135788776 4678 Q ss_pred HHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH-HHHHHH-HHHHHHhhhhh---------h---hcccccCC Q lcl|NC_021301. 395 LGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD-DLDRAR-EQITLFAGNSV---------Q---RPQEDGSR 456 (456) Q Consensus 395 ~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~-e~~~~~-ee~~~~~~~~~---------~---~~~~d~~~ 456 (456) ....+++..+++.+|+.|.+.+....|.+++++.+. ..++.. ++......... + .++++++. T Consensus 457 P~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~~~~~~~~~~~~~~ 532 (533) T protein:vir:34 457 GLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGLRQSTEEEKSDSRA 532 (533) T ss_pred hHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCCCCCCCCCcccCCC Confidence 999999999999999999999988899998765432 111111 11111100000 0 11111111 No 83 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.75 E-value=2e-16 Score=106.64 Aligned_cols=424 Identities=13% Similarity=0.064 Sum_probs=229.8 Q ss_pred HHHHHHHHHH----HHHHHHHHHHHHHHhcccCcccc---cCc-ccc--------hhhhhhhhhh--ccChHHHHHHHHH Q lcl|NC_021301. 7 AEWLPVLTKR----IDDGMSRVRLLARYSNGDAPLPE---LTR-NTS--------AAWRSFQREA--RTNWGLMVRDSVA 68 (456) Q Consensus 7 ~~~~~~l~~~----~~~~~~r~~~~~~YY~g~~~i~~---~~~-~~~--------~~~~~~~~k~--~~n~~~~iVd~~a 68 (456) +.|+++++.- ...++.+.....+-|+|-..-.. .+. ..+ ..++..-+-+ ..+|++.+|+..+ T Consensus 1 mn~~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 80 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFDKLE 80 (502) T ss_pred CchHhhHHhhcChHHHHHHHhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 3445544433 33333333333344666322110 011 111 1111111222 2469999999999 Q ss_pred hhhccC-CeecCCC---C----cccHHHHHHHHHH---h-------cChhHHHHHHHHHHhhCCeEEEEEeeCCCCc--- Q lcl|NC_021301. 69 DRIIPN-GITVGGS---A----DSDLALRARRIWR---D-------NRMDSVCKQWVKYGLDFGESYLTCWRRDDGT--- 127 (456) Q Consensus 69 ~~l~~~-~~~~~~~---~----d~~~~~~l~~~~~---~-------n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~--- 127 (456) ++++|. |+++... . +.+..+.+.+.|. + .+|...+..+++..++.|.+|+.+..++.+. T Consensus 81 ~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~~~~ 160 (502) T protein:vir:79 81 ERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINSLTP 160 (502) T ss_pred HhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCccCC Confidence 999996 6653221 1 1233444444444 2 4688888999999999999999886655432 Q ss_pred -----eEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceee Q lcl|NC_021301. 128 -----ATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVP 202 (456) Q Consensus 128 -----~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (456) .++..++|..+-.-++ .+..+..+|.+ +..|....+.++...- .......... ++ T Consensus 161 g~~~~l~lq~iepd~l~~~~~--~~~~i~~GVe~--d~~Gr~~aY~i~~~hP--------gd~~~~~~~r--------vp 220 (502) T protein:vir:79 161 SAGVHFWLEALEPDFIPMTSD--ESNRLNQGVFV--DDWGRPEKYLVYKSRP--------VSGRQMETKE--------VD 220 (502) T ss_pred CcccceEEEEecchhcCCCCC--CCCeeEeeeEE--CCCCceEEEEEeecCC--------CCCcccceeE--------ec Confidence 4799999998742222 23445555543 4555555443332110 0000000000 00 Q ss_pred cccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhh Q lcl|NC_021301. 203 VGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFE 282 (456) Q Consensus 203 ~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~ 282 (456) -..+.|.+.. .+.....|.|.|.+++..+..++....-........+.--.+|+.-.++.......+... ...... T Consensus 221 A~~vlH~f~~---~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~-~~~~~~ 296 (502) T protein:vir:79 221 AERMLHLKFV---RRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKGDGQSYEPDGNGSKE-NERELT 296 (502) T ss_pred hhheEEeecc---cCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCcccccccCCCCC-cccccc Confidence 0111111110 112335699999998876655554433322333333322234443222211111111111 112233 Q ss_pred hhcccee-ccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 283 AAPGALW-ELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) Q Consensus 283 ~~~~~~~-~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f 360 (456) ..+|.++ .++++.++..+++. +..+|...++...+.|++..|+|-+.+.++.+ .|..++++.+......+...|..| T Consensus 297 l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s-~nySs~R~~~~e~~r~~~~~q~~~ 375 (502) T protein:vir:79 297 IQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN-GTYSAQRQELVESTDGYLILQDWF 375 (502) T ss_pred ccCCccccccCCCceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-chHHHHHHHHHHHHHHHHHHHHHH Confidence 4567664 57888888777654 44678888999999999999999999988864 488899999999999999988888 Q ss_pred HHHHHH-HHHHHH---HhcCCC------c-ccceeEEecCC--CCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHH Q lcl|NC_021301. 361 KIGLEA-ILVKAL---QIEGES------V-EDTVDVSFESP--DRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQI 427 (456) Q Consensus 361 ~~~l~~-~~~l~~---~~~~~~------~-~~~i~v~f~~~--~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~ 427 (456) ...+-+ +++..+ .+.|.- + ..-+.+.|..+ ...|....+++..+++.+|+.|++......|.+++++ T Consensus 376 ~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v 455 (502) T protein:vir:79 376 IGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAGGRNPDDV 455 (502) T ss_pred HHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHH Confidence 765544 333222 123321 1 11246778766 4568999999999999999999999888899998765 Q ss_pred HHHHHHHHHHHHHHHhhhhhh------------h----c-ccccCC Q lcl|NC_021301. 428 KQDDLDRAREQITLFAGNSVQ------------R----P-QEDGSR 456 (456) Q Consensus 428 ~~~e~~~~~ee~~~~~~~~~~------------~----~-~~d~~~ 456 (456) -+..+ +..+.++..+-.... . + +.++++ T Consensus 456 ~~q~a-~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~ 500 (502) T protein:vir:79 456 KRRRK-AEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQS 500 (502) T ss_pred HHHHH-HHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 43211 111111111111000 0 0 001111 No 84 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.75 E-value=3e-17 Score=111.18 Aligned_cols=435 Identities=13% Similarity=0.042 Sum_probs=199.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMS-------RVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~-------r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~~iVd~~a~~l~ 72 (456) ++.....+++.+|...+..... ...+-.+||.|.+ .+......++.. ...++.|.++.+|+...++.. T Consensus 38 ~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~~~i~~v~g~~~ 113 (776) T protein:vir:93 38 LDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQ----WSQDEIDELKERGQAPTVYNVISQSVNWIIGSEK 113 (776) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCC----CCHHHHHHHHhcCCceEEecchHHHHHHHHHHHH Confidence 4444444566666665433322 2234468999985 222111112111 124778999999999999887 Q ss_pred cCC--eecC--CCCcccHHHH----HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC--CCc-eEEEEEccceeEEE Q lcl|NC_021301. 73 PNG--ITVG--GSADSDLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD--DGT-ATITADSPETMVVS 141 (456) Q Consensus 73 ~~~--~~~~--~~~d~~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~--dg~-~~i~~~~p~~~~~~ 141 (456) .+. +.+. ...|.+..+. +..++..|+++..++.+..+++++|.+|+-|+.+. ++. +++.+++|.+++ T Consensus 114 ~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~~~~p~~i~-- 191 (776) T protein:vir:93 114 RGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAGAESWRNIL-- 191 (776) T ss_pred hCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEeeccChhhee-- Confidence 653 2222 2233333333 34566779999999999999999999998887654 344 455677888855 Q ss_pred EeCCCCc------eEEEEEEEEEec----------------------------CCceEEEE--------------EE--- Q lcl|NC_021301. 142 VDPLQPW------RIRSAMRWWRDL----------------------------DAESDFAI--------------VW--- 170 (456) Q Consensus 142 ~d~~~~~------~~~~~~~~~~~~----------------------------d~~~~~~~--------------~~--- 170 (456) ||+.... +.+. .+.|.+. ++...... .| T Consensus 192 ~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (776) T protein:vir:93 192 WDSTYRRLDMDDCRYIF-RVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAGAVAY 270 (776) T ss_pred eccccccCCHHHHhhhh-hhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccccccccccc Confidence 5553321 1111 1111100 00000000 00 Q ss_pred cCCeEEEEEEeeeec-----------c-cc-----------------c------------ceeeccCCCceeeccccccc Q lcl|NC_021301. 171 SGDGWQKFARPCFVQ-----------S-SS-----------------R------------RRLVTRISDSWVPVGDAVVT 209 (456) Q Consensus 171 ~~~~~~~~~~~~~~~-----------~-~~-----------------~------------~~~~~~~~~~~~~~~~~~~~ 209 (456) ..+.+ +....++.. . .. + ++.. ..++..+..+..+.. T Consensus 271 ~~~~v-~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~-~~g~~~l~~~~~p~~ 348 (776) T protein:vir:93 271 ARKRV-RMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAI-MTTRDLMWAGPSPYR 348 (776) T ss_pred CCCeE-EEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEE-EecchhhhccCCCCC Confidence 00010 000000000 0 00 0 0000 011111122222333 Q ss_pred CceeEEEEc------cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhh Q lcl|NC_021301. 210 GSPPPVVVY------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEA 283 (456) Q Consensus 210 ~~~~pvv~~------~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~ 283 (456) ++.+|+|++ ..-+|.|.+..+++.++.+|..+|.+...+- +.+..+-.|.... . +....-.. T Consensus 349 ~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav~~---~-------d~~~~~~~ 416 (776) T protein:vir:93 349 HNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAVDD---I-------DEFRREAA 416 (776) T ss_pred CCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc--CCceeeccccccc---h-------HHHHHhcc Confidence 345555543 2235789999999999999999998766542 2222222222110 0 00101112 Q ss_pred hccceeccCCCc--eeE-eecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 284 APGALWELPPGV--DIW-ESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIA 360 (456) Q Consensus 284 ~~~~~~~~~~d~--~~~-~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f 360 (456) .++.++..++++ .+. +....-...+...+......|-.+||+.+..+|..+++.||+|+......-........+.| T Consensus 417 rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~ 496 (776) T protein:vir:93 417 RPDAVMTVKNGKLGAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNL 496 (776) T ss_pred cCCceeeeCCccccccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Confidence 345555554433 221 11222346677888888888999999999999987766899999887777666676776777 Q ss_pred HHHHHHHHHHHHHh----cCCCcccce--------eEEecCC-----------------CCcC---HHHHHHHHHHHHhc Q lcl|NC_021301. 361 KIGLEAILVKALQI----EGESVEDTV--------DVSFESP-----------------DRVT---LGEKYAAASLAKAA 408 (456) Q Consensus 361 ~~~l~~~~~l~~~~----~~~~~~~~i--------~v~f~~~-----------------~~~~---~~e~ad~~~kl~~~ 408 (456) ..++++++++++.+ .+..-...| -|.+++. -|.+ ..+..+.+..+.+. T Consensus 497 ~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~ 576 (776) T protein:vir:93 497 RLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGK 576 (776) T ss_pred HHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhh Confidence 77777666655432 110000000 0111111 1111 22233333333321 Q ss_pred CCC--cH---HHHHHhCCCCh-hHH-HHHH---------------HHHHHHHHHHHhhhhhh--------hcccccCC Q lcl|NC_021301. 409 GES--WA---SIRRNILNYNA-DQI-KQDD---------------LDRAREQITLFAGNSVQ--------RPQEDGSR 456 (456) Q Consensus 409 g~~--s~---~t~~~~~~~~~-~~~-~~~e---------------~~~~~ee~~~~~~~~~~--------~~~~d~~~ 456 (456) .-. .. ..+++..++-. +++ ++++ .+...++.......... ..+....+ T Consensus 577 ~~p~~~~~~~~~~~e~~d~p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~qa~a~~ 654 (776) T protein:vir:93 577 MPPEIALTMLDLLVENMDIPNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQAKARK 654 (776) T ss_pred cChhhHHHHHHHHHHhcCccchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhhHHHHH Confidence 000 00 11122222210 011 0000 00000000000000000 00001111 No 85 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=99.74 E-value=1e-16 Score=108.30 Aligned_cols=428 Identities=13% Similarity=0.106 Sum_probs=230.1 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCc---ccccCcccch--------hhhhhhhhh--ccChHHHHHHHHH Q lcl|NC_021301. 2 TASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAP---LPELTRNTSA--------AWRSFQREA--RTNWGLMVRDSVA 68 (456) Q Consensus 2 ~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~---i~~~~~~~~~--------~~~~~~~k~--~~n~~~~iVd~~a 68 (456) -+.||.-++- +...... ....+-|+|-.. ....+...++ .++..-+-+ ..+|++-+|+..+ T Consensus 1 m~~~~~~~~a-~~~~~~~-----~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~~ 74 (495) T protein:vir:10 1 MNMTPSGYQS-LASGLLV-----PVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATWV 74 (495) T ss_pred CCcccccccc-cchhhhh-----HHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 1222222211 1111111 111223444111 1111111111 111111222 2569999999999 Q ss_pred hhhccCCeecCCC-CcccHHHHHHHHH---Hh-------cChhHHHHHHHHHHhhCCeEEEEEeeC--CCC---ceEEEE Q lcl|NC_021301. 69 DRIIPNGITVGGS-ADSDLALRARRIW---RD-------NRMDSVCKQWVKYGLDFGESYLTCWRR--DDG---TATITA 132 (456) Q Consensus 69 ~~l~~~~~~~~~~-~d~~~~~~l~~~~---~~-------n~~~~~~~~~~~~a~~~G~a~~~v~~d--~dg---~~~i~~ 132 (456) ++++|.||+.... .+.+..+.+.+.| .+ .+|...+..+++..++.|.||+.+... .+| ..++.. T Consensus 75 ~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lql 154 (495) T protein:vir:10 75 AAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQI 154 (495) T ss_pred HhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEEE Confidence 9999999987543 3434444444444 33 357788899999999999999876543 333 258999 Q ss_pred EccceeE-EEEeC--CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeeccccccc Q lcl|NC_021301. 133 DSPETMV-VSVDP--LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVT 209 (456) Q Consensus 133 ~~p~~~~-~~~d~--~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (456) ++|..+- +.-+. ..+..+..+|.+ +..|....+.++...-...+. .... ..-..++-..+.|. T Consensus 155 iepd~l~~~~~~~~~~~g~~i~~GIe~--d~~Gr~vaY~i~~~hpgd~~~-------~~~~-----~~~~rvpA~~vlH~ 220 (495) T protein:vir:10 155 IEPDMLASDIPDETLPSGGYVKGGIRF--SNGGKRKAYCFYRNHPAESSL-------IGDP-----VDTVWIKAEHVLHV 220 (495) T ss_pred echhhcCCCCCCCCCCCCCEEEeceEE--CCCCceEEEEEeecCCCcccc-------cccc-----cceeeechhheEec Confidence 9999975 33222 233456666653 455655544443221100000 0000 00011122223344 Q ss_pred CceeEEEEccCCCCCCcHhHHHHH--HHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc---ccchhhhhhhhhhh Q lcl|NC_021301. 210 GSPPPVVVYQNPDGMGEVEPHIDI--INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE---NGNAIDYASIFEAA 284 (456) Q Consensus 210 ~~~~pvv~~~n~~g~s~~~~v~~l--iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~---~~~~~~~~~~~~~~ 284 (456) +. .+.....|.|.+.+++.| ++.|..+..........+ -.+|+.-.++....+. .+..-......... T Consensus 221 f~----~r~gQ~RGis~la~i~~l~~l~~y~dael~~a~i~A~~---~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 293 (495) T protein:vir:10 221 TV----LTVRSDAGAPWFQLLLRLNELDQYEDAELVRKKTAALF---AAFIQEATADSTGGPTIGQPKRSKGGKRITGLN 293 (495) T ss_pred cc----cCCCcccCcchhHHHHHHHHhhHHHHHHHHHHHHhhhh---eeeeecCCCccccccccCccccccCcccceecC Confidence 31 133556788988877664 344444444333332222 1233322211111110 01100111123356 Q ss_pred ccceeccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_021301. 285 PGALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLS-IAKI 362 (456) Q Consensus 285 ~~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~-~f~~ 362 (456) +|.+..+.++.++..+++. +..+|.+.++.+++.|++..|+|-+.+.++.++.|..++++.+......++..|. .|.. T Consensus 294 pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~ 373 (495) T protein:vir:10 294 PGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEFRRLCQQVQHHMIIH 373 (495) T ss_pred CceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7888889999988877765 4567888899999999999999999998888888888999999999888887665 4555 Q ss_pred HH-HHHHHHHH---HhcCC---Ccccc-----eeEEecCC--CCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH Q lcl|NC_021301. 363 GL-EAILVKAL---QIEGE---SVEDT-----VDVSFESP--DRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK 428 (456) Q Consensus 363 ~l-~~~~~l~~---~~~~~---~~~~~-----i~v~f~~~--~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~ 428 (456) .+ +-+++..+ .+.|. +++.. +.+.|..+ ...|....+++..+++.+|+.|.+......|.+++++- T Consensus 374 ~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~ 453 (495) T protein:vir:10 374 QFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAERGYDMEELF 453 (495) T ss_pred HHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHH Confidence 44 33433222 12332 11111 46778776 45699999999999999999999999888999988654 Q ss_pred HHH-HHHH-HHHHHHHhh---------hh---hhhcccccCC Q lcl|NC_021301. 429 QDD-LDRA-REQITLFAG---------NS---VQRPQEDGSR 456 (456) Q Consensus 429 ~~e-~~~~-~ee~~~~~~---------~~---~~~~~~d~~~ 456 (456) +.. .++. .++....-. .+ ...+.++.++ T Consensus 454 ~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 454 DMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 321 1111 111111100 00 1111112222 No 86 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.74 E-value=2.4e-16 Score=106.24 Aligned_cols=432 Identities=11% Similarity=0.070 Sum_probs=236.2 Q ss_pred CCCCC-HHHHHHHHHHH-HHHHHHHHHHHHHHhcccCcccc------cCc-cc-chh-------hhhhhhhh--ccChHH Q lcl|NC_021301. 1 MTAST-PAEWLPVLTKR-IDDGMSRVRLLARYSNGDAPLPE------LTR-NT-SAA-------WRSFQREA--RTNWGL 61 (456) Q Consensus 1 ~~~~t-~~~~~~~l~~~-~~~~~~r~~~~~~YY~g~~~i~~------~~~-~~-~~~-------~~~~~~k~--~~n~~~ 61 (456) |--.- -..++++++.- ...+......-.+.|+|-..-.. .+. .. ..+ ++..-+-+ .++|++ T Consensus 1 ~~r~~~~~~~~dr~i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~ 80 (505) T protein:vir:96 1 MKRAEKKPSLAQRMVNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAK 80 (505) T ss_pred CCCCccccchhhcccchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHH Confidence 32111 11233333321 11122222333456665321110 111 11 111 12111222 356999 Q ss_pred HHHHHHHhhhcc-CCeecCCC-------CcccHHHHHHHHHHh------------cChhHHHHHHHHHHhhCCeEEEEEe Q lcl|NC_021301. 62 MVRDSVADRIIP-NGITVGGS-------ADSDLALRARRIWRD------------NRMDSVCKQWVKYGLDFGESYLTCW 121 (456) Q Consensus 62 ~iVd~~a~~l~~-~~~~~~~~-------~d~~~~~~l~~~~~~------------n~~~~~~~~~~~~a~~~G~a~~~v~ 121 (456) -+|+..+.+++| .|++.... .+++..+.+...|.. .+|...+..+++..++.|.||+... T Consensus 81 ~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~ 160 (505) T protein:vir:96 81 RFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREH 160 (505) T ss_pred HHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEe Confidence 999999999999 68776432 234445555444432 2367778889999999999999876 Q ss_pred eCCCCc--eEEEEEccceeEEEEeC--CCCceEEEEEEEEEecCCceEEEEEEcCC--eEEEEEEeeeecccccceeecc Q lcl|NC_021301. 122 RRDDGT--ATITADSPETMVVSVDP--LQPWRIRSAMRWWRDLDAESDFAIVWSGD--GWQKFARPCFVQSSSRRRLVTR 195 (456) Q Consensus 122 ~d~dg~--~~i~~~~p~~~~~~~d~--~~~~~~~~~~~~~~~~d~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 195 (456) ....+. .++..++|..+-.-++. ..+..+..+|.+ +..|....+.++..+ ..+... .. T Consensus 161 ~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GIe~--d~~Gr~~aY~i~~~hPgd~~~~~--------------~~ 224 (505) T protein:vir:96 161 RGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSIEL--DAWERPVAYHLLVNHPGDNSYCY--------------HY 224 (505) T ss_pred ecCCCCcceEEEEechhhcCCCCCcccCCcCeEEeceEE--CCCCceEEEEEeecCCCcccccc--------------cc Confidence 654432 58999999987533321 223445556643 555666554444321 000000 00 Q ss_pred CCCce--eecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcc--cccccc Q lcl|NC_021301. 196 ISDSW--VPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGL--PKVDEN 271 (456) Q Consensus 196 ~~~~~--~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~--~~~~~~ 271 (456) ....+ ++-..+-|.+.. .+.....|.|.|.+++..+..++....-........+.--.+|+.-.+.. ...+.. T Consensus 225 ~~~~~~rvpa~~vlH~f~~---~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~ 301 (505) T protein:vir:96 225 AGQTYERVPADEIIHTFVP---WRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQPPEDDQ 301 (505) T ss_pred ccccccccCHhHhhhhhcc---cCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCcccccc Confidence 00001 011111122110 12234579999999877655544333332233333332223444322111 111222 Q ss_pred cchhhhhhhhhhhccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 272 GNAIDYASIFEAAPGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) +.. .....+|.+..++++.++..+++.. ..+|.+..+.+++.|++..|+|-+.+.++.++.|..|.++.+.... T Consensus 302 ~~~-----~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~~~e~~ 376 (505) T protein:vir:96 302 GEI-----VEEVEAGTYQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSGELDER 376 (505) T ss_pred Ccc-----ccccCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHH Confidence 222 2345578888999999988887653 5678888999999999999999999988888888999999999999 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHH---HhcCC-C-----cccceeEEecCCC--CcCHHHHHHHHHHHHhcCCCcHHHHHH Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEA-ILVKAL---QIEGE-S-----VEDTVDVSFESPD--RVTLGEKYAAASLAKAAGESWASIRRN 418 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~-~~~l~~---~~~~~-~-----~~~~i~v~f~~~~--~~~~~e~ad~~~kl~~~g~~s~~t~~~ 418 (456) ..++..|..|...+-+ +++..+ .+.|. . .+.-+.+.|..+- ..|....+++..+++.+|+.|.+.... T Consensus 377 r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a 456 (505) T protein:vir:96 377 DLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIR 456 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHH Confidence 9999998888765443 333222 12232 1 1112467787764 568999999999999999999999888 Q ss_pred hCCCChhHHHHH-HHHHHH-HHHHHHhhh------hhhhcccccCC Q lcl|NC_021301. 419 ILNYNADQIKQD-DLDRAR-EQITLFAGN------SVQRPQEDGSR 456 (456) Q Consensus 419 ~~~~~~~~~~~~-e~~~~~-ee~~~~~~~------~~~~~~~d~~~ 456 (456) ..|.+++++-+. ..++.. ++....... ....++++.+- T Consensus 457 ~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~ 502 (505) T protein:vir:96 457 AAGDDPEDVFDEIAWEEQLMRDKGVNPTPPEQESKDATTDEEDDSA 502 (505) T ss_pred HcCCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCC Confidence 899998765432 111111 121111000 00011111111 No 87 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.74 E-value=4.6e-17 Score=110.16 Aligned_cols=422 Identities=12% Similarity=0.079 Sum_probs=229.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCc----ccc-cCc--ccc-------hhhhhhhhhh--ccChHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAP----LPE-LTR--NTS-------AAWRSFQREA--RTNWGLMVR 64 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~----i~~-~~~--~~~-------~~~~~~~~k~--~~n~~~~iV 64 (456) |.. |. .+. .. .+.-......||.|-.. +.. .+. ... ..++..-+-+ ..+|++.+| T Consensus 1 ~~~--~~----~~~--~~-~~~~~~~~~~~~~~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rNn~~a~~av 71 (530) T protein:vir:38 1 MKI--PS----LVG--PD-GKTSLREYAGYHGGGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRNNGYAANAV 71 (530) T ss_pred Ccc--ce----eec--Cc-cccchHHHhhhhcccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhcChHHHHHH Confidence 211 00 000 00 11113445567765321 111 111 111 1112111222 356999999 Q ss_pred HHHHhhhccCCeecCCC-----------CcccHHHHHHHHHH---h-----------cChhHHHHHHHHHHhhCCeEEEE Q lcl|NC_021301. 65 DSVADRIIPNGITVGGS-----------ADSDLALRARRIWR---D-----------NRMDSVCKQWVKYGLDFGESYLT 119 (456) Q Consensus 65 d~~a~~l~~~~~~~~~~-----------~d~~~~~~l~~~~~---~-----------n~~~~~~~~~~~~a~~~G~a~~~ 119 (456) +..+.+++|.||+.... .+.+..+.+.+.|. + .+|...+..+.+..++.|.||+. T Consensus 72 ~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~ 151 (530) T protein:vir:38 72 QLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQ 151 (530) T ss_pred HHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEEE Confidence 99999999999876432 12223344444443 2 24678888899999999999998 Q ss_pred EeeCCC-C---ceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeecc Q lcl|NC_021301. 120 CWRRDD-G---TATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTR 195 (456) Q Consensus 120 v~~d~d-g---~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (456) ...+.+ | ..++..++|..+---++...+..+..+|.+ +..|....+.++...- ..... T Consensus 152 ~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~--d~~Gr~~aY~i~~~~~---------~~~~~------- 213 (530) T protein:vir:38 152 ATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKI--NDSGAALGYYVSDDGY---------PGWMA------- 213 (530) T ss_pred eeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEE--CCCCceEEEEEeeccC---------CCccc------- Confidence 765443 3 257999999987544433334456666643 4455554443332100 00000 Q ss_pred CCCceeecccccccCceeEEEEc------cCCCCCCcHhHHHHHHHHHH---HHHHHHHHHHHHhhchhhhhhcCCCccc Q lcl|NC_021301. 196 ISDSWVPVGDAVVTGSPPPVVVY------QNPDGMGEVEPHIDIINRIN---RAELQLLSTMAIQAFRQRALKSAGHGLP 266 (456) Q Consensus 196 ~~~~~~~~~~~~~~~~~~pvv~~------~n~~g~s~~~~v~~liDa~~---~~~s~~~~~~~~~~~~~~~i~g~~~~~~ 266 (456) ..|..... ....+.+-|+|+ ....|.|.|.+++..+..++ .+..........++ .+|+...+... T Consensus 214 --~~~~~~~~-~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a---~fi~~~~~~~~ 287 (530) T protein:vir:38 214 --QNWTYIPR-ELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYA---ATIESELDTQS 287 (530) T ss_pred --cccceeee-eeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhhe---eeeeccCCccc Confidence 00000000 000111123333 33468899999876554444 33333333222221 23332111100 Q ss_pred cc--------c-c-------ccch--hhhhhhhhhhccceeccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCCh Q lcl|NC_021301. 267 KV--------D-E-------NGNA--IDYASIFEAAPGALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPL 327 (456) Q Consensus 267 ~~--------~-~-------~~~~--~~~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~ 327 (456) .. + . .+.. ..........+|.+..+.++.++..+++. +..+|.+.++.+++.|++..|+|- T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~y 367 (530) T protein:vir:38 288 AMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSY 367 (530) T ss_pred cccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCH Confidence 00 0 0 0000 00111234567888889999888877655 346788888999999999999999 Q ss_pred hhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH---HhcCCC--------ccc-----ceeEEecCC Q lcl|NC_021301. 328 PMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA-ILVKAL---QIEGES--------VED-----TVDVSFESP 390 (456) Q Consensus 328 ~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~~l~~---~~~~~~--------~~~-----~i~v~f~~~ 390 (456) +.+.++.++.|..+.+..+......+...|..|...+-+ +++.-+ .+.|.- ++. -..+.|..+ T Consensus 368 e~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p 447 (530) T protein:vir:38 368 EQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGS 447 (530) T ss_pred HHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecC Confidence 999888888888899999999999999988888765433 322211 122311 111 135778766 Q ss_pred --CCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH---HHHHHHHHHHHHhh-----------hhhhhccccc Q lcl|NC_021301. 391 --DRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD---DLDRAREQITLFAG-----------NSVQRPQEDG 454 (456) Q Consensus 391 --~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~---e~~~~~ee~~~~~~-----------~~~~~~~~d~ 454 (456) ...|....+++...++.+|+.|.+.+....|.+++++.+. |.+++++ ...... ...+..++|| T Consensus 448 ~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~e~~~~~~-~Gl~~~~~~~~~~~~~~~~~~~~~~d~ 526 (530) T protein:vir:38 448 GRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRGDDYQEIFAQQVRESMERRA-AGLNPPAWAAAAFEAGVKKSNEEEQDG 526 (530) T ss_pred CccccChHHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHHHHH-cCCCCCCCcccccCCCCCCCCCCCCCC Confidence 4668899999999999999999999888899998765432 1122211 111000 1111112233 Q ss_pred CC Q lcl|NC_021301. 455 SR 456 (456) Q Consensus 455 ~~ 456 (456) .+ T Consensus 527 ~~ 528 (530) T protein:vir:38 527 AR 528 (530) T ss_pred CC Confidence 33 No 88 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.72 E-value=1.1e-17 Score=113.53 Aligned_cols=394 Identities=10% Similarity=-0.003 Sum_probs=196.7 Q ss_pred HHHHHHHHHHhccc---Cccc----ccCcc-cchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC-cccHHHHHH Q lcl|NC_021301. 21 MSRVRLLARYSNGD---APLP----ELTRN-TSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA-DSDLALRAR 91 (456) Q Consensus 21 ~~r~~~~~~YY~g~---~~i~----~~~~~-~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~-d~~~~~~l~ 91 (456) ....+-|...--|- ++-. ..+.. ....+.... ..+.+++++||..+.-++.+|+.+.++. +.+..+.++ T Consensus 1 ~~~~D~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~l~a~Y--~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~~~~~ 78 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQEQTYYSPSLSLTDDLVQLEALW--RDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQLDLFT 78 (437) T ss_pred CchhhhhHhHHhcCCCccccceeecCccccccHHHHHHHH--HhCchhhHHhhcchHHhhcCCceEecCCCCHHHHHHHH Confidence 22222222222110 0000 00000 011122221 2467889999999999999999986642 334446788 Q ss_pred HHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---------Cce-EEEEEccceeEEEE-eCCCCceEEEEEEEEEec Q lcl|NC_021301. 92 RIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---------GTA-TITADSPETMVVSV-DPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 92 ~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---------g~~-~i~~~~p~~~~~~~-d~~~~~~~~~~~~~~~~~ 160 (456) +.|++=++...+.++.+++..||.|++++-.+.. |.+ .+.++++.++.|.. ...++... . T Consensus 79 ~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~---------~ 149 (437) T protein:vir:52 79 KFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSP---------N 149 (437) T ss_pred HHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhcccccccccccccc---------c Confidence 8888878899999999999999999999877542 222 24555555444321 11111000 0 Q ss_pred CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHH Q lcl|NC_021301. 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRA 240 (456) Q Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~ 240 (456) .|++.+..+-.....+. +......++.+.++-..-.+-+|.|.++.+.+-+.+++++ T Consensus 150 fg~p~~y~v~~~~~~~~-----------------------iH~SRii~~~~~~~~~~~~~~~G~s~le~~~~~i~~~~~~ 206 (437) T protein:vir:52 150 FGRYSEYSILGGSQSIT-----------------------VHHSRLIILNANDAPLSDNDIWGVSDLEKIIDVLKRFDSA 206 (437) T ss_pred cCcceEEEEecCCccee-----------------------EccceeEEecCccCCCccccccCCchHHHHHHHHHHHHHH Confidence 12222211111000000 0000111111111001123457999999988888888877 Q ss_pred HHHHHHHHHHhhchhhhhhcCCCcccccccc-cch----hhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHH Q lcl|NC_021301. 241 ELQLLSTMAIQAFRQRALKSAGHGLPKVDEN-GNA----IDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEH 315 (456) Q Consensus 241 ~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~ 315 (456) .-...........+..-+.|+.. .+... ... .......... +.+..++.+.++.+++ .++++..+.+... T Consensus 207 ~~~~~~l~~~~~~~v~k~~~l~~---~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~d~~~~~e~~~-~~~sgl~~~l~~~ 281 (437) T protein:vir:52 207 SVNVGDLIFESKIDIFKIAGLSD---KIAAGMENEVASVISAVQEIKSA-TNSLLLDAENEYDRKE-LTFTGLKDLLTEF 281 (437) T ss_pred HHHHHHHHHHcCCCceecchHHH---HhcCCcHHHHHHHHHHHHHhcCC-CceEEEcCCcceEEEe-cCcCCHHHHHHHH Confidence 66544433333322222222211 11111 111 1111122222 2344555666676664 3466777888888 Q ss_pred HHHHHhhcCCChhhhccc-ccC-cHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCC Q lcl|NC_021301. 316 IRQLSSATKTPLPMLMPD-SAN-QSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILVKALQIEGESVEDTVDVSFESPDR 392 (456) Q Consensus 316 ~~~i~~~~~~p~~~~~~~-~~N-~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~ 392 (456) ..+|++.++||...|.+. .+. +||+.-... ....++.+| ..+.+.+++++.+++....-..+.++++.|++-.. T Consensus 282 ~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~---yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~~~~~f~pL~~ 358 (437) T protein:vir:52 282 RNAVAGAADMPVTILFGQSVSGLASGDEDIQN---YHEAIRRLQETRLRPIFEIIDPLICNELFGGLPADWWFEFVPLTT 358 (437) T ss_pred HHHHHHHhcCchhhhcCcCcccccccHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcceEEeCCcCC Confidence 999999999999776433 222 355543333 334455555 45788899999887655433344568899999888 Q ss_pred cCHHHHHH-------HHHHHHhcCCCcHHHHHHhC---CC---Chh-HHHHHHH-HHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 393 VTLGEKYA-------AASLAKAAGESWASIRRNIL---NY---NAD-QIKQDDL-DRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 393 ~~~~e~ad-------~~~kl~~~g~~s~~t~~~~~---~~---~~~-~~~~~e~-~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .+.++.|+ +..++.++|+++...+++.| |. .++ +++.++- +...+............+++.+.+ T Consensus 359 ~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 359 VKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANISAEHIEELKNADEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred cCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCccccccccCCCCCCCccCCCCCCCCCCCCCCCCC Confidence 88776654 46677788988876666543 11 111 1111100 000000000001111111112222 No 89 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.71 E-value=1.7e-15 Score=101.51 Aligned_cols=440 Identities=11% Similarity=0.018 Sum_probs=226.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH-HHHHHhcccCccc-ccCcccchhhhhhhhhh--ccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVR-LLARYSNGDAPLP-ELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~-~~~~YY~g~~~i~-~~~~~~~~~~~~~~~k~--~~n~~~~iVd~~a~~l~~~~~ 76 (456) .++-.+..-...-...|.. ..+.. ....|.-...... .+. .....++..-+-+ ..+|++-+|+..+.+++|.|| T Consensus 12 ~a~~~~~~~~~~~~~~y~g-A~~~~r~~~~w~~~~~s~~~~~~-~~~~~lr~RaRdL~rNn~~a~~av~~~~~nvVG~Gi 89 (553) T protein:vir:63 12 VTSGRPEQSASLGGGGLEG-ASRLSRETVSWNPSLRSPDALIN-PLKRIADARGRDMADNDGFTNGAVGYQRDSIVGAQY 89 (553) T ss_pred cccccchhhhhhhcccccc-cccCCCcccccccCCCChHHHHH-HHHHHHHHHHHHHHhcChHHHHHHHHHHHhhccCCc Confidence 3333332111000001100 00000 0111100000000 000 0011122212222 256999999999999999998 Q ss_pred ecCCCC------------cccHHHHH---HHHHHh-----------cChhHHHHHHHHHHhhCCeEEEEEeeCCC-C--- Q lcl|NC_021301. 77 TVGGSA------------DSDLALRA---RRIWRD-----------NRMDSVCKQWVKYGLDFGESYLTCWRRDD-G--- 126 (456) Q Consensus 77 ~~~~~~------------d~~~~~~l---~~~~~~-----------n~~~~~~~~~~~~a~~~G~a~~~v~~d~d-g--- 126 (456) +..... +.+..+.+ |+.|.+ .+|...+..+++..++.|.+|+....... | T Consensus 90 ~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~~~~~ 169 (553) T protein:vir:63 90 RLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEWDRAANRPY 169 (553) T ss_pred eeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeeeccCCCCcc Confidence 864322 12222333 333332 24677888899999999999987654433 2 Q ss_pred ceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCC--eEEEEEEeeeecccccceeeccCCCceeecc Q lcl|NC_021301. 127 TATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGD--GWQKFARPCFVQSSSRRRLVTRISDSWVPVG 204 (456) Q Consensus 127 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (456) .+++..++|..+-.-++...+..+..+|.+ +.+|....+.++..+ ..+.... ....+....... -++-. T Consensus 170 ~~~lq~ie~drl~~~~~~~~~~~i~~GVE~--d~~Gr~vaY~i~~~hPgd~~~~~~------~~~~~~r~~~~~-~v~a~ 240 (553) T protein:vir:63 170 ATCFQMVSTDRLSNPYQQLDTPTLRRGVQY--DKRGRPQGYWIQVAHPGDLYQMAP------DMYKWKFVQQSK-PWGRR 240 (553) T ss_pred cceEEEechhhcCCCCCCCCCCeeEeeeEE--CCCCceEEEEeeccCCCccccccc------cccceeeecccc-ccChh Confidence 257899999987655544445556666643 455665554444321 1110000 000000000000 00001 Q ss_pred cccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc------ccccc----- Q lcl|NC_021301. 205 DAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV------DENGN----- 273 (456) Q Consensus 205 ~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~------~~~~~----- 273 (456) .+.|.+.. .+.....|.|.|.+++..+..++....-........+.--.+|+--.+..... ...+. T Consensus 241 ~vlH~f~~---~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (553) T protein:vir:63 241 QVIHILEP---REPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIF 317 (553) T ss_pred Hheecccc---cCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccc Confidence 11111110 12234579999999877554444333222222222222112333111100000 00000 Q ss_pred ---------hhhhhhhhhhhccceeccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHH Q lcl|NC_021301. 274 ---------AIDYASIFEAAPGALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAH 343 (456) Q Consensus 274 ---------~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~ 343 (456) ...........+|.+..+.++.++..+++. +..+|....+.+++.|++..|+|-+.+.++.++.|..+.+ T Consensus 318 ~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R 397 (553) T protein:vir:63 318 GKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQ 397 (553) T ss_pred cccccccccccccccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHH Confidence 000112234457888889999888877665 4567888889999999999999999998888888889999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHH---HhcCCC-------cc---------cceeEEecCCC--CcCHHHHHHH Q lcl|NC_021301. 344 NIEKGFLFKCEDRLSIAKIGLEA-ILVKAL---QIEGES-------VE---------DTVDVSFESPD--RVTLGEKYAA 401 (456) Q Consensus 344 ~~~~~l~~k~~~~~~~f~~~l~~-~~~l~~---~~~~~~-------~~---------~~i~v~f~~~~--~~~~~e~ad~ 401 (456) +.+......+...|..|...+-+ +++.-+ .+.|.- +. .-+.+.|..+- ..|....+++ T Consensus 398 ~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A 477 (553) T protein:vir:63 398 AGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQA 477 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHH Confidence 99999999999888888766544 333222 122311 00 11357787774 4688999999 Q ss_pred HHHHHhcCCCcHHHHHHhCCCChhHHHHH---HHHHHHHHHHHHhhhhh--------------------hhcccccC Q lcl|NC_021301. 402 ASLAKAAGESWASIRRNILNYNADQIKQD---DLDRAREQITLFAGNSV--------------------QRPQEDGS 455 (456) Q Consensus 402 ~~kl~~~g~~s~~t~~~~~~~~~~~~~~~---e~~~~~ee~~~~~~~~~--------------------~~~~~d~~ 455 (456) ...++.+|+.|.+.+....|.+++++-+. |.++.++ ......... ..++++|+ T Consensus 478 ~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~~~~-~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 478 AVMRIDAGLSTYEREIARLGGDFRKSFAQRAREDALLKK-YGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHHHHH-cCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 99999999999999988889998765432 1122211 111100000 01111111 No 90 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.70 E-value=8.6e-16 Score=103.17 Aligned_cols=436 Identities=11% Similarity=0.006 Sum_probs=207.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHH-------HHHHHHHHhcccCcccccCcccchhhhhhh-hhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMS-------RVRLLARYSNGDAPLPELTRNTSAAWRSFQ-REARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~-------r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~-~k~~~n~~~~iVd~~a~~l~ 72 (456) -+.++..+++.++...+..... ....=.+||.|.+ .+......++... ..++.|-++.+|+..+++-. T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g~~~ 98 (711) T protein:vir:10 23 KNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQ----WPSQVRTERELEQRPCLVNNVLPTFVDQVLGDQR 98 (711) T ss_pred cCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCC----CCHHHHHHHHhcCCCcEEEcchHHHHHHHhhhHh Confidence 4555556677777776543332 2334468999975 2222222222211 24678999999999999987 Q ss_pred cCCeec--CC------------------------CCcccHHHHHH----HHHHhcChhHHHHHHHHHHhhCCeEEEEEee Q lcl|NC_021301. 73 PNGITV--GG------------------------SADSDLALRAR----RIWRDNRMDSVCKQWVKYGLDFGESYLTCWR 122 (456) Q Consensus 73 ~~~~~~--~~------------------------~~d~~~~~~l~----~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~ 122 (456) -+.+.+ .. ..|.+..+.+. .++..|+.+.....+..+++++|.+|+-++. T Consensus 99 ~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ev~~ 178 (711) T protein:vir:10 99 QNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYLRVRS 178 (711) T ss_pred hCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceEEEEe Confidence 654322 11 12333333333 4556788999999999999999999986653 Q ss_pred C---C---CCceEEEEE-ccceeEEEEeCCCC------ceEEEEEEEEEecC--------C------------------- Q lcl|NC_021301. 123 R---D---DGTATITAD-SPETMVVSVDPLQP------WRIRSAMRWWRDLD--------A------------------- 162 (456) Q Consensus 123 d---~---dg~~~i~~~-~p~~~~~~~d~~~~------~~~~~~~~~~~~~d--------~------------------- 162 (456) | . +|++++..+ +|.++ +|||... .+.+ +.+.|.+.+ . T Consensus 179 d~~~~d~~~~e~~i~~v~~p~~v--~~Dp~a~~~D~sDar~~-~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~~~~ 255 (711) T protein:vir:10 179 DYLADDSFEQDLIIEAIQNQFSV--TIDPDAKKRDRSDMNWC-LIDDTMSKEKFKALYPDATAEPVYEDSVADYDTWFTE 255 (711) T ss_pred cccCCCCCCCCeEEeeecChhhe--eeCccccccChhhhcce-eeeecCCHHHHHHhCCchhhhhhhcccccccCcccCc Confidence 3 2 478888777 69885 5666432 1222 222332111 0 Q ss_pred c-eEEEEEEcCCeEEEEEEeeeec------cc----------cc------------ceeeccCCCceeecccccccCcee Q lcl|NC_021301. 163 E-SDFAIVWSGDGWQKFARPCFVQ------SS----------SR------------RRLVTRISDSWVPVGDAVVTGSPP 213 (456) Q Consensus 163 ~-~~~~~~~~~~~~~~~~~~~~~~------~~----------~~------------~~~~~~~~~~~~~~~~~~~~~~~~ 213 (456) . .....+|..... .+....... .. .+ +.......+..+.+...+..++.+ T Consensus 256 ~~vrv~E~~~r~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G~~~L~~~~p~~~~~~ 334 (711) T protein:vir:10 256 KSVRVSEYFTREPV-IREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITGANVLEGPVEIPSTTI 334 (711) T ss_pred ceeeEEEEEeeeee-eeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEecceeecCCCCCCCCcc Confidence 0 000111111000 000000000 00 00 000000111222233333344555 Q ss_pred EEEEc-cC-------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhh-hcCCCcccccccccchhhhhhhhhhh Q lcl|NC_021301. 214 PVVVY-QN-------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRAL-KSAGHGLPKVDENGNAIDYASIFEAA 284 (456) Q Consensus 214 pvv~~-~n-------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i-~g~~~~~~~~~~~~~~~~~~~~~~~~ 284 (456) |+|++ .. ..+.|.+..+++.++.+|...|.+...+...+.+..++ .|.-.+ .+ .....-... T Consensus 335 P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai~~---~~------~~~~e~~~~ 405 (711) T protein:vir:10 335 PVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNVEG---RE------DEWEQANTK 405 (711) T ss_pred cEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCcccCC---hH------HHHHhcccc Confidence 65543 11 12457788999999999999999888776665543332 222110 00 011111133 Q ss_pred ccceeccCCCc----eeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 285 PGALWELPPGV----DIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSI 359 (456) Q Consensus 285 ~~~~~~~~~d~----~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~ 359 (456) ++.++..+++. .+...+.. ....+...+......|-.+||+++..+|..+++.||+|+......-.......... T Consensus 406 ~~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn 485 (711) T protein:vir:10 406 NFSLLTYIPQYQGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDN 485 (711) T ss_pred CCCeeEecccccCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHH Confidence 44444443332 34333332 24557777787888888899999999998877789999998887766666666666 Q ss_pred HHHHHHHHHHHHHHh-------------cCCCcccceeEEecC-----------------------------CCCcCHHH Q lcl|NC_021301. 360 AKIGLEAILVKALQI-------------EGESVEDTVDVSFES-----------------------------PDRVTLGE 397 (456) Q Consensus 360 f~~~l~~~~~l~~~~-------------~~~~~~~~i~v~f~~-----------------------------~~~~~~~e 397 (456) |..+.+++.++++.+ .|...... .+.+++ ..+.-..+ T Consensus 486 ~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~-~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~ 564 (711) T protein:vir:10 486 LTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETED-FVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIE 564 (711) T ss_pred HHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcc-eEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHH Confidence 776766666555432 11100000 011221 12222223 Q ss_pred HHHHHHHHHhcCCCcH------HHHHHhCCCC-hhHHH------------------H---HHHHHHHHHH--HH------ Q lcl|NC_021301. 398 KYAAASLAKAAGESWA------SIRRNILNYN-ADQIK------------------Q---DDLDRAREQI--TL------ 441 (456) Q Consensus 398 ~ad~~~kl~~~g~~s~------~t~~~~~~~~-~~~~~------------------~---~e~~~~~ee~--~~------ 441 (456) .+..+..+.+ .++. ..+++.+++. .+++. + ...+..+++. .. T Consensus 565 ~~~~l~ql~~--~~p~~~~~~~~~il~~~d~p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q 642 (711) T protein:vir:10 565 AAEAMIQFAQ--AVPSAAAVMADLIAQNMDWPGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQ 642 (711) T ss_pred HHHHHHHHHh--hcchhhhHHHHHHHHhcCCCCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333333332 1111 1122222221 00000 0 0000000000 00 Q ss_pred ----HhhhhhhhcccccCC Q lcl|NC_021301. 442 ----FAGNSVQRPQEDGSR 456 (456) Q Consensus 442 ----~~~~~~~~~~~d~~~ 456 (456) .++....+.+.+..| T Consensus 643 ~~~~qa~ae~~~Aqae~~q 661 (711) T protein:vir:10 643 ADMAQAEADTAQAQADMLK 661 (711) T ss_pred HHHHHHHHHHHHHHHHHHH Confidence 000000111111111 No 91 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.68 E-value=1.7e-15 Score=101.59 Aligned_cols=425 Identities=12% Similarity=0.042 Sum_probs=227.5 Q ss_pred HHHHHHHHHHHH----HHHHHHHHHHHHhcccCccc---c--cCcccchh-------hhhhhhhh--ccChHHHHHHHHH Q lcl|NC_021301. 7 AEWLPVLTKRID----DGMSRVRLLARYSNGDAPLP---E--LTRNTSAA-------WRSFQREA--RTNWGLMVRDSVA 68 (456) Q Consensus 7 ~~~~~~l~~~~~----~~~~r~~~~~~YY~g~~~i~---~--~~~~~~~~-------~~~~~~k~--~~n~~~~iVd~~a 68 (456) +.|+++++.-.. .++.+-....+-|+|-..-. . .+.....+ ++..-+-+ .++|++-+|+..+ T Consensus 1 Mn~iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~ 80 (548) T protein:vir:95 1 MNLIDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDRLE 80 (548) T ss_pred CchHHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHH Confidence 456665555442 22222223334466632110 0 01111111 11111222 2569999999999 Q ss_pred hhhccC-CeecCC---CCcccHH----HH---HHHHHHh-------cChhHHHHHHHHHHhhCCeEEEEEeeCCCCc--- Q lcl|NC_021301. 69 DRIIPN-GITVGG---SADSDLA----LR---ARRIWRD-------NRMDSVCKQWVKYGLDFGESYLTCWRRDDGT--- 127 (456) Q Consensus 69 ~~l~~~-~~~~~~---~~d~~~~----~~---l~~~~~~-------n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~--- 127 (456) ++++|. |+.+.. ..|.+.. +. +|+-|.. .+|...+..+++..++.|.+|+....+..+. T Consensus 81 ~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~~~ 160 (548) T protein:vir:95 81 ERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNYTF 160 (548) T ss_pred HhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccccC Confidence 999984 433221 1222222 22 3333432 3477888999999999999998776544321 Q ss_pred -----eEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCC--eEEEEEEeeeecccccceeeccCCCce Q lcl|NC_021301. 128 -----ATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGD--GWQKFARPCFVQSSSRRRLVTRISDSW 200 (456) Q Consensus 128 -----~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (456) .++..++|..+-.-++.. ...+...|.+ +..|....+.++..+ ..+... ....| T Consensus 161 g~~~~~~lqliepd~l~~~~~~~-~~~i~~GIE~--D~~Grp~aY~i~~~hPgd~~~~~----------------~~~~~ 221 (548) T protein:vir:95 161 ATSVPFALELLEPDYLPFSYNNL-SKGIVQGIER--DTWRRKRAYHLLKDHPGNLQTLG----------------GSLAV 221 (548) T ss_pred CcccceEEEEechhhcCCCCCCC-CCceeeeeEE--CCCCceEEEEEeecCCCcccccc----------------cccce Confidence 479999999874333322 2345555543 455655544444321 100000 00001 Q ss_pred e--ecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhh Q lcl|NC_021301. 201 V--PVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYA 278 (456) Q Consensus 201 ~--~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~ 278 (456) . +-..+-|.+.. .+.....|.|.|.+++..+..++....-........+.--.+|+.-.++...... ..-... T Consensus 222 ~rvpA~~VlHif~~---~r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~--~~~~~~ 296 (548) T protein:vir:95 222 KRVEAERIIHIAYR---KRIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEP--GKDRKN 296 (548) T ss_pred eeechhHheecccc---cCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCC--Cccccc Confidence 1 11111222110 1223457999999987655444433332222233322222334322221111111 111112 Q ss_pred hhhhhhcccee-ccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 279 SIFEAAPGALW-ELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDR 356 (456) Q Consensus 279 ~~~~~~~~~~~-~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~ 356 (456) ......+|.++ .+.++.++..+++. +..+|...++.+++.|++..|+|-+.+.++.+ .|..+.++.+......+... T Consensus 297 ~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s-~nYSS~R~~l~e~~r~~~~~ 375 (548) T protein:vir:95 297 RTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD-GTYSAQRQELVEGWLGYDLL 375 (548) T ss_pred ccccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-hhHHHHHHHHHHHHHHHHHH Confidence 22344567764 57777787777654 34678888999999999999999999988875 58889999999999999988 Q ss_pred HHHHHHHHHH-HHHHHH---HhcCC---C---c-ccceeEEecCC--CCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCC Q lcl|NC_021301. 357 LSIAKIGLEA-ILVKAL---QIEGE---S---V-EDTVDVSFESP--DRVTLGEKYAAASLAKAAGESWASIRRNILNYN 423 (456) Q Consensus 357 ~~~f~~~l~~-~~~l~~---~~~~~---~---~-~~~i~v~f~~~--~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~ 423 (456) |..|...+-+ +++..+ .+.|. + + ..-+.+.|..+ ...|....+++...++.+|+.|.+.+....|.+ T Consensus 376 q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~G~D 455 (548) T protein:vir:95 376 QHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFADEAEVARARGRD 455 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhCCC Confidence 8887766544 443322 12332 1 1 11257889776 356999999999999999999999988889999 Q ss_pred hhHHHHH-HHHH-HHHHHHHHhhhh--------hhhcccccCC Q lcl|NC_021301. 424 ADQIKQD-DLDR-AREQITLFAGNS--------VQRPQEDGSR 456 (456) Q Consensus 424 ~~~~~~~-e~~~-~~ee~~~~~~~~--------~~~~~~d~~~ 456 (456) ++++.+. ..++ ..++........ ...+.+..+| T Consensus 456 ~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~ 498 (548) T protein:vir:95 456 PRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQK 498 (548) T ss_pred HHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhh Confidence 8865432 1111 111111110000 0111111111 No 92 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.68 E-value=4.7e-17 Score=110.11 Aligned_cols=404 Identities=11% Similarity=-0.012 Sum_probs=201.3 Q ss_pred CCCCCHHHHHHHHHHHHH------------------------------------HH--HHHHHHHHHHhcccCcccccCc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRID------------------------------------DG--MSRVRLLARYSNGDAPLPELTR 42 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~------------------------------------~~--~~r~~~~~~YY~g~~~i~~~~~ 42 (456) ++..++.-....-...|. .. ..+-.-+..||-... +. . T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~- 103 (537) T protein:vir:10 28 FGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTFSAYANPNLSEGLVLWYAQQA-FI--G- 103 (537) T ss_pred CcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhhhhhccccccchhhhhccccC-Cc--c- Confidence 222111111110000000 00 000011112222221 00 0 Q ss_pred ccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCc----ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEE Q lcl|NC_021301. 43 NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD----SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYL 118 (456) Q Consensus 43 ~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d----~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~ 118 (456) ..+-... ..+.+++++||..+.-++-+|+.+.+..+ .+..+.+.+.|++-++...+.++.+.+..||.+++ T Consensus 104 ---~~l~a~Y--~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i 178 (537) T protein:vir:10 104 ---HQMCALI--ATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIA 178 (537) T ss_pred ---HHHHHHH--HhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEE Confidence 1122221 24678999999999999999988866432 23445677778888888999999999999999988 Q ss_pred EEeeCC-CCce----------------EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEe Q lcl|NC_021301. 119 TCWRRD-DGTA----------------TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARP 181 (456) Q Consensus 119 ~v~~d~-dg~~----------------~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 181 (456) ++..+. |+.. .+.+++|.++.+.........+. ....|++.+..+ ....++. T Consensus 179 ~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~------sp~fg~P~~y~v--~g~~iH~--- 247 (537) T protein:vir:10 179 LFKVDSPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPV------SMHFYEPTYWLI--NGKKYHR--- 247 (537) T ss_pred EEeecCcCCcccccccccccccccceeEEEEechhhcccccchhhhccCC------ccccCCceeeee--cCeEecc--- Confidence 876542 3221 13334444433321000000000 000011111111 0000000 Q ss_pred eeecccccceeeccCCCceeecccccccCcee-EE--EEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhh Q lcl|NC_021301. 182 CFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP-PV--VVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRAL 258 (456) Q Consensus 182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-pv--v~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i 258 (456) ....++.+.+ |- -...+-+|.|.++.+.+-+.+++++.-...........+...+ T Consensus 248 ----------------------SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~ 305 (537) T protein:vir:10 248 ----------------------SHLAIYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKV 305 (537) T ss_pred ----------------------eeEEEecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee Confidence 0001111111 10 0112346999999988888888877776554444444333333 Q ss_pred hcCCCccccccccc--chhhhhhhhhhhccceeccCCC-ceeEeecccchHHHHHHHHHHHHHHHhhcCCChhh-hccc- Q lcl|NC_021301. 259 KSAGHGLPKVDENG--NAIDYASIFEAAPGALWELPPG-VDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPM-LMPD- 333 (456) Q Consensus 259 ~g~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~d-~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~-~~~~- 333 (456) .|.. ...++.+ ..+..........+ ++.++.+ .++.+++ +++++..+.+....+.|+++++||..- ||.. T Consensus 306 ~~~~---~l~~~~~~~~r~~~~~~~r~n~g-~~~id~e~e~~e~~~-~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp 380 (537) T protein:vir:10 306 DAAQ---VLANKQQFDETMSWWTATRDNYQ-VRVVDKDNEDVVQID-TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVP 380 (537) T ss_pred chHH---hhcCHHHHHHHHHHHHhhcCCcc-eeEecCCCceeEEEe-ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCc Confidence 3321 1111111 11111122222223 3455554 5555554 457778888888999999999999875 4543 Q ss_pred -ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH-------HHHHH Q lcl|NC_021301. 334 -SANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA-------AASLA 405 (456) Q Consensus 334 -~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad-------~~~kl 405 (456) .-|+||+.-...+... |+.+|..+.+.+++++.+++...... +..+++.|++-...+.+|.|+ +..++ T Consensus 381 ~GlnatGe~D~~~yyd~---I~~~Qe~l~p~l~~l~~ll~~~~~~~-~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~ 456 (537) T protein:vir:10 381 TGFNSTGDYEEASYHEE---CESTQDDMRPLIDRHHQLVCRSHLRK-RIRVKVEFPPMDAPKESERADTFLKKMQAAKLA 456 (537) T ss_pred cccccchhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhcCCC-CcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHH Confidence 2356777655544444 34444457899999998887655443 456889999998899888665 57788 Q ss_pred HhcCCCcHHHHHHhCCCChh----------HHHHHHHHHHHHHHHHHhhhhhhhcc------------------cccCC Q lcl|NC_021301. 406 KAAGESWASIRRNILNYNAD----------QIKQDDLDRAREQITLFAGNSVQRPQ------------------EDGSR 456 (456) Q Consensus 406 ~~~g~~s~~t~~~~~~~~~~----------~~~~~e~~~~~ee~~~~~~~~~~~~~------------------~d~~~ 456 (456) .++|+++...+++.|+..++ +.+..|.....++.. ........++ +.|.+ T Consensus 457 ~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 534 (537) T protein:vir:10 457 FEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAEDIDVDDEGK-PVRIIEDQPAPSEMFGATSSGESANDPRDSGAA 534 (537) T ss_pred HHcCCCCHHHHHHHHhccCccccccccCCCChhhhhcccCCccCC-cCCCCCCCCCccccCCCCccccccCCCccCccc Confidence 88899999888777643211 011111111111111 1111111111 11111 No 93 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.65 E-value=1.4e-14 Score=96.57 Aligned_cols=438 Identities=13% Similarity=0.064 Sum_probs=198.2 Q ss_pred CCCCCHH----HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTASTPA----EWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t~~----~~~~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~~iVd~~a 68 (456) |.+--+. ++.+++...+... +....+-.+||.|.+ .+......++.. ...++.|-++.+|+..+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:81 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ----LPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 3332211 2333333333222 222335568999975 222222222111 12477899999999999 Q ss_pred hhhccCCe--ecCC-CCc-c--cHHHH----HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---CceEEEEEcc Q lcl|NC_021301. 69 DRIIPNGI--TVGG-SAD-S--DLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTATITADSP 135 (456) Q Consensus 69 ~~l~~~~~--~~~~-~~d-~--~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~~i~~~~p 135 (456) ++-.-+.. .+.. ..+ . +..+. +..++..|+.+...+.+..+++++|.+|+-++.+.| +.+++..++| T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p 163 (714) T protein:vir:81 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSR 163 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecch Confidence 99876653 2222 111 1 23333 345666789999999999999999999998887643 5588999999 Q ss_pred ceeEEEEeCCCC------ceEEEEEEEEEecCC------------------------------ceE-------------- Q lcl|NC_021301. 136 ETMVVSVDPLQP------WRIRSAMRWWRDLDA------------------------------ESD-------------- 165 (456) Q Consensus 136 ~~~~~~~d~~~~------~~~~~~~~~~~~~d~------------------------------~~~-------------- 165 (456) .+++ |||... .+.+ +++.|.+.+. ... T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~-~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:81 164 NEVF--WDWLSREADLSDCRWL-MRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred hhee--eccccccCChhhccce-eeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 9955 665321 1222 2222221100 000 Q ss_pred -EEEEEcCC-eEEEEEEeeee--------ccccccee-----------------------------ec-cCCCceeeccc Q lcl|NC_021301. 166 -FAIVWSGD-GWQKFARPCFV--------QSSSRRRL-----------------------------VT-RISDSWVPVGD 205 (456) Q Consensus 166 -~~~~~~~~-~~~~~~~~~~~--------~~~~~~~~-----------------------------~~-~~~~~~~~~~~ 205 (456) ....+..+ .-+.....++. ....+... +. -.+......+. T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~ 320 (714) T protein:vir:81 241 QQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRP 320 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCC Confidence 00000000 00010011100 00000000 00 00111111122 Q ss_pred ccccCceeEEEEc-cC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhh Q lcl|NC_021301. 206 AVVTGSPPPVVVY-QN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYAS 279 (456) Q Consensus 206 ~~~~~~~~pvv~~-~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~ 279 (456) .+...+.+|+|++ .. .. ..|-+..+++.++.+|...|.+...+ .+....+..|.. +..++ .... T Consensus 321 ~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~---~~~d~-----~~~e 390 (714) T protein:vir:81 321 CSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT---QLSDN-----DLME 390 (714) T ss_pred CCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc---cccHH-----HHHH Confidence 2222333444432 11 11 23667889999999999999865543 233222222211 11110 0001 Q ss_pred hhhhhccceeccCCC--------ceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 280 IFEAAPGALWELPPG--------VDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 280 ~~~~~~~~~~~~~~d--------~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) . .+.++.+...+++ .++...+. .-...+...+......|-.+||+.+..+|..+++.||+|+......-. T Consensus 391 ~-~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~ 469 (714) T protein:vir:81 391 Q-IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGA 469 (714) T ss_pred h-ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHH Confidence 1 1223333333332 22222222 235667788888888898999999999998776679999888777655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----hc---------CCCcc--------------------------cceeEEecCCC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQ----IE---------GESVE--------------------------DTVDVSFESPD 391 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~----~~---------~~~~~--------------------------~~i~v~f~~~~ 391 (456) .........+..+.+++.++++. .- |..+. ++|.+.=.+.. T Consensus 470 ~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~ 549 (714) T protein:vir:81 470 TTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQT 549 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCc Confidence 55555555566666665554432 21 11010 01111112222 Q ss_pred CcCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCh-hHHHH-H---------------HHHHHHHHHHH-------- Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAA-----GESWASIRRNILNYNA-DQIKQ-D---------------DLDRAREQITL-------- 441 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~-----g~~s~~t~~~~~~~~~-~~~~~-~---------------e~~~~~ee~~~-------- 441 (456) |....+.++.+..+.++ +.+...++++.+.+.. +++.+ + |.+.++.+... T Consensus 550 ~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:81 550 PAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred hHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 22234555555555542 1111233444444310 11100 0 00000000000 Q ss_pred ---HhhhhhhhcccccCC Q lcl|NC_021301. 442 ---FAGNSVQRPQEDGSR 456 (456) Q Consensus 442 ---~~~~~~~~~~~d~~~ 456 (456) ..+...+..+.+..+ T Consensus 630 q~~~~~a~~~k~eae~~~ 647 (714) T protein:vir:81 630 QMREMAGRVAKLEADAAR 647 (714) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 001111111112222 No 94 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.65 E-value=1.4e-14 Score=96.57 Aligned_cols=438 Identities=13% Similarity=0.064 Sum_probs=198.2 Q ss_pred CCCCCHH----HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTASTPA----EWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t~~----~~~~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~~iVd~~a 68 (456) |.+--+. ++.+++...+... +....+-.+||.|.+ .+......++.. ...++.|-++.+|+..+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:10 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ----LPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 3332211 2333333333222 222335568999975 222222222111 12477899999999999 Q ss_pred hhhccCCe--ecCC-CCc-c--cHHHH----HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---CceEEEEEcc Q lcl|NC_021301. 69 DRIIPNGI--TVGG-SAD-S--DLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTATITADSP 135 (456) Q Consensus 69 ~~l~~~~~--~~~~-~~d-~--~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~~i~~~~p 135 (456) ++-.-+.. .+.. ..+ . +..+. +..++..|+.+...+.+..+++++|.+|+-++.+.| +.+++..++| T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p 163 (714) T protein:vir:10 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSR 163 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecch Confidence 99876653 2222 111 1 23333 345666789999999999999999999998887643 5588999999 Q ss_pred ceeEEEEeCCCC------ceEEEEEEEEEecCC------------------------------ceE-------------- Q lcl|NC_021301. 136 ETMVVSVDPLQP------WRIRSAMRWWRDLDA------------------------------ESD-------------- 165 (456) Q Consensus 136 ~~~~~~~d~~~~------~~~~~~~~~~~~~d~------------------------------~~~-------------- 165 (456) .+++ |||... .+.+ +++.|.+.+. ... T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~-~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:10 164 NEVF--WDWLSREADLSDCRWL-MRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred hhee--eccccccCChhhccce-eeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 9955 665321 1222 2222221100 000 Q ss_pred -EEEEEcCC-eEEEEEEeeee--------ccccccee-----------------------------ec-cCCCceeeccc Q lcl|NC_021301. 166 -FAIVWSGD-GWQKFARPCFV--------QSSSRRRL-----------------------------VT-RISDSWVPVGD 205 (456) Q Consensus 166 -~~~~~~~~-~~~~~~~~~~~--------~~~~~~~~-----------------------------~~-~~~~~~~~~~~ 205 (456) ....+..+ .-+.....++. ....+... +. -.+......+. T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~ 320 (714) T protein:vir:10 241 QQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRP 320 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCC Confidence 00000000 00010011100 00000000 00 00111111122 Q ss_pred ccccCceeEEEEc-cC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhh Q lcl|NC_021301. 206 AVVTGSPPPVVVY-QN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYAS 279 (456) Q Consensus 206 ~~~~~~~~pvv~~-~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~ 279 (456) .+...+.+|+|++ .. .. ..|-+..+++.++.+|...|.+...+ .+....+..|.. +..++ .... T Consensus 321 ~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~---~~~d~-----~~~e 390 (714) T protein:vir:10 321 CSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT---QLSDN-----DLME 390 (714) T ss_pred CCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc---cccHH-----HHHH Confidence 2222333444432 11 11 23667889999999999999865543 233222222211 11110 0001 Q ss_pred hhhhhccceeccCCC--------ceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 280 IFEAAPGALWELPPG--------VDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 280 ~~~~~~~~~~~~~~d--------~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) . .+.++.+...+++ .++...+. .-...+...+......|-.+||+.+..+|..+++.||+|+......-. T Consensus 391 ~-~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~ 469 (714) T protein:vir:10 391 Q-IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGA 469 (714) T ss_pred h-ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHH Confidence 1 1223333333332 22222222 235667788888888898999999999998776679999888777655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----hc---------CCCcc--------------------------cceeEEecCCC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQ----IE---------GESVE--------------------------DTVDVSFESPD 391 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~----~~---------~~~~~--------------------------~~i~v~f~~~~ 391 (456) .........+..+.+++.++++. .- |..+. ++|.+.=.+.. T Consensus 470 ~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~ 549 (714) T protein:vir:10 470 TTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQT 549 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCc Confidence 55555555566666665554432 21 11010 01111112222 Q ss_pred CcCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCh-hHHHH-H---------------HHHHHHHHHHH-------- Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAA-----GESWASIRRNILNYNA-DQIKQ-D---------------DLDRAREQITL-------- 441 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~-----g~~s~~t~~~~~~~~~-~~~~~-~---------------e~~~~~ee~~~-------- 441 (456) |....+.++.+..+.++ +.+...++++.+.+.. +++.+ + |.+.++.+... T Consensus 550 ~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:10 550 PAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred hHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 22234555555555542 1111233444444310 11100 0 00000000000 Q ss_pred ---HhhhhhhhcccccCC Q lcl|NC_021301. 442 ---FAGNSVQRPQEDGSR 456 (456) Q Consensus 442 ---~~~~~~~~~~~d~~~ 456 (456) ..+...+..+.+..+ T Consensus 630 q~~~~~a~~~k~eae~~~ 647 (714) T protein:vir:10 630 QMREMAGRVAKLEADAAR 647 (714) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 001111111112222 No 95 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.65 E-value=1.4e-14 Score=96.57 Aligned_cols=438 Identities=13% Similarity=0.064 Sum_probs=198.2 Q ss_pred CCCCCHH----HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTASTPA----EWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t~~----~~~~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~~iVd~~a 68 (456) |.+--+. ++.+++...+... +....+-.+||.|.+ .+......++.. ...++.|-++.+|+..+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:99 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ----LPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 3332211 2333333333222 222335568999975 222222222111 12477899999999999 Q ss_pred hhhccCCe--ecCC-CCc-c--cHHHH----HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---CceEEEEEcc Q lcl|NC_021301. 69 DRIIPNGI--TVGG-SAD-S--DLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTATITADSP 135 (456) Q Consensus 69 ~~l~~~~~--~~~~-~~d-~--~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~~i~~~~p 135 (456) ++-.-+.. .+.. ..+ . +..+. +..++..|+.+...+.+..+++++|.+|+-++.+.| +.+++..++| T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p 163 (714) T protein:vir:99 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSR 163 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecch Confidence 99876653 2222 111 1 23333 345666789999999999999999999998887643 5588999999 Q ss_pred ceeEEEEeCCCC------ceEEEEEEEEEecCC------------------------------ceE-------------- Q lcl|NC_021301. 136 ETMVVSVDPLQP------WRIRSAMRWWRDLDA------------------------------ESD-------------- 165 (456) Q Consensus 136 ~~~~~~~d~~~~------~~~~~~~~~~~~~d~------------------------------~~~-------------- 165 (456) .+++ |||... .+.+ +++.|.+.+. ... T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~-~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:99 164 NEVF--WDWLSREADLSDCRWL-MRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred hhee--eccccccCChhhccce-eeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 9955 665321 1222 2222221100 000 Q ss_pred -EEEEEcCC-eEEEEEEeeee--------ccccccee-----------------------------ec-cCCCceeeccc Q lcl|NC_021301. 166 -FAIVWSGD-GWQKFARPCFV--------QSSSRRRL-----------------------------VT-RISDSWVPVGD 205 (456) Q Consensus 166 -~~~~~~~~-~~~~~~~~~~~--------~~~~~~~~-----------------------------~~-~~~~~~~~~~~ 205 (456) ....+..+ .-+.....++. ....+... +. -.+......+. T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~ 320 (714) T protein:vir:99 241 QQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRP 320 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCC Confidence 00000000 00010011100 00000000 00 00111111122 Q ss_pred ccccCceeEEEEc-cC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhh Q lcl|NC_021301. 206 AVVTGSPPPVVVY-QN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYAS 279 (456) Q Consensus 206 ~~~~~~~~pvv~~-~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~ 279 (456) .+...+.+|+|++ .. .. ..|-+..+++.++.+|...|.+...+ .+....+..|.. +..++ .... T Consensus 321 ~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~---~~~d~-----~~~e 390 (714) T protein:vir:99 321 CSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT---QLSDN-----DLME 390 (714) T ss_pred CCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc---cccHH-----HHHH Confidence 2222333444432 11 11 23667889999999999999865543 233222222211 11110 0001 Q ss_pred hhhhhccceeccCCC--------ceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 280 IFEAAPGALWELPPG--------VDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 280 ~~~~~~~~~~~~~~d--------~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) . .+.++.+...+++ .++...+. .-...+...+......|-.+||+.+..+|..+++.||+|+......-. T Consensus 391 ~-~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~ 469 (714) T protein:vir:99 391 Q-IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGA 469 (714) T ss_pred h-ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHH Confidence 1 1223333333332 22222222 235667788888888898999999999998776679999888777655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----hc---------CCCcc--------------------------cceeEEecCCC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQ----IE---------GESVE--------------------------DTVDVSFESPD 391 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~----~~---------~~~~~--------------------------~~i~v~f~~~~ 391 (456) .........+..+.+++.++++. .- |..+. ++|.+.=.+.. T Consensus 470 ~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~ 549 (714) T protein:vir:99 470 TTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQT 549 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCc Confidence 55555555566666665554432 21 11010 01111112222 Q ss_pred CcCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCh-hHHHH-H---------------HHHHHHHHHHH-------- Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAA-----GESWASIRRNILNYNA-DQIKQ-D---------------DLDRAREQITL-------- 441 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~-----g~~s~~t~~~~~~~~~-~~~~~-~---------------e~~~~~ee~~~-------- 441 (456) |....+.++.+..+.++ +.+...++++.+.+.. +++.+ + |.+.++.+... T Consensus 550 ~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:99 550 PAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred hHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 22234555555555542 1111233444444310 11100 0 00000000000 Q ss_pred ---HhhhhhhhcccccCC Q lcl|NC_021301. 442 ---FAGNSVQRPQEDGSR 456 (456) Q Consensus 442 ---~~~~~~~~~~~d~~~ 456 (456) ..+...+..+.+..+ T Consensus 630 q~~~~~a~~~k~eae~~~ 647 (714) T protein:vir:99 630 QMREMAGRVAKLEADAAR 647 (714) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 001111111112222 No 96 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.65 E-value=1.4e-14 Score=96.57 Aligned_cols=438 Identities=13% Similarity=0.064 Sum_probs=198.2 Q ss_pred CCCCCHH----HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTASTPA----EWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t~~----~~~~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~~iVd~~a 68 (456) |.+--+. ++.+++...+... +....+-.+||.|.+ .+......++.. ...++.|-++.+|+..+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:27 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ----LPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 3332211 2333333333222 222335568999975 222222222111 12477899999999999 Q ss_pred hhhccCCe--ecCC-CCc-c--cHHHH----HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---CceEEEEEcc Q lcl|NC_021301. 69 DRIIPNGI--TVGG-SAD-S--DLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTATITADSP 135 (456) Q Consensus 69 ~~l~~~~~--~~~~-~~d-~--~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~~i~~~~p 135 (456) ++-.-+.. .+.. ..+ . +..+. +..++..|+.+...+.+..+++++|.+|+-++.+.| +.+++..++| T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p 163 (714) T protein:vir:27 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSR 163 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecch Confidence 99876653 2222 111 1 23333 345666789999999999999999999998887643 5588999999 Q ss_pred ceeEEEEeCCCC------ceEEEEEEEEEecCC------------------------------ceE-------------- Q lcl|NC_021301. 136 ETMVVSVDPLQP------WRIRSAMRWWRDLDA------------------------------ESD-------------- 165 (456) Q Consensus 136 ~~~~~~~d~~~~------~~~~~~~~~~~~~d~------------------------------~~~-------------- 165 (456) .+++ |||... .+.+ +++.|.+.+. ... T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~-~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:27 164 NEVF--WDWLSREADLSDCRWL-MRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred hhee--eccccccCChhhccce-eeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 9955 665321 1222 2222221100 000 Q ss_pred -EEEEEcCC-eEEEEEEeeee--------ccccccee-----------------------------ec-cCCCceeeccc Q lcl|NC_021301. 166 -FAIVWSGD-GWQKFARPCFV--------QSSSRRRL-----------------------------VT-RISDSWVPVGD 205 (456) Q Consensus 166 -~~~~~~~~-~~~~~~~~~~~--------~~~~~~~~-----------------------------~~-~~~~~~~~~~~ 205 (456) ....+..+ .-+.....++. ....+... +. -.+......+. T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~ 320 (714) T protein:vir:27 241 QQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRP 320 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCC Confidence 00000000 00010011100 00000000 00 00111111122 Q ss_pred ccccCceeEEEEc-cC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhh Q lcl|NC_021301. 206 AVVTGSPPPVVVY-QN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYAS 279 (456) Q Consensus 206 ~~~~~~~~pvv~~-~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~ 279 (456) .+...+.+|+|++ .. .. ..|-+..+++.++.+|...|.+...+ .+....+..|.. +..++ .... T Consensus 321 ~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~---~~~d~-----~~~e 390 (714) T protein:vir:27 321 CSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT---QLSDN-----DLME 390 (714) T ss_pred CCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc---cccHH-----HHHH Confidence 2222333444432 11 11 23667889999999999999865543 233222222211 11110 0001 Q ss_pred hhhhhccceeccCCC--------ceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 280 IFEAAPGALWELPPG--------VDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 280 ~~~~~~~~~~~~~~d--------~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) . .+.++.+...+++ .++...+. .-...+...+......|-.+||+.+..+|..+++.||+|+......-. T Consensus 391 ~-~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~ 469 (714) T protein:vir:27 391 Q-IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGA 469 (714) T ss_pred h-ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHH Confidence 1 1223333333332 22222222 235667788888888898999999999998776679999888777655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----hc---------CCCcc--------------------------cceeEEecCCC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQ----IE---------GESVE--------------------------DTVDVSFESPD 391 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~----~~---------~~~~~--------------------------~~i~v~f~~~~ 391 (456) .........+..+.+++.++++. .- |..+. ++|.+.=.+.. T Consensus 470 ~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~ 549 (714) T protein:vir:27 470 TTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQT 549 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCc Confidence 55555555566666665554432 21 11010 01111112222 Q ss_pred CcCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCh-hHHHH-H---------------HHHHHHHHHHH-------- Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAA-----GESWASIRRNILNYNA-DQIKQ-D---------------DLDRAREQITL-------- 441 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~-----g~~s~~t~~~~~~~~~-~~~~~-~---------------e~~~~~ee~~~-------- 441 (456) |....+.++.+..+.++ +.+...++++.+.+.. +++.+ + |.+.++.+... T Consensus 550 ~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:27 550 PAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred hHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 22234555555555542 1111233444444310 11100 0 00000000000 Q ss_pred ---HhhhhhhhcccccCC Q lcl|NC_021301. 442 ---FAGNSVQRPQEDGSR 456 (456) Q Consensus 442 ---~~~~~~~~~~~d~~~ 456 (456) ..+...+..+.+..+ T Consensus 630 q~~~~~a~~~k~eae~~~ 647 (714) T protein:vir:27 630 QMREMAGRVAKLEADAAR 647 (714) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 001111111112222 No 97 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.65 E-value=1.4e-14 Score=96.57 Aligned_cols=438 Identities=13% Similarity=0.064 Sum_probs=198.2 Q ss_pred CCCCCHH----HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTASTPA----EWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t~~----~~~~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~~iVd~~a 68 (456) |.+--+. ++.+++...+... +....+-.+||.|.+ .+......++.. ...++.|-++.+|+..+ T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~~~v~~v~ 83 (714) T protein:vir:32 8 MATKNDNGATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQ----LPPEVLQVLKDRGQPMTIHNLIAPTVDGVL 83 (714) T ss_pred ccCCCCcchhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHhcCCCcEEeccHHHHHHHHH Confidence 3332211 2333333333222 222335568999975 222222222111 12477899999999999 Q ss_pred hhhccCCe--ecCC-CCc-c--cHHHH----HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---CceEEEEEcc Q lcl|NC_021301. 69 DRIIPNGI--TVGG-SAD-S--DLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTATITADSP 135 (456) Q Consensus 69 ~~l~~~~~--~~~~-~~d-~--~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~~i~~~~p 135 (456) ++-.-+.. .+.. ..+ . +..+. +..++..|+.+...+.+..+++++|.+|+-++.+.| +.+++..++| T Consensus 84 g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p 163 (714) T protein:vir:32 84 GMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSR 163 (714) T ss_pred hHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecch Confidence 99876653 2222 111 1 23333 345666789999999999999999999998887643 5588999999 Q ss_pred ceeEEEEeCCCC------ceEEEEEEEEEecCC------------------------------ceE-------------- Q lcl|NC_021301. 136 ETMVVSVDPLQP------WRIRSAMRWWRDLDA------------------------------ESD-------------- 165 (456) Q Consensus 136 ~~~~~~~d~~~~------~~~~~~~~~~~~~d~------------------------------~~~-------------- 165 (456) .+++ |||... .+.+ +++.|.+.+. ... T Consensus 164 ~~v~--~Dp~a~~~D~sDar~~-~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 240 (714) T protein:vir:32 164 NEVF--WDWLSREADLSDCRWL-MRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDR 240 (714) T ss_pred hhee--eccccccCChhhccce-eeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhccccc Confidence 9955 665321 1222 2222221100 000 Q ss_pred -EEEEEcCC-eEEEEEEeeee--------ccccccee-----------------------------ec-cCCCceeeccc Q lcl|NC_021301. 166 -FAIVWSGD-GWQKFARPCFV--------QSSSRRRL-----------------------------VT-RISDSWVPVGD 205 (456) Q Consensus 166 -~~~~~~~~-~~~~~~~~~~~--------~~~~~~~~-----------------------------~~-~~~~~~~~~~~ 205 (456) ....+..+ .-+.....++. ....+... +. -.+......+. T Consensus 241 ~~~~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~ 320 (714) T protein:vir:32 241 QQNEWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRP 320 (714) T ss_pred cccccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCC Confidence 00000000 00010011100 00000000 00 00111111122 Q ss_pred ccccCceeEEEEc-cC---CC--CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhh Q lcl|NC_021301. 206 AVVTGSPPPVVVY-QN---PD--GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYAS 279 (456) Q Consensus 206 ~~~~~~~~pvv~~-~n---~~--g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~ 279 (456) .+...+.+|+|++ .. .. ..|-+..+++.++.+|...|.+...+ .+....+..|.. +..++ .... T Consensus 321 ~p~p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~---~~~d~-----~~~e 390 (714) T protein:vir:32 321 CSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDAT---QLSDN-----DLME 390 (714) T ss_pred CCCCCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcc---cccHH-----HHHH Confidence 2222333444432 11 11 23667889999999999999865543 233222222211 11110 0001 Q ss_pred hhhhhccceeccCCC--------ceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 280 IFEAAPGALWELPPG--------VDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 280 ~~~~~~~~~~~~~~d--------~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) . .+.++.+...+++ .++...+. .-...+...+......|-.+||+.+..+|..+++.||+|+......-. T Consensus 391 ~-~arp~~vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~ 469 (714) T protein:vir:32 391 Q-IERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGA 469 (714) T ss_pred h-ccCCCCceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHH Confidence 1 1223333333332 22222222 235667788888888898999999999998776679999888777655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----hc---------CCCcc--------------------------cceeEEecCCC Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQ----IE---------GESVE--------------------------DTVDVSFESPD 391 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~----~~---------~~~~~--------------------------~~i~v~f~~~~ 391 (456) .........+..+.+++.++++. .- |..+. ++|.+.=.+.. T Consensus 470 ~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~ 549 (714) T protein:vir:32 470 TTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQT 549 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCc Confidence 55555555566666665554432 21 11010 01111112222 Q ss_pred CcCHHHHHHHHHHHHhc-----CCCcHHHHHHhCCCCh-hHHHH-H---------------HHHHHHHHHHH-------- Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAA-----GESWASIRRNILNYNA-DQIKQ-D---------------DLDRAREQITL-------- 441 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~-----g~~s~~t~~~~~~~~~-~~~~~-~---------------e~~~~~ee~~~-------- 441 (456) |....+.++.+..+.++ +.+...++++.+.+.. +++.+ + |.+.++.+... T Consensus 550 ~t~r~~~~~~l~~l~~~~~p~~~~~~~~~~l~~~d~p~~~el~~~ir~~~~~~~~~~~~~~e~q~~~~~~q~~~~~q~~l 629 (714) T protein:vir:32 550 PAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQQQALQQQQAEL 629 (714) T ss_pred hHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCCCHHHHHHHHHHHcCCCCCccccchhhHHHHHHHHHHHHHHHHH Confidence 22234555555555542 1111233444444310 11100 0 00000000000 Q ss_pred ---HhhhhhhhcccccCC Q lcl|NC_021301. 442 ---FAGNSVQRPQEDGSR 456 (456) Q Consensus 442 ---~~~~~~~~~~~d~~~ 456 (456) ..+...+..+.+..+ T Consensus 630 q~~~~~a~~~k~eae~~~ 647 (714) T protein:vir:32 630 QMREMAGRVAKLEADAAR 647 (714) T ss_pred HHHHHHHHHHHHHHHHHH Confidence 001111111112222 No 98 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.64 E-value=2.8e-16 Score=105.84 Aligned_cols=412 Identities=12% Similarity=0.032 Sum_probs=198.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH-----HHH-------HHhcccCccc---c-cCcccchhhhhhhhhhccChHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVR-----LLA-------RYSNGDAPLP---E-LTRNTSAAWRSFQREARTNWGLMVR 64 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~-----~~~-------~YY~g~~~i~---~-~~~~~~~~~~~~~~k~~~n~~~~iV 64 (456) |.+.--..-...++..|...-+.+. -+. .+.-|+.... . .+...+ .+........+.+++++| T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~~-~~~l~a~Y~~~~l~r~~V 101 (532) T protein:vir:94 23 VDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSWP-GFPTLALLAQLPEYRTMH 101 (532) T ss_pred hhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCcccccccccccccccccc-hHHHHHHHHcCchhhhhh Confidence 3333333223334444332211110 000 1111111000 0 000111 111111112356779999 Q ss_pred HHHHhhhccCCeecCCCCc----ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce------------ Q lcl|NC_021301. 65 DSVADRIIPNGITVGGSAD----SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA------------ 128 (456) Q Consensus 65 d~~a~~l~~~~~~~~~~~d----~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~------------ 128 (456) |..+.-++-+++++.++.+ .+..+.+...|++-++...+.++.+++..||.|++++..+.+|.. T Consensus 102 d~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~~~p~~l~~~ 181 (532) T protein:vir:94 102 ETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPADAPLLLSPS 181 (532) T ss_pred ccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCcccccccccccccc Confidence 9999999999988865432 233445666677667888999999999999999988766543310 Q ss_pred --------EEEEEccceeEEEE-eCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCc Q lcl|NC_021301. 129 --------TITADSPETMVVSV-DPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDS 199 (456) Q Consensus 129 --------~i~~~~p~~~~~~~-d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (456) .+.+++|.++.|.. +..++.. ...|++.+..+.... .++. T Consensus 182 ~I~~g~~~~l~vld~~~v~p~~~~~~dp~s---------p~fg~P~~y~v~~g~-~iH~--------------------- 230 (532) T protein:vir:94 182 FVQRGCLIGFATIEPMWLSPNAYNATDPTL---------PSFYKPDSWIATSGK-KIHS--------------------- 230 (532) T ss_pred ccccceeeEEEeechheecccccccccccc---------cccCCceeEEEccCe-eecc--------------------- Confidence 13334444433321 1011100 001122211111000 0000 Q ss_pred eeecccccccCce-eEE-E-EccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-h Q lcl|NC_021301. 200 WVPVGDAVVTGSP-PPV-V-VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-I 275 (456) Q Consensus 200 ~~~~~~~~~~~~~-~pv-v-~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~ 275 (456) ....++.+. +|- . +..+-+|.|.++++.+-+..++++.-........ +.+.+++- +.......+.... . T Consensus 231 ----SRli~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~--~~~~v~k~-~~a~~ls~~~~~~~~ 303 (532) T protein:vir:94 231 ----SRIHTVVGRPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQ--FSMTNLAT-DMAQLLAPGGAQSLD 303 (532) T ss_pred ----ceEEEecCCCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHh--cCCceeee-chHHhhcchhHHHHH Confidence 000111110 110 0 1123368999998888888888776654443333 33333321 1111111111111 1 Q ss_pred hhhh---hhhhhccceeccC-CCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhh-cccc--cCcHHHHHHHHHHH Q lcl|NC_021301. 276 DYAS---IFEAAPGALWELP-PGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPML-MPDS--ANQSAEGAHNIEKG 348 (456) Q Consensus 276 ~~~~---~~~~~~~~~~~~~-~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-~~~~--~N~Sg~Al~~~~~~ 348 (456) .... ......+. +.++ .+.++.+++ .++++..+.++...+.|++.++||..-| |... -|++|+.-..-|. T Consensus 304 ~r~~~~~~~~~n~g~-~~id~~~e~~e~~~-~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yy- 380 (532) T protein:vir:94 304 ARLQLFNLYRDNRNI-GALDKGTEEIQQTN-TPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWY- 380 (532) T ss_pred HHHHHHHhhcCCccc-eEEcCCCceeEEEe-cccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHH- Confidence 1122 12222233 4444 345666665 4567788888999999999999998754 5432 2456765444443 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH-------HHHHHHhcCCCcHHHHHHhC Q lcl|NC_021301. 349 FLFKCEDRL-SIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA-------AASLAKAAGESWASIRRNIL 420 (456) Q Consensus 349 l~~k~~~~~-~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad-------~~~kl~~~g~~s~~t~~~~~ 420 (456) ..|+.+| ..+.+.+++++.+++.......+.++.+.|++-...+.+|.|+ +..++.++|+++.+.+++.+ T Consensus 381 --d~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l 458 (532) T protein:vir:94 381 --DFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPGLAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRL 458 (532) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHH Confidence 4444444 4567888998888765443333456889999988888887654 55677788999998888777 Q ss_pred CCChh------HHHHH---HHHHHHHHHHHHhhhh------hhhcccccCC Q lcl|NC_021301. 421 NYNAD------QIKQD---DLDRAREQITLFAGNS------VQRPQEDGSR 456 (456) Q Consensus 421 ~~~~~------~~~~~---e~~~~~ee~~~~~~~~------~~~~~~d~~~ 456 (456) +..+. ..... +.+.+.++........ ...+..+++- T Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (532) T protein:vir:94 459 AADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSED 509 (532) T ss_pred hcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCC Confidence 54332 01111 1111111111110000 0011111110 No 99 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.63 E-value=3.3e-16 Score=105.49 Aligned_cols=386 Identities=11% Similarity=0.021 Sum_probs=192.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccC--cccchhhhhhhhh-hccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELT--RNTSAAWRSFQRE-ARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~--~~~~~~~~~~~~k-~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |+.... ...+-+-|...+.+.-....-. ....-.+..+... ..+.+++++||..+.-++.+|+. T Consensus 5 m~~~~~-------------~~~~~D~~~~~~~~~~g~~~~~~~~~~~~~~~~l~~~Y~~~~l~~~~Vd~~aed~~r~g~~ 71 (435) T protein:vir:79 5 MSDKVK-------------AITKEDGYNEIFGSKDGTFRPNAFYMQRAAFKALSQFYEEDGMARRIVDVIPEEMVTPGFK 71 (435) T ss_pred cccccc-------------cchhhcchhhhhcccccccccCcccCCcCCHHHHHHHHhcCchhhhhhccchHHhhcCCce Confidence 554421 1111122222233322211000 0000011122222 24678899999999999999999 Q ss_pred cCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc----------e-EEEEEccceeEEEEeCCC Q lcl|NC_021301. 78 VGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT----------A-TITADSPETMVVSVDPLQ 146 (456) Q Consensus 78 ~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~----------~-~i~~~~p~~~~~~~d~~~ 146 (456) +.++.+. +.+...|++=++...+.++.+.+..||.|++++-...+.. + .+.+++|.++.|..-..+ T Consensus 72 i~g~~~~---~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~d 148 (435) T protein:vir:79 72 VDGVKNE---KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETN 148 (435) T ss_pred ecCCChH---HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchhhccC Confidence 8765433 4566677776788899999999999999988886532211 1 233334433322110000 Q ss_pred CceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEE-------E--E Q lcl|NC_021301. 147 PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPV-------V--V 217 (456) Q Consensus 147 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pv-------v--~ 217 (456) +.. ...|++.. |.+.. ..+.. ....|..+++.+ . + T Consensus 149 p~s---------p~fg~P~~---------y~v~~--------------~~~~~----~~~iH~SRli~~~g~~~p~~~~~ 192 (435) T protein:vir:79 149 ARS---------VRYGEPKL---------YKISP--------------GGDIP----EFFVHYSRICIIDGERVSNEKRR 192 (435) T ss_pred Ccc---------cccCcceE---------EEEec--------------CCCCC----ceEEcceeEEEecCCcchhhhcc Confidence 000 00011111 11100 00000 001122222211 1 1 Q ss_pred ccCCCCCCcH-hHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhh---hhhhhhhccceeccCC Q lcl|NC_021301. 218 YQNPDGMGEV-EPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDY---ASIFEAAPGALWELPP 293 (456) Q Consensus 218 ~~n~~g~s~~-~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 293 (456) ..|.+|.|.+ +.+.+-+.+++++.................+.++.......+........ ........+.+...+. T Consensus 193 ~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~ 272 (435) T protein:vir:79 193 QNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDAT 272 (435) T ss_pred ccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecC Confidence 2456788866 67778777777776655444433333333333321110000111111111 1122222445555556 Q ss_pred CceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhh-ccccc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 294 GVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPML-MPDSA--NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK 370 (456) Q Consensus 294 d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-~~~~~--N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l 370 (456) +.++.+++ .++++..+.++....+|++.+++|..-| |...+ |+||+.-..-|...+... .+..+.+.+++++.+ T Consensus 273 ~e~~e~~~-~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~--Qe~~l~p~l~~l~~l 349 (435) T protein:vir:79 273 DEEYEVLN-SDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRK--RVEDYKPILEFLLPF 349 (435) T ss_pred CcceEEEe-cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHH--HHHHHHHHHHHHHHH Confidence 66666665 4567788889999999999999998665 54332 466766544444433322 235678888888888 Q ss_pred HHHhcCCCcccceeEEecCCCCcCHHHHHH-------HHHHHHhcCCCcHHHHHHhC-------CCChhHHHHHHHHHHH Q lcl|NC_021301. 371 ALQIEGESVEDTVDVSFESPDRVTLGEKYA-------AASLAKAAGESWASIRRNIL-------NYNADQIKQDDLDRAR 436 (456) Q Consensus 371 ~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad-------~~~kl~~~g~~s~~t~~~~~-------~~~~~~~~~~e~~~~~ 436 (456) ++.- .++.+.|+|-...+.+|.|+ ++.++.++|+++.+.+++.+ |+..+.... T Consensus 350 i~~s------~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~------- 416 (435) T protein:vir:79 350 MISE------TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIE------- 416 (435) T ss_pred hhcC------CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCccccc------- Confidence 6532 35678899999888877654 45566677888876665443 222111111 Q ss_pred HHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 437 EQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 437 ee~~~~~~~~~~~~~~d~~~ 456 (456) +....+.....++|+|.- T Consensus 417 --~~~~~d~~~~~~~e~g~~ 434 (435) T protein:vir:79 417 --LPEPEDLDPEPGQEGGLN 434 (435) T ss_pred --CCccccCCCCCCCCCCCC Confidence 111111122223333333 No 100 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.63 E-value=4.2e-16 Score=104.89 Aligned_cols=377 Identities=13% Similarity=0.060 Sum_probs=190.2 Q ss_pred HHHHHHHHHHhcccCcc---c-ccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh Q lcl|NC_021301. 21 MSRVRLLARYSNGDAPL---P-ELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD 96 (456) Q Consensus 21 ~~r~~~~~~YY~g~~~i---~-~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~ 96 (456) ..+.+-|...+-|-++= . .........+... -..+.+++++||..+.-++.+|+.+.++.+. ..+..-|++ T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~~~~~~~~~l~a~--Y~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~---~~~~~~~~~ 75 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGSLQNQAPTILASL--YADNALVRRIIDTIPETALAAGFHIDGIDDE---PAFWSRWDD 75 (422) T ss_pred CccchhhHHHHcCCCCCccccCcccccCHHHHHHH--HHhChhhHHHHhhhhHHHhcCCccccCCCHH---HHHHHHHHH Confidence 22223333333332210 0 0000111222222 1246778999999999999999998765432 345566777 Q ss_pred cChhHHHHHHHHHHhhCCeEEEEEeeCCCC----------ce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceE Q lcl|NC_021301. 97 NRMDSVCKQWVKYGLDFGESYLTCWRRDDG----------TA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESD 165 (456) Q Consensus 97 n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg----------~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~ 165 (456) -++...+.++.+.+..||.|++++-..... .+ .+.++++.++.|..-..++.. ...|++. T Consensus 76 l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s---------~~fg~P~ 146 (422) T protein:vir:10 76 LEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRN---------ARFGEPL 146 (422) T ss_pred hhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccc---------cccCcce Confidence 678899999999999999999988763221 11 234444444332210000000 0012222 Q ss_pred EEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEE---------EEccCCCCCCcHhH-HHHHHH Q lcl|NC_021301. 166 FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPV---------VVYQNPDGMGEVEP-HIDIIN 235 (456) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pv---------v~~~n~~g~s~~~~-v~~liD 235 (456) ...+-... .+. ....|..+++.+ -+..+.+|.|.+++ +.+-+. T Consensus 147 ~y~v~~~~-----------------------~~~----~~~iH~SRli~~~g~~~p~~~~~~~~~~G~S~l~~~~~~~i~ 199 (422) T protein:vir:10 147 TYRITTNE-----------------------SDM----FYDVHYSRIHIIDGERIPNVMRRQNDGWGRSVLSSDILDSIK 199 (422) T ss_pred EEEEecCC-----------------------CCc----ceeeccceeEEeCCCCchhhhcccCCcccchhHHHHHHHHHH Confidence 11111000 000 001122222211 12244578888886 567777 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhh---hhhhccceeccCCCceeEeecccchHHHHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASI---FEAAPGALWELPPGVDIWESQTNDFTPMLSAI 312 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l 312 (456) +++++.-...............+.|...-....+........... .....+.+...+.+.++.+++ .++++..+.+ T Consensus 200 ~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~-~~lsgl~~~~ 278 (422) T protein:vir:10 200 DYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLN-SDIGGIDAFL 278 (422) T ss_pred HHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEe-cccCChHHHH Confidence 777666654443333333333333221100001111111111121 122234444555556665554 4566788888 Q ss_pred HHHHHHHHhhcCCChhhh-ccccc--CcHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcCCCcccceeEEec Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPML-MPDSA--NQSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILVKALQIEGESVEDTVDVSFE 388 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~-~~~~~--N~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~ 388 (456) ......|++.++||..-| |...+ |+||+.-..-+.. .|+.+| ..+.+.+++++.+++.- .++.+.|+ T Consensus 279 ~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd---~i~~~Qe~~l~p~l~~l~~~i~~s------~~~~~~f~ 349 (422) T protein:vir:10 279 DKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHK---LVDRKRNAELLPILEFLIPFIVNA------EEWSVEFN 349 (422) T ss_pred HHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhccc------CCcEEEeC Confidence 999999999999998755 44322 4566654433433 333344 56788899998887631 35778899 Q ss_pred CCCCcCHHHHH-------HHHHHHHhcCCCcHHHHHHhCCC----C--hhHHHHHHHHHHHHHHHHHhhhhhhhcccc Q lcl|NC_021301. 389 SPDRVTLGEKY-------AAASLAKAAGESWASIRRNILNY----N--ADQIKQDDLDRAREQITLFAGNSVQRPQED 453 (456) Q Consensus 389 ~~~~~~~~e~a-------d~~~kl~~~g~~s~~t~~~~~~~----~--~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d 453 (456) |-...+.+|.| ++..++.++|+++.+.+++.|.- . .+.+..++.+..+. .......|++| T Consensus 350 pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~d 422 (422) T protein:vir:10 350 PLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDGSVETEVTISET-----SNDPLEVPTDD 422 (422) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCCCCccccchhhc-----CCCCCCCCCCC Confidence 98888888654 55666777898888777655411 0 11111111111110 11222334444 No 101 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.63 E-value=6.1e-16 Score=103.99 Aligned_cols=384 Identities=12% Similarity=0.030 Sum_probs=186.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccC-cccch--hhhhhhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELT-RNTSA--AWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~-~~~~~--~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |.-=+- +-|....-|..+=...+ ..... .+-.. -..+.+++++||..+.-++.+++. T Consensus 1 ~~~~~~------------------d~~~~~~~~~~~~~~~~~~~~~~~~~l~a~--Y~~~~l~~~~Vd~~aed~~r~g~~ 60 (427) T protein:vir:10 1 MKIVKH------------------DGYNDIFNGGADGSPKPFFMSDASYHVGSF--YNDNATAKRIVDVIPEEMVTAGFK 60 (427) T ss_pred CCcccc------------------chHHHHhhcCCCCcccCccccCchHHHHHH--HHcCchhhhhhccchHHhhcCCcc Confidence 111111 11111111211000000 00111 12222 124677899999999999999999 Q ss_pred cCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc-----------eEEEEEccceeEEEEeCCC Q lcl|NC_021301. 78 VGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT-----------ATITADSPETMVVSVDPLQ 146 (456) Q Consensus 78 ~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~-----------~~i~~~~p~~~~~~~d~~~ 146 (456) +.++.+. +.+...|++-++...+.++.+.+..||.|++++-.+.+.. ..+.++++.++.|..-..+ T Consensus 61 i~g~~~~---~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~d 137 (427) T protein:vir:10 61 MSGVKDE---KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTN 137 (427) T ss_pred ccCccHH---HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccC Confidence 8765432 4566777777788999999999999999999886643221 1133333333322110000 Q ss_pred CceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEE---------E Q lcl|NC_021301. 147 PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVV---------V 217 (456) Q Consensus 147 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv---------~ 217 (456) +.. ...|++.+..+-... +. .....|..+++.+. + T Consensus 138 p~s---------~~fg~P~~y~v~~~~------------------------~~---~~~~iH~SRli~~~g~~~p~~~~~ 181 (427) T protein:vir:10 138 ARS---------PRYGEPEIYKVSPGD------------------------NM---QPYLIHHSRVFIADGERVAQQARK 181 (427) T ss_pred ccc---------cccCcceEEEEecCC------------------------CC---cceEEccccEEEecCCCchhhhcc Confidence 000 001122211110000 00 00011222222111 1 Q ss_pred ccCCCCCCcHhH-HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhh---hhhhccceeccCC Q lcl|NC_021301. 218 YQNPDGMGEVEP-HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASI---FEAAPGALWELPP 293 (456) Q Consensus 218 ~~n~~g~s~~~~-v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 293 (456) ..+.+|.|.+.. +.+-+.+++++.-...............+.|+..-....+........... .....+.+...+. T Consensus 182 ~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~ 261 (427) T protein:vir:10 182 QNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE 261 (427) T ss_pred cCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecC Confidence 235578887764 667666777666544333333333333333321111111111111222221 2222344555555 Q ss_pred CceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhh-ccccc--CcHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_021301. 294 GVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPML-MPDSA--NQSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILV 369 (456) Q Consensus 294 d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-~~~~~--N~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l~~~~~ 369 (456) +.++.+++ .++++..+.+.....+||+.++||..-| |...+ |+||+.-..-|... |+.+| ..+.+.+++++. T Consensus 262 ~e~~e~~~-~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~---i~~~Qe~~l~p~l~~l~~ 337 (427) T protein:vir:10 262 TEEYDVLN-SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKL---VDRKREEDYRPLLEFLLP 337 (427) T ss_pred CCceeEEe-cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHH---HHHHHHHHHHHHHHHHHH Confidence 56665554 4567788889999999999999998755 54332 56677544444433 33333 567888999888 Q ss_pred HHHHhcCCCcccceeEEecCCCCcCHHHHH-------HHHHHHHhcCCCcHHHHHHhC----CCChh-HHHHHHHHHHHH Q lcl|NC_021301. 370 KALQIEGESVEDTVDVSFESPDRVTLGEKY-------AAASLAKAAGESWASIRRNIL----NYNAD-QIKQDDLDRARE 437 (456) Q Consensus 370 l~~~~~~~~~~~~i~v~f~~~~~~~~~e~a-------d~~~kl~~~g~~s~~t~~~~~----~~~~~-~~~~~e~~~~~e 437 (456) +++.- .++.+.|+|-...+.+|.| ++..++.++|+++.+.+++.| +.+.- .......+...+ T Consensus 338 ~i~~s------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~ 411 (427) T protein:vir:10 338 FIVDE------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEE 411 (427) T ss_pred HhhcC------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccch Confidence 86632 3578899999888888765 556666778888877766543 11100 000000111111 Q ss_pred HHHHHhhhhhhhcccc Q lcl|NC_021301. 438 QITLFAGNSVQRPQED 453 (456) Q Consensus 438 e~~~~~~~~~~~~~~d 453 (456) +.+..........++| T Consensus 412 ~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 412 TTEPEPGLGEKLEDEN 427 (427) T ss_pred hcCCCCCCCCCCCCCC Confidence 1111111111111222 No 102 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.60 E-value=4.8e-16 Score=104.59 Aligned_cols=403 Identities=12% Similarity=0.029 Sum_probs=192.6 Q ss_pred CCCCC-HHHHHHHHHHHHHHHHHHH--------HHHH-------------HHhcccCcccccCcccchhhhhhhhhhccC Q lcl|NC_021301. 1 MTAST-PAEWLPVLTKRIDDGMSRV--------RLLA-------------RYSNGDAPLPELTRNTSAAWRSFQREARTN 58 (456) Q Consensus 1 ~~~~t-~~~~~~~l~~~~~~~~~r~--------~~~~-------------~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n 58 (456) |...- ..+.++.....-....+.. +-+. .|+-.+.-..+........+........+. T Consensus 68 ~~~~~~~~~~~~~~~a~~~a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~alY~~~~ 147 (862) T protein:vir:99 68 ISDSVNAKSVSGKNFAMDSAVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACALIAQHW 147 (862) T ss_pred ccccccchhhhhhhhcchhhcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHHHHhCc Confidence 11111 1111111111111100000 0011 111111000000000011111111123467 Q ss_pred hHHHHHHHHHhhhccCCeecCCCCc-----ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC-CCCce---- Q lcl|NC_021301. 59 WGLMVRDSVADRIIPNGITVGGSAD-----SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR-DDGTA---- 128 (456) Q Consensus 59 ~~~~iVd~~a~~l~~~~~~~~~~~d-----~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d-~dg~~---- 128 (456) ++++|||..+.-++-+++.+.+..| .+..+.+.+.|++-++...+.++.+++..||.+++++..+ .|+.. T Consensus 148 larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqP 227 (862) T protein:vir:99 148 LVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKP 227 (862) T ss_pred hhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcC Confidence 8899999999999999998876432 3345667888888888899999999999999987776543 23221 Q ss_pred ------------EEEEEccceeEEEEeCCCCceEEEEEEEEEecC----CceEEEEEEcCCeEEEEEEeeeeccccccee Q lcl|NC_021301. 129 ------------TITADSPETMVVSVDPLQPWRIRSAMRWWRDLD----AESDFAIVWSGDGWQKFARPCFVQSSSRRRL 192 (456) Q Consensus 129 ------------~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 192 (456) .|.+++|.++.+.- ...+..++. +++....+ .. T Consensus 228 Ln~e~I~kG~lkgl~vlDp~w~~p~~----------v~~~~~Dp~sp~yGkP~~y~I--~g------------------- 276 (862) T protein:vir:99 228 FNPDGITPGSYRGISQIDPYWMMPML----------TAESTADPSSQFFYEPEFWII--SG------------------- 276 (862) T ss_pred cCcccccccceeEEEEechhhhcccc----------cccccccccccccCCceeeee--cC------------------- Confidence 13333333333210 000001100 11111110 00 Q ss_pred eccCCCceeecccccccCceeEE-------E--EccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCC Q lcl|NC_021301. 193 VTRISDSWVPVGDAVVTGSPPPV-------V--VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGH 263 (456) Q Consensus 193 ~~~~~~~~~~~~~~~~~~~~~pv-------v--~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~ 263 (456) ...|..+++.+ . ...|-+|.|.++.+.+.+.+++++.........-+......+.+.. T Consensus 277 ------------~~IH~SRliif~g~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~- 343 (862) T protein:vir:99 277 ------------QKYHRSHLIIARGPQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAK- 343 (862) T ss_pred ------------eeeccceeEEecCCCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHh- Confidence 00111111111 0 1233479999999888888888776654443333332222222221 Q ss_pred cccccccccchhhhh---hhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhh-hccc--ccCc Q lcl|NC_021301. 264 GLPKVDENGNAIDYA---SIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPM-LMPD--SANQ 337 (456) Q Consensus 264 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~-~~~~--~~N~ 337 (456) ...++. ...... .......| +..++.+.++.+++ .++++..+.+.....+||+.++||..- ||.. .-|+ T Consensus 344 --~l~~ed-~l~~r~~~~~~~rdN~G-i~liD~eEe~e~ls-~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnA 418 (862) T protein:vir:99 344 --AIANED-KFIQRLMFWVRYRDNHA-VKVLGTDETMEQFD-TSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNS 418 (862) T ss_pred --hhccHH-HHHHHHHHHHhccCcce-eEEecCCCceeEEe-cccCChHHHHHHHHHHHHhhhCCCceeecccCcccccC Confidence 111111 111112 22222223 45566666666554 446677788888899999999999874 5543 2356 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH-------HHHHHHhcCC Q lcl|NC_021301. 338 SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA-------AASLAKAAGE 410 (456) Q Consensus 338 Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad-------~~~kl~~~g~ 410 (456) ||+.-..-|...+.-.+ +..+.+.|++++.++....|. +.++++.|++-...+.+|.|+ ++.++.++|+ T Consensus 419 TGE~D~~nYyD~I~s~Q--E~~L~P~LerL~~li~~~lg~--~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGv 494 (862) T protein:vir:99 419 TGEFETISYHEELESIQ--EHVYMPFLQRHYLISRLSLGI--QHEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGV 494 (862) T ss_pred chHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhcCC--CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC Confidence 77754443443333222 356788888888776544443 346889999998889888764 4667788899 Q ss_pred CcHHHHHHhC------CCC---hhHHHHH---HHHHHH---HH---------HHHHhhhhhhhcccccCC Q lcl|NC_021301. 411 SWASIRRNIL------NYN---ADQIKQD---DLDRAR---EQ---------ITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 411 ~s~~t~~~~~------~~~---~~~~~~~---e~~~~~---ee---------~~~~~~~~~~~~~~d~~~ 456 (456) ++...++..| |+. +++++.. ..+... +. .+..++......+-+++. T Consensus 495 ispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~e~d~~~ 564 (862) T protein:vir:99 495 ISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKETQAGAAVTTAEGDQPN 564 (862) T ss_pred CCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCcccccccccccccccCCccccCCccc Confidence 9987777643 332 2222110 000000 00 000000010111111111 No 103 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.60 E-value=2.7e-14 Score=94.96 Aligned_cols=437 Identities=12% Similarity=0.011 Sum_probs=190.5 Q ss_pred CC--------CCCHHHHHHHHHHH--HHHHHHHHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHHHHHHHHHh Q lcl|NC_021301. 1 MT--------ASTPAEWLPVLTKR--IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGLMVRDSVAD 69 (456) Q Consensus 1 ~~--------~~t~~~~~~~l~~~--~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~~iVd~~a~ 69 (456) |. .-+...+...+... +...+..-.+-.+||+|.+ .+......++.. ...++.|.++.+|+..++ T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~r~~a~~d~~fy~G~Q----W~~~~~~~l~~~g~p~~~~N~i~~~v~~v~g 86 (772) T protein:vir:10 11 LNGLPPAGDTPLTVDEYADINYEIEDQPAWRAVADKEMDYADGNQ----LDTELLRRQQALGIPPAVEDLIGPALLSLQG 86 (772) T ss_pred hccCCcccccccCHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCC----CCHHHHHHHHhcCCCcEEEcchHHHHHHHHH Confidence 22 22223333222211 1222233345578999975 222222222211 124778999999999999 Q ss_pred hhccCCee--cCC---CCcccHHHHH----HHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---CceEEEEEccce Q lcl|NC_021301. 70 RIIPNGIT--VGG---SADSDLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTATITADSPET 137 (456) Q Consensus 70 ~l~~~~~~--~~~---~~d~~~~~~l----~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~~i~~~~p~~ 137 (456) +..-+... +.. ..|.+..+.+ ..++..|+++..+..+..+++++|.+|+-++.+.| +.+++..++|.+ T Consensus 87 ~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i~~v~p~~ 166 (772) T protein:vir:10 87 YEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPYRCRPIRRDE 166 (772) T ss_pred HHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCeEEEeeCccc Confidence 99776533 222 1233333333 44666799999999999999999999998887654 357899999999 Q ss_pred eEEEEeCCCCceEE----EEEEEEEecCC--------------------------------ce----E------------ Q lcl|NC_021301. 138 MVVSVDPLQPWRIR----SAMRWWRDLDA--------------------------------ES----D------------ 165 (456) Q Consensus 138 ~~~~~d~~~~~~~~----~~~~~~~~~d~--------------------------------~~----~------------ 165 (456) + +||+.....+. .++..|.+.+. .. . T Consensus 167 v--~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (772) T protein:vir:10 167 I--HWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTV 244 (772) T ss_pred c--eecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhhcccc Confidence 4 57775432111 11111111000 00 0 Q ss_pred -EEEEEcCC-eEEEEEEeeeec--------ccccce-----------------------------eeccC-CCceeeccc Q lcl|NC_021301. 166 -FAIVWSGD-GWQKFARPCFVQ--------SSSRRR-----------------------------LVTRI-SDSWVPVGD 205 (456) Q Consensus 166 -~~~~~~~~-~~~~~~~~~~~~--------~~~~~~-----------------------------~~~~~-~~~~~~~~~ 205 (456) ....|... .-++....++.. ...+.. ..... +...+..+. T Consensus 245 ~~~~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~ 324 (772) T protein:vir:10 245 QEDHWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCLHDGP 324 (772) T ss_pred ccccccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecceeeccCC Confidence 00000000 000000000000 000000 00001 111111122 Q ss_pred ccccCceeEEEEc-c-----CCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccc-ccccccchhhhh Q lcl|NC_021301. 206 AVVTGSPPPVVVY-Q-----NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLP-KVDENGNAIDYA 278 (456) Q Consensus 206 ~~~~~~~~pvv~~-~-----n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~-~~~~~~~~~~~~ 278 (456) .+...+.+|+|++ . .....|-+..+++.++.+|...|.+...+.- .+ ++ +..+.. ..+ ... . T Consensus 325 ~p~~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~--~~--~~--~~~gav~~~d--~~~---~ 393 (772) T protein:vir:10 325 TPYTHRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSV--AR--VE--RTKGAVAMTD--AQF---R 393 (772) T ss_pred CCCCCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhc--cc--cc--ccCCCccchh--HHH---H Confidence 2223333444432 1 1123367888999999999999987664422 22 11 111111 111 000 0 Q ss_pred hhhhhhccceeccCC------CceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHH Q lcl|NC_021301. 279 SIFEAAPGALWELPP------GVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLF 351 (456) Q Consensus 279 ~~~~~~~~~~~~~~~------d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~ 351 (456) . -.+.++.+...++ +.++...+.. -...+...+......|-.+||+.+.++|..++..||+|+......-.. T Consensus 394 e-~~arp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~ 472 (772) T protein:vir:10 394 R-QIARPDADIVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQSNQ 472 (772) T ss_pred H-hccCCCCeEEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHH Confidence 1 1112233333332 2333222222 246788888888889999999999999977666799998877666555 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh----cCC--------Ccc----cce---eEEe----------cC-----------CC Q lcl|NC_021301. 352 KCEDRLSIAKIGLEAILVKALQI----EGE--------SVE----DTV---DVSF----------ES-----------PD 391 (456) Q Consensus 352 k~~~~~~~f~~~l~~~~~l~~~~----~~~--------~~~----~~i---~v~f----------~~-----------~~ 391 (456) ........+..+.+++.++++.+ -+. .+. ..+ ...+ ++ .. T Consensus 473 ~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~ 552 (772) T protein:vir:10 473 SIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDV 552 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEeecc Confidence 55555555666666655544432 110 000 000 0000 00 11 Q ss_pred CcCH---HHHHHHHHHHHhcCCCcH--H----HHHHhCCC-----------------ChhHHHHHHHHHHHHHHH-HHhh Q lcl|NC_021301. 392 RVTL---GEKYAAASLAKAAGESWA--S----IRRNILNY-----------------NADQIKQDDLDRAREQIT-LFAG 444 (456) Q Consensus 392 ~~~~---~e~ad~~~kl~~~g~~s~--~----t~~~~~~~-----------------~~~~~~~~e~~~~~ee~~-~~~~ 444 (456) |... .+.++.+.++... +.+. . .+++...+ ++++..+...+.++++.. ...+ T Consensus 553 p~~~t~r~~~~~~m~ql~~~-~~P~~~~~~~~~~le~~D~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~qq~~~~~~~e 631 (772) T protein:vir:10 553 PSTNSYRGQQLNAMSEAVKS-MPPQYQAAVLPFLVSLMDVPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQDALAKAGND 631 (772) T ss_pred ccchHHHHHHHHHHHHHHhc-cChhHHHHHHHHHHhhcCCCChHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHH Confidence 1111 2333444443322 1111 0 11122222 111110000000000000 0000 Q ss_pred h-----h--hhhcccccCC Q lcl|NC_021301. 445 N-----S--VQRPQEDGSR 456 (456) Q Consensus 445 ~-----~--~~~~~~d~~~ 456 (456) . . ....+.+..+ T Consensus 632 l~~~q~~a~~~~~~A~a~~ 650 (772) T protein:vir:10 632 IKLRELEIKERKADSEISG 650 (772) T ss_pred HHHHHHHHHHHHHHHHHHH Confidence 0 0 0000000101 No 104 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.59 E-value=2.5e-15 Score=100.66 Aligned_cols=396 Identities=11% Similarity=0.041 Sum_probs=200.4 Q ss_pred CCCCCHHHHHHHHHHHHHH---HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDD---GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~---~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |++-.+.-.++.+...... ...+ ..+..||....-+ + .++-... ..+.+++++||..+.-++.+++. T Consensus 71 ~ds~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~f~---g----yql~alY--~~~~l~rkiVd~pAeDa~R~g~~ 140 (765) T protein:vir:96 71 MDSAYGDGPTPAAKAAAGGQNPYVVP-TMLQDWYNSQGFI---G----YQACAII--SQHWLVDKACSMSGEDAARNGWE 140 (765) T ss_pred ccccccccccchHHHhhhccCccchh-hHHHhhhcccCCc---c----HHHHHHH--HhCchhhhhhhcchHHhhcCCce Confidence 5443333333333322211 1111 1123344332211 1 1122221 24678999999999999999999 Q ss_pred cCCCCc---ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC-CCceE----------------EEEEccce Q lcl|NC_021301. 78 VGGSAD---SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD-DGTAT----------------ITADSPET 137 (456) Q Consensus 78 ~~~~~d---~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~-dg~~~----------------i~~~~p~~ 137 (456) +.++.+ .+..+.+.+.|++-++...+.++.+++..||.+|+++-.+. |+... |.+++|.+ T Consensus 141 I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~ 220 (765) T protein:vir:96 141 LKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKGISQIDPYW 220 (765) T ss_pred eecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeEEEEechhh Confidence 876533 22345577777777888999999999999999988776542 22211 22223322 Q ss_pred eEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEE- Q lcl|NC_021301. 138 MVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVV- 216 (456) Q Consensus 138 ~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv- 216 (456) +.+.... .+..++ ..-..|-+ ..|.+ .+...|..+++.+. T Consensus 221 ~~~~~v~----------e~~~Dp----~sp~fg~P-~~y~i------------------------~g~~IH~SRli~~~g 261 (765) T protein:vir:96 221 AMPQLTA----------ESTADP----SAEHFYEP-DFWII------------------------SGKKYHRSHLVVVRG 261 (765) T ss_pred cccccch----------hccccc----cccccCcc-eeeee------------------------cCceeccceEEEecC Confidence 2221000 000000 00001111 11111 00111222222210 Q ss_pred --------EccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhh---hhhc Q lcl|NC_021301. 217 --------VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIF---EAAP 285 (456) Q Consensus 217 --------~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~---~~~~ 285 (456) ...|-+|.|.++.+.+-+.+++++.-.......-.....+.+.+.. ...++.+ .......+ .... T Consensus 262 ~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~---~l~~~~~-l~~r~~~~~~~r~n~ 337 (765) T protein:vir:96 262 PQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEK---AIANEDA-FNARLAFWIANRDNH 337 (765) T ss_pred CCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHh---hhccHHH-HHHHHHHHHHhcCCc Confidence 1123469999999988888888777554443333333222222221 1111111 11112222 2222 Q ss_pred cceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhh-ccc--ccCcHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_021301. 286 GALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPML-MPD--SANQSAEGAHNIEKGFLFKCEDRL-SIAK 361 (456) Q Consensus 286 ~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-~~~--~~N~Sg~Al~~~~~~l~~k~~~~~-~~f~ 361 (456) | ++.++.+.++.+++ .++++..+.+......|++.++||..-| |.. .-|+||+.-..-|.. .|+.+| ..+. T Consensus 338 g-~~~id~ee~~e~~s-~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD---~I~s~Qe~~l~ 412 (765) T protein:vir:96 338 G-VKVIGIDETMEQFD-TNLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHE---ELESIQEHIFD 412 (765) T ss_pred e-eEEecCCcceeEEe-cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHH---HHHHHHHHHHH Confidence 3 44566677776665 3577788889999999999999998554 533 236788754333333 333333 5678 Q ss_pred HHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH-------HHHHHHhcCCCcHHHHHHhCC------CC---hh Q lcl|NC_021301. 362 IGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA-------AASLAKAAGESWASIRRNILN------YN---AD 425 (456) Q Consensus 362 ~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad-------~~~kl~~~g~~s~~t~~~~~~------~~---~~ 425 (456) +.|++++.+++...+.. ..+++.|++-...+.+|.|+ +.+++.++|+++...+++.+. +. .+ T Consensus 413 p~le~L~~li~~s~~i~--~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~ 490 (765) T protein:vir:96 413 PLLERHYLLLAKSESID--VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDD 490 (765) T ss_pred HHHHHHHHHHHHhcCCC--CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCcc Confidence 88999999887654433 36889999998888887654 577778889999888877653 22 11 Q ss_pred HHHHHH-HHHHH-HHHHHHhhhhhhh------------ccccc----------CC Q lcl|NC_021301. 426 QIKQDD-LDRAR-EQITLFAGNSVQR------------PQEDG----------SR 456 (456) Q Consensus 426 ~~~~~e-~~~~~-ee~~~~~~~~~~~------------~~~d~----------~~ 456 (456) +++... .+... ++........... +.+.+ ++ T Consensus 491 ~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~a~p~~~eg~~~~~~~~p~~~~ 545 (765) T protein:vir:96 491 QAETEPGMSPENLAELEKAGAQSAKAKGEAERAEAQAGAVEGAGDPVPAAPRGTK 545 (765) T ss_pred ccccccCCCccccccccCCCcccccccCccccccCCCCccCCCCcccccCCcccC Confidence 111100 00000 0000000000000 00000 00 No 105 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.58 E-value=1e-13 Score=91.83 Aligned_cols=434 Identities=13% Similarity=0.070 Sum_probs=192.5 Q ss_pred CCCCCHH-----------HHHHHHHHHHHHH-------HHHHHHHHHHhcccCcccccCcccchhhhhh-hhhhccChHH Q lcl|NC_021301. 1 MTASTPA-----------EWLPVLTKRIDDG-------MSRVRLLARYSNGDAPLPELTRNTSAAWRSF-QREARTNWGL 61 (456) Q Consensus 1 ~~~~t~~-----------~~~~~l~~~~~~~-------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~-~~k~~~n~~~ 61 (456) |..+++. ++...+...+... +....+-.+||.|.+ .+......++.. ...++.|-++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~r~~a~~d~~fy~G~Q----w~~~~~~~l~~~g~p~~~~N~i~ 76 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQLLSLCSDIDSQPLWRDAANKACAYYDGDQ----LAPEVIQVLKDRGQPMTIHNLIA 76 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHHHHHHHHHHhhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHhcCCCcEEeccHH Confidence 3333221 2222222222211 222334468999976 222222222111 1247789999 Q ss_pred HHHHHHHhhhccCCee--cCC-CCcc---cHHHH----HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---Cce Q lcl|NC_021301. 62 MVRDSVADRIIPNGIT--VGG-SADS---DLALR----ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTA 128 (456) Q Consensus 62 ~iVd~~a~~l~~~~~~--~~~-~~d~---~~~~~----l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~ 128 (456) .+|+..+++..-+... +.. +.+. +..+. +..++..|+.+..+..+..+++++|.+|+-++.+.| +.+ T Consensus 77 ~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~~i 156 (714) T protein:vir:10 77 PTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGPEF 156 (714) T ss_pred HHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCCCe Confidence 9999999999776533 221 1111 22333 345666789999999999999999999998877654 678 Q ss_pred EEEEEccceeEEEEeCCCCc------eEEEEEEEEEecC----------------------------------------- Q lcl|NC_021301. 129 TITADSPETMVVSVDPLQPW------RIRSAMRWWRDLD----------------------------------------- 161 (456) Q Consensus 129 ~i~~~~p~~~~~~~d~~~~~------~~~~~~~~~~~~d----------------------------------------- 161 (456) ++..++|.+++ |||.... +.+. ++.|.+.+ T Consensus 157 ~i~~v~p~~v~--~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (714) T protein:vir:10 157 KVSTVSRNEVF--WDWLSREADLSDCRWLM-RRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWE 233 (714) T ss_pred EEEecChhhee--eccccccCChhhhhhhh-hhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccch Confidence 99999999965 5553221 1111 11111100 Q ss_pred ----------------Cc-eEEEEEEcCCeEEEEEEeeeeccccccee-----------------------------ecc Q lcl|NC_021301. 162 ----------------AE-SDFAIVWSGDGWQKFARPCFVQSSSRRRL-----------------------------VTR 195 (456) Q Consensus 162 ----------------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------------~~~ 195 (456) .. .....+|........... ...+... ... T Consensus 234 ~~~~~~~~~~~~~~~~~~rV~v~E~w~k~~~~~~~~~----~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~ 309 (714) T protein:vir:10 234 EYQSWDRQQNEWLQRERRRVLLQVVYYRTFERLPVIE----LSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAW 309 (714) T ss_pred hhcccccccccccccCcceEEEEEEEEeEEEEEEeec----CCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEE Confidence 00 000111111110000000 0000000 000 Q ss_pred CCCc-eeecccccccCceeEEEEc-cC---C--CCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc Q lcl|NC_021301. 196 ISDS-WVPVGDAVVTGSPPPVVVY-QN---P--DGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV 268 (456) Q Consensus 196 ~~~~-~~~~~~~~~~~~~~pvv~~-~n---~--~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~ 268 (456) ..+. .+..+..+...+..|+|++ .. . ...|.+..+++.++.+|...|.+...+ .+. .++...+. .+.. T Consensus 310 ~~g~~~L~~~~~p~p~~~fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~--~~~~~~ga-v~~~ 384 (714) T protein:vir:10 310 FVGPHFIVDRPCSAPQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAK--RVIMDEDA-TQLS 384 (714) T ss_pred EecchhhhcCCCCCCCCceeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHH--hCC--ceeecccc-cccc Confidence 0001 1111112222333344332 11 1 234678889999999999999865543 222 23322111 1110 Q ss_pred ccccchhhhhhhhhhhccceeccCCC--------ceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHH Q lcl|NC_021301. 269 DENGNAIDYASIFEAAPGALWELPPG--------VDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSA 339 (456) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~d--------~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg 339 (456) ++ . .... ...++.+...+++ .++...+.. -...+...+......|-.+||+.+..+|..+++.|| T Consensus 385 d~--~---~~e~-~~rp~~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SG 458 (714) T protein:vir:10 385 DN--D---LMEQ-LERPDGIIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSG 458 (714) T ss_pred HH--H---HHHh-ccCCCCeEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHH Confidence 00 0 0001 1122333333221 223333323 245677788888888889999999999987777899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC---------Cccc----ceeEEe--------------- Q lcl|NC_021301. 340 EGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI----EGE---------SVED----TVDVSF--------------- 387 (456) Q Consensus 340 ~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~----~~~---------~~~~----~i~v~f--------------- 387 (456) +|+......-..........|..+.+++.++++.+ -+. .+.. -+.+.+ T Consensus 459 vAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~ 538 (714) T protein:vir:10 459 VAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLN 538 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeee Confidence 99888776655555555555666666655544322 110 0000 011100 Q ss_pred -------cCCCCcCHHHHHHHHHHHHhcC-----CCcHHHHHHhCCCC-hhHHH-----------------------HHH Q lcl|NC_021301. 388 -------ESPDRVTLGEKYAAASLAKAAG-----ESWASIRRNILNYN-ADQIK-----------------------QDD 431 (456) Q Consensus 388 -------~~~~~~~~~e~ad~~~kl~~~g-----~~s~~t~~~~~~~~-~~~~~-----------------------~~e 431 (456) .+..+.-..+.++.+..+.++. .+....+++.+.+. .+++. +.. T Consensus 539 ~dv~i~~~p~~~s~r~~~~~~l~ql~~~~~p~~~~~~~~~~le~~d~p~~~ei~~~ir~~~~~~~~~~~~~~e~q~~q~~ 618 (714) T protein:vir:10 539 THIALAPVQQTPAFKAQLAQRMSEVIQGLPPQVQAVVLDLWVNLLDVPQKQEFVERIRAALGTPKSPDEMTPEEQEVAAQ 618 (714) T ss_pred EEEEEeeccCcHHHHHHHHHHHHHHHhhcCchhhhhHHHHHHHhcCCcCHHHHHHHHHHHcCCCCCccccCcchhHHHHH Confidence 1111122334444555444321 11111222322221 00000 000 Q ss_pred ---HHHHHHHHHH-HhhhhhhhcccccCC Q lcl|NC_021301. 432 ---LDRAREQITL-FAGNSVQRPQEDGSR 456 (456) Q Consensus 432 ---~~~~~ee~~~-~~~~~~~~~~~d~~~ 456 (456) ++..+.++.. .........+.+..+ T Consensus 619 ~~~~~~~q~~l~~~e~~a~~~k~eaea~~ 647 (714) T protein:vir:10 619 QQALQQQQAELQMREMAGRVAKLEADAAR 647 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000000 001111111111122 No 106 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.57 E-value=3.8e-14 Score=94.19 Aligned_cols=441 Identities=9% Similarity=-0.078 Sum_probs=190.6 Q ss_pred CCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~-------~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) |.++ ...+.++...+.. .+.....=.+||.|.+ .+......++. ..+.++|.++.+|+...++-.- T Consensus 1 m~d~--~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Q----w~~~~~~~l~~-q~rp~~N~i~~~i~~v~g~~~~ 73 (725) T protein:vir:77 1 MADN--ENRLESILSRFDADWTASDEARREAKNDLFFSRVSQ----WDDWLSQYTTL-QYRGQFDVVRPVVRKLVSEMRQ 73 (725) T ss_pred CCch--HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCC----CCHHHHHHHHh-cCCCccccHHHHHHHHHhhHHh Confidence 7664 3334444444322 2223334468999975 22222222222 2235679999999999998765 Q ss_pred CC--eecC--CCCcccHHHHHH----HHHHhcChhHHHHHHHHHHhhCCeEEEEEee---CCC---CceEEEEE----cc Q lcl|NC_021301. 74 NG--ITVG--GSADSDLALRAR----RIWRDNRMDSVCKQWVKYGLDFGESYLTCWR---RDD---GTATITAD----SP 135 (456) Q Consensus 74 ~~--~~~~--~~~d~~~~~~l~----~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~---d~d---g~~~i~~~----~p 135 (456) +. +.+. ...|.+..+.+. .+...|+.+...+.+..+++++|.+|+-|.. +++ +.++|... +| T Consensus 74 nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~~ 153 (725) T protein:vir:77 74 NPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSAC 153 (725) T ss_pred CCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccCh Confidence 53 2222 223333333333 4455688999999999999999999987643 222 33444433 34 Q ss_pred ceeEEEEeCCCCceE-----EEEEEEEEecC--------------------------------CceEEEEEEcCCeEEEE Q lcl|NC_021301. 136 ETMVVSVDPLQPWRI-----RSAMRWWRDLD--------------------------------AESDFAIVWSGDGWQKF 178 (456) Q Consensus 136 ~~~~~~~d~~~~~~~-----~~~~~~~~~~d--------------------------------~~~~~~~~~~~~~~~~~ 178 (456) .+ ++|||...+.- .+++..|.+.+ .......+|....+... T Consensus 154 ~~--v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~~~ 231 (725) T protein:vir:77 154 SH--VIWDSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKKET 231 (725) T ss_pred hh--ceeCchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEeeE Confidence 44 34666433210 01122222211 00001111111000000 Q ss_pred EEeeeecccccce-------------------------------e--eccCCCceeecccccccCceeEEEEc---c-C- Q lcl|NC_021301. 179 ARPCFVQSSSRRR-------------------------------L--VTRISDSWVPVGDAVVTGSPPPVVVY---Q-N- 220 (456) Q Consensus 179 ~~~~~~~~~~~~~-------------------------------~--~~~~~~~~~~~~~~~~~~~~~pvv~~---~-n- 220 (456) .. .......+.. . .....+.....+..+..++.+|+|++ . . T Consensus 232 ~~-~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~ 310 (725) T protein:vir:77 232 AF-IYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFV 310 (725) T ss_pred EE-EecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCcCCCCccceEEEeeeeecc Confidence 00 0000000000 0 00011222222222233333444432 1 1 Q ss_pred ---CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCC--- Q lcl|NC_021301. 221 ---PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPG--- 294 (456) Q Consensus 221 ---~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d--- 294 (456) +.+.|-+..+++.++.+|...|.+.......... ...+.. + ..+........................+ T Consensus 311 ~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~--~~~~~~-~--~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 385 (725) T protein:vir:77 311 EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKK--KPFFWP-E--QIAGFEHMYDGNDDYPYYLLNRTDENSGDLP 385 (725) T ss_pred CCcccccchhhhhhhHHHHHHHHHHHHHHHHHhcccc--ccccch-h--hhhHHHHHHHhccCCceecccccccCCCccc Confidence 2344778899999999999999866544332211 111110 0 0110000000000000000000111111 Q ss_pred -ceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 295 -VDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 295 -~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) .++..++..+ ...+...+......|-.+||+.+..+|..+++.||+|+..........+......|..+.+++.++++ T Consensus 386 ~~~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL 465 (725) T protein:vir:77 386 TQPLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQ 465 (725) T ss_pred ccCccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2233333333 35677778888888889999999999988777899999998887777777777777777777766554 Q ss_pred Hh----c---------CCC-cc--------------------------cceeEEecCCCCcCHHHHHHHHHHHHhcCCC- Q lcl|NC_021301. 373 QI----E---------GES-VE--------------------------DTVDVSFESPDRVTLGEKYAAASLAKAAGES- 411 (456) Q Consensus 373 ~~----~---------~~~-~~--------------------------~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~- 411 (456) .+ - |.. .. +++.|.=.|..+.-..+.++.+..+.++.-. T Consensus 466 ~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~ 545 (725) T protein:vir:77 466 SIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQG 545 (725) T ss_pred HHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhcccc Confidence 32 1 110 00 1111111111111122344444454433110 Q ss_pred -cH--HHHHHhCCCChhHHHHHHHHHHHHHHHHHhh-------------------------------hhhhhcccccCC Q lcl|NC_021301. 412 -WA--SIRRNILNYNADQIKQDDLDRAREQITLFAG-------------------------------NSVQRPQEDGSR 456 (456) Q Consensus 412 -s~--~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~-------------------------------~~~~~~~~d~~~ 456 (456) +. .++...+...+.+..++.+++++.+...... ......+.+..| T Consensus 546 ~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e~~q~q~~~~~~qa~~~k 624 (725) T protein:vir:77 546 TPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAELAK 624 (725) T ss_pred chhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHH Confidence 10 1111111111111111111111111110000 000011111111 No 107 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.56 E-value=8.9e-14 Score=92.13 Aligned_cols=446 Identities=11% Similarity=0.002 Sum_probs=194.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHH-------HHHHHHHh--cccCcccccCcccchhhhhh-----hhhhccChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSR-------VRLLARYS--NGDAPLPELTRNTSAAWRSF-----QREARTNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r-------~~~~~~YY--~g~~~i~~~~~~~~~~~~~~-----~~k~~~n~~~~iVd~ 66 (456) |+..+. +++.++...+.....+ ...=.+|| .|.+ .+......++.. ...++.|-++.+|+. T Consensus 1 m~~~~~-~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~Q----W~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~ 75 (708) T protein:vir:10 1 MAETLE-KKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQ----WEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) T ss_pred CchhHH-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----CCHHHHHHHHHhhhhcCCCceEEcchHHHHHH Confidence 777666 4667666665433221 11112355 5654 222211122211 124678999999999 Q ss_pred HHhhhccCCee--cC--C-CCcccHHHHH----HHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC---C-------CCc Q lcl|NC_021301. 67 VADRIIPNGIT--VG--G-SADSDLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR---D-------DGT 127 (456) Q Consensus 67 ~a~~l~~~~~~--~~--~-~~d~~~~~~l----~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d---~-------dg~ 127 (456) .+++-.-+... +. + .+|.+..+.+ ..++..|+.+..++.+..+++++|.+|+-+..| + .+. T Consensus 76 v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~~i 155 (708) T protein:vir:10 76 IIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRI 155 (708) T ss_pred HHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCcccc Confidence 99998766522 22 1 2233334333 345667899999999999999999999877543 1 122 Q ss_pred eEEEEEccceeEEEEeCCCCce-----EEEEEEEEEecC-----------------C-----------ceEEEEEEcCCe Q lcl|NC_021301. 128 ATITADSPETMVVSVDPLQPWR-----IRSAMRWWRDLD-----------------A-----------ESDFAIVWSGDG 174 (456) Q Consensus 128 ~~i~~~~p~~~~~~~d~~~~~~-----~~~~~~~~~~~d-----------------~-----------~~~~~~~~~~~~ 174 (456) +...+.+|.. -++||+...+. ...+++.|.+.+ + ...+...|.... T Consensus 156 ~i~~~~~p~~-~v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~~r~ 234 (708) T protein:vir:10 156 AIEPIYDPSR-SVWFDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYYEVR 234 (708) T ss_pred ceEEeecchh-hcccCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEeeeEE Confidence 2333444532 13466543210 011122221111 0 011111111000 Q ss_pred EEEEEEeeeecccccce---------------------------------eeccCCCceeecccccccCceeEEEEc-c- Q lcl|NC_021301. 175 WQKFARPCFVQSSSRRR---------------------------------LVTRISDSWVPVGDAVVTGSPPPVVVY-Q- 219 (456) Q Consensus 175 ~~~~~~~~~~~~~~~~~---------------------------------~~~~~~~~~~~~~~~~~~~~~~pvv~~-~- 219 (456) .......+......+.. ......+..+.+...+..++..|+|++ . T Consensus 235 ~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~~p~~~fP~vP~~g~ 314 (708) T protein:vir:10 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYGK 314 (708) T ss_pred EEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCCCCCCceeeEEEeee Confidence 00000000000000000 000011111122222233344455543 1 Q ss_pred -----C-CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch------hhhhhhhhhhccc Q lcl|NC_021301. 220 -----N-PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA------IDYASIFEAAPGA 287 (456) Q Consensus 220 -----n-~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~------~~~~~~~~~~~~~ 287 (456) + +.+.|.+..+++.++.+|...|.+...+...... +++.+...-.....++... ...........|. T Consensus 315 r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~-~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~G~ 393 (708) T protein:vir:10 315 RWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQ-IPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKSGN 393 (708) T ss_pred eeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCc-ccccChhhhhhHHHHHhhccccchhhhccccccccccc Confidence 1 1224778889999999999999887766543322 2222111000000000000 0000001111121 Q ss_pred eeccCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 288 LWELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA 366 (456) Q Consensus 288 ~~~~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 366 (456) +.. .....+.++. .....+...+......|-.+||+.+..+|. .+|.||+|+......-..........+..+.++ T Consensus 394 ~~~--~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~-~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~ 470 (708) T protein:vir:10 394 IIA--GATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKR 470 (708) T ss_pred ccc--ccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 0011122222 234557777888888888899999999986 457899999988877777777777777777777 Q ss_pred HHHHHHHh-------------cCCC-------------c---------------ccceeEEecCCCCcCHHHHHHHHHHH Q lcl|NC_021301. 367 ILVKALQI-------------EGES-------------V---------------EDTVDVSFESPDRVTLGEKYAAASLA 405 (456) Q Consensus 367 ~~~l~~~~-------------~~~~-------------~---------------~~~i~v~f~~~~~~~~~e~ad~~~kl 405 (456) +.++++.+ .|.. + .++|.+.=.|..+.-..+.++.+..+ T Consensus 471 ~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~ql 550 (708) T protein:vir:10 471 AGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHH Confidence 66655432 1100 0 00111111222223334556666666 Q ss_pred HhcCCCc-HHH------HHHhCCCCh-hHH-HHH--------------------HH--HHHH-HHHHHHh---h------ Q lcl|NC_021301. 406 KAAGESW-ASI------RRNILNYNA-DQI-KQD--------------------DL--DRAR-EQITLFA---G------ 444 (456) Q Consensus 406 ~~~g~~s-~~t------~~~~~~~~~-~~~-~~~--------------------e~--~~~~-ee~~~~~---~------ 444 (456) .++..+. ..+ +++.+.+.- +++ +++ .. +..+ ++.+... + T Consensus 551 l~~~~p~~~~~~~~~~~~l~~~D~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~~ 630 (708) T protein:vir:10 551 LSSMLPTDPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAA 630 (708) T ss_pred HHhcCCCchhhHHHHHHHHHhcCCcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5543221 111 112222110 000 000 00 0000 0000000 0 Q ss_pred -hhhhhcccccCC Q lcl|NC_021301. 445 -NSVQRPQEDGSR 456 (456) Q Consensus 445 -~~~~~~~~d~~~ 456 (456) ....+...+..+ T Consensus 631 qAe~~ka~a~a~~ 643 (708) T protein:vir:10 631 QAEAQKATNETAQ 643 (708) T ss_pred HHHHHHHHHHHHH Confidence 001111111111 No 108 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.55 E-value=1.2e-12 Score=86.04 Aligned_cols=429 Identities=11% Similarity=0.058 Sum_probs=190.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHH--------HHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMS--------RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~--------r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |...|+.+++..|..++..-.. ...+...||.|+..-. ..+ . ..+++.+.....|+....+|. T Consensus 7 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~----~~~---~--~s~~~~~~v~~~v~~~~~~l~ 77 (705) T protein:vir:88 7 IKPMDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN----ERP---G--KSGIVSRDVQETVDWIMPSLM 77 (705) T ss_pred cccCCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc----ccC---C--CCccccHHHHHHHHHHHHHHH Confidence 7777877777776666433222 2244457999985211 111 1 124666777778888877663 Q ss_pred ----c-C-CeecC--CCCcccHHHHHHH-----HHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC-------------- Q lcl|NC_021301. 73 ----P-N-GITVG--GSADSDLALRARR-----IWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD-------------- 125 (456) Q Consensus 73 ----~-~-~~~~~--~~~d~~~~~~l~~-----~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d-------------- 125 (456) + + .|.+. ...|.+..+.+.+ +.+.|+....+..++++++++|.+++.||-+.. T Consensus 78 ~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~~~~~~ 157 (705) T protein:vir:88 78 KVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFSGLSED 157 (705) T ss_pred HhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhccCChh Confidence 2 2 23332 2344444444433 244566667788899999999999887754221 Q ss_pred ----------------------------------CceEEEEEccceeEEEEeCCCCc--eEEEEEEEEEecC-------- Q lcl|NC_021301. 126 ----------------------------------GTATITADSPETMVVSVDPLQPW--RIRSAMRWWRDLD-------- 161 (456) Q Consensus 126 ----------------------------------g~~~i~~~~p~~~~~~~d~~~~~--~~~~~~~~~~~~d-------- 161 (456) |.+++..++|+++++-.+..+.. ..++...+.+..+ T Consensus 158 ~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~~g~~~ 237 (705) T protein:vir:88 158 MVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRLLGVPE 237 (705) T ss_pred hhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHhhcCCh Confidence 66788899999977443221111 1222222111000 Q ss_pred --------CceE--------EEE-------------EEcCC--eEEEEEEeeeeccc-----ccce-eeccCCCceeecc Q lcl|NC_021301. 162 --------AESD--------FAI-------------VWSGD--GWQKFARPCFVQSS-----SRRR-LVTRISDSWVPVG 204 (456) Q Consensus 162 --------~~~~--------~~~-------------~~~~~--~~~~~~~~~~~~~~-----~~~~-~~~~~~~~~~~~~ 204 (456) .... ... .+.+. ..+.+.. ++...+ ...+ .....++. +. T Consensus 238 ~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E-~y~~~d~~~d~~~~~~~~~~~g~~-il-- 313 (705) T protein:vir:88 238 DVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASE-CYTLLDVDGDGISELRRILYVGDY-II-- 313 (705) T ss_pred hHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEE-eeeEecccCCcceeeEEEEEeCcc-cc-- Confidence 0000 000 00000 0011110 010000 0000 00011111 00 Q ss_pred cccccCceeEEEE------ccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhh Q lcl|NC_021301. 205 DAVVTGSPPPVVV------YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYA 278 (456) Q Consensus 205 ~~~~~~~~~pvv~------~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~ 278 (456) ...+.++ +|++. ..+.+|.|.++.++++++.+|...+.+..++...++|...+ ..+. + ... T Consensus 314 ~~~~~~~-~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~-~~g~----v-------~~~ 380 (705) T protein:vir:88 314 SNEPWDC-RPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVV-LDGQ----V-------NLE 380 (705) T ss_pred ccccCCC-CCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceec-cccc----c-------Ccc Confidence 1112222 33332 24557999999999999999999999988888878775443 1110 0 111 Q ss_pred hhhhhhccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhccc----ccCcHHHHHHHHHHHHHHHH Q lcl|NC_021301. 279 SIFEAAPGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPD----SANQSAEGAHNIEKGFLFKC 353 (456) Q Consensus 279 ~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~----~~N~Sg~Al~~~~~~l~~k~ 353 (456) ..+...+|.++.......+..++..+ .......+..+...+-.+||+++...|.+ .+|.++.|+........... T Consensus 381 d~~~~~pg~vv~~~~~~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~~~~r~ 460 (705) T protein:vir:88 381 DLLTNEAAGIVRVKSMNSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTAAEQQI 460 (705) T ss_pred cccccCCCeeEEecCCCccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHHHHHHH Confidence 11223345454433333343343322 23344556667777778899999888743 23457777777777766777 Q ss_pred HHHHHHHHH-HHHHHHHHH----HHhcCCCc------------------ccceeEEecCCCCcCHHHHHHHHHHHHh--- Q lcl|NC_021301. 354 EDRLSIAKI-GLEAILVKA----LQIEGESV------------------EDTVDVSFESPDRVTLGEKYAAASLAKA--- 407 (456) Q Consensus 354 ~~~~~~f~~-~l~~~~~l~----~~~~~~~~------------------~~~i~v~f~~~~~~~~~e~ad~~~kl~~--- 407 (456) ....+.|.+ +++.+++++ ........ ...+.+.-. ....+..+....+..+.+ T Consensus 461 ~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~-~~~~~~eq~~a~l~~ll~~~q 539 (705) T protein:vir:88 461 DLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVG-IGNMNKDQQMLHLMRIWEMAQ 539 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeec-cccchHHHHHHHHHHHHHHHH Confidence 777777753 445444443 33222110 011122111 111121122211111111 Q ss_pred ----cC----CCcHH----HH---HHhCCCCh--------hHHHHHHHHHHHHHHHH-------HhhhhhhhcccccCC Q lcl|NC_021301. 408 ----AG----ESWAS----IR---RNILNYNA--------DQIKQDDLDRAREQITL-------FAGNSVQRPQEDGSR 456 (456) Q Consensus 408 ----~g----~~s~~----t~---~~~~~~~~--------~~~~~~e~~~~~ee~~~-------~~~~~~~~~~~d~~~ 456 (456) .+ .++.. +. .+.+++-. ...+....+...++... -.+....+.+.+..+ T Consensus 540 ~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~ 618 (705) T protein:vir:88 540 AVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALA 618 (705) T ss_pred HhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 11 11111 11 11112110 00000000000000000 011111111111111 No 109 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.50 E-value=1.9e-12 Score=84.88 Aligned_cols=447 Identities=11% Similarity=0.004 Sum_probs=187.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH----HHHH-----HHHHHHhcccCcccccCcccchhhhhh-----hhhhccChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG----MSRV-----RLLARYSNGDAPLPELTRNTSAAWRSF-----QREARTNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~----~~r~-----~~~~~YY~g~~~i~~~~~~~~~~~~~~-----~~k~~~n~~~~iVd~ 66 (456) |+..+.+ ++..+..++... .... +.-..||.|.+ .+......++.. ...++.|-++.+|+. T Consensus 1 ma~~~~~-~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Q----w~~~~~~~l~~~~q~~~rP~~~~N~i~~~i~~ 75 (708) T protein:vir:17 1 MAETLEK-KHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQ----WEGATAAGTKLDEQFEKYPKFEINKVATELNR 75 (708) T ss_pred CchhHHH-HHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCC----CCHHHHHHHHhhhhhcCCCceEEcchHHHHHH Confidence 8777664 566666554331 1111 22236899975 222222222211 124678999999999 Q ss_pred HHhhhccCC--eecCC---CCcccHHHHH----HHHHHhcChhHHHHHHHHHHhhCCeEEEEEee---CCC------Cce Q lcl|NC_021301. 67 VADRIIPNG--ITVGG---SADSDLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR---RDD------GTA 128 (456) Q Consensus 67 ~a~~l~~~~--~~~~~---~~d~~~~~~l----~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~---d~d------g~~ 128 (456) .+++-.-+. +.+.. .+|.+..+.+ ..+...|+.+..++.+..+++++|.+|+-+.. +++ ..+ T Consensus 76 v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i 155 (708) T protein:vir:17 76 IIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRI 155 (708) T ss_pred HHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCcccc Confidence 999876553 22221 2233334443 34555689999999999999999999886643 221 233 Q ss_pred EEEEE-cc-ceeEEEEeCCCCceE-----EEEEEEEEecC----------------------------CceEEEEEEcCC Q lcl|NC_021301. 129 TITAD-SP-ETMVVSVDPLQPWRI-----RSAMRWWRDLD----------------------------AESDFAIVWSGD 173 (456) Q Consensus 129 ~i~~~-~p-~~~~~~~d~~~~~~~-----~~~~~~~~~~d----------------------------~~~~~~~~~~~~ 173 (456) .+..+ +| .+++ |||...+.- ..+++.|.+.+ ....+...|... T Consensus 156 ~i~~~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r 233 (708) T protein:vir:17 156 AIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEV 233 (708) T ss_pred ceEeeccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEE Confidence 34333 33 4544 676542211 01112221110 001111111100 Q ss_pred eEEEEEEeeeeccccc-------------------------------ce--eeccCCCceeecccccccCceeEEEEc-c Q lcl|NC_021301. 174 GWQKFARPCFVQSSSR-------------------------------RR--LVTRISDSWVPVGDAVVTGSPPPVVVY-Q 219 (456) Q Consensus 174 ~~~~~~~~~~~~~~~~-------------------------------~~--~~~~~~~~~~~~~~~~~~~~~~pvv~~-~ 219 (456) ...............+ +. ......+..+.++..+..++..|+|++ . T Consensus 234 ~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g 313 (708) T protein:vir:17 234 RKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEHIPLIPVYG 313 (708) T ss_pred eeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCccceEEEec Confidence 0000000000000000 00 000011222222222233333444432 1 Q ss_pred ---CCCC----CCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh---hhhhhhhhcccee Q lcl|NC_021301. 220 ---NPDG----MGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID---YASIFEAAPGALW 289 (456) Q Consensus 220 ---n~~g----~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~ 289 (456) ..+| .|-+..+++.++.+|...|.+...+......+ +|.+.+.-.....++..... ....+....+.+. T Consensus 314 ~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~-~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g 392 (708) T protein:vir:17 314 KRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQI-PIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYG 392 (708) T ss_pred ccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcc-eeechhhhhhhHHhhhhcccchhhhhhhhccCCccc Confidence 1122 36677899999999999998776654443321 22211100000000000000 0000011111111 Q ss_pred ccCCCc-eeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 290 ELPPGV-DIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI 367 (456) Q Consensus 290 ~~~~d~-~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~ 367 (456) ....++ ..+.++ ..-.+.+...+......|-.+||+.+.++|. .+|.||+|+......-..........+..+.+++ T Consensus 393 ~v~~~a~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~-~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~~ 471 (708) T protein:vir:17 393 NIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQM-PSNIAQETVNNLMNRADMASFIYLDNMAKSLKRA 471 (708) T ss_pred ccccccCCcccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 111122 1223567777888888888899999999886 4579999999877776666666666666666665 Q ss_pred HHHHHH----hc---------CC-Ccc---------------------------cceeEEecCCCCcCHHHHHHHHHHHH Q lcl|NC_021301. 368 LVKALQ----IE---------GE-SVE---------------------------DTVDVSFESPDRVTLGEKYAAASLAK 406 (456) Q Consensus 368 ~~l~~~----~~---------~~-~~~---------------------------~~i~v~f~~~~~~~~~e~ad~~~kl~ 406 (456) .++++. .- |. ... +++.+.=.+..+.-..+..+.++++. T Consensus 472 g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~qll 551 (708) T protein:vir:17 472 GEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNVL 551 (708) T ss_pred HHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHHHH Confidence 554432 21 10 000 00111101111111223444555554 Q ss_pred hcCCCc---HH----HHHHhCCCCh-hHH---------------------HHH--HHHHHHHH-HHH---Hhhhhhhhcc Q lcl|NC_021301. 407 AAGESW---AS----IRRNILNYNA-DQI---------------------KQD--DLDRAREQ-ITL---FAGNSVQRPQ 451 (456) Q Consensus 407 ~~g~~s---~~----t~~~~~~~~~-~~~---------------------~~~--e~~~~~ee-~~~---~~~~~~~~~~ 451 (456) ++..+. .. .+++.+.+.. +++ .++ +++..+++ .+. .++....+.+ T Consensus 552 ~~~~~~~~~~~~~~~l~l~~~D~p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~~~q 631 (708) T protein:vir:17 552 SSMLPADPMRPAIQGIILDNIDGEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVAAQ 631 (708) T ss_pred HhcCCccchhHHHHHHHHHhcCCCChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 432211 00 1112221110 000 000 00000000 000 0011101111 Q ss_pred cccCC Q lcl|NC_021301. 452 EDGSR 456 (456) Q Consensus 452 ~d~~~ 456 (456) .+..| T Consensus 632 Ae~~k 636 (708) T protein:vir:17 632 AEAQK 636 (708) T ss_pred HHHHH Confidence 11112 No 110 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.47 E-value=2.4e-12 Score=84.31 Aligned_cols=445 Identities=11% Similarity=0.009 Sum_probs=195.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHH-------HHHHHHHHHh--cccCcccccCcccchhhhhh-----hhhhccChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGM-------SRVRLLARYS--NGDAPLPELTRNTSAAWRSF-----QREARTNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~-------~r~~~~~~YY--~g~~~i~~~~~~~~~~~~~~-----~~k~~~n~~~~iVd~ 66 (456) |+.++ .+++.++...+.... .+...-.+|| .|.+ .+......++.. ...++.|.++.+|+. T Consensus 1 m~e~~-~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~Q----W~~~~~~~l~~~~q~~grP~~~~N~i~~~v~~ 75 (706) T protein:vir:10 1 MAESR-QKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQ----WEGATVAGTKLDEQFEKYPKFEINKVATELNR 75 (706) T ss_pred CCcch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCcc----CCHHHHHHHHhhhhhcCCCceEecchHHHHHH Confidence 88744 446666666654322 2222223555 5654 222222222211 125778999999999 Q ss_pred HHhhhccCC--eecC--CC-CcccHHHHH----HHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC---------CCce Q lcl|NC_021301. 67 VADRIIPNG--ITVG--GS-ADSDLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD---------DGTA 128 (456) Q Consensus 67 ~a~~l~~~~--~~~~--~~-~d~~~~~~l----~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~---------dg~~ 128 (456) .+++..-+. +.+. .+ .|.+..+.+ ..+...|+.+..+..+..+++++|.+|+-+..+- ++.+ T Consensus 76 v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~~i 155 (706) T protein:vir:10 76 IISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQRI 155 (706) T ss_pred HhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCccc Confidence 999987654 2222 11 233333333 3456678999999999999999999998775431 1233 Q ss_pred EEEE-EccceeEEEEeCCCCc----eE-EEEEEEEEecC------C----------------------ceEEEEEEcCCe Q lcl|NC_021301. 129 TITA-DSPETMVVSVDPLQPW----RI-RSAMRWWRDLD------A----------------------ESDFAIVWSGDG 174 (456) Q Consensus 129 ~i~~-~~p~~~~~~~d~~~~~----~~-~~~~~~~~~~d------~----------------------~~~~~~~~~~~~ 174 (456) .+.. .+|... ++|||...+ .. .++++.|.+.+ + .......|.... T Consensus 156 ~i~~v~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~~~ 234 (706) T protein:vir:10 156 AVEPIYDPARS-VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEVRK 234 (706) T ss_pred eeeeeccchhc-eecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccccc Confidence 4433 356642 456764321 11 12222222111 0 000001111110 Q ss_pred E----EEEEEee------eeccc---------ccc-------------eeeccCCCceeecccccccCceeEEEEc-cC- Q lcl|NC_021301. 175 W----QKFARPC------FVQSS---------SRR-------------RLVTRISDSWVPVGDAVVTGSPPPVVVY-QN- 220 (456) Q Consensus 175 ~----~~~~~~~------~~~~~---------~~~-------------~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n- 220 (456) . ..|.... +.... ... .......+..+.+...+..++.+|+|++ .. T Consensus 235 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~P~vP~~g~r 314 (706) T protein:vir:10 235 ESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHIPLIPVYGKR 314 (706) T ss_pred eeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCccceEEEeecc Confidence 0 0000000 00000 000 0000011111112222233344455543 11 Q ss_pred ------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc-ccccch--h--h--hhhhhhhhccc Q lcl|NC_021301. 221 ------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV-DENGNA--I--D--YASIFEAAPGA 287 (456) Q Consensus 221 ------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~-~~~~~~--~--~--~~~~~~~~~~~ 287 (456) ....|.+..+++.++.+|..+|.+....-... ...-.|........ ..+... . . .........|. T Consensus 315 ~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~~~g~ 392 (706) T protein:vir:10 315 WFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDP--GQTPIVDMEQIRGLEQHWEGRNRKRPAFLPLRTVTDKTGN 392 (706) T ss_pred ccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcC--CcccccchhHHHHHHHHhhhcccccccchhcccccCCCCc Confidence 12347788899999999999998876543221 11111110000000 000000 0 0 00000011121 Q ss_pred eeccCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 288 LWELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA 366 (456) Q Consensus 288 ~~~~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 366 (456) +.. +...++.++. .-++.+...+......|-.+||+.+.++|. .+|.||+|+......-..........|..+.++ T Consensus 393 i~~--~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~-~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~~ 469 (706) T protein:vir:10 393 VVA--PANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQM-PSNVARETVNSLLNRSDMASFIYLDNMAKSLKR 469 (706) T ss_pred ccc--cccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCC-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 1112222222 123456677777888888999999999886 457999999998887777777777777788777 Q ss_pred HHHHHHHh-------------cCCCc-c---------------------------cceeEEecCCCCcCHHHHHHHHHHH Q lcl|NC_021301. 367 ILVKALQI-------------EGESV-E---------------------------DTVDVSFESPDRVTLGEKYAAASLA 405 (456) Q Consensus 367 ~~~l~~~~-------------~~~~~-~---------------------------~~i~v~f~~~~~~~~~e~ad~~~kl 405 (456) +.++++.+ .|... . +++.+.=.+..+.-..+..+.++.+ T Consensus 470 ~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~el 549 (706) T protein:vir:10 470 AGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQL 549 (706) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHHH Confidence 76655432 11000 0 0111111222333345566667666 Q ss_pred HhcCCCc-HHH------HHHhCCCC--hhHHHHH----------------HHHHH----H-H--HHHH-Hh--hhhhhhc Q lcl|NC_021301. 406 KAAGESW-ASI------RRNILNYN--ADQIKQD----------------DLDRA----R-E--QITL-FA--GNSVQRP 450 (456) Q Consensus 406 ~~~g~~s-~~t------~~~~~~~~--~~~~~~~----------------e~~~~----~-e--e~~~-~~--~~~~~~~ 450 (456) .+++.+- ..+ +++.+.+. ++-.+++ +.+.. + + +.+. +. +.-..+. T Consensus 550 ~~~~~p~~~~~~~l~~~~~~~~d~p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~~~ 629 (706) T protein:vir:10 550 LQGMLPQDPMRPALMGIIIDNMEGEGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMVVA 629 (706) T ss_pred HHhcCCcchhhHHHHHHHHhhcCccchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6643321 111 22222221 0000000 00000 0 0 0000 00 0000111 Q ss_pred ccccCC Q lcl|NC_021301. 451 QEDGSR 456 (456) Q Consensus 451 ~~d~~~ 456 (456) +.+..| T Consensus 630 qA~~~k 635 (706) T protein:vir:10 630 QAEAQK 635 (706) T ss_pred HHHHHH Confidence 111222 No 111 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.45 E-value=2e-12 Score=84.71 Aligned_cols=435 Identities=9% Similarity=-0.048 Sum_probs=188.2 Q ss_pred CCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~-------~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) |..+ ...+.++...+.. .+.....=.+||.|.+ .+......++. ..+.++|-++.+|+...++-.- T Consensus 1 m~d~--~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Q----W~~~~~~~l~~-q~rp~~N~i~~~v~~v~g~e~~ 73 (725) T protein:vir:10 1 MADN--ENRLESILSRFDADWTASDEARREAKNDLFFSRVSQ----WDDWLSQYTTL-QYRGQFDVVRPVVRKLVSEMRQ 73 (725) T ss_pred CCch--HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHh-cCCCcccchHHHHHHHHhhHHh Confidence 7664 3344554444322 2223344468999975 22222222222 2235679999999999998765 Q ss_pred CC--eec--CCCCcccHHHHHH----HHHHhcChhHHHHHHHHHHhhCCeEEEEEe---eCCC---CceEEEEE----cc Q lcl|NC_021301. 74 NG--ITV--GGSADSDLALRAR----RIWRDNRMDSVCKQWVKYGLDFGESYLTCW---RRDD---GTATITAD----SP 135 (456) Q Consensus 74 ~~--~~~--~~~~d~~~~~~l~----~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~---~d~d---g~~~i~~~----~p 135 (456) +. +.+ ....|.+..+.+. .+...|+.+...+.+..+++++|.+|+-|. .++| +.+.|... ++ T Consensus 74 nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~ 153 (725) T protein:vir:10 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSAC 153 (725) T ss_pred CCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccCH Confidence 43 222 1223334444433 445568899999999999999999998763 3333 33444433 34 Q ss_pred ceeEEEEeCCCCce-----EEEEEEEEEecC---------C-----------ceEEEEEE-cCCeEEEEEEeee------ Q lcl|NC_021301. 136 ETMVVSVDPLQPWR-----IRSAMRWWRDLD---------A-----------ESDFAIVW-SGDGWQKFARPCF------ 183 (456) Q Consensus 136 ~~~~~~~d~~~~~~-----~~~~~~~~~~~d---------~-----------~~~~~~~~-~~~~~~~~~~~~~------ 183 (456) .+++ |||...+. ..+++..|.+.+ + ...+..-| +++.+ +....++ T Consensus 154 ~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~v-rv~E~~~r~~~~~ 230 (725) T protein:vir:10 154 SHVI--WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTI-QIAEFYEVVEKKE 230 (725) T ss_pred hHcc--cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeE-EEEEEEEEEEEee Confidence 4444 66643211 011122222210 0 00000001 11111 0000000 Q ss_pred -----ecccccce-------------------------------ee--ccCCCceeecccccccCceeEEEEc-c---C- Q lcl|NC_021301. 184 -----VQSSSRRR-------------------------------LV--TRISDSWVPVGDAVVTGSPPPVVVY-Q---N- 220 (456) Q Consensus 184 -----~~~~~~~~-------------------------------~~--~~~~~~~~~~~~~~~~~~~~pvv~~-~---n- 220 (456) .....+.. .+ ....+....++..+..++.+|+|++ . . T Consensus 231 ~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~fP~vP~~g~r~~~ 310 (725) T protein:vir:10 231 TAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFV 310 (725) T ss_pred EEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCCCCCCceeEEEEEeeeecc Confidence 00000000 00 0011111112222233343444432 1 1 Q ss_pred ---CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceec------c Q lcl|NC_021301. 221 ---PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE------L 291 (456) Q Consensus 221 ---~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~ 291 (456) +.+.|.+..+++.++.+|...|.+.......... . ..|.. + .++.. ...+....+..+. . T Consensus 311 ~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~-~-~~~~~-~--~i~~~------e~~~~~~~~~~~~~~~~~~~ 379 (725) T protein:vir:10 311 EDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKK-K-PFFWP-E--QIAGF------EHMYDGNDDYPYYLLNRTDE 379 (725) T ss_pred CCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCc-c-ccccH-h--hhhHH------HHHHhccCCceeeecccccc Confidence 2334888899999999999999866554322211 1 11100 0 00000 0000000000000 0 Q ss_pred CC----CceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 292 PP----GVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA 366 (456) Q Consensus 292 ~~----d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 366 (456) .. ...+...+. .-+..+...+......|-.+||+.+..+|..+++.||+|+.................+..+.++ T Consensus 380 ~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~ 459 (725) T protein:vir:10 380 NNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRR 459 (725) T ss_pred cCcccccccCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 111222222 2245677788888888889999999999987777899999988877777666666777777776 Q ss_pred HHHHHHHh----cCCC--------cc----------------------------cceeEEecCCCCcCHHHHHHHHHHHH Q lcl|NC_021301. 367 ILVKALQI----EGES--------VE----------------------------DTVDVSFESPDRVTLGEKYAAASLAK 406 (456) Q Consensus 367 ~~~l~~~~----~~~~--------~~----------------------------~~i~v~f~~~~~~~~~e~ad~~~kl~ 406 (456) +.++++.+ -+.. +. +++.|.=.|..+.-..+.++.+..|. T Consensus 460 ~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll 539 (725) T protein:vir:10 460 DGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELL 539 (725) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHH Confidence 65555432 1100 00 11111111111111224444444554 Q ss_pred hc-CCC-c--HHHHHHhCCCChhHHHHHHHHHHHHHH-------------------------HH----Hhh--hhhhhcc Q lcl|NC_021301. 407 AA-GES-W--ASIRRNILNYNADQIKQDDLDRAREQI-------------------------TL----FAG--NSVQRPQ 451 (456) Q Consensus 407 ~~-g~~-s--~~t~~~~~~~~~~~~~~~e~~~~~ee~-------------------------~~----~~~--~~~~~~~ 451 (456) ++ +-+ + ..++...+...+-+..++..++++.+. .. ..+ ......+ T Consensus 540 ~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e~~q~~~~~~~~q 619 (725) T protein:vir:10 540 GKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQ 619 (725) T ss_pred HhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHH Confidence 32 100 1 011111111111000000011111000 00 000 0000111 Q ss_pred cccCC Q lcl|NC_021301. 452 EDGSR 456 (456) Q Consensus 452 ~d~~~ 456 (456) .+..| T Consensus 620 ae~~k 624 (725) T protein:vir:10 620 AELAK 624 (725) T ss_pred HHHHH Confidence 12222 No 112 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.44 E-value=6.3e-12 Score=82.01 Aligned_cols=438 Identities=10% Similarity=-0.028 Sum_probs=183.8 Q ss_pred CCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~-------~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) |..+ . ..+.++...+.. .+....+=.+||.|.+ .+......++. ..+.++|-++.+|+...++-.- T Consensus 1 m~d~-~-~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Q----w~~~~~~~l~~-q~rp~~N~i~~~i~~v~g~e~~ 73 (725) T protein:vir:92 1 MADN-E-NRLESILSRFDADWTASDEARREAKNDLFFSRISQ----WDDWLSQYTTL-QYRGQFDVVRPVVRKLVSEMRQ 73 (725) T ss_pred CCch-H-HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCC----CCHHHHHHHHh-cCCCcccchHHHHHHHHhhHHh Confidence 6654 2 234444444322 2223344468999975 22222222222 2235679999999999998754 Q ss_pred CC--eec--CCCCcccHHHHH----HHHHHhcChhHHHHHHHHHHhhCCeEEEEEee---CCC---CceEEEEE---ccc Q lcl|NC_021301. 74 NG--ITV--GGSADSDLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR---RDD---GTATITAD---SPE 136 (456) Q Consensus 74 ~~--~~~--~~~~d~~~~~~l----~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~---d~d---g~~~i~~~---~p~ 136 (456) +. +.+ ....|.+..+.+ ..+...|+.+...+.+..+++++|.+|+-|.. +++ +.++|... +|. T Consensus 74 nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~~ 153 (725) T protein:vir:92 74 NPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSAC 153 (725) T ss_pred CCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCCh Confidence 43 222 122333334433 34455688999999999999999999987643 222 34444433 233 Q ss_pred eeEEEEeCCCCceE-----EEEEEEEEecC---------C-----------ceEEEEEE-cCCeEEEEEEeee------- Q lcl|NC_021301. 137 TMVVSVDPLQPWRI-----RSAMRWWRDLD---------A-----------ESDFAIVW-SGDGWQKFARPCF------- 183 (456) Q Consensus 137 ~~~~~~d~~~~~~~-----~~~~~~~~~~d---------~-----------~~~~~~~~-~~~~~~~~~~~~~------- 183 (456) .. ++|||...+.- ..+++.|.+.+ + ...+..-| +.+.+ +....++ T Consensus 154 ~~-V~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~v-rv~e~~~r~~~~~~ 231 (725) T protein:vir:92 154 SH-VIWDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTI-QIAEFYEVVEKKET 231 (725) T ss_pred hh-cccCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeE-EEEEEEEEEEEeee Confidence 21 34565433210 01112222111 0 00000000 11111 0000000 Q ss_pred ----ecccccce-------------------------------e--eccCCCceeecccccccCceeEEEEc-cC----- Q lcl|NC_021301. 184 ----VQSSSRRR-------------------------------L--VTRISDSWVPVGDAVVTGSPPPVVVY-QN----- 220 (456) Q Consensus 184 ----~~~~~~~~-------------------------------~--~~~~~~~~~~~~~~~~~~~~~pvv~~-~n----- 220 (456) .....+.. . .....+.....+..+..++.+|+|++ .. T Consensus 232 ~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~~~~~~~P~vP~~g~r~~~~ 311 (725) T protein:vir:92 232 AFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQLIAGEHIPIVPVFGEWGFVE 311 (725) T ss_pred EEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCCCCCCceeeEEEEeeeeccC Confidence 00000000 0 00011111222222233343444432 11 Q ss_pred --CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhh--------hhhhhccceec Q lcl|NC_021301. 221 --PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYAS--------IFEAAPGALWE 290 (456) Q Consensus 221 --~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 290 (456) +.+.|.+..+++.++.+|...|.+...+...... .++ +.. + .++.......... ......|.+ T Consensus 312 g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~-~~~-~~~-~--~i~~~~~~~~~~~~~~~~~~~~~~~~~g~~-- 384 (725) T protein:vir:92 312 DKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKK-KPF-FWP-E--QIAGFEHMYDGNDDYPYYLLNRTDENNGEM-- 384 (725) T ss_pred CcccccceeccchhHHHHHHHHHHHHHHHHHhccCc-ccc-cch-h--hhhHHHHHHhccCccceeeccccccccccc-- Confidence 2344888899999999999999865544322211 111 100 0 0000000000000 000001111 Q ss_pred cCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 291 LPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) Q Consensus 291 ~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~ 369 (456) +...+...+. .-+..+...+......|-.+||+.+..+|..+++.||+|+......-..........|..+.+++.+ T Consensus 385 --~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~ 462 (725) T protein:vir:92 385 --PTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGE 462 (725) T ss_pred --cccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111222222 2245677788888888889999999999988777899999988777666666666666666666555 Q ss_pred HHHHh----cCCC--------cc----------------------------cceeEEecCCCCcCHHHHHHHHHHHHhc- Q lcl|NC_021301. 370 KALQI----EGES--------VE----------------------------DTVDVSFESPDRVTLGEKYAAASLAKAA- 408 (456) Q Consensus 370 l~~~~----~~~~--------~~----------------------------~~i~v~f~~~~~~~~~e~ad~~~kl~~~- 408 (456) +++.+ -+.. +. +++.|.=.|..+.-..+.+..+..|.++ T Consensus 463 ~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~ 542 (725) T protein:vir:92 463 IYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKT 542 (725) T ss_pred HHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhc Confidence 44432 1100 00 1111111111111122344444444432 Q ss_pred CCCcH---HHHHHhCCCChhHHHHHHHHHHHHHHHH-----------------------------H--hhhhhhhccccc Q lcl|NC_021301. 409 GESWA---SIRRNILNYNADQIKQDDLDRAREQITL-----------------------------F--AGNSVQRPQEDG 454 (456) Q Consensus 409 g~~s~---~t~~~~~~~~~~~~~~~e~~~~~ee~~~-----------------------------~--~~~~~~~~~~d~ 454 (456) +-+.. .+....+...+-+...+..++++.+... + .+......+.+. T Consensus 543 ~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e~~~~qa~~~~~qae~ 622 (725) T protein:vir:92 543 PQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPAMVQAQGVLLQGQAEL 622 (725) T ss_pred ccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 10000 0111111111000000011111100000 0 000000111122 Q ss_pred CC Q lcl|NC_021301. 455 SR 456 (456) Q Consensus 455 ~~ 456 (456) .| T Consensus 623 ~k 624 (725) T protein:vir:92 623 AK 624 (725) T ss_pred HH Confidence 22 No 113 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.42 E-value=2.2e-11 Score=78.98 Aligned_cols=437 Identities=11% Similarity=0.053 Sum_probs=186.5 Q ss_pred CCCCC----------HHHHHHHHHHHH---HHHHH----HHH----------HHHHHhcccCcccccCcccchhhhhhhh Q lcl|NC_021301. 1 MTAST----------PAEWLPVLTKRI---DDGMS----RVR----------LLARYSNGDAPLPELTRNTSAAWRSFQR 53 (456) Q Consensus 1 ~~~~t----------~~~~~~~l~~~~---~~~~~----r~~----------~~~~YY~g~~~i~~~~~~~~~~~~~~~~ 53 (456) |...| ++.+...|.+++ ...+. ++. +...||.|...-.. .+..... .+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~~~~----~~~~~~~-rs 77 (651) T protein:vir:80 3 LATTTTDKNRQTYDETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVLRSV----GDVNADW-RH 77 (651) T ss_pred ccccccchhhhhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhcccccccc----CCCCCCC-Cc Confidence 22211 223333333333 32221 221 34567766532111 1111111 23 Q ss_pred hhccChHHHHHHHHHhhhccC-----C-eecCCCCcccHHHH----HHHHH----HhcChhHHHHHHHHHHhhCCeEEEE Q lcl|NC_021301. 54 EARTNWGLMVRDSVADRIIPN-----G-ITVGGSADSDLALR----ARRIW----RDNRMDSVCKQWVKYGLDFGESYLT 119 (456) Q Consensus 54 k~~~n~~~~iVd~~a~~l~~~-----~-~~~~~~~d~~~~~~----l~~~~----~~n~~~~~~~~~~~~a~~~G~a~~~ 119 (456) +++.+..+..|+.....|... . |.+....+.+.... +..++ .+++|......+..+++++|.|++. T Consensus 78 ~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~k 157 (651) T protein:vir:80 78 KITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLA 157 (651) T ss_pred cccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEE Confidence 578899999999888877532 1 44333333332322 34444 3677888888999999999999887 Q ss_pred EeeCC-------------------------------CCceEEEEEccceeEEEEeCCCCc--eEEEEEEEE-EecC---- Q lcl|NC_021301. 120 CWRRD-------------------------------DGTATITADSPETMVVSVDPLQPW--RIRSAMRWW-RDLD---- 161 (456) Q Consensus 120 v~~d~-------------------------------dg~~~i~~~~p~~~~~~~d~~~~~--~~~~~~~~~-~~~d---- 161 (456) ||-+. .|.|++..++|+++++ |+..+. ....+++.+ +..+ T Consensus 158 v~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~--dp~a~~~~d~~~v~~~~~t~~~l~~l 235 (651) T protein:vir:80 158 LPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY--DPNVTDPNRGAFIRKLTKTKADILNL 235 (651) T ss_pred EeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeee--cCCCcCccccceeeeeeeeHHHHHHH Confidence 75321 2568899999999774 553321 111111221 1100 Q ss_pred ---Cce---------EEE--------------------EEEcC-C--eEEEEEEeeeecccccceeeccCCCceeec-cc Q lcl|NC_021301. 162 ---AES---------DFA--------------------IVWSG-D--GWQKFARPCFVQSSSRRRLVTRISDSWVPV-GD 205 (456) Q Consensus 162 ---~~~---------~~~--------------------~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 205 (456) |.. ... ..+.+ . .+|.+...................+..+.. .. T Consensus 236 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~g~~il~~~~ 315 (651) T protein:vir:80 236 LSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIMGNEVLRFEQ 315 (651) T ss_pred HhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEcCcEEecccc Confidence 000 000 00000 0 111111111111000010111111111111 11 Q ss_pred ccccCceeEEEEc-----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhh Q lcl|NC_021301. 206 AVVTGSPPPVVVY-----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASI 280 (456) Q Consensus 206 ~~~~~~~~pvv~~-----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~ 280 (456) .++..++|.+++. ...+|+|.++.+++.+..+|.+...+...+...+.|...+.. + + .+ .... T Consensus 316 ~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~---d-------~-~~-~~~~ 383 (651) T protein:vir:80 316 NPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRS---D-------G-LL-QPED 383 (651) T ss_pred cCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecC---C-------c-cc-cHHH Confidence 1111223322221 235799999999999999999999998888888888655421 0 0 00 0111 Q ss_pred hhhhccceeccCCCceeEeec--ccchHHHHHHHHHHHHHHHhhcCCChhhhccc---ccCcHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 281 FEAAPGALWELPPGVDIWESQ--TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPD---SANQSAEGAHNIEKGFLFKCED 355 (456) Q Consensus 281 ~~~~~~~~~~~~~d~~~~~~~--~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~---~~N~Sg~Al~~~~~~l~~k~~~ 355 (456) +...+|.++..+...++..++ ..+.......+..+...+...+|++....|.. ..+.+|.+++.....+...... T Consensus 384 l~~~pg~vi~~~~~~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~ 463 (651) T protein:vir:80 384 VYTEPGKVFLVSDHGDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGNRLSG 463 (651) T ss_pred hhcCCCceEEecCCCCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHHHHHH Confidence 223345555444334444442 23344444556777777777888887666532 2233445555555555555444 Q ss_pred HHHHHHH-----HHHHHHHHHHHhcCCCcc------------------cceeEEe--cCCCCcCHHH---HHHHHHHHHh Q lcl|NC_021301. 356 RLSIAKI-----GLEAILVKALQIEGESVE------------------DTVDVSF--ESPDRVTLGE---KYAAASLAKA 407 (456) Q Consensus 356 ~~~~f~~-----~l~~~~~l~~~~~~~~~~------------------~~i~v~f--~~~~~~~~~e---~ad~~~kl~~ 407 (456) .-+.|.. -++++++++......... ..+.+.+ ...-+....+ .++.+..+.+ T Consensus 464 v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~~~~q 543 (651) T protein:vir:80 464 IHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRLTFIQ 543 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHHHHHH Confidence 4444443 334555555432211100 0111111 1111111122 2222222232 Q ss_pred c-C-C--CcH-----HH---HHHhCCCCh-h-------HHHHH--HHHHHHH-H---HHHHhh-hhhhhcccccCC Q lcl|NC_021301. 408 A-G-E--SWA-----SI---RRNILNYNA-D-------QIKQD--DLDRARE-Q---ITLFAG-NSVQRPQEDGSR 456 (456) Q Consensus 408 ~-g-~--~s~-----~t---~~~~~~~~~-~-------~~~~~--e~~~~~e-e---~~~~~~-~~~~~~~~d~~~ 456 (456) . + . +.. .. .++..|+.. + +.+.. +...+.+ + .+.... ...+.....+.+ T Consensus 544 ~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~~ 619 (651) T protein:vir:80 544 AVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGTQ 619 (651) T ss_pred hhccCCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 2 1 111 12 234456521 1 11100 0000000 0 000000 111111112222 No 114 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=99.33 E-value=7e-12 Score=81.74 Aligned_cols=401 Identities=11% Similarity=0.053 Sum_probs=194.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -+..||.+.- -...-+.-..--++-+ .+|.|.. -..+........+.-.++++.+++..+.-+.+...+ T Consensus 77 ~~~~~~~~~~-~~~~~~~~~~~~~~~l-~~~~~~~---------F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~~~~~ 145 (695) T protein:vir:36 77 VSNYTPRERR-AASYALDFNGTSMDAL-SFVTSSG---------FPGFPTLVLLAQLPEYRAMHEVLADECIRTWGEAIG 145 (695) T ss_pred ccccCccccc-hhhhhhcccccccccc-hhhhccC---------cchHHHHHHHhhccchhhHHHHHHHHhhcccceecc Confidence 3333443210 0000000000000111 1222211 011122223344555677778877777554433221 Q ss_pred C-------------------CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc----e--------- Q lcl|NC_021301. 81 S-------------------ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT----A--------- 128 (456) Q Consensus 81 ~-------------------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~----~--------- 128 (456) . .|.+..+.|..-+++-+....+.++.+++-.||.+..++-.+.++. | T Consensus 146 ~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~~~I~ 225 (695) T protein:vir:36 146 GTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRPYTVP 225 (695) T ss_pred cchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCcccccccccccccccc Confidence 1 1225566777777777788889999999999999987776644331 1 Q ss_pred -----EEEEEccceeEEEEeC-CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceee Q lcl|NC_021301. 129 -----TITADSPETMVVSVDP-LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVP 202 (456) Q Consensus 129 -----~i~~~~p~~~~~~~d~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (456) -+.+++|.++.|-.-+ .++- ....+++++..+-. ..++..+ T Consensus 226 kGslKGl~ViDp~~vtP~~~n~~dP~---------spdfgkP~~y~V~G--~kIH~SR---------------------- 272 (695) T protein:vir:36 226 KGSFQGLRVVEPYWVTPNNYNSINPV---------ADDFYKPSTWWMIG--TEVHATR---------------------- 272 (695) T ss_pred CcceeeeEeecccccccchhhhccch---------hhccCCCceEEEec--eEEeeee---------------------- Confidence 1455555555542100 0000 00112222222211 0011100 Q ss_pred cccccccCcee-E--EEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccccc-ch---h Q lcl|NC_021301. 203 VGDAVVTGSPP-P--VVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENG-NA---I 275 (456) Q Consensus 203 ~~~~~~~~~~~-p--vv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~-~~---~ 275 (456) .+.+.+.| | +-+..|-+|.|....+.+-+++++++.-........ +....++ ++......+... +. + T Consensus 273 ---L~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~--~~v~~lk-~dla~aL~~g~~~~l~~R~ 346 (695) T protein:vir:36 273 ---LHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ--FSVSGIL-MDLAQALMPGANVDLSMRA 346 (695) T ss_pred ---EEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh--hhHHHHH-HHHHHhhcChhHHHHHHHH Confidence 01111111 1 011123468888888888888888776655444432 2222221 010000001111 11 1 Q ss_pred hhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc---cCcHHHHHHHHHHHHHHH Q lcl|NC_021301. 276 DYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQSAEGAHNIEKGFLFK 352 (456) Q Consensus 276 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~N~Sg~Al~~~~~~l~~k 352 (456) ......+...|.+...+.+.++.+++ ++++++-+.+.+..+.+|+.++||..-|-+.+ =|+||++=..-|...+.- T Consensus 347 eli~~~Rsn~G~~llDk~~Eefeq~s-tslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s 425 (695) T protein:vir:36 347 ELINRYRDNRNILFLDKATEEFFQFN-TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRA 425 (695) T ss_pred HHHHHhcCccceEEEecCCcceEEEe-cccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHH Confidence 12223344455444433467777776 46888999999999999999999986654332 278999766666655443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhc-CCCcccceeEEecCCCCcCHHHHHHH-------HHHHHhcCCCcHHHHHHhC---- Q lcl|NC_021301. 353 CEDRLSIAKIGLEAILVKALQIE-GESVEDTVDVSFESPDRVTLGEKYAA-------ASLAKAAGESWASIRRNIL---- 420 (456) Q Consensus 353 ~~~~~~~f~~~l~~~~~l~~~~~-~~~~~~~i~v~f~~~~~~~~~e~ad~-------~~kl~~~g~~s~~t~~~~~---- 420 (456) ..+..+.+.+++++.++..-. |.. +..+.+.|+|-...+.+|.|++ .+.+..+|+++...++.++ T Consensus 426 --~Qe~~L~p~L~rl~~ii~rS~~G~i-dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~ 502 (695) T protein:vir:36 426 --YQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEP 502 (695) T ss_pred --HHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCC Confidence 346789999999988876543 443 3468899999988888887654 3445566877765555443 Q ss_pred --CCChhHHHHH-----HHHHHHHHHHHHhhhh--hhhccc-----ccCC Q lcl|NC_021301. 421 --NYNADQIKQD-----DLDRAREQITLFAGNS--VQRPQE-----DGSR 456 (456) Q Consensus 421 --~~~~~~~~~~-----e~~~~~ee~~~~~~~~--~~~~~~-----d~~~ 456 (456) +|.....+.. ..+.+.-. ....+.. .++..+ .|+. T Consensus 503 ~s~Y~~~~D~~d~p~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~ 551 (695) T protein:vir:36 503 DGPYAGKLDANDDPGVPADDDIDGV-LTYVQRLAEGGDTGAPGGARAGAT 551 (695) T ss_pred CcccccccccccCCCcCccchhhhh-HhhhcCcccccccCCCCccccccc Confidence 2210000000 00000000 0011111 111111 1111 No 115 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.32 E-value=1.1e-10 Score=75.22 Aligned_cols=442 Identities=12% Similarity=0.011 Sum_probs=182.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHH-----H--HHHHHHHhc--ccCcccccCcccch--h--hhh-hhhhhccChHHHHHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMS-----R--VRLLARYSN--GDAPLPELTRNTSA--A--WRS-FQREARTNWGLMVRDS 66 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~-----r--~~~~~~YY~--g~~~i~~~~~~~~~--~--~~~-~~~k~~~n~~~~iVd~ 66 (456) |+..+.+ ++..+...+..... | ...=.+||. |.+ .+..... + ++. -...+..|-++.+|+. T Consensus 1 ma~~~~~-~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~Q----W~~~~~~~~~~~l~~~~~P~~~~N~i~~~v~~ 75 (720) T protein:vir:35 1 MAETLQK-RHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQ----WEGATAAGSELGKHFEKYPKFEINKISTELNR 75 (720) T ss_pred CchHHHH-HHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCC----CCHHHHHHHHHHHhhCCCCeEEEccHHHHHHH Confidence 7666544 55665555433221 1 111235664 653 1111111 0 000 0113667999999999 Q ss_pred HHhhhccCC--eecC---CCCcccHHHHH----HHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC----C-----Cce Q lcl|NC_021301. 67 VADRIIPNG--ITVG---GSADSDLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----D-----GTA 128 (456) Q Consensus 67 ~a~~l~~~~--~~~~---~~~d~~~~~~l----~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~----d-----g~~ 128 (456) .+++-.-+. +.+. ..+|.+..+.+ ..+...|+.+..++.+..+++++|.+|+-|..|- + +.+ T Consensus 76 v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~~i 155 (720) T protein:vir:35 76 IISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQRI 155 (720) T ss_pred HHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCccccee Confidence 999986543 2222 11233334433 3455678999999999999999999999886531 1 123 Q ss_pred EEEEE-cc-ceeEEEEeCCCCceE-----EEEEEEEEecC------C---------------------ce-EEEEEEcCC Q lcl|NC_021301. 129 TITAD-SP-ETMVVSVDPLQPWRI-----RSAMRWWRDLD------A---------------------ES-DFAIVWSGD 173 (456) Q Consensus 129 ~i~~~-~p-~~~~~~~d~~~~~~~-----~~~~~~~~~~d------~---------------------~~-~~~~~~~~~ 173 (456) ++..+ +| .+ +.|||...+.- ..++..|.+.+ + .. .....|... T Consensus 156 ~i~~v~~~~~~--v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~~~~ 233 (720) T protein:vir:35 156 CLEPIYDPARS--VWFDPDAKKYDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYYEVK 233 (720) T ss_pred eEecccCchhh--eeecccccccChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEeeEEE Confidence 33332 23 33 34555432210 11122221111 0 00 000011000 Q ss_pred eEEEEEEeeeecccccc-------------------------------eee--ccCCCceeecccccccCceeEEEEc-c Q lcl|NC_021301. 174 GWQKFARPCFVQSSSRR-------------------------------RLV--TRISDSWVPVGDAVVTGSPPPVVVY-Q 219 (456) Q Consensus 174 ~~~~~~~~~~~~~~~~~-------------------------------~~~--~~~~~~~~~~~~~~~~~~~~pvv~~-~ 219 (456) .... ..........+. +.+ ....+..+.++..+.+++.+|+|++ . T Consensus 234 ~~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~~g 312 (720) T protein:vir:35 234 KESV-DVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPVYG 312 (720) T ss_pred EEEE-EEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEEEe Confidence 0000 000000000000 000 0011111112212222333444432 1 Q ss_pred C---CC----CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc-ccccch--hh----hhhhhhhhc Q lcl|NC_021301. 220 N---PD----GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV-DENGNA--ID----YASIFEAAP 285 (456) Q Consensus 220 n---~~----g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~-~~~~~~--~~----~~~~~~~~~ 285 (456) . .+ ..|.+..+++.++.+|...|.+...+-. .+...-.|........ .++..+ .. ......... T Consensus 313 ~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~--~~~~~~~~a~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 390 (720) T protein:vir:35 313 KRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQ--DTGSIPIVGKSQIKTLEKYWANRNKNRPAFLPLNEIVDKQ 390 (720) T ss_pred eeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHc--CCccccccCcchHHHHHHHhhccccccccccccccccccC Confidence 1 12 2477788999999999999988776532 2222222221110000 000000 00 000011112 Q ss_pred cceeccCCCceeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 286 GALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGL 364 (456) Q Consensus 286 ~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l 364 (456) |.+..- ..+++..+.. -...+...+..-...|-.+||+.+..+|.. +|.||+|+......-..........+..+. T Consensus 391 G~~~~~--~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~-sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~ 467 (720) T protein:vir:35 391 GNIIAP--PTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMP-SNIAKETVNHLMHRSDMSSFIYLDNMAKSL 467 (720) T ss_pred cccccC--CCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222111 1233333322 235566777777788888999999999864 579999999876665555555666666666 Q ss_pred HHHHHHHHHh----c---------CC-C-cc--------------------------cceeEEecCCCCcCHHHHHHHHH Q lcl|NC_021301. 365 EAILVKALQI----E---------GE-S-VE--------------------------DTVDVSFESPDRVTLGEKYAAAS 403 (456) Q Consensus 365 ~~~~~l~~~~----~---------~~-~-~~--------------------------~~i~v~f~~~~~~~~~e~ad~~~ 403 (456) +++.++++.+ - |. . +. ++|.+.=.|..+.-..+.++.+. T Consensus 468 ~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~ 547 (720) T protein:vir:35 468 KRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLT 547 (720) T ss_pred HHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHH Confidence 6655554422 1 10 0 00 01111111111222233444444 Q ss_pred HHHhcCCCcH--------HHHHHhCCCChh-HH---------------------HHHHHHHHH--HHHHH---Hhhhhhh Q lcl|NC_021301. 404 LAKAAGESWA--------SIRRNILNYNAD-QI---------------------KQDDLDRAR--EQITL---FAGNSVQ 448 (456) Q Consensus 404 kl~~~g~~s~--------~t~~~~~~~~~~-~~---------------------~~~e~~~~~--ee~~~---~~~~~~~ 448 (456) .+..+ +++. ..+++.+.+... ++ .+...+.++ ++... ..+.... T Consensus 548 qll~~-~~p~~~~~~~~~~~ile~~d~p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~ 626 (720) T protein:vir:35 548 NLLAG-MLPQDPMRQVLQGIILDNMEGEGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLM 626 (720) T ss_pred HHHHh-cCCCchhHHHHHHHHHHhcCchhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHH Confidence 44332 2111 111222222100 00 000000000 00000 0000011 Q ss_pred hcccccCC Q lcl|NC_021301. 449 RPQEDGSR 456 (456) Q Consensus 449 ~~~~d~~~ 456 (456) +.+....| T Consensus 627 qaqae~~k 634 (720) T protein:vir:35 627 QGQAEVQK 634 (720) T ss_pred HHHHHHHH Confidence 11111111 No 116 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=99.32 E-value=1e-11 Score=80.84 Aligned_cols=409 Identities=12% Similarity=0.055 Sum_probs=195.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHH----HHHHHHhcccCcccc--cC---cccchhhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRV----RLLARYSNGDAPLPE--LT---RNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~----~~~~~YY~g~~~i~~--~~---~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) -..-.| -..|.+.|..+..-| +.-..|--+.+.+.. ++ -..-..+........+.-.++++.+++..+ T Consensus 59 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~ 135 (694) T protein:vir:10 59 VAEPSP---SLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADEC 135 (694) T ss_pred cCCCCc---chhhhhhccccccCCCccccchhhhhhccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHh Confidence 001111 122333332222111 011112111111110 00 000011222233344555677788888877 Q ss_pred ccCCeecCCC-------------------CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc----e Q lcl|NC_021301. 72 IPNGITVGGS-------------------ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT----A 128 (456) Q Consensus 72 ~~~~~~~~~~-------------------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~----~ 128 (456) .-+.+...+. .|.+..+.|..-+++-+....+.++.+++-.||.+..++-.+.++. | T Consensus 136 ~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~P 215 (694) T protein:vir:10 136 IRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTP 215 (694) T ss_pred hcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCccccccc Confidence 6554332211 1224556677777777788889999999999999987776544331 1 Q ss_pred --------------EEEEEccceeEEEEeC-CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceee Q lcl|NC_021301. 129 --------------TITADSPETMVVSVDP-LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) Q Consensus 129 --------------~i~~~~p~~~~~~~d~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) -+.+++|.++.|-.-+ .++- ....+++.+..+-. ..++..+ T Consensus 216 L~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~---------spdfgkP~~y~V~G--~~IH~SR------------- 271 (694) T protein:vir:10 216 LVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPV---------ADDFYKPSTWWMIG--TEVHATR------------- 271 (694) T ss_pred cccccccccCcceeeeEeecccccccchhhhccch---------hhccCCCceEEEec--eEEeeee------------- Confidence 1455555555542100 0000 00112222222211 0011100 Q ss_pred ccCCCceeecccccccCcee-E--EEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc Q lcl|NC_021301. 194 TRISDSWVPVGDAVVTGSPP-P--VVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE 270 (456) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~-p--vv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 270 (456) .+.+.+.| | +-+..|-+|.|....+.+-+++++++.-........ +....++- +......+. T Consensus 272 ------------L~~f~g~plPd~LKp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~--~~v~~lk~-dla~~L~~g 336 (694) T protein:vir:10 272 ------------LHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ--FSVSGILM-DLAQALMPG 336 (694) T ss_pred ------------EEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh--hhhHHHHH-HHHHhhcCh Confidence 01111111 1 011123468888888888888888776655444432 22232210 100000011 Q ss_pred cc-ch---hhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc---cCcHHHHHH Q lcl|NC_021301. 271 NG-NA---IDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQSAEGAH 343 (456) Q Consensus 271 ~~-~~---~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~N~Sg~Al~ 343 (456) .. +. +......+...|.+...+.+.++.+++ ++++++-+.+.+..+.+|+.++||..-|-+.+ =|+||++=. T Consensus 337 ~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~s-tslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~ 415 (694) T protein:vir:10 337 ANVDLSMRAELINRYRDNRNILFLDKATEEFFQFN-TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEI 415 (694) T ss_pred hHHHHHHHHHHHHHhcCccceEEEecCCcceEEEe-cccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhH Confidence 11 11 112223344455444433467777776 46888999999999999999999986654332 278999766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCcccceeEEecCCCCcCHHHHHHH-------HHHHHhcCCCcHHH Q lcl|NC_021301. 344 NIEKGFLFKCEDRLSIAKIGLEAILVKALQIE-GESVEDTVDVSFESPDRVTLGEKYAA-------ASLAKAAGESWASI 415 (456) Q Consensus 344 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~-~~~~~~~i~v~f~~~~~~~~~e~ad~-------~~kl~~~g~~s~~t 415 (456) .-|...+.- ..+..+.+.+++++.++..-. |.. +..+.+.|+|-...+.+|.|++ .+.+..+|+++... T Consensus 416 rnYYD~I~s--~Qe~~L~p~L~rl~~ii~rS~~G~i-dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~e 492 (694) T protein:vir:10 416 RVWYDYVRA--YQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQ 492 (694) T ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHH Confidence 666655443 346789999999988876543 443 3468899999988888887654 34455668777655 Q ss_pred HHHhC------CCChhHHHHH-----HHHHHHHHHHHHhhhh--hhhccc-----ccCC Q lcl|NC_021301. 416 RRNIL------NYNADQIKQD-----DLDRAREQITLFAGNS--VQRPQE-----DGSR 456 (456) Q Consensus 416 ~~~~~------~~~~~~~~~~-----e~~~~~ee~~~~~~~~--~~~~~~-----d~~~ 456 (456) ++.++ +|.....+.. ..+.+.-. ....+.. .++..+ .|+. T Consensus 493 vr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~ 550 (694) T protein:vir:10 493 VAARLNTEPDGPYAGKLDANDDPGVPADDDIDGV-LTYVQRLAEGGDTGAPGGARAGAT 550 (694) T ss_pred HHHHHhcCCCcccccccccccCCCcCccchhhhh-HhhhcCcccccccCCCCccccccc Confidence 55443 2210000000 00000000 0011111 111111 1111 No 117 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=99.31 E-value=1.2e-11 Score=80.36 Aligned_cols=409 Identities=12% Similarity=0.058 Sum_probs=194.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHH----HHHHHHhcccCccc--cc---CcccchhhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRV----RLLARYSNGDAPLP--EL---TRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~----~~~~~YY~g~~~i~--~~---~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) ...-+| --.|.+.|+.+..-+ +.-..|--+.+.+. .+ .-..-..+........+.-.++++.+++..+ T Consensus 60 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~ 136 (695) T protein:vir:78 60 VAEPSP---SLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADEC 136 (695) T ss_pred ccCCCc---ccccceeceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHh Confidence 111111 112222222211111 00011111111100 00 0000011222233344555677788888877 Q ss_pred ccCCeecCCC-------------------CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc----e Q lcl|NC_021301. 72 IPNGITVGGS-------------------ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT----A 128 (456) Q Consensus 72 ~~~~~~~~~~-------------------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~----~ 128 (456) .-+.+...+. .|.+..+.|..-+++-+....+.++.+++-.||.+..++-.+.++. | T Consensus 137 ~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~P 216 (695) T protein:vir:78 137 IRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTP 216 (695) T ss_pred hcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCccccccc Confidence 5554332211 1224556677777777788889999999999999987776644331 1 Q ss_pred --------------EEEEEccceeEEEEeC-CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceee Q lcl|NC_021301. 129 --------------TITADSPETMVVSVDP-LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) Q Consensus 129 --------------~i~~~~p~~~~~~~d~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) -+.+++|.++.|-.-+ .++- ....+++.+..+-. ..++..+ T Consensus 217 L~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~---------spdfgkP~~y~V~G--~kIH~SR------------- 272 (695) T protein:vir:78 217 LVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPV---------ADDFYKPSTWWMIG--TEVHATR------------- 272 (695) T ss_pred cccccccccCcceeeeEeecccccccchhhhccch---------hhccCCCceEEEec--eEEeeee------------- Confidence 1455555555542100 0000 00112222222211 0011100 Q ss_pred ccCCCceeecccccccCcee-E--EEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc Q lcl|NC_021301. 194 TRISDSWVPVGDAVVTGSPP-P--VVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE 270 (456) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~-p--vv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 270 (456) .+.+.+.| | +-+..|-+|.|....+.+-+++++++.-........ +...+++- +......+. T Consensus 273 ------------L~~f~g~plPd~LKp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~--~~v~~lk~-dla~~L~~g 337 (695) T protein:vir:78 273 ------------LHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ--FSVSGILM-DLAQALMPG 337 (695) T ss_pred ------------EEEecCCCchhhhhcccccCcccHHHHHHHHHHHHHHHHhHHHHHHHh--hhhHHHHH-HHHHhhcCh Confidence 01111111 1 011123468888888888888888776655444422 22233210 100000011 Q ss_pred cc-ch---hhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc---cCcHHHHHH Q lcl|NC_021301. 271 NG-NA---IDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQSAEGAH 343 (456) Q Consensus 271 ~~-~~---~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~N~Sg~Al~ 343 (456) .. +. +......+...|.+...+.+.++.+++ ++++++-+.+.+..+.+|+.++||..-|-+.+ =|+||++=. T Consensus 338 ~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~s-tslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~ 416 (695) T protein:vir:78 338 ANVDLSMRAELINRYRDNRNILFLDKATEEFFQFN-TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEI 416 (695) T ss_pred hHHHHHHHHHHHHHhcCccceEEEecCCcceEEEe-cccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhH Confidence 11 11 112223344455444433467777776 46888999999999999999999986654332 278999766 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCcccceeEEecCCCCcCHHHHHHH-------HHHHHhcCCCcHHH Q lcl|NC_021301. 344 NIEKGFLFKCEDRLSIAKIGLEAILVKALQIE-GESVEDTVDVSFESPDRVTLGEKYAA-------ASLAKAAGESWASI 415 (456) Q Consensus 344 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~-~~~~~~~i~v~f~~~~~~~~~e~ad~-------~~kl~~~g~~s~~t 415 (456) .-|...+.- ..+..+.+.+++++.++..-. |.. +..+.+.|+|-...+.+|.|++ .+.+..+|+++... T Consensus 417 rnYYD~I~s--~Qe~~L~p~L~rl~~ii~rS~~G~i-dpdi~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~e 493 (695) T protein:vir:78 417 RVWYDYVRA--YQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQ 493 (695) T ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHH Confidence 666655443 346789999999988876543 443 3468899999988888887654 34455668777655 Q ss_pred HHHhC------CCChhHHHHH-----HHHHHHHHHHHHhhhh--hhhc-----ccccCC Q lcl|NC_021301. 416 RRNIL------NYNADQIKQD-----DLDRAREQITLFAGNS--VQRP-----QEDGSR 456 (456) Q Consensus 416 ~~~~~------~~~~~~~~~~-----e~~~~~ee~~~~~~~~--~~~~-----~~d~~~ 456 (456) ++.++ +|.....+.. ..+.+.-. ....+.. .++. ...|+. T Consensus 494 vr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~ 551 (695) T protein:vir:78 494 VAARLNTEPDGPYAGKLDANDDPGVPADDDIDGV-LTYVQRLAEGGDTGAPGGARAGAT 551 (695) T ss_pred HHHHHhcCCCcccccccccccCCCcCccchhhhh-HhhhcCcccccccCCCCCCCCCCC Confidence 55443 2210000000 00000000 0001111 1111 112222 No 118 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=99.30 E-value=6.8e-12 Score=81.81 Aligned_cols=408 Identities=12% Similarity=0.052 Sum_probs=196.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH----HHHHHhcccCccc--ccC---cccchhhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVR----LLARYSNGDAPLP--ELT---RNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~----~~~~YY~g~~~i~--~~~---~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) ...-+| --.|.+.|+.+..-+- .-..|--+.+.+. .++ -..-..+..+.....+.-.++++.+++..+ T Consensus 60 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~ 136 (698) T protein:vir:10 60 VAEPSP---SLRLARQFEVDVSNYTPRERRAASYALDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADEC 136 (698) T ss_pred ccCCCc---cccccccceeccccCCccccchhhhhhcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHh Confidence 111111 1223333322211110 0011111111110 000 000011222233344555677788888877 Q ss_pred ccCCeecCCC-------------------CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc----e Q lcl|NC_021301. 72 IPNGITVGGS-------------------ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT----A 128 (456) Q Consensus 72 ~~~~~~~~~~-------------------~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~----~ 128 (456) .-+.+...+. .|.+..+.|..-+++-+....+.++++++-.||.+..++-.+.++. | T Consensus 137 ~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~P 216 (698) T protein:vir:10 137 IRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTP 216 (698) T ss_pred hcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCccccccc Confidence 5554332211 1224556677777777788889999999999999877665544331 1 Q ss_pred E--------------EEEEccceeEEEE-eCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceee Q lcl|NC_021301. 129 T--------------ITADSPETMVVSV-DPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) Q Consensus 129 ~--------------i~~~~p~~~~~~~-d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) . +.+++|.++.|-. +..++- ....+++.+..+-.. .++.. T Consensus 217 L~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~dP~---------spdfgkP~~y~V~G~--~IH~S-------------- 271 (698) T protein:vir:10 217 LVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPV---------ADDFYKPSTWWMIGS--EVHAT-------------- 271 (698) T ss_pred cccccccccCccceeeeeecccccccchhhhccch---------hhccCCCceEEEecc--eecce-------------- Confidence 1 4444555444421 000000 001122222221110 00000 Q ss_pred ccCCCceeecccccccCce-eEE--EEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhh-hcCCCcccccc Q lcl|NC_021301. 194 TRISDSWVPVGDAVVTGSP-PPV--VVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRAL-KSAGHGLPKVD 269 (456) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~-~pv--v~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i-~g~~~~~~~~~ 269 (456) ..+.+.+. +|- -+..|-+|.|....+.+-+++++++.-........ +....+ +++. . .+. T Consensus 272 -----------RL~~~vg~pvpd~LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~--~~~~~l~~dla--~-aL~ 335 (698) T protein:vir:10 272 -----------RLHTIVSRPVGDMLKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ--FSVSGILMDLA--Q-ALT 335 (698) T ss_pred -----------eEEEecCCCchhhhcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHH--hhHHHHHHHHH--H-hcC Confidence 00111111 110 11234468898888888888888776655544432 222222 1111 0 001 Q ss_pred cccc--h---hhhhhhhhhhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc---cCcHHHH Q lcl|NC_021301. 270 ENGN--A---IDYASIFEAAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQSAEG 341 (456) Q Consensus 270 ~~~~--~---~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~N~Sg~A 341 (456) ..+. . +......+...|.+...+.+.++.+++ ++++++-+.+.+..+++|+.++||..-|-+.+ =|+||++ T Consensus 336 ~g~~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~s-t~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~ 414 (698) T protein:vir:10 336 PGANVDLSMRAELINRYRDNRNILFLDKATEEFFQFN-TPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEG 414 (698) T ss_pred ChhhHHHHHHHHHHHHhcCccceEEEecCCcceEEEe-cCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchh Confidence 1111 1 122223344455444433467777776 56888999999999999999999986654332 2789997 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCcccceeEEecCCCCcCHHHHHHHH-------HHHHhcCCCcH Q lcl|NC_021301. 342 AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE-GESVEDTVDVSFESPDRVTLGEKYAAA-------SLAKAAGESWA 413 (456) Q Consensus 342 l~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~-~~~~~~~i~v~f~~~~~~~~~e~ad~~-------~kl~~~g~~s~ 413 (456) =..-|...+.- ..+..+.+.+++++.++..-. |.. +..|.+.|+|-...+.+|.|++- ..+...|+++. T Consensus 415 D~rnYYD~I~s--~Qe~~L~p~L~rl~~ii~rS~~G~i-dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~ 491 (698) T protein:vir:10 415 EIRVWYDYVRA--YQRNALQQLMNDVIVMIQLSLFGAV-DPSIKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRP 491 (698) T ss_pred hHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhcCCC-CCcceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCH Confidence 66666655443 346788999999988876543 443 34688999999988988877653 33445687776 Q ss_pred HHHHHhC------CCCh---hH-H------HHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 414 SIRRNIL------NYNA---DQ-I------KQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 414 ~t~~~~~------~~~~---~~-~------~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ..++.+| ||.. .+ . ..++...-.-+....++.........|++ T Consensus 492 ~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 550 (698) T protein:vir:10 492 DQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPTAPGGAR 550 (698) T ss_pred HHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCccccccccccc Confidence 5555443 2321 00 0 01111111111111222222222333444 No 119 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=99.30 E-value=2.5e-11 Score=78.69 Aligned_cols=394 Identities=11% Similarity=-0.005 Sum_probs=181.1 Q ss_pred HHHHHHHHHHHHHHHH-HHHH---------HHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMS-RVRL---------LARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~-r~~~---------~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) +=+++.|..+...... +-.. ...|+.+. .+. ....+.. ..-+.++-...+|+.+++-+..-|+ T Consensus 1 MG~f~~lf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-g~~-~~~~v~~-----~~al~~~~v~~ci~~ia~~iA~lp~ 73 (422) T protein:vir:13 1 MGFLRGLFNKKNNNDEKRSNYDEDIGIDISDSNFWEKF-GIK-LNFSVRG-----KRALKENTVYVCTKIRAESIGKLSL 73 (422) T ss_pred CchhhhhhhccCCccchhhhhhhccccccCcchhhhhc-ccc-CCcccch-----hhhhccHHHHHHHHHHHHhhhhCce Confidence 2244444333211100 0000 00111110 000 0000000 0112234455678888888877787 Q ss_pred ecCCCCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRI 150 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~ 150 (456) .+....+......+.+++.. |. .......+....+.+|.||+++-++..|++ .+..++|..+.++.|+...... T Consensus 74 ~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~~~~~~ 153 (422) T protein:vir:13 74 KIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDDNFLSS 153 (422) T ss_pred EEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCcceec Confidence 76332222112234444432 32 235677788899999999999999988886 5899999999988875532211 Q ss_pred EEEEEE-EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhH Q lcl|NC_021301. 151 RSAMRW-WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEP 229 (456) Q Consensus 151 ~~~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~ 229 (456) ..-+.| +...+|.. . .+.++.+.++.. .. ......|.|.++. T Consensus 154 ~~~~~y~~~~~~g~~--~-~~~~~eiih~~~------------------------------~~----~~~~~~G~s~~~~ 196 (422) T protein:vir:13 154 LSKVWYVVTDKNGKE--H-KLLPDEMLHFIG------------------------------DI----TLDGLIGIKPLDY 196 (422) T ss_pred cceEEEEEEeCCCeE--E-EEcccceEEEcC------------------------------CC----CCCCcccccHHHH Confidence 111111 11222211 1 122222222210 00 0011246666665 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc-hhhhhhhh-h--hhccceeccCCCceeEeecccc- Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN-AIDYASIF-E--AAPGALWELPPGVDIWESQTND- 304 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-~~~~~~~~-~--~~~~~~~~~~~d~~~~~~~~~~- 304 (456) +...++....+..-......-.+.|..+++. ... ..++.-+ ........ . ...+.+..++.+.++.++.... T Consensus 197 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~ 273 (422) T protein:vir:13 197 LRCTIENGRATQEFINKFFKNGLSIKGIVQY-VGD--LDEKAKKIFKKEFESMSNGLENAHSISLLPFGYQFQPISLSMA 273 (422) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEe-CCC--CCHHHHHHHHHHHHHHhcCccccCCceecCCCceeeeccCChh Confidence 5444443332222111111112234444432 111 1111111 11111111 1 1234566778888888776332 Q ss_pred hHHHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCc Q lcl|NC_021301. 305 FTPMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEG--AHNIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESV 379 (456) Q Consensus 305 ~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~A--l~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~ 379 (456) -..+++..+....+|+.+-|+|+..+|... ++ .+.+. +.+....|.-.+. .+++.+.. ++.-..... T Consensus 274 d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~f~~~~l~P~~~--------~ie~~l~~~Ll~~~~~~~ 345 (422) T protein:vir:13 274 DAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKDFYVTTLQSSLT--------VYEQEIQDKLFSQYETLQ 345 (422) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHH--------HHHHHHHHhhCChhhhcC Confidence 233677778888999999999999997532 11 11111 1111112222221 22222211 111111122 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHH-HHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDD-LDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e-~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .+.+++.+....-.|..+.++++.+++++|+++.-.+++.+|+.|-+--..- ...--..++...+.. ....+.|.| T Consensus 346 g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~l~~~~~~~-~~~g~~~g~ 422 (422) T protein:vir:13 346 DVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGDRLLVNGNMIPIEMAGEQY-KKGGEKGGK 422 (422) T ss_pred CceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchhhccccc-ccCCCcCCC Confidence 3334444445566788999999999999999999999999998764321100 000000112222222 233344444 No 120 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.29 E-value=1.3e-10 Score=74.69 Aligned_cols=431 Identities=13% Similarity=0.119 Sum_probs=202.0 Q ss_pred CCCCCHH--HHH------HHHHHHHHH-------HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHH Q lcl|NC_021301. 1 MTASTPA--EWL------PVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRD 65 (456) Q Consensus 1 ~~~~t~~--~~~------~~l~~~~~~-------~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd 65 (456) ||..+-+ -++ .++...+.. ...++..+++||.+...-. .....+.. .+++.+|.+.-+++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~~~----~~~~~~~~-r~~~~~~k~~~~~~ 75 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDTTT----TSNQGLPW-KNSTTLPKLCQIRD 75 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhhhh----hhhccccc-ccccchhHHHHHHH Confidence 6655422 222 444444422 2334577888988854311 11111122 23677888888888 Q ss_pred HHHhhhcc----CC--eec----CCCCcccHHHHHHHHH----HhcChhHHHHHHHHHHhhCCeEEEEEeeCCC------ Q lcl|NC_021301. 66 SVADRIIP----NG--ITV----GGSADSDLALRARRIW----RDNRMDSVCKQWVKYGLDFGESYLTCWRRDD------ 125 (456) Q Consensus 66 ~~a~~l~~----~~--~~~----~~~~d~~~~~~l~~~~----~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d------ 125 (456) .++.+|+. +- +.+ .++.+....+.+.... .+.+|...+..+.+++.++|.|+..++-... T Consensus 76 ~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~~~e~~e 155 (584) T protein:vir:95 76 NLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAKYKEMTD 155 (584) T ss_pred HHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeecceeeec Confidence 88887743 21 111 1122222244444444 5568999999999999999999998864322 Q ss_pred -------CceEEEEEccceeEEEEeCCCCc--eEEEEEEEEE-e-----------------------------------c Q lcl|NC_021301. 126 -------GTATITADSPETMVVSVDPLQPW--RIRSAMRWWR-D-----------------------------------L 160 (456) Q Consensus 126 -------g~~~i~~~~p~~~~~~~d~~~~~--~~~~~~~~~~-~-----------------------------------~ 160 (456) ..+++..++|.++| |||.-.. ..-..++.+. . . T Consensus 156 ~~~v~~~~~prieriSP~d~~--~Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~~~~~~~ 233 (584) T protein:vir:95 156 GTLVPDYIGPRLVRISPLDIV--FNPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHLGGYSVE 233 (584) T ss_pred cccccccccceEEeeChhhee--ecCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCCCCCccc Confidence 25899999999987 6765421 1111111111 0 0 Q ss_pred CCceEE-------EEEE--cCCeEEEEEEe---eeeccccc---c-eeeccCCCceeecccccccCceeEEEEc------ Q lcl|NC_021301. 161 DAESDF-------AIVW--SGDGWQKFARP---CFVQSSSR---R-RLVTRISDSWVPVGDAVVTGSPPPVVVY------ 218 (456) Q Consensus 161 d~~~~~-------~~~~--~~~~~~~~~~~---~~~~~~~~---~-~~~~~~~~~~~~~~~~~~~~~~~pvv~~------ 218 (456) +.++.. ..+| .....+..... .+...... . .......+...-....+...+.+|++.. T Consensus 234 ~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g~~iIR~~~np~~~~~~PF~~~~~~p~~ 313 (584) T protein:vir:95 234 DFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVVDRSTEVRNESIPTWFGSAPIYHVGWRFRP 313 (584) T ss_pred ccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEEeccEEEEeeecCCCCCCCCEEEEcceeee Confidence 000000 0011 11111110000 00000000 0 0111111112212222333444444332 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeE Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIW 298 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 298 (456) ..-+|.|....+.++++.+|.+.-.+.+....+..|.....+.. ......+|..+..+..+.+. T Consensus 314 ~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~----------------~~~~~~pg~~~~~~~~~~~q 377 (584) T protein:vir:95 314 DNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEV----------------EEFVWGPGAEIHLDQGGDVQ 377 (584) T ss_pred ccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeecccc----------------chhcccCCceeecCCCCCcc Confidence 35689999999999999999999888888888888733222210 11234456666555444443 Q ss_pred eecccchHHHH---HHHHHHHHHHHhhcCCChhhhccccc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_021301. 299 ESQTNDFTPML---SAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCEDRLSIAKIGL-EAILVKALQ 373 (456) Q Consensus 299 ~~~~~~~~~~~---~~l~~~~~~i~~~~~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l-~~~~~l~~~ 373 (456) .+.+ +.+++. ..+..+...+...+|+|...-|..+. +.++..+.+....+..-...+.+.|...+ ++++.++.+ T Consensus 378 ~~~p-~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~ll~~ 456 (584) T protein:vir:95 378 EIAK-NVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNAMLE 456 (584) T ss_pred eecC-chhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3322 223333 33556666778889999988876533 34555677777778777888888887776 777777765 Q ss_pred hcC---C-CcccceeEE------ecCCC--------------CcCHHHHHHHHHHHH---h--cC--C---CcHHH---- Q lcl|NC_021301. 374 IEG---E-SVEDTVDVS------FESPD--------------RVTLGEKYAAASLAK---A--AG--E---SWASI---- 415 (456) Q Consensus 374 ~~~---~-~~~~~i~v~------f~~~~--------------~~~~~e~ad~~~kl~---~--~g--~---~s~~t---- 415 (456) ... + .+.-.+... |.... ..-..+.++..+.+. + +| + ++... T Consensus 457 ~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l~~~ 536 (584) T protein:vir:95 457 TATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKALATF 536 (584) T ss_pred HHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHHHHH Confidence 411 0 111011100 11111 111122233222222 2 11 1 11111 Q ss_pred HHH---hCCCC---hh--HHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 416 RRN---ILNYN---AD--QIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 416 ~~~---~~~~~---~~--~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +.+ +.++. ++ ...+++.+-...+.......-.+.+.+ |.- T Consensus 537 ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~~~~~~~~~~-~~~ 584 (584) T protein:vir:95 537 VDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDLQLQAQMPAE-GAI 584 (584) T ss_pred HHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHHHHHHhhhhc-cCC Confidence 111 22221 11 111111111111111111111111111 111 No 121 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.24 E-value=7.7e-11 Score=76.03 Aligned_cols=393 Identities=11% Similarity=-0.011 Sum_probs=175.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCc---c Q lcl|NC_021301. 8 EWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD---S 84 (456) Q Consensus 8 ~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d---~ 84 (456) =+..+.-++........+..--.+-|... .+..+ ....-+.+.-...+|+.+++-+..-||.+-...+ . T Consensus 1 m~f~~~~~~~~~~~~~~~~~~~~~~g~~~---~~~~v-----~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~~~~ 72 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDDKKILEWLGINP---SETYV-----NGKSCLKQATVFGCIRILSDNISKLPIKIYQKKDGIKR 72 (409) T ss_pred CcccccccCcCCCCCCChHHHHHHhcCCc---Cccee-----chhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCeee Confidence 00000000000000000000000111000 00000 0001123344566788888887777776522111 1 Q ss_pred cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEE-EE Q lcl|NC_021301. 85 DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMR-WW 157 (456) Q Consensus 85 ~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~-~~ 157 (456) .....+..++.. |. .......+....+.+|.||+++-.+..|.+ .+..++|..+.++.++........-+. .+ T Consensus 73 ~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~ 152 (409) T protein:vir:10 73 VPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLNSENNVWYLY 152 (409) T ss_pred ccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccccccceEEEEE Confidence 111123344432 32 235567788899999999999999989886 588899999888876542211111111 12 Q ss_pred EecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHH Q lcl|NC_021301. 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRI 237 (456) Q Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~ 237 (456) ....|.. ..+..+.+.++.. .+. ....|.|.++.....++.. T Consensus 153 ~~~~g~~---~~~~~~evih~r~--------------------------~~~---------d~~~G~s~i~~~~~~i~~~ 194 (409) T protein:vir:10 153 TDDLGQR---HKFMSDEILHFKG--------------------------LTA---------DGLAGLSVIELLNHLIENG 194 (409) T ss_pred EeCCcee---EEeccccEEEecC--------------------------cCC---------CCcccccHHHHHHHHHHHH Confidence 2222211 1122223222210 000 1123666665544444333 Q ss_pred HHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc-hhhhhhh-hh--hhccceeccCCCceeEeecccc-hHHHHHHH Q lcl|NC_021301. 238 NRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN-AIDYASI-FE--AAPGALWELPPGVDIWESQTND-FTPMLSAI 312 (456) Q Consensus 238 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-~~~~~~~-~~--~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l 312 (456) ..+..-......-.+.|-.+++. ... ..++.-. ....... .. ...+.+..++.+.++.++...+ -..+++.. T Consensus 195 ~~~~~~~~~~f~ng~~~~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~ 271 (409) T protein:vir:10 195 KSSETYLNNFFKNGLQVKGLVQY-AGD--LNPEAEEVFKENFERMSSGLKNAHRIAMLPIGYKFEPISQKLVDAQFLENS 271 (409) T ss_pred HHHHHHHHHHHhccCCCcEEEEc-CCC--CCHHHHHHHHHHHHHHhccccccCCceecCCCceEEEccCChhhHHHHHHH Confidence 22222111111222234334332 111 1111111 1111111 11 1234566778888888775432 22367778 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCCcccceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI-EGESVEDTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~-~~~~~~~~i~v~f~~~~ 391 (456) +....+|+.+-|+|+..+|.... .++..++.....+...| -.-+-..+++.+..-+-. ......+.+++.+...+ T Consensus 272 ~~~~~~Ia~~fgVPp~~lg~~~~-~~~~~~e~~~~~f~~~~---l~P~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll 347 (409) T protein:vir:10 272 QLTIRQIASVFGVKMHQLNDLDR-ATHSNITEQNREFYIDT---LQSILNMYELEINYKLFLISEIKNGFYSKFNVDTIL 347 (409) T ss_pred HHHHHHHHHHhCCCHHHcCCCCC-CccccHHHHHHHHHHHH---HHHHHHHHHHHHHHhhcCchhccCCcEEEEechhhh Confidence 88999999999999999974321 11111111111111111 011222222222211111 11122334555555556 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) -.|..+.++++.+++++|+++.--+++.+|+.|-+--..- .+....... +...++....|+| T Consensus 348 ~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~--~~~~n~~~~-~~~~~~~~kgGe~ 409 (409) T protein:vir:10 348 RADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVL--LINGNMIPV-KMAGEQYSKGGEK 409 (409) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCee--eeccCccch-hhccccccccCCC Confidence 7789999999999999999999888999998764321100 001111111 1112333456777 No 122 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=99.24 E-value=3.7e-11 Score=77.77 Aligned_cols=408 Identities=11% Similarity=-0.004 Sum_probs=174.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcc-cCcccccCcccchhhh-hhhhhhccChHHHHHHHHHhhhccCCeecCCCCc- Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNG-DAPLPELTRNTSAAWR-SFQREARTNWGLMVRDSVADRIIPNGITVGGSAD- 83 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g-~~~i~~~~~~~~~~~~-~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d- 83 (456) +-|+..|..+.......-.. .+.|.. ...+...+........ ....-+.+.=...+|+.+++-+..-|+.+-...+ T Consensus 1 Mg~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~iA~lp~~~~~~~~~ 79 (457) T protein:vir:62 1 MGFWSALFGRGHSPALDAAE-GRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGG 79 (457) T ss_pred Cchhhhhhcccccccccccc-ccccccchhhhhhccccccCCceechHHhhccHHHHHHHHHHHHhHhhCceEEEEecCC Confidence 33343332221110000000 000000 0000000000000000 0000111222334677777777666776432111 Q ss_pred ---ccHHHHHHHHHHh-cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 84 ---SDLALRARRIWRD-NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 84 ---~~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) ......+..++.. |. ...+...+....+.+|.||+++-.+ +|.+ .+..++|..+.+..+............ T Consensus 80 ~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~ 158 (457) T protein:vir:62 80 TRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVLDPTKIHVHMVMVDGLRRKVFEA 158 (457) T ss_pred ccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEEEEeccCCccceeEEE Confidence 1111123333322 22 3456677888899999999988555 4554 688889999887665443322222222 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liD 235 (456) |....++.......|.++.++++... ...+ ...|.|-++.....+. T Consensus 159 y~~~~~g~~~~~~~~~~~eiih~r~~-------------------------~~~~---------~~~G~sp~~~~~~~i~ 204 (457) T protein:vir:62 159 YDIDADGNEVLLGWFTPRDVLHIPGM-------------------------MLPG---------DFVGCSPISYARESIG 204 (457) T ss_pred EEEccCCceeEEEeeCccceEEecCC-------------------------CCCC---------ceecccHHHHHHHHHH Confidence 22223333333333444554443210 0000 0136665554444333 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhh-hh--hhccceeccCCCceeEeecccch-HHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASI-FE--AAPGALWELPPGVDIWESQTNDF-TPMLS 310 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~-~~--~~~~~~~~~~~d~~~~~~~~~~~-~~~~~ 310 (456) ....+..-......-.+.|..+++-- . ...++....+ ..... .. ...+.+..++.+.++.++..... ..|++ T Consensus 205 ~~~~~~~~~~~~f~ng~~p~gil~~~-~--~ls~e~~~~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e 281 (457) T protein:vir:62 205 LALAAQKYGAHFFRNGAMPGAVVEVP-G--TMSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQ 281 (457) T ss_pred HHHHHHHHHHHHHhccCCcceEEEcC-C--CCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHH Confidence 33222221111122223344444321 1 1112211111 11111 11 11345677888889888754322 23778 Q ss_pred HHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecC Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFES 389 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~ 389 (456) ..+..+.+|+.+-++|+..+|....+ .++..++.....+...+ . .-+-..+++.+...+.-......+.+++.+.. T Consensus 282 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~~--l-~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ 358 (457) T protein:vir:62 282 TRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFS--L-RPWLERIEAGFNRLLFAETADRFRFVKFNLDE 358 (457) T ss_pred HHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHH--H-HHHHHHHHHHHHhhhcCccccCceEEEeechh Confidence 77888999999999999998753322 22222222222222211 1 11222233322211111111122334444455 Q ss_pred CCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH---HH----HHHHHH---HHHHHHhhhhhhhccccc---CC Q lcl|NC_021301. 390 PDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK---QD----DLDRAR---EQITLFAGNSVQRPQEDG---SR 456 (456) Q Consensus 390 ~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~---~~----e~~~~~---ee~~~~~~~~~~~~~~d~---~~ 456 (456) ..-.|..+.++++.+++++|+++..-+++.+|+.|-+.- +. ....+. +....-.+.+.+.+.++. ++ T Consensus 359 l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (457) T protein:vir:62 359 IKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEE 438 (457) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCCCC Confidence 566799999999999999999999999999988653211 00 000000 000000111111110000 00 No 123 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=99.24 E-value=1.6e-11 Score=79.78 Aligned_cols=399 Identities=12% Similarity=0.065 Sum_probs=164.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhc----c---cCcccccCcccch--hhhhhhhhhccC-hHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSN----G---DAPLPELTRNTSA--AWRSFQREARTN-WGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~----g---~~~i~~~~~~~~~--~~~~~~~k~~~n-~~~~iVd~~a~~ 70 (456) ||.. ++... .|.....|..+-++.|. | +++-.......+. .+..+......| .++.|||..++- T Consensus 1 ~~~~-----~~~~~-~~~~~~~~~~~~rd~l~~~~~glg~~r~~~~~~~g~~~~~~~~~l~~~Yr~~~ia~~iVd~~~d~ 74 (449) T protein:vir:10 1 MTDK-----LTLAV-NHALNDARMARARMGLMVPTMGLDNKRHSAWCEYGFPELVTYENLYSLYRRGGIAHGAVEKLVGK 74 (449) T ss_pred Cchh-----hHHHH-hhhcchhHHHHHHHHHHHHHhcCCcccchhhhhcCCcccCCHHHHHHHHhcCchhHHHHHhhhhh Confidence 5544 33333 33222222222222111 1 1111010001111 111222233445 467999999997 Q ss_pred hccCCeecCCCCcccHH---HHHHHHHH---hcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeC Q lcl|NC_021301. 71 IIPNGITVGGSADSDLA---LRARRIWR---DNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDP 144 (456) Q Consensus 71 l~~~~~~~~~~~d~~~~---~~l~~~~~---~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~ 144 (456) +.-+...+....+.+.. ..+...|+ .+++.....++.+++..+|.|++++..+ ||+.--..+.+. T Consensus 75 ~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l~~Pl~~~-------- 145 (449) T protein:vir:10 75 CWQTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIR-DEKDWNLPATKG-------- 145 (449) T ss_pred hhhcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEec-CCCCCCcccccC-------- Confidence 75443222211111111 11222222 2355667888999999999998887664 343211111111 Q ss_pred CCCceEEEEEEEEEec------CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc Q lcl|NC_021301. 145 LQPWRIRSAMRWWRDL------DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY 218 (456) Q Consensus 145 ~~~~~~~~~~~~~~~~------d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~ 218 (456) ..+..+..+|... +..+.. .-|.....|.+.... .+... -....|..+++.+.-. T Consensus 146 ---~~i~~i~v~~~~~i~~~~~~~dp~s-p~yg~P~~y~v~~~~--------------~g~~~-~~~~iH~SRl~~~~~~ 206 (449) T protein:vir:10 146 ---RGLQKVSVSWAGSLKVAEWDTGINS-KTYGQPKLWKYTERL--------------PNGSS-RRVDIHPDRVFILGDY 206 (449) T ss_pred ---cceeeEEeeccccCChhhhhcCCCC-CCCCCceEEEEeeec--------------cCCCc-cceeeccceeEeecCC Confidence 0111111111100 000000 111112222222110 00000 0112344454433211 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHH-----HHHHHHHhhc---hhhhhhcCCCcccc-ccccc-chhhhhhhhhhhccce Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQ-----LLSTMAIQAF---RQRALKSAGHGLPK-VDENG-NAIDYASIFEAAPGAL 288 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~-----~~~~~~~~~~---~~~~i~g~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~ 288 (456) . ..|.|.++++..-+-.++.+.-. +.+....... ...-+.|+...... ..+.. ........+..+.+.+ T Consensus 207 ~-~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 285 (449) T protein:vir:10 207 S-EDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNEVAGEINRGNDVL 285 (449) T ss_pred C-CCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHHHHHHHhccchhe Confidence 1 23666666543322122211100 0010000000 00011111100000 00000 0111222222222322 Q ss_pred eccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhh-ccccc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 289 WELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPML-MPDSA--NQSAEGAHNIEKGFLFKCEDRLSIAKIGLE 365 (456) Q Consensus 289 ~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~-~~~~~--N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~ 365 (456) .++.+.++.+++. ++.+..+.+....+.+|+.++||..-| |...+ |++++ ++. ....|..+|..+.+.|+ T Consensus 286 -~i~~~~d~~~~~~-~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D-~~n----yyd~i~~~Q~~l~p~le 358 (449) T protein:vir:10 286 -MTTQGATVTPLVT-SVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTED-QKY----FNARCQSRRVDLSFEIE 358 (449) T ss_pred -eecCCcceEEEec-ccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccchh-HHH----HHHHHHHHHHhhhHHHH Confidence 3344455555543 355666778888899999999998655 44322 33333 333 33444455566899999 Q ss_pred HHHHHHHHhc-CCCcccceeEEecCCCCcCHHHHHH-------HHHHHHhcC---CCcHHHHHHhCCCChhHHHHHHHHH Q lcl|NC_021301. 366 AILVKALQIE-GESVEDTVDVSFESPDRVTLGEKYA-------AASLAKAAG---ESWASIRRNILNYNADQIKQDDLDR 434 (456) Q Consensus 366 ~~~~l~~~~~-~~~~~~~i~v~f~~~~~~~~~e~ad-------~~~kl~~~g---~~s~~t~~~~~~~~~~~~~~~e~~~ 434 (456) +++.+++... |.. +..+.+.|+|-...+.+|.|+ ++.++.++| +++.+-++..+|+.+...... T Consensus 359 ~l~~~l~~s~~g~~-~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~~~~~~~~~---- 433 (449) T protein:vir:10 359 DFCDKLIELKIIDA-VAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGYDNDDEEPL---- 433 (449) T ss_pred HHHHHHHHhhcCCC-CCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcccCCCCCCC---- Confidence 9998776543 333 347899999999999988755 444455454 667666667766654210000 Q ss_pred HHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 435 AREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 435 ~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .....++++++. T Consensus 434 ----------~~e~~de~~~~~ 445 (449) T protein:vir:10 434 ----------GEEDGDEEDKAT 445 (449) T ss_pred ----------CCCCCccccccC Confidence 000001111111 No 124 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=99.23 E-value=1.2e-10 Score=74.99 Aligned_cols=379 Identities=11% Similarity=0.010 Sum_probs=168.7 Q ss_pred HHHHHHHHHHHHHH--HHHHHHHHhcccC--cccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcc Q lcl|NC_021301. 9 WLPVLTKRIDDGMS--RVRLLARYSNGDA--PLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) Q Consensus 9 ~~~~l~~~~~~~~~--r~~~~~~YY~g~~--~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~ 84 (456) ++--+-..+.+... .-.....+.-.-. .+...........-....-+.+.-...+|+.+++-+..-|+.+...... T Consensus 1 m~m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ia~lp~~~~~~~~~ 80 (392) T protein:vir:74 1 MILPILNFINQTNDPPEAGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKKNQ 80 (392) T ss_pred CcchhhhhhhcccCcccccccccccccCchhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHhhccCceeeccchhh Confidence 11111111100000 0000000000000 0000000000000000011223345568888888887778876532211 Q ss_pred cHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCc Q lcl|NC_021301. 85 DLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAE 163 (456) Q Consensus 85 ~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~ 163 (456) ..+.+=.....-......+...++.+|.||+++-++.+|++ .+..++|..+.+..++..+.. . +.+...++. T Consensus 81 ---~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~-~---y~~~~~~~~ 153 (392) T protein:vir:74 81 ---GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM-Y---YNITFDDPK 153 (392) T ss_pred ---hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE-E---EEEEecCCc Confidence 11111111112245566678899999999999999989886 588999999988876554321 1 111111221 Q ss_pred eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHH Q lcl|NC_021301. 164 SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQ 243 (456) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~ 243 (456) ......+..+.+.++.. +.. -....|.|-++.....++....+..- T Consensus 154 ~~~~~~~~~~evih~~~--------------------------~~~--------~~~~~G~s~i~~~~~~i~~~~~~~~~ 199 (392) T protein:vir:74 154 IEPILQAPQSDLIHMKL--------------------------LSI--------DGGKTGISPLYSLRRESKIQRASDRL 199 (392) T ss_pred cceeEEEcCccEEEecC--------------------------CCC--------CCccccccHHHHHHHHHHHHHHHHHH Confidence 11122233333333210 000 00124677666554444333322221 Q ss_pred HHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhh--hhccceeccCCCceeEeeccc-chHHHHHHHHHHHHHHH Q lcl|NC_021301. 244 LLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFE--AAPGALWELPPGVDIWESQTN-DFTPMLSAIKEHIRQLS 320 (456) Q Consensus 244 ~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~~i~ 320 (456) ......-.+.|..+++- .......++.... ....+. ...+.+..++.+.++.++... ....|++..+....+|+ T Consensus 200 ~~~~f~ng~~p~~il~~-~~~~~~~~~~~~~--~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 276 (392) T protein:vir:74 200 TISSLNSSLNVPGVLTV-KGGGLLSDKDKAS--RSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYA 276 (392) T ss_pred HHHHHhccCCCceEEEe-CCCCCchHHHHHH--HHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 11112222233333321 1111111111111 111111 123455677788898887632 23447888888999999 Q ss_pred hhcCCChhhhcccccCc-HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHH Q lcl|NC_021301. 321 SATKTPLPMLMPDSANQ-SAEGAHNIEK-GFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEK 398 (456) Q Consensus 321 ~~~~~p~~~~~~~~~N~-Sg~Al~~~~~-~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ 398 (456) .+-|+|+..+|....+. +.++.+..+. .|.-.+ ..+++.+...+ . ..+++.+..-.-.+..+. T Consensus 277 ~~fgVPp~~lg~~~~~~~~~e~~~~~~~~~l~p~~--------~~ie~~l~~~l---~----~~~~~~~~~~~~~d~~~~ 341 (392) T protein:vir:74 277 KVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYL--------RPAISELEYKL---S----DHISVNMRPAIDPLGDNY 341 (392) T ss_pred HHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHH--------HHHHHHHHHhc---c----chhcccchhhhcCCHHHH Confidence 99999999998654433 2333322111 111111 11111111111 0 112222222233566788 Q ss_pred HHHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHhhhhhhhccc Q lcl|NC_021301. 399 YAAASLAKAAGESWASIRRNIL---NYNADQIKQDDLDRAREQITLFAGNSVQRPQE 452 (456) Q Consensus 399 ad~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~ 452 (456) ++.+.++..+|+++...+++++ |+.+.++.+ .+......+.+.++|.. T Consensus 342 ~~~~~~l~~~g~~t~near~~~~~~g~~pne~r~------~enl~~~~~Gd~~~p~p 392 (392) T protein:vir:74 342 LSTISTATRWGALAENQATFVLQEAGYIPKDLPA------PENTNKKTTGQSNEPVP 392 (392) T ss_pred HHHHHHHHhCCCcCHHHHHHHHHhCCCCccccch------hcCCCCCCCCCCCCCCC Confidence 8899999999999998877664 887765542 12222222222222222 No 125 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=99.22 E-value=3.6e-11 Score=77.84 Aligned_cols=378 Identities=11% Similarity=0.013 Sum_probs=164.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccH Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~ 86 (456) +-+.+++.............+-.+..+... +..+.. ..-+.+.-...+|+.++.-+.+-|+.... .. T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~v~~-----~~al~~~~V~~~v~~ia~~ia~~p~~~~~----~~ 67 (397) T protein:vir:38 1 MPLLKLNKSHSQGFSLNDPDWVNFLTGGEA----QKYVSA-----DTALKNSDIFSLIMQLSGDLAMVRYTSES----DR 67 (397) T ss_pred CcchhhhhcccCcccCCchhhhhhhcCCcC----Cceech-----HHhhccHHHHHHHHHHHHHHhhCcccccc----cH Confidence 112222111000000000001111111100 000000 00122233445677777766666776432 11 Q ss_pred HHHHHHHHHh-c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecC Q lcl|NC_021301. 87 ALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLD 161 (456) Q Consensus 87 ~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d 161 (456) +..++.+ | ....+...+....+.+|.||+.+-.+.+|.+ .+..++|..+.+..+..... +...+ ..... T Consensus 68 ---~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~-~~y~~--~~~~~ 141 (397) T protein:vir:38 68 ---SQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDGSG-LIYNI--NFDEP 141 (397) T ss_pred ---HHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce-EEEEE--Eeccc Confidence 2223322 2 2346677888899999999999988888886 68899999998877654432 11111 00111 Q ss_pred CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHH Q lcl|NC_021301. 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAE 241 (456) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~ 241 (456) +.+. ...+..+.++++... .. .....|.|.+......++....+. T Consensus 142 ~~~~-~~~~~~~eiih~~~~--------------------------~~--------~~~~~G~s~i~~~~~~i~~~~~~~ 186 (397) T protein:vir:38 142 AIGY-MENVPAADVIHIRLL--------------------------SK--------NGGKTGISPLSALINEQQIKDASN 186 (397) T ss_pred cccc-eeEecCccEEEecCC--------------------------CC--------CCccccccHHHHHHHHHHHHHHHH Confidence 1111 112333333332110 00 001246776665554444333222 Q ss_pred HHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhh--hhccceeccCCCceeEeeccc-chHHHHHHHHHHHH Q lcl|NC_021301. 242 LQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWELPPGVDIWESQTN-DFTPMLSAIKEHIR 317 (456) Q Consensus 242 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~d~~~~~~~~~-~~~~~~~~l~~~~~ 317 (456) .-......-.+.|..+++.-. . ..++..... ....... ...+..+.++.+.++.++... ....|++..+.... T Consensus 187 ~~~~~~f~ng~~~~~il~~~~-~--~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~ 263 (397) T protein:vir:38 187 ELTLKALKQSVTASAVLTIQK-G--GLLDAETRIARSKEISKQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRD 263 (397) T ss_pred HHHHHHHhccCCccEEEEeCC-C--CCHHHHHHHHHHHHHHhcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHH Confidence 222222222233444443211 1 111111111 1111111 123445667888888887643 23347888899999 Q ss_pred HHHhhcCCChhhhccccc-CcHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCH Q lcl|NC_021301. 318 QLSSATKTPLPMLMPDSA-NQSAEGAHNIEK-GFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTL 395 (456) Q Consensus 318 ~i~~~~~~p~~~~~~~~~-N~Sg~Al~~~~~-~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~ 395 (456) +|+++-|+|+..+|+..+ +.+.++.+..+. .|. -+-..+++.+..-+ .. ...+.+.| ..-.+. T Consensus 264 ~Ia~afgVp~~~lg~~~~~~~~~e~~~~~~~~~l~--------P~~~~ie~~ln~~l--~~---~~~~~~~~--~~~~d~ 328 (397) T protein:vir:38 264 QIAKVYGVPDSYLNGQGDQQSSITQISGQYAKSLN--------RYVQAIVGELNDKL--HA---NISANIRF--AIDAMG 328 (397) T ss_pred HHHHHhCCCHHHhCCCCCcccHHHHHHHHHHHHHH--------HHHHHHHHHHHHhc--cC---hhcccccc--cccCCH Confidence 999999999999987543 223332222111 111 11111221111111 11 11122222 234577 Q ss_pred HHHHHHHHHHHhcCCCcHHHHHHhCCCChhH---HHHHHHHHH-HHHHHHHhh--hhhhhcccccCC Q lcl|NC_021301. 396 GEKYAAASLAKAAGESWASIRRNILNYNADQ---IKQDDLDRA-REQITLFAG--NSVQRPQEDGSR 456 (456) Q Consensus 396 ~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~---~~~~e~~~~-~ee~~~~~~--~~~~~~~~d~~~ 456 (456) .+.++++.++.+.|+++..-+++.+|+.+-+ ....+..-. ........+ ......++.++- T Consensus 329 ~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~ 395 (397) T protein:vir:38 329 DQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQEGGENDGNNSDERGSD 395 (397) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccccccccccccccccccCCCCCCCCCCCCCC Confidence 8889999999999999999999888875421 111010000 000000000 000011111111 No 126 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=99.21 E-value=1.7e-10 Score=74.19 Aligned_cols=404 Identities=11% Similarity=0.037 Sum_probs=166.5 Q ss_pred HHHHHHHHHHHH---------H---------HHHHHHHHHHHhcccC--------------cc-cccC-cccchhhhhhh Q lcl|NC_021301. 7 AEWLPVLTKRID---------D---------GMSRVRLLARYSNGDA--------------PL-PELT-RNTSAAWRSFQ 52 (456) Q Consensus 7 ~~~~~~l~~~~~---------~---------~~~r~~~~~~YY~g~~--------------~i-~~~~-~~~~~~~~~~~ 52 (456) +.+.+.|.+... . ..-....+.++-.++. .. ...+ ...+..++... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~l~ 80 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHGVL 80 (547) T ss_pred CchhhhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHHHH Confidence 111111111000 0 0000001111111111 00 0001 11222333322 Q ss_pred hh-hccChHHHHHHHHHhhhcc-----------CCe--ecCC------CCcccHHHHHHHHHHhc---------ChhHHH Q lcl|NC_021301. 53 RE-ARTNWGLMVRDSVADRIIP-----------NGI--TVGG------SADSDLALRARRIWRDN---------RMDSVC 103 (456) Q Consensus 53 ~k-~~~n~~~~iVd~~a~~l~~-----------~~~--~~~~------~~d~~~~~~l~~~~~~n---------~~~~~~ 103 (456) +. ....+.+.+|++.++.+.+ -|+ ++.. ..+......+.+++..- .+..+. T Consensus 81 ~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~f~ 160 (547) T protein:vir:63 81 KKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSSFV 160 (547) T ss_pred HHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHHHH Confidence 22 2345677777777765431 122 2211 11222223445554431 234566 Q ss_pred HHHHHHHhhCCeEEEEEeeCCCCceE-EEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEee Q lcl|NC_021301. 104 KQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPC 182 (456) Q Consensus 104 ~~~~~~a~~~G~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 182 (456) ..+..+.+.+|.+|+.+-.+.+|++. +..++|..+.++.++.. ......++|+...++... ..|..+.+.++... T Consensus 161 ~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g-~~~~~~~~y~~~~~~~~~--~~~~~~eiih~r~n- 236 (547) T protein:vir:63 161 KKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADG-KIPDNGNRFVQVIDQKIV--ATFNAREMAFAVRN- 236 (547) T ss_pred HHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCcc-ccccCceEEEEEcCCcEE--EEeccccEEEeccc- Confidence 77888999999999999889899864 88999999888765432 111111222222222111 11223333222110 Q ss_pred eecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhc---hhhhhh Q lcl|NC_021301. 183 FVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAF---RQRALK 259 (456) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~---~~~~i~ 259 (456) |..-.....+|.|-++.....+.....+. .....++.+ |.-+|. T Consensus 237 ------------------------------~~~~~~~~~~G~Spi~~~~~~i~~~~~a~---~~~~~~f~Ng~~p~giL~ 283 (547) T protein:vir:63 237 ------------------------------PRSDIYATGYGYPELEIALKQFIAHENTE---AFNDRFFSHGGTTRGILQ 283 (547) T ss_pred ------------------------------CCCCcccccccccHHHHHHHHHHHHHHHH---HHHHHHHHcCCCcceEEE Confidence 00000011246666665444444332222 222333332 332222 Q ss_pred cCCCcccccccccchhh-hh-hhhh--hhcccee-ccCCCceeEeecc--cchHHHHHHHHHHHHHHHhhcCCChhhhcc Q lcl|NC_021301. 260 SAGHGLPKVDENGNAID-YA-SIFE--AAPGALW-ELPPGVDIWESQT--NDFTPMLSAIKEHIRQLSSATKTPLPMLMP 332 (456) Q Consensus 260 g~~~~~~~~~~~~~~~~-~~-~~~~--~~~~~~~-~~~~d~~~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~ 332 (456) +.......++....+. .. ..+. ...+.+. ....+.++.++.. .+++ |++..+..+..|+.+-++|++.+|. T Consensus 284 -~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~l~~~~~d~q-fle~~~~~~~~Ia~afgVPP~~lG~ 361 (547) T protein:vir:63 284 -IKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDME-FEKWLNYLINVISALYGIDPAEINI 361 (547) T ss_pred -ecCCCCCCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEEcCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHcCc Confidence 1111111111111111 11 1111 1223333 3356678877753 2333 8888889999999999999999984 Q ss_pred cccCc---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHH Q lcl|NC_021301. 333 DSANQ---------SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAAS 403 (456) Q Consensus 333 ~~~N~---------Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~ 403 (456) ...+. +..-++.....+...+ -.-+-..+++.+...+ .. .....+.+.|......+.++.+... T Consensus 362 ~~~~~~~~~~~~s~t~sn~e~~~~~~~~~t---L~P~~~~ie~~ln~~L--~~-~~~~~~~~~f~~~~~~~~~~~~~~~- 434 (547) T protein:vir:63 362 PNNGGATGSKGGSLNEGNSAEKNQASKNKG---LQPLLGFIEDFINKHI--VA-EFGDKYTFQFVGGDIKSELESVKIL- 434 (547) T ss_pred ccccccccccccccchhhHHHHHHHHHHHH---HHHHHHHHHHHHHhhc--cc-ccCCceEEEeeccccccHHHHHHHH- Confidence 32211 0001111111111100 0111111111111111 01 1123467788888888887777644 Q ss_pred HHHhcCCCcHHHHHHhCCCChh-HH-------------------HHHHHHHHHHHHHHHhhhhh----hh-ccc-ccCC Q lcl|NC_021301. 404 LAKAAGESWASIRRNILNYNAD-QI-------------------KQDDLDRAREQITLFAGNSV----QR-PQE-DGSR 456 (456) Q Consensus 404 kl~~~g~~s~~t~~~~~~~~~~-~~-------------------~~~e~~~~~ee~~~~~~~~~----~~-~~~-d~~~ 456 (456) ++..+|+++.--+++.+|+.|. +- +..+.++.++..+...+... .. +++ +++. T Consensus 435 ~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (547) T protein:vir:63 435 AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKD 513 (547) T ss_pred HHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCCCCCCCCCCCcc Confidence 5667899999888988887652 10 00011111111111111000 00 000 1111 No 127 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=99.20 E-value=1e-10 Score=75.33 Aligned_cols=394 Identities=10% Similarity=0.070 Sum_probs=172.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec-- Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV-- 78 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~-- 78 (456) .|+-|..+... .+...|.+.... ..+....-.+.. .....+.....+|+.+++-+..-|+.+ T Consensus 9 ~~~p~~~e~~~--------------~~~~~~~~~~~~-~~~~~~~~~~~~-~~a~~~~~V~acV~~IA~~iA~lpl~l~~ 72 (518) T protein:vir:10 9 LSAPAMAELSP--------------QMQDSYYYAPAV-GMQLERQFSLYG-GIYKNQPWVRTVIAKRAQALARLPVKCMF 72 (518) T ss_pred ecCchhhhhhh--------------hhhccccccccc-ceecccccchhh-HHHhhhHHHHHHHHHHHHhhccCceEEEE Confidence 22222222211 122222221100 000000000000 001123456778888888777666554 Q ss_pred -CCCCcc-cHHHHHHHHHHh-cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEE Q lcl|NC_021301. 79 -GGSADS-DLALRARRIWRD-NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIR 151 (456) Q Consensus 79 -~~~~d~-~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~ 151 (456) ..+... .....+..++.+ |. -..+...+....+.+|.||+++-++.+|++ .+..++|..+.+..+...... T Consensus 73 ~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~~~~~~-- 150 (518) T protein:vir:10 73 TSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY-- 150 (518) T ss_pred EcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcCCCCEE-- Confidence 111111 111223334433 22 235566788889999999999999999987 589999999998887654331 Q ss_pred EEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHH Q lcl|NC_021301. 152 SAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHI 231 (456) Q Consensus 152 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~ 231 (456) .+++...++.......|..+.+.++... +.. ....|.|-+.... T Consensus 151 --~y~~~~~~~~~~~~~~~~~~eViHir~~--------------------------s~d--------g~~~G~spi~~a~ 194 (518) T protein:vir:10 151 --EYYFQAGAGVGTQLVSFADDEVVPIRFF--------------------------NPD--------GLERGLSLMESLK 194 (518) T ss_pred --EEEEEecCCccceEEEecCCcEEEecCC--------------------------CCC--------cccccccHHHHHH Confidence 1122222222222223333343333100 000 0013566555433 Q ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhh-hhh--hhccceeccCCCceeEeecccchH- Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YAS-IFE--AAPGALWELPPGVDIWESQTNDFT- 306 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~-~~~--~~~~~~~~~~~d~~~~~~~~~~~~- 306 (456) ..+.....+..-......-.+.|..+++.- .. ..++....+. ... .+. ...+.+..++.+.++.++.....+ T Consensus 195 ~~i~~~~a~~~~~~~~f~ng~~p~gil~~~-~~--ls~e~~~~~k~~~~~~~~G~~nag~v~vL~~G~~~~~l~~s~~D~ 271 (518) T protein:vir:10 195 STIFSEDSSRNATAAMWKNAGRPNLVLRHE-KR--LSEAAQQRLREQFDRAHSGSSNTGKTMVVEEGMEPIPLQLTAVEM 271 (518) T ss_pred HHHHHHHHHHHHHHHHHhcCCCccEEEecC-CC--CCHHHHHHHHHHHHHHhcCccccCcceEcCCCceEEEccCChhHH Confidence 332222221111111111122343444321 11 1111111111 111 111 123456678888888877543222 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEE Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVS 386 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~ 386 (456) .|++..+..+.+|+.+-++|+..+|... +++...++.....+...| -.-+-..+++.+...+. ......+.+++. T Consensus 272 q~le~r~~~~~eIa~afgVPp~~lg~~~-~~t~sn~eq~~~~f~~~t---L~P~l~~ie~~ln~~L~-~~~~~~~~~~fd 346 (518) T protein:vir:10 272 QFIEARQLNREEVCGVYDIAPPIVHILD-RATFSNISAQMRAFYRDT---MAIPIARIQSAMDKYVG-QYWVRKNRMKFD 346 (518) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhccCC-CCCchhHHHHHHHHHHHH---HHHHHHHHHHHHHHhhc-ccccCCceEEEe Confidence 3788888889999999999999997432 111111111111111111 01111222222211110 001112334444 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHH-------HHHHHHH-HHHHhhhhhhhcccccCC Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDD-------LDRAREQ-ITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e-------~~~~~ee-~~~~~~~~~~~~~~d~~~ 456 (456) ....+..|..+.++++.+++++|+++.--+++.+|+.+-+..-.. ...+... .....+.....+++.+++ T Consensus 347 ~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~ 424 (518) T protein:vir:10 347 IDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKRPAST 424 (518) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccceecccccccccCCCCCCCCCCCCcc Confidence 455567899999999999999999999889999988653211000 0000000 000000000000000100 No 128 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=99.20 E-value=1.4e-10 Score=74.60 Aligned_cols=375 Identities=10% Similarity=0.034 Sum_probs=164.0 Q ss_pred hhhhhhhhccChHHHHHHHHHhhhccCCeecCCC-------CcccHHHHHHHHHHh---c-----------ChhHHHHHH Q lcl|NC_021301. 48 WRSFQREARTNWGLMVRDSVADRIIPNGITVGGS-------ADSDLALRARRIWRD---N-----------RMDSVCKQW 106 (456) Q Consensus 48 ~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~-------~d~~~~~~l~~~~~~---n-----------~~~~~~~~~ 106 (456) ++.+.+ ...+...+|+..++.+.+-|+.+... ......+.+.+++.. | .+..+...+ T Consensus 1 l~~l~~--~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~ 78 (467) T protein:vir:31 1 MAELLE--HNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTA 78 (467) T ss_pred Chhhhh--cCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHH Confidence 444322 25788999999999999888765211 111122233333332 2 133556678 Q ss_pred HHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeec Q lcl|NC_021301. 107 VKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQ 185 (456) Q Consensus 107 ~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~ 185 (456) ..+.+.+|.||+.+-.+..|++ .+..++|..+.+..|... ++...++...+...|............... T Consensus 79 ~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 149 (467) T protein:vir:31 79 WTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERG---------FVQLLEEKEKYFGVAGDRYQTNGNGDLDPV 149 (467) T ss_pred HHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecce---------eEeecCCceeeEEeccccceeecccceeee Confidence 8889999999999988888886 488889988887765431 111122222222222211111110000000 Q ss_pred ccccceeeccCCCceeecccccccCceeE--EEEccC------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---ch Q lcl|NC_021301. 186 SSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVVYQN------PDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FR 254 (456) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~~~n------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~ 254 (456) ..... ....+. ...+|+ |+|+.. ..|.|.+.....-++... .-......++. .| T Consensus 150 --~~~~~-~~~~~~---------~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~---~~~~~~~~~f~ng~~p 214 (467) T protein:vir:31 150 --FVDAD-DGSTGT---------SVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDS---AAQDYNIDFFENDGVP 214 (467) T ss_pred --eeeec-cccccc---------eeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHH---HHHHHHHHHHhccCCC Confidence 00000 000000 000111 222221 246666554443333222 21112222322 23 Q ss_pred hhhhhcCCCcccccccccchh-hhhhhhh--------------hhccceeccCCCc-------eeEeecccc--hHHHHH Q lcl|NC_021301. 255 QRALKSAGHGLPKVDENGNAI-DYASIFE--------------AAPGALWELPPGV-------DIWESQTND--FTPMLS 310 (456) Q Consensus 255 ~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--------------~~~~~~~~~~~d~-------~~~~~~~~~--~~~~~~ 310 (456) ..++.-.+ +. ..++....+ ....... ...+....+..+. ++..++... -..|++ T Consensus 215 ~gil~~~~-~~-l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e 292 (467) T protein:vir:31 215 RIAIIVKG-AE-LTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLE 292 (467) T ss_pred ceEEEecC-cC-CCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHH Confidence 33332111 10 011111100 0000000 0011222222222 222222211 134778 Q ss_pred HHHHHHHHHHhhcCCChhhhcccc-cC--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhcCCCcccceeEE Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDS-AN--QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL-QIEGESVEDTVDVS 386 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~-~N--~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~-~~~~~~~~~~i~v~ 386 (456) ..+....+|+++-|+|+..+|... ++ ++.+++...+. ..+ -.-+-..|++.+...+ .-......+.+++. T Consensus 293 ~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f~---~~~---l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~ 366 (467) T protein:vir:31 293 FRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKEFA---EET---IQPKQHDFGELLYELVHKQGLDAPDWTIEFE 366 (467) T ss_pred HHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHHHH---HHH---HHHHHHHHHHHHHHhhcchhhccCCceEEEe Confidence 888889999999999999987432 12 12222222111 111 0111122222222111 11111234456777 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHH-HHHHHHHH--H-HHhhh----------------- Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDD-LDRAREQI--T-LFAGN----------------- 445 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e-~~~~~ee~--~-~~~~~----------------- 445 (456) +......+..+.+++..+++++|+++..-+++.+|+.|-...... ........ . .-.+. T Consensus 367 ~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (467) T protein:vir:31 367 LAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADEII 446 (467) T ss_pred cchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccCCcccccccccccCCCCcccCcCCCCCCCcccchH Confidence 778888999999999999999999999999999988652111000 00000000 0 00000 Q ss_pred -------hhhhcccccCC Q lcl|NC_021301. 446 -------SVQRPQEDGSR 456 (456) Q Consensus 446 -------~~~~~~~d~~~ 456 (456) ..+++-|.|.. T Consensus 447 ~~~~~~~~~~~~~~~~~~ 464 (467) T protein:vir:31 447 DSYQADLETEQLIEIGAN 464 (467) T ss_pred hhhhhccccchhhhhccc Confidence 00011111111 No 129 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=99.20 E-value=2e-10 Score=73.75 Aligned_cols=385 Identities=13% Similarity=0.007 Sum_probs=182.2 Q ss_pred HHHHHHHHHHHHHH----HHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCc Q lcl|NC_021301. 8 EWLPVLTKRIDDGM----SRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD 83 (456) Q Consensus 8 ~~~~~l~~~~~~~~----~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d 83 (456) =+++++-++..... .-...+..+|-|..... +..+. ...-+.......+|+.+++-+..-|+.+-...+ T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~-----~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~ 73 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTAS--GERVS-----ESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTD 73 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCccccc--Cceec-----hhhhhccHHHHHHHHHHHHhhhhCceEEEEecC Confidence 12222222211100 01223344444332110 11110 011133455567888888888777876422111 Q ss_pred c----cHHHHHHHHHH-h-c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEE Q lcl|NC_021301. 84 S----DLALRARRIWR-D-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSA 153 (456) Q Consensus 84 ~----~~~~~l~~~~~-~-n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~ 153 (456) . .....+..++. + | ........++...+.+|.||+++-.+..|.+ .+..++|..+.++.++..+.. T Consensus 74 ~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~---- 149 (416) T protein:vir:12 74 GGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGML---- 149 (416) T ss_pred CccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEE---- Confidence 1 01111333332 2 2 2345667788899999999999999888886 488899999988876654421 Q ss_pred EEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHH Q lcl|NC_021301. 154 MRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDI 233 (456) Q Consensus 154 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~l 233 (456) .+....+|... .+..+.+.++.. . ..+...|.|.++..... T Consensus 150 -~~~~~~~g~~~---~~~~~eiih~~~-------------------------------~----~~~~~~G~s~i~~~~~~ 190 (416) T protein:vir:12 150 -WYQTVLNGKAI---ELYDYEVLHFKG-------------------------------L----STDGIHGKSPIGVVREH 190 (416) T ss_pred -EEEEecCCeEE---EecCccEEEecC-------------------------------c----CCCCcccccHHHHHHHH Confidence 11112233211 122333332210 0 00112466666655544 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhhhhccceeccCCCceeEeecccch-HHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFEAAPGALWELPPGVDIWESQTNDF-TPMLSA 311 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~ 311 (456) ++....+..-.....+-.+.|..+++- ... ..++....+ ....... ..+.+..++.+.++.++.-... .-+++. T Consensus 191 i~~~~~~~~~~~~~~~ng~~p~~il~~-~~~--~~~e~~~~~~~~~~~~~-~~~~~~vl~~g~~~~~l~~~~~d~q~~e~ 266 (416) T protein:vir:12 191 IGAQAAATKYNAKLYKNEATPRGILKV-PAF--LDEKPKENVRKEWKRVN-KVENIAIIDYGLEYQSISMPLQEAQFVES 266 (416) T ss_pred HHHHHHHHHHHHHHHhcCCCCceEEec-CCC--CCHHHHHHHHHHHHHHh-cCCCeeecCCCceEEEccCChhhHHHHHH Confidence 444332222222222223334444432 111 111111111 1111111 2355667788888887754322 237777 Q ss_pred HHHHHHHHHhhcCCChhhhcccc-cCc-HHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCcccceeEE Q lcl|NC_021301. 312 IKEHIRQLSSATKTPLPMLMPDS-ANQ-SAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDTVDVS 386 (456) Q Consensus 312 l~~~~~~i~~~~~~p~~~~~~~~-~N~-Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~i~v~ 386 (456) .+....+|+.+-|+|+..+|... ++- +.+... +....|.-.+.. +++.+.. ++.-......+.+++. T Consensus 267 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~--------ie~~l~~~l~~~~~~~~g~~i~fd 338 (416) T protein:vir:12 267 MKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIEYVRNTLQPWIVN--------FEQELNVKLFLDHDQKSGHYVKFN 338 (416) T ss_pred HHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHHHHHHHHHHHHHH--------HHHHHHHhhcCchhhcCCceEEee Confidence 88889999999999999997432 221 222221 222222222222 2221111 1100111123445555 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH--------HHHHHHHHHHHHhh--hhhhhccccc Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD--------DLDRAREQITLFAG--NSVQRPQEDG 454 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~--------e~~~~~ee~~~~~~--~~~~~~~~d~ 454 (456) +..-...|..+.++++.++..+|+++.--+++.+|+.|-+--.. ..+...+....-.+ ......+.+| T Consensus 339 ~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 339 IDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccchhhccccccccCCCCCcCCC Confidence 55667789999999999999999999999999998876421100 00111111111010 1111223333 No 130 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=99.19 E-value=2.7e-10 Score=73.01 Aligned_cols=393 Identities=10% Similarity=0.014 Sum_probs=175.1 Q ss_pred HHHHHHHHHHHHHHHHH-------HHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSR-------VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r-------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) +-+++.+.....+.... ...+..+.-+... ...+.. ..-+.+.-...+|+.++.-+..-||.+- T Consensus 1 M~~~~~~f~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~-----~~al~~~~v~~~i~~ia~~ia~l~~~~~ 71 (429) T protein:vir:10 1 MDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKG-----KNALKVATVFACIKILSESVSKLPLKIY 71 (429) T ss_pred CchhhhhhcccccCcccccccCCChHHHHHHhcCCCC----cceech-----hhhhccHHHHHHHHHHHHhhccCceEEE Confidence 33443333211100000 0011111110000 000000 0012234455678888887777676642 Q ss_pred CCC-c---ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCce Q lcl|NC_021301. 80 GSA-D---SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWR 149 (456) Q Consensus 80 ~~~-d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~ 149 (456) ... + ......+..++.. | ........+....+.+|.||+++-.+..|++ .+..++|..+.+..|+..... T Consensus 72 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~~~~ 151 (429) T protein:vir:10 72 QEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVGLLN 151 (429) T ss_pred EecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCccccc Confidence 111 1 1111124444432 2 2345677788899999999999999999986 688999999888776432211 Q ss_pred EEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhH Q lcl|NC_021301. 150 IRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEP 229 (456) Q Consensus 150 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~ 229 (456) ...-..+.....| .. ..|.++.+.++.. .. ......|.|.++. T Consensus 152 ~~~~~~~~~~~~g--~~-~~~~~~evih~~~------------------------------~~----~~~~~~G~s~i~~ 194 (429) T protein:vir:10 152 SKTKMWYVVNTGG--QQ-RVLKPEEILHFKN------------------------------GI----TLDGLVGVPTMEY 194 (429) T ss_pred ccceEEEEEccCC--eE-EEEccccEEEecC------------------------------CC----CCCCcccccHHHH Confidence 1111111111111 11 1233333332210 00 0111246666665 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhh-hh--hhccceeccCCCceeEeecccch Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASI-FE--AAPGALWELPPGVDIWESQTNDF 305 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~-~~--~~~~~~~~~~~d~~~~~~~~~~~ 305 (456) +...++....+..-......-.+.|..+++. ... ..++....+ ..... .. ...+.+..++.+.++.++...+. T Consensus 195 ~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~ 271 (429) T protein:vir:10 195 LKSTLENSASADKFINNFYKQGLQVKGLVQY-VGD--LNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMS 271 (429) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEc-CCC--CCHHHHHHHHHHHHHHhccccccCceeecCCCceEEEccCChh Confidence 5444443332222111112222234444432 111 111111111 11111 11 12345667788888887753322 Q ss_pred -HHHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCc Q lcl|NC_021301. 306 -TPMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESV 379 (456) Q Consensus 306 -~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~ 379 (456) ..+++..+....+|+.+-|+|+..+|... ++ ++.+... +....|. -+-..+++.+.. ++.-..... T Consensus 272 d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~f~~~~l~--------P~~~~ie~~ln~kl~~~~~~~~ 343 (429) T protein:vir:10 272 DAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQFYTDTLQ--------ATLTMYEQEMTYKLFLDSELDK 343 (429) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHH--------HHHHHHHHHHHHhhcChhhcCC Confidence 23677778889999999999999997432 22 1222221 1111121 122222222221 110011112 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHH--HHHHHHhhhhhhhccc Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAR--EQITLFAGNSVQRPQE 452 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~--ee~~~~~~~~~~~~~~ 452 (456) .+.+++.+..-...|..+.++++.++..+|+++..-+++.+|+.|.+-.. .....+. .+...-.+....+... T Consensus 344 g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~ 423 (429) T protein:vir:10 344 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSK 423 (429) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccchhhccccccCCCCCCCCCCC Confidence 23344444455677999999999999999999998889999886542110 0000100 0000011111111111 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) +|+. T Consensus 424 ~~~e 427 (429) T protein:vir:10 424 EGNE 427 (429) T ss_pred CCCC Confidence 1222 No 131 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=99.19 E-value=1.8e-10 Score=73.97 Aligned_cols=380 Identities=10% Similarity=0.014 Sum_probs=167.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCc--ccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC Q lcl|NC_021301. 5 TPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTR--NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) Q Consensus 5 t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~--~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~ 82 (456) =.+-+++++.+.- ..+.......+.-...+....+. ......-....-+.+.-...+|+.+++-+..-|+++.... T Consensus 1 m~m~~f~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:10 1 MILPILNFINQTN--DPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred Ccchhhhhhhccc--ccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccch Confidence 0001111111100 00000000001000000000000 0000000000012234456688888888877788765322 Q ss_pred cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecC Q lcl|NC_021301. 83 DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLD 161 (456) Q Consensus 83 d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d 161 (456) +. ..+.+=.....-......+....+.+|.||+++-++.+|++ .+..++|..+.+..++..+.. . +.+...+ T Consensus 79 ~~---~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~-~---y~~~~~~ 151 (392) T protein:vir:10 79 NQ---GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM-Y---YNITFDD 151 (392) T ss_pred hh---hHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE-E---EEEEecC Confidence 11 11111111112245566788899999999999999999987 688999999888876544321 1 1111112 Q ss_pred CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHH Q lcl|NC_021301. 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAE 241 (456) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~ 241 (456) +.......|..+.++++.. ++. -....|.|-+......++....+. T Consensus 152 ~~~~~~~~~~~~eiih~~~--------------------------~~~--------~~~~~G~s~i~~~~~~i~~~~~~~ 197 (392) T protein:vir:10 152 PKIEPILQAPQSDLIHMKL--------------------------LSI--------DGGKTGISPLYSLRRESKIQRASD 197 (392) T ss_pred cccceeEEEccccEEEecC--------------------------CCC--------CCccccccHHHHHHHHHHHHHHHH Confidence 2111122233333333210 000 001246666665444443333222 Q ss_pred HHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHH Q lcl|NC_021301. 242 LQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLS 320 (456) Q Consensus 242 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~ 320 (456) .-......-.+.|..+++ +.......++........-.-....+.+..++.+.++.++...+ ...|++..+....+|+ T Consensus 198 ~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 276 (392) T protein:vir:10 198 RLTISSLNSSLNVPGVLT-VKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYA 276 (392) T ss_pred HHHHHHHhccCCCceEEE-eCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 111111111223333332 11111111111111111101112234556678888988886432 2347888888999999 Q ss_pred hhcCCChhhhcccccCcHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHH Q lcl|NC_021301. 321 SATKTPLPMLMPDSANQSA-EGAHN-IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEK 398 (456) Q Consensus 321 ~~~~~p~~~~~~~~~N~Sg-~Al~~-~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ 398 (456) .+-|+|+..+|....+.|. ++.+. ....|.-.+...+..+...| .. .+++......-.+..+. T Consensus 277 ~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L-----------~~----~~~~d~~~~~~~d~~~~ 341 (392) T protein:vir:10 277 KVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKL-----------SD----HISVNMRPAIDPLGDNY 341 (392) T ss_pred HHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------cc----cccccchhhhccCHHHH Confidence 9999999999865443332 22221 11222222222222221111 10 11111122223466777 Q ss_pred HHHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 399 YAAASLAKAAGESWASIRRNIL---NYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 399 ad~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +..+.++..+|+++...+++++ |+.++++.+. +... .-+.-|+++ T Consensus 342 ~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~------e~l~-------~~~~Gd~~~ 389 (392) T protein:vir:10 342 LSTISTATRWGALAENQATFVLQEAGYIPKDLPAP------ENTN-------KKTTGQSNE 389 (392) T ss_pred HHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchh------cCCC-------CCCCCCCCC Confidence 8888899999999988777665 8887765421 1122 223334444 No 132 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=99.19 E-value=1.8e-10 Score=73.97 Aligned_cols=380 Identities=10% Similarity=0.014 Sum_probs=167.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCc--ccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC Q lcl|NC_021301. 5 TPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTR--NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) Q Consensus 5 t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~--~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~ 82 (456) =.+-+++++.+.- ..+.......+.-...+....+. ......-....-+.+.-...+|+.+++-+..-|+++.... T Consensus 1 m~m~~f~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~ 78 (392) T protein:vir:39 1 MILPILNFINQTN--DPPEVGSVQSYFPDGNDAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKINAEKKK 78 (392) T ss_pred Ccchhhhhhhccc--ccccccccccccccCchhhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCceeeccch Confidence 0001111111100 00000000001000000000000 0000000000012234456688888888877788765322 Q ss_pred cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecC Q lcl|NC_021301. 83 DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLD 161 (456) Q Consensus 83 d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d 161 (456) +. ..+.+=.....-......+....+.+|.||+++-++.+|++ .+..++|..+.+..++..+.. . +.+...+ T Consensus 79 ~~---~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~-~---y~~~~~~ 151 (392) T protein:vir:39 79 NQ---GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGM-Y---YNITFDD 151 (392) T ss_pred hh---hHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE-E---EEEEecC Confidence 11 11111111112245566788899999999999999999987 688999999888876544321 1 1111112 Q ss_pred CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHH Q lcl|NC_021301. 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAE 241 (456) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~ 241 (456) +.......|..+.++++.. ++. -....|.|-+......++....+. T Consensus 152 ~~~~~~~~~~~~eiih~~~--------------------------~~~--------~~~~~G~s~i~~~~~~i~~~~~~~ 197 (392) T protein:vir:39 152 PKIEPILQAPQSDLIHMKL--------------------------LSI--------DGGKTGISPLYSLRRESKIQRASD 197 (392) T ss_pred cccceeEEEccccEEEecC--------------------------CCC--------CCccccccHHHHHHHHHHHHHHHH Confidence 2111122233333333210 000 001246666665444443333222 Q ss_pred HHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHH Q lcl|NC_021301. 242 LQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLS 320 (456) Q Consensus 242 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~ 320 (456) .-......-.+.|..+++ +.......++........-.-....+.+..++.+.++.++...+ ...|++..+....+|+ T Consensus 198 ~~~~~~f~ng~~p~gil~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia 276 (392) T protein:vir:39 198 RLTISSLNSSLNVPGVLT-VKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYA 276 (392) T ss_pred HHHHHHHhccCCCceEEE-eCCCCCchHHHHHHHHHHHhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 111111111223333332 11111111111111111101112234556678888988886432 2347888888999999 Q ss_pred hhcCCChhhhcccccCcHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHH Q lcl|NC_021301. 321 SATKTPLPMLMPDSANQSA-EGAHN-IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEK 398 (456) Q Consensus 321 ~~~~~p~~~~~~~~~N~Sg-~Al~~-~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ 398 (456) .+-|+|+..+|....+.|. ++.+. ....|.-.+...+..+...| .. .+++......-.+..+. T Consensus 277 ~~fgVpp~~lg~~~~~~~~~~~~~~f~~~~l~P~~~~ie~~l~~~L-----------~~----~~~~d~~~~~~~d~~~~ 341 (392) T protein:vir:39 277 KVYGLPDSYIGGQGDQQSSIQQISGMYASALNRYLRPAISELEYKL-----------SD----HISVNMRPAIDPLGDNY 341 (392) T ss_pred HHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------cc----cccccchhhhccCHHHH Confidence 9999999999865443332 22221 11222222222222221111 10 11111122223466777 Q ss_pred HHHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 399 YAAASLAKAAGESWASIRRNIL---NYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 399 ad~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +..+.++..+|+++...+++++ |+.++++.+. +... .-+.-|+++ T Consensus 342 ~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~e~r~~------e~l~-------~~~~Gd~~~ 389 (392) T protein:vir:39 342 LSTISTATRWGALAENQATFVLQEAGYIPKDLPAP------ENTN-------KKTTGQSNE 389 (392) T ss_pred HHHHHHHHhCCCcCHHHHHHHHHhcCCCccccchh------cCCC-------CCCCCCCCC Confidence 8888899999999988777665 8887765421 1122 223334444 No 133 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=99.19 E-value=3.6e-10 Score=72.33 Aligned_cols=396 Identities=10% Similarity=0.027 Sum_probs=176.6 Q ss_pred HHHHHHHHHHHH--HHHH-H-------HHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 7 AEWLPVLTKRID--DGMS-R-------VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 7 ~~~~~~l~~~~~--~~~~-r-------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) +-+++++..-+. .+.. . ...+-.+.-+... +..+.. ..-+.+.-...+|+.+++-+..-|| T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~-----~~al~~~~v~~~i~~ia~~ia~lp~ 71 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKG-----KNALKVATVFACIKILSESVSKLPL 71 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC----ccccch-----hhhhccHHHHHHHHHHHHhhccCce Confidence 334554433321 0000 0 0011111110000 000000 0112233345677888887777777 Q ss_pred ecCCC-Cc---ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_021301. 77 TVGGS-AD---SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQ 146 (456) Q Consensus 77 ~~~~~-~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~ 146 (456) .+... .+ ......+.+++.. | .-..+...+....+.+|.||+++..+..|++ .+..++|..+.+..|+.. T Consensus 72 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~ 151 (432) T protein:vir:10 72 KIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVG 151 (432) T ss_pred EEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcc Confidence 64211 11 1111224444432 2 2346677788899999999999999988986 588999999988876532 Q ss_pred CceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCc Q lcl|NC_021301. 147 PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGE 226 (456) Q Consensus 147 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~ 226 (456) .........|.....+ .. ..+.+++++++.. . . ......|.|. T Consensus 152 ~~~~~~~~~y~~~~~g--~~-~~~~~~eiih~r~------------------------------~-~---~~~~~~G~s~ 194 (432) T protein:vir:10 152 LLNSKTKMWYVVNTGG--QQ-RVLKPEEILHFKN------------------------------G-I---TLDGLVGVPT 194 (432) T ss_pred cccccceEEEEEecCC--eE-EEEccccEEEecC------------------------------C-C---CCCCcccccH Confidence 1111111111111111 11 1223333332210 0 0 0111246666 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-hhhhhh-hh--hhccceeccCCCceeEeecc Q lcl|NC_021301. 227 VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDYASI-FE--AAPGALWELPPGVDIWESQT 302 (456) Q Consensus 227 ~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~~~~-~~--~~~~~~~~~~~d~~~~~~~~ 302 (456) +......++....+..-......-.+.|..+++. ... ..++.... ...... .. ...+.+..++.+.++.++.. T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~ 271 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQY-VGD--LNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISL 271 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEc-CCC--CCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccC Confidence 6655444443332222111111222234444432 111 11111111 111111 11 12345667788888888764 Q ss_pred cch-HHHHHHHHHHHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCc Q lcl|NC_021301. 303 NDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESV 379 (456) Q Consensus 303 ~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~ 379 (456) ... ..+++..+....+|+.+-|+|+..+|... ++-| .++.....+... .-.-+-..+++.+.. ++.-..... T Consensus 272 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s--~~e~~~~~~~~~---~l~P~~~~ie~~ln~kLl~~~~~~~ 346 (432) T protein:vir:10 272 NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN--NIEQQQQQFYTD---TLQATLTMYEQEMTYKLFLDSELDK 346 (432) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHH---HHHHHHHHHHHHHHHhhcChhhcCC Confidence 322 23677788889999999999999997432 2211 111111111111 011122222222221 111011112 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHH--HHHHHhhhhhhhccc Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRARE--QITLFAGNSVQRPQE 452 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~e--e~~~~~~~~~~~~~~ 452 (456) ...+++.+..-+..|..+.++++.+++++|+++..-+++.+|+.|.+--. .....+.+ +...-++........ T Consensus 347 g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~ 426 (432) T protein:vir:10 347 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSK 426 (432) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCC Confidence 23344445556678999999999999999999998889999887642110 00001100 000001111111111 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) +|+. T Consensus 427 ~~~~ 430 (432) T protein:vir:10 427 EGNE 430 (432) T ss_pred CCCC Confidence 2222 No 134 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=99.19 E-value=3.6e-10 Score=72.33 Aligned_cols=396 Identities=10% Similarity=0.027 Sum_probs=176.6 Q ss_pred HHHHHHHHHHHH--HHHH-H-------HHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 7 AEWLPVLTKRID--DGMS-R-------VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 7 ~~~~~~l~~~~~--~~~~-r-------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) +-+++++..-+. .+.. . ...+-.+.-+... +..+.. ..-+.+.-...+|+.+++-+..-|| T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~-----~~al~~~~v~~~i~~ia~~ia~lp~ 71 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKG-----KNALKVATVFACIKILSESVSKLPL 71 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC----ccccch-----hhhhccHHHHHHHHHHHHhhccCce Confidence 334554433321 0000 0 0011111110000 000000 0112233345677888887777777 Q ss_pred ecCCC-Cc---ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_021301. 77 TVGGS-AD---SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQ 146 (456) Q Consensus 77 ~~~~~-~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~ 146 (456) .+... .+ ......+.+++.. | .-..+...+....+.+|.||+++..+..|++ .+..++|..+.+..|+.. T Consensus 72 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~ 151 (432) T protein:vir:10 72 KIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVG 151 (432) T ss_pred EEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcc Confidence 64211 11 1111224444432 2 2346677788899999999999999988986 588999999988876532 Q ss_pred CceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCc Q lcl|NC_021301. 147 PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGE 226 (456) Q Consensus 147 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~ 226 (456) .........|.....+ .. ..+.+++++++.. . . ......|.|. T Consensus 152 ~~~~~~~~~y~~~~~g--~~-~~~~~~eiih~r~------------------------------~-~---~~~~~~G~s~ 194 (432) T protein:vir:10 152 LLNSKTKMWYVVNTGG--QQ-RVLKPEEILHFKN------------------------------G-I---TLDGLVGVPT 194 (432) T ss_pred cccccceEEEEEecCC--eE-EEEccccEEEecC------------------------------C-C---CCCCcccccH Confidence 1111111111111111 11 1223333332210 0 0 0111246666 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-hhhhhh-hh--hhccceeccCCCceeEeecc Q lcl|NC_021301. 227 VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDYASI-FE--AAPGALWELPPGVDIWESQT 302 (456) Q Consensus 227 ~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~~~~-~~--~~~~~~~~~~~d~~~~~~~~ 302 (456) +......++....+..-......-.+.|..+++. ... ..++.... ...... .. ...+.+..++.+.++.++.. T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~ 271 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQY-VGD--LNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISL 271 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEc-CCC--CCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccC Confidence 6655444443332222111111222234444432 111 11111111 111111 11 12345667788888888764 Q ss_pred cch-HHHHHHHHHHHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCc Q lcl|NC_021301. 303 NDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESV 379 (456) Q Consensus 303 ~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~ 379 (456) ... ..+++..+....+|+.+-|+|+..+|... ++-| .++.....+... .-.-+-..+++.+.. ++.-..... T Consensus 272 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s--~~e~~~~~~~~~---~l~P~~~~ie~~ln~kLl~~~~~~~ 346 (432) T protein:vir:10 272 NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN--NIEQQQQQFYTD---TLQATLTMYEQEMTYKLFLDSELDK 346 (432) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHH---HHHHHHHHHHHHHHHhhcChhhcCC Confidence 322 23677788889999999999999997432 2211 111111111111 011122222222221 111011112 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHH--HHHHHhhhhhhhccc Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRARE--QITLFAGNSVQRPQE 452 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~e--e~~~~~~~~~~~~~~ 452 (456) ...+++.+..-+..|..+.++++.+++++|+++..-+++.+|+.|.+--. .....+.+ +...-++........ T Consensus 347 g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~ 426 (432) T protein:vir:10 347 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSK 426 (432) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCC Confidence 23344445556678999999999999999999998889999887642110 00001100 000001111111111 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) +|+. T Consensus 427 ~~~~ 430 (432) T protein:vir:10 427 EGNE 430 (432) T ss_pred CCCC Confidence 2222 No 135 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=99.19 E-value=3.6e-10 Score=72.33 Aligned_cols=396 Identities=10% Similarity=0.027 Sum_probs=176.6 Q ss_pred HHHHHHHHHHHH--HHHH-H-------HHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 7 AEWLPVLTKRID--DGMS-R-------VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 7 ~~~~~~l~~~~~--~~~~-r-------~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) +-+++++..-+. .+.. . ...+-.+.-+... +..+.. ..-+.+.-...+|+.+++-+..-|| T Consensus 1 M~~~~r~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~----~~~v~~-----~~al~~~~v~~~i~~ia~~ia~lp~ 71 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPS----TISVKG-----KNALKVATVFACIKILSESVSKLPL 71 (432) T ss_pred CChHHHHHHhcCccccCcccccccCCchHHHHHHhCCCcC----ccccch-----hhhhccHHHHHHHHHHHHhhccCce Confidence 334554433321 0000 0 0011111110000 000000 0112233345677888887777777 Q ss_pred ecCCC-Cc---ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_021301. 77 TVGGS-AD---SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQ 146 (456) Q Consensus 77 ~~~~~-~d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~ 146 (456) .+... .+ ......+.+++.. | .-..+...+....+.+|.||+++..+..|++ .+..++|..+.+..|+.. T Consensus 72 ~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~d~~~ 151 (432) T protein:vir:10 72 KIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPIDASKVTVYIDDVG 151 (432) T ss_pred EEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcc Confidence 64211 11 1111224444432 2 2346677788899999999999999988986 588999999988876532 Q ss_pred CceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCc Q lcl|NC_021301. 147 PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGE 226 (456) Q Consensus 147 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~ 226 (456) .........|.....+ .. ..+.+++++++.. . . ......|.|. T Consensus 152 ~~~~~~~~~y~~~~~g--~~-~~~~~~eiih~r~------------------------------~-~---~~~~~~G~s~ 194 (432) T protein:vir:10 152 LLNSKTKMWYVVNTGG--QQ-RVLKPEEILHFKN------------------------------G-I---TLDGLVGVPT 194 (432) T ss_pred cccccceEEEEEecCC--eE-EEEccccEEEecC------------------------------C-C---CCCCcccccH Confidence 1111111111111111 11 1223333332210 0 0 0111246666 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-hhhhhh-hh--hhccceeccCCCceeEeecc Q lcl|NC_021301. 227 VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDYASI-FE--AAPGALWELPPGVDIWESQT 302 (456) Q Consensus 227 ~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~~~~-~~--~~~~~~~~~~~d~~~~~~~~ 302 (456) +......++....+..-......-.+.|..+++. ... ..++.... ...... .. ...+.+..++.+.++.++.. T Consensus 195 ~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~ 271 (432) T protein:vir:10 195 MEYLKSTLENSASADKFINNFYKQGLQVKGLVQY-VGD--LNEDAKKVFRENFESMSSGLQNSHRIALMPVGYQFQPISL 271 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEc-CCC--CCHHHHHHHHHHHHHHhcccccCCcceecCCCceEEEccC Confidence 6655444443332222111111222234444432 111 11111111 111111 11 12345667788888888764 Q ss_pred cch-HHHHHHHHHHHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCc Q lcl|NC_021301. 303 NDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESV 379 (456) Q Consensus 303 ~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~ 379 (456) ... ..+++..+....+|+.+-|+|+..+|... ++-| .++.....+... .-.-+-..+++.+.. ++.-..... T Consensus 272 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s--~~e~~~~~~~~~---~l~P~~~~ie~~ln~kLl~~~~~~~ 346 (432) T protein:vir:10 272 NMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN--NIEQQQQQFYTD---TLQATLTMYEQEMTYKLFLDSELDK 346 (432) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc--cHHHHHHHHHHH---HHHHHHHHHHHHHHHhhcChhhcCC Confidence 322 23677788889999999999999997432 2211 111111111111 011122222222221 111011112 Q ss_pred ccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHH--HHHHHhhhhhhhccc Q lcl|NC_021301. 380 EDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRARE--QITLFAGNSVQRPQE 452 (456) Q Consensus 380 ~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~e--e~~~~~~~~~~~~~~ 452 (456) ...+++.+..-+..|..+.++++.+++++|+++..-+++.+|+.|.+--. .....+.+ +...-++........ T Consensus 347 g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~ 426 (432) T protein:vir:10 347 GFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSK 426 (432) T ss_pred CcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccchhhccccccCCCCCCCCCCC Confidence 23344445556678999999999999999999998889999887642110 00001100 000001111111111 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) +|+. T Consensus 427 ~~~~ 430 (432) T protein:vir:10 427 EGNE 430 (432) T ss_pred CCCC Confidence 2222 No 136 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=99.18 E-value=3.8e-10 Score=72.24 Aligned_cols=386 Identities=10% Similarity=-0.044 Sum_probs=177.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCccc- Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSD- 85 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~- 85 (456) +-+++++-++....+. ... +.|-...- .........-....-+.+.....+|+.+++-+..-||.+-...+.. T Consensus 1 Mgl~~~~f~~~~~~~~----~~~-~~~~~~~~-~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~ 74 (409) T protein:vir:84 1 MSLFTRIFSGPSEERT----LTK-ISGIPSPA-EDWAMHGDRPGANSAMTLGAFYACVTLLADTVASLSIDAYRKKDNVR 74 (409) T ss_pred CchhhhhhcCCCcccc----ccc-cccccccc-chhhccCcccchhhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc Confidence 2233333222111100 000 00000000 0000000000001112344566788888888877787653221111 Q ss_pred -HHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEE-eeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEE Q lcl|NC_021301. 86 -LALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTC-WRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWW 157 (456) Q Consensus 86 -~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v-~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~ 157 (456) ....+.+++.. | ........+....+.+|.+|+++ ..+..|.+ .+..++|..+.+........... .+. T Consensus 75 ~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~---~~~ 151 (409) T protein:vir:84 75 IPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWI---EPV 151 (409) T ss_pred cccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEE---EEE Confidence 11223444432 2 23466677888999999999876 46777775 58889999887665432221111 111 Q ss_pred EecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHH Q lcl|NC_021301. 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRI 237 (456) Q Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~ 237 (456) ...++ ..|..+.+.++.. ..+ .....|.|.++.....++.. T Consensus 152 ~~~~g-----~~~~~~dvih~~~-------------------------------~~~---~~~~~G~s~i~~~~~~i~~~ 192 (409) T protein:vir:84 152 YRIDG-----KVVPNHRIMHIKR-------------------------------YPV---AGCALGMSPIEKAASAIGLG 192 (409) T ss_pred ecCCc-----eEEchhhEEEecC-------------------------------CCC---CcccccccHHHHHHHHHHHH Confidence 11111 1122222222210 000 01124677666544444433 Q ss_pred HHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhhhhccceeccCCCceeEeecccch-HHHHHHHHHH Q lcl|NC_021301. 238 NRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFEAAPGALWELPPGVDIWESQTNDF-TPMLSAIKEH 315 (456) Q Consensus 238 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~l~~~ 315 (456) ..+..-..+...-.+.|-.+++.- . ...++....+ ..........+.++.++.+.++.++...+. ..|++..+.. T Consensus 193 ~~~~~~~~~~f~ng~~p~gil~~~-~--~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~ 269 (409) T protein:vir:84 193 LAAERYGLRWFRDSANPSGILSSD-A--DLTPDQVKQTQKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQ 269 (409) T ss_pred HHHHHHHHHHHhcCCCccEEEecC-C--CCCHHHHHHHHHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHH Confidence 322111111111122344444321 1 1111111111 111111123455677888889988764332 2377778888 Q ss_pred HHHHHhhcCCChhhhccccc-CcHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecC Q lcl|NC_021301. 316 IRQLSSATKTPLPMLMPDSA-NQSAEGA-----HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFES 389 (456) Q Consensus 316 ~~~i~~~~~~p~~~~~~~~~-N~Sg~Al-----~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~ 389 (456) +.+|+.+-|+|+..+|.... +.++..+ .+....|.-.+...+..|..- + .....+++.+.. T Consensus 270 ~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~~l~P~~~~ie~~l~~~--------L-----~~g~~i~fd~~~ 336 (409) T protein:vir:84 270 RSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRHTLLPWLRCIEQALDTF--------L-----PRGQFVKFNVDG 336 (409) T ss_pred HHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHh--------c-----cCCCeEEEechh Confidence 99999999999999874322 2222222 222223333332222222211 1 123445666666 Q ss_pred CCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHHHH-HHHhhhhhhhcccccCC Q lcl|NC_021301. 390 PDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAREQI-TLFAGNSVQRPQEDGSR 456 (456) Q Consensus 390 ~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~ee~-~~~~~~~~~~~~~d~~~ 456 (456) ..-.|.++.++++.+++++|+++.-.+++.+|+.|-+--. .-...+.... ....+...+....+|+| T Consensus 337 l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 337 LMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVPLGYVPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccccccCCccccCcCCCCCCccCCCC Confidence 6778999999999999999999998899999887642110 0011110000 00011112233446666 No 137 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=99.18 E-value=1.6e-10 Score=74.28 Aligned_cols=406 Identities=11% Similarity=-0.008 Sum_probs=176.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--HhcccCcccccCc-ccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC- Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLAR--YSNGDAPLPELTR-NTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA- 82 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~--YY~g~~~i~~~~~-~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~- 82 (456) +=++..|..+..... ...... +-........... ......-....-+.+.=...+|+.+++-+..-|+.+-... T Consensus 1 Mg~~~~l~~r~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~ 78 (457) T protein:vir:13 1 MGFWSALFGRGHSPA--LDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRG 78 (457) T ss_pred Cchhhhhhccccccc--ccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecC Confidence 223333322211110 000000 0000000000000 0000000000011122234577777777777777642211 Q ss_pred ---cccHHHHHHHHHHh--cC--hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 83 ---DSDLALRARRIWRD--NR--MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 83 ---d~~~~~~l~~~~~~--n~--~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) .......+.+.++. |. ...+...+....+.+|.||+.+-.+ .|++ .+..++|..+.+..+.........+. T Consensus 79 ~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~~~~~ 157 (457) T protein:vir:13 79 GSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRRKVFE 157 (457) T ss_pred CcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccceeEE Confidence 11112223444332 12 2356677888899999999988655 4554 58889999988776544432222222 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC-CCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN-PDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n-~~g~s~~~~v~~l 233 (456) .|....++.......|.++.++++... + ..+ -.|.|.++..... T Consensus 158 ~y~~~~~~~~~~~~~~~~~diih~~~~--------------------------~---------~~~~~~G~s~i~~~~~~ 202 (457) T protein:vir:13 158 AYDIDADGNEVLLGWFTPRDVLHIPGM--------------------------M---------LPGDFVGCSPISYARES 202 (457) T ss_pred EEEEecCCceeeEEeeCccceEEecCC--------------------------C---------CCCccccccHHHHHHHH Confidence 222233343333334445554443210 0 001 2466666544444 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhh-hhh--hhccceeccCCCceeEeecccch-HHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYAS-IFE--AAPGALWELPPGVDIWESQTNDF-TPM 308 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~-~~~--~~~~~~~~~~~d~~~~~~~~~~~-~~~ 308 (456) |.....+..-......-.+.|..+++.- .. ..++.-..+ .... ... ...+.+..++.+.++.++...+. .-| T Consensus 203 i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~--ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~ 279 (457) T protein:vir:13 203 IGLALAAQKYGSKFFANGAMPGAVVEVP-GT--MSEEGLARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQF 279 (457) T ss_pred HHHHHHHHHHHHHHHhcCCCcceEEEcC-CC--CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHH Confidence 4333222211111112223344444321 11 111111111 1111 111 11245677888889888754322 236 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEe Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQ-SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSF 387 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f 387 (456) ++..+..+.+|+.+-++|+..+|....+. ++..++-....+...+ . .-+-..+++.+..-+--......+.+++.+ T Consensus 280 ~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~~f~~~t--l-~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~ 356 (457) T protein:vir:13 280 LQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNIAFTMFS--L-RPWLERIEAGFNRLLFAETADRFRFVKFNL 356 (457) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHHHHHHHH--H-HHHHHHHHHHHHHhhcCccccCceeEEeec Confidence 77778888999999999999997543222 2222222222221111 0 112222333222211111111223345555 Q ss_pred cCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH--H-H----HHHHHHH-------HHHHHhhhhhhhcccc Q lcl|NC_021301. 388 ESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK--Q-D----DLDRARE-------QITLFAGNSVQRPQED 453 (456) Q Consensus 388 ~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~--~-~----e~~~~~e-------e~~~~~~~~~~~~~~d 453 (456) ....-.|..+.++++.+++++|+++.--+++.+|+.|-+-. . . ......+ ......+.....++++ T Consensus 357 ~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 436 (457) T protein:vir:13 357 DEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEE 436 (457) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCC Confidence 56677799999999999999999998888999888653211 0 0 0000000 0000111111122222 Q ss_pred cCC Q lcl|NC_021301. 454 GSR 456 (456) Q Consensus 454 ~~~ 456 (456) .++ T Consensus 437 ~~~ 439 (457) T protein:vir:13 437 PEP 439 (457) T ss_pred CCC Confidence 222 No 138 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=99.16 E-value=2.1e-10 Score=73.67 Aligned_cols=379 Identities=12% Similarity=0.012 Sum_probs=169.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccH Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~ 86 (456) +-|.+++.+....... -..++.+-.+-...+.......-.....+...-...+|+.+++-+..-|+.+...... T Consensus 1 M~~f~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~~p~~~~~~~~~-- 74 (386) T protein:vir:49 1 MPIFNITNLATESPPI----NQESFFDIADSDFLASLNSSEWVSAENALKNSDLFSIISQLSNDLATAKITTSRKQLQ-- 74 (386) T ss_pred CchhhhhccCCCCccc----chhhhhhhhhccccccccCCceechhhhhccHHHHHHHHHHHHHhhhCceeeccchhh-- Confidence 2233332221111000 0111111000000000000000000111223334457788888777778876532211 Q ss_pred HHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceE Q lcl|NC_021301. 87 ALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESD 165 (456) Q Consensus 87 ~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~ 165 (456) ..+.+-............+....+.+|.||+.+-.+.+|++ .+..++|..+.+..++.... +. +.+...+.... T Consensus 75 -~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~~~-~~---y~~~~~~~~~~ 149 (386) T protein:vir:49 75 -GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQNG-LY---YNITFDDPHIA 149 (386) T ss_pred -hhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCCce-EE---EEEEEcCcccc Confidence 11111111112345667788889999999999988888886 58889999988877654332 11 11111111111 Q ss_pred EEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 166 FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLL 245 (456) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~ 245 (456) ....+..+.++++.. +... ....|.|-+..+...++....+..-.. T Consensus 150 ~~~~~~~~evih~~~--------------------------~~~~--------~~~~G~s~l~~~~~~i~~~~~~~~~~~ 195 (386) T protein:vir:49 150 PKQHVPQNDILHFRL--------------------------LSVD--------GGLTSVSPLMALGREFNIQKASDKLTI 195 (386) T ss_pred ceeEEccccEEEecC--------------------------CCCC--------CccccccHHHHHHHHHHHHHHHHHHHH Confidence 112233333333210 0000 012466666655444443332222111 Q ss_pred HHHHHhhchhhhhhcCCCccccccccc-chhhhhhhhhhhccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhc Q lcl|NC_021301. 246 STMAIQAFRQRALKSAGHGLPKVDENG-NAIDYASIFEAAPGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSAT 323 (456) Q Consensus 246 ~~~~~~~~~~~~i~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~ 323 (456) +...-.+.|..+++--.. ..++.. .............+.++.++.+.++.++.... ...+++..+....+|+.+- T Consensus 196 ~~~~ng~~~~~il~~~~~---~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~f 272 (386) T protein:vir:49 196 SALKNALNANGILKIKGG---GLLDFKTKVSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVY 272 (386) T ss_pred HHHHccCCccEEEEeCCC---CChHHHHHHHHHHHHhccCCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHh Confidence 112222334444432111 111111 11111122233455677788888998886332 2347888888999999999 Q ss_pred CCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHH Q lcl|NC_021301. 324 KTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAA 402 (456) Q Consensus 324 ~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~ 402 (456) |+|+..+|...++ .+++.++..+...+.-+ -+.+...+.+. + + ..+++......-.+..+.+..+ T Consensus 273 gVPp~~lg~~~~~~~~~~~~~~~~~~~i~~~---l~~i~~~~~~~----l---~----~~~~~~~~~~~~~d~~~~~~~~ 338 (386) T protein:vir:49 273 GIPESIVGGDGDQQSSLEMIYNIYFKSVSRY---LRPFVSEMSKK----L---S----CEVDVDISPAVDPTGSNYISLI 338 (386) T ss_pred CCCHHHhCCCCCccchHHHHHHHHHHHHHHH---HHHHHHHHHHH----h---c----chhcccchhhhccCHHHHHHHH Confidence 9999999865433 34444443332221111 01111111111 1 1 1122223333445666788888 Q ss_pred HHHHhcCCCcHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccc Q lcl|NC_021301. 403 SLAKAAGESWASIRRNIL---NYNADQIKQDDLDRAREQITLFAGNSVQRPQED 453 (456) Q Consensus 403 ~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d 453 (456) .+|..+|+++.-.+++++ |+.+.++...+.-. . ....+... .++| T Consensus 339 ~~l~~~g~~t~nE~r~~l~~~~~~~~~~~~~~~~~---~-~~~~gGd~--~~~~ 386 (386) T protein:vir:49 339 NSMVKSGTLAQNQGLYILQQAEILPKELPDGKNPN---R-TSLKGGEI--NEQD 386 (386) T ss_pred HHHHhCCCcCHHHHHHHHhhCCCCCCcCcchhccC---C-CCCCCCCC--CCCC Confidence 899999999988888765 45554433211000 0 00001011 1111 No 139 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=99.14 E-value=4.6e-10 Score=71.75 Aligned_cols=404 Identities=9% Similarity=0.008 Sum_probs=173.2 Q ss_pred CCCHHHHHHHHHH---------HHHH---------HHHHHHHHHHHhcccCccc--------ccC--------cccchhh Q lcl|NC_021301. 3 ASTPAEWLPVLTK---------RIDD---------GMSRVRLLARYSNGDAPLP--------ELT--------RNTSAAW 48 (456) Q Consensus 3 ~~t~~~~~~~l~~---------~~~~---------~~~r~~~~~~YY~g~~~i~--------~~~--------~~~~~~~ 48 (456) -+.-.-+++.+.. .|.. +.-.-+.+.++-.|+.... ..+ ...+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l 80 (551) T protein:vir:80 1 MKNKLGLFESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDL 80 (551) T ss_pred CchhhhhHHHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHH Confidence 1222333444331 0100 0111233444444433220 000 0111122 Q ss_pred hhhhhh-hccChHHHHHHHHHhhhcc-----------CCeecCC-C-------CcccHHHHHHHHHHhc---------Ch Q lcl|NC_021301. 49 RSFQRE-ARTNWGLMVRDSVADRIIP-----------NGITVGG-S-------ADSDLALRARRIWRDN---------RM 99 (456) Q Consensus 49 ~~~~~k-~~~n~~~~iVd~~a~~l~~-----------~~~~~~~-~-------~d~~~~~~l~~~~~~n---------~~ 99 (456) +...+. ......+.+|+..++.+.. -|+.+.. + .+......+.+++..- .+ T Consensus 81 ~~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~ 160 (551) T protein:vir:80 81 HGVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSF 160 (551) T ss_pred HHHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchH Confidence 222112 2245677787777765532 2333211 1 1111222344554432 23 Q ss_pred hHHHHHHHHHHhhCCeEEEEEeeCCCCceE-EEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEE Q lcl|NC_021301. 100 DSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKF 178 (456) Q Consensus 100 ~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 178 (456) ..+...+..+.+.+|.||+.+-.+.+|++. +..++|..+.++.++... .....++|+...++... ..|..+.+.++ T Consensus 161 ~~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~-~~~~~~~y~~~~~g~~~--~~~~~~eiiH~ 237 (551) T protein:vir:80 161 SSFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGK-IPDNGNRFVQVIDQKIV--ATFNAREMAFA 237 (551) T ss_pred HHHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccc-cccCceEEEEEeCCcEE--EEEcccceEEe Confidence 456667888899999999999888999874 899999999888765432 11111223222222211 12333333332 Q ss_pred EEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chh Q lcl|NC_021301. 179 ARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQ 255 (456) Q Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~ 255 (456) .... ..+.. ...+|.|-++.....+.... .-......++. .|. T Consensus 238 ~~n~----------------------~~~~~---------~~~~G~spi~~a~~~i~~~~---a~~~~~~~~f~Ng~~p~ 283 (551) T protein:vir:80 238 VRNP----------------------RSDIY---------ATGYGYPELEIALKQFIAHE---NTEAFNDRFFSHGGTTR 283 (551) T ss_pred cccC----------------------CCCcc---------cccccccHHHHHHHHHHHHH---HHHHHHHHHHHcCCCcc Confidence 1100 00000 01246665654444443332 22222333333 233 Q ss_pred hhhhcCCCcccccccccchh-hhh-hhhh--hhccceec-cCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhh Q lcl|NC_021301. 256 RALKSAGHGLPKVDENGNAI-DYA-SIFE--AAPGALWE-LPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPM 329 (456) Q Consensus 256 ~~i~g~~~~~~~~~~~~~~~-~~~-~~~~--~~~~~~~~-~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~ 329 (456) .+|. +.......++....+ ... ..+. ...+.+.. ...+.++..+.... -..|++..+..+..|+.+-++|+.. T Consensus 284 giL~-~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~ 362 (551) T protein:vir:80 284 GILQ-IKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFEKWLNYLINVISALYGIDPAE 362 (551) T ss_pred eEEE-EcCCCCCCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhcCCHHH Confidence 3332 111111111111111 111 1111 12234333 35677887775322 2238888899999999999999999 Q ss_pred hcccccCc-----------H-HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCH Q lcl|NC_021301. 330 LMPDSANQ-----------S-AEGA--HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTL 395 (456) Q Consensus 330 ~~~~~~N~-----------S-g~Al--~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~ 395 (456) +|....+. | .+.. .+....|.-.+.. +++.+...+ .. .....+.+.|......+. T Consensus 363 lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~--------ie~~ln~~L--~~-~~~~~~~f~f~~~~~~~~ 431 (551) T protein:vir:80 363 INIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGF--------IEDFINKHI--VA-EFGDKYTFQFVGGDIKSE 431 (551) T ss_pred cCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHH--------HHHHHHhhh--cc-ccCCceEEEeeccChhhH Confidence 98432211 1 1111 1111112111211 122121111 11 112346777888777777 Q ss_pred HHHHHHHHHHHhcCCCcHHHHHHhCCCChh-HH-------------------HHHHHHHHHHHHHHHhh----hhhhhcc Q lcl|NC_021301. 396 GEKYAAASLAKAAGESWASIRRNILNYNAD-QI-------------------KQDDLDRAREQITLFAG----NSVQRPQ 451 (456) Q Consensus 396 ~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~-~~-------------------~~~e~~~~~ee~~~~~~----~~~~~~~ 451 (456) ++.+... ++..+|+++.--+++.+|+.|. +- ...+.++.++..+...+ ...+.++ T Consensus 432 ~~~~~~~-~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (551) T protein:vir:80 432 LESVKIL-AEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVE 510 (551) T ss_pred HHHHHHH-HHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCC Confidence 7777644 4667799999889999887652 10 00111111211111111 1111110 Q ss_pred -c-ccCC Q lcl|NC_021301. 452 -E-DGSR 456 (456) Q Consensus 452 -~-d~~~ 456 (456) + +++. T Consensus 511 ~~p~~~~ 517 (551) T protein:vir:80 511 DIPDGKD 517 (551) T ss_pred CCCCccc Confidence 0 1100 No 140 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=99.14 E-value=1.7e-10 Score=74.12 Aligned_cols=390 Identities=10% Similarity=0.063 Sum_probs=173.7 Q ss_pred CCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPA--EWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~--~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) =+..+|. +... .+..-|-+.... ..+....-.+.. .....+.....+|+.++.-+..-|+.+ T Consensus 7 ~~~~~p~~~~~~~--------------~~~~~~~~~~~~-g~~~~~~~~~~~-~~~~~~~~V~acV~~IA~~iA~lp~~l 70 (518) T protein:vir:78 7 QTLSAPAMAELSP--------------QMQDSYYYAPAV-GMQLERQFSLYG-GIYKNQPWVRTVIAKRAQALARLPVKC 70 (518) T ss_pred eeeccchhhhhhh--------------hhhhccccccee-ceecccccchhh-HHhhhhHHHHHHHHHHHHhhccCceEE Confidence 1222222 2211 122222221110 111111000000 001224466778888888887777664 Q ss_pred C---CCCcc-cHHHHHHHHHHh-cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCce Q lcl|NC_021301. 79 G---GSADS-DLALRARRIWRD-NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWR 149 (456) Q Consensus 79 ~---~~~d~-~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~ 149 (456) - .+... .....+..++.+ |. -..+...+....+.+|.||+++-++.+|.+ .+..++|..+.+..+...... T Consensus 71 ~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~~~~~~ 150 (518) T protein:vir:78 71 MFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNSRTGRY 150 (518) T ss_pred EEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcCCCCEE Confidence 2 11111 111223344443 32 235566788888999999999999999987 489999999998887654321 Q ss_pred EEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhH Q lcl|NC_021301. 150 IRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEP 229 (456) Q Consensus 150 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~ 229 (456) .+++...++.......+..+++.++... +..+ ...|.|-+.. T Consensus 151 ----~y~~~~~~~~~~~~~~~~~~eIiHir~~--------------------------~~dg--------~~~G~Spi~~ 192 (518) T protein:vir:78 151 ----EYYFQAGAGVGTQLVSFADDEVVPIRFF--------------------------NPDG--------LERGLSLMES 192 (518) T ss_pred ----EEEEEecCCccceeEEecCCcEEEecCC--------------------------CCCc--------ccccccHHHH Confidence 1122222222222222333333332100 0000 0135555543 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhh-hhh--hhccceeccCCCceeEeecccc- Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYAS-IFE--AAPGALWELPPGVDIWESQTND- 304 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~-~~~--~~~~~~~~~~~d~~~~~~~~~~- 304 (456) ....+.....+..-..+...-.+.|..+++.- .. ..++....+ .... .+. ...+.+..++.+.++..+.... T Consensus 193 ~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~-~~--ls~e~~~~~k~~~~~~~~G~~nag~~~vL~~G~~~~~l~~~~~ 269 (518) T protein:vir:78 193 LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KR--LSPEAQQRLREQFDRAHAGSSNTGKTMVVEEGMEPIPLQLTAV 269 (518) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CC--CCHHHHHHHHHHHHHHhcCcccCCceeEcCCCceEEeccCChh Confidence 33333322222111111112222344444321 11 111111111 1111 111 1234567788888888775432 Q ss_pred hHHHHHHHHHHHHHHHhhcCCChhhhccccc-CcH-HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_021301. 305 FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQS-AEGA--HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVE 380 (456) Q Consensus 305 ~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~S-g~Al--~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~ 380 (456) -..|++..+..+.+|+.+-++|+..+|.... +-| .+.. .+....+.-.+. .+++.+...+. ...... T Consensus 270 d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~f~~~tL~P~~~--------~ie~eln~~L~-~~~~~~ 340 (518) T protein:vir:78 270 EMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAFYRDTMAIPIA--------RIQSAMDKYVG-QYWVRK 340 (518) T ss_pred HHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHHHHHHHHHHHH--------HHHHHHHHhhc-ccccCc Confidence 2237787788889999999999999974321 111 1211 111112222222 22222221110 001112 Q ss_pred cceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHH-------HHHHHHHHH-HHhhhhhhhccc Q lcl|NC_021301. 381 DTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDD-------LDRAREQIT-LFAGNSVQRPQE 452 (456) Q Consensus 381 ~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e-------~~~~~ee~~-~~~~~~~~~~~~ 452 (456) +.+++.....+..|..+.++++.+++++|+++.--+++.+|+.+-+..-.. ...+....+ ...+.....+++ T Consensus 341 ~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~ 420 (518) T protein:vir:78 341 NRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYANSALQPLGATPDGAVEGEEAPAPKR 420 (518) T ss_pred ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccccccCCCCCCCCCC Confidence 334444445667899999999999999999999889999888653211000 000000000 000000000000 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) .+++ T Consensus 421 ~~~~ 424 (518) T protein:vir:78 421 PAST 424 (518) T ss_pred CCcc Confidence 1111 No 141 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=99.13 E-value=8.4e-10 Score=70.33 Aligned_cols=391 Identities=14% Similarity=0.074 Sum_probs=173.1 Q ss_pred HHHHHHHHHHHHHH-HHHHhcccC-cccc------cC-cccchhhh-hhhhhhccChHHHHHHHHHhhhccCCeecCC-C Q lcl|NC_021301. 13 LTKRIDDGMSRVRL-LARYSNGDA-PLPE------LT-RNTSAAWR-SFQREARTNWGLMVRDSVADRIIPNGITVGG-S 81 (456) Q Consensus 13 l~~~~~~~~~r~~~-~~~YY~g~~-~i~~------~~-~~~~~~~~-~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~-~ 81 (456) +.+-.++...+... +..| .|.. ...+ .+ ........ ....-+.+.-...+|+.+++-+..-|+.+-. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~g~~~s~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~~ 79 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKW-LGVPISLTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQTK 79 (437) T ss_pred CCcchhhhhhhhHHhhhhh-cCCcccCCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEEc Confidence 11112222222222 1222 2221 0000 00 00000000 0011122334455788888877766765411 1 Q ss_pred Ccc----cHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEE Q lcl|NC_021301. 82 ADS----DLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIR 151 (456) Q Consensus 82 ~d~----~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~ 151 (456) .+. .....+..++.. | ....+...+...++.+|.||+++-++ .|.+ .+..++|..+.+..+... . + T Consensus 80 ~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~-~g~~~~L~~l~p~~v~i~~~~~g-~-~- 155 (437) T protein:vir:10 80 PDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRS-AGVLIGLELMLPQRTTVKRLTSG-A-L- 155 (437) T ss_pred CCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEECCCC-e-E- Confidence 111 111223444432 2 23456677888999999999999888 4776 488899999887765432 1 1 Q ss_pred EEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHH Q lcl|NC_021301. 152 SAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHI 231 (456) Q Consensus 152 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~ 231 (456) ..++...+|.. ..+..+.+.++.. + .++-..|.|-++.+. T Consensus 156 --~y~~~~~~g~~---~~~~~~dIih~r~--------------------------------~---~~d~~~G~spi~~~~ 195 (437) T protein:vir:10 156 --QYTYRNVDGTV---STLAEDDVFHVRG--------------------------------F---SLDGLMGLTPIQYAR 195 (437) T ss_pred --EEEEEecCceE---EEEccccEEEecC--------------------------------c---CCCCcccccHHHHHH Confidence 11223333322 1223333332210 0 001124666655444 Q ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhh------hccceeccCCCceeEeecccch Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEA------APGALWELPPGVDIWESQTNDF 305 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~d~~~~~~~~~~~ 305 (456) ..++....+..-......-.+.|..+++.- .. ..++....+ ...+.. ..+.+..++.+.++.++..... T Consensus 196 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~--l~~e~~~~~--~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~ 270 (437) T protein:vir:10 196 EVLGNSTAANKTSASVFRNGLRPSGVLSTD-QI--LQKEKRAEI--RTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPG 270 (437) T ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEcC-CC--CCHHHHHHH--HHHHHHHhcCccccCcceeccCCceEEeccCChh Confidence 333322222211111112222344444321 11 111111111 111111 1245667788889888864332 Q ss_pred -HHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccce Q lcl|NC_021301. 306 -TPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTV 383 (456) Q Consensus 306 -~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i 383 (456) ..|++..+....+|+.+-|+|+..+|....+ ..+..++.....+...| -.-+-..+++.+...+-..+......+ T Consensus 271 d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~t---l~P~~~~ie~~l~~kll~~~e~~~~~~ 347 (437) T protein:vir:10 271 DVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLGFLTFT---LRPWLTRIEQAARRSLLRPGERDQFYA 347 (437) T ss_pred hHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHHH---HHHHHHHHHHHHHhhccCccccCceEE Confidence 2378888888899999999999999754322 11122222222211111 011112222222211111111122334 Q ss_pred eEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHH-HH-HHHHHHHHHHHHHhhhhhhhccc--------- Q lcl|NC_021301. 384 DVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQI-KQ-DDLDRAREQITLFAGNSVQRPQE--------- 452 (456) Q Consensus 384 ~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~-~~-~e~~~~~ee~~~~~~~~~~~~~~--------- 452 (456) ++.+...+..|..+.++++.++..+|+++.--+++.+|+.|-+- .. +-...--..++...+.......+ T Consensus 348 ~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 427 (437) T protein:vir:10 348 EFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAAQDALKAWLYQ 427 (437) T ss_pred EEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCCCcchhccccccCCC Confidence 55555567778999999999999999999999999988865321 00 00000000011111111111112 Q ss_pred ------ccCC Q lcl|NC_021301. 453 ------DGSR 456 (456) Q Consensus 453 ------d~~~ 456 (456) +.+| T Consensus 428 ~~~~~~~~e~ 437 (437) T protein:vir:10 428 EEKTRATQER 437 (437) T ss_pred CCCCCccccC Confidence 2222 No 142 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=99.12 E-value=6.3e-10 Score=71.03 Aligned_cols=387 Identities=13% Similarity=0.037 Sum_probs=174.0 Q ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCC-Cc- Q lcl|NC_021301. 7 AEWLPVLTKRIDDG-MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS-AD- 83 (456) Q Consensus 7 ~~~~~~l~~~~~~~-~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~-~d- 83 (456) +-+++.|.++.... ......+...+.+...-. .+..+..+ .-+.+.-...+|+.+++-+..-|+.+-.. .+ T Consensus 1 Mg~f~~lf~r~~~~~~~~~~~~~~~~~~~~~~~-~g~~v~~~-----~al~~~~v~~~i~~Ia~~ia~~p~~~~~~~~~~ 74 (414) T protein:vir:44 1 MVFFSGLFQRKSDAPVTTPAELADAIGLSYDTY-TGKQISSQ-----RAMRLTAVFSCVRVLAESVGMLPCNLYHLNGSL 74 (414) T ss_pred CchhhhhhccCccCcccchhhHhHhhccCcccc-CCceechh-----hhhccHHHHHHHHHHHHHhccCceEEEEecCCc Confidence 33444443332111 011111222222211100 01111110 11223445668888888887777664211 11 Q ss_pred --ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 84 --SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 84 --~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) ......+..++.. | ........+....+.+|.||+++..+ +|.+ .+..++|..+.+.+++... + ++ T Consensus 75 ~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~--~---~y 148 (414) T protein:vir:44 75 KQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPKLNSSWE--P---VY 148 (414) T ss_pred eeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEEECCCCc--E---EE Confidence 1111223344331 2 23456677888999999999998776 5766 5888999999888765422 1 11 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liD 235 (456) .+...+|... .+..+.++++.. ++ +....|.|-+......++ T Consensus 149 ~~~~~~g~~~---~~~~~evih~~~--------------------------------~~---~d~~~G~s~i~~~~~~i~ 190 (414) T protein:vir:44 149 QVTFPDGSTD---VLSQEDIWHVRT--------------------------------LT---LDGLVGLNPIAYAREAIS 190 (414) T ss_pred EEEecCceEE---EEccccEEEecC--------------------------------CC---CCCcccccHHHHHHHHHH Confidence 2222333221 233333333210 00 011246666654444333 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhh-hhhh--hhccceeccCCCceeEeecccch-HHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYA-SIFE--AAPGALWELPPGVDIWESQTNDF-TPMLS 310 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~-~~~~--~~~~~~~~~~~d~~~~~~~~~~~-~~~~~ 310 (456) ....+..-..+...-.+.|..+++. .. ...++.-..+ ... .... ...+.+..++.+.++.++..... ..|++ T Consensus 191 ~~~~~~~~~~~~f~ng~~p~gil~~-~~--~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e 267 (414) T protein:vir:44 191 LAAATEEHGARLFSNGAVTSGVLRT-EQ--TLSDQAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLE 267 (414) T ss_pred HHHHHHHHHHHHHhccCCCceEEEe-CC--CCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHH Confidence 3322221111111112234333332 11 1111111111 111 1111 12244667788888887754322 23778 Q ss_pred HHHHHHHHHHhhcCCChhhhcccc-cC-cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEE Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVS 386 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~ 386 (456) ..+....+|+.+-|+|+..+|... ++ ++.+... +....|.- +-..+++.+...+--......+.+++. T Consensus 268 ~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~~~~~~l~P--------~~~~ie~~ln~~L~~~~~~~~~~i~fd 339 (414) T protein:vir:44 268 TRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVP--------YLTRIEQRINTGLVRKSKQGVFYAKFN 339 (414) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHH--------HHHHHHHHHHhhcCCccccCceEEEEe Confidence 788888999999999999997532 12 2222211 11112211 112222222221111111112234444 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHH--HhhhhhhhcccccCC Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITL--FAGNSVQRPQEDGSR 456 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~--~~~~~~~~~~~d~~~ 456 (456) +...+..|..+.++++.++.++|+++.-.+++.+|+.|.+--..-. ....... ........+.++++. T Consensus 340 ~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~--~~~n~~~~~~~~~~~~~~~~~~~~ 409 (414) T protein:vir:44 340 AGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYL--TPMNMTTKPSDGSKAGKQKDNANA 409 (414) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceec--ccccccccCCccccCCCCCCCCCC Confidence 4455667889999999999999999999999999987642111000 0000000 000111111122222 No 143 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=99.12 E-value=1.4e-09 Score=69.16 Aligned_cols=389 Identities=11% Similarity=0.003 Sum_probs=172.8 Q ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcc Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSR--VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r--~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~ 84 (456) +=|..+..++-. ..+. ....-...-|..... ...+.+ ..-+.+.-.-.+|+.+++-+..-|+++..+... T Consensus 1 Mg~f~~~~~r~~-~~~~~~~~~~~~~~~~~~~~~--~~~~~~-----~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~ 72 (416) T protein:vir:45 1 MGIFYKNEKRDL-QYNEDDLQMMVQTLPGFQGTK--LRQYKD-----IEAIRHSDIFTAVMMIASDLARMPIRVTVNGQI 72 (416) T ss_pred CCcccccccccc-cCCCcchhHHHHHhccccccC--ccccch-----hhhhcchHHHHHHHHHHHhhccCceEEecCccc Confidence 111111100000 0000 000001111111000 000000 000111122337788888887778876543222 Q ss_pred cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 85 DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 85 ~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) .....+..++.. |. ...+...+....+.+|.||+++.++.+|.+ .+..++|..+.+..|... + +... .... T Consensus 73 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g-~-~~~~-~~~~ 149 (416) T protein:vir:45 73 NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARG-R-LYYF-HQRI 149 (416) T ss_pred cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCc-c-EEEE-EEEe Confidence 222334444432 32 235566788888999999999999999987 488999999988876442 2 1111 1111 Q ss_pred ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHH Q lcl|NC_021301. 159 DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRIN 238 (456) Q Consensus 159 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~ 238 (456) +..+... ...|..+.+.++.. . ......|.|-++.....++... T Consensus 150 ~~~~~~~-~~~~~~~evihir~--------------------------------~---~~d~~~G~s~i~~~~~~i~~~~ 193 (416) T protein:vir:45 150 DSNGNNI-ERNVKFEDMLDIKF--------------------------------Y---SLDGINGLSLLDTLSRTIESDN 193 (416) T ss_pred cCCCcee-EEEEccccEEEecc--------------------------------C---CCCCccccCHHHHHHHHHHHHH Confidence 1122111 11233333322210 0 0011246666665544444332 Q ss_pred HHHHHHHHHHHHhhchhhhhhcCCCccccccc-ccchh-hhh-hhhh--hhccceeccCCCceeEeecccc-hHHHHHHH Q lcl|NC_021301. 239 RAELQLLSTMAIQAFRQRALKSAGHGLPKVDE-NGNAI-DYA-SIFE--AAPGALWELPPGVDIWESQTND-FTPMLSAI 312 (456) Q Consensus 239 ~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~-~~~~~-~~~-~~~~--~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l 312 (456) ....-......-.+.|..+++- ... ..++ ....+ ... ..+. ...+.+..++.+.++.++.... ...|++.. T Consensus 194 ~~~~~~~~~f~ng~~~~gil~~-~~~--~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~ 270 (416) T protein:vir:45 194 NGKDFLNNFLRNGTHAGGILKM-KGV--LDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIREN 270 (416) T ss_pred HHHHHHHHHHhccCCCcEEEEe-CCC--CCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHH Confidence 2211111111222234343331 111 1111 11111 111 1111 1124456777888887775432 22377777 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEK-GFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~-~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~ 391 (456) +....+|+.+-|+|+..+|...++.|.+.....+. .|.-.+. .+++.+... +........+++.+.... T Consensus 271 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~l~P~~~--------~ie~~ln~~--l~~~~~~~~~~f~~~~l~ 340 (416) T protein:vir:45 271 KSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKPYIT--------CVCAELNFK--FNDEYVNREFKFDTTEIR 340 (416) T ss_pred HHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHHHHH--------HHHHHHhhh--ccccccCceEEEechhhh Confidence 88889999999999999986544433332222111 1111111 111111111 111122334555555556 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH----------HHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD----------DLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~----------e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) -.|..+.+++..+++++|+++.--+++.+|+.|-+--.. ..+.. ++...........+-..|+. T Consensus 341 ~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~-~~~~~~~~~~~~~~~kgGe~ 414 (416) T protein:vir:45 341 VVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV-DEYQMNKSRATDKKLKGGEE 414 (416) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc-cccCcccccccccccCCCCC Confidence 678999999999999999999999999998865321110 00100 00000000111111122222 No 144 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=99.12 E-value=1.4e-09 Score=69.16 Aligned_cols=389 Identities=11% Similarity=0.003 Sum_probs=172.8 Q ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcc Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSR--VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS 84 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r--~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~ 84 (456) +=|..+..++-. ..+. ....-...-|..... ...+.+ ..-+.+.-.-.+|+.+++-+..-|+++..+... T Consensus 1 Mg~f~~~~~r~~-~~~~~~~~~~~~~~~~~~~~~--~~~~~~-----~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~ 72 (416) T protein:vir:81 1 MGIFYKNEKRDL-QYNEDDLQMMVQTLPGFQGTK--LRQYKD-----IEAIRHSDIFTAVMMIASDLARMPIRVTVNGQI 72 (416) T ss_pred CCcccccccccc-cCCCcchhHHHHHhccccccC--ccccch-----hhhhcchHHHHHHHHHHHhhccCceEEecCccc Confidence 111111100000 0000 000001111111000 000000 000111122337788888887778876543222 Q ss_pred cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 85 DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 85 ~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) .....+..++.. |. ...+...+....+.+|.||+++.++.+|.+ .+..++|..+.+..|... + +... .... T Consensus 73 ~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g-~-~~~~-~~~~ 149 (416) T protein:vir:81 73 NYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARG-R-LYYF-HQRI 149 (416) T ss_pred cccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCCc-c-EEEE-EEEe Confidence 222334444432 32 235566788888999999999999999987 488999999988876442 2 1111 1111 Q ss_pred ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHH Q lcl|NC_021301. 159 DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRIN 238 (456) Q Consensus 159 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~ 238 (456) +..+... ...|..+.+.++.. . ......|.|-++.....++... T Consensus 150 ~~~~~~~-~~~~~~~evihir~--------------------------------~---~~d~~~G~s~i~~~~~~i~~~~ 193 (416) T protein:vir:81 150 DSNGNNI-ERNVKFEDMLDIKF--------------------------------Y---SLDGINGLSLLDTLSRTIESDN 193 (416) T ss_pred cCCCcee-EEEEccccEEEecc--------------------------------C---CCCCccccCHHHHHHHHHHHHH Confidence 1122111 11233333322210 0 0011246666665544444332 Q ss_pred HHHHHHHHHHHHhhchhhhhhcCCCccccccc-ccchh-hhh-hhhh--hhccceeccCCCceeEeecccc-hHHHHHHH Q lcl|NC_021301. 239 RAELQLLSTMAIQAFRQRALKSAGHGLPKVDE-NGNAI-DYA-SIFE--AAPGALWELPPGVDIWESQTND-FTPMLSAI 312 (456) Q Consensus 239 ~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~-~~~~~-~~~-~~~~--~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l 312 (456) ....-......-.+.|..+++- ... ..++ ....+ ... ..+. ...+.+..++.+.++.++.... ...|++.. T Consensus 194 ~~~~~~~~~f~ng~~~~gil~~-~~~--~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~ 270 (416) T protein:vir:81 194 NGKDFLNNFLRNGTHAGGILKM-KGV--LDNKKARDRAREEFHKSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIREN 270 (416) T ss_pred HHHHHHHHHHhccCCCcEEEEe-CCC--CCCHHHHHHHHHHHHHHhcCccccCceeecCCCceeEeccCCHHHHHHHHHH Confidence 2211111111222234343331 111 1111 11111 111 1111 1124456777888887775432 22377777 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEK-GFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPD 391 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~-~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~ 391 (456) +....+|+.+-|+|+..+|...++.|.+.....+. .|.-.+. .+++.+... +........+++.+.... T Consensus 271 ~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~l~P~~~--------~ie~~ln~~--l~~~~~~~~~~f~~~~l~ 340 (416) T protein:vir:81 271 KSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKPYIT--------CVCAELNFK--FNDEYVNREFKFDTTEIR 340 (416) T ss_pred HHHHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHHHHH--------HHHHHHhhh--ccccccCceEEEechhhh Confidence 88889999999999999986544433332222111 1111111 111111111 111122334555555556 Q ss_pred CcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH----------HHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 392 RVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD----------DLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 392 ~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~----------e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) -.|..+.+++..+++++|+++.--+++.+|+.|-+--.. ..+.. ++...........+-..|+. T Consensus 341 ~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~-~~~~~~~~~~~~~~~kgGe~ 414 (416) T protein:vir:81 341 VVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV-DEYQMNKSRATDKKLKGGEE 414 (416) T ss_pred ccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc-cccCcccccccccccCCCCC Confidence 678999999999999999999999999998865321110 00100 00000000111111122222 No 145 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=99.11 E-value=1.1e-09 Score=69.59 Aligned_cols=383 Identities=12% Similarity=0.093 Sum_probs=170.9 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVR----LLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA 82 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~----~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~ 82 (456) +=|.+++......+.+... .+..+.-|.. -. ...-+.+.-...+|+.+++-+..-|+.+-... T Consensus 1 MG~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~-------~~------~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~ 67 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNETVDMTNPLLLQWLGVDP-------DT------PRNQLSEATYFACLKILSESLGKLPLKMYQKT 67 (411) T ss_pred CchHHHHHhhccCcccccccchHHHHHHhcCcc-------cC------hhhhhccHHHHHHHHHHHHhHhhCceeEEEec Confidence 2233333222111111000 0011110100 00 00112233345678888887777776652110 Q ss_pred -c---ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 83 -D---SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 83 -d---~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) + ......+.+++.. | ....+...+....+.+|.||+++-.+ +|.+ .+..++|..+.++.|+........ T Consensus 68 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~-~g~~~~l~~l~~~~v~~~~~~~~~~~~~~ 146 (411) T protein:vir:81 68 ERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYS-GPQLQALWILPSQYVTIVVDDRGLLGEKN 146 (411) T ss_pred CCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCceEEEEEECCceEEEEEcCcccccccc Confidence 1 1111224444432 3 23466677888999999999998887 4554 588899999998887543211111 Q ss_pred EE-EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHH Q lcl|NC_021301. 153 AM-RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHI 231 (456) Q Consensus 153 ~~-~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~ 231 (456) .+ ..+....+ +.. ..+..+.+.++.. . ++ .+...|.|-+..+. T Consensus 147 ~~~~~~~~~~~-g~~-~~~~~~eiih~k~------------------------------~-~~---~~~~~G~s~~~~~~ 190 (411) T protein:vir:81 147 AIWYRYNDPYD-GKM-YVFRNDEILHFKT------------------------------S-VT---FDGITGLSVRDVLK 190 (411) T ss_pred eEEEEEEecCC-ceE-EEEccccEEEEcC------------------------------C-CC---CCCcccccHHHHHH Confidence 11 11111111 111 1123333332210 0 00 01124666555444 Q ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-hhhhhh-hh--hhccceeccCCCceeEeecccch-H Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDYASI-FE--AAPGALWELPPGVDIWESQTNDF-T 306 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~~~~-~~--~~~~~~~~~~~d~~~~~~~~~~~-~ 306 (456) ..++....+..-..+...-.+.|..+++.-. . ..++.... ...... .. ...+.++.++.+.++.++..... . T Consensus 191 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~--l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~ 267 (411) T protein:vir:81 191 HTVDGALESQKFMNNLYKTGLTGKAVLEYTG-D--LNQEARDRLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDS 267 (411) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceEEEeCC-C--CCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHH Confidence 4333333222211111122223444443311 1 11111111 111111 11 12345677888888888754322 3 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCccc Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVED 381 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~ 381 (456) .+++..+....+|+.+-|+|+..+|... ++ .+.+... +....|.- +-..+++.+.. ++.-....... T Consensus 268 q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~l~P--------~~~~ie~~l~~~ll~~~~~~~~~ 339 (411) T protein:vir:81 268 QFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQNLAFYVDTLLY--------VLKQYEEEITYKILSNDLISQGH 339 (411) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHHHHHHHHHHHH--------HHHHHHHHHHhhcCChhhcCCCc Confidence 4677778889999999999999997432 22 2222221 11111111 11122222211 11001112233 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH---HHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ---DDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~---~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .+++.+..-+-.|..+.++++.++.++|+++.--+++.+|+.|.+--. +..... .++...++.. ..|+- T Consensus 340 ~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~~~~~n~~--pl~~~~~~~~----kgGd~ 411 (411) T protein:vir:81 340 YFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNLMANGNYI--PLSMLGANYG----KGGDS 411 (411) T ss_pred EEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCcc--chhhhhhhhc----cCCCC Confidence 355555555677899999999999999999998899999987643110 000000 0111111100 11111 No 146 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=99.08 E-value=2.1e-09 Score=68.16 Aligned_cols=392 Identities=14% Similarity=0.028 Sum_probs=175.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH---HHHHHhcccC-cc----cccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVR---LLARYSNGDA-PL----PELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~---~~~~YY~g~~-~i----~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |-++--.=++.++..-+....+.-. .......+.. .+ -..+..+. ...-+.+.-...+|+.+++-+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~-----~~~al~~~~V~~~i~~Ia~~ia 75 (432) T protein:vir:10 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVN-----ADAIMRLDAVAACVKLVSQAIA 75 (432) T ss_pred CCCCcccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccc-----hhhhhcchHHHHHHHHHHHhhh Confidence 5554444444444333322111000 0000000000 00 00000000 0111233445568888888887 Q ss_pred cCCeecCC-CCcc---cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEE Q lcl|NC_021301. 73 PNGITVGG-SADS---DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSV 142 (456) Q Consensus 73 ~~~~~~~~-~~d~---~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~ 142 (456) .-|+.+-. ..+. .....+..++.. |. -..+...+...++.+|.||+++..+ +|++ .+..++|..+.++. T Consensus 76 ~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~~~~v~v~~ 154 (432) T protein:vir:10 76 AMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITT 154 (432) T ss_pred hCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEEEE Confidence 77876421 1111 111224444432 32 2355667888899999999988776 5664 58889999998887 Q ss_pred eCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCC Q lcl|NC_021301. 143 DPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPD 222 (456) Q Consensus 143 d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~ 222 (456) |+... + ..++...+|... .+..+.++++.. . ..+-.. T Consensus 155 ~~~g~--~---~y~~~~~~g~~~---~~~~~~iih~~~-------------------------------~----~~dg~~ 191 (432) T protein:vir:10 155 DTKGN--T---AYRYRRTDGQMI---DIPKQQIWKIMG-------------------------------Y----SLDGEN 191 (432) T ss_pred cCCCc--E---EEEEEecCceEE---EEcCccEEEecC-------------------------------C----CCCCcc Confidence 65422 1 112223333211 122333322210 0 001113 Q ss_pred CCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhhhhhhhhh--hccceeccCCCcee Q lcl|NC_021301. 223 GMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAIDYASIFEA--APGALWELPPGVDI 297 (456) Q Consensus 223 g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~d~~~ 297 (456) |.|-++.... .+...+.-......++. .|-.+++. .. ...++.-..+ ...+.. ..+.+..++.+.++ T Consensus 192 G~spi~~~~~---~i~~~~~~~~~~~~~f~ng~~~~gil~~-~~--~l~~e~~~~~--~~~~~~~~nag~~~vl~~g~~~ 263 (432) T protein:vir:10 192 GLSAIRYGAQ---IFGTAIAAEAQAARAFRNGQLQSVYYQI-DR--FLTDDQYDSF--AKKVSGSVEAGRAPLLEGGMDV 263 (432) T ss_pred cccHHHHHHH---HHHHHHHHHHHHHHHHhcCCCcceEEec-CC--CCCHHHHHHH--HHHHhhhhhCCCceecCCCceE Confidence 5555554333 33332222222223332 23333332 11 1111111111 111221 23456678888898 Q ss_pred Eeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccC--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021301. 298 WESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN--QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI 374 (456) Q Consensus 298 ~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N--~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 374 (456) .++.-.. -..|++..+....+|+.+-|+|+..+|....+ ..+..++.....+...+ -.-+-..|++.+..-+- T Consensus 264 ~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~f~~~t---l~P~~~~ie~~ln~kL~- 339 (432) T protein:vir:10 264 KSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLSMT---LSPWLRRIEQSIALNLL- 339 (432) T ss_pred EEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHHHHHHH---HHHHHHHHHHHHHhhhc- Confidence 8875432 22377878889999999999999999754221 11222322222221111 01111222222221111 Q ss_pred cCCCcccceeEEec--CCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH--HH-------HHHHHHHHHHHHh Q lcl|NC_021301. 375 EGESVEDTVDVSFE--SPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK--QD-------DLDRAREQITLFA 443 (456) Q Consensus 375 ~~~~~~~~i~v~f~--~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~--~~-------e~~~~~ee~~~~~ 443 (456) .........+.|. ..+-.|..+.++++.++.++|+++.--+++.+|+.|-+-. .. -.+...++. .. T Consensus 340 -~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~~~~~~~~~~~~pl~~~~~~~--~~ 416 (432) T protein:vir:10 340 -SPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGLQA--SP 416 (432) T ss_pred -CccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhcccC--CC Confidence 1111122334454 4456788999999999999999999999999988653211 00 011111100 01 Q ss_pred hhhhhhcccc---cCC Q lcl|NC_021301. 444 GNSVQRPQED---GSR 456 (456) Q Consensus 444 ~~~~~~~~~d---~~~ 456 (456) +.....++++ -+| T Consensus 417 ~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 417 EPASGLGNQQQDKVSK 432 (432) T ss_pred CCCCCCCCcccccccC Confidence 1111111111 122 No 147 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=99.07 E-value=2.2e-09 Score=68.09 Aligned_cols=394 Identities=14% Similarity=0.031 Sum_probs=178.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHH----------HHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRV----------RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~----------~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~ 70 (456) |++.+.+-+..++..-+....+.- ....+.+ |-.. -..+..+.. ..-+.+.-...+|+.+++- T Consensus 1 ~~~~~~mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~g~~v~~-----~~al~~~~V~~~i~~Ia~~ 73 (432) T protein:vir:81 1 MPDEKKLGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDL-GIII-SDTGAAVNA-----DAIMRLDAVAACVKLVSQA 73 (432) T ss_pred CCchhhcchhhhhhhhcccccccccccccccccCccchhhh-cccc-cccCcccch-----HhhhccHHHHHHHHHHHHh Confidence 888777777776644443221100 0000000 0000 000000000 0112233345578888887 Q ss_pred hccCCeecCC-CCc---ccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEE Q lcl|NC_021301. 71 IIPNGITVGG-SAD---SDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVV 140 (456) Q Consensus 71 l~~~~~~~~~-~~d---~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~ 140 (456) +..-|+.+-. ..+ ......+..++.. |. -..+...+...++.+|.||+++..+ +|++ .+..++|..+.+ T Consensus 74 ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l~~~~v~v 152 (432) T protein:vir:81 74 IAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTI 152 (432) T ss_pred hhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCCceEE Confidence 7777776421 111 1111224444432 32 2355667888899999999988776 4665 578899999988 Q ss_pred EEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC Q lcl|NC_021301. 141 SVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN 220 (456) Q Consensus 141 ~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n 220 (456) ..|+.. + + ...+...+|... .+..+.+.++.. . + .+- T Consensus 153 ~~~~~g-~-~---~y~~~~~~g~~~---~~~~~~iih~r~-------------------------------~-~---~dg 189 (432) T protein:vir:81 153 TTDPKG-N-T---AYRYRRTDGQMI---DIPKQQIWKIMG-------------------------------Y-S---LDG 189 (432) T ss_pred EECCCC-c-E---EEEEEecCceEE---EEccccEEEecC-------------------------------C-C---CCC Confidence 877543 2 1 112222333221 122233322210 0 0 001 Q ss_pred CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhhhhhhhh--hhccceeccCCCc Q lcl|NC_021301. 221 PDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAIDYASIFE--AAPGALWELPPGV 295 (456) Q Consensus 221 ~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~d~ 295 (456) -.|.|-++.... ++...+.-......++. .|..+++- .. ...++.-... ...+. ...+.+..++.+. T Consensus 190 ~~G~spi~~~~~---~i~~~~~~~~~~~~~f~ng~~~~gil~~-~~--~l~~e~~~~~--~~~~~~~~nag~~~vl~~g~ 261 (432) T protein:vir:81 190 ENGLSAIRYGAQ---IFGTAIAAEAQAARAFRNGQLQSVYYQI-DR--FLTDDQYDSF--AKKVSGSVEAGRAPLLEGGM 261 (432) T ss_pred cccccHHHHHHH---HHHHHHHHHHHHHHHHhcCCCcceEEec-CC--CCCHHHHHHH--HHHHhhhhcCCCceecCCCc Confidence 135555544333 33332222222223333 23222221 11 1111111111 11121 1235567788888 Q ss_pred eeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccC--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 296 DIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN--QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 296 ~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N--~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) ++.++.-.. -..+++..+..+.+|+.+-++|+..+|....+ ..+..++.....+...+ -.-+-..|++.+..-+ T Consensus 262 ~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~t---l~P~~~~ie~~l~~kL 338 (432) T protein:vir:81 262 DVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMT---LSPWLRRIEQSIALNL 338 (432) T ss_pred eEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHHH---HHHHHHHHHHHHHhhc Confidence 988875432 23477778889999999999999999754221 12222322222221111 0111122222222211 Q ss_pred HhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH--HHHHHHHHHHHHHHhhh----- Q lcl|NC_021301. 373 QIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK--QDDLDRAREQITLFAGN----- 445 (456) Q Consensus 373 ~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~--~~e~~~~~ee~~~~~~~----- 445 (456) --......+.+++.+...+..|..+.++++.++.++|+++.--+++.+|+.|-+-. ..-.....--++..++. T Consensus 339 l~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~~~~~~~ 418 (432) T protein:vir:81 339 LSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGLQASPEP 418 (432) T ss_pred cCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecCcccchhhhccCCCCCC Confidence 11111122233433344466789999999999999999999999999988653211 00000000000111111 Q ss_pred hhhhcccccCC Q lcl|NC_021301. 446 SVQRPQEDGSR 456 (456) Q Consensus 446 ~~~~~~~d~~~ 456 (456) .....+++.++ T Consensus 419 ~~~~~n~~~~~ 429 (432) T protein:vir:81 419 ASGLGNQQQDK 429 (432) T ss_pred CCCCCCccccc Confidence 11111222222 No 148 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=99.04 E-value=4.1e-09 Score=66.59 Aligned_cols=391 Identities=10% Similarity=-0.014 Sum_probs=170.1 Q ss_pred HHHHHHHHHHHHHH-------HHHH-HHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 9 WLPVLTKRIDDGMS-------RVRL-LARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 9 ~~~~l~~~~~~~~~-------r~~~-~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) +.+.+...-....+ .+.. .....++-......+..+.. ..-+.+.=...+|+.+++-+..-|+.+-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~-----~~al~~~~V~~~v~~Ia~~iA~lp~~~~~ 75 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADP-----EAVLSFHAVFACISLISQDIAKMRLRLMQ 75 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccCh-----HHhhccHHHHHHHHHHHHhhccCceEEEE Confidence 22222211000000 0000 00000000000000000000 00011122344778877777777776421 Q ss_pred -CCcc----cHHHHHHHHHHh-cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceE Q lcl|NC_021301. 81 -SADS----DLALRARRIWRD-NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRI 150 (456) Q Consensus 81 -~~d~----~~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~ 150 (456) ..+. .....+..++.+ |. .......+...++.+|.||+++-.+.+|.+ .+..++|..+.+++++.. . + T Consensus 76 ~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g-~-~ 153 (454) T protein:vir:93 76 TDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDG-E-V 153 (454) T ss_pred eccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCC-c-E Confidence 1111 111123334433 32 235667788899999999999999888887 588999999988876432 1 1 Q ss_pred EEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHH Q lcl|NC_021301. 151 RSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPH 230 (456) Q Consensus 151 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v 230 (456) .. ++....+........+..+++.++.. +. ......|.|-+... T Consensus 154 ~y--~~~~~~~~~~~~~~~~~~~eViH~k~------------------------------~~----~~~~~~G~sp~~~~ 197 (454) T protein:vir:93 154 FY--RITPDRNCGITEAVTVPAREVIHDRF------------------------------NC----FFHPLIGLPPVYAA 197 (454) T ss_pred EE--EEEeccccccceeEEecCcceEEecc------------------------------CC----CCCCceeccHHHHH Confidence 11 11111110001111233333333210 00 00112466666544 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhh--hhccceeccCCCceeEeecccc-hH Q lcl|NC_021301. 231 IDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWELPPGVDIWESQTND-FT 306 (456) Q Consensus 231 ~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~-~~ 306 (456) ...+.....+..-......-.+.|..+++- ... ..++....+ ....... ...+.+..++.+.++.++...+ -. T Consensus 198 ~~~i~~~~~~~~~~~~~f~ng~~p~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~ 274 (454) T protein:vir:93 198 GLAATQGHHIQENSTSFFRNGGRPSGVIEI-PGS--ITEENAKKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDS 274 (454) T ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEec-CCC--CCHHHHHHHHHHHHHHhcccccCCceeccCCceEEEcccChhHH Confidence 443333222221111111112224333332 111 111111111 1111111 1234566677888888775432 22 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhccccc-CcH-HHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQS-AEGA--HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDT 382 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~S-g~Al--~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~ 382 (456) .|++..+..+.+|+.+-|+|+..+|.... +-| .+.. .+....|.=.+.. |++.+...+ + ....+. T Consensus 275 q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~~l~P~~~~--------ie~~ln~~L-~--~~~~~~ 343 (454) T protein:vir:93 275 QTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQYYSQCLQTLIES--------IELLLDEAL-E--TGENES 343 (454) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHHHHHHHHHHHHH--------HHHHHHHhh-c--CCCCcE Confidence 37787888889999999999999974322 211 1111 1122222222222 222221111 0 122344 Q ss_pred eeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH-H-------HHHHHHHH---HHH---HHh---hh Q lcl|NC_021301. 383 VDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK-Q-------DDLDRARE---QIT---LFA---GN 445 (456) Q Consensus 383 i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~-~-------~e~~~~~e---e~~---~~~---~~ 445 (456) +++.+...+..|..+.++++.++.++|+++.--+++.+|+.|-+-- + .-.+...+ ..+ ..+ .. T Consensus 344 ~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 423 (454) T protein:vir:93 344 TEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASV 423 (454) T ss_pred EEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCC Confidence 5555666677899999999999999999999889999888653210 0 00000000 000 000 00 Q ss_pred hhhhcccccCC Q lcl|NC_021301. 446 SVQRPQEDGSR 456 (456) Q Consensus 446 ~~~~~~~d~~~ 456 (456) ..+.+..|+++ T Consensus 424 ~~~~~~~d~~~ 434 (454) T protein:vir:93 424 PQAVAASDGNK 434 (454) T ss_pred CCCCCCCCCCC Confidence 11112234444 No 149 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=99.03 E-value=3.7e-09 Score=66.83 Aligned_cols=397 Identities=11% Similarity=0.023 Sum_probs=171.4 Q ss_pred CCCCCH-HHHHHHHHHHH--------------HHHH---HH--HHHHHHHhcccCcccccCcccchhhhhhhhhhccChH Q lcl|NC_021301. 1 MTASTP-AEWLPVLTKRI--------------DDGM---SR--VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWG 60 (456) Q Consensus 1 ~~~~t~-~~~~~~l~~~~--------------~~~~---~r--~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~ 60 (456) |-==.| --++.+-.++- .++- +. ....-....|-.... ...+.. ..-+.+.=. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-----~~al~~~~V 73 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK--LRQYKD-----IEAIRHSDI 73 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCccc--ccccch-----hhhhccHHH Confidence 221111 00111111110 0000 00 000011111110000 000000 000111212 Q ss_pred HHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh--cCh---hHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEc Q lcl|NC_021301. 61 LMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD--NRM---DSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADS 134 (456) Q Consensus 61 ~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~--n~~---~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~ 134 (456) -.+|+.+++-+..-|+.+..+........+..++.. |.. ......+....+.+|.||+++-.+.+|++ .+..++ T Consensus 74 ~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 153 (441) T protein:vir:94 74 FTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 153 (441) T ss_pred HHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 336777777777778776433222222223444432 322 35566788889999999999999989987 589999 Q ss_pred cceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE Q lcl|NC_021301. 135 PETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP 214 (456) Q Consensus 135 p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) |..+.+..|+.. + +... .+..+.++... ...|..+.+.++.. .+ T Consensus 154 ~~~v~v~~d~~g-~-~~~~-~~~~~~~~~~~-~~~~~~~dvih~k~--------------------------------~~ 197 (441) T protein:vir:94 154 TSEIELKSDARG-R-LYYF-HQRIDSNGNNI-ERNVKFEDMLDIKF--------------------------------YS 197 (441) T ss_pred CceeEEEECCCc-c-EEEE-EEEeccCCcee-EEEEccccEEEecc--------------------------------CC Confidence 999998877542 2 2111 11111112111 12233333322210 00 Q ss_pred EEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhh-hhhhh--hccceec Q lcl|NC_021301. 215 VVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYA-SIFEA--APGALWE 290 (456) Q Consensus 215 vv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~-~~~~~--~~~~~~~ 290 (456) ...-.|.|-++.....++....+..-......-.+.|..+++- ..... .++....+ ... ..+.. ..+.+.. T Consensus 198 ---~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~-~~~~~-~~e~~e~~r~~~~~~~~G~~nag~~~v 272 (441) T protein:vir:94 198 ---LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKM-KGVLD-NKKARDRAREEFHKSFSGTKQAGKVVV 272 (441) T ss_pred ---CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEc-CCCCC-CHHHHHHHHHHHHHHhcCccccCccee Confidence 0111366665544443332221111111111122234333321 11110 11111111 111 11111 1245667 Q ss_pred cCCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 291 LPPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEK-GFLFKCEDRLSIAKIGLEAIL 368 (456) Q Consensus 291 ~~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~-~l~~k~~~~~~~f~~~l~~~~ 368 (456) ++.+.++.++..... ..|++..+....+|+.+-|+|+..+|...++.|.+.....+. .|.- +-..+++.+ T Consensus 273 l~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~~~tl~P--------~~~~ie~el 344 (441) T protein:vir:94 273 LDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKP--------YITCVCAEL 344 (441) T ss_pred cCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHH--------HHHHHHHHH Confidence 788888887754322 237787888899999999999999986554443222221111 1111 111222212 Q ss_pred HHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HH-----HHHHHHH Q lcl|NC_021301. 369 VKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DD-----LDRAREQ 438 (456) Q Consensus 369 ~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e-----~~~~~ee 438 (456) ...+ ........+++.+....-.|..+.+++..+++++|+++..-+++.+|+.|-+--. +- .+.. .+ T Consensus 345 n~kl--~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~-~~ 421 (441) T protein:vir:94 345 NFKF--NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV-DE 421 (441) T ss_pred hhhc--cccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc-cc Confidence 1111 1111223344444445667889999999999999999999999999886632110 00 0110 00 Q ss_pred HHHHhhhhhhhcccccCC Q lcl|NC_021301. 439 ITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 439 ~~~~~~~~~~~~~~d~~~ 456 (456) .........+.+...|++ T Consensus 422 ~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:94 422 YQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccccccccccCCCCC Confidence 000000111112222333 No 150 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=99.03 E-value=3.7e-09 Score=66.83 Aligned_cols=397 Identities=11% Similarity=0.023 Sum_probs=171.4 Q ss_pred CCCCCH-HHHHHHHHHHH--------------HHHH---HH--HHHHHHHhcccCcccccCcccchhhhhhhhhhccChH Q lcl|NC_021301. 1 MTASTP-AEWLPVLTKRI--------------DDGM---SR--VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWG 60 (456) Q Consensus 1 ~~~~t~-~~~~~~l~~~~--------------~~~~---~r--~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~ 60 (456) |-==.| --++.+-.++- .++- +. ....-....|-.... ...+.. ..-+.+.=. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-----~~al~~~~V 73 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTK--LRQYKD-----IEAIRHSDI 73 (441) T ss_pred CccccCccccccccccccchhhhhccccccccccccccCCCcchHHHHHHhcccCccc--ccccch-----hhhhccHHH Confidence 221111 00111111110 0000 00 000011111110000 000000 000111212 Q ss_pred HHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh--cCh---hHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEc Q lcl|NC_021301. 61 LMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD--NRM---DSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADS 134 (456) Q Consensus 61 ~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~--n~~---~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~ 134 (456) -.+|+.+++-+..-|+.+..+........+..++.. |.. ......+....+.+|.||+++-.+.+|++ .+..++ T Consensus 74 ~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 153 (441) T protein:vir:79 74 FTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 153 (441) T ss_pred HHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 336777777777778776433222222223444432 322 35566788889999999999999989987 589999 Q ss_pred cceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE Q lcl|NC_021301. 135 PETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP 214 (456) Q Consensus 135 p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) |..+.+..|+.. + +... .+..+.++... ...|..+.+.++.. .+ T Consensus 154 ~~~v~v~~d~~g-~-~~~~-~~~~~~~~~~~-~~~~~~~dvih~k~--------------------------------~~ 197 (441) T protein:vir:79 154 TSEIELKSDARG-R-LYYF-HQRIDSNGNNI-ERNVKFEDMLDIKF--------------------------------YS 197 (441) T ss_pred CceeEEEECCCc-c-EEEE-EEEeccCCcee-EEEEccccEEEecc--------------------------------CC Confidence 999998877542 2 2111 11111112111 12233333322210 00 Q ss_pred EEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhh-hhhhh--hccceec Q lcl|NC_021301. 215 VVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYA-SIFEA--APGALWE 290 (456) Q Consensus 215 vv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~-~~~~~--~~~~~~~ 290 (456) ...-.|.|-++.....++....+..-......-.+.|..+++- ..... .++....+ ... ..+.. ..+.+.. T Consensus 198 ---~dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~-~~~~~-~~e~~e~~r~~~~~~~~G~~nag~~~v 272 (441) T protein:vir:79 198 ---LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKM-KGVLD-NKKARDRAREEFHKSFSGTKQAGKVVV 272 (441) T ss_pred ---CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEc-CCCCC-CHHHHHHHHHHHHHHhcCccccCccee Confidence 0111366665544443332221111111111122234333321 11110 11111111 111 11111 1245667 Q ss_pred cCCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 291 LPPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEK-GFLFKCEDRLSIAKIGLEAIL 368 (456) Q Consensus 291 ~~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~-~l~~k~~~~~~~f~~~l~~~~ 368 (456) ++.+.++.++..... ..|++..+....+|+.+-|+|+..+|...++.|.+.....+. .|.- +-..+++.+ T Consensus 273 l~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~~~~tl~P--------~~~~ie~el 344 (441) T protein:vir:79 273 LDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYLSTLKP--------YITCVCAEL 344 (441) T ss_pred cCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHHHHHHH--------HHHHHHHHH Confidence 788888887754322 237787888899999999999999986554443222221111 1111 111222212 Q ss_pred HHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HH-----HHHHHHH Q lcl|NC_021301. 369 VKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DD-----LDRAREQ 438 (456) Q Consensus 369 ~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e-----~~~~~ee 438 (456) ...+ ........+++.+....-.|..+.+++..+++++|+++..-+++.+|+.|-+--. +- .+.. .+ T Consensus 345 n~kl--~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~-~~ 421 (441) T protein:vir:79 345 NFKF--NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV-DE 421 (441) T ss_pred hhhc--cccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc-cc Confidence 1111 1111223344444445667889999999999999999999999999886632110 00 0110 00 Q ss_pred HHHHhhhhhhhcccccCC Q lcl|NC_021301. 439 ITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 439 ~~~~~~~~~~~~~~d~~~ 456 (456) .........+.+...|++ T Consensus 422 ~~~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:79 422 YQMNKSRATDKKLKGGEE 439 (441) T ss_pred cccccccccccccCCCCC Confidence 000000111112222333 No 151 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=99.02 E-value=5e-09 Score=66.11 Aligned_cols=398 Identities=11% Similarity=0.003 Sum_probs=170.8 Q ss_pred CCCCCH-HHHHHHHHHHH--------------HHHHH-----HHHHHHHHhcccCcccccCcccchhhhhhhhhhccChH Q lcl|NC_021301. 1 MTASTP-AEWLPVLTKRI--------------DDGMS-----RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWG 60 (456) Q Consensus 1 ~~~~t~-~~~~~~l~~~~--------------~~~~~-----r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~ 60 (456) |-==.| --++++-.++- +++.- ....+-....|-....-........ +.+.=. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a-------l~~~~V 73 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEA-------IRHSDI 73 (441) T ss_pred CceecCccceeccccccchhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhh-------hccHHH Confidence 211111 00111111110 00000 0000000000000000000000001 111112 Q ss_pred HHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEc Q lcl|NC_021301. 61 LMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADS 134 (456) Q Consensus 61 ~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~ 134 (456) -.+|+.+++-+..-|+.+...........+..++.. |. -......+...++.+|.||+++-++.+|++ .+..++ T Consensus 74 ~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 153 (441) T protein:vir:98 74 FTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRK 153 (441) T ss_pred HHHHHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEc Confidence 336777777777677776443222222234444432 32 235566788888999999999999988886 489999 Q ss_pred cceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE Q lcl|NC_021301. 135 PETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP 214 (456) Q Consensus 135 p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) |..+.+..|+.. + +... .+..+..+.... ..+..+.+.++.. ++ T Consensus 154 ~~~v~v~~~~~g-~-~~~~-~~~~~~~~~~~~-~~~~~~dviHir~--------------------------------~~ 197 (441) T protein:vir:98 154 TSEIELKLDARG-R-LYYF-HQRIDSNGNNIE-RNVKFEDMLDIKF--------------------------------YS 197 (441) T ss_pred CceeEEEECCCC-c-EEEE-EEEeccCcceee-EEEccccEEEecc--------------------------------CC Confidence 999998886542 2 2111 111111221111 2233333332210 00 Q ss_pred EEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhh-hhhh--hhccceec Q lcl|NC_021301. 215 VVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYA-SIFE--AAPGALWE 290 (456) Q Consensus 215 vv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~-~~~~--~~~~~~~~ 290 (456) ...-.|.|-+..+...++....+..-......-.+.|..+++- ..... .++....+ ... ..+. ...+.+.. T Consensus 198 ---~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~-~~~~~-~~e~~~~~~~~~~~~~~G~~nag~~~v 272 (441) T protein:vir:98 198 ---LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKM-KGVLD-NKKARDRAREEFHKSFSGTKQAGKVVV 272 (441) T ss_pred ---CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEe-CCCCC-CHHHHHHHHHHHHHHhcCccccCccee Confidence 0111355555544443333222211111111112233333321 11110 01111111 111 1111 11245667 Q ss_pred cCCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 291 LPPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV 369 (456) Q Consensus 291 ~~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~ 369 (456) ++.+.++.++..... ..|++..+....+|+.+-|+|+..+|...++.|.+.....+. .-+ .-+-..+++.+. T Consensus 273 l~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~~y~---~tl----~P~~~~ie~~ln 345 (441) T protein:vir:98 273 LDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANLDYL---STL----KPYITCVCAELN 345 (441) T ss_pred cCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHHHHH---HHH----HHHHHHHHHHHH Confidence 788888887754322 237787888889999999999999986555544332221111 111 111222222222 Q ss_pred HHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----H-----HHHHHHHHH Q lcl|NC_021301. 370 KALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----D-----DLDRAREQI 439 (456) Q Consensus 370 l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~-----e~~~~~ee~ 439 (456) ..+ ......+.+++.....+-.|.++.+++..++.++|+++..-+++.+|+.|-+--. + ..+.. ++. T Consensus 346 ~~L--~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~-~~~ 422 (441) T protein:vir:98 346 FKF--NDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELV-DEY 422 (441) T ss_pred hhc--cccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccc-ccc Confidence 111 1111223344333444667889999999999999999999999999886532110 0 00110 000 Q ss_pred HHHhhhhhhhcccccCC Q lcl|NC_021301. 440 TLFAGNSVQRPQEDGSR 456 (456) Q Consensus 440 ~~~~~~~~~~~~~d~~~ 456 (456) ........+..-..|+. T Consensus 423 q~~~~~~~~~~~kgGe~ 439 (441) T protein:vir:98 423 QMNKSRATDKKLKGGEE 439 (441) T ss_pred ccccccccccccCCCCC Confidence 00000001111112222 No 152 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=99.02 E-value=5e-09 Score=66.11 Aligned_cols=386 Identities=10% Similarity=0.007 Sum_probs=168.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCccc-ccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC---CC Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP-ELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG---SA 82 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~-~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~---~~ 82 (456) +=+.+.+.++-.+. +.. ......|-.... ..+..+. ....+.+.-...+|+.++.-+..-|+.+-. +. T Consensus 1 m~~~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~~g~~v~-----~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g 72 (419) T protein:vir:57 1 MFIPQFWKGRPSEN--RVN-WQVVPGGMRSSSSQAGVIIT-----PETALALSAVRACVTLLAESVAQLPCVLYRRTENG 72 (419) T ss_pred CcchhhhccCCccc--ccc-ccccccccccccccCCceec-----hHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCC Confidence 10111111100000 000 000000000000 0000000 011122333567888888888777776411 11 Q ss_pred c--ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 83 D--SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 83 d--~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) . ......+.+++.. | ....+...+....+.+|.||+++-.+.+|.+ .+..++|..+.+..++.. . T Consensus 73 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~g-~------ 145 (419) T protein:vir:57 73 GREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPDG-M------ 145 (419) T ss_pred ceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCCc-e------ Confidence 1 1112224555432 2 2345667788899999999999999998986 588889988877654331 1 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDII 234 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~li 234 (456) .+|.. .+.+. ++..+.++++. . + ......|.|-+......+ T Consensus 146 ~~y~~-~~~~~---~~~~~~vih~r-------------------------------~-~---~~d~~~G~s~i~~~~~~i 186 (419) T protein:vir:57 146 PYYDI-PSIGE---ILPMRMVHHIK-------------------------------S-F---SLDGYIGTSPIQTNPDVL 186 (419) T ss_pred EEEEE-cCCce---EEchhhEEEec-------------------------------C-c---CCCCcccccHHHHHHHHH Confidence 11211 11111 12222222211 0 0 011124777666555544 Q ss_pred HHHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchhh-hhhhhhh------hccceeccCCCceeEeecccc Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAID-YASIFEA------APGALWELPPGVDIWESQTND 304 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~~-~~~~~~~------~~~~~~~~~~d~~~~~~~~~~ 304 (456) +....+.. -...++ +.|-.+++.-........ ..... ....+.. ..+.+..++.+.++.++.... T Consensus 187 ~~~~~~~~---~~~~~f~ng~~p~gil~~~~~~~~~~~--~e~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~ 261 (419) T protein:vir:57 187 GLGIAVEQ---HAAQVFARGTTMSGVIERPFEAKAIAS--QAAVDAILAKWTERYGGVRNAFSVGMLQEGMTYKQLSQDN 261 (419) T ss_pred HHHHHHHH---HHHHHHHccCCccEEEEecCcCCcccC--HHHHHHHHHHHHHHhccccccccceecCCCceEEEcCCCh Confidence 43332211 122222 223333321110000011 01111 1111111 234566778888988876432 Q ss_pred h-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCcccc Q lcl|NC_021301. 305 F-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDT 382 (456) Q Consensus 305 ~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~ 382 (456) . ..|++..+....+|+.+-|+|+..+|.... .++..++.....+...+ -.-+-..+++.+.. ++. ......+. T Consensus 262 ~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~t~sn~e~~~~~f~~~~---l~P~~~~ie~~l~~~ll~-~~~~~~~~ 336 (419) T protein:vir:57 262 EKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQK-STNNNIEHQGLQYVIYT---MLAILKRHESAMMRDLLL-PSERRDFY 336 (419) T ss_pred hhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CccccHHHHHHHHHHHH---HHHHHHHHHHHHHhhccC-ccccCCeE Confidence 2 237787888889999999999999974321 11111111111111100 01111122221211 111 11112233 Q ss_pred eeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH--------HHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 383 VDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD--------DLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 383 i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~--------e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) +++.+...+..|..+.+++..+++++|+++.--+++.+|+.|.+--.. ..+...+......+...+...-.. T Consensus 337 i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~ 416 (419) T protein:vir:57 337 IEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTPLNMVDSKALTGIGKATPQQLKDIEAILC 416 (419) T ss_pred EEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccccccccccccCCCcccCcchhhhhh Confidence 444445556779999999999999999999999999999876422110 011111111111111111111122 Q ss_pred CC Q lcl|NC_021301. 455 SR 456 (456) Q Consensus 455 ~~ 456 (456) +| T Consensus 417 ~~ 418 (419) T protein:vir:57 417 TR 418 (419) T ss_pred cc Confidence 22 No 153 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=99.02 E-value=5.1e-09 Score=66.05 Aligned_cols=387 Identities=13% Similarity=0.034 Sum_probs=172.6 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcc-- Q lcl|NC_021301. 8 EWLPVLTKRIDD-GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADS-- 84 (456) Q Consensus 8 ~~~~~l~~~~~~-~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~-- 84 (456) =++..+-++... .......+...+-+.... ..+..+.. ..-+.+.....+|+.+++-+..-|+.+-...+. T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~~-~~g~~v~~-----~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~~~ 74 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYDT-YTGKRISS-----QRAMRLTAVYSCVRVLAESVGMLPCSLYKISGTLK 74 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCccc-ccCceech-----hhhhccHHHHHHHHHHHHhhhhCceEEEEecCCcc Confidence 012222111110 000011111111111100 01111100 111223445667888888887777664321111 Q ss_pred --cHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 85 --DLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 85 --~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) .....+.+++.. | ........+....+.+|.||+++..+ .|++ .+..++|..+.+..+.... + +.. T Consensus 75 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~~--~---~y~ 148 (413) T protein:vir:48 75 TRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQWQ--P---VYQ 148 (413) T ss_pred eeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCce--E---EEE Confidence 111224444432 2 23456777888999999999998876 5665 4788899988887764421 1 112 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINR 236 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa 236 (456) +...+|... .|..+.++++.. . ......|.|-+......++. T Consensus 149 ~~~~~g~~~---~~~~~evih~~~--------------------------~---------~~d~~~G~s~i~~~~~~i~~ 190 (413) T protein:vir:48 149 VTFPDGSVD---VLTQDEIWHVRT--------------------------L---------TLDGLVGLNPIAYAREAISL 190 (413) T ss_pred EEecCceEE---EEccccEEEecC--------------------------c---------CCCCcccccHHHHHHHHHHH Confidence 222333221 233333333210 0 00112466666554444443 Q ss_pred HHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhh-hhh--hhccceeccCCCceeEeecccch-HHHHHH Q lcl|NC_021301. 237 INRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYAS-IFE--AAPGALWELPPGVDIWESQTNDF-TPMLSA 311 (456) Q Consensus 237 ~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~-~~~--~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~ 311 (456) ...+..-......-.+.|..+++.-. . ..++.-..+ .... ... ...+.++.++.+.++.++..... .-|++. T Consensus 191 ~~~~~~~~~~~~~ng~~p~gil~~~~-~--~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~ 267 (413) T protein:vir:48 191 AAATEEHGARLFGNGAVTSGVLRTEQ-K--LTPDAYERLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLET 267 (413) T ss_pred HHHHHHHHHHHHhccCCcceEEEeCC-C--CCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEeccCChhHHHHHHH Confidence 33222111111111223434443211 1 111111111 1111 111 12345667788888888764322 237787 Q ss_pred HHHHHHHHHhhcCCChhhhcccc-cC-cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEe Q lcl|NC_021301. 312 IKEHIRQLSSATKTPLPMLMPDS-AN-QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSF 387 (456) Q Consensus 312 l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f 387 (456) .+....+|+.+-|+|+..+|... ++ ++.+... +....+.- +-..+++.+..-+-.......+.+++.+ T Consensus 268 ~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~f~~~~i~P--------~~~~ie~~l~~~L~~~~~~~~~~~~fd~ 339 (413) T protein:vir:48 268 RKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLGFINYSLVP--------YLTRIEQRINTGLVRESKQGKFYAKFNA 339 (413) T ss_pred HHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHHHHHHHHHH--------HHHHHHHHHHhhccCccccCCeEEEEec Confidence 88889999999999999997532 12 2222222 22212221 1122222222111111111223345545 Q ss_pred cCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHH-HHHHHHHHHHhhhhhhhccc--ccCC Q lcl|NC_021301. 388 ESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDL-DRAREQITLFAGNSVQRPQE--DGSR 456 (456) Q Consensus 388 ~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~-~~~~ee~~~~~~~~~~~~~~--d~~~ 456 (456) ....-.|..+.++++.+++++|+++.--+++.+|+.|-+--..-. ..-...... .......+.+ +.++ T Consensus 340 ~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~-~~~~~~~~~~~~~~~~ 410 (413) T protein:vir:48 340 GALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPMNMTTSPS-AGDDNGKKKESGDADK 410 (413) T ss_pred hhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeecccccccccc-ccccCCCCCCCCCccc Confidence 555667899999999999999999998889999987642110000 000000000 0111111111 2222 No 154 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.01 E-value=5.6e-09 Score=65.83 Aligned_cols=399 Identities=10% Similarity=0.033 Sum_probs=164.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |--.+-.+--..++++. . ..-...-+ |.+++..-+.. ...+.... -...+.+.+|+..+.-+.+-|+.+.. T Consensus 49 ~~~~~~~d~~~~~~~r~----g-~~~~~~~~-g~~~~~epp~d-~~~l~~l~--~~np~V~~aI~iia~~ia~l~~~i~~ 119 (648) T protein:vir:79 49 GGGSAKRDPKMSLVKRI----G-LAIMDGGG-GGRDFEEPEFD-FNEITSAY--NTEGYVRQAVDKYIEMMFKADWDFVS 119 (648) T ss_pred ccccccccchhHHHHHh----H-HHHHhhcC-CccccccCCcC-HHHHHHHH--hcChHHHHHHHHHHHHHhhCcceEEe Confidence 00111111111111110 0 11111112 33333221111 12232221 13567888999999998888876544 Q ss_pred CCcccHHH--HHHHHHHhc---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCceE----------------EEEEccceeE Q lcl|NC_021301. 81 SADSDLAL--RARRIWRDN---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT----------------ITADSPETMV 139 (456) Q Consensus 81 ~~d~~~~~--~l~~~~~~n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~----------------i~~~~p~~~~ 139 (456) ..+..... ....+..-| ....+...+..+.+.+|.||+.+-.+.+|.+- +..++|..+. T Consensus 120 ~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~ 199 (648) T protein:vir:79 120 KNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMK 199 (648) T ss_pred cCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhhhccccceeeeEeecCceeE Confidence 33322111 111111222 33466777888999999999999888887431 1223333333 Q ss_pred EEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc Q lcl|NC_021301. 140 VSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ 219 (456) Q Consensus 140 ~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~ 219 (456) +..++.. . .. .|.+...+.... ..|.++.+.++. ..+ ... T Consensus 200 v~~d~~g-~-~~---~Y~y~~~g~~~~-~~~~~~dIIHik-------------------------------~~~---~~d 239 (648) T protein:vir:79 200 VKRDKFG-M-IK---GWQQEQEGQDKP-QKFKPEDIVHIY-------------------------------YKR---EKG 239 (648) T ss_pred EEEcCCC-c-ee---eeEEEecCCcee-EEecCccEEEEc-------------------------------cCC---CCC Confidence 3322111 0 00 000011111100 111111111110 000 011 Q ss_pred CCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee-- Q lcl|NC_021301. 220 NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI-- 297 (456) Q Consensus 220 n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~-- 297 (456) ..+|.|-+......|+....+.........-.+.|-.+++- ..+....+.... ....+..........+...+. T Consensus 240 ~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~-~~~~~~~e~~k~---~~e~~~~~~~~~~i~gg~v~~~~ 315 (648) T protein:vir:79 240 RAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKV-GLEQEGFGAEEG---EVDLVRGEVENMDVEGGMVTTER 315 (648) T ss_pred CceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEe-CCCccchHHHHH---HHHHHHHhcccccccccccccce Confidence 23577776655554443332222222222223344433321 111111111111 111122222222222222221 Q ss_pred Eeecc----cchHHHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_021301. 298 WESQT----NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI-LVK 370 (456) Q Consensus 298 ~~~~~----~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~-~~l 370 (456) ..++. .++ .|++..+....+|+.+-++|+..+|... +| +.+++....+. ..+...+..+...+... .+. T Consensus 316 ~~i~~~~s~~dl-qfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~~~~---~~i~~l~~~i~~~le~~~~~~ 391 (648) T protein:vir:79 316 VNISSIASNQII-DAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSSDFK---DRIKALQKVMATFINEFMVKE 391 (648) T ss_pred eeccccCCHHHH-HHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 11211 122 3777778889999999999999997421 22 23333332222 22222223333333221 111 Q ss_pred HHH---hcC-CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH--HHHH--H--HHHHHHH Q lcl|NC_021301. 371 ALQ---IEG-ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK--QDDL--D--RAREQIT 440 (456) Q Consensus 371 ~~~---~~~-~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~--~~e~--~--~~~ee~~ 440 (456) .+. +.. ...++.+++.|++....+....++.+.++.++|++|...+++.+|+.|-+.. .... + ....+.. T Consensus 392 ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~ 471 (648) T protein:vir:79 392 ILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATA 471 (648) T ss_pred HhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccc Confidence 111 111 1223457788888888888889999999999999999999999998763211 0000 0 0000000 Q ss_pred HHhhhhhhh----cccccC--C Q lcl|NC_021301. 441 LFAGNSVQR----PQEDGS--R 456 (456) Q Consensus 441 ~~~~~~~~~----~~~d~~--~ 456 (456) .......+. ...+++ + T Consensus 472 ~~~~~~~~~~~~~~~a~~eg~~ 493 (648) T protein:vir:79 472 LAALAPTPAGGSSASASGDKKK 493 (648) T ss_pred cccCCCCCCCCCCCCccccccc Confidence 000000000 000000 0 No 155 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=99.01 E-value=2e-09 Score=68.30 Aligned_cols=387 Identities=11% Similarity=0.015 Sum_probs=166.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMS---RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~---r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |- +..+.+.-.+... -+..+-....+... ..+..+.++ .-+.+.-...+|+.+++-+..-||. T Consensus 1 m~-------~~~~~~~~~~~~s~~~~w~~~~~~~~~~~~--~~g~~vt~~-----~al~~~~v~~~i~~Ia~~iA~lp~~ 66 (421) T protein:vir:10 1 MF-------IPQMFEGKKRSVSGGGFWEAMLGGVRSSHS--KAGVMITPE-----TALALSAVRACVTLLAESVAQLPVE 66 (421) T ss_pred CC-------CcchhcccccccCcchhhHHHhhhhccCcc--cCCceechH-----HhhccHHHHHHHHHHHHhhccCceE Confidence 10 0000000000000 00001111111100 000011110 0122333455788888887777776 Q ss_pred cCC-CCcc----cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_021301. 78 VGG-SADS----DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQ 146 (456) Q Consensus 78 ~~~-~~d~----~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~ 146 (456) +-. ..+. .....+..++.. |. ...+...+....+.+|.||+++-++.+|.+ .+..++|..+.+..++. T Consensus 67 ~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~- 145 (421) T protein:vir:10 67 LYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPD- 145 (421) T ss_pred EEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCC- Confidence 421 1111 111124444432 32 345566788899999999999999999987 47888888888766533 Q ss_pred CceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCc Q lcl|NC_021301. 147 PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGE 226 (456) Q Consensus 147 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~ 226 (456) +. +.|.....|. .+..+.++++. . + ..+...|.|- T Consensus 146 g~-----~~y~~~~~g~-----~~~~~eiih~~-------------------------------~-~---~~d~~~G~sp 180 (421) T protein:vir:10 146 GM-----PYYEIPEIGE-----TLPMRMMHHVK-------------------------------V-F---SLDGYIGSSP 180 (421) T ss_pred ce-----EEEEEcCCCc-----EEchhhEEEec-------------------------------C-c---CCCCcccccH Confidence 11 1111111111 11112221110 0 0 0111236666 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCc-ccccccccchh-hhh-hhhh--hhccceeccCCCceeEeec Q lcl|NC_021301. 227 VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHG-LPKVDENGNAI-DYA-SIFE--AAPGALWELPPGVDIWESQ 301 (456) Q Consensus 227 ~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~-~~~~~~~~~~~-~~~-~~~~--~~~~~~~~~~~d~~~~~~~ 301 (456) ++.+...++....+..-......-.+.|..+++--... ....++.-..+ ... .... ...+.+..++.+.++.++. T Consensus 181 i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~ 260 (421) T protein:vir:10 181 IQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMS 260 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCCCceEEecC Confidence 65444444332211111111111122343344321100 00011111111 000 1111 1224566778888988886 Q ss_pred ccchH-HHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021301. 302 TNDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEG--AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG 376 (456) Q Consensus 302 ~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~A--l~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 376 (456) ....+ .|++..+..+.+|+.+-|+|+..+|... ++ .+.+. +.+....|.- +-..+++.+...+-... T Consensus 261 ~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f~~~tl~P--------~~~~ie~~ln~kL~~~~ 332 (421) T protein:vir:10 261 QDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEHQGLQFVMYTLLA--------WLKRHEGALQRDLLLPS 332 (421) T ss_pred CChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHHHHHHHHHHHHHH--------HHHHHHHHHhhhccCcc Confidence 43322 3778788889999999999999987432 11 12111 1122222222 11222222222111111 Q ss_pred CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHH---HHHHHHHHHHHhhhhhhhcccc Q lcl|NC_021301. 377 ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDD---LDRAREQITLFAGNSVQRPQED 453 (456) Q Consensus 377 ~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e---~~~~~ee~~~~~~~~~~~~~~d 453 (456) ......+++.+......|..+.++++.+++++|+++.--+++.+|+.|-+--..- ..-. ...+...+...+.+++. T Consensus 333 ~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~-~~~~~~~~~~~~~~~~~ 411 (421) T protein:vir:10 333 ERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLTPLNMV-DSAQIIPGDKKPTAQQM 411 (421) T ss_pred ccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccc-cccccccCCCCcccccC Confidence 1112234444444556789999999999999999999999999988754211100 0000 00011111112222222 Q ss_pred cCC Q lcl|NC_021301. 454 GSR 456 (456) Q Consensus 454 ~~~ 456 (456) ++. T Consensus 412 ~e~ 414 (421) T protein:vir:10 412 AEI 414 (421) T ss_pred ccc Confidence 222 No 156 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=99.00 E-value=4.6e-09 Score=66.30 Aligned_cols=393 Identities=9% Similarity=0.017 Sum_probs=175.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccc-ccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP-ELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~-~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) |+.+. ++.++....-... +..-+.+-.+.. ..+........ +.-..+.-...+|+.+++-+..-|+.+- T Consensus 1 ~~~~~---~~~~~k~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~v~~--~~a~~~~~v~~~i~~Ia~~ia~lp~~~~ 70 (409) T protein:vir:94 1 MAKEN---IVTRIKKKLIDNW-----IDQSASKLYDFSPWKNKSFWGVIN--NTLETNETIFSAITKLSNSMASLPLKMY 70 (409) T ss_pred Ccccc---cchhhhhHHhhhh-----hcCCcccccccccccCccccccch--hhhhccHHHHHHHHHHHHhhhhCceeEe Confidence 55444 3333333221100 000011111100 00111000000 1112234456678888888877787663 Q ss_pred CCCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEE Q lcl|NC_021301. 80 GSADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSA 153 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~ 153 (456) ...+ .....+.+++.. |. -..+...+...++.+|.||+++.++.+|.+ .+..++|..+.+..++.... +. T Consensus 71 ~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~-~~-- 146 (409) T protein:vir:94 71 EDYK-VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE-LY-- 146 (409) T ss_pred eccc-ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE-EE-- Confidence 2221 122234444432 32 234556788888999999999999989986 58889999998887755332 11 Q ss_pred EEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHH Q lcl|NC_021301. 154 MRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDI 233 (456) Q Consensus 154 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~l 233 (456) +.+...++.. . .+..+.+.++.. ..+ .+.-.|.|.+...... T Consensus 147 -y~~~~~~g~~--~-~~~~~dvih~r~-------------------------------~~~---~~~~~G~s~l~~~~~~ 188 (409) T protein:vir:94 147 -YSIHAATGNK--L-IVHNMDMLHFKH-------------------------------IVA---SNMVQGISPIDVLKNT 188 (409) T ss_pred -EEEEcCCceE--E-EEccccEEEecC-------------------------------CCC---CCccccccHHHHHHHH Confidence 1122222221 1 123333333210 000 0112466666544444 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc-hhhhhhhhhhhccceeccCCCceeEeecccch-HHHHHH Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN-AIDYASIFEAAPGALWELPPGVDIWESQTNDF-TPMLSA 311 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~ 311 (456) ++....+.. . ....+..+-.++.-.+.. ..++.-. ........-...+.+..++.+.++.+++.... ..+++. T Consensus 189 i~~~~~~~~-~--~~~~~~~~~~~i~~~~~~--l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~ 263 (409) T protein:vir:94 189 TDFDNAVRT-F--NLTEMQKPDSFMLKYGSN--VGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVAS 263 (409) T ss_pred HHHHHHHHH-H--HHHhcCCCCeeEEecCCC--CCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHH Confidence 443322211 1 112222222222111110 1111111 11111111223455667788889888764322 247777 Q ss_pred HHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCcccceeEEecCC Q lcl|NC_021301. 312 IKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDTVDVSFESP 390 (456) Q Consensus 312 l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~i~v~f~~~ 390 (456) .+....+|+.+-++|+..+|.... ++...++.....+...| -.-+-..+++.+.. ++.-.+......+++....- T Consensus 264 ~~~~~~~Ia~~fgVPp~~lg~~~~-~~~sn~e~~~~~f~~~~---l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~l 339 (409) T protein:vir:94 264 ENLTRERVANVFQLPSVFLNARSN-TNFAKNEELNRFYLQHT---LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSY 339 (409) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHHHHHHHHH---HHHHHHHHHHHHHHhhCCcccccCcceEEeechhh Confidence 777889999999999999975322 12111222211111111 01111222222211 11111111223334333344 Q ss_pred CCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH-----HHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 391 DRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD-----DLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 391 ~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~-----e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +-.|..+.++++.+++++|+++.-.+++.+|+.|-+--.. ....+.. ....+...+-.+++++. T Consensus 340 l~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~--~~~~~~~~kGG~~n~~e 408 (409) T protein:vir:94 340 LRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDT--PLELRKSLKGGDKNVNE 408 (409) T ss_pred hccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEeeccccccccc--chhhcccccCCCCCcCC Confidence 5678899999999999999999988899998876431100 0000000 00000111111222222 No 157 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=99.00 E-value=1.8e-09 Score=68.55 Aligned_cols=333 Identities=10% Similarity=0.021 Sum_probs=152.2 Q ss_pred ccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh--c---ChhHHHHHHHHHHh Q lcl|NC_021301. 37 LPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD--N---RMDSVCKQWVKYGL 111 (456) Q Consensus 37 i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~ 111 (456) |- .-|+.+... ++.....+.+++.. | .-......++...+ T Consensus 1 ia----------------------------------~lp~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~ 45 (348) T protein:vir:93 1 MA----------------------------------SLPLKMYED-YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRN 45 (348) T ss_pred Cc----------------------------------ccceEeEec-CcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHh Confidence 22 223332111 11111223333321 2 12344566778888 Q ss_pred hCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccc Q lcl|NC_021301. 112 DFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRR 190 (456) Q Consensus 112 ~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (456) .+|.||+++-++..|.+ .+..++|..+.++.++.... +. +.+...++.. ..|..+.+.++.. T Consensus 46 l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~-~~---y~~~~~~g~~---~~~~~~eiih~r~---------- 108 (348) T protein:vir:93 46 EKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE-LY---YSIHAATGNK---LIVHNMDMLHFKH---------- 108 (348) T ss_pred hcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCCCcE-EE---EEEEcCCCeE---EEEccccEEEecC---------- Confidence 99999999999999987 58888998888877654331 11 1122222221 1233333332210 Q ss_pred eeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc Q lcl|NC_021301. 191 RLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE 270 (456) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 270 (456) ..| ..--.|.|-++.+...++..+.+. . .....+..+-.++.-.+. ...++ T Consensus 109 ---------------------~~~---~~~~~G~s~~~~~~~~i~~~~~~~-~--~~~~~~~~~~~~i~~~~~--~l~~e 159 (348) T protein:vir:93 109 ---------------------IVA---SNMVQGISPIDVLKNTTDFDNAVR-T--FNLTEMQKPDSFMLKYGS--NVSTE 159 (348) T ss_pred ---------------------CCC---CCceeeccHHHHHHHHHHHHHHHH-H--HHHHhcCCCceeEEecCC--CCCHH Confidence 000 011136665554444443322111 1 111222222122211111 11111 Q ss_pred ccc-hhhhhhhhhhhccceeccCCCceeEeecccchH-HHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHH Q lcl|NC_021301. 271 NGN-AIDYASIFEAAPGALWELPPGVDIWESQTNDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKG 348 (456) Q Consensus 271 ~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~ 348 (456) .-. ............+.+..++.+.++.+++.+..+ .|++..+....+|+.+-|+|+..+|.... ++...++..... T Consensus 160 ~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~-~~~~~~e~~~~~ 238 (348) T protein:vir:93 160 KRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSN-TNFAKNEELNRF 238 (348) T ss_pred HHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHHHH Confidence 111 111111112234566777888898888643322 47777788899999999999999975321 111122222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHH Q lcl|NC_021301. 349 FLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQI 427 (456) Q Consensus 349 l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~ 427 (456) +...| -.-+-..+++.+.. ++--.+......+++.+..-+-.|..+.++++.+++++|+++.--+++.+|+.|-+- T Consensus 239 ~~~~~---l~P~~~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~g 315 (348) T protein:vir:93 239 YLQHT---LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEG 315 (348) T ss_pred HHHHH---HHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCC Confidence 21111 01111122222221 111111112233454455556678999999999999999999999999999876421 Q ss_pred HH-----HHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 428 KQ-----DDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 428 ~~-----~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) -. .....+.. ....+...+-.+++++. T Consensus 316 gD~~~~~~n~~~~~~--~~~~~~~~~gg~~n~~~ 347 (348) T protein:vir:93 316 GDKPLISGDLYPIDT--PLELRKSLKGGDKNVNE 347 (348) T ss_pred cCeEeeccccccccc--chhhcccccCCCCCcCC Confidence 10 00000000 00000111111112222 No 158 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=98.97 E-value=6.4e-09 Score=65.51 Aligned_cols=393 Identities=12% Similarity=0.039 Sum_probs=178.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccc------cCcccchhh--h--hhhhhhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPE------LTRNTSAAW--R--SFQREARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~------~~~~~~~~~--~--~~~~k~~~n~~~~iVd~~a~~ 70 (456) |- .|.-..+ ...+..-+.+++..+.|...... -+.....-. . ....-+.+.-...+|+.+++- T Consensus 1 ~~--~~~~~~~-----~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:18 1 ME--EPKYTID-----LRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDSSINDERILQISTVWRCVSLISTL 73 (424) T ss_pred CC--CCccccc-----cCCCCchHHHHHhhccccccccccchhhccccccccccccccccHHHhhccHHHHHHHHHHHHh Confidence 21 1111111 11122333444445544321100 000000000 0 001112223345678888888 Q ss_pred hccCCeecCC-CCcc-----cHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEcccee Q lcl|NC_021301. 71 IIPNGITVGG-SADS-----DLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETM 138 (456) Q Consensus 71 l~~~~~~~~~-~~d~-----~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~ 138 (456) +..-|+.+-. ..+. .....+.+++.. | .-..+...+...++.+|.||+++-++.+|++ .+..++|..+ T Consensus 74 iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v 153 (424) T protein:vir:18 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) T ss_pred hccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcce Confidence 8777766421 1111 111224444432 3 2234566788899999999999988888886 5788889888 Q ss_pred EEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc Q lcl|NC_021301. 139 VVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY 218 (456) Q Consensus 139 ~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~ 218 (456) .+..+.. .+ .|....++.. ..|..+.+.++.. .. . T Consensus 154 ~v~~~~~---~~----~y~~~~~g~~---~~~~~~eVihir~--------------------------~~---------~ 188 (424) T protein:vir:18 154 DVKLVGK---KV----VYRYQRDSEY---ADFSQKEIFHLKG--------------------------FG---------F 188 (424) T ss_pred EEEEcCC---eE----EEEEEeCCeE---EEeccccEEEecC--------------------------cC---------C Confidence 7665422 11 1111222211 1233333332210 00 0 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchh-hhhhhhhh--hccceeccC Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAI-DYASIFEA--APGALWELP 292 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~ 292 (456) ....|.|-+..... ++...++-..-...++ +.|..+++..+. ...++....+ ........ ..+.+..++ T Consensus 189 dg~~G~spi~~~~~---~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~--~l~~e~~~~~~~~~~~~~~~~nag~~~vl~ 263 (424) T protein:vir:18 189 TGLVGLSPIAFACK---SAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQVEENFKEIAGGPVKKRLWILE 263 (424) T ss_pred CCcccccHHHHHHH---HHHHHHHHHHHHHHHHhccCCcceEEEeCCc--CCCHHHHHHHHHHHHHHhCCcccCCceecc Confidence 11235555543332 3322222111122222 233333332111 1111211111 11111111 124466777 Q ss_pred CCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccC-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 293 PGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK 370 (456) Q Consensus 293 ~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l 370 (456) .+.++.++.-.. -..|++..+....+|+.+-|+|+..+|....+ .+|..++.....+...+- .-+-..+++.+.. T Consensus 264 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~tl---~P~~~~ie~~ln~ 340 (424) T protein:vir:18 264 AGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYTL---QPYISRWENSIQR 340 (424) T ss_pred CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHHH---HHHHHHHHHHHHh Confidence 888888775332 23377777888899999999999999753322 222333333222222110 1111122222221 Q ss_pred HHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH-HHHHHHHHHHHHhhhhhhh Q lcl|NC_021301. 371 ALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD-DLDRAREQITLFAGNSVQR 449 (456) Q Consensus 371 ~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~-e~~~~~ee~~~~~~~~~~~ 449 (456) -+--......+.+++.+...+..|.++.++++.++.++|+++.--+++.+|+.|-+--.. -...--..++.. ...+. T Consensus 341 ~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~n~~~l~~~--~~~~~ 418 (424) T protein:vir:18 341 WLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDVAMRQAQYVPITDL--GTNKE 418 (424) T ss_pred hcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchhhh--hccCC Confidence 111011112233555555667789999999999999999999999999999876421110 000000011111 12345 Q ss_pred cccccC Q lcl|NC_021301. 450 PQEDGS 455 (456) Q Consensus 450 ~~~d~~ 455 (456) +.++|+ T Consensus 419 ~~~n~a 424 (424) T protein:vir:18 419 PRNNGA 424 (424) T ss_pred ccccCC Confidence 566666 No 159 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=98.97 E-value=7.9e-09 Score=65.00 Aligned_cols=377 Identities=11% Similarity=0.028 Sum_probs=162.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccH Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~ 86 (456) +-+.+.+...-..........-....+ .+ .+.......-....-+.+.-...+|+.+++-+..-|+++.... . T Consensus 1 M~~f~~~~~~~~~~~~~~~~~~~~~~~--~~--~~~~~~~~~v~~~~~~~~~~v~~~i~~ia~~ia~~p~~~~~~~---~ 73 (386) T protein:vir:48 1 MPIFNITNLATESPPISQGGFFDITDP--DF--LSTLNGSEWVSAESALRNSDLFSIINQLSNDLATVKLTASRKQ---L 73 (386) T ss_pred Ccccccccccccccccccccccccccc--hh--cccccCCceechhhhhcchHHHHHHHHHHHhhccCceeeccch---h Confidence 111111110000000000000000000 00 0000000000000012223345677888887777788764321 1 Q ss_pred HHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCC-ce Q lcl|NC_021301. 87 ALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDA-ES 164 (456) Q Consensus 87 ~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~-~~ 164 (456) ...+.+-............+..+.+.+|.||+++-++.+|.+ .+..++|..+.+..++.... + .|....++ .. T Consensus 74 ~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~-~----~y~~~~~~~~~ 148 (386) T protein:vir:48 74 QGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDG-I----YYNITFDDPRI 148 (386) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEcCCCce-E----EEEEEecCccc Confidence 111111111122345667788899999999999999988886 58889999988877644321 1 11111111 11 Q ss_pred EEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHH Q lcl|NC_021301. 165 DFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQL 244 (456) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~ 244 (456) .....+..+.++++.. ++.. ....|.|.+......+.....+..-. T Consensus 149 ~~~~~~~~~evih~~~--------------------------~~~~--------~~~~G~s~i~~~~~~i~~~~~~~~~~ 194 (386) T protein:vir:48 149 PPKQHVPQGDVLHFKL--------------------------LSVD--------GGLTSVSPLMALSRELNIQKASDKLT 194 (386) T ss_pred cceeEecCccEEEecC--------------------------CCCC--------CceeeccHHHHHHHHHHHHHHHHHHH Confidence 1112233333332210 0000 00236666665444333332222211 Q ss_pred HHHHHHhhchhhhhhcCCCccccccccc-chhhhhhhhhhhccceeccCCCceeEeecccch-HHHHHHHHHHHHHHHhh Q lcl|NC_021301. 245 LSTMAIQAFRQRALKSAGHGLPKVDENG-NAIDYASIFEAAPGALWELPPGVDIWESQTNDF-TPMLSAIKEHIRQLSSA 322 (456) Q Consensus 245 ~~~~~~~~~~~~~i~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~ 322 (456) .....-.+.|..+++--. . ..++.. .............+.++.++.+.++.++..... ..|++..+....+|+.+ T Consensus 195 ~~~~~ng~~~~~ii~~~~-~--~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~ 271 (386) T protein:vir:48 195 LNSLKNALNANGILKIKG-G--GLLDFKTKLSRSRQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKV 271 (386) T ss_pred HHHHhccCCcceEEEeCC-C--CCHHHHHHHHHHHHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHH Confidence 111122233444443211 1 111111 111111112223455677888889888764332 23788888889999999 Q ss_pred cCCChhhhcccccCcHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHH Q lcl|NC_021301. 323 TKTPLPMLMPDSANQSAE--GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYA 400 (456) Q Consensus 323 ~~~p~~~~~~~~~N~Sg~--Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad 400 (456) -|+|+..+|..+++++.+ .+.+....|.-.+...+..+...|-. .+++.+....-.+....+. T Consensus 272 fgVPp~~lg~~~~~~~~e~~~~~~~~~~l~P~~~~ie~~l~~~l~~---------------~~~~~~~~~~~~d~~~~~~ 336 (386) T protein:vir:48 272 YGIPENVVGGQGDQQSSLEMSLDLYNKAVSRYLRPFLSELSQKLSC---------------DVDADILPAVDPTGSNSVS 336 (386) T ss_pred hCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------hhhcchhhhhccChHHHHH Confidence 999999998644433222 23333333333333333322222210 1111122222245556677 Q ss_pred HHHHHHhcCCCcHHHHHHhCCC---ChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 401 AASLAKAAGESWASIRRNILNY---NADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 401 ~~~kl~~~g~~s~~t~~~~~~~---~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .+.++..+|+++.-.+++.+|. .+.++...+ . .........+.||+- T Consensus 337 ~~~~l~~~g~~t~nE~r~~lg~~~~~~~~~~~~~------~---~~~~~~~gGd~~~~~ 386 (386) T protein:vir:48 337 RINSMVKSGTLAQNQGLYILQQAEILPKELPEGE------N---PNKTTLKGGEINGED 386 (386) T ss_pred HHHHHHhCCCcCHHHHHHHhhcCCCCCccchhhc------C---CCCCccCCCCCCCCC Confidence 7888899999999888887654 333322111 0 000000011111111 No 160 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=98.97 E-value=4.2e-09 Score=66.53 Aligned_cols=389 Identities=13% Similarity=0.063 Sum_probs=179.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccc------Ccccchhh--h--hhhhhhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPEL------TRNTSAAW--R--SFQREARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~------~~~~~~~~--~--~~~~k~~~n~~~~iVd~~a~~ 70 (456) |..-+..-| +.++...+.|....... +....... . ....-+.+.-...+|+.+++- T Consensus 8 ~~~~~~~g~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~al~~~~v~~cv~~Ia~~ 73 (424) T protein:vir:18 8 IDLRTNNGW--------------WARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDSSINDERILQISTVWRCVSLISTL 73 (424) T ss_pred EeecCCCch--------------HHHHHhhhcccccccccccccccccccccccccccccHHHhhccHHHHHHHHHHHHh Confidence 333333333 33444444332211000 00000000 0 001112233345578888888 Q ss_pred hccCCeecCC-CCcc-----cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEcccee Q lcl|NC_021301. 71 IIPNGITVGG-SADS-----DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETM 138 (456) Q Consensus 71 l~~~~~~~~~-~~d~-----~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~ 138 (456) +..-|+.+-. ..+. .....+.+++.. |. -..+...+....+.+|.||+++-.+.+|++ .+..++|..+ T Consensus 74 iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~V 153 (424) T protein:vir:18 74 TACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSANM 153 (424) T ss_pred hccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcce Confidence 8777776521 1111 011224444432 32 234566688899999999999989989986 5888899988 Q ss_pred EEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc Q lcl|NC_021301. 139 VVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY 218 (456) Q Consensus 139 ~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~ 218 (456) .+..+.. .+ .|....++.. ..|..+.+.++.. . .. T Consensus 154 ~v~~~~~---~~----~y~~~~~g~~---~~~~~~eIih~r~--------------------------~---------~~ 188 (424) T protein:vir:18 154 DVKLVGK---KV----VYRYQRDSEY---ADFSQKEIFHLKG--------------------------F---------GF 188 (424) T ss_pred EEEEcCC---eE----EEEEEeCCeE---EEeccccEEEecC--------------------------c---------CC Confidence 7765432 11 1212223221 1233333333210 0 00 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhh--hhccceeccCCCc Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWELPPGV 295 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~d~ 295 (456) ....|.|-++.....++....+..-..+...-.+.|-.+++..+. ...++....+ ....... ...+.+..++.+. T Consensus 189 dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~--~l~~e~~~~~~~~~~~~~~g~nag~~~vl~~g~ 266 (424) T protein:vir:18 189 TGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEK--VLTEQQRSQVEENFKEIAGGPVKKRLWILEAGF 266 (424) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCc--CCCHHHHHHHHHHHHHHhCCcccCCceeccCCc Confidence 112366655543333332221111111111222234344432111 1111111111 1111111 1134566778888 Q ss_pred eeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccCc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 296 DIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQ-SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ 373 (456) Q Consensus 296 ~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~ 373 (456) ++.++.-.. -.-|++..+....+|+.+-|+|+..+|....+. .+..++.....+...+ -.-+-..+++.+..-+- T Consensus 267 ~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~~f~~~t---l~P~~~~ie~~l~~~L~ 343 (424) T protein:vir:18 267 STSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNLGFLQYT---LQPYISRWENSIQRWLI 343 (424) T ss_pred eEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHHHHHHHH---HHHHHHHHHHHHHhhcC Confidence 888775432 223778788889999999999999997533222 1222332222221111 01111222222221111 Q ss_pred hcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-HHHHHHHHHHHHHhhhhhhhccc Q lcl|NC_021301. 374 IEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-DDLDRAREQITLFAGNSVQRPQE 452 (456) Q Consensus 374 ~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-~e~~~~~ee~~~~~~~~~~~~~~ 452 (456) -........+++.+...+..|..+.+++..++.++|+++.-.+++.+|+.|-+--. .-...--...+.. ...+.|.. T Consensus 344 ~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~~~~~~n~~~l~~~--~~~~~p~~ 421 (424) T protein:vir:18 344 PAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAMRQSQYVPITDL--GTNKEPRN 421 (424) T ss_pred CccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCccchHhh--hccCCCcc Confidence 01111223345445555678999999999999999999998899999887642110 0000000011111 12345666 Q ss_pred ccC Q lcl|NC_021301. 453 DGS 455 (456) Q Consensus 453 d~~ 455 (456) +|+ T Consensus 422 ~ga 424 (424) T protein:vir:18 422 NGA 424 (424) T ss_pred CCC Confidence 777 No 161 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=98.96 E-value=9.2e-09 Score=64.63 Aligned_cols=396 Identities=14% Similarity=0.042 Sum_probs=171.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHH---HHHHHhcccC-cc----cccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVR---LLARYSNGDA-PL----PELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~---~~~~YY~g~~-~i----~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |-++--.=++.++..-+....+... .......+.. .+ -..+..+.. ..-+.+.=...+|+.+++-+. T Consensus 1 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~-----~~a~~~~aV~~~v~~Ia~~ia 75 (432) T protein:vir:97 1 MPDEKKLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNA-----DAIMRLDAVAACVKLVSQAVA 75 (432) T ss_pred CCCcccCchhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccch-----HhhhcchHHHHHHHHHHHhhc Confidence 4444333333333322211111000 0000000000 00 000000000 001122333457777777777 Q ss_pred cCCeecCC-CCcc---cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEE Q lcl|NC_021301. 73 PNGITVGG-SADS---DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSV 142 (456) Q Consensus 73 ~~~~~~~~-~~d~---~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~ 142 (456) .-|+.+-. ..|. ....-+..++.. |. -..+...+....+.+|.||+++..+ +|++ .+..++|..+.++. T Consensus 76 ~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l~p~~v~v~~ 154 (432) T protein:vir:97 76 AMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYLANDRLTITT 154 (432) T ss_pred cCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEcCcceEEEE Confidence 77766421 1111 111224444432 32 2355667888999999999998876 4665 57889999998887 Q ss_pred eCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCC Q lcl|NC_021301. 143 DPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPD 222 (456) Q Consensus 143 d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~ 222 (456) |.... + .+.+...+|.. ..+..+.+.++.. ++ +.--. T Consensus 155 ~~~g~--~---~y~~~~~~g~~---~~~~~~~iih~r~--------------------------------~~---~dg~~ 191 (432) T protein:vir:97 155 DTKGN--T---AYRYRRTDGQM---IDIPRQQIWKIMG--------------------------------YS---LDGEN 191 (432) T ss_pred cCCCc--E---EEEEEecCceE---EEEccccEEEecC--------------------------------cC---CCCcc Confidence 65432 1 11222233321 1122333322210 00 01113 Q ss_pred CCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhhhhhhhh--hhccceeccCCCcee Q lcl|NC_021301. 223 GMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAIDYASIFE--AAPGALWELPPGVDI 297 (456) Q Consensus 223 g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~d~~~ 297 (456) |.|-++.... .+...+.-......++. .|..+++- .. ...++.-... ...+. ...+.+..++.+.++ T Consensus 192 G~spi~~~~~---~i~~~~a~~~~~~~~f~ng~~~~gil~~-~~--~l~~e~~~~~--~~~~~~~~nag~~~vl~~g~~~ 263 (432) T protein:vir:97 192 GLSAIRYGAQ---IFGTAIAAEAQAARAFRNGQLQSVYYQI-DR--FLTDDQYDSF--SKKVSGSVEAGRAPLLEGGMDV 263 (432) T ss_pred cccHHHHHHH---HHHHHHHHHHHHHHHHhccCCcceeEec-CC--CCCHHHHHHH--HHHHhhhhcCCCceecCCCceE Confidence 5555554333 33322222222223332 23223321 11 1111111111 11111 123456778888898 Q ss_pred Eeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021301. 298 WESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQ--SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI 374 (456) Q Consensus 298 ~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~--Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 374 (456) .++.-... ..+++..+....+|+.+-++|+..+|....+. .|..++.....+...| -.-+-..+++.+..-+-- T Consensus 264 ~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~~~f~~~t---l~P~~~~ie~~ln~kLl~ 340 (432) T protein:vir:97 264 KSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTMT---LSPWLRRIEQSIALNLLT 340 (432) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHHHHHHHHH---HHHHHHHHHHHHhhhccC Confidence 88764322 34777788889999999999999997532211 1222322222221111 011112222222211111 Q ss_pred cCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH-H-H-------HHHHHHHHHHH-Hhh Q lcl|NC_021301. 375 EGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK-Q-D-------DLDRAREQITL-FAG 444 (456) Q Consensus 375 ~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~-~-~-------e~~~~~ee~~~-~~~ 444 (456) ......+.+++.+...+-.|..+.++++.+++++|+++.--+++.+|+.+-+-. . . -.+...++... -.+ T Consensus 341 ~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~~~~~~~~~~~~pl~~~~~~~~~~~~~ 420 (432) T protein:vir:97 341 PAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGNAAVLTVQSAMVPLDSIGLQASPEPAS 420 (432) T ss_pred ccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCcceEeecccccchhhhcccCCCCCCC Confidence 111112234444444566789999999999999999999889999888653211 0 0 01111111000 011 Q ss_pred hhhhhcccccCC Q lcl|NC_021301. 445 NSVQRPQEDGSR 456 (456) Q Consensus 445 ~~~~~~~~d~~~ 456 (456) ...+..+..-+| T Consensus 421 ~~~~~~~~~~~~ 432 (432) T protein:vir:97 421 GLGNQQQDKVSK 432 (432) T ss_pred CCCCcccccccC Confidence 111111112222 No 162 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=98.92 E-value=1e-08 Score=64.36 Aligned_cols=387 Identities=13% Similarity=0.057 Sum_probs=169.1 Q ss_pred CCCCCHHHHHH---HHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhh-hhhhh-hhccChHHHHHHHHHhhhccCC Q lcl|NC_021301. 1 MTASTPAEWLP---VLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAW-RSFQR-EARTNWGLMVRDSVADRIIPNG 75 (456) Q Consensus 1 ~~~~t~~~~~~---~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~-~~~~~-k~~~n~~~~iVd~~a~~l~~~~ 75 (456) |+-..+..-+. .+..+-..-..+-......+...... ..+ ..-... ..... .........+|+.+++-+..-| T Consensus 1 ~~~~~~~~~~~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~v~~cI~~ia~~ia~~~ 78 (413) T protein:vir:96 1 MPGVSEIRKDKNLKFFNNKRSPTEESKAKDEIPKAPQVVM-TLP-NFFKELISDGYTKLSDSPEVRMAVDCIADLVSNMT 78 (413) T ss_pred CCccchhhhhhcCCccccCCCcchhhhhhccccccccccc-cch-hhHhhhccchhHHHhhchHHHHHHHHHHHhhccCc Confidence 66666544221 11111000000000000000000000 000 000000 00011 1235677788999999888878 Q ss_pred eecCC---CCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCc-e-EEEEEccceeEEEEeCC Q lcl|NC_021301. 76 ITVGG---SADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGT-A-TITADSPETMVVSVDPL 145 (456) Q Consensus 76 ~~~~~---~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~-~-~i~~~~p~~~~~~~d~~ 145 (456) |.+-. +.+......+..++.. |. -..+...+....+.+|.||+++.++.+|. + .+..++|..+.+.+++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~~l~~~~v~~~~~~~ 158 (413) T protein:vir:96 79 IQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLTPISPYKVTFNVSDD 158 (413) T ss_pred eEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEEEecCceeEEEEcCC Confidence 77521 1111122223444431 32 24666778899999999999999988874 3 57888898888776543 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC-CCCC Q lcl|NC_021301. 146 QPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN-PDGM 224 (456) Q Consensus 146 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n-~~g~ 224 (456) . +.|....++. .+..+++.++.. ... ..+ -.|. T Consensus 159 ~-------~~y~~~~~~~-----~~~~~evih~k~------------------------------~~~----~~~~~~G~ 192 (413) T protein:vir:96 159 D-------LDYSITFDNK-----EYDPSTLLHFVL------------------------------NPS----IERPFIGT 192 (413) T ss_pred e-------EEEEEeecCc-----EEchhhEEEEec------------------------------cCC----CCCccccc Confidence 1 1111111110 111222222110 000 000 1356 Q ss_pred CcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-hhhhhh-hhh--hccceeccCCC-ceeEe Q lcl|NC_021301. 225 GEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDYASI-FEA--APGALWELPPG-VDIWE 299 (456) Q Consensus 225 s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~~~~-~~~--~~~~~~~~~~d-~~~~~ 299 (456) |-++.+...+.....+..-......-.+.|..+++. ... ..++.... ...... +.. ..+....++.+ .++.+ T Consensus 193 s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~ 269 (413) T protein:vir:96 193 GYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSV-DSD--SDELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQ 269 (413) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEe-CCC--CCHHHHHHHHHHHHHHhcCccccCceeeecCCcccccc Confidence 655544333332222211111112222334444442 111 11111111 111111 111 12333344333 33334 Q ss_pred ec---ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021301. 300 SQ---TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG 376 (456) Q Consensus 300 ~~---~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 376 (456) +. ..++ .+++..+....+|+.+-|+|+..+|...+ .+..+..+....|.-.+...+..+... + T Consensus 270 ~~~~~~~d~-q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~-~~~~~~~~~~~~l~P~~~~ie~~ln~~--------l---- 335 (413) T protein:vir:96 270 IKPLTLNDL-AINDAVTLDKKTVAGIFGVPAFLLGVGTY-NKDEFNNFINTKIMSIAQVIQQTYNKL--------I---- 335 (413) T ss_pred cccCChhHH-HHHHHHHHHHHHHHHHhCCCHHHcCCCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHh--------h---- Confidence 32 2233 36777788889999999999999975332 233333333323322222222222111 1 Q ss_pred CCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHH--HHhhhhhhhccccc Q lcl|NC_021301. 377 ESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQIT--LFAGNSVQRPQEDG 454 (456) Q Consensus 377 ~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~--~~~~~~~~~~~~d~ 454 (456) ......+++.+...+..|..+.++++.++..+|+++.--+++.+|+.|.+-. ++.--..+ ........+....| T Consensus 336 l~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~~g----d~~~~~~n~~~~~~~~~~~~~~~~ 411 (413) T protein:vir:96 336 VEEDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPDAEM----DDLLVLENYLQQKDLVNQKKLIQD 411 (413) T ss_pred CCCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc----ceeeecccccchhhcccccCCCCC Confidence 1123445555566677899999999999999999999999999998875321 11100000 00000000000001 Q ss_pred CC Q lcl|NC_021301. 455 SR 456 (456) Q Consensus 455 ~~ 456 (456) +- T Consensus 412 dt 413 (413) T protein:vir:96 412 ET 413 (413) T ss_pred CC Confidence 11 No 163 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.92 E-value=1.5e-09 Score=69.04 Aligned_cols=371 Identities=11% Similarity=-0.001 Sum_probs=164.5 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCccc Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMS-RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSD 85 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~-r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~ 85 (456) +-|++++.+....... -.......+-+-. ..+..+.. ..-+...-...+|+.+++-+..-||.+...... T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~v~~-----~~~l~~~~v~~~i~~ia~~ia~~~~~~~~~~~~- 71 (382) T protein:vir:48 1 MPIFNLATESPPDNQGGFFDVVDSDFLASL---KGNEWVSA-----ETALRNSDLFSIINQLSNDLATVKLITSRKKLQ- 71 (382) T ss_pred CccccccccCCcccccccccchhhhccccc---cCCcccch-----HhhhccHHHHHHHHHHHHhhccCceeeecchhh- Confidence 2233332221110000 0000011010000 00000000 011223345568888888887778876543221 Q ss_pred HHHHHHH-HHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCc Q lcl|NC_021301. 86 LALRARR-IWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAE 163 (456) Q Consensus 86 ~~~~l~~-~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~ 163 (456) .|.+ =........+...+....+.+|.||+++-++.+|.+ .+..++|..+.+..++.... +. |....++. T Consensus 72 ---~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~~~-~~----y~~~~~~~ 143 (382) T protein:vir:48 72 ---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNKDG-IY----YNITFDDP 143 (382) T ss_pred ---hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCCe-EE----EEEEecCc Confidence 1211 111112346677788899999999999999988886 68889999998877654332 11 11111111 Q ss_pred -eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHH Q lcl|NC_021301. 164 -SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAEL 242 (456) Q Consensus 164 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s 242 (456) ......+..+.++++.. ++. .....|.|-+..+...++....+.. T Consensus 144 ~~~~~~~~~~~evih~~~--------------------------~~~--------~~~~~G~s~l~~~~~~i~~~~~~~~ 189 (382) T protein:vir:48 144 RIPPKQHVPQNDVLHFRL--------------------------LSV--------DGGMTSVSPLMALSRELDIQKASGN 189 (382) T ss_pred cccceeEEcCccEEEecC--------------------------CCC--------CCccccccHHHHHHHHHHHHHHHHH Confidence 11112223333332210 000 0112466666655544443332222 Q ss_pred HHHHHHHHhhchhhhhhcCCCcccccccccc-hhhhhhhhhhhccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHH Q lcl|NC_021301. 243 QLLSTMAIQAFRQRALKSAGHGLPKVDENGN-AIDYASIFEAAPGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLS 320 (456) Q Consensus 243 ~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~ 320 (456) -......-.+.|..+++- ... ..++... ............+.++.++.+.++.++...+ ...+++..+....+|+ T Consensus 190 ~~~~~~~ng~~p~~il~~-~~~--~~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia 266 (382) T protein:vir:48 190 LTINSLKNALNANGILKI-KGG--GLLDFKTKLSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVSQLLKQADWTTGQFA 266 (382) T ss_pred HHHHHHhccCCCceEEEe-CCC--CChHHHHHHHHHHHhhccCCCCeeEcCCCceEEEccCChhHHHHHHHHHHHHHHHH Confidence 222222223344444432 111 1111111 1111111222345667788888888876332 2347788888999999 Q ss_pred hhcCCChhhhcccccCcH-HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHH Q lcl|NC_021301. 321 SATKTPLPMLMPDSANQS-AEGAHN-IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEK 398 (456) Q Consensus 321 ~~~~~p~~~~~~~~~N~S-g~Al~~-~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ 398 (456) .+-|+|+..+|....+.+ ..+.+. ....+.-.+...+..+...| + ......+...+. .+.... T Consensus 267 ~afgVp~~~lg~~~~~~~~~~~~~~~~~~~l~p~~~~i~~~l~~~l-------~----~~~~~~~~~~~~----~~~~~~ 331 (382) T protein:vir:48 267 KVYGIPDNVVGGQGDQQSSLEMSSDLYSKAVSRYLRPFLSELSQKL-------S----CDVDADIFPAVD----PTGSNY 331 (382) T ss_pred HHhCCCHHHhCCCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------c----Chhhhhhhhhhc----cchhHH Confidence 999999999986544432 223222 11112121211111111111 0 000111111111 122344 Q ss_pred HHHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccc Q lcl|NC_021301. 399 YAAASLAKAAGESWASIRRNIL---NYNADQIKQDDLDRAREQITLFAGNSVQRPQED 453 (456) Q Consensus 399 ad~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d 453 (456) ...+.+|..+|+.++..+++.+ |+.+.++.+.+. ......+... ..+ + T Consensus 332 ~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~~-----~~~~~~GGd~-~~~-~ 382 (382) T protein:vir:48 332 ISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGEN-----PNSTLKGGEE-DGQ-D 382 (382) T ss_pred HHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhhc-----CCCCCCCCCC-CCC-C Confidence 5556678888999988888765 676665443211 0011111111 111 1 No 164 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=98.89 E-value=5.8e-09 Score=65.75 Aligned_cols=373 Identities=11% Similarity=-0.015 Sum_probs=151.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCccc---ccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCc Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLP---ELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD 83 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~---~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d 83 (456) +-|.+++...-........-...+. ...+. ..+..... ..-+...=...+|+.+++-+.+-||.+..... T Consensus 1 Mglf~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~v~~-----~~al~~~~V~~~i~~Ia~~ia~l~~~~~~~~~ 73 (384) T protein:vir:49 1 MPIFNITNLATESPPSNQDSFFDIT--DPEFLDALNGSEWVSA-----ETALKNSDLFSIISQLSNDLATAKITTSRKQL 73 (384) T ss_pred CccccccccCcccccccchhhcccc--chhhcccccCCceech-----hhhhccHHHHHHHHHHHHHHhhCceeeecchh Confidence 1122111100000000000000000 00000 00000000 00112223456788888888888888753221 Q ss_pred ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCC Q lcl|NC_021301. 84 SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDA 162 (456) Q Consensus 84 ~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~ 162 (456) . ..+.+-............+...++.+|.||+++-.+.+|++ .+..++|..+.+..++.... +. +.+...+. T Consensus 74 ~---~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~-~~---y~~~~~~~ 146 (384) T protein:vir:49 74 Q---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNG-LY---YNITFDDP 146 (384) T ss_pred h---hhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCce-EE---EEEEecCc Confidence 1 11111111123446677788899999999999999999886 58889999988876544322 11 11111111 Q ss_pred ceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHH Q lcl|NC_021301. 163 ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAEL 242 (456) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s 242 (456) .......+..+.++++.. ++.. ....|.|-+..+...++....+.. T Consensus 147 ~~~~~~~~~~~eVih~~~--------------------------~~~~--------~~~~G~s~i~~~~~~i~~~~~~~~ 192 (384) T protein:vir:49 147 RIPPKQHVPQGDILHFRL--------------------------LSVD--------GGLTSVSPLMALGRELNIQKASDK 192 (384) T ss_pred cccceeEecCccEEEecC--------------------------CCCC--------CceeeccHHHHHHHHHHHHHHHHH Confidence 111111222333332210 0000 012366655544443433222222 Q ss_pred HHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHHh Q lcl|NC_021301. 243 QLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLSS 321 (456) Q Consensus 243 ~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~ 321 (456) -..+...-.+.|..+++- ..... .++...............+.+..++.+.++.++...+ ...+++..+....+|+. T Consensus 193 ~~~~~~~ng~~~~~il~~-~~~~~-~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~ 270 (384) T protein:vir:49 193 LTLNALKNALNANGILKI-KGGGL-LDFKTKQSRSRQAMKQMQGGPLVLDDLEDFTPLEIKSNVAQLLSQADWTTGQFAK 270 (384) T ss_pred HHHHHHhccCCCceEEEe-CCCCC-hHHHHHHHHHHHhcccCCccceecCCCceEEEccCChhhHHHHHHHHHHHHHHHH Confidence 111222222334333332 11111 1111111111111223345667788888988876332 23477888889999999 Q ss_pred hcCCChhhhcccccC-cHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHH Q lcl|NC_021301. 322 ATKTPLPMLMPDSAN-QSAEGAHNIEKGFLFK-CEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKY 399 (456) Q Consensus 322 ~~~~p~~~~~~~~~N-~Sg~Al~~~~~~l~~k-~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~a 399 (456) +-|+|+..+|...++ .++..++..+...... +.-....+...|..-+.. . ..+..-.+..... T Consensus 271 ~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l~~-----------~----~~~~~~~~~~~~~ 335 (384) T protein:vir:49 271 VYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEVDA-----------D----ILPAVDPTGSNYI 335 (384) T ss_pred HhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhhh-----------h----hhhhhhccchHHH Confidence 999999999875443 3444444433332221 111111222211110000 0 0000001111122 Q ss_pred HHHHHHHhcCCCcHHHHHHhC---CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 400 AAASLAKAAGESWASIRRNIL---NYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 400 d~~~kl~~~g~~s~~t~~~~~---~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ..+..|..+|+.++..+++.+ |+.+.++.++ +...+. .-.+.+++ T Consensus 336 ~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~~------~~~~p~------~gGd~~~~ 383 (384) T protein:vir:49 336 GLINSMVKTGTLAQNQGLYVLQQAEILPKDLPEG------ETDSTL------KGGETNEQ 383 (384) T ss_pred HHHHHHhhcCcccHHHHHHHHhhCCCCChhHHHH------cCCCCC------CCCCCCCC Confidence 223334555666655555443 4444332211 011111 11222233 No 165 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=98.89 E-value=1.5e-08 Score=63.45 Aligned_cols=386 Identities=12% Similarity=-0.016 Sum_probs=169.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCccc-ccCcccchhh----------h-hhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 9 WLPVLTKRIDDGMSRVRLLARYSNGDAPLP-ELTRNTSAAW----------R-SFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 9 ~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~-~~~~~~~~~~----------~-~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) ++-+++.++.---.-.-.+..+++.+.... ..+. ..... . ....-+.+.=...+|+.+++-+..-|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~~-~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp~ 79 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSLENPSTPI-TGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMPL 79 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCCCCCcccc-chhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCce Confidence 233333322111111111222222221000 0000 00000 0 000111222244578888888877777 Q ss_pred ecCCCCc----ccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_021301. 77 TVGGSAD----SDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQ 146 (456) Q Consensus 77 ~~~~~~d----~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~ 146 (456) .+-...+ ......+.+++.. |. -..+...+...++.+|.||+++-++..|.+ .+..++|..+.+..+. T Consensus 80 ~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~-- 157 (424) T protein:vir:45 80 HVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTG-- 157 (424) T ss_pred EEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcC-- Confidence 6521111 1111224444432 32 234556688889999999999999888886 4888888887655332 Q ss_pred CceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCc Q lcl|NC_021301. 147 PWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGE 226 (456) Q Consensus 147 ~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~ 226 (456) ++ + ++.+....+ . ..+.++.+.++.. . ..+...|.|. T Consensus 158 ~~-~---~y~~~~~~~--~--~~~~~~eVih~r~-------------------------------~----~~d~~~G~sp 194 (424) T protein:vir:45 158 GR-Y---TYGLYNEYG--A--FAISPDDMIHIRA-------------------------------L----GNNQKMGLSP 194 (424) T ss_pred Ce-E---EEEEEecCc--e--EEECcccEEEecC-------------------------------c----CCCCcccccH Confidence 11 1 111111111 1 1122233322210 0 0011236665 Q ss_pred HhHHHHHHHHHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchh-hhh-hhhh---hhccceeccCCCceeE Q lcl|NC_021301. 227 VEPHIDIINRINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAI-DYA-SIFE---AAPGALWELPPGVDIW 298 (456) Q Consensus 227 ~~~v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~-~~~-~~~~---~~~~~~~~~~~d~~~~ 298 (456) ++.....|+. ..+-..-...++ +.|..+++- ... ..++....+ ... .... ...+.+..++.+.++. T Consensus 195 i~~~~~~i~~---~~~~~~~~~~~f~ng~~p~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~g~~~~ 268 (424) T protein:vir:45 195 IMQHAETIGM---GMSGQKYTESFFSGNARPAGIVSV-KSG--LNKESWGWLKDQWQKASQALRRQENKTMLLPADLDYK 268 (424) T ss_pred HHHHHHHHHH---HHHHHHHHHHHHhccCCccEEEEe-CCC--CCHHHHHHHHHHHHHHhccccccCCceeEcCCCceEE Confidence 5544333332 222111122232 234444432 111 111111111 111 1111 1235667788888998 Q ss_pred eecccchH-HHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_021301. 299 ESQTNDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEG--AHNIEKGFLFKCEDRLSIAKIGLEAILVK-AL 372 (456) Q Consensus 299 ~~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~A--l~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~ 372 (456) ++...+.+ -|++..+....+|+..-|+|+..+|... ++ ++.+. +.+....|.-.+ ..+++.+.. ++ T Consensus 269 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~f~~~tL~P~~--------~~ie~~ln~kLl 340 (424) T protein:vir:45 269 ALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQFVRYTMMPWV--------TNWEQELNRRLF 340 (424) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH--------HHHHHHHHHhcC Confidence 87644332 3778888889999999999999997532 22 12121 112222222222 222222221 11 Q ss_pred HhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccc Q lcl|NC_021301. 373 QIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQE 452 (456) Q Consensus 373 ~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~ 452 (456) .-......+.+++.....+-.|..+.++++.+++++|+++..-+++.+|+.|-+--..-. ..-......+...+.++. T Consensus 341 ~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD~~~--~~~n~~~~~~~~~~~~~~ 418 (424) T protein:vir:45 341 TRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDEML--VSVNAANPAGDFKPPKND 418 (424) T ss_pred ChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceee--ecccccccccccCCCCCC Confidence 001111123344444445567889999999999999999998899999987642110000 000000011111111111 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) +|+- T Consensus 419 ~~~~ 422 (424) T protein:vir:45 419 EGKT 422 (424) T ss_pred CCCC Confidence 2222 No 166 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=98.87 E-value=2.4e-08 Score=62.37 Aligned_cols=392 Identities=10% Similarity=0.026 Sum_probs=176.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHH--HHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGM--SRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~--~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) |+.+. ++.++......+. +.-..+..+ ....++.... . ..+.-..+.....+|+.+++-+..-|+.+ T Consensus 1 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~-v-~~~~~~~~~~V~~ci~~Ia~~ia~lp~~~ 69 (409) T protein:vir:93 1 MAKEN---IVTRIKKKLIDNWIDQSTSKLYDF------SPWKNRSFWG-V-INNTLETNETIFSAITKLSNSMASLPLKM 69 (409) T ss_pred CCccc---hhhhhhhhhhhhhhcccccccccc------ccccCccccc-c-chhhhhccHHHHHHHHHHHHhhhhCceeE Confidence 55443 3333333221110 000111111 0001110000 0 00111234455667888888877778776 Q ss_pred CCCCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 79 GGSADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 79 ~~~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) -...+ .....+..++.. |. -......+...++.+|.||+++.++.+|.+ .+..++|..+.+..++.... +. T Consensus 70 ~~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~~~-~~- 146 (409) T protein:vir:93 70 YEDYK-VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE-LY- 146 (409) T ss_pred eeccc-cccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE-EE- Confidence 33221 122334444432 32 235567788899999999999999988886 58889999988877654332 11 Q ss_pred EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHH Q lcl|NC_021301. 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHID 232 (456) Q Consensus 153 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~ 232 (456) +.+...+|.. ..+..+.+.++.. ..+ ..--.|.|-++.+.. T Consensus 147 --y~~~~~~g~~---~~~~~~eVih~r~-------------------------------~~~---~~~~~G~s~i~~~~~ 187 (409) T protein:vir:93 147 --YSIHAATGNK---LIVHNMDMLHFKH-------------------------------IVA---SNMVQGISPIDVLKN 187 (409) T ss_pred --EEEEcCCceE---EEEccccEEEeCC-------------------------------CCC---CCccccccHHHHHHH Confidence 1122222221 1233333333310 000 011146666655444 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc-hhhhhhhhhhhccceeccCCCceeEeecccch-HHHHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN-AIDYASIFEAAPGALWELPPGVDIWESQTNDF-TPMLS 310 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~ 310 (456) .++..+.+.. . .......+-..+.-.+.. ..++.-. ........-...+.+..++.+.++.+++..+. ..|++ T Consensus 188 ~i~~~~~~~~--~-~~~~~~~~~~~i~~~~~~--l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e 262 (409) T protein:vir:93 188 TTDFDNAVRT--F-NLTEMQKPDSFMLKYGSN--VGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVA 262 (409) T ss_pred HHHHHHHHHH--H-HHHhcCCCCceEEecCCC--CCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHH Confidence 4443322111 1 111222221122111111 1111111 11111111123455667788889888764322 24777 Q ss_pred HHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCcccceeEEecC Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDTVDVSFES 389 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~i~v~f~~ 389 (456) ..+....+|+.+-++|+..+|.... ++...++.....+...| -.-+-..+++.+.. ++.-.+....+.+++.+.. T Consensus 263 ~r~~~~~~Ia~~fgVPp~~lg~~~~-~~~sn~e~~~~~f~~~~---l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ 338 (409) T protein:vir:93 263 SENLTRERVANVFQLPSVFLNARSN-TNFAKNEELNRFYLQHT---LLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKS 338 (409) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCC-CCcccHHHHHHHHHHHH---HHHHHHHHHHHHHhhcCCcccccCcceEEeechh Confidence 7777889999999999999975422 11111121111111111 01111222222221 1111111122334433344 Q ss_pred CCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 390 PDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 390 ~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) -+-.|..+.++++.+++++|+++.-.+++.+|+.|-+--. .....+.. ....+......+++++. T Consensus 339 ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~--~~~~~~~~~gG~~n~~e 408 (409) T protein:vir:93 339 YLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDT--PLELRKSLKGGDKNVNE 408 (409) T ss_pred hhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccccccc--chhhcccccCCCCCcCC Confidence 4567889999999999999999999999999987642110 00000000 00001111112222222 No 167 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=98.85 E-value=2.6e-08 Score=62.14 Aligned_cols=404 Identities=10% Similarity=0.010 Sum_probs=173.9 Q ss_pred CCC------CC---------HHHHHHH--HHHHHHHHHHHHH-HHHHHhcccCcccccCc-ccchhhhh----hhhhhcc Q lcl|NC_021301. 1 MTA------ST---------PAEWLPV--LTKRIDDGMSRVR-LLARYSNGDAPLPELTR-NTSAAWRS----FQREART 57 (456) Q Consensus 1 ~~~------~t---------~~~~~~~--l~~~~~~~~~r~~-~~~~YY~g~~~i~~~~~-~~~~~~~~----~~~k~~~ 57 (456) +|- +| -++++++ ++.-+..+.+... .+. -+.++. +...+. ..++.... .+..... T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~f-~~s~es-~s~vtsls~pdaf~~vnVs~~~Alkn 124 (945) T protein:vir:10 47 LTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNLF-EYSPES-LMYLPSISDPDAFFLINLFRKYRFNN 124 (945) T ss_pred hhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhhh-hccCcc-ceecccccCccceeeehhhhhhhhcc Confidence 110 01 0111111 1111111111110 011 122221 111111 11111100 0111223 Q ss_pred ChHHHHHHHHHhhhccCCeecCC-CCccc---------HHHHHHHHHHh-cCh-------hHHHHHHHHHHhhCCeEEEE Q lcl|NC_021301. 58 NWGLMVRDSVADRIIPNGITVGG-SADSD---------LALRARRIWRD-NRM-------DSVCKQWVKYGLDFGESYLT 119 (456) Q Consensus 58 n~~~~iVd~~a~~l~~~~~~~~~-~~d~~---------~~~~l~~~~~~-n~~-------~~~~~~~~~~a~~~G~a~~~ 119 (456) .-...+|+.+++-+.+-|+.+-. ..+.. ....+.+++.+ |.. ......+..+.+.+|.+|+. T Consensus 125 saV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYie 204 (945) T protein:vir:10 125 DSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIV 204 (945) T ss_pred HHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEE Confidence 34566888888888888876421 11111 12234455543 221 12445677899999999999 Q ss_pred EeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCe-EEEEEEeeeecccccceeeccCC Q lcl|NC_021301. 120 CWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDG-WQKFARPCFVQSSSRRRLVTRIS 197 (456) Q Consensus 120 v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 197 (456) +.++.+|++ .+..++|..+.+..++.... . .+|....++... ..|.... ++.... T Consensus 205 IiRd~~G~ii~L~pLdPs~Vti~~ddDG~~-~---y~Yv~~idG~~~--~~v~a~DvIlhirn----------------- 261 (945) T protein:vir:10 205 KIRDEQGNLVAITPVDGTTIKPILSEDTGI-V---VGYVQEVDGAIV--AHFDKRDVVLFRQN----------------- 261 (945) T ss_pred EEECCCCcEEEEEEECCcceEEEEcCCCcE-E---EEEEEecCCceE--EEecCCceEEEecc----------------- Confidence 999999987 58999999998887655432 1 111112222211 1122221 111100 Q ss_pred CceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh----chhhhhhcCCC--cccc---- Q lcl|NC_021301. 198 DSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA----FRQRALKSAGH--GLPK---- 267 (456) Q Consensus 198 ~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~----~~~~~i~g~~~--~~~~---- 267 (456) .+..+ .....|.|.++ .+..++...+.-......++. .|..++.-... .... T Consensus 262 ---------~s~DG------~~~GyGlSPIe---aa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~ 323 (945) T protein:vir:10 262 ---------LTPDV------YMYGYSLPPIE---ILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQ 323 (945) T ss_pred ---------CCCCc------ccccCCchHHH---HHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccc Confidence 00000 00012444444 444444433333222333332 23333321110 0000 Q ss_pred -cccccchhhhhhhhhh-----hccceeccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHH Q lcl|NC_021301. 268 -VDENGNAIDYASIFEA-----APGALWELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAE 340 (456) Q Consensus 268 -~~~~~~~~~~~~~~~~-----~~~~~~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~ 340 (456) .++....+ ...+.. ..+..+.++.+.++.++...+ -..+++..+..+.+|+++-|+|+..+|.... .++. T Consensus 324 LseEq~erl--Ke~wee~~sG~NnG~piVLdeGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~-st~S 400 (945) T protein:vir:10 324 LSREQLESI--QRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEG-SNKA 400 (945) T ss_pred cCHHHHHHH--HHHHHHHhCCcccccceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCC-CCcc Confidence 01111111 111111 123334567788888775432 2337788888899999999999999974322 1111 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHh Q lcl|NC_021301. 341 GAHNIEKGFLFK-CEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNI 419 (456) Q Consensus 341 Al~~~~~~l~~k-~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~ 419 (456) .++.....+... ..-....+...+.+.+ . .......+.+.|+.....+..+.++++.++.++|+++.--+++. T Consensus 401 NiEqq~~~Fv~~tL~Pil~~IEqeLNrkL---l---~~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~ 474 (945) T protein:vir:10 401 TAEVMASLTKAKGLEPLMATISKGFDEVV---S---EFRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARME 474 (945) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhc---c---ccccCceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 122222222111 1111122222222211 1 11223457788888877888999999999999999999889999 Q ss_pred CCCChhHHHHH---H------HHH-HHHHH----HHHhhhhhhhc-------ccccCC Q lcl|NC_021301. 420 LNYNADQIKQD---D------LDR-AREQI----TLFAGNSVQRP-------QEDGSR 456 (456) Q Consensus 420 ~~~~~~~~~~~---e------~~~-~~ee~----~~~~~~~~~~~-------~~d~~~ 456 (456) +|+.|-+--.. - .+. .+... +...+...+++ ++++.+ T Consensus 475 lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~ 532 (945) T protein:vir:10 475 KGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSV 532 (945) T ss_pred hCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCC Confidence 88765311000 0 000 00000 00000101111 011111 No 168 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.85 E-value=2.8e-08 Score=61.97 Aligned_cols=406 Identities=11% Similarity=0.056 Sum_probs=167.4 Q ss_pred CCCCCHHHHHHHHHHH-HH---HHHH------HHHHHHHHhcccCccc--------c-------cC-cccchhhhhhhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKR-ID---DGMS------RVRLLARYSNGDAPLP--------E-------LT-RNTSAAWRSFQRE 54 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~-~~---~~~~------r~~~~~~YY~g~~~i~--------~-------~~-~~~~~~~~~~~~k 54 (456) +--+|-+++-+-+... |. .... .-+.+.+.-+++.... . .+ ...+..+....+. T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 90 (574) T protein:vir:80 11 IEKSSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKK 90 (574) T ss_pred cchhhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHh Confidence 3334444443333332 10 0000 0022233222221110 0 00 0111122221111 Q ss_pred -hccChHHHHHHHHHhhhc-----------cCCeecC-CCC-------cccHHHHHHHHHHhc---------ChhHHHHH Q lcl|NC_021301. 55 -ARTNWGLMVRDSVADRII-----------PNGITVG-GSA-------DSDLALRARRIWRDN---------RMDSVCKQ 105 (456) Q Consensus 55 -~~~n~~~~iVd~~a~~l~-----------~~~~~~~-~~~-------d~~~~~~l~~~~~~n---------~~~~~~~~ 105 (456) ......+.++++.++.+. +-|+.+- .+. +......+.+++... .+..+... T Consensus 91 ~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~ 170 (574) T protein:vir:80 91 FGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKK 170 (574) T ss_pred hccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHH Confidence 122344455555443221 2344431 011 111222344554321 23456677 Q ss_pred HHHHHhhCCeEEEEEeeCCCCceE-EEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeee Q lcl|NC_021301. 106 WVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFV 184 (456) Q Consensus 106 ~~~~a~~~G~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 184 (456) +..+.+.+|.+|+.+-.+.+|++. +..++|..+.+..|..... .....+|+...++... ..|..+.+.++... T Consensus 171 lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~-~~~~~~y~~~~~g~~~--~~~~~~eiih~~~~--- 244 (574) T protein:vir:80 171 LVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKL-IKNGERFVQVIDNRIV--AKFNERELAFAVRN--- 244 (574) T ss_pred HHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccc-ccCceEEEEEeCCceE--EEEccccEEEEecc--- Confidence 888899999999998888888874 8889999998887654211 1111222222222211 12222332222110 Q ss_pred cccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcC Q lcl|NC_021301. 185 QSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSA 261 (456) Q Consensus 185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~ 261 (456) |.--.....+|.|.++.+...|+....+.. ....++. .|..+|.- T Consensus 245 ----------------------------~~~~~~~~~~G~spi~~a~~~i~~~~~a~~---~~~~~f~ng~~p~gil~~- 292 (574) T protein:vir:80 245 ----------------------------PRADIEVGQYGYPELEIALKQFIAHENTEV---FNDRFFSHGGTTRGILHV- 292 (574) T ss_pred ----------------------------CCCCcccccccccHHHHHHHHHHHHHHHHH---HHHHHHhccCCCceEEEe- Confidence 000000112466766655444443332222 2223332 34433321 Q ss_pred CCcccccccccchh-hhh-hhhh--hhccce-eccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhccccc Q lcl|NC_021301. 262 GHGLPKVDENGNAI-DYA-SIFE--AAPGAL-WELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA 335 (456) Q Consensus 262 ~~~~~~~~~~~~~~-~~~-~~~~--~~~~~~-~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~ 335 (456) ..+....++.-..+ ... ..+. ...+.+ +..+.+.++.++.... -..|++..+..+..|+.+-++|+..+|.... T Consensus 293 ~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~ 372 (574) T protein:vir:80 293 KTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNN 372 (574) T ss_pred CCCCCCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccc Confidence 11111111111111 111 1111 122333 3446778888776332 2337888888999999999999999974321 Q ss_pred -----------C-cHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHH Q lcl|NC_021301. 336 -----------N-QSAEG--AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAA 401 (456) Q Consensus 336 -----------N-~Sg~A--l~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~ 401 (456) | ++.+. +.+....|.-.+. .|++.+...+ +. .....+.+.|.+....+..+...+ T Consensus 373 ~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~--------~ie~~ln~~L-l~--~~~~~~~~~f~~~d~~~~~~~~~~ 441 (574) T protein:vir:80 373 GGATGSKGGSLNEGNSKEKMQASQNKGLQPLLR--------FIEDTVNTYI-VA--EFGEKYQFQFRGGDLSAQLDKLKI 441 (574) T ss_pred ccccccccccccchhHHHHHHHHHHHHHHHHHH--------HHHHHHHhhh-hh--hcCCceEEEecccchhhHHHHHHH Confidence 1 11111 1111111211121 1222221111 01 112345677888777666666544 Q ss_pred HHHHHhcCCCcHHHHHHhCCCChhHH-------------HH------HHHHHHHHHHH----HHhhhhhhhc-cc----- Q lcl|NC_021301. 402 ASLAKAAGESWASIRRNILNYNADQI-------------KQ------DDLDRAREQIT----LFAGNSVQRP-QE----- 452 (456) Q Consensus 402 ~~kl~~~g~~s~~t~~~~~~~~~~~~-------------~~------~e~~~~~ee~~----~~~~~~~~~~-~~----- 452 (456) .++..+|+++.--+++.+|+.|-+- .+ .+.+..++..+ ..+......+ .+ T Consensus 442 -~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 520 (574) T protein:vir:80 442 -IEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQ 520 (574) T ss_pred -HHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCCCcc Confidence 3466789999988888887754210 00 00001111111 0111111000 00 Q ss_pred --ccCC Q lcl|NC_021301. 453 --DGSR 456 (456) Q Consensus 453 --d~~~ 456 (456) +++. T Consensus 521 ~d~~~~ 526 (574) T protein:vir:80 521 NDTDVS 526 (574) T ss_pred ccccch Confidence 0000 No 169 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.84 E-value=2.9e-08 Score=61.87 Aligned_cols=378 Identities=12% Similarity=0.022 Sum_probs=175.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhh--ccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~--~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) -..-||..|-.-|...-.-...++-.+ ...+ ...+..-++.+....+.+.++++ T Consensus 37 ~~gltp~~l~~iLr~a~~gd~~~~~~L------------------------~e~m~e~D~~i~s~l~~Rk~av~~~~w~I 92 (526) T protein:vir:99 37 AKGLTPAKLARILVEAEQGNLQAQAEL------------------------FMDMEERDAHLFAEMSKRKRAILGLDWAV 92 (526) T ss_pred cCCCCHHHHHHHHHhhhCCCHHHHHHH------------------------HHHHHhhChHHHHHHHHHHHHHhCCCceE Confidence 111122222222221111011111111 1111 13455556666666667777766 Q ss_pred CCCC-----cccHHHHHHHHHHhc-ChhHHHHHHHHHHhhCCe-EEEEEeeCCCCceEE---EEEccceeEEEEeCCCCc Q lcl|NC_021301. 79 GGSA-----DSDLALRARRIWRDN-RMDSVCKQWVKYGLDFGE-SYLTCWRRDDGTATI---TADSPETMVVSVDPLQPW 148 (456) Q Consensus 79 ~~~~-----d~~~~~~l~~~~~~n-~~~~~~~~~~~~a~~~G~-a~~~v~~d~dg~~~i---~~~~p~~~~~~~d~~~~~ 148 (456) .... +....+.+.+++.+- +|...+.++. +|.-||. +++++|...+|...+ ...+|+.+ .||+..+. T Consensus 93 ~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~f--~~~~~~~~ 169 (526) T protein:vir:99 93 EPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWF--QLNPEDQN 169 (526) T ss_pred ecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeecccce--eeccCCCc Confidence 4322 222334456666543 5777766655 6888998 567888766665443 33444322 23332221 Q ss_pred eEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEE---ccCCCCCC Q lcl|NC_021301. 149 RIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVV---YQNPDGMG 225 (456) Q Consensus 149 ~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~---~~n~~g~s 225 (456) .+ + +.....++ ..-+.++++...+ ..|+.|.| T Consensus 170 ~l----~-~~~~~~~g----------------------------------------~~l~~~k~i~~~~~~~~g~p~g~g 204 (526) T protein:vir:99 170 EL----R-LRDNSPAG----------------------------------------EALQPFGWIIHRPRARSGYVARSG 204 (526) T ss_pred EE----E-ecCCCCCc----------------------------------------eeecCCCeEEEeecCCcCCccccc Confidence 11 0 00000000 0001122222222 25678999 Q ss_pred cHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhhhhhhhccceeccCCCcee--Eeecc Q lcl|NC_021301. 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YASIFEAAPGALWELPPGVDI--WESQT 302 (456) Q Consensus 226 ~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~d~~~--~~~~~ 302 (456) .+..+.-..--=+..+.+.....+.++.|.++.+- ..+. .++....+. .+..+ +.+....++.+..+ .+... T Consensus 205 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky-~~~a--~~~ek~~L~~av~~i--~~d~~~iiP~~~~ie~~ea~~ 279 (526) T protein:vir:99 205 LFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKY-PPGT--ADEEKATLLRAVTGL--GHAAAGIIPETMAIDFQQAAQ 279 (526) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEec-CCCC--CHHHHHHHHHHHHHH--hhCcEEEecCCceeEEeecCC Confidence 88875443332333577888888999988776652 1111 122222221 12222 22334455555554 44334 Q ss_pred cchHHHHHHHHHHHHHHHhhc-CCChhhhcccccCcHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCc Q lcl|NC_021301. 303 NDFTPMLSAIKEHIRQLSSAT-KTPLPMLMPDSANQSAEGAHN-IEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGESV 379 (456) Q Consensus 303 ~~~~~~~~~l~~~~~~i~~~~-~~p~~~~~~~~~N~Sg~Al~~-~~~~l~~k~~~~~~~f~~~l~-~~~~l~~~~~~~~~ 379 (456) .+...|...++.+-.+|+.+. |=...... ..++.|.-|+-- ...-....++.-.+.+...+. ++++.++.+..... T Consensus 280 ~~~~~f~~li~~~d~~Isk~iLGqtlTs~~-~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~~Li~~l~~~N~~~~ 358 (526) T protein:vir:99 280 GSSEPFLAMMRQSEDAISKAVLGGTLTSTT-SQSGGGAFALGQVHNEVRHDLLASDARQLAATLSRDLLWPLLVLNRPGS 358 (526) T ss_pred CCHHHHHHHHHHHHHHHHHHHhhhhhcccc-ccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCc Confidence 456778888888888887653 10011100 001111123221 122222333344456666774 57777777765322 Q ss_pred ---ccceeEEecCCCCcCHHHHHHHHHHHHhcCC-CcHHHHHHhCCCChhHHHHHHHHHHHHH---HHHHhhhhhhhccc Q lcl|NC_021301. 380 ---EDTVDVSFESPDRVTLGEKYAAASLAKAAGE-SWASIRRNILNYNADQIKQDDLDRAREQ---ITLFAGNSVQRPQE 452 (456) Q Consensus 380 ---~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~-~s~~t~~~~~~~~~~~~~~~e~~~~~ee---~~~~~~~~~~~~~~ 452 (456) ...-+++|....+.|.++.++.+.+|..+|+ ++.+.+.+.+|+......+.-....... ...........+.. T Consensus 359 ~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~ 438 (526) T protein:vir:99 359 PDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNEPVLRSAAQPAILSRQHGQRVAALATI 438 (526) T ss_pred CCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCcccccCCCCCCccccccccccccccccc Confidence 2235678988999999999999999999996 7889999999984321110000000000 00000000000000 Q ss_pred cc---------------------CC Q lcl|NC_021301. 453 DG---------------------SR 456 (456) Q Consensus 453 d~---------------------~~ 456 (456) .+ +. T Consensus 439 ~~~~~~~~~~~d~~l~~~~~~~~~~ 463 (526) T protein:vir:99 439 VGPRYGDQQALDKALADLPAKDMQN 463 (526) T ss_pred ccccCcchhhHHHHHHHHHHHHHHH Confidence 00 00 No 170 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.83 E-value=1.6e-08 Score=63.32 Aligned_cols=387 Identities=8% Similarity=-0.019 Sum_probs=179.6 Q ss_pred HHH--HHHHHHHHHHHHHHHHHHhcccCcccccCcccchh--------hhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 10 LPV--LTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA--------WRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 10 ~~~--l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~--------~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) +++ |..+......--+....|..|-. .+. ++- ++.......-.+..-++++....+.+.++++. T Consensus 1 v~~~~l~~e~at~~~~~d~~~~~~~~l~----~~~--~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~ 74 (488) T protein:vir:99 1 MEKPALGREIATSGDGRDITRPFISGLQ----VPN--DSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWKVE 74 (488) T ss_pred CCccchhHHHHHHHhhhhhhccccCCCC----CCC--hHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCceEE Confidence 211 11111111111112222322211 010 111 01111112346777788888888888888875 Q ss_pred CCCccc----HHHHHHHHHHhcChhHHHHHHHHHHhhCCeE-EEEEeeCCCCceEE---EEEccceeEEEEeCCCCceEE Q lcl|NC_021301. 80 GSADSD----LALRARRIWRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTATI---TADSPETMVVSVDPLQPWRIR 151 (456) Q Consensus 80 ~~~d~~----~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~d~dg~~~i---~~~~p~~~~~~~d~~~~~~~~ 151 (456) ...++. ..+.+.+.+..-.|...+.++. +|..||.+ ++++|...+|...+ ..++|+. ..||+..+..+ T Consensus 75 p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~--f~~d~~~~l~~- 150 (488) T protein:vir:99 75 AGGDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRR--FRYDQDGGLRL- 150 (488) T ss_pred cCCCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccc--eeecCCCceEE- Confidence 433332 2244566666667877777765 68889985 67888766666543 3334432 22333221100 Q ss_pred EEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEE---ccCCCCCCcHh Q lcl|NC_021301. 152 SAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVV---YQNPDGMGEVE 228 (456) Q Consensus 152 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~---~~n~~g~s~~~ 228 (456) ....+. ..+.. .+...+++...+ ..|+.|.|.+. T Consensus 151 -----~~~~~~---------------------------------~~g~~-----lp~~~~~i~~~~~~~~g~p~g~gLl~ 187 (488) T protein:vir:99 151 -----LTPNNM---------------------------------FEGEP-----CPAPYFWHFSTGADNDDEPYGLGLAH 187 (488) T ss_pred -----eccCCC---------------------------------CCccc-----cccCceEEEEeecCCCCCcccchHHH Confidence 000000 00000 000111111111 35788999988 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee--EeecccchH Q lcl|NC_021301. 229 PHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI--WESQTNDFT 306 (456) Q Consensus 229 ~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~--~~~~~~~~~ 306 (456) .+....--=+..+.+.....+.++.|.++.+- ++....++....+... ....+.+....++.+.++ .+....+.. T Consensus 188 ~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky--~~~~a~~~ek~~l~~a-v~~~~~~~~~viP~~~~ie~~ea~~~~~~ 264 (488) T protein:vir:99 188 WLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRY--DDKTATPEDKAKLLAA-LHAIQTDSAIIMPAGMQAELLEAGRSGTA 264 (488) T ss_pred HHHHHHHHHHhhHHHHHHHHHHcCCceeeeec--CCCCCCHHHHHHHHHH-HHHHhcCcEEEecCCceeEEeecCCCChH Confidence 76544333344477778888999999765542 1111112222222111 112223334455555554 443444556 Q ss_pred HHHHHHHHHHHHHHhhc-CCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCccccee Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSAT-KTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGESVEDTVD 384 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~-~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~~l~~~~~~~~~~~~i~ 384 (456) .|...++.+-.+|+.+. |=....-++.++++.|. ....-....++.-.+.+...+. +++..++.+.... ...-. T Consensus 265 ~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~---vh~~v~~d~~~aDa~~i~~tln~~li~~l~~~N~~~-~~~p~ 340 (488) T protein:vir:99 265 DYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDD---LQADVRLDLVKADADLICESFNLGPARWLTEWNFPG-AQPPR 340 (488) T ss_pred HHHHHHHHHHHHHHHHHhhhhhcccccccchhhHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcCC-cCCce Confidence 78888888777777653 10001111111222222 1122233334444466666664 4777767665432 22245 Q ss_pred EEecCCCCcCHHHHHHHHHHHHhc-CC-CcHHHHHHhCCCChhHHHHHH----------------------HHHHH---- Q lcl|NC_021301. 385 VSFESPDRVTLGEKYAAASLAKAA-GE-SWASIRRNILNYNADQIKQDD----------------------LDRAR---- 436 (456) Q Consensus 385 v~f~~~~~~~~~e~ad~~~kl~~~-g~-~s~~t~~~~~~~~~~~~~~~e----------------------~~~~~---- 436 (456) +.|....+.+.++.++.+.+|..+ |+ ++.+.+++.+|+.+.+..... .+... T Consensus 341 ~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 420 (488) T protein:vir:99 341 VYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQAEATAPTPSTEFAEGDQPSDPAAAMAPQLAEAMQ 420 (488) T ss_pred eEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCcccccccccCCCcccCCCCCCCCCchHHHHHHHHHHHH Confidence 678888889999999999999986 74 788888888887643211000 00000 Q ss_pred HHHHHHhhhhhhhccccc------CC Q lcl|NC_021301. 437 EQITLFAGNSVQRPQEDG------SR 456 (456) Q Consensus 437 ee~~~~~~~~~~~~~~d~------~~ 456 (456) ...+.+.......-++-+ ++ T Consensus 421 ~~~~~~~~~i~~~l~~a~s~ee~~~~ 446 (488) T protein:vir:99 421 PVVGNWTTQLRTLIEQASSLEDLRER 446 (488) T ss_pred HHHHHHHHHHHHHHHhcCCHHHHHHH Confidence 000000000000000000 00 No 171 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.75 E-value=6.7e-08 Score=59.92 Aligned_cols=442 Identities=10% Similarity=0.027 Sum_probs=178.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHhcccCcccc--cC---cccchhhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG----MSRVRLLARYSNGDAPLPE--LT---RNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~----~~r~~~~~~YY~g~~~i~~--~~---~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) |+.++-..+|..+++..... .++++.+.+||.......+ .+ +..-.......+++..+.+...|+.+++.| T Consensus 20 ~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~s~L 99 (641) T protein:vir:94 20 LSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADWRHRINTGHTFEVVETLVAYF 99 (641) T ss_pred CCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcccccccchhHHHHHHHHhhHH Confidence 77766555555544444332 2456677788766433221 11 111111112234678888888888888877 Q ss_pred cc----CC--eecC--CCCcccHHHHHHH----HHHhcChhHHHHHHHHHHhhCCeEEEEEeeC---------------- Q lcl|NC_021301. 72 IP----NG--ITVG--GSADSDLALRARR----IWRDNRMDSVCKQWVKYGLDFGESYLTCWRR---------------- 123 (456) Q Consensus 72 ~~----~~--~~~~--~~~d~~~~~~l~~----~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d---------------- 123 (456) .+ .+ |.+. ..+|.+..+.+++ .+.++++........++++.+|.+++.++-+ T Consensus 100 m~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~~~~ 179 (641) T protein:vir:94 100 KGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVETGD 179 (641) T ss_pred hhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhcccchh Confidence 54 22 4432 2234444433333 3446777777788999999999998877521 Q ss_pred ------------CCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec--------CCc-----------eEEE----- Q lcl|NC_021301. 124 ------------DDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL--------DAE-----------SDFA----- 167 (456) Q Consensus 124 ------------~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~--------d~~-----------~~~~----- 167 (456) ....+++..++|.+++ +|+.....-..++++.... +|. ..+. T Consensus 180 ~~~~~~~~~v~~~~~~~r~~~v~~~di~--~dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~~~d 257 (641) T protein:vir:94 180 IFGGWEDVAVNRQRSELRIEPLSPYDVW--LDTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFADPD 257 (641) T ss_pred hcccccccceecccceeeEEecchhhee--ecCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhcccccccccc Confidence 0122456666677654 5554432211122111000 000 0000 Q ss_pred ----EEEcCC---eEEEEEEeeeeccc-ccceeeccCCCceeecccccccCceeEEEEc-----cCCCCCCcHhHHHHHH Q lcl|NC_021301. 168 ----IVWSGD---GWQKFARPCFVQSS-SRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-----QNPDGMGEVEPHIDII 234 (456) Q Consensus 168 ----~~~~~~---~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-----~n~~g~s~~~~v~~li 234 (456) .-+... +++.|......... ...+.....++..+.....+++.++|.+++. ...+|.|-...+++.+ T Consensus 258 ~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l~dq 337 (641) T protein:vir:94 258 TPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNLGAL 337 (641) T ss_pred cccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCCcccCCChHHHHHHHH Confidence 000000 11111110000000 0011112222222212222223344443321 3458999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeecc--cchHHHHHHH Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQT--NDFTPMLSAI 312 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~--~~~~~~~~~l 312 (456) ..+|.+.-.+...++...+|...+.. .+.. ....+...+|.++..+....++.+.. .+.......+ T Consensus 338 k~ln~l~r~~ld~~~~~~~p~~~~~~--~~~~----------~~~~l~~~PG~ii~~~~~~~v~pl~~~~~~~~~~~~~~ 405 (641) T protein:vir:94 338 HVLNVLTNGRLDNLVLHINKMWTLVE--DGIL----------KREDVKAKPGAVFKVAQHGSLQPIDMGRQDFVVTYQEA 405 (641) T ss_pred HHHHHHHHHHHHHHHHHhCCeeeecc--cccc----------ccceeeccCCcceeeCCCCcceeecCCccccchhHHHH Confidence 99999999888888888877553321 0000 01112333444444333233333221 1222222223 Q ss_pred HHHHHHHHhhcCCChhhhcc---cccCcHHHHHHHHHHHHHHHHHHHHHHHHH-HH----HHHHHHHHHhc--------- Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMP---DSANQSAEGAHNIEKGFLFKCEDRLSIAKI-GL----EAILVKALQIE--------- 375 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~---~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~-~l----~~~~~l~~~~~--------- 375 (456) +.+...+-..+++.....+. ++.+.+|..+.........+....-+.|.+ .+ ++++.++.... T Consensus 406 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~ 485 (641) T protein:vir:94 406 QVQESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMY 485 (641) T ss_pred HHHHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhh Confidence 33333333334433221111 111224444444444444444444444442 33 33333332211 Q ss_pred -------CC----CcccceeEEecCCCCcCHHHHHHHHHHHHh----cCCCc--------H---HHHHHhCCCC-h---- Q lcl|NC_021301. 376 -------GE----SVEDTVDVSFESPDRVTLGEKYAAASLAKA----AGESW--------A---SIRRNILNYN-A---- 424 (456) Q Consensus 376 -------~~----~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~----~g~~s--------~---~t~~~~~~~~-~---- 424 (456) |. +++...++...+-......+.++.+..|.+ .|..+ . ....+..|+. + T Consensus 486 ~~~~~~~~~~~~~p~~L~~~~~iv~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~~~p~~~i 565 (641) T protein:vir:94 486 VPEEQMDGFFEVSPEYLHYPYKFLALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRFTDPMRYI 565 (641) T ss_pred chhhhcccCCCCCccceeeeeeEeecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCCCCchhhc Confidence 10 011111222222112222233333333322 11111 1 1112233431 1 Q ss_pred ---hHH---HHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 425 ---DQI---KQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 425 ---~~~---~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +.. ..++.++.++++-..++......+..+.. T Consensus 566 r~~~~~~~~~~~~~~~~q~~~~~~a~~~~~~~~~~a~~ 603 (641) T protein:vir:94 566 KKAEAPPAAPPIAPAEPGALPPEMMNSVGGGLNDQAIA 603 (641) T ss_pred cCccCchhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHH Confidence 100 01111111222222222111122222211 No 172 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=98.75 E-value=6.7e-08 Score=59.91 Aligned_cols=388 Identities=9% Similarity=0.017 Sum_probs=171.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHhcccCcccc-cCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeec Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLA-RYSNGDAPLPE-LTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITV 78 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~-~YY~g~~~i~~-~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~ 78 (456) |+-+. ++.++....- ..+. .-..+-++... .++....... +.-....-...+|+.++.-+..-|+.+ T Consensus 1 ~~~~~---~~~~~k~~~~------~~~~~~~~~~~~~~~~~~~~~~~~v~~--~~a~~~~~V~~ci~~ia~~ia~lp~~~ 69 (409) T protein:vir:96 1 MAKEN---IVTRIKKKLI------DNWIDQSASKLYDFSPWKNKSFWGVIN--NTLETNETIFSAITKLSNSMASLPLKM 69 (409) T ss_pred Ccccc---chhhhhhHHh------hhhhccccccccccccccCccccccch--hhHhhhHHHHHHHHHHHHhhhhCceEE Confidence 44333 2222222210 0000 00011111100 1111100000 111223445667888888777777765 Q ss_pred CCCCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 79 GGSADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 79 ~~~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) ....+ .....+.+++.. |. -......++..++.+|.||+++-++.+|.+ .+..++|..+.++.++.... +. T Consensus 70 ~~~~~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~-~~- 146 (409) T protein:vir:96 70 YEDYK-VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE-LY- 146 (409) T ss_pred eeccc-ccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE-EE- Confidence 33222 122234444432 32 234556788899999999999999889886 58888999988887654332 11 Q ss_pred EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHH Q lcl|NC_021301. 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHID 232 (456) Q Consensus 153 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~ 232 (456) +.+...++.. ..|..+.+.++. ..++ .+.-.|.|-++.... T Consensus 147 --y~~~~~~g~~---~~~~~~evih~r-------------------------------~~~~---~~~~~G~s~l~~~~~ 187 (409) T protein:vir:96 147 --YSIHAATGNK---LIVHNMDMLHFK-------------------------------HIVA---SNMVQGISPIDVLKN 187 (409) T ss_pred --EEEEcCCceE---EEEccccEEEeC-------------------------------CCCC---CCccccccHHHHHHH Confidence 1112222211 123333333221 0000 011246666655444 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc-hhhhhhhhhhhccceeccCCCceeEeecccch-HHHHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN-AIDYASIFEAAPGALWELPPGVDIWESQTNDF-TPMLS 310 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~ 310 (456) .++..+.+ ... . ......+-.++.-.+. ...++.-. ........-...+.+..++.+.++.++..... ..+++ T Consensus 188 ~i~~~~~~-~~~-~-~~~~~~~~~~i~~~~~--~l~~e~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e 262 (409) T protein:vir:96 188 TTDFDNAV-RTF-N-LTEMQKPDSFMLKYGS--NVSTEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVA 262 (409) T ss_pred HHHHHHHH-HHH-H-HHhcCCCceeEEecCC--CCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHH Confidence 44432211 111 1 1111111111211111 11111111 11111111123456777888889888754322 23777 Q ss_pred HHHHHHHHHHhhcCCChhhhccccc-C-cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCcccceeE Q lcl|NC_021301. 311 AIKEHIRQLSSATKTPLPMLMPDSA-N-QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDTVDV 385 (456) Q Consensus 311 ~l~~~~~~i~~~~~~p~~~~~~~~~-N-~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~i~v 385 (456) ..+....+|+.+-++|+..+|.... + ++.+... +....|.-.+ ..+++.+.. ++.-.+......+++ T Consensus 263 ~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~f~~~~l~P~~--------~~ie~~l~~~Ll~~~~~~~g~~i~f 334 (409) T protein:vir:96 263 SENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRFYLQHTLLPIV--------KQYEEEFNRKLLTKTDREKNRYFKF 334 (409) T ss_pred HHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHH--------HHHHHHHHhhcCCcccccCcceEEe Confidence 7777888999999999999975321 1 1222211 2222221111 122221111 111111112233443 Q ss_pred EecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH-----HHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 386 SFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD-----DLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 386 ~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~-----e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ....-+-.|..+.++++.++.++|+++.-.+++.+|+.|-+--.. ..-.+.. ....+...+-.+++++. T Consensus 335 d~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~n~~~~~~--~~~~~~~~~gG~~n~~e 408 (409) T protein:vir:96 335 NVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPIDT--PLELRKSLKGGDKNVNE 408 (409) T ss_pred echhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeeccccccccc--chhhcccccCCCCCcCC Confidence 334445678899999999999999999988899998865421100 0000000 00000001111111111 No 173 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=98.75 E-value=6.7e-08 Score=59.90 Aligned_cols=394 Identities=11% Similarity=0.023 Sum_probs=169.7 Q ss_pred CCCCCHHHHHHHHHHHHHHH-HH----HH----HHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG-MS----RV----RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~-~~----r~----~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) |+.+-.. ++......-... .. .+ ......+-|...- .+..+.. ..-+.+.=...+|+.+++-+ T Consensus 1 ~~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~--~g~~v~~-----~~al~~~~V~~~i~~ia~~i 72 (434) T protein:vir:43 1 MSKSLGK-VLSSATSAPRSSLFGWGGKTIRLTDGAFWSQFLGRESS--SGKKVTV-----DKAMKLSAVWACVRLISTSV 72 (434) T ss_pred Cccchhh-hhhhcccccchhhhcccccccccCchHHHHHHhcCCcc--CCceech-----hhhhccHHHHHHHHHHHHhh Confidence 5443322 222222111000 00 00 0111111111000 0000000 00122222345778777777 Q ss_pred ccCCeecC-CCCc----ccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEE Q lcl|NC_021301. 72 IPNGITVG-GSAD----SDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVV 140 (456) Q Consensus 72 ~~~~~~~~-~~~d----~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~ 140 (456) ..-|+.+- ...+ ....-.+.+++.. |. -..+...+....+.+|.||+++..+ +|.+ .+..++|..+.+ T Consensus 73 a~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v~~ 151 (434) T protein:vir:43 73 AGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRVDL 151 (434) T ss_pred hhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcceEE Confidence 76676641 1111 1112234455432 33 2356677888999999999988766 5775 578899999888 Q ss_pred EEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC Q lcl|NC_021301. 141 SVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN 220 (456) Q Consensus 141 ~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n 220 (456) ..++.. + + ..++...+|.. ..+..+.+.++.. . .+.. T Consensus 152 ~~~~~g-~-~---~y~~~~~~g~~---~~~~~~eVih~~~-------------------------------~----~~dg 188 (434) T protein:vir:43 152 ECDENG-R-L---KYFYTTKKGAR---REIERTNMLHIPA-------------------------------F----TLDG 188 (434) T ss_pred EEcCCC-e-E---EEEEEecCceE---EEEccccEEEecC-------------------------------c----CCCC Confidence 776442 1 1 11222222221 1223333332210 0 0111 Q ss_pred CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchh-hhhhhhh--hhccceeccCCC Q lcl|NC_021301. 221 PDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWELPPG 294 (456) Q Consensus 221 ~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~~d 294 (456) ..|.|-++.....+.. ...--.....++. .|-.+++. ... ..++....+ ....... ...+.+..++.+ T Consensus 189 ~~G~spi~~~~~~i~~---~~~~~~~~~~~f~ng~~~~gil~~-~~~--l~~e~~~~~r~~~~~~~g~~nag~~~vl~~g 262 (434) T protein:vir:43 189 RIGLSAIRYGVDVFGS---VMSAEDAANGTFKNGLLPTVAFKV-DRI--LQPAQREEFREYVKSVSGAMNSGRSPVLEQG 262 (434) T ss_pred ccccCHHHHHHHHHHH---HHHHHHHHHHHHhccCCcceEEec-CCC--CCHHHHHHHHHHHHHhcCccccCCccccCCC Confidence 2355655543333322 2221112222322 23333322 111 111111111 1111111 113456667788 Q ss_pred ceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhccccc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 295 VDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 295 ~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) .++.++.... -.-|++..+....+|+.+-|+|+..+|.... +..+..++.....+...| -.-+-..+++.+..-+ T Consensus 263 ~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~f~~~~---L~P~~~~ie~~ln~kL 339 (434) T protein:vir:43 263 ITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLAFLTFS---ISSITNQIQQCVNKRL 339 (434) T ss_pred ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHHHHHHH---HHHHHHHHHHHHHhhc Confidence 8888875332 2237788888899999999999999875321 111222222222221111 0111122222221111 Q ss_pred HhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH--------HHHHHHHHHHHHHhh Q lcl|NC_021301. 373 QIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ--------DDLDRAREQITLFAG 444 (456) Q Consensus 373 ~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~--------~e~~~~~ee~~~~~~ 444 (456) --......+.+++.+...+..|..+.+++..++.++|+++.-.+++.+|+.|-+--. .-.+...+. +.... T Consensus 340 ~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~-~~~~~ 418 (434) T protein:vir:43 340 LTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGDILTVQSNLVPIDQLGQS-NKSQA 418 (434) T ss_pred CChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccCccchhhhhcc-CCCcc Confidence 001111123345444555677999999999999999999999999998886532100 001111110 00000 Q ss_pred hhhhhcccccCC Q lcl|NC_021301. 445 NSVQRPQEDGSR 456 (456) Q Consensus 445 ~~~~~~~~d~~~ 456 (456) .....++..+.+ T Consensus 419 ~~~~~~~~~~~~ 430 (434) T protein:vir:43 419 VRAALMNWFSQP 430 (434) T ss_pred hhhhhhccCCCC Confidence 000111111111 No 174 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=98.73 E-value=7.7e-08 Score=59.60 Aligned_cols=390 Identities=9% Similarity=0.051 Sum_probs=175.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cCcccccCc-ccchhhhh-hhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNG-DAPLPELTR-NTSAAWRS-FQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g-~~~i~~~~~-~~~~~~~~-~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |+=-.-.-++.++.+.. ..++... ...+..... ........ ...-........+|+.+++-+..-||. T Consensus 1 m~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~a~~~~~v~~~i~~ia~~iA~lp~~ 71 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKL---------IDNWIDQSTSKLYDFSPWKNRSFWGVINNTLETNETIFSAITKLSNSMASLPLK 71 (412) T ss_pred CccchhhhhhhhhhhhH---------hhhhhcccccccccccccCCccccccchhhhhccHHHHHHHHHHHHhHhhCcee Confidence 32211111121111111 1111100 001000000 00000000 011223445566788888877777876 Q ss_pred cCCCCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEE Q lcl|NC_021301. 78 VGGSADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIR 151 (456) Q Consensus 78 ~~~~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~ 151 (456) +-...+ .....+..++.. |. -......++..++.+|.||+++.++.+|.+ .+..++|..+.+..++.... +. T Consensus 72 ~~~~~~-~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~-~~ 149 (412) T protein:vir:26 72 MYEDYK-VVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQSRE-LY 149 (412) T ss_pred Eeeccc-cccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCCcE-EE Confidence 532221 222334444432 32 234557788899999999999999999986 58889999998888765432 11 Q ss_pred EEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHH Q lcl|NC_021301. 152 SAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHI 231 (456) Q Consensus 152 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~ 231 (456) +.+...++.. ..+..+.+.++.. ..+ ...-.|.|-++... T Consensus 150 ---y~~~~~~g~~---~~~~~~evih~~~-------------------------------~~~---~~~~~G~s~i~~~~ 189 (412) T protein:vir:26 150 ---YSIHAATGNK---LIVHNMDMLHFKH-------------------------------IVA---SNMVQGISPIDVLK 189 (412) T ss_pred ---EEEEcCCceE---EEEccccEEEeCC-------------------------------CCC---CCCcccccHHHHHH Confidence 1122222211 1233333333310 000 01124666665444 Q ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccccc-chhhhhhhhhhhccceeccCCCceeEeecccch-HHHH Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENG-NAIDYASIFEAAPGALWELPPGVDIWESQTNDF-TPML 309 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~ 309 (456) ..++..+.+. .. .......+-..+.-... ...++.. .............+.+..++.+.++.+++..+. ..|+ T Consensus 190 ~~i~~~~a~~-~~--~~~~~~~~~~~i~~~~~--~l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~ 264 (412) T protein:vir:26 190 NTTDFDNAVR-TF--NLTEMQKPDSFMLKYGS--NVGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIV 264 (412) T ss_pred HHHHHHHHHH-HH--HHHhcCCCCceEEecCC--CCCHHHHHHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHH Confidence 4333322111 11 11112222111111111 1111111 111111111223455667788888888763322 2477 Q ss_pred HHHHHHHHHHHhhcCCChhhhccccc-C-cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCccccee Q lcl|NC_021301. 310 SAIKEHIRQLSSATKTPLPMLMPDSA-N-QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDTVD 384 (456) Q Consensus 310 ~~l~~~~~~i~~~~~~p~~~~~~~~~-N-~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~i~ 384 (456) +..+..+.+|+.+-|+|+..+|.... + ++.+... +....|.- +-..+++.+.. ++.-.+......++ T Consensus 265 e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f~~~~l~P--------~~~~ie~~ln~kLl~~~~~~~~~~~~ 336 (412) T protein:vir:26 265 ASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLP--------IVKQYEEEFNRKLLTKTDREKNRYFK 336 (412) T ss_pred HHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHH--------HHHHHHHHHHhhcCCcccccCcceEE Confidence 77777889999999999999976432 2 1222211 11111111 11222222221 11111111223344 Q ss_pred EEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHH-----HHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 385 VSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQD-----DLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 385 v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~-----e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +.+..-+..|..+.++++.++.++|+++.--+++.+|+.|-+--.. ....+. .....+....-.+++++. T Consensus 337 fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~--~~~~~~~~~~gG~~n~~e 411 (412) T protein:vir:26 337 FNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPID--TPLELRKSLKGGDKNVNE 411 (412) T ss_pred eechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccc--cchhhcccccCCCCCcCC Confidence 4444556778999999999999999999999999999876421100 000000 000011111122223333 No 175 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=98.73 E-value=8e-08 Score=59.50 Aligned_cols=427 Identities=12% Similarity=0.023 Sum_probs=160.8 Q ss_pred CC-------------------CCCHHHHHHHHHHHH-------HHHHHHHHHHHHH--hcccCcccccCcccchhhhhhh Q lcl|NC_021301. 1 MT-------------------ASTPAEWLPVLTKRI-------DDGMSRVRLLARY--SNGDAPLPELTRNTSAAWRSFQ 52 (456) Q Consensus 1 ~~-------------------~~t~~~~~~~l~~~~-------~~~~~r~~~~~~Y--Y~g~~~i~~~~~~~~~~~~~~~ 52 (456) |- .=+-.+++..|.+.+ ...+.+.....+| |+|+.+ +++. + ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~-~--gr-- 71 (763) T protein:vir:95 1 MEQNTDSMVPLPDPSQATKLTSWKNELSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAK----PPKV-K--GR-- 71 (763) T ss_pred CCcCccCcCCCccccchhcCCCCCChHHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCc----cccc-C--CC-- Confidence 11 111123344444443 2333333334444 555432 1111 1 11 Q ss_pred hhhccChHHHHHHHHHhhh----ccC-C-eec--CCCCcccHHHHHHH-----HHHhcChhHHHHHHHHHHhhCCeEEEE Q lcl|NC_021301. 53 REARTNWGLMVRDSVADRI----IPN-G-ITV--GGSADSDLALRARR-----IWRDNRMDSVCKQWVKYGLDFGESYLT 119 (456) Q Consensus 53 ~k~~~n~~~~iVd~~a~~l----~~~-~-~~~--~~~~d~~~~~~l~~-----~~~~n~~~~~~~~~~~~a~~~G~a~~~ 119 (456) .+++..-.+..|+.....| ++. . |.+ .+..|.+..+.... ++..|+=.......+++++++|.+++. T Consensus 72 s~vv~~~v~~~ve~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k 151 (763) T protein:vir:95 72 SQVQPKLVRRQAEWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVR 151 (763) T ss_pred ccccCHHHHHHHHHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEE Confidence 1355555555666555544 222 2 333 23344444443332 455565556677899999999999777 Q ss_pred EeeC---------------------------------------------------------------------------- Q lcl|NC_021301. 120 CWRR---------------------------------------------------------------------------- 123 (456) Q Consensus 120 v~~d---------------------------------------------------------------------------- 123 (456) ||-+ T Consensus 152 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (763) T protein:vir:95 152 VGWNREIRKEKQEVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVE 231 (763) T ss_pred EeeeeeeeeeeeeehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEE Confidence 6421 Q ss_pred --CCCceEEEEEccceeEEEEeCCCC-c--eEEEEEEEEEecC----Cc-----------eEE----EE---------EE Q lcl|NC_021301. 124 --DDGTATITADSPETMVVSVDPLQP-W--RIRSAMRWWRDLD----AE-----------SDF----AI---------VW 170 (456) Q Consensus 124 --~dg~~~i~~~~p~~~~~~~d~~~~-~--~~~~~~~~~~~~d----~~-----------~~~----~~---------~~ 170 (456) ..+.|+|..++|+++++-.+-..+ . ..++...+.+..+ +. ... .+ -+ T Consensus 232 ~~~k~~p~ie~V~p~d~~iDp~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 311 (763) T protein:vir:95 232 VPLANHPTVEMLNPENIIIDPSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQI 311 (763) T ss_pred EEecCceEEEeecHHHheecCCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccC Confidence 013557788999998743321111 1 1221111111000 00 000 00 00 Q ss_pred cC---CeE--EEEEEeeeec-cc-ccceeeccCCCceeecccccccCceeEEEE------ccCCCCCCcHhHHHHHHHHH Q lcl|NC_021301. 171 SG---DGW--QKFARPCFVQ-SS-SRRRLVTRISDSWVPVGDAVVTGSPPPVVV------YQNPDGMGEVEPHIDIINRI 237 (456) Q Consensus 171 ~~---~~~--~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~------~~n~~g~s~~~~v~~liDa~ 237 (456) .+ ..+ +.+....... +. .....+...++........+...+.+|++. ....+|.|.+..++++++.+ T Consensus 312 ~d~~~~~V~v~E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~ 391 (763) T protein:vir:95 312 SDPMRKRVVAYEYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVL 391 (763) T ss_pred CCcccceEEEEEeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHH Confidence 00 001 1111100000 00 011112222333333333333333334332 24568999999999999999 Q ss_pred HHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee------Eeecc--cchHHHH Q lcl|NC_021301. 238 NRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI------WESQT--NDFTPML 309 (456) Q Consensus 238 ~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~------~~~~~--~~~~~~~ 309 (456) |...+.+..+....+.+.. +...+. + .....+...+|.++...++.+. ..++. ....... T Consensus 392 N~~~~~~~d~l~~~~~~~~-~v~~ga----v-------~~~d~~~~~pg~v~~v~~g~~~~~~~~~~~~p~~~~~~~~~l 459 (763) T protein:vir:95 392 GAVMRGMIDLLGRSANGQR-GMPKGM----L-------DALNSRRYREGEDYEYNPTQNPAQMIIEHKFPELPQSALTMA 459 (763) T ss_pred HHHHHHHHHHHHhhcCCcE-Eeeccc----c-------cchhhhcccCCceEEeeCCCChhhhcccccCCCCcchHHHHH Confidence 9999999988888777743 222111 0 1111122233444433332221 12222 1233344 Q ss_pred HHHHHHHHHHHhhcCCChhhhccc--ccC--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHhcCCC--- Q lcl|NC_021301. 310 SAIKEHIRQLSSATKTPLPMLMPD--SAN--QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILV----KALQIEGES--- 378 (456) Q Consensus 310 ~~l~~~~~~i~~~~~~p~~~~~~~--~~N--~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~----l~~~~~~~~--- 378 (456) .+++.. +-..||++....|.. ..| +||++. ....-........+.|..+++.+++ |+....+.. T Consensus 460 ~~~~~~---~e~~TGv~~~~~G~~~~~~~~tat~v~~--l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rvi 534 (763) T protein:vir:95 460 TLQNQE---AESLTGVKAFAGGVTGESYGDVAAGIRG--VLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVV 534 (763) T ss_pred HHHHHH---HHHhhCcchhhcCcCcccccchhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEE Confidence 444444 445566666554422 112 233332 2222223333344555555555544 443322211 Q ss_pred ----------------cccceeEEecCCCCcCH-HHHHHHHHHHHh-cC-CCcH----HHHH---HhCC---CC------ Q lcl|NC_021301. 379 ----------------VEDTVDVSFESPDRVTL-GEKYAAASLAKA-AG-ESWA----SIRR---NILN---YN------ 423 (456) Q Consensus 379 ----------------~~~~i~v~f~~~~~~~~-~e~ad~~~kl~~-~g-~~s~----~t~~---~~~~---~~------ 423 (456) ..+++.+.-. +.+. .+.+..+..+.+ .| .+.. .... +... +. T Consensus 535 RI~g~e~v~v~~~~~~~~~DV~V~~~---~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~ 611 (763) T protein:vir:95 535 RITNEEFVTIKREDLKGNFDLEVDIS---TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTW 611 (763) T ss_pred EEeCCccccccHHHhcCCcceEEecc---cchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhc Confidence 0112222221 1122 122222232222 11 1111 1111 1111 10 Q ss_pred ---hhHH----HHHHHHHHHHHHHH------Hh--hhhhhhcccccCC Q lcl|NC_021301. 424 ---ADQI----KQDDLDRAREQITL------FA--GNSVQRPQEDGSR 456 (456) Q Consensus 424 ---~~~~----~~~e~~~~~ee~~~------~~--~~~~~~~~~d~~~ 456 (456) +++. ++.+..+.+.++.. +. +......+.+-.+ T Consensus 612 q~~~d~~~q~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq~e~~~ 659 (763) T protein:vir:95 612 QPQPDPVQEQLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAERDNKN 659 (763) T ss_pred CCCccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0111 11111111100000 00 0000000000000 No 176 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=98.70 E-value=1e-07 Score=58.90 Aligned_cols=437 Identities=9% Similarity=0.015 Sum_probs=203.9 Q ss_pred CC------------CCCHHHHHHHHHHHHHH-------HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHH Q lcl|NC_021301. 1 MT------------ASTPAEWLPVLTKRIDD-------GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGL 61 (456) Q Consensus 1 ~~------------~~t~~~~~~~l~~~~~~-------~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~ 61 (456) |+ -+++.+|+..++.+|.+ +.+.++.++.|-.... .+..+.. .+ .-.+++.+|-.- T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~~~-tr~t~~~---~~-~w~~s~t~~k~~ 75 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDATD-TRKTSNS---KL-PFKNSTTINKLA 75 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhhhc-ccccccC---CC-CcccccchHHHH Confidence 43 34456776666666532 2234567778744432 2211111 11 112357778778 Q ss_pred HHHHHHHhhhccCC------eecCC--CC-cc-cHHHHH----HHHHHhcChhHHHHHHHHHHhhCCeEEEEEee----- Q lcl|NC_021301. 62 MVRDSVADRIIPNG------ITVGG--SA-DS-DLALRA----RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWR----- 122 (456) Q Consensus 62 ~iVd~~a~~l~~~~------~~~~~--~~-d~-~~~~~l----~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~----- 122 (456) .+++.+..++++-- |.+.+ +. +. +....+ ++-+.+.+|...+..++.+.+.+|.||..+.. T Consensus 76 ~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~ 155 (599) T protein:vir:31 76 HLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMT 155 (599) T ss_pred HHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEcce Confidence 88888888876532 22221 11 11 111112 23344568888899999999999999887641 Q ss_pred -CCCC-------ceEEEEEccceeEEEEeCCCCceEEEEEEEEEec--------CCc---------------eE---E-- Q lcl|NC_021301. 123 -RDDG-------TATITADSPETMVVSVDPLQPWRIRSAMRWWRDL--------DAE---------------SD---F-- 166 (456) Q Consensus 123 -d~dg-------~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~--------d~~---------------~~---~-- 166 (456) -+|| .|++..++|..+|+--+-.+.......+|.+... ++. .. . T Consensus 156 ~~~d~~v~~~~~~P~~ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~ 235 (599) T protein:vir:31 156 VTAENQVIKNYSGTVTERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREAL 235 (599) T ss_pred eecccccccccccceEEeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccc Confidence 1232 3789999999977433222222222223322200 000 00 0 Q ss_pred EEEE-----cC----C---eEEEEEEe-------eee-ccccc-------ceeeccCCCceeecccccccCce-eEEEE- Q lcl|NC_021301. 167 AIVW-----SG----D---GWQKFARP-------CFV-QSSSR-------RRLVTRISDSWVPVGDAVVTGSP-PPVVV- 217 (456) Q Consensus 167 ~~~~-----~~----~---~~~~~~~~-------~~~-~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~-~pvv~- 217 (456) ...| .+ + .++.|... .|+ ..+.. ...+..+.+...-....+...+. |.++. T Consensus 236 ~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~~ 315 (599) T protein:vir:31 236 ADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIAV 315 (599) T ss_pred cchhhhhhhccccccccccchhhhcccchhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEEE Confidence 0000 00 0 01111100 010 00000 01111111111111112223332 33221 Q ss_pred ----ccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCC Q lcl|NC_021301. 218 ----YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPP 293 (456) Q Consensus 218 ----~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (456) ..+.+|.|.+..+..+++.+|.+--.+.+..+.+.+|+....|.-.+ + .+.-.++++|.+.. T Consensus 316 ~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~~-----e---------D~~~~P~~v~~~~d 381 (599) T protein:vir:31 316 YEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVRE-----K---------GMRGGPNHVFEVEE 381 (599) T ss_pred eeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhcccccccccccc-----c---------CccCCCCcceeecC Confidence 13568999999999999999988777777777777776665543111 0 11223677776666 Q ss_pred CceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_021301. 294 GVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA-ILVK 370 (456) Q Consensus 294 d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~-~~~l 370 (456) .+.+..+.+ +++..-...+......+-..+|+|..+-|..+ ++..+..++............+.+.|...+-+ +++. T Consensus 382 ~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~lepll~~ 461 (599) T protein:vir:31 382 TGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELLTPVLND 461 (599) T ss_pred CCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHH Confidence 665554433 22222222334444445567899999988543 34567777888878777777888888776543 5554 Q ss_pred HHHhc----CCCc-------------ccce-------eEEecCCCCcCHHHHHHHHHHHHhc-------CCC---cHHH- Q lcl|NC_021301. 371 ALQIE----GESV-------------EDTV-------DVSFESPDRVTLGEKYAAASLAKAA-------GES---WASI- 415 (456) Q Consensus 371 ~~~~~----~~~~-------------~~~i-------~v~f~~~~~~~~~e~ad~~~kl~~~-------g~~---s~~t- 415 (456) ++... ...+ ..+| .+.+.+--..-.++..+..++|.+. ++. ++.. T Consensus 462 l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~~~~k~l 541 (599) T protein:vir:31 462 YLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPHMSRTKL 541 (599) T ss_pred HHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchhhHHHHH Confidence 33221 1110 0001 1122222223335566666655432 122 2211 Q ss_pred --HHH----hCCCC--hhHHH----H-----HHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 416 --RRN----ILNYN--ADQIK----Q-----DDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 416 --~~~----~~~~~--~~~~~----~-----~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +++ +..|. +..++ + ++.+-++++-.++.+.-...|..|--+ T Consensus 542 ~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 542 FNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 (599) T ss_pred HHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCcccCC Confidence 111 11221 11111 1 111222222233333333333333333 No 177 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=98.70 E-value=1e-07 Score=58.90 Aligned_cols=400 Identities=8% Similarity=-0.008 Sum_probs=165.0 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCC-Cc Q lcl|NC_021301. 6 PAEWLPVLTKRIDDGMSR-VRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS-AD 83 (456) Q Consensus 6 ~~~~~~~l~~~~~~~~~r-~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~-~d 83 (456) -..+++++.++....... ....-+ +-|.. +.-.+......... .-....+...+|+.+++-+.+-|+.+... .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~~-~~~~~~~~~~~~~~--~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~ 76 (460) T protein:vir:10 1 MANRIIRALRELTGLDNKFNDAFIK-YIGQT-FTKYDNNGKTYLEQ--GYNINPDVYSCISQMAAKTVAVPYTIKVVKDT 76 (460) T ss_pred CchhHHHHHhhhhccCCCchHHHHH-hhccc-cCCCccchhhhhHH--HHhcchHHHHHHHHHHHhhhhCceEEEeccCC Confidence 122444444333221111 111111 11211 10011111111111 11234566678888888888777765321 11 Q ss_pred ccH-------------------------------HHHHHHHHHh-c---ChhHHHHHHHHHHhhCCeEEEEEeeCCC--- Q lcl|NC_021301. 84 SDL-------------------------------ALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDD--- 125 (456) Q Consensus 84 ~~~-------------------------------~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~d~d--- 125 (456) ... ......++.+ | ....+...+....+.+|.||+++.++.. T Consensus 77 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~ 156 (460) T protein:vir:10 77 KAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGIN 156 (460) T ss_pred ccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCcc Confidence 100 0011112221 2 2235556678899999999999887543 Q ss_pred -CceE-EEEEccceeEEEEeCCCCceEEE-EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceee Q lcl|NC_021301. 126 -GTAT-ITADSPETMVVSVDPLQPWRIRS-AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVP 202 (456) Q Consensus 126 -g~~~-i~~~~p~~~~~~~d~~~~~~~~~-~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (456) |.++ +..++|..+.+..+......... .+..+....+... ..+.++.+.+++.. T Consensus 157 ~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~--~~~~~~evih~r~~--------------------- 213 (460) T protein:vir:10 157 AGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQF--IEFNEDEVIHTKYA--------------------- 213 (460) T ss_pred CceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCcee--EEecccceEEEecC--------------------- Confidence 5553 78888998887765443221111 1111111111111 12233333332110 Q ss_pred cccccccCceeEEEEc-cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhc--hhhhhhcCCCcccccccccchh-hhh Q lcl|NC_021301. 203 VGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAF--RQRALKSAGHGLPKVDENGNAI-DYA 278 (456) Q Consensus 203 ~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~--~~~~i~g~~~~~~~~~~~~~~~-~~~ 278 (456) .++.-+. ..-.|.|.++.+...+.....+.. ....++.. +...+...+. ...++.-..+ ... T Consensus 214 ---------~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~---~~~~~f~ng~~~~~i~~~~~--~l~~e~~~~~~~~~ 279 (460) T protein:vir:10 214 ---------NPNFDLQGSHLYGMSPIRAILRNINSQNSTID---NNVKTMQNGGVFGFIHGGST--GLTQPQADSLKQRL 279 (460) T ss_pred ---------CCCcccccCccccccHHHHHHHHHHHHHHHHH---HHHHHHhcCCCcceeeecCC--CCCHHHHHHHHHHH Confidence 0000000 012466666544443333222211 12222222 1122222111 1111111111 111 Q ss_pred h-hhh--hhccceeccCCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccc---cCc-HHH--HHHHHHHH Q lcl|NC_021301. 279 S-IFE--AAPGALWELPPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQ-SAE--GAHNIEKG 348 (456) Q Consensus 279 ~-~~~--~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~N~-Sg~--Al~~~~~~ 348 (456) . .+. ...+.+..++.+.++.++...+. ..+++..+....+|+.+-|+|+..+|... +|- +.+ .+.+.... T Consensus 280 ~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~~ 359 (460) T protein:vir:10 280 TEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTDN 359 (460) T ss_pred HHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHHH Confidence 1 111 12345667788888888764332 23778888899999999999999997532 111 111 12222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhH-- Q lcl|NC_021301. 349 FLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQ-- 426 (456) Q Consensus 349 l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~-- 426 (456) |.-.+...+..|... ++.-.+....+. +.|.-.......+...+...+..+|+++..-+++.+|+.|-. T Consensus 360 l~P~~~~ie~~ln~k-------l~~~~~~~~~~~--i~~d~~~l~~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~pi~~~ 430 (460) T protein:vir:10 360 IQPDLVILKQAFDKK-------FIKRFKGYENAV--IEWDISELPEMQTDMVAMASWLNTIPVTPNEIRIAMKYETLNQD 430 (460) T ss_pred HHHHHHHHHHHHHHh-------hcCcccccCCce--EEeecchhhhHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Confidence 222222222211111 111011112233 344332222233334445557788999999999999887531 Q ss_pred HHHH-HHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 427 IKQD-DLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 427 ~~~~-e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) -... -...--..++... .......++.+| T Consensus 431 ~gD~~~~~~n~~~~~~~~-~~~~~~~~nq~~ 460 (460) T protein:vir:10 431 GMDIVFMPSNKVRIDDVS-NNLIDSAFNQNQ 460 (460) T ss_pred CCCeeeecccccchhhcc-cccCCCcccCCC Confidence 1110 0000000011111 112233344444 No 178 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.67 E-value=1.1e-07 Score=58.81 Aligned_cols=393 Identities=12% Similarity=-0.008 Sum_probs=156.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -|.+++..+...-.. .+.-......+||+. |. .+..+.... -...+...+|+..+..+.+-|+.+.. T Consensus 9 ~s~~~~~~i~~~~~~---s~~~~~~~~~~~~~p-------p~-~~~~la~l~--~~n~~v~scI~~ia~~IA~l~~~~~~ 75 (542) T protein:vir:41 9 RSLEKYKAIKREEVE---SQALGETRFEEYVEP-------KV-NPLVLLSLL--QVNPYHASACSIKANDIIRTGYILEG 75 (542) T ss_pred cccccchhhhhcccc---ccccccccCCccccC-------CC-CHHHHHHHH--hhcHHHHHHHHHHHHHHhhCceeeec Confidence 222223322211110 000000111122211 11 122222221 12356788999999999999998754 Q ss_pred CCcccHHHHHHHHHHhc--ChhHHHHHHHHHHhhCCeEEEEEeeCCCCceE-EEEEccceeEEEEeCCCCceEEEEEEEE Q lcl|NC_021301. 81 SADSDLALRARRIWRDN--RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRSAMRWW 157 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n--~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~~~~~~~~~ 157 (456) ... ..+..++-.. ....+...++.+.+.+|.||+.+-++.+|++. +..++|..+.+..|... .+.+ T Consensus 76 ~~~----~~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~-------~~~~ 144 (542) T protein:vir:41 76 DDE----GVVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSR-------YRQT 144 (542) T ss_pred ccc----hhhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCe-------eEee Confidence 322 2233333211 23456777888999999999999999888874 88889988877665331 1111 Q ss_pred EecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc------CCCCCCcHhHHH Q lcl|NC_021301. 158 RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ------NPDGMGEVEPHI 231 (456) Q Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~------n~~g~s~~~~v~ 231 (456) ....+ ..+...|.....+... .+... ....... |+|+. ...|.|.+.... T Consensus 145 ~~~~~-~~~~~~y~~~~~~~~~-----------------~g~~~---~~~~~~e---IiHir~~~~~~~~~Glspi~~~~ 200 (542) T protein:vir:41 145 WDGVN-ITHFKDYRYEGEINPE-----------------TGEDQ---DSVGANE---LVFIHIPSPVCSYYGVPRYVSAA 200 (542) T ss_pred ecCCc-ceeEEeeccccccccc-----------------ccccc---cccCccc---EEEecCCCCCCCcccccHHHHHH Confidence 11111 1111112111100000 00000 0000001 12221 135677666544 Q ss_pred HHHHHHHHHHHHHHHHHHHhhc---hhhhhh--cCCCcccccccc--cchhhh-hhhh-------hhhccceecc----- Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAF---RQRALK--SAGHGLPKVDEN--GNAIDY-ASIF-------EAAPGALWEL----- 291 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~---~~~~i~--g~~~~~~~~~~~--~~~~~~-~~~~-------~~~~~~~~~~----- 291 (456) .-+... ..-......++.+ |..+++ |...+....+.. ...... ...+ ....+....+ T Consensus 201 ~~i~~~---~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~~~~ 277 (542) T protein:vir:41 201 PAILAM---QKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSIPGG 277 (542) T ss_pred HHHHHH---HHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeeccCC Confidence 333222 2111112223322 322332 211111110100 000000 0111 1112333332 Q ss_pred -CCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhccccc---Cc-HHHHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 292 -PPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA---NQ-SAEGAH--NIEKGFLFKCEDRLSIAKIG 363 (456) Q Consensus 292 -~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~---N~-Sg~Al~--~~~~~l~~k~~~~~~~f~~~ 363 (456) +.+.++..+.... -..|++..+....+|+++-++|+..+|.... |. +.+... +....+.- ..+.+... T Consensus 278 ~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~~~f~~~tL~P----~~~~ie~~ 353 (542) T protein:vir:41 278 DTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTRRTYYESVVRP----QQNIISSI 353 (542) T ss_pred cccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHHHHHHHHHHHH----HHHHHHHH Confidence 2445666654321 2347787888899999999999999975422 21 122221 11111111 11222222 Q ss_pred HHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhC-CCChhHHHHH------------ Q lcl|NC_021301. 364 LEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNIL-NYNADQIKQD------------ 430 (456) Q Consensus 364 l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~-~~~~~~~~~~------------ 430 (456) |.+.+ + ......+.+.|+....... ..+..+.+++++|+++..-+++.+ |+.+-+..-+ T Consensus 354 ln~~L---~----~~~~~~~~~~f~~~~ll~~-d~~~~~~~~v~~GilT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~ 425 (542) T protein:vir:41 354 LTDFF---Q----VKFNPKTRFKFNDETLLES-DSVRNCALLVQSGVLTPAEARERLFGLDGGPDIFMVPSKGAAKSVKR 425 (542) T ss_pred HHhhc---c----cccCCceEEEecchhhcch-HHHHHHHHHHhCCCCCHHHHHHhhCCCCCCCcccccccccccccccc Confidence 22111 1 1112234566665432222 223345567889999887777643 5543211000 Q ss_pred -----HHHHHHHHHH--HHhh--------hhhhhcccccCC Q lcl|NC_021301. 431 -----DLDRAREQIT--LFAG--------NSVQRPQEDGSR 456 (456) Q Consensus 431 -----e~~~~~ee~~--~~~~--------~~~~~~~~d~~~ 456 (456) +.+..++-.. .-.. .....+...|-| T Consensus 426 ~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~ 466 (542) T protein:vir:41 426 QERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKI 466 (542) T ss_pred CCcCCCCCchhhhhhcccccCccccccccccccchhhcccc Confidence 0000000000 0000 000001111111 No 179 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=98.66 E-value=1.3e-07 Score=58.25 Aligned_cols=378 Identities=13% Similarity=0.033 Sum_probs=166.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMS-RVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~-r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) |--+-+ .........+ .-..+....-|.. . ..+..+.. ..-+.+.-...+|+.+++-+.+-|+.+- T Consensus 1 m~~~~~------~~~~~~~~~~~~~~~~~~~~g~~~-s-~~~~~v~~-----~~al~~~~v~~cv~~ia~~ia~lp~~~~ 67 (419) T protein:vir:80 1 MFFSRQ------LLSNLGQTQPGSGGWVSALLGSAR-S-EAGQVVTP-----ASALSLTVLQNCVTLLAESIAQLPVELY 67 (419) T ss_pred CCcccc------cccccCcCCCCcchhhHHhhcccc-c-ccCcccCh-----HHhhccHHHHHHHHHHHHhhccCceEEE Confidence 111000 0000000000 0011111111111 0 00000100 1112334456688888888877787642 Q ss_pred C-CCcc---cHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceE-EEEEccceeEEEEeCCCCce Q lcl|NC_021301. 80 G-SADS---DLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWR 149 (456) Q Consensus 80 ~-~~d~---~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~ 149 (456) . ..+. .....+..++.. |. -..+...+....+.+|.||+++-++.+|.+. +..++|..+.+..+.... T Consensus 68 ~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~-- 145 (419) T protein:vir:80 68 ERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVMKGPDLK-- 145 (419) T ss_pred EecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCce-- Confidence 1 1111 111224445442 32 2355677888899999999999999999864 888999888776654311 Q ss_pred EEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhH Q lcl|NC_021301. 150 IRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEP 229 (456) Q Consensus 150 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~ 229 (456) + +|.. .+.. .+..+ ...|.-.. ......|.|.++. T Consensus 146 ~-----~y~~-~~~~----~~~~~-------------------------------~i~h~~~~----~~d~~~G~s~i~~ 180 (419) T protein:vir:80 146 P-----MYRV-AGAD----PLPQR-------------------------------LVHHVRWM----SINGYTGLSPVLL 180 (419) T ss_pred E-----EEEE-cCcc----ccchh-------------------------------heEEecCC----CCCCcccccHHHH Confidence 0 1110 0100 00001 11111100 0112346666654 Q ss_pred HHHHHHHHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchhhh-hhhhh------hhccceeccCCCceeEe Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAIDY-ASIFE------AAPGALWELPPGVDIWE 299 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~~~-~~~~~------~~~~~~~~~~~d~~~~~ 299 (456) ....++....+. .-...++ +.|..+++-........++ ..... ...+. ...+.+..++.+.++.+ T Consensus 181 ~~~~i~~~~~~~---~~~~~~f~ng~~~~gil~~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~ 255 (419) T protein:vir:80 181 HANAIGHAQAIQ---QYAGKSFMNGTALSGVIERPTDAPALKDQ--ASVDRITDGWNAKFGGSGNAKKVALLQEGMKFKP 255 (419) T ss_pred HHHHHHHHHHHH---HHHHHHHhcCCCccEEEEecCCCCcccCH--HHHHHHHHHHHHHhcCccccCCceecCCCceEEe Confidence 444333222221 1122222 2344444321100001111 11111 11111 12355677888889888 Q ss_pred ecccchH-HHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021301. 300 SQTNDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEG--AHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI 374 (456) Q Consensus 300 ~~~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~A--l~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 374 (456) +.-.+.+ .+++..+....+|+.+-|+|+..+|... ++ .+.+. +.+....|.-.+. .+++.+..-+-. T Consensus 256 l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~~f~~~~l~P~~~--------~ie~~l~~kll~ 327 (419) T protein:vir:80 256 LSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSLQFVIYTLLPWVK--------RHEQAKTRDLLL 327 (419) T ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHHHHHHHHHHHHH--------HHHHHHhhhccC Confidence 7643332 3777778888999999999999997432 12 12121 1222222222222 222222111101 Q ss_pred cCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 375 EGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 375 ~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) ......+.+++.+......|..+.+++..++++.|+++.-.+++.+|+.|-+--. +.--..+. .......+.+.| T Consensus 328 ~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gGD----~~~~~~n~-~~~~~~~~~~~~ 402 (419) T protein:vir:80 328 PSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGD----IYLSPMNM-VDASKPQPIPMG 402 (419) T ss_pred ccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcc----eeeecccc-ccccccccccCC Confidence 1111223344444455677899999999999999999999999999886532110 00000000 000111111111 Q ss_pred CC Q lcl|NC_021301. 455 SR 456 (456) Q Consensus 455 ~~ 456 (456) .. T Consensus 403 ~~ 404 (419) T protein:vir:80 403 KT 404 (419) T ss_pred CC Confidence 11 No 180 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.65 E-value=1.5e-07 Score=58.05 Aligned_cols=388 Identities=13% Similarity=0.029 Sum_probs=173.2 Q ss_pred CCCCCHHHHHHHHHHHHHHH------HHHHHHH-HHHhcccCcccccCcccchhhhhhhhhh--ccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG------MSRVRLL-ARYSNGDAPLPELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~------~~r~~~~-~~YY~g~~~i~~~~~~~~~~~~~~~~k~--~~n~~~~iVd~~a~~l 71 (456) +.+.|+. +..+.+.+... -.|+... +..-.|. +. ....+...+ ...+..-++++....+ T Consensus 18 ~~~~~~~--~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd--~~--------~~~~L~~~m~e~D~~i~s~l~~Rk~av 85 (528) T protein:vir:10 18 RKQQTAH--LAGLAKEFANHPAKGLTPAKLAHILIEAEQGH--LQ--------AQAELFMDMEERDAHLFAEMSKRKRAV 85 (528) T ss_pred cchhhhh--hhhhhhhhcccCCCCCCHHHHHHHHHhhhCCC--HH--------HHHHHHHHHHhhChHHHHHHHHHHHHH Confidence 1111110 11111111100 0011111 1101110 00 000111111 2455666777777777 Q ss_pred ccCCeecCCCCcc-----cHHHHHHHHHHh-cChhHHHHHHHHHHhhCCeE-EEEEeeCCCCceEE---EEEccceeEEE Q lcl|NC_021301. 72 IPNGITVGGSADS-----DLALRARRIWRD-NRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTATI---TADSPETMVVS 141 (456) Q Consensus 72 ~~~~~~~~~~~d~-----~~~~~l~~~~~~-n~~~~~~~~~~~~a~~~G~a-~~~v~~d~dg~~~i---~~~~p~~~~~~ 141 (456) .+.++++....+. ...+.+.+.+.+ ..|+..+.. ..+|.-||.+ ++++|...+|...+ ..++|+.+ . T Consensus 86 ~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~-~lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f--~ 162 (528) T protein:vir:10 86 LGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLD-CMDGVGHGYSAIELDWSLQGREWLPQAFDHRPQSWF--Q 162 (528) T ss_pred hcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHH-HHhhhhhcceeEEEEEeecCCceeEEEeeeecccce--e Confidence 7778776543221 222334455544 246655554 4457789984 67888765665443 33333321 1 Q ss_pred EeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEE---c Q lcl|NC_021301. 142 VDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVV---Y 218 (456) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~---~ 218 (456) |++..+..+ +. .++... +..-+..+++...+ . T Consensus 163 ~~~~~~~~l----~~---~~~~~~--------------------------------------g~~l~~~k~iv~~~~~~~ 197 (528) T protein:vir:10 163 LNPDDQDEL----RL---RDNSIA--------------------------------------GEVLQPFGWIMHKPRSRS 197 (528) T ss_pred eccCCCcEE----ec---cCCCCC--------------------------------------ceeecCCCeEEEeecCCC Confidence 222221110 00 000000 00001112222221 3 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee- Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI- 297 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~- 297 (456) .|+.|.|.+..+....---+..+.+.....+.|+.|.++.+= ..+ ..+++...+... ....+.+....++.+..+ T Consensus 198 g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky-~~~--a~~~ek~~L~~a-l~~i~~~~~~iiP~~~~ie 273 (528) T protein:vir:10 198 GYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKY-PPG--TPDEEKVTLLRA-VTGLGHAAAGIIPESMSID 273 (528) T ss_pred CCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEec-CCC--CCHHHHHHHHHH-HHHHhhCcEEEecCCceeE Confidence 567899988876554444445577888888999998776642 111 112222222111 112222334445555544 Q ss_pred -EeecccchHHHHHHHHHHHHHHHhhcCC----ChhhhcccccCcHHHHHHH-HHHHHHHHHHHHHHHHHHHHH-HHHHH Q lcl|NC_021301. 298 -WESQTNDFTPMLSAIKEHIRQLSSATKT----PLPMLMPDSANQSAEGAHN-IEKGFLFKCEDRLSIAKIGLE-AILVK 370 (456) Q Consensus 298 -~~~~~~~~~~~~~~l~~~~~~i~~~~~~----p~~~~~~~~~N~Sg~Al~~-~~~~l~~k~~~~~~~f~~~l~-~~~~l 370 (456) .+....+...|...++.+-.+|+.+.-- ....-|+.++ -|+.- ...-....++.-.+.+...+. ++++- T Consensus 274 ~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS----~Alg~vh~~v~~di~~aDa~~i~~tln~~li~~ 349 (528) T protein:vir:10 274 FQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGA----YALGQVHNEVRHDLLAADARQLAATLSRDLLWP 349 (528) T ss_pred EeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccch----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444455677888888887777765411 1001011111 22221 112223333344456666774 47777 Q ss_pred HHHhcCCCc---ccceeEEecCCCCcCHHHHHHHHHHHHhcCC-CcHHHHHHhCCCChhHH-HHHHHHHHHHHHHHHhh- Q lcl|NC_021301. 371 ALQIEGESV---EDTVDVSFESPDRVTLGEKYAAASLAKAAGE-SWASIRRNILNYNADQI-KQDDLDRAREQITLFAG- 444 (456) Q Consensus 371 ~~~~~~~~~---~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~-~s~~t~~~~~~~~~~~~-~~~e~~~~~ee~~~~~~- 444 (456) ++.+..... ..--.+.|....+.|..+.++.+.+|.+.|+ ++.+.+.+.+|+...+. +................ T Consensus 350 l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~ 429 (528) T protein:vir:10 350 LLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRR 429 (528) T ss_pred HHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCCCCCCCCcccccCCCcccccccCcc Confidence 776664322 2234678999999999999999999999996 89999999999853211 11000000000000000 Q ss_pred ---hhhhhcccccCC Q lcl|NC_021301. 445 ---NSVQRPQEDGSR 456 (456) Q Consensus 445 ---~~~~~~~~d~~~ 456 (456) .....++..+.+ T Consensus 430 ~~~~~~~~~~~~~~~ 444 (528) T protein:vir:10 430 PGPRIAALAQVIGPR 444 (528) T ss_pred ccccccccccccccc Confidence 000000000000 No 181 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=98.65 E-value=1.5e-07 Score=58.02 Aligned_cols=386 Identities=10% Similarity=0.031 Sum_probs=178.2 Q ss_pred CC--------------CCCHHHHHHHHHHHHHHHHHHHHHHHHHhc-cc-----CcccccCcccchhhhhhhhhhccChH Q lcl|NC_021301. 1 MT--------------ASTPAEWLPVLTKRIDDGMSRVRLLARYSN-GD-----APLPELTRNTSAAWRSFQREARTNWG 60 (456) Q Consensus 1 ~~--------------~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~-g~-----~~i~~~~~~~~~~~~~~~~k~~~n~~ 60 (456) || ...+ .+.+.+. .+. + ..++-- |- ..|+.........+.. .....+. T Consensus 1 m~~~i~~~~g~p~~~~~~~~-~~~~~ia----~~~-~---~~~~~~~~~~~~~~~~iLr~~~~~~~~y~~---m~~D~~i 68 (491) T protein:vir:10 1 MSKGLWVSPTEFVTFGEPDK-SLSSQIA----TRA-R---SIDFFALGMYLPNPDPVLKALGKDIRVYRE---LRADAHV 68 (491) T ss_pred CCCceeCCCCCccCcccCCh-HHHHHHH----hhh-c---ccccccccCCccchHHHHHhcCCCHHHHHH---HhhChHH Confidence 32 2211 1222111 110 0 001100 10 0010000000011111 1235677 Q ss_pred HHHHHHHHhhhccCCeecCCC-CcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeE-EEEEeeCCCCceE---EEEEcc Q lcl|NC_021301. 61 LMVRDSVADRIIPNGITVGGS-ADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTAT---ITADSP 135 (456) Q Consensus 61 ~~iVd~~a~~l~~~~~~~~~~-~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~d~dg~~~---i~~~~p 135 (456) .-++++...-+.+.++++... .+....+.+.+.+.+-.|+..+.++. +|..||.+ ++++|...+|... +..++| T Consensus 69 ~s~l~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~ 147 (491) T protein:vir:10 69 GGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPA 147 (491) T ss_pred HHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeecc Confidence 778888888888889887543 33445567788887778888887775 78889995 6788876666544 444444 Q ss_pred ceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEE Q lcl|NC_021301. 136 ETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPV 215 (456) Q Consensus 136 ~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pv 215 (456) +.+ .||+..... +...++... +..-+..+++.. T Consensus 148 ~~f--~~d~~~~l~-------~~~~~~~~~--------------------------------------g~~l~~~k~i~~ 180 (491) T protein:vir:10 148 DWF--VYDPENQLR-------FRSKDHWMQ--------------------------------------GEELPARKFLVP 180 (491) T ss_pred cce--eeccCCceE-------EecCCCCCC--------------------------------------cceecCCCEEEE Confidence 432 233322110 000111000 000011222222 Q ss_pred EE---ccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccC Q lcl|NC_021301. 216 VV---YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELP 292 (456) Q Consensus 216 v~---~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (456) .+ ..|+.|.|.+..+....---+..+.+.....+.++.|.++.+- ..+ ..+++...+... ....+.+....++ T Consensus 181 ~~~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky-~~~--a~~~ek~~l~~a-l~~~~~~a~~viP 256 (491) T protein:vir:10 181 RQEATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKH-PRS--ASDGEKNLLLDC-LEDMVQDAVAVVP 256 (491) T ss_pred EecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEec-CCC--CCHHHHHHHHHH-HHHHhcCcEEEec Confidence 21 3568899999887665555555677888889999988766542 111 112222222111 1122233445566 Q ss_pred CCceeE--eec--ccchHHHHHHHHHHHHHHHhhcCC-ChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 293 PGVDIW--ESQ--TNDFTPMLSAIKEHIRQLSSATKT-PLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAI 367 (456) Q Consensus 293 ~d~~~~--~~~--~~~~~~~~~~l~~~~~~i~~~~~~-p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~ 367 (456) .+.++. +.. ..+...|...++.+-.+|+.+.-= .... +..++++.|.. + ..-....++.-.+.....+.++ T Consensus 257 ~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt-~~~gs~a~~~v-h--~~v~~di~~~D~~~i~~tln~l 332 (491) T protein:vir:10 257 DDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTT-EATSTRASAQA-G--LEVTDDIRDGDKAVVSEAMNML 332 (491) T ss_pred CCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhccc-CcccchhHHHH-H--HHHHHHHHHHHHHHHHHHHHHH Confidence 665553 332 223456777777776666654210 0000 11122222221 1 1112222333345566667777 Q ss_pred HHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCC-CcHHHHHHhCCCChhHHHHHH--------------- Q lcl|NC_021301. 368 LVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGE-SWASIRRNILNYNADQIKQDD--------------- 431 (456) Q Consensus 368 ~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~-~s~~t~~~~~~~~~~~~~~~e--------------- 431 (456) ++-++.+.+.... ...+.|..+. ....+.++.+.+|.+.|+ ++.+.+++.+|+.+.+..+.. T Consensus 333 i~~l~~~N~~~~~-~p~f~~~~~~-e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~~~~~~~~~~~~~~~~~~~~~~ 410 (491) T protein:vir:10 333 IRWICDLNFDGAD-RPVFDMWEQE-QVDEIQAGRDQKLTQAGARFTPAYFKRAYNLQDGDLDERPLPVSAVDTVGAASFA 410 (491) T ss_pred HHHHHHhcCCCCC-cceEEecCcC-chhHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCcCccccccCCCCCccccccc Confidence 7766666654332 3455665443 333678999999999996 688888888887432111000 Q ss_pred ------------------HHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 432 ------------------LDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 432 ------------------~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .+..++..+.+.......-++-++= T Consensus 411 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~ 453 (491) T protein:vir:10 411 EFEAPDQDALDAALNTLSARDLNADAQALVAPLLKRIANGASA 453 (491) T ss_pred ccCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCH Confidence 0000000000000000000000000 No 182 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=98.64 E-value=1.6e-07 Score=57.85 Aligned_cols=391 Identities=10% Similarity=-0.065 Sum_probs=156.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-hcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC---CCC Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARY-SNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG---GSA 82 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~Y-Y~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~---~~~ 82 (456) +=++++|..+- ......... ..|. +..-............--..+.+...+|+.+++-+..-|+.+- .+. T Consensus 1 Mg~~~~~~~~~----~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia~~ia~lp~~~~~~~~dg 74 (423) T protein:vir:81 1 MGFLQKLGLAP----SVVATPEPIELVGP--IFESLKLSTKNMTVEQIWEDQPHLRTVTTFIARNVASLQLQAFERVEDG 74 (423) T ss_pred CchhHhhcccc----ccccCccccccccc--cccccccccchhhHHHHHHhhhHHHHHHHHHHHhHhhCceEEEEEecCC Confidence 23344432100 000000000 0000 0000000000001101011245677889999998887787641 111 Q ss_pred c--ccHHHHHHHHHHh-c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccc---eeEEEEe-CCCCceEEE Q lcl|NC_021301. 83 D--SDLALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPE---TMVVSVD-PLQPWRIRS 152 (456) Q Consensus 83 d--~~~~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~---~~~~~~d-~~~~~~~~~ 152 (456) + .-....+.+++.+ | ....+...+....+.+|.||+++..+..+...+..+.|. .+.+..+ +..+. +.. T Consensus 75 ~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~~v~~~~~~~~~~~-~~Y 153 (423) T protein:vir:81 75 GRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVSWVQRRAYKDGWGS-LDY 153 (423) T ss_pred ceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccceeeeeeccCCCcc-eEE Confidence 1 1112224445443 2 234556667888899999999998765433333333332 2222111 11111 100 Q ss_pred EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC-CCCCCcHhHHH Q lcl|NC_021301. 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN-PDGMGEVEPHI 231 (456) Q Consensus 153 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n-~~g~s~~~~v~ 231 (456) .+......+| .... +..+++.++. .. .... ..|.|-+..+. T Consensus 154 ~~~~~~~~~g--~~~~-~~~~evih~r-------------------------------~~----~~~~~~~G~spi~~~~ 195 (423) T protein:vir:81 154 IIIESGDNDG--RSVK-VPGERVIHRH-------------------------------GY----NPKTMKRGKSPVQSLR 195 (423) T ss_pred EEEEecCCCc--eEEE-EcccceEEec-------------------------------CC----CCCCccccccHHHHHH Confidence 0000001111 1111 1111221110 00 0001 13666555443 Q ss_pred HHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCc--ccccccccchh-hhhh-hh---hhhccceeccCCCceeEeec Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQA---FRQRALKSAGHG--LPKVDENGNAI-DYAS-IF---EAAPGALWELPPGVDIWESQ 301 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~--~~~~~~~~~~~-~~~~-~~---~~~~~~~~~~~~d~~~~~~~ 301 (456) . .+...+.-......++. .|-.+++--... ....++....+ .... .+ ....|.+..++.+.++.+++ T Consensus 196 ~---~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~ 272 (423) T protein:vir:81 196 D---ILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFH 272 (423) T ss_pred H---HHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcceecCCCceEEecc Confidence 3 33322222222233332 344444321100 00011111111 1111 11 12235667788888988875 Q ss_pred ccchH-HHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcC-CC Q lcl|NC_021301. 302 TNDFT-PMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQ-IEG-ES 378 (456) Q Consensus 302 ~~~~~-~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~-~~~-~~ 378 (456) -++.+ .|++..+....+|+.+-|+|+..+|... +++...++.....+...+ . .-+-..+++.+...+- -.+ .. T Consensus 273 ~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~-~~t~sn~e~~~~~f~~~~--L-~P~~~~ie~~l~~~L~~~~~~~~ 348 (423) T protein:vir:81 273 TTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLD-NANYSNVREFRKALYGDN--L-GSWIRIIQDVMNLFLLPRVGIDN 348 (423) T ss_pred CChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCC-CCCcccHHHHHHHHHHHH--H-HHHHHHHHHHHhhhhcCcccccc Confidence 43322 3677777888899999999999987422 221111221111111111 0 1111222222222111 011 11 Q ss_pred cccceeEEecCCCCcCHHHHHHHHHHHH-hcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 379 VEDTVDVSFESPDRVTLGEKYAAASLAK-AAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 379 ~~~~i~v~f~~~~~~~~~e~ad~~~kl~-~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ..+-+++.+..-+..|..+.+++..++. ++|+++.--+++.+|+.|.+-- ++.--..+... .+.++..|.+ T Consensus 349 ~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R~~~gl~p~~gG----D~~~~p~n~~~---~~~~~~~~~~ 420 (423) T protein:vir:81 349 EKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVRAMDNLPSIDGG----DDLARPLNTEF---GDSEDAPGEE 420 (423) T ss_pred CccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHHHHhCCCCCCCc----ceeeccccccc---CccCCCCCCC Confidence 1222333334446678888898888876 4699998888999998764311 11111111111 1112222333 No 183 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=98.63 E-value=1.7e-07 Score=57.69 Aligned_cols=395 Identities=10% Similarity=0.003 Sum_probs=152.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |.. .|.=+-..-. .....-+..+.+.|.+. ++...-.++ ..+ .+.-|+ ++...+.-..+=++.+.. T Consensus 53 ~~~-~~~g~~~~~~---~~~~~~~~~l~~~~~~~-~~~~~~i~t-----~~~--~va~~~--~i~~~s~~~~~~~i~l~~ 118 (535) T protein:vir:10 53 ADG-NVAGQYSVAS---ISDVLSTKKLLKAYADN-DIVQAIIRT-----RTN--QVLTYS--NPSRYNRNGVGFKVELKD 118 (535) T ss_pred ccC-CcccccccCc---cccccCHHHHHHHhccC-hhHHHHHHH-----HHH--HHHHHH--HHHHHhcccCcceeEEEe Confidence 331 1110000000 00000111111111111 111000000 000 111122 333333333333333211 Q ss_pred -C-----CcccHHHHHHHHHHh--cC-------hhHHHHHHHHHHhhCC-eEEEEEeeCCCCceE-EEEEccceeEEEEe Q lcl|NC_021301. 81 -S-----ADSDLALRARRIWRD--NR-------MDSVCKQWVKYGLDFG-ESYLTCWRRDDGTAT-ITADSPETMVVSVD 143 (456) Q Consensus 81 -~-----~d~~~~~~l~~~~~~--n~-------~~~~~~~~~~~a~~~G-~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d 143 (456) + ........+.+++.. |. +..+...+..+++.+| .+|+++..+..|++. +..++|..+.+..| T Consensus 119 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d 198 (535) T protein:vir:10 119 ATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYS 198 (535) T ss_pred ccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEc Confidence 0 001111123333321 22 1234555666666665 689999999899875 89999999998887 Q ss_pred CCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCC Q lcl|NC_021301. 144 PLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDG 223 (456) Q Consensus 144 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g 223 (456) +....... .++...++... ..|..+.+.++... ++--...-..| T Consensus 199 ~~~~~~~~---~~~~~~~~~~~--~~~~~~eiih~~~~-------------------------------~~~~~~~~~~G 242 (535) T protein:vir:10 199 PRSKDQPR---KFEQFVSETKS--VKFSERNLTFINYW-------------------------------NLSDTDRRGYG 242 (535) T ss_pred CccccCce---EEEEEecCcee--EEECcccEEEEecc-------------------------------CCCCccccccc Confidence 65432211 12211222111 12233333332110 00000001236 Q ss_pred CCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCccccc-ccccchhhhhhhhhh------hccce-eccC Q lcl|NC_021301. 224 MGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKV-DENGNAIDYASIFEA------APGAL-WELP 292 (456) Q Consensus 224 ~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~-~~~~~~~~~~~~~~~------~~~~~-~~~~ 292 (456) .|.++.+...+.... .-......++. .|..+|+--....... ++.-..+ ...|.. ..+.+ +..+ T Consensus 243 ~Spi~~~~~~i~~~~---aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~l--k~~~~~~~~G~~nag~~~vl~~ 317 (535) T protein:vir:10 243 YSPVEASIPLIRAIY---DTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGI--RRQWTSQGSGLGGAWKIPILAA 317 (535) T ss_pred ccHHHHHHHHHHHHH---HHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHH--HHHHHHHhcCcccccccccccC Confidence 666654444443333 22222333333 3433333211101111 1111111 111211 12333 3334 Q ss_pred CCceeEeecc--cchHHHHHHHHHHHHHHHhhcCCChhhhcccc----cCcHHHHHHHHHHHHHHHHHHH-HHHHHH--- Q lcl|NC_021301. 293 PGVDIWESQT--NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS----ANQSAEGAHNIEKGFLFKCEDR-LSIAKI--- 362 (456) Q Consensus 293 ~d~~~~~~~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~----~N~Sg~Al~~~~~~l~~k~~~~-~~~f~~--- 362 (456) .+.++..+.. .+++ |++..+..+..|+.+-|+|+..+|... +|.++.....-...++...... +..+.+ T Consensus 318 ~g~~~~~l~~~~~D~q-fle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~ 396 (535) T protein:vir:10 318 KDAKFVNMTQNSRDME-FDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLS 396 (535) T ss_pred CCceEEecCCChhHHH-HHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHH Confidence 5778877653 3333 888888899999999999999997432 1211111111111111111100 011111 Q ss_pred HHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHH Q lcl|NC_021301. 363 GLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRARE 437 (456) Q Consensus 363 ~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~e 437 (456) .|++.+...+ . ...+..+++.|......+.++.+++.. +..+|.++.--+++.+|+.|-+--. ...+.... T Consensus 397 ~ie~~ln~~L--l-~~~~~~~~f~f~~l~~~d~~~r~~~~~-~~~~g~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~ 472 (535) T protein:vir:10 397 FIEQVINDKI--M-RYVDTDYRFSFTLGDAQDKLQEEQVWK-LKLANGYFINEYRKDHGLKTVDGLDVPGFIGSAENFIN 472 (535) T ss_pred HHHHHHhhhc--c-cccCCeEEEEeccccccCHHHHHHHHH-HHHcCCCCHHHHHHHhCCCCCCCccccccccchhhccc Confidence 1222221111 1 111235677888888888888777665 4445677888888888876532100 00000000 Q ss_pred HHHHHhhhhh--------------------------hhcccccCC Q lcl|NC_021301. 438 QITLFAGNSV--------------------------QRPQEDGSR 456 (456) Q Consensus 438 e~~~~~~~~~--------------------------~~~~~d~~~ 456 (456) . ....+... .....|++| T Consensus 473 ~-~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~ 516 (535) T protein:vir:10 473 A-TGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKS 516 (535) T ss_pred c-cccccccCCCCCCCccccCCccccCcccccccccccCCCCCCC Confidence 0 00000000 001112222 No 184 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=98.63 E-value=1.7e-07 Score=57.65 Aligned_cols=368 Identities=13% Similarity=0.028 Sum_probs=156.9 Q ss_pred CCCCCHHHHHHHHH-HHHHHHHHHHHHHHHHh---cccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLT-KRIDDGMSRVRLLARYS---NGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~-~~~~~~~~r~~~~~~YY---~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) |-- +..+. .+.+.+......-..++ -|...... +. ...-+.+.-...+|+.+++-+..-|+ T Consensus 1 Mg~------~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~----v~-----~~~~l~~~~v~~~i~~ia~~ia~~~~ 65 (383) T protein:vir:10 1 MGL------LTPKNFSKRNAKNMVYPSNPAFFTTTVGGMQLSY----VS-----ALSALQNTNVYSVINRIASDVSSAHF 65 (383) T ss_pred CCc------ccccccccccccccccccchhhhhhhccCccccc----cc-----hhHhhcchHHHHHHHHHHHhhccCce Confidence 222 11110 11000000000000111 11100000 00 01112234456678888888877788 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRW 156 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~ 156 (456) ++..... ...+.+-............+..+++.+|.||+++..+. ..+...+|.++.+..+.. . . +.+ T Consensus 66 ~~~~~~~---~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~~---~~~~p~~~~~v~~~~~~~--~-~---~~~ 133 (383) T protein:vir:10 66 KTENTAT---LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNM--G-I---VYT 133 (383) T ss_pred eecccch---hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCcceEEEEEcCC--c-e---EEE Confidence 8753221 11111111111334566778888889999999886542 223333333333332211 1 0 111 Q ss_pred EEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHH Q lcl|NC_021301. 157 WRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINR 236 (456) Q Consensus 157 ~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa 236 (456) +....+ .. ...|..+.+.+++.. + ++ .++-..|.|.++.....++. T Consensus 134 ~~~~~~-~~-~~~~~~~evih~r~~-------------------------~-----~~--~~~~~~G~s~l~~~~~~i~~ 179 (383) T protein:vir:10 134 VLESND-RP-KMVLRQDQMLHFRLM-------------------------P-----DP--QYRYLIGRSPLESLQNALNL 179 (383) T ss_pred EEEcCC-ce-EEEEcccceEEeccC-------------------------C-----CC--cccccccccHHHHHHHHHHH Confidence 111111 11 111222222222100 0 00 00112467766655555544 Q ss_pred HHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-hhhhhhhh--hhccceeccCCCceeEeeccc--chHHHHHH Q lcl|NC_021301. 237 INRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDYASIFE--AAPGALWELPPGVDIWESQTN--DFTPMLSA 311 (456) Q Consensus 237 ~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~--~~~~~~~~ 311 (456) ...+..-......-.+.|..+++- .... ..++.... ........ ...+.++.++.+.++..+... +.+.+.+. T Consensus 180 ~~~~~~~~~~~f~ng~~~~~il~~-~~~~-~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~ 257 (383) T protein:vir:10 180 DDKASKSNMSAMENQINPAGKLTI-SNYL-SDGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADN 257 (383) T ss_pred HHHHHHHHHHHHhccCCcceEEEe-CCCC-CCHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHHHHHH Confidence 333332221212222233333332 1111 00111111 11111111 123456777888888777533 34444577 Q ss_pred HHHHHHHHHhhcCCChhhhcccc-cC---cHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEE Q lcl|NC_021301. 312 IKEHIRQLSSATKTPLPMLMPDS-AN---QSAEGAHNIE-KGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVS 386 (456) Q Consensus 312 l~~~~~~i~~~~~~p~~~~~~~~-~N---~Sg~Al~~~~-~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~ 386 (456) .+....+|+.+-|+|+..+|... ++ ++.+.++..+ ..|.-.+ ..+++.+... +. ...+++. T Consensus 258 ~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~~~~l~P~~--------~~ie~~l~~~--l~----~~~~~f~ 323 (383) T protein:vir:10 258 SAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLANLNSYV--------NPIVDELRLK--MN----APDLELD 323 (383) T ss_pred HHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHHHHHHHHHH--------HHHHHHHHHh--hC----CceEEee Confidence 77788999999999999998532 22 2222222111 1111111 1111111111 11 1245666 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +...+..|..+.++++.++.++|+++..-+++.+|+.+-+-.. ..+. . ....+.+-|.. T Consensus 324 ~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d--~~~~-------~--~~~~~~~gGd~ 382 (383) T protein:vir:10 324 IKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDN--LPEF-------K--PLTNETKGGDD 382 (383) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCc--cccc-------C--CCcccCCCCCC Confidence 6777788999999999999999999999999988876532111 0000 0 01112222222 No 185 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=98.62 E-value=1.5e-07 Score=58.05 Aligned_cols=376 Identities=7% Similarity=-0.028 Sum_probs=149.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |. +++++..+..... ..-+.+ .......... -+.......+|+.+++-+..-|+.+.. T Consensus 1 Mg------l~d~~~~~~~~~~------~~~~~~--------~~~~~~~~~~--~l~~~~v~~~i~~Ia~~ia~lp~~v~~ 58 (395) T protein:vir:96 1 MG------ILDFFSFKKSGTL------SDDDSG--------STTSEKLTNV--VLKEDALYKCVNYLARIISKSTFRIKA 58 (395) T ss_pred Cc------chhhhcCCCCccc------cccccc--------cchhhhcchh--hhhhHHHHHHHHHHHHhhccceeEEEe Confidence 22 2222211110000 000000 0000000010 112334556788888888777877543 Q ss_pred CC-cccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 81 SA-DSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 81 ~~-d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) .. +......+..+++. |. -......+...++.+|.||+++..+..+. .+......+ ...+... T Consensus 59 ~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~~------~~~~~~~~~-~~~~~~~---- 127 (395) T protein:vir:96 59 PEKLTENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGIY------VADAFTQDK-KLSGNKF---- 127 (395) T ss_pred CCccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCcee------cCCcccccc-cccccee---- Confidence 21 11122234445432 32 24556678888999999999887654321 111111100 0000000 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCc---HhHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGE---VEPHI 231 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~---~~~v~ 231 (456) .....++ ..+...+..+.+.++. ...++. ...+.+- ..+++ T Consensus 128 -~~v~~~~-~~~~~~~~~~dvih~k------------------------------~~~~~~----~~~~~~~~~~~~~~~ 171 (395) T protein:vir:96 128 -KVSRVQG-QTYEKIFTFDQVIYLK------------------------------NDNSDL----MLKVESLWEEYGELL 171 (395) T ss_pred -eeeeecc-ceeeeEeccCceEEec------------------------------ccCCcc----ccccccccchHHHHH Confidence 0000011 0001112222222221 000000 0112222 22333 Q ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCc-ccccccc-cchhh-hhhhhhhhccceeccCCCceeEeecccchH-- Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAFRQRALKSAGHG-LPKVDEN-GNAID-YASIFEAAPGALWELPPGVDIWESQTNDFT-- 306 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~-~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~-- 306 (456) .+..+....-+........+... ....|.... .+..++. ..... .......+.+.++.++.+.++.+++..+.+ T Consensus 172 ~~~i~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q 250 (395) T protein:vir:96 172 GHVINNQKIANQIRFTMTPPKDK-VRERAQENSDGGRQPKSDKDFFKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSV 250 (395) T ss_pred HHHHHHHHHHHHHHHHhhhcccc-cccceeeccCchhhHHHHHHHHHHHHHHhhcCCcceEEccCCceeEecccChhhhh Confidence 33222211111111111111111 111111100 0000000 00000 111122334455667778888776543222 Q ss_pred -----HHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_021301. 307 -----PMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVED 381 (456) Q Consensus 307 -----~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~ 381 (456) .+.+.....+.+|+.+-|+|++.+++..+|.+...+.+....|.-.+...+..+...| + ....-.. T Consensus 251 ~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~~L-------l--~~~e~~~ 321 (395) T protein:vir:96 251 KSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYELLLEGPIESLITNIVDGLEYAI-------F--DKSETLE 321 (395) T ss_pred hhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCccHHHHHHHHHHHHHHHHHHHHHHHHHhhc-------C--ChhhhcC Confidence 2333344557889999999999998665554433333333333332222222221111 1 0000112 Q ss_pred ceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccC Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGS 455 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~ 455 (456) .+.+.|......|..+.+++..++.++|+++.--+++.+|+.|-+......-.+........+.-....+++++ T Consensus 322 ~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~~~gD~~~~~~N~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 322 GSFIKVTGLKNYDLFSISSQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred ceeEeecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceechhccCCCCCCCCC Confidence 24466777788899999999999999999999889999988764221100000000000011111122233333 No 186 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=98.62 E-value=1.9e-07 Score=57.49 Aligned_cols=383 Identities=12% Similarity=0.012 Sum_probs=165.1 Q ss_pred HHHHHHHHHHH-----HHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC-CCc- Q lcl|NC_021301. 11 PVLTKRIDDGM-----SRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG-SAD- 83 (456) Q Consensus 11 ~~l~~~~~~~~-----~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~-~~d- 83 (456) -.+.++..... .....+....-+.... -+..+.. ..-+.+.-...+|+.+++-+..-|+.+-. ..+ T Consensus 1 ~~~~r~~~~~~~~~~~~~~~~~~~~~g~~~s~--~~~~vt~-----~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~ 73 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMSAGGWVSALLGSSRSD--SGQVVTP-----ASALALTVLQNCVTLLAESIAQLPIELYERSGED 73 (419) T ss_pred CcccccccccccccccCcchhhHHhhcCCCcc--CCcccch-----HHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 00000000000 0000111111111100 0001100 11122334566888888887777766421 111 Q ss_pred --ccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCceE-EEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 84 --SDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 84 --~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) ......+..++.. | .-......++...+.+|.||+++-++.+|.+. +..++|..+.+..+... . + T Consensus 74 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~-~-~----- 146 (419) T protein:vir:14 74 RKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDL-K-P----- 146 (419) T ss_pred cccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc-e-E----- Confidence 1111224444442 3 22355666788999999999999999889864 88899988887665332 1 0 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liD 235 (456) +|.. .+.. .+..+ ...|.-.. ......|.|-++.+...++ T Consensus 147 ~y~~-~~~~----~~~~~-------------------------------~i~h~~~~----~~dg~~G~s~i~~~~~~i~ 186 (419) T protein:vir:14 147 VYRV-RGSD----PMPQR-------------------------------LVHHVRWM----SINGYTGLSPVLLHANAIG 186 (419) T ss_pred EEEE-ccCc----ccchh-------------------------------heeEecCc----CCCCcccccHHHHHHHHHH Confidence 1100 0000 00000 11111100 0111246666654444443 Q ss_pred HHHHHHHHHHHHHHHh---hchhhhhhcCCCccccccc-ccchh-hhh-hhhh--hhccceeccCCCceeEeecccchH- Q lcl|NC_021301. 236 RINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDE-NGNAI-DYA-SIFE--AAPGALWELPPGVDIWESQTNDFT- 306 (456) Q Consensus 236 a~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~-~~~~~-~~~-~~~~--~~~~~~~~~~~d~~~~~~~~~~~~- 306 (456) ....+. .....++ +.|..+++-........++ ....+ ... .... ...+.+..++.+.++.++...+.+ T Consensus 187 ~~~~~~---~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~ 263 (419) T protein:vir:14 187 HAQAIQ---QYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQEGMTFRPLSMTNVDA 263 (419) T ss_pred HHHHHH---HHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCCCceEEEccCChhhH Confidence 322222 1222222 2344444321111011111 11101 000 1111 122456778888898887644333 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEE Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVS 386 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~ 386 (456) .+++..+....+|+..-|+|+..+|.... .+...++.....+...| -.-+-..+++.+..-+-.......+.+++. T Consensus 264 q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~-~t~s~~E~~~~~f~~~~---L~P~~~~ie~~l~~kll~~~~~~~~~i~fd 339 (419) T protein:vir:14 264 ALIDALRLSALDIARIYKIPAHMVNELER-ATFSNIEHQSLQFVIYT---LLPWVKRHEQAKTRDLLLPSERKQYFIEYN 339 (419) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHhcCCCC-CCcccHHHHHHHHHHHH---HHHHHHHHHHHHhhhccCccccCCeEEEEe Confidence 37777778889999999999999974321 11111222211111111 011112222222211111111122334444 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH---HHHHHHHHHHHHHhhhhhhhcc------cccCC Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ---DDLDRAREQITLFAGNSVQRPQ------EDGSR 456 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~---~e~~~~~ee~~~~~~~~~~~~~------~d~~~ 456 (456) +......|..+.+++..+++++|+++.--+++.+|+.|-+--. .-..-. ..+...+.....++ ++..| T Consensus 340 ~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~--~~~~~~~~~~~~~~~~~~~~~e~~~ 416 (419) T protein:vir:14 340 LAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMV--DASKPQQLPVGKSEPTKAAIDEIGR 416 (419) T ss_pred chhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccc--cccccccccCCCCCCccccccchhc Confidence 4455667899999999999999999999999999887542110 000000 00000000000000 01111 No 187 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=98.62 E-value=1.9e-07 Score=57.49 Aligned_cols=377 Identities=10% Similarity=0.010 Sum_probs=158.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC-ccc Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA-DSD 85 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~-d~~ 85 (456) +=+++++..+..+....-..+...+.+..... +..+ .. ..-+.+.-...+|+.+++-+..-|+.+.... +.. T Consensus 1 MGl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v----t~-~~al~~~~v~~~i~~Ia~~iA~lp~~v~~~~g~~~ 73 (394) T protein:vir:62 1 MGLRDRFSNYLFKKAEKRGYLDNVLGKSIRYS--GVYV----TD-SNILQSSDVYELLQDISNQMVLADIVVEDEFGNEI 73 (394) T ss_pred CchhhhhhhhccCCCCchhhhhhhhhcccccC--cccc----Ch-hhhhccHHHHHHHHHHHHhhcccceEEEcCCCccc Confidence 33445544333222211112222222221111 0001 00 1113345567788988888888888764321 111 Q ss_pred HHHHHHHHHHh-cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecC Q lcl|NC_021301. 86 LALRARRIWRD-NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLD 161 (456) Q Consensus 86 ~~~~l~~~~~~-n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d 161 (456) ....+..++.+ |. .......++..++.+|.||+++-.+.-+ -+..+.+..++.. ..++. .. T Consensus 74 ~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-------~~~~~~~~~~~~~-------~~~~~-~~ 138 (394) T protein:vir:62 74 KDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-------LASNVFTELDDNL-------VEHFN-IG 138 (394) T ss_pred chhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-------ccccceEEECCce-------EEEEe-eC Confidence 11223344433 32 2355667888999999999987432211 1223333332210 00110 00 Q ss_pred CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHH Q lcl|NC_021301. 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAE 241 (456) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~ 241 (456) + ..|..+.+.+++ . ++ ...-.|.|-+..+...++....+. T Consensus 139 ~-----~~~~~~eiih~r-------------------------------~-~~---~d~~~G~s~~~~~~~~i~~~~~~~ 178 (394) T protein:vir:62 139 G-----HEIPPCMIRHVK-------------------------------N-IG---ADHLRGKGILDLGRDTLEGVMSAE 178 (394) T ss_pred C-----EEechhheEEec-------------------------------C-cC---CCCccccChHHHHHHHHHHHHHHH Confidence 0 001111111110 0 00 011135665554444333332222 Q ss_pred HHHHHHHHHhhchhhhhhcCCCcccccccccch-hhh-hhhhh--hhccceeccC--CCceeEeecccch-HHHHHHHHH Q lcl|NC_021301. 242 LQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDY-ASIFE--AAPGALWELP--PGVDIWESQTNDF-TPMLSAIKE 314 (456) Q Consensus 242 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~-~~~~~--~~~~~~~~~~--~d~~~~~~~~~~~-~~~~~~l~~ 314 (456) .-......-.+.|..+++- .......++.... ... ...+. ...+.+..++ .+.++.+++.... .-+++..+. T Consensus 179 ~~~~~~~~ng~~~~~il~~-~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~ 257 (394) T protein:vir:62 179 KTLTDKYKKGGLLTFLLNL-DAHINPQNGAQSKLINAILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNV 257 (394) T ss_pred HHHHHHHHccCCcceEEEe-CCCCCcCHHHHHHHHHHHHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHH Confidence 2111111222334333332 1111111110011 111 11111 1123333343 3445555543222 237777788 Q ss_pred HHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCc Q lcl|NC_021301. 315 HIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRV 393 (456) Q Consensus 315 ~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~ 393 (456) ...+|+.+-|+|+..+|... +|.+.....+....|.- +-..+++.+..- +.+......+.+.|+..... T Consensus 258 ~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~~~~~~l~P--------~~~~ie~~l~~k--ll~~~~~~~~~~~fd~~~~~ 327 (394) T protein:vir:62 258 YKKDLGKFLGINVDTYTELIKEDIEKAMMYIHNKAVRP--------IMKNFEDHLSLL--FYAQNSGKRIKFKINILDFV 327 (394) T ss_pred HHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHHHHHHH--------HHHHHHHHHhhh--hcCccccCceEEEechhhhc Confidence 88999999999999997532 22221122222222222 222222222211 11222234577888877767 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHh-hhhhhhcccccCC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFA-GNSVQRPQEDGSR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~-~~~~~~~~~d~~~ 456 (456) +....++++.++.++|+++..-+++.+|+.|-+.+....-.......... ....+.+...|+. T Consensus 328 ~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n~~~~~~~~~~~~~~kgge~ 391 (394) T protein:vir:62 328 TYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISNDVTEIGKKEATDGSLGGGEE 391 (394) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeecccccccccccccccccCCCCCC Confidence 77788889999999999999999999998753211100000000000000 0111122222322 No 188 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=98.61 E-value=1.9e-07 Score=57.41 Aligned_cols=388 Identities=12% Similarity=0.023 Sum_probs=176.1 Q ss_pred CCCCCHHHHHHHHHHHHHHH------HHHHHHH-HHHhcccCcccccCcccchhhhhhhhhh--ccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG------MSRVRLL-ARYSNGDAPLPELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~------~~r~~~~-~~YY~g~~~i~~~~~~~~~~~~~~~~k~--~~n~~~~iVd~~a~~l 71 (456) |.+.|+. +-.+.+.+... -.|+... +..=.|.. . ..-.+...+ .-.+..-++.+....+ T Consensus 18 ~~~~~~~--~~~~~~~~~~~~~~gltp~~l~~il~~a~~gd~--~--------~~~~L~edm~e~D~~i~s~l~~Rk~av 85 (526) T protein:vir:79 18 REPQTSR--LAGLAKEFAQHPAKGLTPAKLARILVEAEQGNL--Q--------AQAELFMDMEERDAHLFAEMSKRKRAI 85 (526) T ss_pred chhhhhh--hhhhhhhcccCCCCCcCHHHHHHHHHHhhCCCH--H--------HHHHHHHHHHhhChHHHHHHHHHHHHH Confidence 1111111 11111111100 0111111 11111110 0 000111111 2456666777777777 Q ss_pred ccCCeecCCCC-----cccHHHHHHHHHHhc-ChhHHHHHHHHHHhhCCe-EEEEEeeCCCCceEE---EEEccceeEEE Q lcl|NC_021301. 72 IPNGITVGGSA-----DSDLALRARRIWRDN-RMDSVCKQWVKYGLDFGE-SYLTCWRRDDGTATI---TADSPETMVVS 141 (456) Q Consensus 72 ~~~~~~~~~~~-----d~~~~~~l~~~~~~n-~~~~~~~~~~~~a~~~G~-a~~~v~~d~dg~~~i---~~~~p~~~~~~ 141 (456) .+.++++.... +....+.+.+++.+- +|...+.++. +|.-||. +++++|...+|...+ ...+|+.+ . T Consensus 86 ~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-dA~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~F--~ 162 (526) T protein:vir:79 86 LGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWF--Q 162 (526) T ss_pred hCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHH-hhhhhcceeEEEEEeecCCceeEEEeeeecccce--E Confidence 77787764322 222333455666543 4766666554 4788998 467888766665443 33333322 2 Q ss_pred EeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEE---c Q lcl|NC_021301. 142 VDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVV---Y 218 (456) Q Consensus 142 ~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~---~ 218 (456) ||+..+..+ + +.....++ ..-+..+++...+ . T Consensus 163 ~~~~~~~~l----~-~~~~~~~g----------------------------------------~~l~~~k~iv~~~~~~~ 197 (526) T protein:vir:79 163 LNPEDQNEL----R-LRDNSPAG----------------------------------------EALQPFGWIIHRPRARS 197 (526) T ss_pred eccCCCcEE----E-ecCCCCCc----------------------------------------eeecCCceEEEeecCCc Confidence 333322111 0 00000000 0001122222222 2 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee- Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI- 297 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~- 297 (456) .|+.|.|.+..+....--=+..+.+....++.|+.|.++.+- ..+. .+++...+... ....+.+....++.+..+ T Consensus 198 g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky-~~~a--~~~ek~~L~~a-v~~i~~da~~iiP~~~~ie 273 (526) T protein:vir:79 198 GYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKY-PPGT--ADEEKATLLRA-VTGLGHAAAGIIPETMAID 273 (526) T ss_pred CCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEec-CCCC--CHHHHHHHHHH-HHHHhcCcEEEecCCceeE Confidence 567899988875443222233577788888999988776652 1111 12222222111 112223344455665554 Q ss_pred -EeecccchHHHHHHHHHHHHHHHhhc-C-C--ChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_021301. 298 -WESQTNDFTPMLSAIKEHIRQLSSAT-K-T--PLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AILVKA 371 (456) Q Consensus 298 -~~~~~~~~~~~~~~l~~~~~~i~~~~-~-~--p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~~l~ 371 (456) .+....+...|...++.+-.+|+.+. | + ....-|+.++++.|.. ...-....++.-.+.+...+. ++++.+ T Consensus 274 ~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~v---h~~v~~di~~aDa~~i~~tln~~Li~~l 350 (526) T protein:vir:79 274 FQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQV---HNEVRHDILASDARQLAATLSRDLLWPL 350 (526) T ss_pred EeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44334455678888888888887753 1 1 1111111122222211 111222333334456667774 577777 Q ss_pred HHhcCCCc---ccceeEEecCCCCcCHHHHHHHHHHHHhcCC-CcHHHHHHhCCCChhH-HHHHHHHHHHHHHHH---Hh Q lcl|NC_021301. 372 LQIEGESV---EDTVDVSFESPDRVTLGEKYAAASLAKAAGE-SWASIRRNILNYNADQ-IKQDDLDRAREQITL---FA 443 (456) Q Consensus 372 ~~~~~~~~---~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~-~s~~t~~~~~~~~~~~-~~~~e~~~~~ee~~~---~~ 443 (456) +.+..... ...-++.|....+.|.++.++.+.+|..+|+ ++.+.+.+.+|+...+ .+.. .......... .. T Consensus 351 ~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gip~~~~~e~~-l~~~~~~~~~~~~~~ 429 (526) T protein:vir:79 351 LVLNRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGIPQPAKNEPV-LRPAAQPAILSRQHG 429 (526) T ss_pred HHhCCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCCCCCCCchhh-ccccCCccccccccc Confidence 77765322 2234678888999999999999999999996 7889999999984221 1110 0000000000 00 Q ss_pred hhhhhhccccc--CC Q lcl|NC_021301. 444 GNSVQRPQEDG--SR 456 (456) Q Consensus 444 ~~~~~~~~~d~--~~ 456 (456) ......+...+ +. T Consensus 430 ~~~~~~~~~~~~~~~ 444 (526) T protein:vir:79 430 QRVAALATIVGPRYG 444 (526) T ss_pred cccccccccccccCc Confidence 00000000000 00 No 189 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=98.60 E-value=2.2e-07 Score=57.12 Aligned_cols=392 Identities=10% Similarity=0.023 Sum_probs=176.2 Q ss_pred CCCC------CHHH---HHHHHHHHHHHHHHHHHHHHHHhcc----cCcccccCcccchhhhhhhhhhccChHHHHHHHH Q lcl|NC_021301. 1 MTAS------TPAE---WLPVLTKRIDDGMSRVRLLARYSNG----DAPLPELTRNTSAAWRSFQREARTNWGLMVRDSV 67 (456) Q Consensus 1 ~~~~------t~~~---~~~~l~~~~~~~~~r~~~~~~YY~g----~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~ 67 (456) ||+- .|-. .-+.+..+.......++.. .+.| .-.++ .... ..+..........+..-++++. T Consensus 1 ~~~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~--~~~~~~p~~~~il--~~~~-~~~~~y~~m~~D~~i~s~l~~R 75 (491) T protein:vir:79 1 MSKGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFF--ALGMYLPNPDPVL--KALG-KDIRVYRELRADAHVGGCVRRR 75 (491) T ss_pred CCCeeeCCCCCcccccccchhHHHHHhhhccccccc--cccccCcchhHHH--hhcc-CCHHHHHHHhhChHHHHHHHHH Confidence 3321 1110 0011222222111111110 0111 11111 0000 0011111112356777788888 Q ss_pred HhhhccCCeecCCCC-cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeE-EEEEeeCCCCce---EEEEEccceeEEEE Q lcl|NC_021301. 68 ADRIIPNGITVGGSA-DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTA---TITADSPETMVVSV 142 (456) Q Consensus 68 a~~l~~~~~~~~~~~-d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~d~dg~~---~i~~~~p~~~~~~~ 142 (456) ..-+.+.++++..+. +....+.+.+.+..-+|.....++ .+|..||.+ ++++|...+|.. .+..++|+.+. | T Consensus 76 k~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~--~ 152 (491) T protein:vir:79 76 KAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFV--Y 152 (491) T ss_pred HHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeeEEeeeeeccccee--e Confidence 888888888875433 333456777887777787777766 468889985 667887666664 34555554332 3 Q ss_pred eCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEE---cc Q lcl|NC_021301. 143 DPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVV---YQ 219 (456) Q Consensus 143 d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~---~~ 219 (456) |+..+.. +...++... +..-+..+++...+ .. T Consensus 153 d~~~~l~-------l~~~~~~~~--------------------------------------g~~lp~~k~i~~~~~~~~g 187 (491) T protein:vir:79 153 DPENQLR-------FRSKEHWVQ--------------------------------------GEELPARKFLVPRQEATYL 187 (491) T ss_pred ccCCceE-------EeecCCCCC--------------------------------------ceeecCCCeEEEEecCCCC Confidence 3322110 000000000 00001122222222 35 Q ss_pred CCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhccceeccCCCcee-- Q lcl|NC_021301. 220 NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDI-- 297 (456) Q Consensus 220 n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~-- 297 (456) |+.|.|.+..+....---+..+.+.....+.++.|.++.+- ..+ ..+++...+... ....+.+....++.+.++ T Consensus 188 ~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky-~~~--a~~~ek~~l~~a-l~~~~~~a~~viP~~~~ie~ 263 (491) T protein:vir:79 188 NPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKH-PRS--ASDAETNLLLDR-LEDMVQDAVAVIPDDSSIEI 263 (491) T ss_pred CcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEec-CCC--CCHHHHHHHHHH-HHHHhcCeEEEecCCceeEE Confidence 67899998876544333344577778888999988766542 111 112222222111 112233344456666554 Q ss_pred Eeec--ccchHHHHHHHHHHHHHHHhhcCC-ChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_021301. 298 WESQ--TNDFTPMLSAIKEHIRQLSSATKT-PLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQI 374 (456) Q Consensus 298 ~~~~--~~~~~~~~~~l~~~~~~i~~~~~~-p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 374 (456) .+.. ..+...|...++.+-.+|+.+.-= .... +..++++.|.. ...-....++.-.+.....+.++++-++.+ T Consensus 264 ~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt-~~~gs~a~~~v---h~~v~~~i~~~D~~~i~~tln~li~~l~~~ 339 (491) T protein:vir:79 264 KEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTT-EATSTRASAQA---GLEVTDDIRDGDKAIVVEAMNMLIRWICDL 339 (491) T ss_pred EeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhcc-CcccchhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3332 233456777777776776654310 0000 11122222221 111222333344455666777777777776 Q ss_pred cCCCcccceeEEecCCCCcCH-HHHHHHHHHHHhcCC-CcHHHHHHhCCCChhHHHHHHHH------------------- Q lcl|NC_021301. 375 EGESVEDTVDVSFESPDRVTL-GEKYAAASLAKAAGE-SWASIRRNILNYNADQIKQDDLD------------------- 433 (456) Q Consensus 375 ~~~~~~~~i~v~f~~~~~~~~-~e~ad~~~kl~~~g~-~s~~t~~~~~~~~~~~~~~~e~~------------------- 433 (456) .+... ....+.|.. +.+. ...++.+.+|.++|+ ++.+.+++.+|+.+.+..+.... T Consensus 340 N~~~~-~~p~f~~~e--~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 416 (491) T protein:vir:79 340 NFDGA-ARPVFDMWE--QEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNLQDGDLDERPLPVSAVDAVGAASFAEFEAPD 416 (491) T ss_pred cCCCC-CcceEeecC--cCchhHHHHHHHHHHHhCCCccCHHHHHHHhCCCCCCCCccccCcCcccccccccccccCCCC Confidence 65322 223444443 3344 457889999999996 68888888888753211110000 Q ss_pred -------------H-HHHHHHHHhhhhhhhcccccC------C Q lcl|NC_021301. 434 -------------R-AREQITLFAGNSVQRPQEDGS------R 456 (456) Q Consensus 434 -------------~-~~ee~~~~~~~~~~~~~~d~~------~ 456 (456) + .++..+.+.......-++-++ + T Consensus 417 ~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~~e~~~~ 459 (491) T protein:vir:79 417 QDALDAALNALSARDLNADAQALVAPLLKRIANGASADELLGM 459 (491) T ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHH Confidence 0 000000000000000000000 0 No 190 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=98.58 E-value=2.4e-07 Score=56.83 Aligned_cols=369 Identities=14% Similarity=0.059 Sum_probs=154.0 Q ss_pred HHHhcccCcccccCc-------ccchhhh--hh-hhhhccChHHHHHHHHHhhhccCCeecCC-CCcccH-HHHHHHHHH Q lcl|NC_021301. 28 ARYSNGDAPLPELTR-------NTSAAWR--SF-QREARTNWGLMVRDSVADRIIPNGITVGG-SADSDL-ALRARRIWR 95 (456) Q Consensus 28 ~~YY~g~~~i~~~~~-------~~~~~~~--~~-~~k~~~n~~~~iVd~~a~~l~~~~~~~~~-~~d~~~-~~~l~~~~~ 95 (456) ++.|.+......... ...+... .. ..-+.+.=.-.+|+.+++-+..-|+.+-. ..+... ...+..++. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~~V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~lL~ 80 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNSDVLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYLMN 80 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccHHHHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHHHh Confidence 333333321111000 0000000 00 00001111224778888877776776521 111111 112334443 Q ss_pred h--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCC-ce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEE Q lcl|NC_021301. 96 D--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDG-TA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAI 168 (456) Q Consensus 96 ~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg-~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~ 168 (456) . |. ...+...+...++.+|.||+++.++..| .+ .+..++|..+.+..++... + .+.+...++... . T Consensus 81 ~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~--~---~y~~~~~~~~~~--~ 153 (417) T protein:vir:38 81 TKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDN--I---IYRFTPYNSSMQ--K 153 (417) T ss_pred cccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCe--E---EEEEEEcCCcEE--E Confidence 2 32 2355667888899999999999887654 34 3677888888776543221 1 112222222211 1 Q ss_pred EEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 169 VWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTM 248 (456) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~ 248 (456) ++..+.+.++.. ++ .+.-.|.|.++.+...+.... +-..-.. T Consensus 154 ~~~~~dviH~r~--------------------------------~~---~d~~~G~s~l~~~~~~i~~~~---~~~~~~~ 195 (417) T protein:vir:38 154 VCGFEDVIHWKF--------------------------------FS---YDTIMGRSPLLSLGDEIGLQE---SGVSTLQ 195 (417) T ss_pred EecCcceEEecC--------------------------------CC---CCCccccCHHHHHHHHHHHHH---HHHHHHH Confidence 222222222210 00 011136665554433332222 1111122 Q ss_pred HHhh---chhhhhhcCCCcccccccccchh-hhhhh-hh-hhccceeccCCCceeEeecccch-HHHHHHHHHHHHHHHh Q lcl|NC_021301. 249 AIQA---FRQRALKSAGHGLPKVDENGNAI-DYASI-FE-AAPGALWELPPGVDIWESQTNDF-TPMLSAIKEHIRQLSS 321 (456) Q Consensus 249 ~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~~~~-~~-~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~ 321 (456) .++. .|-.+++ ..... .++.-..+ ..... .. ...+....++.+.++..+.-... ..|++..+..+.+|+. T Consensus 196 ~~f~ng~~p~~il~-~~~~l--~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~ 272 (417) T protein:vir:38 196 KFFKSGLKGSIIKA-KESRL--SAEARQKIREDFERAQAGADAGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAK 272 (417) T ss_pred HHHhccCCCcEEEE-eCCCC--CHHHHHHHHHHHHHHhcccccCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHH Confidence 2222 2333322 11111 11111111 11111 11 12344566778888887653222 2377777888999999 Q ss_pred hcCCChhhhcccccCcHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHH Q lcl|NC_021301. 322 ATKTPLPMLMPDSANQSAEGAHNIE--KGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKY 399 (456) Q Consensus 322 ~~~~p~~~~~~~~~N~Sg~Al~~~~--~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~a 399 (456) +-|+|+..+|....++|.+.+...+ ..|.-.+...+..+...| + .........+.|.... .+.+..+ T Consensus 273 ~fgVPp~~lg~~~~~s~~e~~~~~~~~~tl~P~~~~ie~~l~~~L-------l---~~~~~~~~~~~fd~~~-l~~~~~~ 341 (417) T protein:vir:38 273 ALRVPAYRLAQNSPNQSVKQLADDYIRNDLPFYFEPITSEFELKL-------L---DDAQRHQYCIGFDTKS-VNGLPIA 341 (417) T ss_pred HhCCCHHHhCCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHhhh-------c---ChhhcccceEEechhh-hhHHHHH Confidence 9999999998644444433222221 222222222211111111 1 1111123446664322 1222233 Q ss_pred HHHHHHHhcCCCcHHHHHHhCCCChhHH---HHH-------HHHHH-----HHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 400 AAASLAKAAGESWASIRRNILNYNADQI---KQD-------DLDRA-----REQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 400 d~~~kl~~~g~~s~~t~~~~~~~~~~~~---~~~-------e~~~~-----~ee~~~~~~~~~~~~~~d~~~ 456 (456) .+.+++.+|+++.--+++++|+.|-+. ++. ..+.. .+..+.-++....+.+.+++. T Consensus 342 -~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~~~~~~ 412 (417) T protein:vir:38 342 -DVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAAELKGGDTNAKGNQNGSG 412 (417) T ss_pred -HHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeecccccccccccccccccccccCCCCCCCCCCCcCCC Confidence 355677899999999999998865311 110 00100 011111111111111111111 No 191 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=98.56 E-value=1.3e-07 Score=58.32 Aligned_cols=430 Identities=12% Similarity=0.017 Sum_probs=169.5 Q ss_pred CCCCCHH---HHHHH-------HH--HHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTASTPA---EWLPV-------LT--KRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t~~---~~~~~-------l~--~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a 68 (456) ||..+-+ ..+.- .. ....+.. .-.++|.++.-|. |+=.+..|+.+.. ..++++.+|+..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~--p~~~~~~L~~~~e--~~~~~~~~i~~~~ 72 (651) T protein:vir:99 1 MTDTTGETQETKVHVEGLGGEADLAKSPNSTQI----PDHRIQSHNVGVN--PPYNPDRLAAFLE--LNETLATGIRKKS 72 (651) T ss_pred CCCccceeeeeEEEeeccccccccccccccccc----chhhhcccCCCCC--CCCCHHHHHHHHh--cChHHHHHHHHHh Confidence 6655511 00000 00 0011111 1224455554332 3334555665421 2689999999999 Q ss_pred hhhccCCeecCC-------CCcccHHHHHHHHHHhc---------------ChhHHHHHHHHHHhhCCeEEEEEeeCCCC Q lcl|NC_021301. 69 DRIIPNGITVGG-------SADSDLALRARRIWRDN---------------RMDSVCKQWVKYGLDFGESYLTCWRRDDG 126 (456) Q Consensus 69 ~~l~~~~~~~~~-------~~d~~~~~~l~~~~~~n---------------~~~~~~~~~~~~a~~~G~a~~~v~~d~dg 126 (456) ..+.|-||.+.. +.+.+..+.++++|... .+......+..+...+|.+|+-+..+..| T Consensus 73 ~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g 152 (651) T protein:vir:99 73 RYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEG 152 (651) T ss_pred hhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCcc Confidence 999998876421 12222233455555331 23345555666777888888877676666 Q ss_pred ce-EEEEEccceeEEEEeCCC-CceEEE---------------------------EEEEEEecCCceEEEEEEcCCeEEE Q lcl|NC_021301. 127 TA-TITADSPETMVVSVDPLQ-PWRIRS---------------------------AMRWWRDLDAESDFAIVWSGDGWQK 177 (456) Q Consensus 127 ~~-~i~~~~p~~~~~~~d~~~-~~~~~~---------------------------~~~~~~~~d~~~~~~~~~~~~~~~~ 177 (456) .+ .+..+++..+-..-+... ...... ++..+....+............+.. T Consensus 153 ~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~ 232 (651) T protein:vir:99 153 RPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTI 232 (651) T ss_pred chhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeE Confidence 53 233344433221111000 000000 0000000000000000000000000 Q ss_pred EEEeeeecccccce---eeccCCCceeecccccccCceeE--EEEcc--C----CCCCCcHhHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 178 FARPCFVQSSSRRR---LVTRISDSWVPVGDAVVTGSPPP--VVVYQ--N----PDGMGEVEPHIDIINRINRAELQLLS 246 (456) Q Consensus 178 ~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~p--vv~~~--n----~~g~s~~~~v~~liDa~~~~~s~~~~ 246 (456) .. ..+..... ......+.+..... .....+++ |+++. + ..|.|.+..+...+ ....+-..- T Consensus 233 ~~----~~d~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i---~~a~~a~~~ 304 (651) T protein:vir:99 233 RY----REDEESEREPIFVDRETGDVTTGDA-NGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTI---SADEAAKDY 304 (651) T ss_pred Ee----ccCcceeeeeecccceeeeEEEcCC-CceeEecccceEEecCCCCCCCcccccHHHHHHHHH---HHHHHHHHH Confidence 00 00000000 00000000000000 00001111 33332 1 24666665444333 322222222 Q ss_pred HHHHhh---chhhhhhcCCCcccccccccchh-hhhhhhhhhccceecc-----------CCCceeEeecccc--hHHHH Q lcl|NC_021301. 247 TMAIQA---FRQRALKSAGHGLPKVDENGNAI-DYASIFEAAPGALWEL-----------PPGVDIWESQTND--FTPML 309 (456) Q Consensus 247 ~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-----------~~d~~~~~~~~~~--~~~~~ 309 (456) ...++. .|..+|+--+ + ...++....+ ........+.++.+.+ +.+.++..++..+ -..|+ T Consensus 305 ~~~~f~NG~~p~gil~~~~-~-~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfl 382 (651) T protein:vir:99 305 NRDFFDNDTIPRMVIKVTG-G-ELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFR 382 (651) T ss_pred HHHHHhccCCCceEEEecC-C-CCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHH Confidence 223333 2444443211 1 1111111111 1111112223333332 2356676665432 23478 Q ss_pred HHHHHHHHHHHhhcCCChhhhcccc-cC-cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCccccee Q lcl|NC_021301. 310 SAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVK-ALQIEGESVEDTVD 384 (456) Q Consensus 310 ~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l-~~~~~~~~~~~~i~ 384 (456) +..+....+|+++-++|+..+|... +| ++.+... +....|.- +-..+++.+.. ++.-......+.+. T Consensus 383 e~r~~~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f~~~tL~P--------~~~~ie~eln~kLl~~~e~~~~~~i~ 454 (651) T protein:vir:99 383 QFREKNEHEIAKVLEVPPVKIGVTDSANRSNSDQQDKDFALEVIQP--------EQHTFAEWLYQIIHQQALGVTDWTIE 454 (651) T ss_pred HHHHHHHHHHHHHhCCCHHHhccCCCCCcccHHHHHHHHHHHHHHH--------HHHHHHHHHHHhhcCccccccCceEE Confidence 8888889999999999999987432 22 2222211 11111111 11222222221 11111112234556 Q ss_pred EEecC--CCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHH--HHHHHHHHHH--HHHHHhhhhhh------hccc Q lcl|NC_021301. 385 VSFES--PDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQI--KQDDLDRARE--QITLFAGNSVQ------RPQE 452 (456) Q Consensus 385 v~f~~--~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~--~~~e~~~~~e--e~~~~~~~~~~------~~~~ 452 (456) +.|.. .+-.|....++.+.+++++|+++..-+++.+|+.+-.. ..+-+..... ..+...+...+ ..++ T Consensus 455 ~ef~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~ 534 (651) T protein:vir:99 455 YELRGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENK 534 (651) T ss_pred EEeccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCccccc Confidence 66654 45568888999999999999999999999998865311 1110100000 00000000000 0011 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) .|.+ T Consensus 535 ~~~~ 538 (651) T protein:vir:99 535 IGER 538 (651) T ss_pred cccc Confidence 1111 No 192 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.56 E-value=2.9e-07 Score=56.44 Aligned_cols=367 Identities=10% Similarity=0.070 Sum_probs=147.1 Q ss_pred CCCCCH---------------------------------HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchh Q lcl|NC_021301. 1 MTASTP---------------------------------AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAA 47 (456) Q Consensus 1 ~~~~t~---------------------------------~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~ 47 (456) |+...| ..++..++.........+.....+..+...+ T Consensus 54 ~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~---------- 123 (576) T protein:vir:96 54 QAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGF---------- 123 (576) T ss_pred chhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccc---------- Confidence 111111 0122222222222222222111111111100 Q ss_pred hhhhhhhhccChHHHHHHHHHhhhccCCeecCC---CCcccHH---HHHHHHHH-----h----cChhHHHHHHHHHHhh Q lcl|NC_021301. 48 WRSFQREARTNWGLMVRDSVADRIIPNGITVGG---SADSDLA---LRARRIWR-----D----NRMDSVCKQWVKYGLD 112 (456) Q Consensus 48 ~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~---~~d~~~~---~~l~~~~~-----~----n~~~~~~~~~~~~a~~ 112 (456) ++.... ....+.. ..+..++. . ..+..+...+..+.+. T Consensus 124 ---------------------------~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll 176 (576) T protein:vir:96 124 ---------------------------EVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYT 176 (576) T ss_pred ---------------------------eeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHh Confidence 111100 0000000 00111110 0 1234566778888999 Q ss_pred CCeEEEEEeeCCC--Cce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeeccccc Q lcl|NC_021301. 113 FGESYLTCWRRDD--GTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSR 189 (456) Q Consensus 113 ~G~a~~~v~~d~d--g~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 189 (456) +|.+|+++..+.+ |++ .+..++|..+.++.+.... .+....+++...++... ..+..+.+++... T Consensus 177 ~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~-~~~~~~~~~~~~~~~~~--~~~~~~dii~~~~--------- 244 (576) T protein:vir:96 177 YDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGK-IIKGGKRFVQVINKKVV--ASFTSREMAMGIR--------- 244 (576) T ss_pred cCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCc-eeeeeeEEEEecCCceE--EEecccceEEEee--------- Confidence 9999998765554 444 5888999999888765432 22222222222222111 1122222221110 Q ss_pred ceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHh---hchhhhhhcCCCccc Q lcl|NC_021301. 190 RRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQ---AFRQRALKSAGHGLP 266 (456) Q Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~ 266 (456) ...+ -......|.|-++.....+.....+.. ....++ +.|-.+|+--+ ... T Consensus 245 ---------------------~~~~-d~~~~~~G~Spi~~a~~~i~~~~~~~~---~~~~~f~Ng~~p~giL~~~~-~~~ 298 (576) T protein:vir:96 245 ---------------------NPRT-ELSSSGYGLSEVEIAMKQFIAYNNTET---FNDRFFSHGGTTRGILQIKS-EQQ 298 (576) T ss_pred ---------------------cCCC-CcccCcccccHHHHHHHHHHHHHHHHH---HHHHHHhccCCCceEEEeCC-CCC Confidence 0000 000012466666554444433332222 222333 23443433111 111 Q ss_pred ccccccchhhhhhhhhh------hccce-eccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccC-- Q lcl|NC_021301. 267 KVDENGNAIDYASIFEA------APGAL-WELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN-- 336 (456) Q Consensus 267 ~~~~~~~~~~~~~~~~~------~~~~~-~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N-- 336 (456) ..++.-..+ ...|.. ..+.+ ..++.+.++..+.... -..|++..+..+..|+.+-++|+..+|..... T Consensus 299 ls~e~~~~l--r~~~~~~~~G~~nag~~p~vl~~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~ 376 (576) T protein:vir:96 299 QSQRALENF--KREWKSSFSGINGSWQVPVVMADDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGA 376 (576) T ss_pred CCHHHHHHH--HHHHHHHhccccccccceeecCCCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccc Confidence 111111111 111211 12332 4567788888875432 23478888999999999999999999742110 Q ss_pred -----------cHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHH Q lcl|NC_021301. 337 -----------QSAEGAH--NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAAS 403 (456) Q Consensus 337 -----------~Sg~Al~--~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~ 403 (456) ++.+... +....|.-.+...+..|...| +. .....+.+.|.+..+.+.++..+.. T Consensus 377 ~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L-------l~----~~~~~~~~~f~r~d~~~~~e~~~~~- 444 (576) T protein:vir:96 377 TGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINTHI-------IS----EYSDKYVFQFVGGDTKSELDKIKIL- 444 (576) T ss_pred cccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhh-------ch----hccCceEEEeccCCHHHHHHHHHHH- Confidence 1111111 111122222212111111111 10 1123456678777776666665443 Q ss_pred HHHhcCCCcHHHHHHhCCCChhHHH-------------------HHHHHHHHHHHHHH----hhhhhhhcccccCC Q lcl|NC_021301. 404 LAKAAGESWASIRRNILNYNADQIK-------------------QDDLDRAREQITLF----AGNSVQRPQEDGSR 456 (456) Q Consensus 404 kl~~~g~~s~~t~~~~~~~~~~~~~-------------------~~e~~~~~ee~~~~----~~~~~~~~~~d~~~ 456 (456) ++..+|+++.--+++.+|+.|-+-- ..+.+..++..+.. .....+.+.+.++. T Consensus 445 ~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~ 520 (576) T protein:vir:96 445 QEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTE 520 (576) T ss_pred HHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCC Confidence 3445699998888888887543210 00011111111111 11111111111111 No 193 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=98.55 E-value=2.5e-07 Score=56.78 Aligned_cols=365 Identities=10% Similarity=-0.021 Sum_probs=150.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccH Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~ 86 (456) +-++++|-++... ... .+.+. ....+. .. .-+.......+|+.+++-+..-|+.+-.... .. T Consensus 1 Mg~f~~lf~~~~~----~~~---~~~~~-----~~~~v~---~~--~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~ 62 (395) T protein:vir:10 1 MSILEKIFKTRKD----ITY---MLDLD-----MIEDLS---QQ--AYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQ 62 (395) T ss_pred CchhhhhhccCcc----ccc---cccch-----hccccc---hh--hhhhhHHHHHHHHHHHHhhccceeEeccCCc-cc Confidence 2223332221100 000 00000 000000 00 0123455677888888888777876543221 12 Q ss_pred HHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEcccee--EEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 87 ALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETM--VVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 87 ~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~--~~~~d~~~~~~~~~~~~~~~~ 159 (456) ...+...+.. |. -......+....+..|.+|+++..+ .+ . ...++..+ ..+++.. ...+. T Consensus 63 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~~-~--~~~~~~~~~~~~~~~~~-----~~~~~---- 129 (395) T protein:vir:10 63 KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-KE-L--LIADSFYREEYALYDDI-----FKDVT---- 129 (395) T ss_pred cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC-CC-e--EecCCccceeEeecCcc-----eeEEE---- Confidence 2223344332 32 2344556777778888888766433 22 1 11222211 1111110 00000 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~ 239 (456) ..+.. ....+..++++++.. ++ + ..+..|.|-++.....++.... T Consensus 130 ~~~~~-~~~~~~~~evih~~~--------------------------~~----~----~~~~~G~spi~~~~~~~~~~~~ 174 (395) T protein:vir:10 130 VKDYT-YQRTFTMQEVIYLKY--------------------------NN----N----KVTHFVESLFEDYGKIFGRMIG 174 (395) T ss_pred EcCce-eeeeeccccEEEEcc--------------------------CC----C----CcccccchHHHHHHHHHHHHHH Confidence 00000 001111222211100 00 0 0112455555544333333221 Q ss_pred HHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhh----hhhhhccceeccCCCceeEeecccch------HHH Q lcl|NC_021301. 240 AELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YAS----IFEAAPGALWELPPGVDIWESQTNDF------TPM 308 (456) Q Consensus 240 ~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~d~~~~~~~~~~~------~~~ 308 (456) . ....+.+.-+++ .... ...++...... ... ....+...++.++.+.++.+++-.+. ..| T Consensus 175 ~-------~~~~~~~~gii~-~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~ 245 (395) T protein:vir:10 175 A-------QLKNYQIRGILK-SASS-AYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSEL 245 (395) T ss_pred H-------HHhcCCCceEEE-eCCC-CCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHH Confidence 1 111122222221 1111 11111111111 111 11122233445677778877643221 247 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEec Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFE 388 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~ 388 (456) ++..+....+|+.+-++|+..+++..+|.+.....+....|.-.+...+..|...| +.-.. ....+++.+. T Consensus 246 ~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL-------~~~~~--~~~~~~f~~~ 316 (395) T protein:vir:10 246 SELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFCLTPLLKKIQNELNAKL-------ITQSM--YLKDTRIEIV 316 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhh-------cChhh--hcccceecch Confidence 77778888999999999999998765554433333333333333333322222211 10000 0112344555 Q ss_pred CCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH---H--H--HHHHH---HHHH--HHHhhhhhhhcccccC Q lcl|NC_021301. 389 SPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK---Q--D--DLDRA---REQI--TLFAGNSVQRPQEDGS 455 (456) Q Consensus 389 ~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~---~--~--e~~~~---~ee~--~~~~~~~~~~~~~d~~ 455 (456) ..+-.|..+.++++.+++++|+++.--+++.+|+.|-+.. + + ....+ .+.. ...........+++|+ T Consensus 317 ~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 317 GVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 5667788999999999999999999889999988663211 0 0 00000 0000 0001111122333444 No 194 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=98.55 E-value=2.5e-07 Score=56.78 Aligned_cols=365 Identities=10% Similarity=-0.021 Sum_probs=150.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccH Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~ 86 (456) +-++++|-++... ... .+.+. ....+. .. .-+.......+|+.+++-+..-|+.+-.... .. T Consensus 1 Mg~f~~lf~~~~~----~~~---~~~~~-----~~~~v~---~~--~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~ 62 (395) T protein:vir:95 1 MSILEKIFKTRKD----ITY---MLDLD-----MIEDLS---QQ--AYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQ 62 (395) T ss_pred CchhhhhhccCcc----ccc---cccch-----hccccc---hh--hhhhhHHHHHHHHHHHHhhccceeEeccCCc-cc Confidence 2223332221100 000 00000 000000 00 0123455677888888888777876543221 12 Q ss_pred HHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEcccee--EEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 87 ALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETM--VVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 87 ~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~--~~~~d~~~~~~~~~~~~~~~~ 159 (456) ...+...+.. |. -......+....+..|.+|+++..+ .+ . ...++..+ ..+++.. ...+. T Consensus 63 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~~-~--~~~~~~~~~~~~~~~~~-----~~~~~---- 129 (395) T protein:vir:95 63 KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-KE-L--LIADSFYREEYALYDDI-----FKDVT---- 129 (395) T ss_pred cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC-CC-e--EecCCccceeEeecCcc-----eeEEE---- Confidence 2223344332 32 2344556777778888888766433 22 1 11222211 1111110 00000 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~ 239 (456) ..+.. ....+..++++++.. ++ + ..+..|.|-++.....++.... T Consensus 130 ~~~~~-~~~~~~~~evih~~~--------------------------~~----~----~~~~~G~spi~~~~~~~~~~~~ 174 (395) T protein:vir:95 130 VKDYT-YQRTFTMQEVIYLKY--------------------------NN----N----KVTHFVESLFEDYGKIFGRMIG 174 (395) T ss_pred EcCce-eeeeeccccEEEEcc--------------------------CC----C----CcccccchHHHHHHHHHHHHHH Confidence 00000 001111222211100 00 0 0112455555544333333221 Q ss_pred HHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhh----hhhhhccceeccCCCceeEeecccch------HHH Q lcl|NC_021301. 240 AELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YAS----IFEAAPGALWELPPGVDIWESQTNDF------TPM 308 (456) Q Consensus 240 ~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~d~~~~~~~~~~~------~~~ 308 (456) . ....+.+.-+++ .... ...++...... ... ....+...++.++.+.++.+++-.+. ..| T Consensus 175 ~-------~~~~~~~~gii~-~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~ 245 (395) T protein:vir:95 175 A-------QLKNYQIRGILK-SASS-AYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSEL 245 (395) T ss_pred H-------HHhcCCCceEEE-eCCC-CCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHH Confidence 1 111122222221 1111 11111111111 111 11122233445677778877643221 247 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEec Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFE 388 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~ 388 (456) ++..+....+|+.+-++|+..+++..+|.+.....+....|.-.+...+..|...| +.-.. ....+++.+. T Consensus 246 ~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL-------~~~~~--~~~~~~f~~~ 316 (395) T protein:vir:95 246 SELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFCLTPLLKKIQNELNAKL-------ITQSM--YLKDTRIEIV 316 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhh-------cChhh--hcccceecch Confidence 77778888999999999999998765554433333333333333333322222211 10000 0112344555 Q ss_pred CCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH---H--H--HHHHH---HHHH--HHHhhhhhhhcccccC Q lcl|NC_021301. 389 SPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK---Q--D--DLDRA---REQI--TLFAGNSVQRPQEDGS 455 (456) Q Consensus 389 ~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~---~--~--e~~~~---~ee~--~~~~~~~~~~~~~d~~ 455 (456) ..+-.|..+.++++.+++++|+++.--+++.+|+.|-+.. + + ....+ .+.. ...........+++|+ T Consensus 317 ~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 317 GVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 5667788999999999999999999889999988663211 0 0 00000 0000 0001111122333444 No 195 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=98.55 E-value=2.5e-07 Score=56.78 Aligned_cols=365 Identities=10% Similarity=-0.021 Sum_probs=150.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccH Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~ 86 (456) +-++++|-++... ... .+.+. ....+. .. .-+.......+|+.+++-+..-|+.+-.... .. T Consensus 1 Mg~f~~lf~~~~~----~~~---~~~~~-----~~~~v~---~~--~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~-~~ 62 (395) T protein:vir:10 1 MSILEKIFKTRKD----ITY---MLDLD-----MIEDLS---QQ--AYVKRLAIDSCIEFVARAVAQSHFKVLEGNR-IQ 62 (395) T ss_pred CchhhhhhccCcc----ccc---cccch-----hccccc---hh--hhhhhHHHHHHHHHHHHhhccceeEeccCCc-cc Confidence 2223332221100 000 00000 000000 00 0123455677888888888777876543221 12 Q ss_pred HHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEcccee--EEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 87 ALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETM--VVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 87 ~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~--~~~~d~~~~~~~~~~~~~~~~ 159 (456) ...+...+.. |. -......+....+..|.+|+++..+ .+ . ...++..+ ..+++.. ...+. T Consensus 63 ~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~-~~-~--~~~~~~~~~~~~~~~~~-----~~~~~---- 129 (395) T protein:vir:10 63 KNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDS-KE-L--LIADSFYREEYALYDDI-----FKDVT---- 129 (395) T ss_pred cchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecC-CC-e--EecCCccceeEeecCcc-----eeEEE---- Confidence 2223344332 32 2344556777778888888766433 22 1 11222211 1111110 00000 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~ 239 (456) ..+.. ....+..++++++.. ++ + ..+..|.|-++.....++.... T Consensus 130 ~~~~~-~~~~~~~~evih~~~--------------------------~~----~----~~~~~G~spi~~~~~~~~~~~~ 174 (395) T protein:vir:10 130 VKDYT-YQRTFTMQEVIYLKY--------------------------NN----N----KVTHFVESLFEDYGKIFGRMIG 174 (395) T ss_pred EcCce-eeeeeccccEEEEcc--------------------------CC----C----CcccccchHHHHHHHHHHHHHH Confidence 00000 001111222211100 00 0 0112455555544333333221 Q ss_pred HHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhh----hhhhhccceeccCCCceeEeecccch------HHH Q lcl|NC_021301. 240 AELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YAS----IFEAAPGALWELPPGVDIWESQTNDF------TPM 308 (456) Q Consensus 240 ~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~d~~~~~~~~~~~------~~~ 308 (456) . ....+.+.-+++ .... ...++...... ... ....+...++.++.+.++.+++-.+. ..| T Consensus 175 ~-------~~~~~~~~gii~-~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~~~~q~ 245 (395) T protein:vir:10 175 A-------QLKNYQIRGILK-SASS-AYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIEGFDYEELSNGGKNSNMPFSEL 245 (395) T ss_pred H-------HHhcCCCceEEE-eCCC-CCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceeeeccccccccchhHHHH Confidence 1 111122222221 1111 11111111111 111 11122233445677778877643221 247 Q ss_pred HHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEec Q lcl|NC_021301. 309 LSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFE 388 (456) Q Consensus 309 ~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~ 388 (456) ++..+....+|+.+-++|+..+++..+|.+.....+....|.-.+...+..|...| +.-.. ....+++.+. T Consensus 246 ~e~~~~~~~~Ia~~f~VPp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~kL-------~~~~~--~~~~~~f~~~ 316 (395) T protein:vir:10 246 SELMRDAIKNVALMIGIPPGLIYGETADLEKNTLVFEKFCLTPLLKKIQNELNAKL-------ITQSM--YLKDTRIEIV 316 (395) T ss_pred HHHHHHHHHHHHHHhCCCHHHhcCcccCHHHHHHHHHHHHHHHHHHHHHHHHHHhh-------cChhh--hcccceecch Confidence 77778888999999999999998765554433333333333333333322222211 10000 0112344555 Q ss_pred CCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH---H--H--HHHHH---HHHH--HHHhhhhhhhcccccC Q lcl|NC_021301. 389 SPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK---Q--D--DLDRA---REQI--TLFAGNSVQRPQEDGS 455 (456) Q Consensus 389 ~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~---~--~--e~~~~---~ee~--~~~~~~~~~~~~~d~~ 455 (456) ..+-.|..+.++++.+++++|+++.--+++.+|+.|-+.. + + ....+ .+.. ...........+++|+ T Consensus 317 ~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 317 GVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITKNYEKANSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred hhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeeccccccccccccccCcccccccCCCCCCCCCC Confidence 5667788999999999999999999889999988663211 0 0 00000 0000 0001111122333444 No 196 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=98.54 E-value=3.3e-07 Score=56.13 Aligned_cols=374 Identities=7% Similarity=-0.053 Sum_probs=152.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCcccH Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSADSDL 86 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d~~~ 86 (456) +-+++++.............. +...... ........ -+...-...+|+.+++-+..-|+.+.... ... T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~--------~~~~~~~-~~~~~~~~--~l~~~~v~~~v~~Ia~~ia~~p~~~~~~~-~~~ 68 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLT--------DTVWCSI-PSEKLKEL--SIKKWAIDSCANKIANTLSCAEVLTYEKG-EEV 68 (395) T ss_pred CchHHHHHhhhcccccccccc--------cchhhcc-ccccchhh--hhhhHHHHHHHHHHHHHHhhCceeeccCC-ccc Confidence 334444444432221111000 0000000 00000111 12233455677888887777787764322 222 Q ss_pred HHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecC Q lcl|NC_021301. 87 ALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLD 161 (456) Q Consensus 87 ~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d 161 (456) ...+..++.. |. -..+...+...++.+|.||+++..+.- ..|..+........+. .......+ T Consensus 69 ~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~~~~~-------~~~~~~~~~~~~~~~~-----~~~~v~~~ 136 (395) T protein:vir:40 69 RKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQDEYI-------YVADSFTKNDKSLYEN-----TYTEVTLK 136 (395) T ss_pred cchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEecCce-------eecCCccccccccccc-----eeeeeeec Confidence 2334444432 32 234456678888999999987754321 1111111100000000 00000001 Q ss_pred CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHH Q lcl|NC_021301. 162 AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAE 241 (456) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~ 241 (456) +.. +...|..+.++++ .+++..+.+.+.. +...+.... T Consensus 137 ~~~-~~~~~~~~evih~--------------------------------------r~~~~~~~~~~~~---l~~~~~~~~ 174 (395) T protein:vir:40 137 DLT-LKKEFKESEVLHL--------------------------------------TLNNESIKSIIDG---FYLLYGDLL 174 (395) T ss_pred Cce-eeeeeccccEEEe--------------------------------------ecCCCCccccchh---HHHHHHHHH Confidence 100 0011111111111 1223323322222 222333333 Q ss_pred HHHHHHHHHhhchhhhhhcCCCcccccccccch-hhhh----hhhhhhccceeccCCCceeEeecccc-hHHHHHH---H Q lcl|NC_021301. 242 LQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDYA----SIFEAAPGALWELPPGVDIWESQTND-FTPMLSA---I 312 (456) Q Consensus 242 s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~~----~~~~~~~~~~~~~~~d~~~~~~~~~~-~~~~~~~---l 312 (456) +...+...+...+.-.+..-... ...++.-.. .... .......+.++.++.+.++.++.... ...+++. . T Consensus 175 ~~~~~~~~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~ 253 (395) T protein:vir:40 175 TAAVNKYKKLNSRKIIVKLKAMF-GQTPEAEEKLRLMLSERMKKFLAEGDSALPVEDGMEIDELAGDSKIAESRDIKKMI 253 (395) T ss_pred HHHHHHHHhcCCCCceEEEeccc-CCCHHHHHHHHHHHHHHHHHhhccCCceeecCCCceEEeccCChhhhhHHHHHHHH Confidence 33322222222221111110111 111111111 1111 11112345566788888888875332 1224442 2 Q ss_pred HHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCC Q lcl|NC_021301. 313 KEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDR 392 (456) Q Consensus 313 ~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~ 392 (456) +..+.+|+.+-|+|+..+++..+|.+...+.+....|.-.+...+..+...| +--........+++.+..-+- T Consensus 254 ~~~~~~Ia~~fgVPp~~l~~~~sn~e~~~~~f~~~~L~P~~~~ie~~l~~kL-------l~~~~~~~g~~i~fd~~~ll~ 326 (395) T protein:vir:40 254 DDVFEMVANSFNIPLGLAKGDTVGLSEQVNSFLMFSINPIAEMFTDEGNRKF-------YGRDSVLERTYMKLDTTRIKV 326 (395) T ss_pred HHHHHHHHHHhCCCHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CChhhhcCCceEEEechhhhc Confidence 3346789999999999998766664443333333333332222222221111 110111123345555566677 Q ss_pred cCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhc----ccccCC Q lcl|NC_021301. 393 VTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRP----QEDGSR 456 (456) Q Consensus 393 ~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~----~~d~~~ 456 (456) .|..+.+++..++.++|+++.--+++.+|+.|-+-.. .++.--..+.......... +.++++ T Consensus 327 ~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi~~~~--gD~~~~~~n~~~~~~~~~~~kgge~~~~~ 392 (395) T protein:vir:40 327 QDIQEIASSMDVLFHIGVNTIDDNLRMIGREPVMSPE--TQERFVTKNYAPLGENEEDLKGGDINENK 392 (395) T ss_pred cCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCC--CceeeeccccccccccccccCCCCCCCCc Confidence 8899999999999999999999999999987642211 0100000000000001111 111111 No 197 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=98.53 E-value=1.9e-07 Score=57.45 Aligned_cols=362 Identities=13% Similarity=0.054 Sum_probs=154.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |. +++.+-++... ....|.+.- .. ... . ..-+.......+|+.+++-+..-|+.+.. T Consensus 1 Mg------~f~~~f~~~~~-------~~~~~~~~~----~~-~~~---~--~~a~~~~~v~~~i~~ia~~ia~~p~~~~~ 57 (385) T protein:vir:95 1 MG------LFDSVFKRHSE-------LSWMYDLEF----LQ-DKS---K--KAYLKQIALNTVVEMVARTISQSEFRVMK 57 (385) T ss_pred Cc------hhhhhhccCcc-------cccccchhh----hh-ccc---h--hhhhhhHHHHHHHHHHHHHHcccceeeee Confidence 22 44444332211 111111110 00 000 0 00122345567888888888888877632 Q ss_pred CCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceE--EEEEccceeEEEEeCCCCceEEEE Q lcl|NC_021301. 81 SADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTAT--ITADSPETMVVSVDPLQPWRIRSA 153 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~--i~~~~p~~~~~~~d~~~~~~~~~~ 153 (456) ... .....+..++.. |. -......++...+.+|.||++... +|... .....+..+ .++.. T Consensus 58 ~~~-~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~--~~~~~~~~~~~~~~~~-~~~~~--------- 124 (385) T protein:vir:95 58 NNT-KEKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKND--EGHFFVADDFEKEDEL-GLYSH--------- 124 (385) T ss_pred cCc-cccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEec--CCCeeecccccccccc-ccccc--------- Confidence 211 122234444432 22 245567788899999999976543 33211 111111111 01000 Q ss_pred EEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC----CCCCCcHhH Q lcl|NC_021301. 154 MRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN----PDGMGEVEP 229 (456) Q Consensus 154 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n----~~g~s~~~~ 229 (456) ..+....++ ......+..+.+.++ .+.+ ..|.|-++. T Consensus 125 ~~~~~~~~~-~~~~~~~~~~eiih~--------------------------------------~~~~~~~~~~G~s~~~~ 165 (385) T protein:vir:95 125 RFTNVLVND-FEFKRVFTMDDVIYL--------------------------------------KYNNQKLDAFSLGLFED 165 (385) T ss_pred cceeeeecc-cceeeeeccccEEEe--------------------------------------cCCCCCcccccchHHHH Confidence 000000000 000011111111111 1111 124444433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhh----hhhhhhccceeccCCCceeEeeccc- Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYA----SIFEAAPGALWELPPGVDIWESQTN- 303 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~----~~~~~~~~~~~~~~~d~~~~~~~~~- 303 (456) ....+ ........+.+.+.-+++.-. .....++....+ ... .......+.++.++.+.++.+++.. T Consensus 166 ~~~~i-------~~~~~~~~~~~~~~g~l~~~~-~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~g~~~~~l~~~~ 237 (385) T protein:vir:95 166 YGEIF-------GRMIDLQMLNNQIRGILKVDA-TKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTEGLAYEEHSNRG 237 (385) T ss_pred HHHHH-------HHHHHHHHhcCCCceEEEeCC-ccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCCCceeEeecccc Confidence 32222 222222233333322222111 111111111111 111 1111233446667888888776421 Q ss_pred ------chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021301. 304 ------DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE 377 (456) Q Consensus 304 ------~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~ 377 (456) ...-|++..+....+|+.+-|+|+..+++..+|.+...+.+....|.-.+...+..+...| +. ... T Consensus 238 ~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~sn~e~~~~~~~~~~l~P~~~~ie~~l~~~L-------~~-~~~ 309 (385) T protein:vir:95 238 AAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLGEMADLEKTIESYLQFCINPLLRKIEAELNSKF-------FY-QDE 309 (385) T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHhhc-------CC-hhh Confidence 1234788888899999999999999997655554443344443333333322222222111 10 001 Q ss_pred CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccC Q lcl|NC_021301. 378 SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGS 455 (456) Q Consensus 378 ~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~ 455 (456) .....+++.+......|..+.++++.+++++|+++.--+++.+|+.|-+.+- .++.--..+...-...+..+.+++ T Consensus 310 ~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~~--gd~~~~~~n~~~~~~~kgge~~~e 385 (385) T protein:vir:95 310 YLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDPE--LDKFIITKNLQSADAFKGGESNEE 385 (385) T ss_pred cccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--CceeeecccceecccccCCCCCCC Confidence 1122355555566778889999999999999999999999999987632110 000000000000001111122222 No 198 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=98.52 E-value=3.8e-07 Score=55.79 Aligned_cols=368 Identities=14% Similarity=0.049 Sum_probs=154.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHH-H---HHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSR-V---RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r-~---~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~ 76 (456) |. +.+++.....+.... . .....+.-|.-.. ..+.. ..-+...-...+|+.+++-+..-|| T Consensus 1 Mg------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~v~~-----~~al~~~~v~~~i~~ia~~ia~~p~ 65 (385) T protein:vir:10 1 MG------LLTPRNFNKRKAKNMVYPSNPAFFTTTVGGMQL----SYVSA-----LSALQNTNVYSVINRIASDVASAHF 65 (385) T ss_pred Cc------cccchhcccccccccccccchhhhhhhccccCc----cccCH-----HHhhccHHHHHHHHHHHHHHhhCce Confidence 22 222111100000000 0 0000111110000 00000 1112233456688888888888888 Q ss_pred ecCCCCcccHHHHHHHHHHh-c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) ++.... ...++.+ | ........+....+.+|.||+++..+. ..+...++..+.+..+.. . T Consensus 66 ~v~~~~-------~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---~~~~p~~~~~v~~~~~~~--~---- 129 (385) T protein:vir:10 66 KTENTA-------TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNM--G---- 129 (385) T ss_pred eeeccc-------hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCCceEEEEEcCC--c---- Confidence 874321 1222222 2 234556667888889999999986542 223333333333332211 1 Q ss_pred EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHH Q lcl|NC_021301. 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHID 232 (456) Q Consensus 153 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~ 232 (456) ..+++....+.. ...+..+.++++... -++ .++...|.|.+..... T Consensus 130 ~~~~~~~~~~~~--~~~~~~~eiihik~~------------------------------~~~--~~~~~~G~s~i~~~~~ 175 (385) T protein:vir:10 130 IVYTVLESNDRP--QMVLRQDQMLHFRLM------------------------------PDP--QYRYLIGRSPLESLQN 175 (385) T ss_pred eEEEEEEcCCce--EEEEccccEEEeccC------------------------------CCC--cccccccccHHHHHHH Confidence 011111111111 111222232222100 000 0011246676665544 Q ss_pred HHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhhh--hccceeccCCCceeEeeccc--chHH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFEA--APGALWELPPGVDIWESQTN--DFTP 307 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~d~~~~~~~~~--~~~~ 307 (456) .++....+..-..+...-.+.|..+++- ..... .++....+ ........ ..+.+..++.+.++.++... +++. T Consensus 176 ~i~~~~~~~~~~~~~~~ng~~~~gil~~-~~~~~-~~e~~~~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~ 253 (385) T protein:vir:10 176 ALNLDDKASKSNMSAMENQINPAGKLTI-SNYLS-DGKDLESAREEFEKANTGDNSGRLMVLPDGFDYTQLEMKTDVFKA 253 (385) T ss_pred HHHHHHHHHHHHHHHHhccCCcceEEEe-CCCCC-CHHHHHHHHHHHHHHhCccccCCccccCCCceEEecCCChhHHHH Confidence 4443332222111111112233333322 11110 11111111 11111111 23456677888888777543 3443 Q ss_pred HHHHHHHHHHHHHhhcCCChhhhccc-ccCc---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccce Q lcl|NC_021301. 308 MLSAIKEHIRQLSSATKTPLPMLMPD-SANQ---SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTV 383 (456) Q Consensus 308 ~~~~l~~~~~~i~~~~~~p~~~~~~~-~~N~---Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i 383 (456) +.+..+..+.+|+.+-|+|+..+|.. .+++ +.+..+..+. . + . .-+-..+++.+...+ .+ ..+ T Consensus 254 l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~---~-~--l-~P~~~~ie~~l~~~l--~~----~~~ 320 (385) T protein:vir:10 254 LADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYL---A-N--L-NSYVNPIVDELRLKM--NA----PDL 320 (385) T ss_pred HHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHH---H-H--H-HHHHHHHHHHHHHhh--CC----ceE Confidence 45777888899999999999999753 2332 2222222211 1 1 1 111112222221111 11 235 Q ss_pred eEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccC Q lcl|NC_021301. 384 DVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGS 455 (456) Q Consensus 384 ~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~ 455 (456) ++.+..-+..|..+.++++.++++.|+++.--+++.+|+.+-.... .....-... ..+..+++.+ T Consensus 321 ~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~~~--~~~~~~~~~-----~~~~g~~~dn 385 (385) T protein:vir:10 321 ELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLPDN--LPEFKPLTT-----QVKGGDEGDN 385 (385) T ss_pred EeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCCCC--CccccCccc-----ccCCCCCCCC Confidence 5555666778999999999999999999988888877654311000 111110000 0111111222 No 199 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=98.51 E-value=4e-07 Score=55.65 Aligned_cols=374 Identities=13% Similarity=0.027 Sum_probs=159.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCC----C Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS----A 82 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~----~ 82 (456) +-+..+|..+...-. +..+.++ .+.......+..... .-....-...+|+.+++-+..-|+++... . T Consensus 1 mg~~~~~~~~~~~~~----~~~~~~~---~~~~~~~~~~~~t~~--~~~~~~~v~~cv~~Ia~~ia~~p~~v~~~~~~~~ 71 (403) T protein:vir:10 1 MGFKSWITEKLNPGQ----RIIRDME---PVSHRTNRKPFTTGQ--AYSKIEILNRTANMVIDSAAECSYTVGDKYNIVT 71 (403) T ss_pred Ccchhhhhhccchhh----hhhhccc---ccccccCCcccccHH--HHHHHHHHHHHHHHHHHHHhhCceeEeecccccc Confidence 334555544432110 1111111 111111100000000 01112334456777777666667654211 1 Q ss_pred c--ccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 83 D--SDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 83 d--~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) + .-....+..++.. |. ...+...+...++.+|.||+++- +. .+..++|..+.+..+.. .. +. T Consensus 72 ~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~~----~~-~l~~l~~~~~~v~~~~~--~~----~~ 140 (403) T protein:vir:10 72 YANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYWD----GT-SLYHVPAALMQVEADAN--KF----IK 140 (403) T ss_pred cccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEe----Cc-eeEeecCcceEEEEcCC--ce----EE Confidence 1 1111224444443 32 23555667888889999997652 21 24556666555443321 10 11 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc-cCCCCCCcHhHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY-QNPDGMGEVEPHIDII 234 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~-~n~~g~s~~~~v~~li 234 (456) ++...++ ..|..+++. |+-...++... ....|.|.+..+...+ T Consensus 141 ~~~~~~~-----~~~~~~eii-------------------------------h~~~~~~~~~~~~~~~G~s~i~~~~~~i 184 (403) T protein:vir:10 141 KFIFNNQ-----INYRVDEII-------------------------------FIKDNSYVCGTNSQISGQSRVATVIDSL 184 (403) T ss_pred EEEecCc-----eeecccceE-------------------------------EecccccccCCCCCcccccHHHHHHHHH Confidence 1110000 001111111 11111111111 2235666666555444 Q ss_pred HHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhh-hhh--hhccceeccCCCceeEeecc----cchH Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YAS-IFE--AAPGALWELPPGVDIWESQT----NDFT 306 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~-~~~--~~~~~~~~~~~d~~~~~~~~----~~~~ 306 (456) +....+..-..+...-.+.|-.+++. ... ..++.-..+. ... ... ...++++.++.+.++..++. .++ T Consensus 185 ~~~~~~~~~~~~~f~ng~~~~gil~~-~~~--l~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~~~~~~~~~d~- 260 (403) T protein:vir:10 185 EKRSKMLNFKEKFLDNGTVIGLILET-DEI--LNKKLRERKQEELQLDYNPSTGQSSVLILDGGMKAKPYSQISSFKDL- 260 (403) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEe-CCC--CCHHHHHHHHHHHHHHhCCcccCcceeecCCCceeEEecccCCHHHH- Confidence 44333322211111112223333332 111 1111111111 111 111 11345667788888877652 122 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhccc-ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeE Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPD-SANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDV 385 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~-~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v 385 (456) -|++..+....+|+.+-|+|+..+|.. .+|.+...+.+....|.-.+...+..+...| + +.+.+ T Consensus 261 q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~~L-----------~----~~~~~ 325 (403) T protein:vir:10 261 DFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRPNIELFYYMTIIPMLNKLTSSLTFFF-----------G----YKITP 325 (403) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHhc-----------C----ceeee Confidence 367777888999999999999999743 2343333333444444443433333333222 1 22333 Q ss_pred EecCC--CCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHH----HHHhhhhh--hhcccccC Q lcl|NC_021301. 386 SFESP--DRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQI----TLFAGNSV--QRPQEDGS 455 (456) Q Consensus 386 ~f~~~--~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~----~~~~~~~~--~~~~~d~~ 455 (456) .+..- +-.|..+.++++.++++.|+++..-+++.+|+.|-+.+....-.+-... ....+... +....+|+ T Consensus 326 d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 326 NTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDEQMNKIRIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred ccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccccccccccccccccCCCCcCCCCCCCcCCC Confidence 33322 3457788889999999999999999999999876432211111111000 00111111 11122222 No 200 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=98.51 E-value=4.1e-07 Score=55.59 Aligned_cols=358 Identities=13% Similarity=0.031 Sum_probs=155.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |. ++++|.++... ......+-.+. ... . +.-+.......+|+.+++-+..-|+.+.. T Consensus 1 Mg------~f~~l~~~~~~----~~~~~~~~~~~--------~~~---~--~~~l~~~~v~~~i~~Ia~~ia~~p~~~~~ 57 (376) T protein:vir:78 1 MG------FFSELFKRNKE----IEWMWDLDFLE--------DKT---T--KVYLKKMALNTCVKHIARTIAKSDFRLKN 57 (376) T ss_pred Cc------hhhhhhccCCc----cccccchhhcc--------ccc---h--hhhhhhHHHHHHHHHHHHhhcccceeecc Confidence 22 33333322100 00000000000 000 0 00122344567888888888777877643 Q ss_pred CCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceE-EEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 81 SADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) .. ......+..++.. |. .......+....+.+|.||+++..+.+|.+. ...+.|..+.+ . T Consensus 58 ~~-~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~-------------~ 123 (376) T protein:vir:78 58 GE-TSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFP-------------D 123 (376) T ss_pred cc-ccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceee-------------e Confidence 22 2222234444432 32 2455667888888999999998877665421 11111111100 0 Q ss_pred EEEE-ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHH Q lcl|NC_021301. 155 RWWR-DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDI 233 (456) Q Consensus 155 ~~~~-~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~l 233 (456) .++. ..++ ......|..+.+.++ .+.+..+.+-.. .+ T Consensus 124 ~~~~~~~~~-~~~~~~~~~~evih~--------------------------------------~~~~~~~~~~~~---~~ 161 (376) T protein:vir:78 124 VFEGVTVKD-YRYNRNFSMDDVIFL--------------------------------------EYGNERLSAFTD---GM 161 (376) T ss_pred eeeeeeeec-ceeeeeeccccEEEe--------------------------------------ccCCCCchhhhh---HH Confidence 0000 0000 000001111111111 011111222111 23 Q ss_pred HHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhh-hh---hhhccceeccCCCceeEeecccch--- Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YAS-IF---EAAPGALWELPPGVDIWESQTNDF--- 305 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~-~~---~~~~~~~~~~~~d~~~~~~~~~~~--- 305 (456) ...+..++....... .++...+...-........++....+. ... .. ....+.++.++.+.++.++...+. T Consensus 162 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~g~~~~~l~~~~~~~~ 240 (376) T protein:vir:78 162 FEDYGELFGKMIRAQ-MRNFQIRGAVNFKMAGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQLEGFNYEEFGTTSVNNS 240 (376) T ss_pred HHHHHHHHHHHHHHH-HhcCCCceeEEEccCCCCCHHHHHHHHHHHHHHhccccccCcceEEcCCCceEEeeccCccccc Confidence 333333333322222 122211111101111111111111111 111 11 122334556778888877643221 Q ss_pred ---HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_021301. 306 ---TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDT 382 (456) Q Consensus 306 ---~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~ 382 (456) .-|++..+....+|+.+-|+|+..+++..+|.+...+.+....+.-.+...+..+...| + . ..... T Consensus 241 ~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~~~~f~~~~l~P~~~~ie~~l~~kl-------l---~-~~~~~ 309 (376) T protein:vir:78 241 QSFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNNMKAYMEYCIDPLTKKLEDELNAKL-------F---T-FSEFL 309 (376) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhhh-------C---C-cccce Confidence 24777778889999999999999998766665544444444444333333332222221 1 1 11222 Q ss_pred eeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 383 VDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 383 i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) +...+....-.|..+.++++.++..+|+++.--+++.+|+.|-+-.. .++.--. ..-....+..|+| T Consensus 310 ~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~--~d~~~~~---~n~~~~~~~~e~g 376 (376) T protein:vir:78 310 AGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPE--LDKYLIT---KNYQSADEGGEDG 376 (376) T ss_pred ecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cceeeec---cCceehhccccCC Confidence 22333334456888999999999999999999999999987642210 0000000 0011122446666 No 201 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=98.48 E-value=4.8e-07 Score=55.22 Aligned_cols=371 Identities=12% Similarity=0.009 Sum_probs=154.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccc-cCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPE-LTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~-~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) ||+-. -|...... -+..... ... ..-..+.....+|+.+++-+..-|+.+. T Consensus 1 ~~~~~--------------------------~~~g~~~~~~~~~~~~-~~~-~~~~~~~~V~acV~~Ia~~iA~lpl~l~ 52 (723) T protein:vir:94 1 MTTFP--------------------------SGAGGWNAWSADSVFG-NGA-KGWSNSAVAYRCISMLANNAASVDLVVR 52 (723) T ss_pred Ccccc--------------------------cCCCcccccccccccc-ccH-HHHhhhHHHHHHHHHHHHhhccceeEEE Confidence 11100 00000000 0000000 000 0012234556678888887777787753 Q ss_pred CC-CcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeC---CCCce-EEEEEccceeEEEEeCCCCce Q lcl|NC_021301. 80 GS-ADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRR---DDGTA-TITADSPETMVVSVDPLQPWR 149 (456) Q Consensus 80 ~~-~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d---~dg~~-~i~~~~p~~~~~~~d~~~~~~ 149 (456) .. .+......+.+++.. |. ...+...+...++.+|.+|+++-.+ ..|.| .+..++|..+.++.......- T Consensus 53 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~ 132 (723) T protein:vir:94 53 GPDGELDELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAV 132 (723) T ss_pred cCCCccchhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccc Confidence 22 111122234455542 32 2345566777888999999998754 23544 356666655544432221110 Q ss_pred EEEEEE--EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcH Q lcl|NC_021301. 150 IRSAMR--WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEV 227 (456) Q Consensus 150 ~~~~~~--~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~ 227 (456) ...... .+...+|.. ..+..+.+.++. ..- +++.-.|.|.+ T Consensus 133 ~~~~~~~y~~~~~~G~~---~~~~~~dIiHir-------------------------------~~~---~~dg~~G~Spi 175 (723) T protein:vir:94 133 PQAQIIGYVIERTDGVR---VPVLADEMLWLR-------------------------------FSD---PYDPLAVMAPW 175 (723) T ss_pred eeeeeeEEEEEecCcee---EEecccceEEec-------------------------------CCC---CCCCcccccHH Confidence 000000 011111110 001111111110 000 01112466666 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhc---hhhhhhcCCCcccccccccchhhhhhhhhh------hccceecc------- Q lcl|NC_021301. 228 EPHIDIINRINRAELQLLSTMAIQAF---RQRALKSAGHGLPKVDENGNAIDYASIFEA------APGALWEL------- 291 (456) Q Consensus 228 ~~v~~liDa~~~~~s~~~~~~~~~~~---~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~------- 291 (456) +.... .+...++-......+|.+ |-.+++. + . ..++.... ....+.. ..|+...+ T Consensus 176 ~~a~~---~i~~~~aa~~~~~~~f~NG~~p~giL~~-~-~--l~~e~~~~--~~~~~~~~~~G~~Nagk~~vL~g~~~~~ 246 (723) T protein:vir:94 176 KAARA---AVDADFYAATWQRQSFKNGARPGGVVNL-G-D--MDEQTFTK--TVAAFRSQVEGVQNAGRHLLIAGQGSDG 246 (723) T ss_pred HHHHH---HHHHHHHHHHHHHHHHhcCCCcceEEEc-C-C--CCHHHHHH--HHHHHHHHhhchhhcCcceeeccccccc Confidence 54433 333222222222333332 3344431 1 1 11111111 1111111 11222222 Q ss_pred ---CCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHH-HH-HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 292 ---PPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSA-EG-AHNIEKGFLFKCEDRLSIAKIGLE 365 (456) Q Consensus 292 ---~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg-~A-l~~~~~~l~~k~~~~~~~f~~~l~ 365 (456) +.+.++..+.-... ..|++..+..+.+|+.+-|+|+..+++.+.+++. .+ +.+....|.-.+ ..|+ T Consensus 247 ~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~~~f~~~tL~P~~--------~~ie 318 (723) T protein:vir:94 247 GAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAKAAVWTETLIPQM--------EVMA 318 (723) T ss_pred ccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHHHHHHHHHHHHHH--------HHHH Confidence 34557766653322 2377888888999999999999988765433322 22 222222222222 2222 Q ss_pred HHHHHHHHhcCCCcccceeEEecCC--CCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhH--HHHH----------- Q lcl|NC_021301. 366 AILVKALQIEGESVEDTVDVSFESP--DRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQ--IKQD----------- 430 (456) Q Consensus 366 ~~~~l~~~~~~~~~~~~i~v~f~~~--~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~--~~~~----------- 430 (456) +.+..-+. ....+.+.+.|+.. +-.|..+.++++.+++++|+++..-+++.+|+.|-+ ..++ T Consensus 319 ~~ln~~Ll---~~~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~ 395 (723) T protein:vir:94 319 SITDLQLL---PDIGWTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAP 395 (723) T ss_pred HHHhHhhc---ccccCceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccC Confidence 22221110 11234566777653 457888999999999999999998899988875421 0000 Q ss_pred -------HHHHHHHHHHHHhhhhhhhc-ccccCC Q lcl|NC_021301. 431 -------DLDRAREQITLFAGNSVQRP-QEDGSR 456 (456) Q Consensus 431 -------e~~~~~ee~~~~~~~~~~~~-~~d~~~ 456 (456) ..+-.......+...+.+.| .+-..| T Consensus 396 ~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~ 429 (723) T protein:vir:94 396 APAPAPAVEEGAARMLALLERVAADRPLPELPVR 429 (723) T ss_pred CCCCCccchhhhHhhhhhccccccccCcCCCCCC Confidence 00000000000000011111 011111 No 202 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=98.47 E-value=3.6e-07 Score=55.89 Aligned_cols=371 Identities=8% Similarity=-0.023 Sum_probs=151.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCc-cc Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD-SD 85 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d-~~ 85 (456) +=+++++..+... .+...+.+. ......... -+.......+|+.+++-+..-|+.+....+ .. T Consensus 1 MGlf~~~~~~~~~------~~~~~~~~~--------~~~~~~~~~--~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~~~~ 64 (395) T protein:vir:98 1 MGILDFFSFKKSG------TLSDDDSGS--------TTSEKLTNV--VLKEDALYKCVNYLARIISKSTFRLKTPEKLTE 64 (395) T ss_pred CcchhhhcCCCcc------cccccccch--------hhhhhcchh--hhhhHHHHHHHHHHHHHHhhCceeEEecCCccc Confidence 2223332211100 000001110 000000000 012334566788888888777776533221 11 Q ss_pred HHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec Q lcl|NC_021301. 86 LALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL 160 (456) Q Consensus 86 ~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~ 160 (456) ....+..++.. |. -......+...++.+|.||+++-.+..+. + |......+. ..+. ..+.... T Consensus 65 ~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~-----~-~~~~~~~~~-~~~~-----~~~~~~~ 132 (395) T protein:vir:98 65 NQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIY-----V-ADSFTQDKK-ISGS-----QFKVSRV 132 (395) T ss_pred ccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCcee-----c-CCccccccc-ccCc-----ccceeee Confidence 12234444442 32 23556678888999999998876653211 1 111111110 0000 0000000 Q ss_pred CCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCC----CCCcHhHHHHHHH- Q lcl|NC_021301. 161 DAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPD----GMGEVEPHIDIIN- 235 (456) Q Consensus 161 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~----g~s~~~~v~~liD- 235 (456) ++. .+...|..+.+.++. +.+.. +.|-+.....++. T Consensus 133 ~~~-~~~~~~~~~evih~k--------------------------------------~~~~~~~~~~~~~~~~~~~~~~~ 173 (395) T protein:vir:98 133 QGQ-TYEKTFTFDQVIYLK--------------------------------------NDNSDLMSKVESLWEEYGELLGH 173 (395) T ss_pred cCc-eeeeEecCccEEEec--------------------------------------CCCCCccccccchhhhHHHHHHH Confidence 110 001111222222211 11111 1121221111111 Q ss_pred HHHHHHHHHHHHHHHh---hchhhhhhcCCCcccc-cc-cccchhh-hhhhhhhhccceeccCCCceeEeeccc------ Q lcl|NC_021301. 236 RINRAELQLLSTMAIQ---AFRQRALKSAGHGLPK-VD-ENGNAID-YASIFEAAPGALWELPPGVDIWESQTN------ 303 (456) Q Consensus 236 a~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~-~~-~~~~~~~-~~~~~~~~~~~~~~~~~d~~~~~~~~~------ 303 (456) +++.... .....++ ..+...+.+....... .. ....... .........+.++.++.+.++.++... T Consensus 174 ~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~~~~~ 251 (395) T protein:vir:98 174 VINNQKI--ANQIRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTANTNYEEYGSKNTGAVK 251 (395) T ss_pred HHHHHHH--HHHHHHhhccccccccccccccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCCCceeEecccccccccC Confidence 1111110 0111111 1111111111100000 00 0000011 111122334455667778888776422 Q ss_pred -chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccc Q lcl|NC_021301. 304 -DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDT 382 (456) Q Consensus 304 -~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~ 382 (456) ....+.+..+..+.+|+.+-|+|+..+++..+|.+...+.+....|.-.+...+..+...| +.-.. -... T Consensus 252 ~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~sn~e~~~~~f~~~tl~P~~~~ie~~l~~kl-------l~~~~--~~~g 322 (395) T protein:vir:98 252 SYVDDIKKLKDQYMAEFAEMLGIPISLLHGDIADNQKNYELLLEGPIESLITNIVDGLEYAI-------FDKSE--TLQG 322 (395) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CChhh--hcCc Confidence 1234667677778899999999999998665555444444444444443333333322221 10000 0122 Q ss_pred eeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccC Q lcl|NC_021301. 383 VDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGS 455 (456) Q Consensus 383 i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~ 455 (456) +.+.|......|..+.+++..++.+.|+++.-.+++.+|+.|-+.+....-.+..........-....+++++ T Consensus 323 ~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~~~~~~n~~~~~~~gge~~~~~~~ 395 (395) T protein:vir:98 323 SFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKVLYMTKNYESVLERGGEVDEEVET 395 (395) T ss_pred ceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecccceecccccCCCCCCCCC Confidence 3466777788899999999999999999999999999998764221100000000000010011112222333 No 203 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=98.47 E-value=5.4e-07 Score=54.94 Aligned_cols=380 Identities=14% Similarity=0.028 Sum_probs=160.2 Q ss_pred HHHHHHHHHHHHHHHHHH-HHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCCc-- Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRV-RLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSAD-- 83 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~-~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~d-- 83 (456) +-+++.+.....+...+. .....++-+..... .... ....-........+|+.+++-+..-||.+-...+ T Consensus 1 Mg~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-----~~~~~~~~~~v~~~i~~ia~~ia~~~~~~~~~~~~~ 73 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKIRADTGYVGLFMSGEDVS--FLVP-----GYVRLSDNPEVRMAVHKIADLISSMTIYLMQNTEDG 73 (406) T ss_pred CcchhhhccccccccccccchhhhhhccCcccC--cccc-----CHHHHhhcHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 222222221111111100 01111121111110 0000 0011123467788999999999888877521111 Q ss_pred -ccHHHHHHHHHH-h-c---ChhHHHHHHHHHHhhCCeE--EEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 84 -SDLALRARRIWR-D-N---RMDSVCKQWVKYGLDFGES--YLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 84 -~~~~~~l~~~~~-~-n---~~~~~~~~~~~~a~~~G~a--~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) ......+...+. . | ...++...+....+.+|.+ |+.+-.+..|.+ .+..++|..+.+..+... . T Consensus 74 ~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~~~~~~~-------~ 146 (406) T protein:vir:95 74 DIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNFLDTPDG-------Y 146 (406) T ss_pred ceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEEEEcCCe-------E Confidence 111111222222 1 2 2345667788888888765 555556667776 478888888877765431 0 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDII 234 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~li 234 (456) ++. .++ ..|..+.++++... + .+ ..--.|.|-++.....+ T Consensus 147 ~~~--~~~-----~~~~~~evih~~~~---------------------------~---~~---~~~~~G~s~i~~~~~~i 186 (406) T protein:vir:95 147 QVL--YGG-----QTFNYDEVLHFIYN---------------------------P---DP---ERPYIGRGYRVVLKDIA 186 (406) T ss_pred EEE--ecc-----EEEchhHEEEeecc---------------------------C---CC---CCCccccCHHHHHHHHH Confidence 000 011 01222222222100 0 00 00013666555444444 Q ss_pred HHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch-hhh-hhhhhh--hccceecc-CCCceeEeec---ccchH Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA-IDY-ASIFEA--APGALWEL-PPGVDIWESQ---TNDFT 306 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~-~~~-~~~~~~--~~~~~~~~-~~d~~~~~~~---~~~~~ 306 (456) +....+..-......-.+.|..+++. .... .++.-.. ... ...+.. ..+....+ .+..++.++. ..+ . T Consensus 187 ~~~~~~~~~~~~~~~ng~~~~~il~~-~~~l--~~e~~~~~~~~~~~~~~g~~n~~~~~v~~~~~~~~~~~~~~~~~d-~ 262 (406) T protein:vir:95 187 DNLKQATATKKSFMSGKYMPSLIVKV-DAAT--AELSSEEGRNAVFKKYLQATEAGQPWIIPAELLEVEQVKPLSLKD-I 262 (406) T ss_pred HHHHHHHHHHHHHHhccCCcceEEEe-CCCC--CHHHHHHHHHHHHHHhccccccCCceeecCCCccccccccCChhH-H Confidence 33332222111111222233333322 1111 1111111 111 111111 11222223 3333444432 222 3 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEE Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVS 386 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~ 386 (456) .+++..+....+|+.+-|+|+..+|... +.+.....+....+. -+-..+++.+...+ . ....+.+++. T Consensus 263 q~~e~~~~~~~~Ia~~fgVp~~~lg~~~-~~~~~~~~~~~~~l~--------P~~~~ie~~l~~~l--~-~~~~~~~~fd 330 (406) T protein:vir:95 263 AINEAVELDKRTVAGMFGVPAFLLGIGE-FNRDEYNNFINSTIL--------PIAKGIEQELTRKL--L-ISPDLYFKFN 330 (406) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCC-chHHHHHHHHHHHHH--------HHHHHHHHHHHHhc--C-CCCCcEEEee Confidence 3678788889999999999999987432 222222222111111 11222222221111 1 1223345555 Q ss_pred ecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 387 FESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 387 f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +......|..+.++.+.++..+|+++..-+++.+|+.|.+--. .....+...............+++|.+ T Consensus 331 ~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~~~n~~~~~~~~~~~~~k~g~~~~~~~~~ 405 (406) T protein:vir:95 331 PRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVILENYIPLDKIGDQSKLKGGDNSGADGQT 405 (406) T ss_pred chhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccCccchhhcccccccCCCCCCCCCCCC Confidence 5556677899999999999999999999999999987642110 000000000000000111122334444 No 204 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=98.45 E-value=6e-07 Score=54.71 Aligned_cols=377 Identities=11% Similarity=0.004 Sum_probs=154.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |+ |.+ +.-......-.....+.-|...-...... . +.+.-.-.+|+.+++-+..-|+.... T Consensus 1 m~------~f~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---A-------l~~~~V~~~i~~Ia~~iA~lp~~~~~ 61 (406) T protein:vir:97 1 MS------FFQ---PLGTSKVSYDDYISSVLAGDVSQKYLGVS---A-------LKNSDILTATSIIAGDIARFPLVKKD 61 (406) T ss_pred Cc------ccc---ccCCCCCCcchHHHHHhcCCCCcccccch---h-------hccHHHHHHHHHHHHhhhhCeeEEEe Confidence 11 000 00000000000011111111100000000 0 11112223566666666555655321 Q ss_pred -CCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCC-CCce-EEEEEccceeEEEEeCCCCceEEE Q lcl|NC_021301. 81 -SADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRD-DGTA-TITADSPETMVVSVDPLQPWRIRS 152 (456) Q Consensus 81 -~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~-dg~~-~i~~~~p~~~~~~~d~~~~~~~~~ 152 (456) +.+......+..++.. |. ...+...+...++.+|.||+++.++. .|.+ .+..++|..+.+..++... +. T Consensus 62 ~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~~~~--~~- 138 (406) T protein:vir:97 62 VNGDIIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETDNHE--IV- 138 (406) T ss_pred cCccccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcCCce--EE- Confidence 1111111224445432 32 24566778889999999999998874 4655 6888899988876654321 11 Q ss_pred EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHH Q lcl|NC_021301. 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHID 232 (456) Q Consensus 153 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~ 232 (456) +.+....+... ..+..+.+.++.. . ....-.|.|.++.+.. T Consensus 139 --y~~~~~~~~~~--~~~~~~evih~r~-------------------------------~----~~dg~~G~spi~~~~~ 179 (406) T protein:vir:97 139 --YTFTDMLTAKQ--VKCFAHDVIHWKF-------------------------------F----SHDTILGRSPLLSLGD 179 (406) T ss_pred --EEEEecCCceE--EEEccccEEEecC-------------------------------C----CCCCcccccHHHHHHH Confidence 11111111111 1122223222210 0 0011126665554333 Q ss_pred HHHHHHHHHHHHHHHHHHhhc--hhhhhhcCCCcccccccccchh-hhhhhhhh--hccceeccCCCceeEeecccc-hH Q lcl|NC_021301. 233 IINRINRAELQLLSTMAIQAF--RQRALKSAGHGLPKVDENGNAI-DYASIFEA--APGALWELPPGVDIWESQTND-FT 306 (456) Q Consensus 233 liDa~~~~~s~~~~~~~~~~~--~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~~~~~~d~~~~~~~~~~-~~ 306 (456) .++. ...-......++.. +..++.-... ...++....+ ........ ..+.++.++.+.++.++.-.+ .. T Consensus 180 ~i~~---~~a~~~~~~~~f~ng~~~~~i~~~~~--~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~ 254 (406) T protein:vir:97 180 EIDL---QTGGINTLIKFFKDGFSSGILTMKGA--QLSGDARQRARQEFEKMREGSVGGSPLVFDSTMEYTPLEIDTNVL 254 (406) T ss_pred HHHH---HHHHHHHHHHHHhccCCCceEEecCC--CCCHHHHHHHHHHHHHHhcccccCceeecCCCceEEEccCCHHHH Confidence 3332 21111111222222 1112211111 1111111111 11111111 124566778888888775332 23 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhcccccCcHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccee Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAE-GAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVD 384 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~-Al~-~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~ 384 (456) .+++..+..+.+|+.+-++|+..+|....+.+-+ ..+ +....|.-.+ ..|++.+..-+ .......... T Consensus 255 q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~~~~f~~~~l~P~~--------~~ie~~l~~kl--l~~~~~~~~~ 324 (406) T protein:vir:97 255 QLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQLMEDYVTNDLPFYF--------DAITSELGLKT--LNDKDRRLYH 324 (406) T ss_pred HHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHHHHHHHHHHHHHHH--------HHHHHHHhhhh--cChhhcccee Confidence 3677777788999999999999998644333211 111 1111121111 12222221111 1111111233 Q ss_pred EEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH---H-------HHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 385 VSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIK---Q-------DDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 385 v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~---~-------~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) +.|.- ..+....++++.++.++|+++..-+++.+|+.|.+.. + ...+...+..+.... ..+..+.+| T Consensus 325 i~fd~--~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~-~~~gg~~~~ 401 (406) T protein:vir:97 325 IEFDT--RSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGI-KGKGGEVNA 401 (406) T ss_pred EEEec--CccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccchhccccccccccc-ccCCCCCCC Confidence 44532 2344556777888999999999999999988653211 0 011111111111111 111122233 Q ss_pred CC Q lcl|NC_021301. 455 SR 456 (456) Q Consensus 455 ~~ 456 (456) ++ T Consensus 402 ~~ 403 (406) T protein:vir:97 402 EE 403 (406) T ss_pred CC Confidence 33 No 205 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.45 E-value=6e-07 Score=54.68 Aligned_cols=401 Identities=9% Similarity=0.034 Sum_probs=161.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---------------ccccc--Ccccchhhhhhhhhh-ccChHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDA---------------PLPEL--TRNTSAAWRSFQREA-RTNWGLM 62 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~---------------~i~~~--~~~~~~~~~~~~~k~-~~n~~~~ 62 (456) + .+.++.-++.+-.. -..+..+.+--+++. .+... ....+..+...-+.+ ...+.+. T Consensus 25 ~-~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~ 99 (563) T protein:vir:99 25 I-DEGLQANIKKIEQD----NKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNA 99 (563) T ss_pred c-cCChhhhHhhhhcc----chhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHH Confidence 2 33333333322221 011111111111111 00000 011111122211122 2456677 Q ss_pred HHHHHHhhhcc-----------C--CeecCC-CC---cc--cHHHHHHHHHH-----h----cChhHHHHHHHHHHhhCC Q lcl|NC_021301. 63 VRDSVADRIIP-----------N--GITVGG-SA---DS--DLALRARRIWR-----D----NRMDSVCKQWVKYGLDFG 114 (456) Q Consensus 63 iVd~~a~~l~~-----------~--~~~~~~-~~---d~--~~~~~l~~~~~-----~----n~~~~~~~~~~~~a~~~G 114 (456) +|++.++.+.. - ++.+.. +. +. .....+..++. . ..+..+...+..+++.+| T Consensus 100 ~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~G 179 (563) T protein:vir:99 100 IILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYD 179 (563) T ss_pred HHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcC Confidence 77776665431 1 222211 00 11 11122223221 1 124466777889999999 Q ss_pred eEEEEEe--eCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccce Q lcl|NC_021301. 115 ESYLTCW--RRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR 191 (456) Q Consensus 115 ~a~~~v~--~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (456) .+|+++. .+..|++ .+..++|..+.+..++... ......+++...++... ..|..+.+++.. T Consensus 180 n~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~g~~~--~~~~~~evI~~~------------ 244 (563) T protein:vir:99 180 QVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVDKRVV--ASFTSRELAMGI------------ 244 (563) T ss_pred CeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeCCcee--EEecCcceEEEe------------ Confidence 9988765 5556765 5888999999888765432 12222223322232211 112222211110 Q ss_pred eeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCccccc Q lcl|NC_021301. 192 LVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKV 268 (456) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~ 268 (456) .. |..-......|.|-++.....+... +.-......++. .|..+|+--+ +.... T Consensus 245 ------------------~~-~~~d~~~~~~G~Spi~~a~~~i~~~---~~~~~~~~~~f~ng~~p~giL~~~~-~~~ls 301 (563) T protein:vir:99 245 ------------------RN-PRTELSSSGYGLSEVEIAMKEFIAY---NNTESFNDRFFSHGGTTRGILQIRS-DQQQS 301 (563) T ss_pred ------------------cc-CCCCcccCcccchHHHHHHHHHHHH---HHHHHHHHHHHHccCCCceEEEeCC-CCCCC Confidence 00 0000000124666655444333322 221112223332 3433332111 11111 Q ss_pred ccccchh-hhhh-hhhh--hccce-eccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccC------ Q lcl|NC_021301. 269 DENGNAI-DYAS-IFEA--APGAL-WELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN------ 336 (456) Q Consensus 269 ~~~~~~~-~~~~-~~~~--~~~~~-~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N------ 336 (456) ++....+ .... .+.. ..+.+ ..++.+.++..+.... -..|++..+.....|+.+-++|+..+|..... T Consensus 302 ~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~ 381 (563) T protein:vir:99 302 QHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSK 381 (563) T ss_pred HHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccc Confidence 1111111 1111 1111 12333 5567888888876432 23388888999999999999999999742110 Q ss_pred -------cHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHh Q lcl|NC_021301. 337 -------QSAEGA--HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKA 407 (456) Q Consensus 337 -------~Sg~Al--~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~ 407 (456) ++.+.+ .+....|.-.+.. |++.+...+. ......+.+.|.+..+.+..+..+ +.++.+ T Consensus 382 ~~ss~~~sn~e~~~~~f~~~tL~P~l~~--------ie~~ln~~L~---~~~~~~~~~~f~r~D~~~~~e~~~-~~~~~~ 449 (563) T protein:vir:99 382 GGSTLNEADPGKKQQQSQNKGLQPLLRF--------IEDLVNRHII---SEYGDKYTFQFVGGDTKSATDKLN-ILKLET 449 (563) T ss_pred cccchhhccHHHHHHHHHHHHHHHHHHH--------HHHHHHhhhc---hhcccccEEEeccCCHHHHHHHHH-HHHHhc Confidence 011111 1111112111111 2221111110 011234567787777666666554 445678 Q ss_pred cCCCcHHHHHHhCCCChhHHH-------------------HHHHHHHHHHHHHHhh-------hhhhhcccccCC Q lcl|NC_021301. 408 AGESWASIRRNILNYNADQIK-------------------QDDLDRAREQITLFAG-------NSVQRPQEDGSR 456 (456) Q Consensus 408 ~g~~s~~t~~~~~~~~~~~~~-------------------~~e~~~~~ee~~~~~~-------~~~~~~~~d~~~ 456 (456) +|+++.--+++.+|+.|-+-- ..+.+..+...+...+ ...+.+..+++. T Consensus 450 ~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (563) T protein:vir:99 450 QIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSN 524 (563) T ss_pred CCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 899998888888887643210 0000111111111110 000011111111 No 206 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.45 E-value=6e-07 Score=54.68 Aligned_cols=401 Identities=9% Similarity=0.034 Sum_probs=161.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---------------ccccc--Ccccchhhhhhhhhh-ccChHHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDA---------------PLPEL--TRNTSAAWRSFQREA-RTNWGLM 62 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~---------------~i~~~--~~~~~~~~~~~~~k~-~~n~~~~ 62 (456) + .+.++.-++.+-.. -..+..+.+--+++. .+... ....+..+...-+.+ ...+.+. T Consensus 25 ~-~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~n~i~~~ 99 (563) T protein:vir:95 25 I-DEGLQANIKKIEQD----NKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNLHDVLKKFGNNPILNA 99 (563) T ss_pred c-cCChhhhHhhhhcc----chhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccHHHHHHHhhcchHHHH Confidence 2 33333333322221 011111111111111 00000 011111122211122 2456677 Q ss_pred HHHHHHhhhcc-----------C--CeecCC-CC---cc--cHHHHHHHHHH-----h----cChhHHHHHHHHHHhhCC Q lcl|NC_021301. 63 VRDSVADRIIP-----------N--GITVGG-SA---DS--DLALRARRIWR-----D----NRMDSVCKQWVKYGLDFG 114 (456) Q Consensus 63 iVd~~a~~l~~-----------~--~~~~~~-~~---d~--~~~~~l~~~~~-----~----n~~~~~~~~~~~~a~~~G 114 (456) +|++.++.+.. - ++.+.. +. +. .....+..++. . ..+..+...+..+++.+| T Consensus 100 ~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~~lll~G 179 (563) T protein:vir:95 100 IILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKKIVRDTYIYD 179 (563) T ss_pred HHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHHHHHHHHhcC Confidence 77776665431 1 222211 00 11 11122223221 1 124466777889999999 Q ss_pred eEEEEEe--eCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccce Q lcl|NC_021301. 115 ESYLTCW--RRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR 191 (456) Q Consensus 115 ~a~~~v~--~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (456) .+|+++. .+..|++ .+..++|..+.+..++... ......+++...++... ..|..+.+++.. T Consensus 180 n~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~-~~~~~~~y~~~~~g~~~--~~~~~~evI~~~------------ 244 (563) T protein:vir:95 180 QVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGK-IIKGGKRFVQVVDKRVV--ASFTSRELAMGI------------ 244 (563) T ss_pred CeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCc-eeccceeEEEEeCCcee--EEecCcceEEEe------------ Confidence 9988765 5556765 5888999999888765432 12222223322232211 112222211110 Q ss_pred eeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCccccc Q lcl|NC_021301. 192 LVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKV 268 (456) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~ 268 (456) .. |..-......|.|-++.....+... +.-......++. .|..+|+--+ +.... T Consensus 245 ------------------~~-~~~d~~~~~~G~Spi~~a~~~i~~~---~~~~~~~~~~f~ng~~p~giL~~~~-~~~ls 301 (563) T protein:vir:95 245 ------------------RN-PRTELSSSGYGLSEVEIAMKEFIAY---NNTESFNDRFFSHGGTTRGILQIRS-DQQQS 301 (563) T ss_pred ------------------cc-CCCCcccCcccchHHHHHHHHHHHH---HHHHHHHHHHHHccCCCceEEEeCC-CCCCC Confidence 00 0000000124666655444333322 221112223332 3433332111 11111 Q ss_pred ccccchh-hhhh-hhhh--hccce-eccCCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccC------ Q lcl|NC_021301. 269 DENGNAI-DYAS-IFEA--APGAL-WELPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN------ 336 (456) Q Consensus 269 ~~~~~~~-~~~~-~~~~--~~~~~-~~~~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N------ 336 (456) ++....+ .... .+.. ..+.+ ..++.+.++..+.... -..|++..+.....|+.+-++|+..+|..... T Consensus 302 ~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~ 381 (563) T protein:vir:95 302 QHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSK 381 (563) T ss_pred HHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccc Confidence 1111111 1111 1111 12333 5567888888876432 23388888999999999999999999742110 Q ss_pred -------cHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHh Q lcl|NC_021301. 337 -------QSAEGA--HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKA 407 (456) Q Consensus 337 -------~Sg~Al--~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~ 407 (456) ++.+.+ .+....|.-.+.. |++.+...+. ......+.+.|.+..+.+..+..+ +.++.+ T Consensus 382 ~~ss~~~sn~e~~~~~f~~~tL~P~l~~--------ie~~ln~~L~---~~~~~~~~~~f~r~D~~~~~e~~~-~~~~~~ 449 (563) T protein:vir:95 382 GGSTLNEADPGKKQQQSQNKGLQPLLRF--------IEDLVNRHII---SEYGDKYTFQFVGGDTKSATDKLN-ILKLET 449 (563) T ss_pred cccchhhccHHHHHHHHHHHHHHHHHHH--------HHHHHHhhhc---hhcccccEEEeccCCHHHHHHHHH-HHHHhc Confidence 011111 1111112111111 2221111110 011234567787777666666554 445678 Q ss_pred cCCCcHHHHHHhCCCChhHHH-------------------HHHHHHHHHHHHHHhh-------hhhhhcccccCC Q lcl|NC_021301. 408 AGESWASIRRNILNYNADQIK-------------------QDDLDRAREQITLFAG-------NSVQRPQEDGSR 456 (456) Q Consensus 408 ~g~~s~~t~~~~~~~~~~~~~-------------------~~e~~~~~ee~~~~~~-------~~~~~~~~d~~~ 456 (456) +|+++.--+++.+|+.|-+-- ..+.+..+...+...+ ...+.+..+++. T Consensus 450 ~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (563) T protein:vir:95 450 QIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSN 524 (563) T ss_pred CCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCC Confidence 899998888888887643210 0000111111111110 000011111111 No 207 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=98.42 E-value=7.2e-07 Score=54.27 Aligned_cols=388 Identities=14% Similarity=0.045 Sum_probs=168.1 Q ss_pred HHHHHHHHHHHHHH---HHHHHHH-----------HHHhcccCc--ccc-cCcccchhhh-hhhhhhccChHHHHHHHHH Q lcl|NC_021301. 7 AEWLPVLTKRIDDG---MSRVRLL-----------ARYSNGDAP--LPE-LTRNTSAAWR-SFQREARTNWGLMVRDSVA 68 (456) Q Consensus 7 ~~~~~~l~~~~~~~---~~r~~~~-----------~~YY~g~~~--i~~-~~~~~~~~~~-~~~~k~~~n~~~~iVd~~a 68 (456) +-+++++..+-... .++.+-- -+.+.|-.+ +.. +......... ....-+.+.-...+|+.++ T Consensus 1 Mgl~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~ci~~Ia 80 (431) T protein:vir:10 1 MGLFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGELNGGTGRETRALRNMAVLRCVTLIS 80 (431) T ss_pred CcchhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCccCcceechhhhhccHHHHHHHHHHH Confidence 33444443321100 0110000 001111100 000 0000000000 0011122334456788888 Q ss_pred hhhccCCeecCCCCc---ccHHHHHHHHHHh--cCh---hHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEE Q lcl|NC_021301. 69 DRIIPNGITVGGSAD---SDLALRARRIWRD--NRM---DSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVV 140 (456) Q Consensus 69 ~~l~~~~~~~~~~~d---~~~~~~l~~~~~~--n~~---~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~ 140 (456) +-+..-|+.+-...+ ......+.+++.. |.. ..+...++..++.+|.||+++-.+......+..++|..+.+ T Consensus 81 ~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~L~pl~~~~v~~ 160 (431) T protein:vir:10 81 GTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNRPIRLIPMDRGSAKG 160 (431) T ss_pred HhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCceEEEEEEcCceeEE Confidence 877777776422111 1111224444432 322 34556788899999999999988753334678888888887 Q ss_pred EEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC Q lcl|NC_021301. 141 SVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN 220 (456) Q Consensus 141 ~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n 220 (456) ..+... . + ...+...+|... .+..+.+.++.. . ..+. T Consensus 161 ~~~~~~-~-~---~y~~~~~~g~~~---~~~~~dViHir~-------------------------------~----~~dg 197 (431) T protein:vir:10 161 RLTSTW-Q-I---VYDYTTPTGDKI---ELPAREVFHLRD-------------------------------L----SIDG 197 (431) T ss_pred EEcCCC-e-E---EEEEEeCCceEE---EEchhhEEEecC-------------------------------c----CCCC Confidence 765432 1 1 122333333321 122222222210 0 0011 Q ss_pred CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchh-hhh-hhhh--hhccceeccCC Q lcl|NC_021301. 221 PDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAI-DYA-SIFE--AAPGALWELPP 293 (456) Q Consensus 221 ~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~~-~~~~--~~~~~~~~~~~ 293 (456) ..|.|-++.... .+.....-......++. .|-.+++- .. ...++.-..+ ... .... ...+.+..++. T Consensus 198 ~~G~spi~~~~~---~i~~~~~~~~~~~~~f~ng~~p~gil~~-~~--~ls~e~~~~~~~~~~~~~~g~~n~g~~~vl~~ 271 (431) T protein:vir:10 198 VSGVSRVKLSGN---ALELAEQAERAASRTFRTGVMAGGAIEV-PK--ELSDNAYGRMKASVQENHTGSENAGSWMLLEE 271 (431) T ss_pred cccccHHHHHHH---HHHHHHHHHHHHHHHHhccCCccEEEec-CC--CCCHHHHHHHHHHHHHHhcCccccCCceecCC Confidence 235555543332 22222221112222322 23333322 11 1111111111 111 1111 12345667788 Q ss_pred CceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 294 GVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 294 d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) +.++.++.-.+. ..+++..+..+.+|+.+-|+|++.+|.... +++..++.....+...| -.-+-..|++.+...+ T Consensus 272 g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~-~t~sn~eq~~~~f~~~t---L~P~~~~ie~~ln~~L 347 (431) T protein:vir:10 272 GATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDT-SWGSGIEQLAIFFIQYG---LSHWFVSWEQAAARAF 347 (431) T ss_pred CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCC-CccccHHHHHHHHHHHH---HHHHHHHHHHHHHhhc Confidence 889888764322 236777777889999999999999985421 12222222222222111 0111222222222111 Q ss_pred HhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCC----CcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_021301. 373 QIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGE----SWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQ 448 (456) Q Consensus 373 ~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~----~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~ 448 (456) --......+.+++.+...+-.|..+.+++..++.++|+ ++.--+++.+|+.+-+... .++..- ..+. T Consensus 348 l~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~~g~lT~NE~R~~~gl~p~~~~~--gD~~~~-------p~n~ 418 (431) T protein:vir:10 348 LPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGGQSPWMKQNEVREMLDLPRADDPV--ADQLRN-------PMTQ 418 (431) T ss_pred cChhhcCCceEEEechhhhccCHHHHHHHHHHHHhcccccCccCHHHHHHHhCCCCCCCcc--ccceec-------cccc Confidence 10111122334544455567788999999999987765 7888888898887642211 111111 1111 Q ss_pred hcccccCC Q lcl|NC_021301. 449 RPQEDGSR 456 (456) Q Consensus 449 ~~~~d~~~ 456 (456) .+..+++. T Consensus 419 ~~~~~~~~ 426 (431) T protein:vir:10 419 KQKGSGDE 426 (431) T ss_pred ccCCCCCC Confidence 22222222 No 208 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=98.42 E-value=7.3e-07 Score=54.21 Aligned_cols=348 Identities=12% Similarity=0.025 Sum_probs=149.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |+=-+| +.++ .............+.... ..+..+.. ..-+.+.=..-+|+.+++-+.+-|+.. T Consensus 1 M~~~~~------f~~r---~~~~~~~~~~~~~~~~~~-~~~~~v~~-----~~al~~~av~~cv~~ia~~ia~~p~~~-- 63 (359) T protein:vir:10 1 MSILNP------FERR---SSITPNNYYPFMVQNGSI-VPNSLVDA-----TEALKNSDLYAVTSLISSDIAGTRFIG-- 63 (359) T ss_pred Ccccch------hhcc---ccCCCCcchhhhhccccc-cCCcccCH-----HHhhcchHHHHHHHHHHHhhhcCcccc-- Confidence 332221 1100 000000000010000000 00000100 000112222347888888777777642 Q ss_pred CCcccHHHHHHHHHHh-c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 81 SADSDLALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) ... +..++.+ | .-..+...+....+.+|.||+++-++..|.+ .+..++|..+.+..++. . + .+ T Consensus 64 ---~~~---~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~--~-~---~y 131 (359) T protein:vir:10 64 ---NQV---FTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD--T-L---TY 131 (359) T ss_pred ---chH---HHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC--e-E---EE Confidence 111 2223332 2 1234456677788899999999999988985 47888888887655432 1 1 11 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liD 235 (456) .+...++ .. ...+..+++.++.... . +.-....-.|.|-++.+...+. T Consensus 132 ~~~~~~~-~~-~~~~~~~evih~~~~~-------------------------~-----~~~~~dg~~G~spi~~~~~~i~ 179 (359) T protein:vir:10 132 EVNQFDD-YP-SAKYNASEMIHVKIMA-------------------------Y-----GVDTLHNLVGHSPLESLTSEIG 179 (359) T ss_pred EEEecCC-ce-EEEEcccceEEeccCC-------------------------C-----CCCccCccccccHHHHHHHHHH Confidence 1111111 11 1223333333332100 0 0000011236665554444333 Q ss_pred HHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc-hhhhhhhhh--hhccceeccCCCceeEeecccchH-HHHHH Q lcl|NC_021301. 236 RINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN-AIDYASIFE--AAPGALWELPPGVDIWESQTNDFT-PMLSA 311 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-~~~~~~~~~--~~~~~~~~~~~d~~~~~~~~~~~~-~~~~~ 311 (456) ....+..-..+...-.+.|..+++- ..+. ..++... ......... ...|.+..++.+.++..+...+.+ .+++. T Consensus 180 ~~~~~~~~~~~~f~ng~~~~gil~~-~~~~-l~~e~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~ 257 (359) T protein:vir:10 180 QQKEANRLSLSTLKGALNPTSVVKV-PQGT-LSSEAKDSIRKEFEKANGGNNSGRVMVLDQSADFSTVSINADVANYLNS 257 (359) T ss_pred HHHHHHHHHHHHHhccCCcceEEEe-CCCC-CCHHHHHHHHHHHHHHhCccccCCceecCCCcceeeecCCHHHHHHHHH Confidence 3222221111111112223333321 1110 1111111 111111111 112456677888888877644333 37788 Q ss_pred HHHHHHHHHhhcCCChhhhcccc-cCcHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecC Q lcl|NC_021301. 312 IKEHIRQLSSATKTPLPMLMPDS-ANQSAEGAHNIEKGFL-FKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFES 389 (456) Q Consensus 312 l~~~~~~i~~~~~~p~~~~~~~~-~N~Sg~Al~~~~~~l~-~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~ 389 (456) .+....+|+.+-|+|++.+|... .+.+...++..+.... .-+.-....+...|.+- +. . + ....+.| T Consensus 258 ~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~------~~-~-~-~~~~~~~-- 326 (359) T protein:vir:10 258 MNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIEPLISELRIKCDSS------IG-V-D-MSPITDY-- 326 (359) T ss_pred HHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh------hc-c-c-chhhhhc-- Confidence 88889999999999999997532 2334444443333221 11111111111111110 00 0 0 0001111 Q ss_pred CCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhH Q lcl|NC_021301. 390 PDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQ 426 (456) Q Consensus 390 ~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~ 426 (456) +.......+.+++++|+++.-.+++.++..|-- T Consensus 327 ----d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 327 ----SNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred ----CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 223334456678889999999999988876543 No 209 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.39 E-value=9.1e-07 Score=53.71 Aligned_cols=432 Identities=13% Similarity=0.021 Sum_probs=176.7 Q ss_pred CCCCCH--HHHHHHHHHHHHH----HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTASTP--AEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~~t~--~~~~~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~ 74 (456) |...|- .+-++....++.. ...+++.+.+|..-..-.. ..... +....++--+-+...++.+++.|.+- T Consensus 1 ~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~---~~~~~--~~~~~~~~dst~~~a~~~Las~l~~~ 75 (522) T protein:vir:94 1 MAEREGFAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSLFPK---ESDNS--STEYTTPWQAVGARCLNNLAAKLMLA 75 (522) T ss_pred CcccchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCC---CCCcc--cccccccccccHHHHHHHHHHHHHhh Confidence 766552 3334444444433 3455666677754431111 01001 11112344566778888888877542 Q ss_pred -----C-eecCCC--------CcccHHHH-----------HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce- Q lcl|NC_021301. 75 -----G-ITVGGS--------ADSDLALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA- 128 (456) Q Consensus 75 -----~-~~~~~~--------~d~~~~~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~- 128 (456) | |++... .+.+.... +.+.+..++|.....++.++..++|.+.+++..+..|.+ T Consensus 76 ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~~ 155 (522) T protein:vir:94 76 LFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGTYS 155 (522) T ss_pred cCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCcee Confidence 2 333211 11111222 234455688999999999999999999988877666654 Q ss_pred EEEEEccceeEEEEeCCCCceEEEEEEEEEecC-------CceEEEEEEcCCeEEEEEEeeeecccccceee-ccCCCce Q lcl|NC_021301. 129 TITADSPETMVVSVDPLQPWRIRSAMRWWRDLD-------AESDFAIVWSGDGWQKFARPCFVQSSSRRRLV-TRISDSW 200 (456) Q Consensus 129 ~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 200 (456) .++.++-.+.++.-|+ .++ +...++.++-.- ++.....-+.++..+.+....+.. ..++.. ....+.. T Consensus 156 ~~~~~pl~~y~v~~d~-~G~-vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~~~--~~~~~~~~~~~g~~ 231 (522) T protein:vir:94 156 PMRMYRLVSYVVQRDA-FGN-ILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIYRQ--DDEYLRYEEVEGIE 231 (522) T ss_pred eEEEEEcceEEEeeCC-CcC-eEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEEee--CCceeEEeeccCce Confidence 4777776665555543 333 444444432100 000000001111111111111111 111111 1111222 Q ss_pred eecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhh--cCCCccccccccc Q lcl|NC_021301. 201 VPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK--SAGHGLPKVDENG 272 (456) Q Consensus 201 ~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~ 272 (456) ........++. +|.+++ | .+.+|+|-.+..++.+..+|...-......+....|...+. |... T Consensus 232 ~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~~~g~~~--------- 302 (522) T protein:vir:94 232 VTGTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVNPNGITQ--------- 302 (522) T ss_pred ecccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeeccccccc--------- Confidence 11111212222 332222 2 34689999999999999999888888888888888764431 1111 Q ss_pred chhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHHHHH Q lcl|NC_021301. 273 NAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIEKGF 349 (456) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~~~l 349 (456) .......+.|.+....+ +....++. .+++..-.+.++.+...|....-+. .++ .+..+-+|.-++.....+ T Consensus 303 ----~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~ 376 (522) T protein:vir:94 303 ----PRRLNKAATGEFVAGRVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLN--SAVQRNAERVTAEEIRYVAGEL 376 (522) T ss_pred ----chheeccCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHHHHHHHH Confidence 11112333344333222 23333432 2344443444444444443322111 111 122233444443333222 Q ss_pred HHHHH----HHH-HHHHHHHHHHHHHHHHhcC--CCcccceeEEecCCCCcC-HHHHHHHHHHHHh--cCC--------C Q lcl|NC_021301. 350 LFKCE----DRL-SIAKIGLEAILVKALQIEG--ESVEDTVDVSFESPDRVT-LGEKYAAASLAKA--AGE--------S 411 (456) Q Consensus 350 ~~k~~----~~~-~~f~~~l~~~~~l~~~~~~--~~~~~~i~v~f~~~~~~~-~~e~ad~~~kl~~--~g~--------~ 411 (456) .+... +.+ ..+.+-+++.+.++....- ......+++.+..++..- ....++.+....+ +++ + T Consensus 377 ~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~i 456 (522) T protein:vir:94 377 EATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDI 456 (522) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcC Confidence 22211 111 2223333444444433211 122334666665543221 1111111111111 011 1 Q ss_pred cH----HHHHHhCCCCh-------hHHHHHHHHHHHHHH-HHHhhhhhhhc-ccccCC Q lcl|NC_021301. 412 WA----SIRRNILNYNA-------DQIKQDDLDRAREQI-TLFAGNSVQRP-QEDGSR 456 (456) Q Consensus 412 s~----~t~~~~~~~~~-------~~~~~~e~~~~~ee~-~~~~~~~~~~~-~~d~~~ 456 (456) .- ......+|+++ +|++++..+..+++. ...+....+.. .-.|++ T Consensus 457 d~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~ 514 (522) T protein:vir:94 457 NLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQG 514 (522) T ss_pred CHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcc Confidence 11 11234456643 233322222111111 11111111110 101111 No 210 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.34 E-value=1.2e-06 Score=53.03 Aligned_cols=433 Identities=9% Similarity=0.041 Sum_probs=186.4 Q ss_pred CCCCCHHHHHHHHHHHH-HHH---HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc--- Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRI-DDG---MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP--- 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~-~~~---~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~--- 73 (456) |.......+.+++ +.+ ..| ..+++.+.+|.--...-..-....+. ...+.++..+-+...|+++++.|.+ T Consensus 1 m~~~~~~~l~~r~-~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~--~~~~~~~~dst~~~a~~~Las~l~~~lt 77 (556) T protein:vir:73 1 MAETEKERLLKQL-AQLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRD--DRRNTKIVDPTGSMAQRILSSGMMSGIT 77 (556) T ss_pred CChhhHHHHHHHH-HHHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcc--hhhcCccccchHHHHHHHHHHHHHHhhc Confidence 8886666554443 333 333 44556666664221000000001111 1112345567788888888887754 Q ss_pred ---CC-eecCCC-CcccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccce Q lcl|NC_021301. 74 ---NG-ITVGGS-ADSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPET 137 (456) Q Consensus 74 ---~~-~~~~~~-~d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~ 137 (456) .+ |++... .+.... ..+.+.+..++|.....++.++..++|.+.+++..+..+.+++..++..+ T Consensus 78 pp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~~l~~ 157 (556) T protein:vir:73 78 SPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPFPIGS 157 (556) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEeecce Confidence 23 334321 111111 12345566678889899999999999999998888777778899999989 Q ss_pred eEEEEeCCCCceEEEEEEEEEec-------CCceE----EEEEEcCC---eEEEEEEeeee--cccccc----------e Q lcl|NC_021301. 138 MVVSVDPLQPWRIRSAMRWWRDL-------DAESD----FAIVWSGD---GWQKFARPCFV--QSSSRR----------R 191 (456) Q Consensus 138 ~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~----~~~~~~~~---~~~~~~~~~~~--~~~~~~----------~ 191 (456) +++.-|+. + ++...++.++-. .|... ....|..+ ..+......+. ..+... . T Consensus 158 ~~~~~d~~-G-~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~V~pr~~~~~~~~~~~~~p~~s~ 235 (556) T protein:vir:73 158 YYLANSPR-G-SVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHCITPNVNRDSGKMDSKNKPYRSV 235 (556) T ss_pred eEEeeCCC-C-CeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEEEeccccccccccCcccceEEEE Confidence 88776654 3 344444443211 00000 00001000 00000000000 000000 0 Q ss_pred eecc-CCCc-eeecccccccCceeEEEE-c----cCCCCCCc-HhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCC Q lcl|NC_021301. 192 LVTR-ISDS-WVPVGDAVVTGSPPPVVV-Y----QNPDGMGE-VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGH 263 (456) Q Consensus 192 ~~~~-~~~~-~~~~~~~~~~~~~~pvv~-~----~n~~g~s~-~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~ 263 (456) .++. ..+. ...+.. +..+|.+++ | .+.+|+|- .+..++.+..+|...-......+....|.+.+-.- T Consensus 236 ~~~~~~~~~~vl~esg---~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~~-- 310 (556) T protein:vir:73 236 YFESGGDSDKLLRESG---FDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPPMVAPTS-- 310 (556) T ss_pred EEEecCCCceecccCC---cccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCceecccc-- Confidence 0110 0111 111111 112222221 1 35689995 89999999999988888888888888775443211 Q ss_pred cccccccccchhhhhhhhhhhccceecc--CCC-ceeE--eecccchHHHHHHHHHHHHHHHhhcCCCh-hhhc-ccccC Q lcl|NC_021301. 264 GLPKVDENGNAIDYASIFEAAPGALWEL--PPG-VDIW--ESQTNDFTPMLSAIKEHIRQLSSATKTPL-PMLM-PDSAN 336 (456) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~d-~~~~--~~~~~~~~~~~~~l~~~~~~i~~~~~~p~-~~~~-~~~~N 336 (456) +.. ......+|.+... ..+ ..+. +...++.....+.+..+...|....-.+. ..++ .+..+ T Consensus 311 --------~~~----~~~~~~pgg~~~~~~~~~~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r 378 (556) T protein:vir:73 311 --------LKN----QRVSLLPGDVTYLDVISGQDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRS 378 (556) T ss_pred --------ccc----cceeeccCccccccCCCCccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCC Confidence 100 0112223322111 111 1122 11223344444444444444433322211 1111 12233 Q ss_pred cHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHhcCC---C---cccceeEEecCCCCcCHH--------H Q lcl|NC_021301. 337 QSAEGAHNIEKGFLFK----CEDRL-SIAKIGLEAILVKALQIEGE---S---VEDTVDVSFESPDRVTLG--------E 397 (456) Q Consensus 337 ~Sg~Al~~~~~~l~~k----~~~~~-~~f~~~l~~~~~l~~~~~~~---~---~~~~i~v~f~~~~~~~~~--------e 397 (456) -+|.-++.....+... ..+.+ ..+.+-+.+.+.++.+..-. + ....+++.+..++-..-. . T Consensus 379 ~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~ 458 (556) T protein:vir:73 379 MPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQ 458 (556) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHH Confidence 3555444443333322 22222 23344455666655542211 1 123467777665432111 1 Q ss_pred HHHHHHHHHhcC-----CCcHHHH----HHhCCCC------hhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 398 KYAAASLAKAAG-----ESWASIR----RNILNYN------ADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 398 ~ad~~~kl~~~g-----~~s~~t~----~~~~~~~------~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .++.+..+.+++ .+.-..+ ...+|++ +++++++..+|++++............ .+|.| T Consensus 459 ~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a-~~~~~ 531 (556) T protein:vir:73 459 TVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAA-AQGAK 531 (556) T ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 112222222211 0111111 2345553 445555555554444333222111111 11222 No 211 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.29 E-value=1.6e-06 Score=52.35 Aligned_cols=433 Identities=9% Similarity=0.018 Sum_probs=183.0 Q ss_pred CCCCCHHHHHHH---HHHHHHHHHHHHHHHHHHh---cccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc- Q lcl|NC_021301. 1 MTASTPAEWLPV---LTKRIDDGMSRVRLLARYS---NGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP- 73 (456) Q Consensus 1 ~~~~t~~~~~~~---l~~~~~~~~~r~~~~~~YY---~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~- 73 (456) |...+.+++.++ |..+......+++.+.+|. .+...-.+. .+ ....+.++..+-+...|+++++.|.+ T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~---~~--~~~~~~~~~dst~~~a~~~Las~l~~~ 75 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSDYINPRGSRFLTSEV---NR--NDRRNTRIIDSTGTMAARTLASGMMSG 75 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCcCCCCC---Cc--ccccccccccchHHHHHHHHHHHHHHh Confidence 888887765443 2223333445566666663 222111000 00 11123345567788888888887754 Q ss_pred -----CC-eecCCC-CcccHHHH-----------HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEcc Q lcl|NC_021301. 74 -----NG-ITVGGS-ADSDLALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSP 135 (456) Q Consensus 74 -----~~-~~~~~~-~d~~~~~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p 135 (456) .+ |++... .+...... +.+.+..++|.....++.++..++|.+.+++..+..+.+++..++. T Consensus 76 ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~~l 155 (559) T protein:vir:95 76 ITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPFPI 155 (559) T ss_pred hcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEeec Confidence 23 444322 11111111 3345666788888999999999999998888777666788999999 Q ss_pred ceeEEEEeCCCCceEEEEEEEEEec-------CCceE----EEEEEc--C-CeEEEEEEeeee--cccccc--------- Q lcl|NC_021301. 136 ETMVVSVDPLQPWRIRSAMRWWRDL-------DAESD----FAIVWS--G-DGWQKFARPCFV--QSSSRR--------- 190 (456) Q Consensus 136 ~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~----~~~~~~--~-~~~~~~~~~~~~--~~~~~~--------- 190 (456) .++++.-|+.. ++...++.++-. .+... ....+. + +..+.+....+. ..+... T Consensus 156 ~~~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~pf~ 233 (559) T protein:vir:95 156 GSYYLANSPRG--SVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHSVYPNIDRDTSKLDSKNKPFK 233 (559) T ss_pred CeEEEeeCCCC--CeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEEEeccccccccccccccceEE Confidence 99888776543 344444433210 00000 000000 0 000111100000 000000 Q ss_pred -eeecc-CCCc-eeecccccccCceeEEEEc---cCCCCCCc-HhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCC Q lcl|NC_021301. 191 -RLVTR-ISDS-WVPVGDAVVTGSPPPVVVY---QNPDGMGE-VEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGH 263 (456) Q Consensus 191 -~~~~~-~~~~-~~~~~~~~~~~~~~pvv~~---~n~~g~s~-~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~ 263 (456) ...+. ..+. ...+... +.+.++|.-|. ...+|+|- ....++.+..+|...-......+....|.+.+-.- T Consensus 234 s~~~e~~~~~~~~l~esg~-~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp~~v~~~-- 310 (559) T protein:vir:95 234 SVYYEVGGDNDKLLRESGF-DEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPPMVAPTS-- 310 (559) T ss_pred EEEEEecCCCceeeecCCc-ccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCceecccc-- Confidence 00111 1111 1111111 11222222222 34689995 88899999999988888888888888874443211 Q ss_pred cccccccccchhhhhhhhhhhccceeccCC-----CceeEeecccchHHHHHHHHHHHHHHHhhcCCCh-hhhc-ccccC Q lcl|NC_021301. 264 GLPKVDENGNAIDYASIFEAAPGALWELPP-----GVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPL-PMLM-PDSAN 336 (456) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~-~~~~-~~~~N 336 (456) +.. ......+|.++..+. .....+....+.......+..+...|....-... .++. .+..+ T Consensus 311 --------~~~----~~~~l~pgg~~~~~~~~~~~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~r 378 (559) T protein:vir:95 311 --------LKN----QRASLLPGDITYIDQITGQDGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) T ss_pred --------ccc----cceeeeccceeeeCCCCCcccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCC Confidence 100 011122333322211 1111111112233222233333333333222211 1121 12233 Q ss_pred cHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHhcC---CC---cccceeEEecCCCCcCH--------HH Q lcl|NC_021301. 337 QSAEGAHNIEKGFLFK----CEDRL-SIAKIGLEAILVKALQIEG---ES---VEDTVDVSFESPDRVTL--------GE 397 (456) Q Consensus 337 ~Sg~Al~~~~~~l~~k----~~~~~-~~f~~~l~~~~~l~~~~~~---~~---~~~~i~v~f~~~~~~~~--------~e 397 (456) .+|.-++.....+... ..+.+ ..+.+-+.+.+.++.+..- .+ ....+++.+..++..-- .. T Consensus 379 vTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~ 458 (559) T protein:vir:95 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHH Confidence 3555544443333222 22222 2334555566666554211 11 12346677765543211 01 Q ss_pred HHHHHHHHHhcC-----CCcHHHH----HHhCCCC------hhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 398 KYAAASLAKAAG-----ESWASIR----RNILNYN------ADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 398 ~ad~~~kl~~~g-----~~s~~t~----~~~~~~~------~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .++.+..+.+++ .+.-..+ ...+|++ ++|++++..+|.+++............ ..+.| T Consensus 459 ~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~a-a~~~~ 531 (559) T protein:vir:95 459 TVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAA-AQGVK 531 (559) T ss_pred HHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhh Confidence 112222222221 1211111 2345553 345554444443333221111111111 11111 No 212 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.27 E-value=1.8e-06 Score=52.06 Aligned_cols=427 Identities=12% Similarity=0.022 Sum_probs=177.9 Q ss_pred CCCCC----HHHHHHHHHHHH----HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTAST----PAEWLPVLTKRI----DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t----~~~~~~~l~~~~----~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |...- .++-++....++ .....+++.+.+|..-..- +...... +....++--+-+...++.+++.|. T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~----~~~~~~~-~~~~~~~~dst~~~a~~~Laa~l~ 75 (535) T protein:vir:33 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLF----PKESDNE-STDYTTPWQAVGARGLNNLASKLM 75 (535) T ss_pred CChhhhhccChhHHHHHHHHHHHHhhHHHHHHHHHHHHhccccc----CCCCCcc-cccccccccccHHHHHHHHHHHHH Confidence 54433 233344333333 3445566677777544310 1100000 111112334556778888888775 Q ss_pred cC-----C-eecCCCC--------cccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc Q lcl|NC_021301. 73 PN-----G-ITVGGSA--------DSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT 127 (456) Q Consensus 73 ~~-----~-~~~~~~~--------d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~ 127 (456) +- | |++...+ +.... ..+.+.+..++|.....++.++..++|.+.+++-.+..+. T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 155 (535) T protein:vir:33 76 LALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSY 155 (535) T ss_pred HhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCc Confidence 42 2 3332111 00111 1233456678899999999999999999988887777677 Q ss_pred eEEEEEccceeEEEEeCCCCceEEEEEEEEEec-------CCce-----EEEEEEcCCeEEEEEEeeeecccccceeec- Q lcl|NC_021301. 128 ATITADSPETMVVSVDPLQPWRIRSAMRWWRDL-------DAES-----DFAIVWSGDGWQKFARPCFVQSSSRRRLVT- 194 (456) Q Consensus 128 ~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 194 (456) .+++.++-.++++.-|+ .++ +...++.++-. .+.. .....+..-.+|+ +.+...+...+... T Consensus 156 ~~f~~~pl~~~~v~~d~-~G~-vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~---~v~~~~~~~~~~~~~ 230 (535) T protein:vir:33 156 NPMKLYRLSSYVVQRDA-YGN-VLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYT---HVYLDEESGDYLKYE 230 (535) T ss_pred eeeEEEEcCeeEEeeCC-CCC-eeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEE---EEEeeCCCCcEEEEE Confidence 88888877776666553 343 34444444211 0100 0000111111111 11111111111110 Q ss_pred cCCCceeecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc Q lcl|NC_021301. 195 RISDSWVPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV 268 (456) Q Consensus 195 ~~~~~~~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~ 268 (456) ...+..........+++ +|.+++ | .+.+|+|-.++.++.+..+|...-......+....|...+- T Consensus 231 ~~~~~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~--------- 301 (535) T protein:vir:33 231 EVEDVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN--------- 301 (535) T ss_pred EEeCccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec--------- Confidence 01111111111111122 232222 2 34689999999999999999888888888888777754331 Q ss_pred ccccchhhhhhhhhhhccceeccC-CCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHH Q lcl|NC_021301. 269 DENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIE 346 (456) Q Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~ 346 (456) .+| .........++.|.+.... .+....++. .+++....+.++.+...|.... +.+.....+..+-+|.-++.. T Consensus 302 -~~g-~~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r~TAtEV~~r- 377 (535) T protein:vir:33 302 -PAG-ITQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGERVTAEEIRYV- 377 (535) T ss_pred -ccc-ccchhhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCccccHHHHHHH- Confidence 001 0011122233334433222 223344433 3345555555555555544332 111111112222344433333 Q ss_pred HHHHHHHHHHHHHHHHHH------------HHHHHHHHHhcC--CCcccceeEEecCCCCcCH-HHHH----HHHHHHHh Q lcl|NC_021301. 347 KGFLFKCEDRLSIAKIGL------------EAILVKALQIEG--ESVEDTVDVSFESPDRVTL-GEKY----AAASLAKA 407 (456) Q Consensus 347 ~~l~~k~~~~~~~f~~~l------------~~~~~l~~~~~~--~~~~~~i~v~f~~~~~~~~-~e~a----d~~~kl~~ 407 (456) .+++...++..+ ++.+.++.+..- ..+...+++.|..++..-- .+.+ +.+..+.+ T Consensus 378 ------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~ 451 (535) T protein:vir:33 378 ------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAA 451 (535) T ss_pred ------HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHh Confidence 333444444433 344444432111 1223446777765542211 1111 11222211 Q ss_pred cC--C----CcHHH----HHHhCCCCh-------hHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 408 AG--E----SWASI----RRNILNYNA-------DQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 408 ~g--~----~s~~t----~~~~~~~~~-------~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) .+ + +.-.. ....+|+++ ++++++..++.+++.................+ T Consensus 452 ~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~ 517 (535) T protein:vir:33 452 LAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALAT 517 (535) T ss_pred hChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhh Confidence 11 0 11111 123456543 33333222221111111000000000111111 No 213 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.26 E-value=1.9e-06 Score=51.94 Aligned_cols=392 Identities=12% Similarity=0.024 Sum_probs=156.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC-cccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDA-PLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) |+--+-..+ ....... ....-+.+.. ++. -|+-.+..+.... -...+...+|+..+..+.+.|+.+. T Consensus 6 ~~~~~~~~~--------~~~~~~~-~~~~~~~~~~~~~~-~pp~~~~~La~~~--~~n~~v~scI~~ia~~ia~~~~~i~ 73 (540) T protein:vir:41 6 LSIKSLEKY--------RAIKGDT-DSQALKEDRFEEYV-EPKVHPLVLLSLL--QVNPYHASACSIKANDILRTGYLID 73 (540) T ss_pred cChhhccch--------hhhhccc-cccccccCCCCccc-cCCCCHHHHHHHH--HhcHHHHHHHHHHHHHHhcCCceEe Confidence 443322111 0000000 0001111111 111 0111112222221 1245778899999999999998875 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEE Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWR 158 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~ 158 (456) .. +......+- ...-....+...+..+.+.+|.||+.+-.+..|.+ .+..++|..+.+..+... ++. T Consensus 74 ~~-~~~~~~~lp--N~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~---------~~~ 141 (540) T protein:vir:41 74 GD-DGGVEELLR--ACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSR---------YMQ 141 (540) T ss_pred cC-ccchhhhcc--CCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCce---------eEe Confidence 43 222211110 00112345667788899999999999999888876 588889998877654331 122 Q ss_pred ecCCceEE-EEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EEEcc------CCCCCCcHhH Q lcl|NC_021301. 159 DLDAESDF-AIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVVYQ------NPDGMGEVEP 229 (456) Q Consensus 159 ~~d~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~~~------n~~g~s~~~~ 229 (456) ..++.... ...|.....+.. ..+.. ...+++ |+|+. ...|.|.+.. T Consensus 142 ~~d~~~~~~~~~~~~~~~~~~-----------------~~g~~--------~~~~~~~eViHir~~~~~~~~~G~Spi~~ 196 (540) T protein:vir:41 142 TWDGIHVTYFKDYRYEGEVNP-----------------DNGED--------QDGVGANEIIFIHLPSPICSYYGVPRYLS 196 (540) T ss_pred eecCceeeeeecccccceeec-----------------ccccc--------ceeecccceEEecCCCCCCCcccccHHHH Confidence 22222111 111110000000 00000 000111 22221 1256665553 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCC--cccccccccch--hhhh-hhh-------hhhccceecc--- Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQA---FRQRALKSAGH--GLPKVDENGNA--IDYA-SIF-------EAAPGALWEL--- 291 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~--~~~~~~~~~~~--~~~~-~~~-------~~~~~~~~~~--- 291 (456) .. .++.....-......++. .|-.+++--+. +.....+.... .... ..+ ....+....+ T Consensus 197 ~~---~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~~ 273 (540) T protein:vir:41 197 AA---PSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSIP 273 (540) T ss_pred HH---HHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEecC Confidence 32 333222221111222322 23333321110 00000000000 0000 001 1122333332 Q ss_pred ---CCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhccc---ccC-cHHHH--HHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 292 ---PPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPD---SAN-QSAEG--AHNIEKGFLFKCEDRLSIAK 361 (456) Q Consensus 292 ---~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~---~~N-~Sg~A--l~~~~~~l~~k~~~~~~~f~ 361 (456) +.+.++..+.... -..|++..+.....|+++-++|+..+|.. +.| ++.+. +.+....|.-.+...+. T Consensus 274 ~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~~~tL~P~~~~ie~--- 350 (540) T protein:vir:41 274 GGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYYESVVRPQQEIVSS--- 350 (540) T ss_pred CCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHHHHHHHHHHHHHHH--- Confidence 2455676664322 22388888889999999999999999742 222 12222 22222223222222222 Q ss_pred HHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHh-CCCChhHHHHH--------HH Q lcl|NC_021301. 362 IGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNI-LNYNADQIKQD--------DL 432 (456) Q Consensus 362 ~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~-~~~~~~~~~~~--------e~ 432 (456) .|.+.+ +. . ....+.+.|........ +.+..+.+++++|+++.--+++. .|..+-...-+ +. T Consensus 351 -~ln~~L---~~-~---~~~~~~i~f~~~~ll~~-D~~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p~n~~~~~~ 421 (540) T protein:vir:41 351 -VLTDFI---QL-K---LDPGARFVFNEEILMES-EFVHNYALLVQCGVLTPSEVREKLFGLDGGPDMFMVPSSIGKSAM 421 (540) T ss_pred -HHHHhh---hh-c---cCCceEEEecchhhcch-HHHHHHHHHHhCCCCCHHHHHHHhCcCcCCCcccccccccccccc Confidence 222211 11 1 12235566765533222 34444566788999988777764 45543211000 00 Q ss_pred HHHH--------HHHHHHhhhhhhh---------cccccCC Q lcl|NC_021301. 433 DRAR--------EQITLFAGNSVQR---------PQEDGSR 456 (456) Q Consensus 433 ~~~~--------ee~~~~~~~~~~~---------~~~d~~~ 456 (456) .... ++.........+. +.+++++ T Consensus 422 ~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~ 462 (540) T protein:vir:41 422 KRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKK 462 (540) T ss_pred cccccccCCCCccccccccchhcccccCccccccccccccc Confidence 0000 0000000000000 1111111 No 214 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.22 E-value=2.5e-06 Score=51.30 Aligned_cols=431 Identities=9% Similarity=-0.077 Sum_probs=180.6 Q ss_pred CCCCCHHHHHHHHHHHHHH-------HHHHHHHHHHH---hcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDD-------GMSRVRLLARY---SNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~-------~~~r~~~~~~Y---Y~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~ 70 (456) |+.++.. +++.|.++++. ...+++.+.+| |.|.-... ++.........+.++--+-+...++.+++. T Consensus 1 m~~d~~~-~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~--~~~~~~~~~~~~~~~~dstg~~a~~~LAs~ 77 (549) T protein:vir:10 1 MTNDDAK-ILQALNADHGRMKEKRQSYEAVWNDVIDYLMPRLDKFGQL--PRPDSEKGRERSQKMFDSTAPLALRNFVAA 77 (549) T ss_pred CCcchHH-HHHHHHHHHHHHHHHhhhHHHHHHHHHHHhcccccccccc--CCCCCCcccccccccccchHHHHHHHHHHH Confidence 9998844 56665555432 23344455555 22221111 111111112223345556778888888887 Q ss_pred hccC------C-eecCCCCc-ccH----HHHHH-------HHH--HhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceE Q lcl|NC_021301. 71 IIPN------G-ITVGGSAD-SDL----ALRAR-------RIW--RDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTAT 129 (456) Q Consensus 71 l~~~------~-~~~~~~~d-~~~----~~~l~-------~~~--~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~ 129 (456) |.+- | |++....+ ... ...+. ..+ ...+|.....++.++..++|.+.+++..+..+.++ T Consensus 78 l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~~~~~ 157 (549) T protein:vir:10 78 MDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVGKGIV 157 (549) T ss_pred HHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCCCeeE Confidence 7542 3 33433211 111 11121 211 24678888888999999999999998887777788 Q ss_pred EEEEccceeEEEEeCCCCceEEEEEEEEEec-------CC---------------ceEEEEEEcCCeEEEEEEe--eee- Q lcl|NC_021301. 130 ITADSPETMVVSVDPLQPWRIRSAMRWWRDL-------DA---------------ESDFAIVWSGDGWQKFARP--CFV- 184 (456) Q Consensus 130 i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~---------------~~~~~~~~~~~~~~~~~~~--~~~- 184 (456) +..++-.++++.-|+. ++ +...++.++-. .| .....++|+ .+.. ... T Consensus 158 f~~~pl~~~~v~~d~~-G~-vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~------~V~pr~~~~~ 229 (549) T protein:vir:10 158 YRNVPMQRLWFAENNS-GL-IDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYH------AVEPRADRDP 229 (549) T ss_pred EEEEEcCeEEEeeCCC-CC-eEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEE------EeecCCCCCc Confidence 8889888887777754 43 34444433210 00 011112211 1000 000 Q ss_pred ---cccc---cceeeccCCCceeecccccccCceeEEEEc---cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchh Q lcl|NC_021301. 185 ---QSSS---RRRLVTRISDSWVPVGDAVVTGSPPPVVVY---QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQ 255 (456) Q Consensus 185 ---~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~---~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~ 255 (456) .... .....+...+....+... +.+.++|.-|. .+.+|+|-.+..++.+..+|.+.-......+....|. T Consensus 230 ~~~~~~~~pf~sv~~e~~~~~il~esg~-~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~ 308 (549) T protein:vir:10 230 RKLDGRNMQFASYWLDEGRDRIVQNSGF-RTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQKLVDPP 308 (549) T ss_pred cccccccCceEEEEEEecCCEeeccCCc-ccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 0000 000011111111111111 11222222221 3468999999999999999988888888888877775 Q ss_pred hhhhcCCCcccccccccchhhhhhhhhhhccceec-cCCCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhccc Q lcl|NC_021301. 256 RALKSAGHGLPKVDENGNAIDYASIFEAAPGALWE-LPPGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPD 333 (456) Q Consensus 256 ~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~ 333 (456) +.+- -+. ..++ ......+.+.+.. .+.+..+.++. ..++......++.+...|...--........+ T Consensus 309 ~~v~-~~g-------~~~~---~~l~pgg~~~~~~~~~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~ 377 (549) T protein:vir:10 309 LLAN-EDG-------VLDG---FDLRSGALNWGGLNDKGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVD 377 (549) T ss_pred eeec-ccc-------cccc---ceeccCCccccccCCCCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcC Confidence 5431 000 0011 1111111111111 11222344432 22333333334443333333222111111112 Q ss_pred ccCcHHHHHHHHHHHHHHH----HHHHH-HHHHHHHHHHHHHHHHhcCC-C--------cccceeEEecCCCCcCHH-H- Q lcl|NC_021301. 334 SANQSAEGAHNIEKGFLFK----CEDRL-SIAKIGLEAILVKALQIEGE-S--------VEDTVDVSFESPDRVTLG-E- 397 (456) Q Consensus 334 ~~N~Sg~Al~~~~~~l~~k----~~~~~-~~f~~~l~~~~~l~~~~~~~-~--------~~~~i~v~f~~~~~~~~~-e- 397 (456) ..+-+|.-++.....+.+. ..+.+ ..+.+-+.+.+.++.. .|. + ....+++.|..++-..-. + T Consensus 378 ~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r-~g~lP~~p~~l~~~~~~~~i~yis~La~aq~~~~ 456 (549) T protein:vir:10 378 SGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAE-AGQLPDMPQELIDAGADVDVEYDSPLNKAMRAGE 456 (549) T ss_pred CCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHh-cCCCCCCChhhhcCCceeEEEeecHHHHHHHHHH Confidence 2233555444433333222 22222 3334455566665554 222 1 112456666554432110 1 Q ss_pred ------HHHHHHHHHhcC-----CCcHHH----HHHhCCCC------hhHHHHHHHHHHHHHH----HHHhhhhhhhccc Q lcl|NC_021301. 398 ------KYAAASLAKAAG-----ESWASI----RRNILNYN------ADQIKQDDLDRAREQI----TLFAGNSVQRPQE 452 (456) Q Consensus 398 ------~ad~~~kl~~~g-----~~s~~t----~~~~~~~~------~~~~~~~e~~~~~ee~----~~~~~~~~~~~~~ 452 (456) .++.+..+.+.+ .+.-.. ....+|++ ++|++++..++.++++ ...+....+ ... T Consensus 457 ~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~~-~a~ 535 (549) T protein:vir:10 457 GAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAAG-AIK 535 (549) T ss_pred HHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH Confidence 112222222221 111111 12345543 4455544333333222 112211111 111 Q ss_pred ccCC Q lcl|NC_021301. 453 DGSR 456 (456) Q Consensus 453 d~~~ 456 (456) +.++ T Consensus 536 ~~~~ 539 (549) T protein:vir:10 536 DLSD 539 (549) T ss_pred hhhh Confidence 2222 No 215 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=98.16 E-value=3.5e-06 Score=50.53 Aligned_cols=372 Identities=12% Similarity=0.080 Sum_probs=150.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) |. +.+.+.++-.... ..-..++-+.......+ ...... .........+|+.+++-+..-|+.+-. T Consensus 1 Mg------~~~~f~~k~~~~~---~~~~~~~~~~~~~~~~~---~~~~~~---~~~~~~V~~~I~~ia~~iA~~p~~~~~ 65 (403) T protein:vir:80 1 MG------LFNFFRRKTRSEP---TNAISWFLTQEAYDTLA---IPGYTR---LSDNPEVRMAVHKIAELISSMTIHLMQ 65 (403) T ss_pred Cc------ccccccccccccc---cchhhhhcccccccccc---cchhhh---hhhhHHHHHHHHHHHHhhhhCceEEEE Confidence 11 1111111100000 00000000000000000 000011 111234456788888877766766421 Q ss_pred C-Cc--ccHHHHHHHHHHh--cCh---hHHHHHHHHHHhh--CCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCce Q lcl|NC_021301. 81 S-AD--SDLALRARRIWRD--NRM---DSVCKQWVKYGLD--FGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWR 149 (456) Q Consensus 81 ~-~d--~~~~~~l~~~~~~--n~~---~~~~~~~~~~a~~--~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~ 149 (456) . ++ ......+.+++.. |.. ..+...++...+. .|.||+++..+..|.+ .+..++|..+.++.++.. .. T Consensus 66 ~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~~~~~~g-~~ 144 (403) T protein:vir:80 66 NTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSFVDTDTG-YQ 144 (403) T ss_pred ecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEEEEcCCc-eE Confidence 1 11 1112223344332 322 2344455666666 4778998888888876 577888888776655332 11 Q ss_pred EEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccC-CCCCCcHh Q lcl|NC_021301. 150 IRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN-PDGMGEVE 228 (456) Q Consensus 150 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n-~~g~s~~~ 228 (456) .+|. + ..|..+++.++.. ...| .+ -.|.|-+. T Consensus 145 -----~~y~---~-----~~~~~~eiih~~~------------------------------~~~~----~~~~~G~s~~~ 177 (403) T protein:vir:80 145 -----IWYQ---G-----KAYNYDEVLHFIV------------------------------NPDP----EKPYMGRGYRV 177 (403) T ss_pred -----EEEe---e-----cccchhhEEEEec------------------------------cCCC----cCccccccHHH Confidence 1111 0 1122222222210 0000 01 13555444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccc-hhhh-hhhhhh--hccceeccCC-CceeEee Q lcl|NC_021301. 229 PHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGN-AIDY-ASIFEA--APGALWELPP-GVDIWES 300 (456) Q Consensus 229 ~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~-~~~~-~~~~~~--~~~~~~~~~~-d~~~~~~ 300 (456) .+. +.++....-......++. .|-.++.. .... .++... .... ...+.. ..|..+.++. ..++.++ T Consensus 178 ~~~---~~i~~~~~~~~~~~~~~~ng~~p~~il~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 251 (403) T protein:vir:80 178 VLK---DIVNNLKQATTTKKSFMSGKYMPSLIVKV-DAAT--AELSSEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQV 251 (403) T ss_pred HHH---HHHHHHHHHHHHHHHHHhccCCcceEEEe-CCCC--ChHHHHHHHHHHHHHHhhhhhcCCeeeeccccccccee Confidence 333 333322221111222222 23333321 1111 111111 1100 111111 1223333332 2334443 Q ss_pred c---ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021301. 301 Q---TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE 377 (456) Q Consensus 301 ~---~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~ 377 (456) . ..++ .+++..+....+|+.+-++|++.+|... +.+.....+....|.-.+ ..+++.+..- +. . T Consensus 252 ~~l~~~d~-q~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~~~~~~f~~~~l~P~~--------~~ie~~l~~k--ll-~ 318 (403) T protein:vir:80 252 KPLSLKDL-AIHETVELDKRTVAGIFGVPAFLLGVGK-YDKDEYNNFINSTILPIA--------KGIEQELTRK--LL-I 318 (403) T ss_pred ccCCHHHH-HHHHHHHHhHHHHHHHhCCCHHHcCCCC-ccHHHHHHHHHHHHHHHH--------HHHHHHHHHh--cc-C Confidence 3 2233 3677788889999999999999987432 223322222222222222 1222211110 11 1 Q ss_pred CcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH---HHHHHHHHHHHHHhhhhh----hhc Q lcl|NC_021301. 378 SVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ---DDLDRAREQITLFAGNSV----QRP 450 (456) Q Consensus 378 ~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~---~e~~~~~ee~~~~~~~~~----~~~ 450 (456) ..++.+++....-+..|..+.++++.++.++|+++.-.+++.+|+.|.+-.. +...... ++..++... ... T Consensus 319 ~~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~n~~p--l~~~~~~~~~k~ge~~ 396 (403) T protein:vir:80 319 SPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVILENYIP--LDKIGDQNKLKGGEKG 396 (403) T ss_pred CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccc--hhhccchhhccCCCCC Confidence 1223333333445677899999999999999999999999999987642111 0000000 011111111 111 Q ss_pred ccccCC Q lcl|NC_021301. 451 QEDGSR 456 (456) Q Consensus 451 ~~d~~~ 456 (456) +++|.+ T Consensus 397 ~~~~~~ 402 (403) T protein:vir:80 397 GADGQT 402 (403) T ss_pred CCCCCC Confidence 222222 No 216 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.07 E-value=5.4e-06 Score=49.47 Aligned_cols=434 Identities=7% Similarity=-0.020 Sum_probs=183.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCc-ccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc------ Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAP-LPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP------ 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~-i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~------ 73 (456) |+...-..-.+.|..+-.....+++.+.+|.--... ...............+.++-.+-+...|+++++.|.+ T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~~~ltPp~ 80 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDSTAGDGLETLSSSLHGSLTSPA 80 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccchHHHHHHHHHHHHHHhhcCCC Confidence 555544444444555445556667777777431110 0000000000011224456667888888988888764 Q ss_pred CC-eecCCC-Cc----ccHH-------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC--CCceEEEEEcccee Q lcl|NC_021301. 74 NG-ITVGGS-AD----SDLA-------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD--DGTATITADSPETM 138 (456) Q Consensus 74 ~~-~~~~~~-~d----~~~~-------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~--dg~~~i~~~~p~~~ 138 (456) .+ |++... .+ .+.. ..+.+.+..++|.....++.++..++|.+.+++..++ .+.+++..++..++ T Consensus 81 ~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~~~pl~~~ 160 (547) T protein:vir:10 81 TKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQSSPIQDS 160 (547) T ss_pred CcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEEEeecceE Confidence 23 333221 11 1111 1133456667888889999999999999988887654 35678999998888 Q ss_pred EEEEeCCCCceEEEEEEEEEec-------CCceE-----EEEE-EcCC------eEEEEEEeeeecccc----------- Q lcl|NC_021301. 139 VVSVDPLQPWRIRSAMRWWRDL-------DAESD-----FAIV-WSGD------GWQKFARPCFVQSSS----------- 188 (456) Q Consensus 139 ~~~~d~~~~~~~~~~~~~~~~~-------d~~~~-----~~~~-~~~~------~~~~~~~~~~~~~~~----------- 188 (456) ++.-|+.. ++...++.++-. .|... ...+ ..++ .+++.+......... T Consensus 161 ~v~~d~~G--~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v~~~~~~~~~~~~~~~~~~~~ 238 (547) T protein:vir:10 161 YFEEDSRG--QVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCVFTRYDKKQNRNAGTVLAPTE 238 (547) T ss_pred EEeeCCCc--CeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEEeeccCCCCCccccceeeccc Confidence 87776543 334444433210 00000 0000 0000 011111000000000 Q ss_pred ---cceeeccCC-CceeecccccccCceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhh Q lcl|NC_021301. 189 ---RRRLVTRIS-DSWVPVGDAVVTGSPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK 259 (456) Q Consensus 189 ---~~~~~~~~~-~~~~~~~~~~~~~~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~ 259 (456) ..+..+..+ .....++. +..+|.+++ | .+.+|+|-.+..++.+..+|...-.....++....|.+.+- T Consensus 239 ~p~~s~~~e~~~~~~~l~esg---~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~ 315 (547) T protein:vir:10 239 RPFGKKWILKEGAVQLGEEGG---YYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSSEKVIDPAIMVT 315 (547) T ss_pred cceeEEEEEecCceeeeecCC---cccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceecc Confidence 000001111 11111111 112232222 1 34689999999999999999888888888888887765331 Q ss_pred --cCCCcccccccccchhhhhhhhhhhccceeccCCCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccC Q lcl|NC_021301. 260 --SAGHGLPKVDENGNAIDYASIFEAAPGALWELPPGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSAN 336 (456) Q Consensus 260 --g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N 336 (456) |.. ++ +...+|.+...+....+..++ .++...-...++.+...|...--...... .+..+ T Consensus 316 ~~g~~----------~~------~~~~pgg~~~~~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~-~~~~~ 378 (547) T protein:vir:10 316 ERGLI----------SD------IDLGASGLTVVRDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQM-KDSPA 378 (547) T ss_pred ccccc----------cc------ceecCCeeeecCCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhc-CCCcc Confidence 111 11 112233332222222333232 12333333333333333332211111111 11222 Q ss_pred cHHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHhcCCC---------cccceeEEecCCCCcCHH------ Q lcl|NC_021301. 337 QSAEGAHNIEKGFLFKC----EDRL-SIAKIGLEAILVKALQIEGES---------VEDTVDVSFESPDRVTLG------ 396 (456) Q Consensus 337 ~Sg~Al~~~~~~l~~k~----~~~~-~~f~~~l~~~~~l~~~~~~~~---------~~~~i~v~f~~~~~~~~~------ 396 (456) -+|.-++.....+.+.. .+.+ ..+.+-+.+.+.++....-.+ ....++|++..++-..-. T Consensus 379 ~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~ 458 (547) T protein:vir:10 379 MTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAAS 458 (547) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHHHHHHHHHHHH Confidence 34444444333322221 1222 233444455555554321111 123466777665433211 Q ss_pred --HHHHHHHHHHhcC--C---CcHHH----HHHhCCCC------hhHHHHHHHHHHHHHHHHH--------hhhhhhhcc Q lcl|NC_021301. 397 --EKYAAASLAKAAG--E---SWASI----RRNILNYN------ADQIKQDDLDRAREQITLF--------AGNSVQRPQ 451 (456) Q Consensus 397 --e~ad~~~kl~~~g--~---~s~~t----~~~~~~~~------~~~~~~~e~~~~~ee~~~~--------~~~~~~~~~ 451 (456) ..++.+..+.+++ + +.-.. ....+|++ ++|++++..+|.+.+.... ++....... T Consensus 459 i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~~~ 538 (547) T protein:vir:10 459 IERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQGK 538 (547) T ss_pred HHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 1112222222211 1 11111 22345653 4455544444333222111 111111112 Q ss_pred cccC-C Q lcl|NC_021301. 452 EDGS-R 456 (456) Q Consensus 452 ~d~~-~ 456 (456) .+++ | T Consensus 539 ~~a~~~ 544 (547) T protein:vir:10 539 GQAALK 544 (547) T ss_pred cccchh Confidence 2222 2 No 217 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=98.04 E-value=6.4e-06 Score=49.07 Aligned_cols=404 Identities=9% Similarity=-0.029 Sum_probs=151.3 Q ss_pred CCCCC--HHHHHHHHHHHHHHHHHHHHHHHH----H-hcccC-----cccccCcccchhhhhhhhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTAST--PAEWLPVLTKRIDDGMSRVRLLAR----Y-SNGDA-----PLPELTRNTSAAWRSFQREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t--~~~~~~~l~~~~~~~~~r~~~~~~----Y-Y~g~~-----~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a 68 (456) |.-.+ |.+.+..-........+++...+. | |.|-. .|+. .+....-++.. ....+..-++++.. T Consensus 1 m~kk~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~iLr-~~~~~~ly~~m---~~D~hi~s~l~~Rk 76 (448) T protein:vir:77 1 MAKRGRKPKELVPGPGSIDPSDVPKLEGASVPVMSTSYDVVVDREFDELLQ-GKDGLLVYHKM---LSDGTVKNALNYIF 76 (448) T ss_pred CCCCCCCCcccCCcccccchhhhhhhccchhhhcccccccccccchhHhhc-cccchHHHHHH---hhChHHHHHHHHHH Confidence 21111 111110000000000000000000 0 11110 0110 00000011111 12455566677777 Q ss_pred hhhccCCeecCCCCcccHHHHH----HHHHHh-------cChhHHHHHHHHHHhhCCeE-EEEEee-CCCCceEEEEE-- Q lcl|NC_021301. 69 DRIIPNGITVGGSADSDLALRA----RRIWRD-------NRMDSVCKQWVKYGLDFGES-YLTCWR-RDDGTATITAD-- 133 (456) Q Consensus 69 ~~l~~~~~~~~~~~d~~~~~~l----~~~~~~-------n~~~~~~~~~~~~a~~~G~a-~~~v~~-d~dg~~~i~~~-- 133 (456) ..+.+.++++....++....++ .+.+.. ..|..++.++ .+|..||.+ ++++|. ..+|...+..+ T Consensus 77 ~av~~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~~-lda~~~G~s~~Eivw~~~~dg~~~~~~l~~ 155 (448) T protein:vir:77 77 GRIRSAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIY-ENAYIYGMAAGEIVLTLGADGKLILDKIVP 155 (448) T ss_pred HHHhcCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHHH-HHhhhhcceeEEEEEeecCCCceeeccccc Confidence 7777777776432222222333 333222 2466666665 588999995 668885 46776543322 Q ss_pred -cccee-EEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCc Q lcl|NC_021301. 134 -SPETM-VVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGS 211 (456) Q Consensus 134 -~p~~~-~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (456) +++.. ...||+..+ ... .+....+.. ......+...++.. T Consensus 156 r~~~~~~~f~~~~~~~----------------l~~---~~~~~~~~~-------------------~~~~~~~~~lP~~~ 197 (448) T protein:vir:77 156 IHPFNIDEVLYDEEGG----------------PKA---LKLSGEVKG-------------------GSQFVNGLEIPIWK 197 (448) T ss_pred cCCCccceeeeecCCc----------------eEE---EecCCcccc-------------------cccCCCccccccce Confidence 33221 112222211 110 000000000 00000000011111 Q ss_pred eeEEE--EccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhhhhccce Q lcl|NC_021301. 212 PPPVV--VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFEAAPGAL 288 (456) Q Consensus 212 ~~pvv--~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 288 (456) ++-.. ...|+.|.|.+..+.-..--=+..+.+.+..++.++.|.++.+--. +....++..... ..+..+..+.... T Consensus 198 ~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~-ga~~~~~~~~~l~~av~~i~~g~~a~ 276 (448) T protein:vir:77 198 TVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPK-SVRQGTKQWEAAKEIVKNFVQKPRHG 276 (448) T ss_pred EEEEecCCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCC-CCCCCHHHHHHHHHHHHHHhcCCceE Confidence 11111 1247788998887544322233446777888999999987665211 111111111111 1122222233344 Q ss_pred eccCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_021301. 289 WELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-A 366 (456) Q Consensus 289 ~~~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~ 366 (456) ..++.+.++..+.. .....+...++.+-.+|+.+.--.....+.. +..++.+......-.......-.+.+...+. + T Consensus 277 ~iiP~g~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~-~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~ 355 (448) T protein:vir:77 277 IILPDDWKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLN-MGVQAVNIGEFVSLTQQTIISLQREFASAVNLY 355 (448) T ss_pred EEecCCceEEEEecCCCccCHHHHHHHHHHHHHHHHhccccccccc-cchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 45666665533322 1222344556666666665442111111111 1112222222111111112222233444453 4 Q ss_pred HHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHH-HHHHHHHHHhhh Q lcl|NC_021301. 367 ILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLD-RAREQITLFAGN 445 (456) Q Consensus 367 ~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~-~~~ee~~~~~~~ 445 (456) +++-++.+..-....--.+.|....+.|.++.++.+.+|++. .++.+|+........+.. ...++.....+. T Consensus 356 Li~~l~~lNfg~~~~~P~~~f~~~e~eDl~~~a~~~~~l~~~-------~~~~~~ip~~~~~~~~~~~~~~~~~~~~~~~ 428 (448) T protein:vir:77 356 LIPKLVLPNWPGATRFPRLTFEMEERNDFSAAANLMGMLINA-------VKDSEDIPTELKALIDALPSKMRRALGVVDE 428 (448) T ss_pred HHHHHHHhcCCCCCCCCEEEecCCChhhHHHHHHHhHHHHHH-------HHHHhcCCccCCcCCCCCchhcccccCCCCC Confidence 555555544222222236788888999999999988888742 233333321100000000 000000000000 Q ss_pred -hhhhcccccCC Q lcl|NC_021301. 446 -SVQRPQEDGSR 456 (456) Q Consensus 446 -~~~~~~~d~~~ 456 (456) .........+| T Consensus 429 ~~~~~~~~~~~~ 440 (448) T protein:vir:77 429 VREAVRQPADSR 440 (448) T ss_pred CCchhhcchhhH Confidence 00001111122 No 218 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=98.00 E-value=7.7e-06 Score=48.62 Aligned_cols=411 Identities=10% Similarity=-0.010 Sum_probs=173.3 Q ss_pred CCCCC----HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhh--ccChHHHHHHHHHhhhccC Q lcl|NC_021301. 1 MTAST----PAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRIIPN 74 (456) Q Consensus 1 ~~~~t----~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~--~~n~~~~iVd~~a~~l~~~ 74 (456) ||..| |.--+.+...+= .... .+.|+--.....+.. +..++ +...+ .-.+..-+++.....+.+- T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~--~~~~----~~~~~~~e~~~~lr~--~~~~~-ly~~m~e~D~~i~s~l~~rk~av~~~ 71 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSG--VVDG----WTVWDPFEQTPELQW--PQSVA-VYSRMDNEDSRVTSLLEAISLPIRST 71 (469) T ss_pred CCCcccCCCCccchhhhhhcc--cccc----hhhcccccccccccc--ccchH-HHHHHHhhChHHHHHHHHHHHHHhcC Confidence 66554 432222222110 0111 122221100111100 11111 11111 2567777888888888888 Q ss_pred CeecCCCCcc-cHHHHHHH----HHH-------------hcChhHHHHHHHHHHhhCCeE-EEEEeeCC----CCceEEE Q lcl|NC_021301. 75 GITVGGSADS-DLALRARR----IWR-------------DNRMDSVCKQWVKYGLDFGES-YLTCWRRD----DGTATIT 131 (456) Q Consensus 75 ~~~~~~~~d~-~~~~~l~~----~~~-------------~n~~~~~~~~~~~~a~~~G~a-~~~v~~d~----dg~~~i~ 131 (456) ++++....+. +..+.+.+ .+. +..+...+.++...++.||.+ ++++|... +|...+. T Consensus 72 ~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~ 151 (469) T protein:vir:10 72 PWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLR 151 (469) T ss_pred CceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeee Confidence 8887543322 22222222 111 123456667777778889995 66888632 4555444 Q ss_pred EEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCc Q lcl|NC_021301. 132 ADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGS 211 (456) Q Consensus 132 ~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (456) .+.++.- ....++..+.++.....+......-..... . ..+.. +...+..+ T Consensus 152 ~l~~rp~------------~~i~~~~~~~~~~l~~~~~~~~~~~~~~~~-----------~--~~~~~----~~~lp~~k 202 (469) T protein:vir:10 152 KLAPRPQ------------WTISKFNVAPDGGLESIEQIAPPARTRGSL-----------Y--VANIA----PPEIPVNR 202 (469) T ss_pred eeeecCc------------ccceeeeeccCCceeeeeecCccccccccc-----------c--cCCCC----ccccccCc Confidence 3332210 000011122222221111100000000000 0 00000 01112223 Q ss_pred eeEEEE---ccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhhhhccc Q lcl|NC_021301. 212 PPPVVV---YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFEAAPGA 287 (456) Q Consensus 212 ~~pvv~---~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 287 (456) ++...+ ..|+.|.|.+..+....--=+..+.+.+...+.++.|.++.+- +... .++....+ .....+..+... T Consensus 203 ~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky--~~~a-~~~ek~~l~~a~~~~~~g~~a 279 (469) T protein:vir:10 203 LVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTA--SSAT-DEDEVRKMAALARSVRGGINA 279 (469) T ss_pred EEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEec--CCCC-CHHHHHHHHHHHHHHhcCCce Confidence 333332 2578899998876544333334577778888999988776542 1111 11122221 122222223344 Q ss_pred eeccCCCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 288 LWELPPGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE 365 (456) Q Consensus 288 ~~~~~~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~ 365 (456) ...++.+.++.-+. ..+...|...++.+-.+|+.+.--..-..+ ..++.+.|.. ...-....++.-.+.+...+. T Consensus 280 ~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~v---h~ev~~d~~~sDa~~i~~tln 356 (469) T protein:vir:10 280 GVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALASV---LEDPFTQAVHAYATSICRIAN 356 (469) T ss_pred EEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 45567666654333 234456777777777777665421111111 1111221221 111122233333345556664 Q ss_pred -HHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCC-----CcHHHHHHhCCCChhHHHHHHHHHHHHHH Q lcl|NC_021301. 366 -AILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGE-----SWASIRRNILNYNADQIKQDDLDRAREQI 439 (456) Q Consensus 366 -~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~-----~s~~t~~~~~~~~~~~~~~~e~~~~~ee~ 439 (456) ++++-++.+.........+++|.... .+....++.+.+|.++|+ ++.+-+++.+|+.+.+..+......+... T Consensus 357 ~~li~~l~~lN~g~~~~~P~~~~~~~e-~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~~ 435 (469) T protein:vir:10 357 QHIIEDLVDINFGVDTPAPVLTFDPIG-SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRFNLPSELNDTPSAEPEEPAA 435 (469) T ss_pred HHHHHHHHHhcCCCCCCccEEEecCCC-CcHHHHHHHHHHHHhcCCccCccccHHHHHHHhCCCCCCCCcccccchhccc Confidence 46665555553223333456776554 455678899999999997 34566788888853221111111111110 Q ss_pred HH----Hhhhhhhhccc-ccCC Q lcl|NC_021301. 440 TL----FAGNSVQRPQE-DGSR 456 (456) Q Consensus 440 ~~----~~~~~~~~~~~-d~~~ 456 (456) .. ........... ..++ T Consensus 436 ~~~~~~~~~~~~~~~~~~~~~~ 457 (469) T protein:vir:10 436 VPNQSAAPARTRSSGNADARAR 457 (469) T ss_pred CCCCCccccccCCCCCcccccc Confidence 00 00000000000 0000 No 219 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=97.99 E-value=7.9e-06 Score=48.56 Aligned_cols=376 Identities=12% Similarity=0.009 Sum_probs=174.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -..-||..|..-|...-.-...++-.+ ||+-. -.-.+..-++.+....+.+.++++.. T Consensus 37 ~~gltp~~l~~iL~~a~~gd~~~~~~L--~~dm~--------------------~~D~hi~s~l~~Rk~av~~~~w~I~p 94 (512) T protein:vir:19 37 SSGVTPNRAAQMLRDAERGDLTAQADL--AFDME--------------------EKDTHLFSELSKRRLAIQALEWRIAP 94 (512) T ss_pred ccCCCHHHHHHHHHHhhCCCHHHHHHH--HHHHH--------------------hhChHHHHHHHHHHHHHhCCCceEec Confidence 123334443333332221111111111 11100 01344555666666667777777653 Q ss_pred CCcc-----cHHHHHHHHHHhc-ChhHHHHHHHHHHhhCCeE-EEEEeeCCCCceE---EEEEccceeEEEEeCCCCceE Q lcl|NC_021301. 81 SADS-----DLALRARRIWRDN-RMDSVCKQWVKYGLDFGES-YLTCWRRDDGTAT---ITADSPETMVVSVDPLQPWRI 150 (456) Q Consensus 81 ~~d~-----~~~~~l~~~~~~n-~~~~~~~~~~~~a~~~G~a-~~~v~~d~dg~~~---i~~~~p~~~~~~~d~~~~~~~ 150 (456) +.+. ...+.+.+.+..- +|+..+..+. +|.-||.+ ++++|.-.+|... +..++|+.+ .|++.....+ T Consensus 95 ~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f--~~~~~~~~~l 171 (512) T protein:vir:19 95 ARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKGYSMQEIEWGWLGKMRVPVALHHRDPALF--CANPDNLNEL 171 (512) T ss_pred CCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhcceeeeeEeeeeCCceeeeeeeeeccccc--eeccCCCcEE Confidence 3221 2223355555443 5777776654 68889984 6678865555433 444444432 2333222111 Q ss_pred EEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEE---ccCCCCCCcH Q lcl|NC_021301. 151 RSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVV---YQNPDGMGEV 227 (456) Q Consensus 151 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~---~~n~~g~s~~ 227 (456) . + .....++ ..-+..+++...+ ..|+.|.|.+ T Consensus 172 r----~-~~~~~~G----------------------------------------~~l~~~k~i~~~~~~~~g~p~g~gLl 206 (512) T protein:vir:19 172 R----L-RDASYHG----------------------------------------LELQPFGWFMHRAKSRTGYVGTNGLV 206 (512) T ss_pred E----e-cCCCCCc----------------------------------------eeecCCceEEEeccCCCCCcccccHH Confidence 0 0 0000000 0001112222221 3577899998 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhhhhccceeccCCCcee--Eeecccc Q lcl|NC_021301. 228 EPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFEAAPGALWELPPGVDI--WESQTND 304 (456) Q Consensus 228 ~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~--~~~~~~~ 304 (456) ..+....---+..+.+....++.++.|.++.+- +... .++....+ ..+. ..+.+....++.+.++ .+....+ T Consensus 207 r~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky--~~~a-~~~ek~~L~~al~--~~~~~a~~iiP~~~~ie~~ea~~~~ 281 (512) T protein:vir:19 207 RTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKY--PTGS-TNREKATLMQAVM--DIGRRAGGIIPMGMTLDFQSAADGQ 281 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEec--CCCC-CHHHHHHHHHHHH--HHhhCcEEEecCCceEEEeecCCCC Confidence 876544444445577888889999988665541 1111 11122221 1112 2233344456666554 3333345 Q ss_pred hHHHHHHHHHHHHHHHhhc-CCChhh-hcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCcc- Q lcl|NC_021301. 305 FTPMLSAIKEHIRQLSSAT-KTPLPM-LMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGESVE- 380 (456) Q Consensus 305 ~~~~~~~l~~~~~~i~~~~-~~p~~~-~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~~l~~~~~~~~~~- 380 (456) ...|...++.+-.+|+.+. |=.... -|..++++.|. ....-....++.-.+.+...+. ++++-++.+...... T Consensus 282 ~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~---vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~~~~~~ 358 (512) T protein:vir:19 282 SDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGE---VHDEVRREIRNADVGQLARSINRDLIYPLLALNSDSTID 358 (512) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCC Confidence 5668888888888887652 211000 01111222221 1122233334444466677774 577777776653221 Q ss_pred --cceeEEecCCCCcCHHHHHHHHHHHHhcC-CCcHHHHHHhCCCChhHHH-HH-------------------------- Q lcl|NC_021301. 381 --DTVDVSFESPDRVTLGEKYAAASLAKAAG-ESWASIRRNILNYNADQIK-QD-------------------------- 430 (456) Q Consensus 381 --~~i~v~f~~~~~~~~~e~ad~~~kl~~~g-~~s~~t~~~~~~~~~~~~~-~~-------------------------- 430 (456) .--.+.|....+.|....++.+.++. .| .++.+.+.+.+|+...+.. .. T Consensus 359 ~~~~p~~~f~~~e~eDl~~~a~~~~~l~-~G~~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 437 (512) T protein:vir:19 359 INRLPGIVFDTSEAGDITALSDAIPKLA-AGMRIPVSWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQKEAALSAEDIPQE 437 (512) T ss_pred ccccceEEecCCChhhHHHHHHHHHHHh-cCCCCCHHHHHHHhCCCCCCCccccccCCCccccccccccccccccCCCch Confidence 23467888889999999999999886 67 4688888898887421100 00 Q ss_pred -HHHHHH-------HHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 431 -DLDRAR-------EQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 431 -e~~~~~-------ee~~~~~~~~~~~~~~d~~~ 456 (456) +.++.. +.++.........-.. ++- T Consensus 438 ~~~d~~~~~~~~~~~~~~~~~~~i~~~~~~-~s~ 470 (512) T protein:vir:19 438 DDIDRMGVSPEDWQRSVDPLLKPVIFSVLK-DGP 470 (512) T ss_pred hhHhHHhhhHHHHHHHHHHHHHHHHHHHHh-CCH Confidence 000000 0000000000000000 000 No 220 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=97.99 E-value=6.2e-08 Score=60.10 Aligned_cols=182 Identities=15% Similarity=0.049 Sum_probs=91.2 Q ss_pred hhhhhcCCCcccccccccchhhhhhhhhhh---ccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhh- Q lcl|NC_021301. 255 QRALKSAGHGLPKVDENGNAIDYASIFEAA---PGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPML- 330 (456) Q Consensus 255 ~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~- 330 (456) ...++|+... ..+..+.....+..+... .+.+...+.+.++.++ .++++++.+.+......+++.++||..-| T Consensus 1 V~k~~~l~~~--~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~-~~~lsGl~d~l~~~~~~iaa~s~iP~t~Lf 77 (201) T protein:vir:10 1 MWKAKGLADL--CDDSDGAARLRLAQVDNNSGVGQAIGIDADSEEYNVL-NSDIGGIDTFLSQKFDRIVALSGIHEIILK 77 (201) T ss_pred CccchHHHHH--hcCChHHHHHHHHHHHHhhhhhhhheeecCCcceeee-ecCcCChHHHHHHHHHHHHhHhcCchhhhc Confidence 0001111100 000001111111111111 1222233333445444 35678888999999999999999998555 Q ss_pred ccccc--CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHH-------HH Q lcl|NC_021301. 331 MPDSA--NQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKY-------AA 401 (456) Q Consensus 331 ~~~~~--N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~a-------d~ 401 (456) |...+ |+||+.-..-|...+.- ..+..+.+.+++++.++.. ...+.+.|+|-...+.++.| ++ T Consensus 78 G~sp~Glnatge~d~~nyyd~i~~--~Qe~~l~p~le~l~~~~~~------~~~~~~~f~pL~~~s~kekAei~~~~a~a 149 (201) T protein:vir:10 78 GKNVGGVSASQNTALETFYGYVDR--KRKAELLPLLEFLLPFIVT------EQEWSVEFNPLSQVSDKDKSEILEKNVNS 149 (201) T ss_pred CCCCccccccchhHHHHHHHHHHH--HHHHHHHHHHHHHHHhhcC------CCCceEeeCCCCCCCHHHHHHHHHHHHHH Confidence 54322 56787544444433332 2236678888888886542 34678889999988888765 55 Q ss_pred HHHHHhcCCCcHHHHHHhC------CCChhHHHHHHHHHHHHHHHHHhhhhhhhcccc Q lcl|NC_021301. 402 ASLAKAAGESWASIRRNIL------NYNADQIKQDDLDRAREQITLFAGNSVQRPQED 453 (456) Q Consensus 402 ~~kl~~~g~~s~~t~~~~~------~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d 453 (456) ..+++++|+++...+++.| |+.++..-+.+.+ ..+..+ ....|..+ T Consensus 150 ~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~-~~e~~d-----p~~~~~~~ 201 (201) T protein:vir:10 150 VAALIAAGIIDADEARDTLRAISTEVKIGEGSIQTEVV-INESED-----PLDVSANN 201 (201) T ss_pred HHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCCCcccc-ccccCC-----CCCCCCCC Confidence 6666777888877666543 2332211111111 111111 11111112 No 221 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=97.99 E-value=7.9e-06 Score=48.55 Aligned_cols=435 Identities=11% Similarity=0.049 Sum_probs=185.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc--- Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP--- 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~--- 73 (456) |.+.++...+..-.+.+..+ ..+++.+.+|.--...-...+.... ......++..+-+...++.+++.|.+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~--~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNR--GEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCc--chhcccccccccHHHHHHHHHHHHHHhhc Confidence 99999876555544444333 3445555566422110000001111 12223456667788888888888764 Q ss_pred ---CC-eecCCC-CcccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccce Q lcl|NC_021301. 74 ---NG-ITVGGS-ADSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPET 137 (456) Q Consensus 74 ---~~-~~~~~~-~d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~ 137 (456) .| |++... .+.+.. ..+.+.+..++|.....++.++..++|.+.+++..+..+.+++..++..+ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:98 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 23 333321 111111 11334566688888899999999999999998888877778889999888 Q ss_pred eEEEEeCCCCceEEEEEEEEEec-------CCceEE----EEEEc---CCeEEEEEEeeeecccc--c----------ce Q lcl|NC_021301. 138 MVVSVDPLQPWRIRSAMRWWRDL-------DAESDF----AIVWS---GDGWQKFARPCFVQSSS--R----------RR 191 (456) Q Consensus 138 ~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~~----~~~~~---~~~~~~~~~~~~~~~~~--~----------~~ 191 (456) +++.-|+.. ++..+++.++-. .|...- ...+. .+..+.+....+...+. . .+ T Consensus 159 ~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~ 236 (555) T protein:vir:98 159 YAIAADNQG--RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSV 236 (555) T ss_pred eEEeeCCCC--CEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEE Confidence 887766543 344444443210 000000 00000 00101110000000000 0 00 Q ss_pred eecc-CCC-ceeecccccccCceeEEE--Ec---cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCc Q lcl|NC_021301. 192 LVTR-ISD-SWVPVGDAVVTGSPPPVV--VY---QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHG 264 (456) Q Consensus 192 ~~~~-~~~-~~~~~~~~~~~~~~~pvv--~~---~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~ 264 (456) .++. ..+ ....+.. +..+|.++ |. .+.+|+|-.+..++-+..+|...-......+....|.+.+-- T Consensus 237 ~~~~~~d~~~vl~esg---y~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~---- 309 (555) T protein:vir:98 237 YFEPGADETRTLRESG---YRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPV---- 309 (555) T ss_pred EEEeccCCccccccCC---cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc---- Confidence 1110 011 1111111 11223222 21 346899999999999999988776677777777766444311 Q ss_pred ccccccccchhhhhhhhhhhccceecc----CCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCCh-hhhc-ccccCc Q lcl|NC_021301. 265 LPKVDENGNAIDYASIFEAAPGALWEL----PPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPL-PMLM-PDSANQ 337 (456) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~-~~~~-~~~~N~ 337 (456) ++.. .. +...+|.+... +.+.-.-.++. .++....+.++.+...|....-.+. ..+. .+..+- T Consensus 310 ------~~~~-~~---~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~ 379 (555) T protein:vir:98 310 ------SAKN-QD---ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQM 379 (555) T ss_pred ------cccc-cc---ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcc Confidence 1100 01 11222222111 11221222222 2444444444444444443322211 1111 122233 Q ss_pred HHHHHHHHHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHhcCCC------cccceeEEecCCCCcCHH--------HH Q lcl|NC_021301. 338 SAEGAHNIEKGFLFKCE----DR-LSIAKIGLEAILVKALQIEGES------VEDTVDVSFESPDRVTLG--------EK 398 (456) Q Consensus 338 Sg~Al~~~~~~l~~k~~----~~-~~~f~~~l~~~~~l~~~~~~~~------~~~~i~v~f~~~~~~~~~--------e~ 398 (456) +|.-++.....+.+... +. ...+.+-+.+.+.++.+..-.+ ....++|.|..++-..-. .. T Consensus 380 TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~ 459 (555) T protein:vir:98 380 TATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRF 459 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHH Confidence 55544433322222211 11 1223344445555544321111 123466777665432111 11 Q ss_pred HHHHHHHHhcC--C---CcHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHHhhhhhhh----cccccC---- Q lcl|NC_021301. 399 YAAASLAKAAG--E---SWAS----IRRNILNYN------ADQIKQDDLDRAREQITLFAGNSVQR----PQEDGS---- 455 (456) Q Consensus 399 ad~~~kl~~~g--~---~s~~----t~~~~~~~~------~~~~~~~e~~~~~ee~~~~~~~~~~~----~~~d~~---- 455 (456) ++.+..+.+.+ + +.-. ...+.+|++ +++++++..+|.+.+..........+ .+.-|+ T Consensus 460 l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~ 539 (555) T protein:vir:98 460 VGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTS 539 (555) T ss_pred HHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccC Confidence 12222222211 0 1111 122345653 45555554444443322211111111 111111 Q ss_pred C Q lcl|NC_021301. 456 R 456 (456) Q Consensus 456 ~ 456 (456) + T Consensus 540 ~ 540 (555) T protein:vir:98 540 K 540 (555) T ss_pred c Confidence 1 No 222 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=97.99 E-value=7.9e-06 Score=48.55 Aligned_cols=435 Identities=11% Similarity=0.049 Sum_probs=185.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc--- Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP--- 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~--- 73 (456) |.+.++...+..-.+.+..+ ..+++.+.+|.--...-...+.... ......++..+-+...++.+++.|.+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~--~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNR--GEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCc--chhcccccccccHHHHHHHHHHHHHHhhc Confidence 99999876555544444333 3445555566422110000001111 12223456667788888888888764 Q ss_pred ---CC-eecCCC-CcccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccce Q lcl|NC_021301. 74 ---NG-ITVGGS-ADSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPET 137 (456) Q Consensus 74 ---~~-~~~~~~-~d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~ 137 (456) .| |++... .+.+.. ..+.+.+..++|.....++.++..++|.+.+++..+..+.+++..++..+ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 23 333321 111111 11334566688888899999999999999998888877778889999888 Q ss_pred eEEEEeCCCCceEEEEEEEEEec-------CCceEE----EEEEc---CCeEEEEEEeeeecccc--c----------ce Q lcl|NC_021301. 138 MVVSVDPLQPWRIRSAMRWWRDL-------DAESDF----AIVWS---GDGWQKFARPCFVQSSS--R----------RR 191 (456) Q Consensus 138 ~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~~----~~~~~---~~~~~~~~~~~~~~~~~--~----------~~ 191 (456) +++.-|+.. ++..+++.++-. .|...- ...+. .+..+.+....+...+. . .+ T Consensus 159 ~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~ 236 (555) T protein:vir:10 159 YAIAADNQG--RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSV 236 (555) T ss_pred eEEeeCCCC--CEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEE Confidence 887766543 344444443210 000000 00000 00101110000000000 0 00 Q ss_pred eecc-CCC-ceeecccccccCceeEEE--Ec---cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCc Q lcl|NC_021301. 192 LVTR-ISD-SWVPVGDAVVTGSPPPVV--VY---QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHG 264 (456) Q Consensus 192 ~~~~-~~~-~~~~~~~~~~~~~~~pvv--~~---~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~ 264 (456) .++. ..+ ....+.. +..+|.++ |. .+.+|+|-.+..++-+..+|...-......+....|.+.+-- T Consensus 237 ~~~~~~d~~~vl~esg---y~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~---- 309 (555) T protein:vir:10 237 YFEPGADETRTLRESG---YRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPV---- 309 (555) T ss_pred EEEeccCCccccccCC---cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc---- Confidence 1110 011 1111111 11223222 21 346899999999999999988776677777777766444311 Q ss_pred ccccccccchhhhhhhhhhhccceecc----CCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCCh-hhhc-ccccCc Q lcl|NC_021301. 265 LPKVDENGNAIDYASIFEAAPGALWEL----PPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPL-PMLM-PDSANQ 337 (456) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~-~~~~-~~~~N~ 337 (456) ++.. .. +...+|.+... +.+.-.-.++. .++....+.++.+...|....-.+. ..+. .+..+- T Consensus 310 ------~~~~-~~---~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~ 379 (555) T protein:vir:10 310 ------SAKN-QD---ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQM 379 (555) T ss_pred ------cccc-cc---ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcc Confidence 1100 01 11222222111 11221222222 2444444444444444443322211 1111 122233 Q ss_pred HHHHHHHHHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHhcCCC------cccceeEEecCCCCcCHH--------HH Q lcl|NC_021301. 338 SAEGAHNIEKGFLFKCE----DR-LSIAKIGLEAILVKALQIEGES------VEDTVDVSFESPDRVTLG--------EK 398 (456) Q Consensus 338 Sg~Al~~~~~~l~~k~~----~~-~~~f~~~l~~~~~l~~~~~~~~------~~~~i~v~f~~~~~~~~~--------e~ 398 (456) +|.-++.....+.+... +. ...+.+-+.+.+.++.+..-.+ ....++|.|..++-..-. .. T Consensus 380 TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~ 459 (555) T protein:vir:10 380 TATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRF 459 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHH Confidence 55544433322222211 11 1223344445555544321111 123466777665432111 11 Q ss_pred HHHHHHHHhcC--C---CcHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHHhhhhhhh----cccccC---- Q lcl|NC_021301. 399 YAAASLAKAAG--E---SWAS----IRRNILNYN------ADQIKQDDLDRAREQITLFAGNSVQR----PQEDGS---- 455 (456) Q Consensus 399 ad~~~kl~~~g--~---~s~~----t~~~~~~~~------~~~~~~~e~~~~~ee~~~~~~~~~~~----~~~d~~---- 455 (456) ++.+..+.+.+ + +.-. ...+.+|++ +++++++..+|.+.+..........+ .+.-|+ T Consensus 460 l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~ 539 (555) T protein:vir:10 460 VGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTS 539 (555) T ss_pred HHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccC Confidence 12222222211 0 1111 122345653 45555554444443322211111111 111111 Q ss_pred C Q lcl|NC_021301. 456 R 456 (456) Q Consensus 456 ~ 456 (456) + T Consensus 540 ~ 540 (555) T protein:vir:10 540 K 540 (555) T ss_pred c Confidence 1 No 223 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=97.99 E-value=7.9e-06 Score=48.55 Aligned_cols=435 Identities=11% Similarity=0.049 Sum_probs=185.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc--- Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP--- 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~--- 73 (456) |.+.++...+..-.+.+..+ ..+++.+.+|.--...-...+.... ......++..+-+...++.+++.|.+ T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~--~~~~~~~~~dst~~~a~~~LAa~L~~~lt 78 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISDYLLPRAGRFFVQDRNR--GEKRHNNILDNTGTRALRVLAAGMMAGMT 78 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhCcccccccCCCCCc--chhcccccccccHHHHHHHHHHHHHHhhc Confidence 99999876555544444333 3445555566422110000001111 12223456667788888888888764 Q ss_pred ---CC-eecCCC-CcccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccce Q lcl|NC_021301. 74 ---NG-ITVGGS-ADSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPET 137 (456) Q Consensus 74 ---~~-~~~~~~-~d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~ 137 (456) .| |++... .+.+.. ..+.+.+..++|.....++.++..++|.+.+++..+..+.+++..++..+ T Consensus 79 pp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~~pl~~ 158 (555) T protein:vir:10 79 SPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHSLTAGE 158 (555) T ss_pred CCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEEeecce Confidence 23 333321 111111 11334566688888899999999999999998888877778889999888 Q ss_pred eEEEEeCCCCceEEEEEEEEEec-------CCceEE----EEEEc---CCeEEEEEEeeeecccc--c----------ce Q lcl|NC_021301. 138 MVVSVDPLQPWRIRSAMRWWRDL-------DAESDF----AIVWS---GDGWQKFARPCFVQSSS--R----------RR 191 (456) Q Consensus 138 ~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~~----~~~~~---~~~~~~~~~~~~~~~~~--~----------~~ 191 (456) +++.-|+.. ++..+++.++-. .|...- ...+. .+..+.+....+...+. . .+ T Consensus 159 ~~v~~d~~G--~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~V~pr~~~~~~~~~~~~~p~~s~ 236 (555) T protein:vir:10 159 YAIAADNQG--RVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHAIEPRADRDPSKRDDRNMAWKSV 236 (555) T ss_pred eEEeeCCCC--CEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEEEeeccCcCcCCCCccccceEEE Confidence 887766543 344444443210 000000 00000 00101110000000000 0 00 Q ss_pred eecc-CCC-ceeecccccccCceeEEE--Ec---cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCc Q lcl|NC_021301. 192 LVTR-ISD-SWVPVGDAVVTGSPPPVV--VY---QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHG 264 (456) Q Consensus 192 ~~~~-~~~-~~~~~~~~~~~~~~~pvv--~~---~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~ 264 (456) .++. ..+ ....+.. +..+|.++ |. .+.+|+|-.+..++-+..+|...-......+....|.+.+-- T Consensus 237 ~~~~~~d~~~vl~esg---y~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~v~~---- 309 (555) T protein:vir:10 237 YFEPGADETRTLRESG---YRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQLPV---- 309 (555) T ss_pred EEEeccCCccccccCC---cccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeecc---- Confidence 1110 011 1111111 11223222 21 346899999999999999988776677777777766444311 Q ss_pred ccccccccchhhhhhhhhhhccceecc----CCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCCh-hhhc-ccccCc Q lcl|NC_021301. 265 LPKVDENGNAIDYASIFEAAPGALWEL----PPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPL-PMLM-PDSANQ 337 (456) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~-~~~~-~~~~N~ 337 (456) ++.. .. +...+|.+... +.+.-.-.++. .++....+.++.+...|....-.+. ..+. .+..+- T Consensus 310 ------~~~~-~~---~~~~pgg~~~v~~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~ 379 (555) T protein:vir:10 310 ------SAKN-QD---ISTVPGGLSYVDAAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQM 379 (555) T ss_pred ------cccc-cc---ceeccccccccccCCCCcceecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcc Confidence 1100 01 11222222111 11221222222 2444444444444444443322211 1111 122233 Q ss_pred HHHHHHHHHHHHHHHHH----HH-HHHHHHHHHHHHHHHHHhcCCC------cccceeEEecCCCCcCHH--------HH Q lcl|NC_021301. 338 SAEGAHNIEKGFLFKCE----DR-LSIAKIGLEAILVKALQIEGES------VEDTVDVSFESPDRVTLG--------EK 398 (456) Q Consensus 338 Sg~Al~~~~~~l~~k~~----~~-~~~f~~~l~~~~~l~~~~~~~~------~~~~i~v~f~~~~~~~~~--------e~ 398 (456) +|.-++.....+.+... +. ...+.+-+.+.+.++.+..-.+ ....++|.|..++-..-. .. T Consensus 380 TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~~i~~~ 459 (555) T protein:vir:10 380 TATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATNSVDRF 459 (555) T ss_pred cHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHHHHHHH Confidence 55544433322222211 11 1223344445555544321111 123466777665432111 11 Q ss_pred HHHHHHHHhcC--C---CcHH----HHHHhCCCC------hhHHHHHHHHHHHHHHHHHhhhhhhh----cccccC---- Q lcl|NC_021301. 399 YAAASLAKAAG--E---SWAS----IRRNILNYN------ADQIKQDDLDRAREQITLFAGNSVQR----PQEDGS---- 455 (456) Q Consensus 399 ad~~~kl~~~g--~---~s~~----t~~~~~~~~------~~~~~~~e~~~~~ee~~~~~~~~~~~----~~~d~~---- 455 (456) ++.+..+.+.+ + +.-. ...+.+|++ +++++++..+|.+.+..........+ .+.-|+ T Consensus 460 l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~~~~~~ 539 (555) T protein:vir:10 460 VGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLGSVDTS 539 (555) T ss_pred HHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccC Confidence 12222222211 0 1111 122345653 45555554444443322211111111 111111 Q ss_pred C Q lcl|NC_021301. 456 R 456 (456) Q Consensus 456 ~ 456 (456) + T Consensus 540 ~ 540 (555) T protein:vir:10 540 K 540 (555) T ss_pred c Confidence 1 No 224 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=97.97 E-value=8.6e-06 Score=48.34 Aligned_cols=418 Identities=14% Similarity=0.103 Sum_probs=177.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhhc-----cC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRII-----PN 74 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l~-----~~ 74 (456) ..++..+--..... ..||-.-.++.. ..+...++-...+ .+.+.=+-.+|+..+.=.+ .+ T Consensus 23 ~~~~~~dg~~~~~~-------------~~~~g~~~~~e~-~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~ 88 (537) T protein:vir:10 23 VQKDSLDGSQPIVG-------------GGYFGYSVDFDG-TIRNDHELITRYREMVLNPECDSAVDDVVNETICGNFDDV 88 (537) T ss_pred cCCCcccccceeec-------------cccccccccccc-ccchHHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCc Confidence 00100000000000 011111111110 0000011111111 1222333444444443221 23 Q ss_pred CeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC----CceEEEEEccceeEEEEe Q lcl|NC_021301. 75 GITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD----GTATITADSPETMVVSVD 143 (456) Q Consensus 75 ~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d----g~~~i~~~~p~~~~~~~d 143 (456) ||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+...|.+ |-..++.++|+.+..+.. T Consensus 89 pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~lDPr~i~~vR~ 168 (537) T protein:vir:10 89 PISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYVDPRKIRKVTE 168 (537) T ss_pred eEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeEe Confidence 44432211 1122344556666678888999999999999999999877643 667899999998866542 Q ss_pred CCCCceEEEEEEEE---EecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EEEc Q lcl|NC_021301. 144 PLQPWRIRSAMRWW---RDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVVY 218 (456) Q Consensus 144 ~~~~~~~~~~~~~~---~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~~ 218 (456) -.... ...++.. ...........+|.+.+.+ . .++..+..... ..-.+. ++.. T Consensus 169 i~~~~--~~~~~~~~~~~~v~~~~~eyf~ynp~g~~--------~----------~~~~~vkI~~d--AI~y~hSGl~d~ 226 (537) T protein:vir:10 169 YEAKR--PEALRTQDLNQQLTQQSASYFLYNPKGLK--------N----------STNQGMKIAPD--SIAYCHSGIQDL 226 (537) T ss_pred ecccC--CccceEEecceeeeecccceeeecccccc--------c----------cCCCceeccHh--heeeecccceeC Confidence 11000 0001000 0000001111223322111 0 00000000000 000000 1223 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Ccccccccccchhhhhhh------hhhhccc---- Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDENGNAIDYASI------FEAAPGA---- 287 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~~~~------~~~~~~~---- 287 (456) ++....|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+.-=..+.+.. ..+..|. T Consensus 227 n~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~dd 305 (537) T protein:vir:10 227 NKNMVLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDD 305 (537) T ss_pred CCCeeeeeehhhhHHHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceeccc Confidence 3444556666543322222 234444444444333433222111 112211110000000000 0000000 Q ss_pred ---------eec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-Cc-HHHHHHHHHHHHHHH Q lcl|NC_021301. 288 ---------LWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGAHNIEKGFLFK 352 (456) Q Consensus 288 ---------~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~-Sg~Al~~~~~~l~~k 352 (456) .|. .+.+..+..|+...--+-++-++-+-..++...++|.+-++...+ |. -|..|--....+..- T Consensus 306 rk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KF 385 (537) T protein:vir:10 306 KKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF 385 (537) T ss_pred chhhhhhhhhcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHH Confidence 011 112233444554432233444677777888999999888764422 21 223466666677778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------HHHHHHHh-cC-CCcHHHH Q lcl|NC_021301. 353 CEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------AAASLAKA-AG-ESWASIR 416 (456) Q Consensus 353 ~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d~~~kl~~-~g-~~s~~t~ 416 (456) +.+.+..|..-|.++++.-+.++|.-. + ..|.+.|.....-.+...+ +++..+.. .| .+|.+++ T Consensus 386 I~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi 465 (537) T protein:vir:10 386 IARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYI 465 (537) T ss_pred HHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHH Confidence 889999999999999998877777521 1 3477788765443333332 33332221 12 4688887 Q ss_pred H-HhCCCChhHHHHHHHHHHHHHHHH-Hhh-------------------hhhhhcccccCC Q lcl|NC_021301. 417 R-NILNYNADQIKQDDLDRAREQITL-FAG-------------------NSVQRPQEDGSR 456 (456) Q Consensus 417 ~-~~~~~~~~~~~~~e~~~~~ee~~~-~~~-------------------~~~~~~~~d~~~ 456 (456) + .+|.+++++++++. +.+++|... ... .....|+.|+|- T Consensus 466 ~k~ILr~tDeeI~~~~-k~I~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 525 (537) T protein:vir:10 466 RTKVLKQTESEIKEID-KEIKQEIADGVIMDPQAMQAMEMGIGDEEPVPEGGEEPQTDPNS 525 (537) T ss_pred HHHHhccCHHHHHHHH-HHHHHHhhCCCCCCcccccccccCCCCcccCCCCCCCcccCCcc Confidence 6 57899998887643 334444322 110 001122223322 No 225 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=97.92 E-value=1.1e-05 Score=47.78 Aligned_cols=430 Identities=12% Similarity=0.028 Sum_probs=179.9 Q ss_pred CCCCCH----HHHHHHHHHHHH----HHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTASTP----AEWLPVLTKRID----DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t~----~~~~~~l~~~~~----~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |...-. ++.++....++. ....+++.+.+|..-..- .+.... . +....++--+-+...++.+++.|. T Consensus 1 m~~~~~~~~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~~~---~~~~~~-~-~~~~~~~~dst~~~a~~~Laa~l~ 75 (535) T protein:vir:15 1 MADSKRTGLGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSLF---PKESDN-E-STDYTTPWQAVGARGLNNLASKLM 75 (535) T ss_pred CCccchhccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccc---CCCCCc-c-cccccccccccHHHHHHHHHHHHH Confidence 766653 333443444443 335566666777544310 011000 0 111123445567778888888775 Q ss_pred cC-----C-eecCCCC--------cccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc Q lcl|NC_021301. 73 PN-----G-ITVGGSA--------DSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT 127 (456) Q Consensus 73 ~~-----~-~~~~~~~--------d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~ 127 (456) +- | |++.... +.... ..+.+.+..++|.....++.++..++|.+-+++..+.++. T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 155 (535) T protein:vir:15 76 LALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSY 155 (535) T ss_pred HhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCc Confidence 42 2 3332111 11111 1234456678899999999999999999988887776677 Q ss_pred eEEEEEccceeEEEEeCCCCceEEEEEEEEEec-------CCceEEEEE--EcCCeEEEEEEeeeecccccceeec-cCC Q lcl|NC_021301. 128 ATITADSPETMVVSVDPLQPWRIRSAMRWWRDL-------DAESDFAIV--WSGDGWQKFARPCFVQSSSRRRLVT-RIS 197 (456) Q Consensus 128 ~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 197 (456) .+++.++-.++++.-|+. ++ +...++.++-. .+....... ..++....+..+.+...+...+... ... T Consensus 156 ~~f~~~pl~~~~v~~d~~-G~-vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~ 233 (535) T protein:vir:15 156 NPMKLYRLSSYVVQRDAY-GN-VLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLDEESGDYLKYEEVE 233 (535) T ss_pred eeeEEEEcCeeEEeeCCC-CC-eeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEecCCCcEEEEEEee Confidence 888888777776665544 33 34444443211 000000000 0011111111111111111111110 011 Q ss_pred CceeecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccc Q lcl|NC_021301. 198 DSWVPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDEN 271 (456) Q Consensus 198 ~~~~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~ 271 (456) +..........+++ +|.+++ | .+.+|+|-.++.++.+..+|...-......+....|...+- .+ T Consensus 234 g~~~~~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~----------~~ 303 (535) T protein:vir:15 234 DVEIDGSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN----------PA 303 (535) T ss_pred CccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec----------cc Confidence 11111011111122 222222 2 34689999999999999999888888888888777754331 00 Q ss_pred cchhhhhhhhhhhccceeccC-CCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHH Q lcl|NC_021301. 272 GNAIDYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGF 349 (456) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l 349 (456) |. ........++.|.+.... .+....++. .+++....+.++.+...|.... +.+.....+..+-+|.-++.. T Consensus 304 g~-~~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af-~~~~~~~~~~~r~TAtEV~~r---- 377 (535) T protein:vir:15 304 GI-TQPRRLTKAQTGDFVPGRREDIDFLQLEKQADFTVAKAVSDQIEARLSYAF-MLNSAVQRTGERVTAEEIRYV---- 377 (535) T ss_pred cc-ccchhcccCCceeeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHH-hhhhcccCCCccccHHHHHHH---- Confidence 10 011112233334333222 223344433 3345555555555555544332 111111112222344433333 Q ss_pred HHHHHHHHHHHHHHH------------HHHHHHHHHhcC--CCcccceeEEecCCCCcCH-HHHH----HHHHHHHhcC- Q lcl|NC_021301. 350 LFKCEDRLSIAKIGL------------EAILVKALQIEG--ESVEDTVDVSFESPDRVTL-GEKY----AAASLAKAAG- 409 (456) Q Consensus 350 ~~k~~~~~~~f~~~l------------~~~~~l~~~~~~--~~~~~~i~v~f~~~~~~~~-~e~a----d~~~kl~~~g- 409 (456) .+++...++..+ ++.+.++.+..- ..+...+++.|..++..-- .+.+ +.+..+.+.+ T Consensus 378 ---~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~la~~~P 454 (535) T protein:vir:15 378 ---ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAWAALAP 454 (535) T ss_pred ---HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHHHhcCh Confidence 333444444433 344444432111 1223346777765542211 1111 1122222111 Q ss_pred -C----CcHHHH----HHhCCCCh-------hHHHHHHHHHHHH-HHHHHhhhhhhhcccccCC Q lcl|NC_021301. 410 -E----SWASIR----RNILNYNA-------DQIKQDDLDRARE-QITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 410 -~----~s~~t~----~~~~~~~~-------~~~~~~e~~~~~e-e~~~~~~~~~~~~~~d~~~ 456 (456) + +.-..+ ...+|+++ ++++++.++..++ +....+..........+.+ T Consensus 455 ~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~ 518 (535) T protein:vir:15 455 MQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALATS 518 (535) T ss_pred hhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhcc Confidence 0 111111 23355543 3333322211111 1111111111111122222 No 226 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=97.90 E-value=1.2e-05 Score=47.61 Aligned_cols=427 Identities=11% Similarity=0.029 Sum_probs=169.5 Q ss_pred CCCCCHHH-----HHHHHHHHHHHH----HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTASTPAE-----WLPVLTKRIDDG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~t~~~-----~~~~l~~~~~~~----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) |+++|..+ -++...+++..+ ..+++.+.+|.--..-..+ .... +....++..+-+...++.+++.| T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~---~~~~--~~~~~~~~dst~~~a~~~Laa~l 75 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSLFPKD---SDNA--STDYTTPWQAVGARGLNNLASKL 75 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCC---CCcc--ccccCCcccccHHHHHHHHHHHH Confidence 99988644 222233333222 4455556666443211000 0000 11122344566777888888877 Q ss_pred ccC-----C-eecCCCC--------cccHHHHH-----------HHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCC Q lcl|NC_021301. 72 IPN-----G-ITVGGSA--------DSDLALRA-----------RRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDG 126 (456) Q Consensus 72 ~~~-----~-~~~~~~~--------d~~~~~~l-----------~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg 126 (456) .+- + |++...+ +.....++ ...+..++|.....++.++..++|.+.+++..+.+. T Consensus 76 ~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~ 155 (535) T protein:vir:94 76 MLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLYIPEPEGT 155 (535) T ss_pred HhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeccCcCc Confidence 542 2 3332211 00111122 334556889999999999999999998887666544 Q ss_pred ceEEEEEccceeEEEEeCCCCceEEEEEEEEEecC-------CceE-EEEEEcCCeEEEEEEeeeecccccceeecc-CC Q lcl|NC_021301. 127 TATITADSPETMVVSVDPLQPWRIRSAMRWWRDLD-------AESD-FAIVWSGDGWQKFARPCFVQSSSRRRLVTR-IS 197 (456) Q Consensus 127 ~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d-------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 197 (456) ..+++.++-.+.++.-| ..++ +...++.++-.- ++.. ...-+.++....+....+...+...+.... .. T Consensus 156 ~~~f~~~pl~~y~v~~d-~~G~-vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~ 233 (535) T protein:vir:94 156 YNPMKLYRLSSYVVQRD-AFGT-VLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLDEESGEYLKYEEID 233 (535) T ss_pred ccceEEEEcCeEEEeeC-CCCC-eEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEeeCCCCcEEEEEEec Confidence 45677776666555544 3343 444444432100 0000 000011112222222212111211221110 11 Q ss_pred CceeecccccccC-ceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccc Q lcl|NC_021301. 198 DSWVPVGDAVVTG-SPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDEN 271 (456) Q Consensus 198 ~~~~~~~~~~~~~-~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~ 271 (456) +..........++ .+|.+++ | .+.+|+|-.+..++-+..+|...-...........|...+. .+ T Consensus 234 g~~~~~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~----------p~ 303 (535) T protein:vir:94 234 GVEVEGTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVN----------PA 303 (535) T ss_pred CeeeccccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----------cc Confidence 1111101111122 2233222 2 34689999999999888888766655555555454432221 11 Q ss_pred cchhhhhhhhhhhccceeccC-CCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHHHH Q lcl|NC_021301. 272 GNAIDYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIEKG 348 (456) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~~~ 348 (456) |. .........+.|.+.... .+....++. .++++.-...++.+...|....-+ ..+. .+...-+|.-++... T Consensus 304 g~-~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~--~~~~~~d~~rvTAtEV~~r~-- 378 (535) T protein:vir:94 304 GI-TQVRRLTKAQTGDFVSGRPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFML--NSAVQRTGERVTAEEIRYVA-- 378 (535) T ss_pred cc-cchhhcccCCCceeecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhH--hhhccCCCCCccHHHHHHHH-- Confidence 11 111112233334443322 222334443 234444444444444444332211 1111 122223444444333 Q ss_pred HHHHHHHHHHHHHHH------------HHHHHHHHHHhcCC--CcccceeEEecCCCCcCHHHHHHHHHH-------HHh Q lcl|NC_021301. 349 FLFKCEDRLSIAKIG------------LEAILVKALQIEGE--SVEDTVDVSFESPDRVTLGEKYAAASL-------AKA 407 (456) Q Consensus 349 l~~k~~~~~~~f~~~------------l~~~~~l~~~~~~~--~~~~~i~v~f~~~~~~~~~e~ad~~~k-------l~~ 407 (456) +++...++.. +.+.+.++.+..-. ....-+++.+..++ ..+..++.+.+ +.+ T Consensus 379 -----~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l--a~l~r~~~~~~l~~~~~~laq 451 (535) T protein:vir:94 379 -----SELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGM--EALGRGQDLDKLERCIAAWSA 451 (535) T ss_pred -----HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEeehH--HHHHHHHHHHHHHHHHHHHHh Confidence 3333444433 34444444332111 12223445443332 11111111111 111 Q ss_pred cC------CCcHHHH----HHhCCCC-------hhHHHHHHHHHHHHHHHH-HhhhhhhhcccccCC Q lcl|NC_021301. 408 AG------ESWASIR----RNILNYN-------ADQIKQDDLDRAREQITL-FAGNSVQRPQEDGSR 456 (456) Q Consensus 408 ~g------~~s~~t~----~~~~~~~-------~~~~~~~e~~~~~ee~~~-~~~~~~~~~~~d~~~ 456 (456) .+ .+.-..+ .+.+|+. ++|++++.+++.+++... ............+.. T Consensus 452 ~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~ 518 (535) T protein:vir:94 452 LAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMATA 518 (535) T ss_pred hChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 11 1111111 2334543 344443333322222111 111111111111111 No 227 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=97.88 E-value=1.3e-05 Score=47.37 Aligned_cols=265 Identities=9% Similarity=0.028 Sum_probs=121.8 Q ss_pred hccCCeecCCCCcccHHHHHHHHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeC Q lcl|NC_021301. 71 IIPNGITVGGSADSDLALRARRIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDP 144 (456) Q Consensus 71 l~~~~~~~~~~~d~~~~~~l~~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~ 144 (456) +..-|+.+... +......+..++.. | ...+....++...+.+|.||+.+..+.+|.+ .+..++|..+.+..++ T Consensus 1 ia~l~~~~~~~-~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~ 79 (278) T protein:vir:78 1 MASLPLKMYED-YKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIEN 79 (278) T ss_pred CccceeEEEec-CcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcC Confidence 22334433211 11122223333331 2 2345677788999999999999999988986 6888999999887765 Q ss_pred CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCC Q lcl|NC_021301. 145 LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGM 224 (456) Q Consensus 145 ~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~ 224 (456) .... + . +.+...+|.. ..+..+++.++.. .++ .....|. T Consensus 80 ~~~~-~-~--y~~~~~~g~~---~~~~~~evih~~~-------------------------~~~---------~~~~~G~ 118 (278) T protein:vir:78 80 QSRE-L-Y--YSIHAATGNK---LIVHNMDMLHFKH-------------------------IVA---------SNMVQGI 118 (278) T ss_pred CCce-E-E--EEEEcCCceE---EEEccccEEEECC-------------------------CCC---------CCCeeec Confidence 4321 1 1 1112222211 1223333332210 000 0112467 Q ss_pred CcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchh-hhhhhhhhhccceeccCCCceeEeeccc Q lcl|NC_021301. 225 GEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAI-DYASIFEAAPGALWELPPGVDIWESQTN 303 (456) Q Consensus 225 s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~~~~~~~ 303 (456) |.+......++....+.. .....++.+-..+...+.. ..++....+ ..........+.+..++++.++.+++.. T Consensus 119 s~~~~~~~~i~~~~~~~~---~~~~~~~~~~~~i~~~~~~--l~~e~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~ 193 (278) T protein:vir:78 119 SPIDVLKNTTDFDNAVRT---FNLTEMQKPDSFMLKYGSN--VGKEKRQQVLEDFKQYYEENGGILFQEPGVEIEPLPKK 193 (278) T ss_pred cHHHHHHHHHHHHHHHHH---HHHHHhcCCCcEEEEeCCC--CCHHHHHHHHHHHHHHhccCCCceecCCCceEEEccCC Confidence 766665555554332221 1222333222222211111 111111111 1111111234556778888888877643 Q ss_pred c-hHHHHHHHHHHHHHHHhhcCCChhhhcccc-cC-cHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Q lcl|NC_021301. 304 D-FTPMLSAIKEHIRQLSSATKTPLPMLMPDS-AN-QSAE-GAH-NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES 378 (456) Q Consensus 304 ~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~-~N-~Sg~-Al~-~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~ 378 (456) . -..+++..+....+|+.+-|+|+..+|... +| ++.+ +.+ +....+.-.++..+.. +...+ -... T Consensus 194 ~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~~~~~~l~P~~~~i~~~--------ln~~L--~~~~ 263 (278) T protein:vir:78 194 YVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFYLQHTLLPIVKQYEEE--------FNRKL--LTKT 263 (278) T ss_pred hhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHHHHHHHHHHHHHHHHH--------HHhhc--CChh Confidence 2 234777788899999999999999997543 22 2222 221 1111222222111111 11111 1111 Q ss_pred c-ccceeEEecCCCC Q lcl|NC_021301. 379 V-EDTVDVSFESPDR 392 (456) Q Consensus 379 ~-~~~i~v~f~~~~~ 392 (456) + .....+.|+-+.. T Consensus 264 e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 264 DREKIGILNLTLNLI 278 (278) T ss_pred HhcCCceEEEecccC Confidence 1 1223466665443 No 228 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=97.83 E-value=1.6e-05 Score=46.86 Aligned_cols=395 Identities=9% Similarity=-0.035 Sum_probs=155.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccC-----cccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDA-----PLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNG 75 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~-----~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~ 75 (456) +.+.+|. +......+... =|.|-. .|+. .+....-++.. ....+..-++++....+.+.+ T Consensus 19 ~~~~~~~--~~~~~~~~~~~---------~~~g~~~~~~~~iLr-~~~~~~ly~~m---~~D~hi~s~l~~Rk~av~~~~ 83 (448) T protein:vir:79 19 DPSDVPK--LEGASVPVMST---------SYDVVVDREFDELLQ-GKDGLLVYHKM---LSDGTVKNALNYIFGRIRSAK 83 (448) T ss_pred ccccchh--hhhhhhhhccc---------ccccccccchhHhhc-cccchHHHHHH---hhChHHHHHHHHHHHHHhcCC Confidence 1111111 00000000000 011100 0000 00000011111 124556667777777777888 Q ss_pred eecCCCCcccHHHHHHHHHH----h-------cChhHHHHHHHHHHhhCCeE-EEEEee-CCCCceEE---EEEcccee- Q lcl|NC_021301. 76 ITVGGSADSDLALRARRIWR----D-------NRMDSVCKQWVKYGLDFGES-YLTCWR-RDDGTATI---TADSPETM- 138 (456) Q Consensus 76 ~~~~~~~d~~~~~~l~~~~~----~-------n~~~~~~~~~~~~a~~~G~a-~~~v~~-d~dg~~~i---~~~~p~~~- 138 (456) +++....++....++.+.+. . ..|..++. -..+|..||.+ ++++|. ..+|...+ ...+|+.. T Consensus 84 w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~-~~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r~~~~~~ 162 (448) T protein:vir:79 84 WYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFA-IYENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFNID 162 (448) T ss_pred ceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHH-HHHHhhhhcceeEEEEeeecCCCceecccccccCCcccc Confidence 77743222222333322222 1 13444443 34568889995 567885 45776543 22333321 Q ss_pred EEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEE-- Q lcl|NC_021301. 139 VVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVV-- 216 (456) Q Consensus 139 ~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv-- 216 (456) ...||+... ... .+.+..+.. . .....+ ...++..++-.. T Consensus 163 ~f~~~~d~~----------------l~~---~~~~~~~~~------------~-~~~~~~------~~lP~~~~i~~~~~ 204 (448) T protein:vir:79 163 EVLYDEEGG----------------PKA---LKLSGEVKG------------G-SQFVSG------LEIPIWKTVVFLHN 204 (448) T ss_pred ceeeecCCc----------------eEE---eecCCcccc------------c-ccCCCc------cccccceEEEEecC Confidence 122232221 110 000000000 0 000000 001111111111 Q ss_pred EccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhh-hhhhhhhhccceeccCCCc Q lcl|NC_021301. 217 VYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAID-YASIFEAAPGALWELPPGV 295 (456) Q Consensus 217 ~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~d~ 295 (456) ...|+.|.|.+..+.-..---+..+.+.+..++.++.|.++.+--. +....++...... .+..+..+..+...++.+. T Consensus 205 ~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~-ga~~~~~~~~~l~~av~~i~~g~~a~~iiP~~~ 283 (448) T protein:vir:79 205 DDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPK-SVRQGTKQWEAAKEIVKNFVQKPRHGIILPDDW 283 (448) T ss_pred ccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCC-CCCcCHHHHHHHHHHHHHHhcCCceEEEecCCc Confidence 1246788898887554333334457777888999999987655311 1111111111221 1222222333444566666 Q ss_pred eeEeeccc-chHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Q lcl|NC_021301. 296 DIWESQTN-DFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AILVKALQ 373 (456) Q Consensus 296 ~~~~~~~~-~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~~l~~~ 373 (456) ++..+..+ ....+...++.+-.+|+.+.--.....+.. ++.++.++.....-....++.-.+.+...+. +++.-++. T Consensus 284 ~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~-~g~~~~~~~~~~~v~~~~~~aDa~~i~~tln~~li~~l~~ 362 (448) T protein:vir:79 284 KFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLN-MGVQAINIGEFVSLTQQTIISLQREFASAVNLYLIPKLVL 362 (448) T ss_pred eEEEEecCCCcccHHHHHHHHHHHHHHHHhhhhhccccc-cchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65433221 222344555655566655432110111111 1111122221111111122222244455554 35655555 Q ss_pred hcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccc Q lcl|NC_021301. 374 IEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNSVQRPQED 453 (456) Q Consensus 374 ~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d 453 (456) +..-....--.+.|....+.|.++.++.+.+|+..+.....-..+.++..+.... .+ .+............++.. T Consensus 363 lNfg~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~---~~--~~a~~~~~~~~~~~~~~~ 437 (448) T protein:vir:79 363 PNWPSATRFPRLTFEMEERNDFSAAANLMGMLINAVKDSEDIPTELKALIDALPS---KM--RRALGVVDEVREAVRQPA 437 (448) T ss_pred hcCCCcCCCcEEEecCCChHHHHHHHHHhhhhhccchhhHHHHHHhhcCCCCCCC---cc--ccccCCCCcccccccCCc Confidence 5422222223677888888999999999999987765554445555554321100 00 000000000111112222 Q ss_pred cCC Q lcl|NC_021301. 454 GSR 456 (456) Q Consensus 454 ~~~ 456 (456) -+| T Consensus 438 ~~~ 440 (448) T protein:vir:79 438 DSR 440 (448) T ss_pred ccc Confidence 233 No 229 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=97.82 E-value=1.7e-05 Score=46.72 Aligned_cols=436 Identities=11% Similarity=0.018 Sum_probs=171.5 Q ss_pred CCCCCH---HHHHHHHHHHHHH----HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTP---AEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~---~~~~~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) |+..-. .+-++....++.. ...+++.+.+|..-..-..+ ... -.....++..+-+...++.+++.|.+ T Consensus 1 m~~~~~~~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~~~~~---~~~--~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) T protein:vir:21 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKD---SDN--ASTDYQTPWQAVGARGLNNLASKLML 75 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCC---CCc--ccccccccccccHHHHHHHHHHHHHH Confidence 776221 2334444444433 24555666666544321111 111 11112245566778888888887754 Q ss_pred C-----C-eecCCCC-c-------cc-----------HHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce Q lcl|NC_021301. 74 N-----G-ITVGGSA-D-------SD-----------LALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA 128 (456) Q Consensus 74 ~-----~-~~~~~~~-d-------~~-----------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~ 128 (456) - | |++.... + .. ....+...+..++|.....++.++..++|.+.+++-.+.++.+ T Consensus 76 ~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~ 155 (536) T protein:vir:21 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) T ss_pred hhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCce Confidence 2 2 3332111 0 00 1122445666788999999999999999999887765544434 Q ss_pred -EEEEEccceeEEEEeCCCCceEEEEEEEEEec-------CCceE--EEEEEcCCeEEEEEEeeeecccccceeec-cCC Q lcl|NC_021301. 129 -TITADSPETMVVSVDPLQPWRIRSAMRWWRDL-------DAESD--FAIVWSGDGWQKFARPCFVQSSSRRRLVT-RIS 197 (456) Q Consensus 129 -~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 197 (456) .++.++-.++++.-|+. ++ +...++.++-. .+... ...-..++....+....+...+...+... ... T Consensus 156 ~~f~~~pl~~~~v~~d~~-G~-vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~e~~ 233 (536) T protein:vir:21 156 NPMKLYRLSSYVVQRDAF-GN-VLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEDSGEYLRYEEVE 233 (536) T ss_pred eeEEEEEcCeEEEeeCCC-CC-eeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEecCCCcEEEEeccC Confidence 46777766766666643 43 44444443211 01000 00000111111111111111111122111 111 Q ss_pred CceeecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccc Q lcl|NC_021301. 198 DSWVPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDEN 271 (456) Q Consensus 198 ~~~~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~ 271 (456) +..+.......++. +|.+++ | .+.+|+|-.+..++.+..+|...-...........+...+. .. T Consensus 234 g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~----------p~ 303 (536) T protein:vir:21 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN----------PA 303 (536) T ss_pred CeeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC----------cc Confidence 22111111222222 233322 2 34689999999999999998776665555555554332221 11 Q ss_pred cchhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHHHH Q lcl|NC_021301. 272 GNAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIEKG 348 (456) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~~~ 348 (456) |. .........+.|.+....+ +..+.++. .+++..-...++.+...|....-+. .+. .+...-+|.-++..... T Consensus 304 g~-~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E 380 (536) T protein:vir:21 304 GI-TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASE 380 (536) T ss_pred cc-cchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHH Confidence 10 1111222334444433222 22233332 2334333333433333333222111 111 12223355444443333 Q ss_pred HHHHH----HHHH-HHHHHHHHHHHHHHHHhcCC--CcccceeEEecCCCC-cCHHHHHHHHHH----HHhcC--C---- Q lcl|NC_021301. 349 FLFKC----EDRL-SIAKIGLEAILVKALQIEGE--SVEDTVDVSFESPDR-VTLGEKYAAASL----AKAAG--E---- 410 (456) Q Consensus 349 l~~k~----~~~~-~~f~~~l~~~~~l~~~~~~~--~~~~~i~v~f~~~~~-~~~~e~ad~~~k----l~~~g--~---- 410 (456) +.+.. .+.+ ..+.+-+++.+.++.+..-. ....-+++.+..++. -.....++.+.. +.+.+ + T Consensus 381 ~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~ 460 (536) T protein:vir:21 381 LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPD 460 (536) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhccc Confidence 32211 1111 22233344444444332111 122234555543321 111111221111 11111 1 Q ss_pred CcHHHH----HHhCCCCh-------hHHHHHHHHHHHHHH-HHHhhhhhhhcc-------cccCC Q lcl|NC_021301. 411 SWASIR----RNILNYNA-------DQIKQDDLDRAREQI-TLFAGNSVQRPQ-------EDGSR 456 (456) Q Consensus 411 ~s~~t~----~~~~~~~~-------~~~~~~e~~~~~ee~-~~~~~~~~~~~~-------~d~~~ 456 (456) +.-..+ .+.+|+++ +|++++..++..++. ...+....+... ++++. T Consensus 461 id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 525 (536) T protein:vir:21 461 INLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAA 525 (536) T ss_pred CCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHh Confidence 111112 23457643 344443333222211 111111111111 11111 No 230 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=97.79 E-value=1.9e-05 Score=46.47 Aligned_cols=436 Identities=11% Similarity=0.020 Sum_probs=170.7 Q ss_pred CCCCCH---HHHHHHHHHHHHH----HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc Q lcl|NC_021301. 1 MTASTP---AEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP 73 (456) Q Consensus 1 ~~~~t~---~~~~~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~ 73 (456) |+..-. .+-++...+++.. ...+++.+.+|..-..-..+ ... -.....++.-+-+...++.+++.|.+ T Consensus 1 m~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~~---~~~--~~~~~~~~~dst~~~a~~~Laa~l~~ 75 (536) T protein:vir:10 1 MAEKRTGLAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSLFPKD---SDN--ASTDYQTPWQAVGARGLNNLASKLML 75 (536) T ss_pred CcchhhchhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccccCCC---CCc--ccccccccccccHHHHHHHHHHHHHh Confidence 776221 2334444444433 24556666677544321111 111 11112245556778888888887754 Q ss_pred C-----C-eecCCCC-c-------cc-----------HHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce Q lcl|NC_021301. 74 N-----G-ITVGGSA-D-------SD-----------LALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA 128 (456) Q Consensus 74 ~-----~-~~~~~~~-d-------~~-----------~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~ 128 (456) - | |++.... + .. ....+...+..++|.....++.++..++|.+.+++-.+.++.+ T Consensus 76 ~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~~ 155 (536) T protein:vir:10 76 ALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSNY 155 (536) T ss_pred hhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCce Confidence 2 2 3332111 0 00 1122445666788999999999999999999887765544434 Q ss_pred -EEEEEccceeEEEEeCCCCceEEEEEEEEEec-------CCceE--EEEEEcCCeEEEEEEeeeecccccceeec-cCC Q lcl|NC_021301. 129 -TITADSPETMVVSVDPLQPWRIRSAMRWWRDL-------DAESD--FAIVWSGDGWQKFARPCFVQSSSRRRLVT-RIS 197 (456) Q Consensus 129 -~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 197 (456) .++.++-.++++.-|+. ++ +...++.++-. .+... ...-..++....+....+.......+... ... T Consensus 156 ~~~~~~pl~~~~v~~d~~-G~-vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~~~e~~ 233 (536) T protein:vir:10 156 NPMKLYRLSSYVVQRDAF-GN-VLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLDEASGEYLRYEEVE 233 (536) T ss_pred eeEEEEEcCeEEEeeCCC-CC-eeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEecCCCcEEEEEeec Confidence 46777766766666643 43 44444443211 01000 00000111111111111111111111110 111 Q ss_pred CceeecccccccC-ceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccc Q lcl|NC_021301. 198 DSWVPVGDAVVTG-SPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDEN 271 (456) Q Consensus 198 ~~~~~~~~~~~~~-~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~ 271 (456) +..+.......++ .+|.+++ | .+.+|+|-.+..++.+..+|...-...........+...+. .. T Consensus 234 g~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~----------p~ 303 (536) T protein:vir:10 234 GMEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGLVN----------PA 303 (536) T ss_pred CccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccC----------cc Confidence 1111111111122 2233322 1 34689999999999999998776665555555554332221 11 Q ss_pred cchhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHHHH Q lcl|NC_021301. 272 GNAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIEKG 348 (456) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~~~ 348 (456) |. .........+.|.+....+ +..+.++. .+++..-...++.+...|....-+. .+. .+...-+|.-++..... T Consensus 304 g~-~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E 380 (536) T protein:vir:10 304 GI-TQPRRLTKAQTGDFVTGRPEDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLN--SAVQRTGERVTAEEIRYVASE 380 (536) T ss_pred cc-cchhhhccCCCcceecCCcccceeeeccccccchHHHHHHHHHHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHH Confidence 10 1111222334444433222 22233332 2334333333433333333222111 111 12222355444443333 Q ss_pred HHHHH----HHHH-HHHHHHHHHHHHHHHHhcCC--CcccceeEEecCCCC-cCHHHHHHHHH----HHHhcC--C---- Q lcl|NC_021301. 349 FLFKC----EDRL-SIAKIGLEAILVKALQIEGE--SVEDTVDVSFESPDR-VTLGEKYAAAS----LAKAAG--E---- 410 (456) Q Consensus 349 l~~k~----~~~~-~~f~~~l~~~~~l~~~~~~~--~~~~~i~v~f~~~~~-~~~~e~ad~~~----kl~~~g--~---- 410 (456) +.+.. .+.+ ..+.+-+++.+.++.+..-. ....-+++.+..++. -.....++.+. .+.+.+ + T Consensus 381 ~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~ 460 (536) T protein:vir:10 381 LEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPD 460 (536) T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhccc Confidence 32211 1111 22233344444444332111 122234555543321 11111111111 111111 1 Q ss_pred CcHHHH----HHhCCCCh-------hHHHHHHHHHHHHHH-HHHhhhhhhhcc-------cccCC Q lcl|NC_021301. 411 SWASIR----RNILNYNA-------DQIKQDDLDRAREQI-TLFAGNSVQRPQ-------EDGSR 456 (456) Q Consensus 411 ~s~~t~----~~~~~~~~-------~~~~~~e~~~~~ee~-~~~~~~~~~~~~-------~d~~~ 456 (456) +.-..+ .+.+|+++ +|++++..++..++. ...+....+.-+ ++.+. T Consensus 461 id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 525 (536) T protein:vir:10 461 INLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAA 525 (536) T ss_pred CCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHh Confidence 111122 23457643 344443333222211 111111111111 11111 No 231 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=97.78 E-value=2e-05 Score=46.33 Aligned_cols=402 Identities=11% Similarity=-0.004 Sum_probs=172.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhh-ccChHHHHHHHHHhhhccCCeecC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA-RTNWGLMVRDSVADRIIPNGITVG 79 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~-~~n~~~~iVd~~a~~l~~~~~~~~ 79 (456) =.+-||. |.++........+....|..-..-++..+....+.++-..... .-.+..-.+++.-..+.+-.+++. T Consensus 6 ~~~p~~~-----~~~~~~~~~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~~v~s~l~~Rk~av~~~~w~V~ 80 (446) T protein:vir:98 6 RNAPTPA-----IRRRTIYAMEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEPIIAQGLDSIALSVLNKVGPYQ 80 (446) T ss_pred cCCCchh-----hhhhhhhccccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcchHHHHHHHHHHHHhhcCCceec Confidence 1122332 2222222233333334444211112222222222222211111 246667777777777777788876 Q ss_pred CCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeE-EEEEeeCCCCceE-EEEEccceeEEEEeCCCCceEEEEEEEE Q lcl|NC_021301. 80 GSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGES-YLTCWRRDDGTAT-ITADSPETMVVSVDPLQPWRIRSAMRWW 157 (456) Q Consensus 80 ~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a-~~~v~~d~dg~~~-i~~~~p~~~~~~~d~~~~~~~~~~~~~~ 157 (456) . .+.+..+.+.+.+..-.++..... ..++..||.+ .+++|.-.+|.-. .+++++.. ...... +++. T Consensus 81 p-~~~~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~---------~~~~~~-~r~~ 148 (446) T protein:vir:98 81 H-GDKRIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGARDNMPATVLDDIV---------NYHPLQ-VMLI 148 (446) T ss_pred C-ccHHHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeecccccccchhhcccc---------cccccc-ceee Confidence 4 455566667777776666555544 5788889994 6688875444211 11111111 000000 0111 Q ss_pred EecCCceEEEEEEcCCeEEEEEEeeeecc-----cccceeeccCCCceeecccccccCceeEEEE---ccCCCCCCcHhH Q lcl|NC_021301. 158 RDLDAESDFAIVWSGDGWQKFARPCFVQS-----SSRRRLVTRISDSWVPVGDAVVTGSPPPVVV---YQNPDGMGEVEP 229 (456) Q Consensus 158 ~~~d~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~---~~n~~g~s~~~~ 229 (456) .+.++...... ..+...+....+... ........ ..+. ...-+...++...+ ..|+.|.|.+.. T Consensus 149 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g~----~~~iP~~kfi~~~~~~~~~~p~G~gLlr~ 220 (446) T protein:vir:98 149 ANDNGRIVDGD---TVTASQYKSGYWVPLPPYRIGDPPKKVD-VVGS----HVRLPSHKRLFINYNTKGNNPWGTSCLTS 220 (446) T ss_pred eccCCcccccc---ccchhhcccccccCcccchhhhhhhhcc-cCcc----cccccccceEEEEecCCCCCccccchHHH Confidence 11111000000 000000000000000 00000000 0000 00001111111112 257889998876 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhchhhhhhc---CCCcccccccccch---------hhhhhhhhhhcccee---ccCCC Q lcl|NC_021301. 230 HIDIINRINRAELQLLSTMAIQAFRQRALKS---AGHGLPKVDENGNA---------IDYASIFEAAPGALW---ELPPG 294 (456) Q Consensus 230 v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g---~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~---~~~~d 294 (456) +--.---=+..+-+.+...+.++.|.++.+- .+..+.. +.++.. ...........+.+. ..+.+ T Consensus 221 ~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~-~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g 299 (446) T protein:vir:98 221 VLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEE-APDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQP 299 (446) T ss_pred HHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCccccc-chhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCC Confidence 4332222233466677788899988776652 2211111 001100 000111111111111 12445 Q ss_pred ceeEee--cccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-CcHHHHHH-HHHHHHHHHHHHHHHHHHHHHH-HHHH Q lcl|NC_021301. 295 VDIWES--QTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAH-NIEKGFLFKCEDRLSIAKIGLE-AILV 369 (456) Q Consensus 295 ~~~~~~--~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~Sg~Al~-~~~~~l~~k~~~~~~~f~~~l~-~~~~ 369 (456) ..+.-+ ..+....|...++.+-.+|+.+.--..-.+|...+ +.|. |+- ....-....++.-.+.+...+. ++++ T Consensus 300 ~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~-ala~vh~~V~~d~~~aDa~~i~~tln~~Li~ 378 (446) T protein:vir:98 300 VQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTG-RASEIQLELFDGKINSIFDTVIHAFTEQVIG 378 (446) T ss_pred ceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554333 22333457777888778888765433223332211 1121 221 1111122222333355556664 5777 Q ss_pred HHHHhcCCCcccce-----eEEecCCCCcCHHHHHHHHHHHHhcCCC---cHHHHHHhCCCChhHHHH Q lcl|NC_021301. 370 KALQIEGESVEDTV-----DVSFESPDRVTLGEKYAAASLAKAAGES---WASIRRNILNYNADQIKQ 429 (456) Q Consensus 370 l~~~~~~~~~~~~i-----~v~f~~~~~~~~~e~ad~~~kl~~~g~~---s~~t~~~~~~~~~~~~~~ 429 (456) -++.+.+....... .+.|....+.|..+.++++.+|.++|+. +.+.+++.+|+.+.+-.- T Consensus 379 ~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 379 NLIRLNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred HHHHhCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 66666654332222 2456666788999999999999999963 355678888874321111 No 232 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=97.77 E-value=2e-05 Score=46.30 Aligned_cols=436 Identities=11% Similarity=0.030 Sum_probs=171.8 Q ss_pred CCC----CCHHHHHHHHHHHHHH----HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTA----STPAEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~----~t~~~~~~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |.. -.+.+.++....++.. ...+++.+.+|..-..-..+-. . -+....++--+-+...++.+++.|. T Consensus 1 ~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~-~----~~~~~~~~~dst~~~a~~~Laa~l~ 75 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSLFPKDSD-N----SSTDYTTPWQAVGARGLNNLSAKVM 75 (543) T ss_pred CcccccCcchHHHHHHHHHHHHHHHhHHHHHHHHHHHHhccccCCCCCC-c----ccccccccccchHHHHHHHHHHHHH Confidence 554 2245566665555543 3345566666655421100000 0 0111123445667778888888775 Q ss_pred cC-----C-eecCCCC--------cccHHHH-----------HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCc Q lcl|NC_021301. 73 PN-----G-ITVGGSA--------DSDLALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGT 127 (456) Q Consensus 73 ~~-----~-~~~~~~~--------d~~~~~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~ 127 (456) +- + |++.... +.+.... +...+..++|.....++.++..++|.+.+++..+.... T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~ 155 (543) T protein:vir:88 76 LALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASS 155 (543) T ss_pred HhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCcccc Confidence 42 2 3332211 1111122 33445568899999999999999999987776554332 Q ss_pred eE---EEEEccceeEEEEeCCCCceEEEEEEEEEecCCc--------eEEEEEEcCCeEEEEEEeeeecccccceeec-c Q lcl|NC_021301. 128 AT---ITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAE--------SDFAIVWSGDGWQKFARPCFVQSSSRRRLVT-R 195 (456) Q Consensus 128 ~~---i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 195 (456) .+ ++.++-.+.++..|. .++ +...++.++-.-.. .....-+.++....+....+...+...+... . T Consensus 156 ~~~~~~~~~pl~~y~v~~d~-~G~-v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~~~~~~~~~~~ 233 (543) T protein:vir:88 156 NSYNPMKLYTLHNHVVQRDA-FGN-VLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDDESGDFLSYQE 233 (543) T ss_pred ceecceEEeEcceEEEeeCC-CCC-eeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeecCCCccccccc Confidence 32 344433443333343 343 44444444211000 0000000111111111111111111111111 1 Q ss_pred CCCceeecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccc Q lcl|NC_021301. 196 ISDSWVPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVD 269 (456) Q Consensus 196 ~~~~~~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~ 269 (456) ..+..+........++ +|.+++ | .+.+|+|-.+..++.+..+|...-......+....|.+.+. T Consensus 234 ~~~~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~---------- 303 (543) T protein:vir:88 234 IEGVEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVGLVN---------- 303 (543) T ss_pred ccCeeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec---------- Confidence 1122221111111122 232222 2 34689999999999999999888888888888777764431 Q ss_pred cccchhhhhhhhhhhccceecc-CCCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHH Q lcl|NC_021301. 270 ENGNAIDYASIFEAAPGALWEL-PPGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIE 346 (456) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~-~~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~ 346 (456) .+|. ........++.|.+... +.+....++. ..++....+.++.+...|....-+. .+. .+..+-+|.-++... T Consensus 304 ~~g~-~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~ 380 (543) T protein:vir:88 304 PNGI-TQVRRLVKAQTGDFVAGRKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLN--SAVQRSGERVTAEEIRYVA 380 (543) T ss_pred cccc-cchhhcccCCCceeecCCCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCCcccHHHHHHHH Confidence 0010 01111223333433222 2233333443 2344444444444444443322111 111 122233444443333 Q ss_pred HHHHHHHH----HHH-HHHHHHHHHHHHHHHHhcCC--CcccceeEEecCCC-CcCHHHHHHHHHHHHh-cCCCc----- Q lcl|NC_021301. 347 KGFLFKCE----DRL-SIAKIGLEAILVKALQIEGE--SVEDTVDVSFESPD-RVTLGEKYAAASLAKA-AGESW----- 412 (456) Q Consensus 347 ~~l~~k~~----~~~-~~f~~~l~~~~~l~~~~~~~--~~~~~i~v~f~~~~-~~~~~e~ad~~~kl~~-~g~~s----- 412 (456) ..+.+... +.+ ..+.+-+.+.+.++.+..-. .....+++.+..++ +-.....++.+....+ .|.+. T Consensus 381 ~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vl 460 (543) T protein:vir:88 381 SELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGD 460 (543) T ss_pred HHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhh Confidence 22222211 111 22233334444444332111 12234555554321 2122222222222221 11111 Q ss_pred ----HHHH----HHhCCCCh-------hHHHHHHHHHHHHHHHHHhhhhhh-hcccccCC Q lcl|NC_021301. 413 ----ASIR----RNILNYNA-------DQIKQDDLDRAREQITLFAGNSVQ-RPQEDGSR 456 (456) Q Consensus 413 ----~~t~----~~~~~~~~-------~~~~~~e~~~~~ee~~~~~~~~~~-~~~~d~~~ 456 (456) -..+ ...+|+++ ++++++..++..++.......... ....+.++ T Consensus 461 d~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~~~ 520 (543) T protein:vir:88 461 PDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQATA 520 (543) T ss_pred ccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcc Confidence 1111 23356643 334433333322221111111000 01111112 No 233 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=97.72 E-value=2.5e-05 Score=45.78 Aligned_cols=420 Identities=10% Similarity=0.043 Sum_probs=182.6 Q ss_pred CCCCCHH--H-HHHHHHHHHHHHHHHHHHHHHHhcccC-cccccCcccchhhhhhhh-hhccChHHHHHHHHHhhh---- Q lcl|NC_021301. 1 MTASTPA--E-WLPVLTKRIDDGMSRVRLLARYSNGDA-PLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRI---- 71 (456) Q Consensus 1 ~~~~t~~--~-~~~~l~~~~~~~~~r~~~~~~YY~g~~-~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l---- 71 (456) =|+-.|. + ....-+..+.......-..+++|-|.. .+. ...++-+..+ .+.+.=+-.+|+..++=. T Consensus 29 ~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~-----~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d 103 (523) T protein:vir:68 29 ESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLK-----STRELIDTYRNLMTNYEVDNAVSEIVSDAIVYE 103 (523) T ss_pred CCccccCCCCcceeeeccccccccccchhhhhhhhccccccc-----hHHHHHHHHHHHhhccchhhHHHHhhcceeeec Confidence 0111111 0 000000000011111122223333211 000 0001111111 111222333333333322 Q ss_pred -ccCCeecCCCCc-------ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC----CCceEEEEEccceeE Q lcl|NC_021301. 72 -IPNGITVGGSAD-------SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----DGTATITADSPETMV 139 (456) Q Consensus 72 -~~~~~~~~~~~d-------~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~----dg~~~i~~~~p~~~~ 139 (456) ..+||.+.-+.. +....++..+.+--+|+....+..|.-.+.|+.|.+...|. +|-..++.++|+.+- T Consensus 104 ~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~ 183 (523) T protein:vir:68 104 DDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQ 183 (523) T ss_pred CCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCCCccccceeeeeeCCccee Confidence 123444432211 12234455666667888889999999999999999987764 366788999999775 Q ss_pred EEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EEE Q lcl|NC_021301. 140 VSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVV 217 (456) Q Consensus 140 ~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~ 217 (456) .+..-..... ..+.. -+......+|.+... .+...... . . ......+|. |++ T Consensus 184 ~vr~i~~~~~--~g~~v----i~~~~e~f~Y~~~~~-~~~~~g~~---------~-~---------~~~~ikI~~dAI~y 237 (523) T protein:vir:68 184 YVREVITTTE--AGVKI----VKGYKEYFIYDTSHE-SYACDGRI---------Y-E---------AGTKIKIPKAAIVY 237 (523) T ss_pred EEEeecCCCC--cchhh----hhhhhhheeeccccc-cccccccc---------c-C---------CCcceecchhheee Confidence 5432111100 00000 011222334443221 11000000 0 0 001111111 233 Q ss_pred ccC-------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc------------------- Q lcl|NC_021301. 218 YQN-------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE------------------- 270 (456) Q Consensus 218 ~~n-------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~------------------- 270 (456) .+. ..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 238 ~hSGL~d~~~~~i~gyLhkAiKp~NQL-kmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa 316 (523) T protein:vir:68 238 AHSGLVDCCGKNIIGYLHRAIKPANQL-KLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDA 316 (523) T ss_pred eeccceeCCCCceeccchhhhHHHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEec Confidence 221 11124444322222222 233344444344333332221111 11221111 Q ss_pred -ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc--cCc-HHHHH Q lcl|NC_021301. 271 -NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS--ANQ-SAEGA 342 (456) Q Consensus 271 -~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~--~N~-Sg~Al 342 (456) +|...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-+.... -|. -|..| T Consensus 317 ~TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EI 395 (523) T protein:vir:68 317 TTGKIKNQQHIMSMTED-YWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSI 395 (523) T ss_pred cCCeeccchhhhhhHhh-hcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcceecccccch Confidence 1111000000000000 011 11223344555543223444467777788889999987774331 121 22356 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------HHHHHHHh- Q lcl|NC_021301. 343 HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------AAASLAKA- 407 (456) Q Consensus 343 ~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d~~~kl~~- 407 (456) --....+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+...+ +++..+.. T Consensus 396 tRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpy 475 (523) T protein:vir:68 396 TRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPF 475 (523) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhh Confidence 6666677778889999999999999998877777521 1 3477788765444333332 33333321 Q ss_pred cC-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 408 AG-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 408 ~g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) .| .+|.++++ .+|.+++++++++ .+.+++|...-.-....++.+|. T Consensus 476 vGky~s~~yi~k~ILr~tDeei~~~-~kqI~~E~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 476 IGKYISHRTAMKDILQMSDEEIEQE-AKQIEEESKEARFQDPDQEQEDF 523 (523) T ss_pred hcccchhHHHHHHHhccCHHHHHHH-HHHHHHHhhcCCCCCCchhhhcC Confidence 13 46888876 5789999888764 44556665443333444445555 No 234 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=97.70 E-value=2.8e-05 Score=45.58 Aligned_cols=348 Identities=10% Similarity=-0.047 Sum_probs=144.6 Q ss_pred HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC--CCc-------ccHHHHHH Q lcl|NC_021301. 21 MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG--SAD-------SDLALRAR 91 (456) Q Consensus 21 ~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~--~~d-------~~~~~~l~ 91 (456) ..-+.+...+.......-..+-.. ..-+... ........+|+.+++-+..-|+.+-. ..+ +.....+. T Consensus 1 M~if~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~~v~~~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVTA-WQNEAVE--YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred CchhHHhHhhhhcccccCcceeee-eecchhh--hhhHHHHHHHHHHHHhHhhCceeeeeecccccccccccccccchHH Confidence 222222222222211000000000 0000000 11234567888888888777764311 000 01122344 Q ss_pred HHHHh--cC---hhHHHHHHHHHHhhCCeEEEE-EeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceE Q lcl|NC_021301. 92 RIWRD--NR---MDSVCKQWVKYGLDFGESYLT-CWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESD 165 (456) Q Consensus 92 ~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~-v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~ 165 (456) .+++. |. -......+....+..|.||++ ++.+.+|.+... ++. .++ T Consensus 78 ~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~------------------------~~~-~~~--- 129 (378) T protein:vir:94 78 EVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELLDL------------------------LFA-NDK--- 129 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEEEE------------------------EEe-cCc--- Confidence 55542 22 235556688888999999976 444444432110 000 011 Q ss_pred EEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHH-HHHHHHHHHHHHH Q lcl|NC_021301. 166 FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHI-DIINRINRAELQL 244 (456) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~-~liDa~~~~~s~~ 244 (456) ..|..+.+ +++.++.+.+.+...+ .+.++++..+. T Consensus 130 --~~~~~~dv----------------------------------------ih~~~~~~~~~~~~~~~~~~~~~~~~~~-- 165 (378) T protein:vir:94 130 --KEYKPEEL----------------------------------------VRLTSPFYINEDTSILDNALASIQTKLE-- 165 (378) T ss_pred --EEechhce----------------------------------------eeecCcCCcccchhHHHHHHHHHHHHHh-- Confidence 00111111 1222222222222111 12222222111 Q ss_pred HHHHHHhhchhhhhhcCCCcccccccccchhhhh-hhhh-----hhccceeccCCCceeEeecccchHHHHHHHHHHHHH Q lcl|NC_021301. 245 LSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYA-SIFE-----AAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQ 318 (456) Q Consensus 245 ~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~-~~~~-----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~ 318 (456) .+.+.-+++. ..... .+......... ..+. ...+.+..++.+.++.+++..+...-.+.++....+ T Consensus 166 ------~~~~~g~l~~-~~~l~-~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~~ 237 (378) T protein:vir:94 166 ------QGKLRGLLKI-NAFLD-IDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKSE 237 (378) T ss_pred ------hCCcccceee-CCcCC-HHHHHHHHHHHHHHHHHhhcccccccceeccCCceEEEccCChHHhhHHHHHHHHHH Confidence 1111112211 11100 00001111111 1111 122346777888898888543322223556778889 Q ss_pred HHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--ceeEEecCCCCcCHH Q lcl|NC_021301. 319 LSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVED--TVDVSFESPDRVTLG 396 (456) Q Consensus 319 i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~--~i~v~f~~~~~~~~~ 396 (456) |+.+-|+|+..+.+..+ ....+.+....|.-.+...+..+...|-. ---...|....+ .+.+.+..-.-.|.. T Consensus 238 Ia~~fgvPp~~l~g~~~--e~~~~~f~~~tl~P~~~~ie~~l~~~Ll~---~~e~~~g~~~~~~~~~~f~~~~l~~~d~~ 312 (378) T protein:vir:94 238 LLTGYFMNENILLGTAT--QEQQIYFYNSTIIPLLIQLEKELTYKLIS---TNRRRVVKGNLYYERIIVDNQLFKFATLK 312 (378) T ss_pred HHHHhCCCHHHhcCCch--HHHHHHHHHHHHHHHHHHHHHHHHhhcCC---hhHhhhhhhhcccceeEeecchhhhcCHH Confidence 99999999998864322 11222232233322222222222211100 000011222222 344445566778999 Q ss_pred HHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHHHH-HHHhhhhhhhcccccCC Q lcl|NC_021301. 397 EKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAREQI-TLFAGNSVQRPQEDGSR 456 (456) Q Consensus 397 e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~ee~-~~~~~~~~~~~~~d~~~ 456 (456) +.++++.++.+.|+++.--+++.+|+.|-+--. .....+.... ...........+|+.++ T Consensus 313 ~~~e~~~~~~~~G~~t~NE~R~~~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 313 ELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeecccccchhcchhcccccCCCCCCCCCCCC Confidence 999999999999999998889998886532100 0011111000 01112223344556666 No 235 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=97.64 E-value=3.4e-05 Score=45.12 Aligned_cols=415 Identities=10% Similarity=0.050 Sum_probs=179.8 Q ss_pred CCCCCHHH------H---HHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAE------W---LPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~------~---~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~ 70 (456) -|.-.|.. . +..... +.+-...+.|-|... ..+...++-+..+ .+.+.=+-.+|+..++= T Consensus 29 ~S~~~p~~~Dga~e~~~~~~~~a~------~~~g~~~~~~g~~e~----~~~~~~eLI~~YR~ma~~pEvd~Av~eIVne 98 (524) T protein:vir:10 29 VSITAPKLDDGAREFEVSSNEAAS------PYNAAFQTIFGSYEP----GMKTTRELIDTYRNLMNNYEVDNAVSEIVSD 98 (524) T ss_pred ccccCccCCCCceeeeeccccccc------ccceeeeehhccccc----ccchHHHHHHHHHHHhhccchhhHHHHhhcc Confidence 12222211 0 000000 001111111111000 0000001111111 11122233333333332 Q ss_pred h-----ccCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC----CCceEEEEEc Q lcl|NC_021301. 71 I-----IPNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----DGTATITADS 134 (456) Q Consensus 71 l-----~~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~----dg~~~i~~~~ 134 (456) . ..+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+...|. +|-..++.++ T Consensus 99 aiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lD 178 (524) T protein:vir:10 99 AIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLD 178 (524) T ss_pred eeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCCCccccceeeeeeC Confidence 2 12344443211 111234455666667888889999999999999999987764 3667789999 Q ss_pred cceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE Q lcl|NC_021301. 135 PETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP 214 (456) Q Consensus 135 p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) |+.+-.+..-... ...... ..+......+|.+.. ..|...... . .......+|. T Consensus 179 Pr~i~~vr~i~~~--~~~~~~----vi~~~~e~f~Y~~~~-~~y~~~g~~---------~----------~~~~~ikI~~ 232 (524) T protein:vir:10 179 PRQVQYVREIITE--TEAGTK----IVKGYKEYFIYDTAH-ESYACDGRM---------Y----------EAGTKIKIPK 232 (524) T ss_pred CccceeeeeeccC--CCccch----hhcchhhheeeccCc-cccccCccc---------c----------CCCcceecch Confidence 9987554321100 000000 001122223444321 111000000 0 0011111111 Q ss_pred --EEEccC-------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc-------------- Q lcl|NC_021301. 215 --VVVYQN-------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE-------------- 270 (456) Q Consensus 215 --vv~~~n-------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~-------------- 270 (456) |++.+. ..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 233 dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQL-kmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNk 311 (524) T protein:vir:10 233 AAIVYAHSGLVDCCGKNIIGYLHRAVKPANQL-KLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNR 311 (524) T ss_pred hheeeeeccceeCCCCceeccchhhhHHHHhh-hHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 333221 11123444322222221 233344444344333332221111 11221111 Q ss_pred ------ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc---cCc Q lcl|NC_021301. 271 ------NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQ 337 (456) Q Consensus 271 ------~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~N~ 337 (456) +|...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-+..+. -|. T Consensus 312 lvYDa~TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~ 390 (524) T protein:vir:10 312 VVYDASTGKIKNQQHNMSMTED-YWLQRRDGKAVTEVDTLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMF 390 (524) T ss_pred eEEeCCCCeeccchhhhhhHhh-hcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccc Confidence 1111000000000000 011 11223344455443223444467777888899999988873221 121 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------HHH Q lcl|NC_021301. 338 -SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------AAA 402 (456) Q Consensus 338 -Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d~~ 402 (456) -|..|--....+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+...+ +++ T Consensus 391 gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l 470 (524) T protein:vir:10 391 DSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINML 470 (524) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH Confidence 233455666677778889999999999999998877777521 1 3477788765444443333 333 Q ss_pred HHHHh-cC-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 403 SLAKA-AG-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 403 ~kl~~-~g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) ..+.. .| .+|.++++ .+|.+++++++++ .+.+++|...-.-....++++|. T Consensus 471 ~~~dpyvGky~s~~yi~k~ILr~tDeei~~~-~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 471 TMAEPFIGKYISHRTAMKDILQMTDEEIEQE-AKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHhhhhhcccchhHHHHHHHhccCHHHHHHH-HHHHHHHhhcCCCCCCchhhhcC Confidence 33321 13 46888876 5789999888764 44556665443333444445555 No 236 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=97.63 E-value=3.6e-05 Score=44.97 Aligned_cols=415 Identities=11% Similarity=0.049 Sum_probs=179.1 Q ss_pred CCCCCHHH------H---HHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAE------W---LPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~------~---~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~ 70 (456) -|.-.|.. . +..... +.+-...+.|-|... ..+...++-+..+ .+.+.=+-.+|+..++= T Consensus 29 ~S~~~p~~~Dga~e~~~~~~~~a~------~~~g~~~~~~g~~e~----~~~~~~eLI~~YR~ma~~pEvd~Av~eIVne 98 (524) T protein:vir:72 29 VSITAPKLDDGAREFEVSSNEAAS------PYNAAFQTIFGSYEP----GMKTTRELIDTYRNLMNNYEVDNAVSEIVSD 98 (524) T ss_pred ccccCccCCCCceeeeeccccccc------ccceeeeehhccccc----ccchHHHHHHHHHHHhhccchhhHHHHhhcc Confidence 12222211 0 000000 001111111111000 0000001111111 11122233333333332 Q ss_pred h-----ccCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC----CCceEEEEEc Q lcl|NC_021301. 71 I-----IPNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----DGTATITADS 134 (456) Q Consensus 71 l-----~~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~----dg~~~i~~~~ 134 (456) . ..+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+...|. +|-..++.++ T Consensus 99 aiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lD 178 (524) T protein:vir:72 99 AIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLD 178 (524) T ss_pred eeEecCCCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeC Confidence 2 12344443211 111234455666667888889999999999999999987764 3667789999 Q ss_pred cceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE Q lcl|NC_021301. 135 PETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP 214 (456) Q Consensus 135 p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 214 (456) |+.+-.+..-... ...... ..+......+|.+.. ..|...... . .......+|. T Consensus 179 Pr~i~~vr~i~~~--~~~~~~----vi~~~~e~f~Y~~~~-~~y~~~g~~---------~----------~~~~~ikI~~ 232 (524) T protein:vir:72 179 PRQVQYVREIITE--TEAGTK----IVKGYKEYFIYDTAH-ESYACDGRM---------Y----------EAGTKIKIPK 232 (524) T ss_pred CccceeeeeeccC--CCccch----hhcchhhheeeccCc-cccccCccc---------c----------CCCcceecch Confidence 9987554321100 000000 001122223444321 111000000 0 0011111111 Q ss_pred --EEEccC-------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc-------------- Q lcl|NC_021301. 215 --VVVYQN-------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE-------------- 270 (456) Q Consensus 215 --vv~~~n-------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~-------------- 270 (456) |++.+. ..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 233 dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQL-kmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNk 311 (524) T protein:vir:72 233 AAVVYAHSGLVDCCGKNIIGYLHRAVKPANQL-KLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNR 311 (524) T ss_pred hheeeeeccceeCCCCceeccchhhhHhHHhh-hHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 333221 11123444322222221 233344444343333332221111 11221111 Q ss_pred ------ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc---cCc Q lcl|NC_021301. 271 ------NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---ANQ 337 (456) Q Consensus 271 ------~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~N~ 337 (456) +|...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-+..+. -|. T Consensus 312 lvYDa~TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~ 390 (524) T protein:vir:72 312 VVYDASTGKIKNQQHNMSMTED-YWLQRRDGKAVTEVDTLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMF 390 (524) T ss_pred eEEeCCCCeeccchhhhhhHhh-hcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccc Confidence 1111000000000000 011 11223344455443223444467777888899999988873221 121 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------HHH Q lcl|NC_021301. 338 -SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------AAA 402 (456) Q Consensus 338 -Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d~~ 402 (456) -|..|--....+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+...+ +++ T Consensus 391 gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l 470 (524) T protein:vir:72 391 DSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINML 470 (524) T ss_pred cccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHH Confidence 233455666677778889999999999999998877777521 1 3477788765444433333 333 Q ss_pred HHHHh-cC-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 403 SLAKA-AG-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 403 ~kl~~-~g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) ..+.. .| .+|.++++ .+|.+++++++++ .+.+++|...-.-....++.+|. T Consensus 471 ~~~dpyvGky~s~~yi~k~ILr~tDeei~~~-~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 471 TMAEPFIGKYISHRTAMKDILQMTDEEIEQE-AKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHhhhhhcccchhHHHHHHHhccCHHHHHHH-HHHHHHHhhcCCCCCCchhhhcC Confidence 33321 13 46888876 5789999888764 44555555433333333444555 No 237 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=97.57 E-value=4.3e-05 Score=44.51 Aligned_cols=422 Identities=11% Similarity=0.023 Sum_probs=159.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) -+--||..+..-+..-. -...-.||+..-.++. ..+.-.-++.. ..-.+..-++++....+.+..+++.. T Consensus 8 ~~gl~p~rl~~i~~~~~------~~~~~~~~~~~~~~Lr-~~~~~~ly~~m---~~D~hi~s~l~~Rk~av~~~~w~v~p 77 (488) T protein:vir:95 8 QESLPPFRMGEVGSLGL------KVKNGRIYEEPRQALR-FPESIKTFQLM---MRDPAVAASVNIIKMFVRKVNWRFVP 77 (488) T ss_pred CCCCCHHHHHHHHHHhh------ccccchhhccchhhhc-ccchHHHHHHH---hhChHHHHHHHHHHHHHhcCCceEec Confidence 23334543333221100 0111234442222221 11111111221 23567777888888888888877643 Q ss_pred CCc--c-cH----HHHHHHHHHh--cChhHHHHHHHHHHhhCCe-EEEEEeeCCCCceEEEEEccceeEEEEeCCCCceE Q lcl|NC_021301. 81 SAD--S-DL----ALRARRIWRD--NRMDSVCKQWVKYGLDFGE-SYLTCWRRDDGTATITADSPETMVVSVDPLQPWRI 150 (456) Q Consensus 81 ~~d--~-~~----~~~l~~~~~~--n~~~~~~~~~~~~a~~~G~-a~~~v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~ 150 (456) ..+ + .. .+.+.+.+.. +.|..++.++. +|.-||. +++++|....+...............+.....+.. T Consensus 78 ~~~~~~d~~~~~~a~~v~~~l~~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~~~~~~~~~~~~dg~~~~~~i~~Rpq 156 (488) T protein:vir:95 78 PKGKEQDPKMLERADFFNSLMDDMEHDWADFINSVM-SFCTYGFCVNEKVYKKRQGKKGKYQSKFDDGLIGWAKLPIRNQ 156 (488) T ss_pred CCCCchhHHHHHHHHHHHHHHhccCccHHHHHHHHH-HhhcccceeeeeeeeccccccccccccccCCeeeeeeeeecCc Confidence 211 1 11 2233444433 23556666654 7888998 46788864322111100000000000000000000 Q ss_pred EEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE---EEE-----ccCCC Q lcl|NC_021301. 151 RSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP---VVV-----YQNPD 222 (456) Q Consensus 151 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p---vv~-----~~n~~ 222 (456) ....++..+.++..... ......... ........+.......+|| +++ ..|+. T Consensus 157 ~~~~~f~~d~d~~l~~~---~~~~~~~~~----------------~~~~~~~~~~~~~~~~lP~~kfi~~~~~~~~g~p~ 217 (488) T protein:vir:95 157 STLDKWYFDEDFRRVTG---VRQNLRNVS----------------HIAGAINLGERPLTRKLPRAKFMLFKYDDEYGNPE 217 (488) T ss_pred ccccceeeccCCCceee---ccccccccc----------------ccccccccccccccccccccceEEEeecCCCCccc Confidence 00000111122211100 000000000 0000000000011111221 222 25678 Q ss_pred CCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcc-cccccccchh-hhhhhh----hhhccceeccCCC-- Q lcl|NC_021301. 223 GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGL-PKVDENGNAI-DYASIF----EAAPGALWELPPG-- 294 (456) Q Consensus 223 g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~-~~~~~~~~~~-~~~~~~----~~~~~~~~~~~~d-- 294 (456) |.|.+..+.-..--=+..+.+.+..++.+..+..+.+|...-. ...++....+ ...... ..+..+...++.+ T Consensus 218 g~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~ 297 (488) T protein:vir:95 218 GRSPLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLPPDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYID 297 (488) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeeccCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeeccccc Confidence 8888876433222123345566667777777776666632110 0111111111 111100 0111111122222 Q ss_pred ceeE----e---ec--ccchHHHHHHHHHHHHHHHhhcCCChhh--hcccccCcHHH-HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 295 VDIW----E---SQ--TNDFTPMLSAIKEHIRQLSSATKTPLPM--LMPDSANQSAE-GAHNIEKGFLFKCEDRLSIAKI 362 (456) Q Consensus 295 ~~~~----~---~~--~~~~~~~~~~l~~~~~~i~~~~~~p~~~--~~~~~~N~Sg~-Al~~~~~~l~~k~~~~~~~f~~ 362 (456) .++. + +. ..+...|...++.+-.+|+.+.--.... -+..++++.|. ..+... ..+..-.+.+.. T Consensus 298 ~~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~~~~~~Gs~Al~~vh~ev~~----~i~~aDa~~i~~ 373 (488) T protein:vir:95 298 PDTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAMGQSKYGSFSLADSKTSLLA----MSVDILLKQIKN 373 (488) T ss_pred cccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhccccccccCcchhhhHHHHHHHHHH----HHHHHHHHHHHH Confidence 1111 1 11 1223346666666666666543211101 11111222111 222222 222233344555 Q ss_pred HHH-HHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCC-c----HHHHHHhCCCChhHHHHHHHHHHH Q lcl|NC_021301. 363 GLE-AILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGES-W----ASIRRNILNYNADQIKQDDLDRAR 436 (456) Q Consensus 363 ~l~-~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~-s----~~t~~~~~~~~~~~~~~~e~~~~~ 436 (456) .+. +++.-++.+..-......++.|....+.|.++.++++.+|.++|+. + .+.+++.+|+.+.+..+....... T Consensus 374 tln~~li~~l~~~Nfg~~~~~P~~~~~~~e~~Dl~~~ae~~~~L~~~G~~i~~~~~~~~i~e~~gip~~~~~e~~~~~~~ 453 (488) T protein:vir:95 374 VINRDLVAQTYALNMWDDEEHVQITYDDIETPDLEAIGSYIQKTVAVGALEVDKELSNKLREHIGLPPADESQPVSEKLS 453 (488) T ss_pred HHHHHHHHHHHHhcCCCCCCccEEEecCcChhhHHHHHHHHHHHHhCCCccccHHHHHHHHHHhCCCCCCCCccccccCC Confidence 553 4666555555322333346788888899999999999999999964 4 356778888764321111010000 Q ss_pred HHHHHHhhhh---------hhhcccccCC Q lcl|NC_021301. 437 EQITLFAGNS---------VQRPQEDGSR 456 (456) Q Consensus 437 ee~~~~~~~~---------~~~~~~d~~~ 456 (456) .+....++.. .....++++. T Consensus 454 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 482 (488) T protein:vir:95 454 PNSQSRSGDGYKTAGEGTAKTPSAKDPST 482 (488) T ss_pred CCCCCCCCcccCCCcccCCcccccccchh Confidence 0000001100 0011111111 No 238 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=97.51 E-value=5.3e-05 Score=44.01 Aligned_cols=418 Identities=12% Similarity=0.071 Sum_probs=173.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhhc-----cC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRII-----PN 74 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l~-----~~ 74 (456) -.++.++-. ..+ .---....||.-...+ .+. .++-+..+ .+.+.=+-.+|+..+.=.+ .+ T Consensus 23 ~~p~~ddg~-~~~--------~~~g~~~~~~~~~~~~----~~~-~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~~~ 88 (558) T protein:vir:10 23 VPKNNEDGV-DNF--------ISSGFYGQYVDIEGAY----RSE-YDLIRRYREMALHPEADGAIEDVVNEAIVSDLYDS 88 (558) T ss_pred cCCCccccc-cce--------eccceeeeeecccchh----hhH-HHHHHHHHHHhhccchhhHHHHhhcceeEecCCCc Confidence 111111100 000 0000000111100000 000 01111111 1122233334443333221 23 Q ss_pred CeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC----CCceEEEEEccceeEEEEe Q lcl|NC_021301. 75 GITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----DGTATITADSPETMVVSVD 143 (456) Q Consensus 75 ~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~----dg~~~i~~~~p~~~~~~~d 143 (456) ||.+.-+. ......++..+.+--+|+....+..|.-.+.|+.|.+...|. +|-..++.++|+.+-.+.. T Consensus 89 pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~ 168 (558) T protein:vir:10 89 PVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQ 168 (558) T ss_pred eEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCcccceeeee Confidence 44432211 112223455566667888889999999999999999987764 3667899999998876654 Q ss_pred CCCC-ceEEEEEEEEEecC-----CceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--E Q lcl|NC_021301. 144 PLQP-WRIRSAMRWWRDLD-----AESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--V 215 (456) Q Consensus 144 ~~~~-~~~~~~~~~~~~~d-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--v 215 (456) -... ..... +...+... .......+|.+...+........ ..+ ....++. | T Consensus 169 i~~~~~~~~~-~~~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~-----------~~~---------~~vkI~~dAI 227 (558) T protein:vir:10 169 EKRKPGNQDP-AIRVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQM-----------GGK---------NSIKIAKDSI 227 (558) T ss_pred eccccccccc-eeeeecccceeeccceeEeeeecCCcccccccceee-----------cCC---------Cceeechhhe Confidence 2111 00000 00011111 11122233333221111100000 000 0111111 2 Q ss_pred EEc-------cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Ccccccccccchhhhhhh------h Q lcl|NC_021301. 216 VVY-------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDENGNAIDYASI------F 281 (456) Q Consensus 216 v~~-------~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~~~~------~ 281 (456) ++. ++..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+.-=..+.+.. . T Consensus 228 ~y~hSGL~d~~~~~i~syLhkAIKp~NQL-kmlEDAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVY 306 (558) T protein:vir:10 228 TMCTSGLVDRNKNRVLSYLHKAIKALNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVY 306 (558) T ss_pred eeecccceecCCCeeeecchHhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEE Confidence 222 1111124444332222222 233344444344333332221111 112211110000000000 0 Q ss_pred hhhccc-------------eec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-Cc-HHHHH Q lcl|NC_021301. 282 EAAPGA-------------LWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGA 342 (456) Q Consensus 282 ~~~~~~-------------~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~-Sg~Al 342 (456) .+..|. .|. .+.+..+..|+...--+-++-++-+-..++...++|.+-++...+ |. -|..| T Consensus 307 Da~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EI 386 (558) T protein:vir:10 307 DANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEI 386 (558) T ss_pred eccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchh Confidence 000000 011 112233444554432223344667777888889999887754321 21 22245 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHH-------HHHHHHHHh- Q lcl|NC_021301. 343 HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEK-------YAAASLAKA- 407 (456) Q Consensus 343 ~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~-------ad~~~kl~~- 407 (456) --....+..-+.+.+..|..-|.++++.-+.++|.-. + ..|.+.|.....-.+... ++++..+.. T Consensus 387 tRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpy 466 (558) T protein:vir:10 387 LRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPY 466 (558) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhh Confidence 5566667777889999999999999998877777521 1 347778876543333333 333333321 Q ss_pred cC-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHH-----------HHhhhhhhhcccccCC Q lcl|NC_021301. 408 AG-ESWASIRR-NILNYNADQIKQDDLDRAREQIT-----------LFAGNSVQRPQEDGSR 456 (456) Q Consensus 408 ~g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~-----------~~~~~~~~~~~~d~~~ 456 (456) .| .+|.++++ .+|.+++++++++.. .+++|.. .+.+..-+. +.|+.. T Consensus 467 vGky~S~dyi~k~ILr~tDeeI~~~~k-qI~~E~k~~~~~~p~~~~~~~~~~~~~-~~~~~~ 526 (558) T protein:vir:10 467 IGKYYSTEYVRKRVLRQTDMEIEEIDT-QIEDEIQKGIIPDPSQIDPITGEPLPQ-EGDPAM 526 (558) T ss_pred hccccchHHHHHHHhccCHHHHHHHHH-HHHHHHhCCCCCCccccChhhccccCc-cCCchh Confidence 13 46888876 578999988876433 3333321 121111100 001111 No 239 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=97.41 E-value=7.3e-05 Score=43.27 Aligned_cols=395 Identities=9% Similarity=-0.037 Sum_probs=157.7 Q ss_pred HHHHHHHHHHHHHH----HHHHH--------------------HHHHHhcccCcccccCcccchhhhhh--hhhhccChH Q lcl|NC_021301. 7 AEWLPVLTKRIDDG----MSRVR--------------------LLARYSNGDAPLPELTRNTSAAWRSF--QREARTNWG 60 (456) Q Consensus 7 ~~~~~~l~~~~~~~----~~r~~--------------------~~~~YY~g~~~i~~~~~~~~~~~~~~--~~k~~~n~~ 60 (456) +-|+.+|....... .+.+. ....+..|.... ..+...... ..-+.+... T Consensus 1 M~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~-----~~~~~g~~v~~~~a~~~~~v 75 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPRMSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTE-----LAPDTFVGLATQAYQANGPV 75 (466) T ss_pred CchhHHHhhccCcccccchhhhhhhhhhhhccccccccccccHHHHHhhcccccc-----ccCccccccchhhhhccHHH Confidence 45666666553211 01110 011111111100 000011111 112335667 Q ss_pred HHHHHHHHhhhccCCeecCCCCccc----HHHHHHHHHHh-c---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCc----- Q lcl|NC_021301. 61 LMVRDSVADRIIPNGITVGGSADSD----LALRARRIWRD-N---RMDSVCKQWVKYGLDFGESYLTCWRRDDGT----- 127 (456) Q Consensus 61 ~~iVd~~a~~l~~~~~~~~~~~d~~----~~~~l~~~~~~-n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~----- 127 (456) ..+|+.+++-+..-|+.+....+.. ....+..++.+ | ....+...+...++.+|.||+++..++.|. T Consensus 76 ~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~~~~ 155 (466) T protein:vir:81 76 FACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMRPDW 155 (466) T ss_pred HHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCcccccccc Confidence 7889999998888887754322211 11123334432 2 223556778889999999999998876654 Q ss_pred ----eEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceE--EEEEEcCCeEEEEEEeeeecccccceeeccCCCcee Q lcl|NC_021301. 128 ----ATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESD--FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWV 201 (456) Q Consensus 128 ----~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (456) ..+..++|..+.+..+......+ .|++...+... ....|..+.+.++.. T Consensus 156 ~g~~~~l~~l~~~~v~~~~~~~~~~~~----~y~~~~~~~~~~~~~~~~~~~dviHir~--------------------- 210 (466) T protein:vir:81 156 VDVVVEERMVRGGRGELGGGQLGWRKV----GYLYTEGGRQSGNESVGFLAEDVVHFAP--------------------- 210 (466) T ss_pred CcceeEEEEecCcceEEEEcCCCceEE----EEEEEecCcccccceeeeccccEEEEcC--------------------- Confidence 24666677766666543322111 11111111100 001122222221100 Q ss_pred ecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchh-hh Q lcl|NC_021301. 202 PVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAI-DY 277 (456) Q Consensus 202 ~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~ 277 (456) ..+ | +.--.|.|-+......++....+. .....++. .|..+++- ... ..++....+ .. T Consensus 211 ----~~~-----~---~d~~~G~s~i~~~~~~i~~~~a~~---~~~~~~f~ng~~p~gil~~-~~~--l~~e~~~~~~~~ 272 (466) T protein:vir:81 211 ----IPD-----P---LASYRGMSWLTPILREIRADQAMS---KHQAKFFDNGATVNLVIKH-NPM--ADPAAVKKWADE 272 (466) T ss_pred ----CCC-----c---ccccccccHHHHHHHHHHHHHHHH---HHHHHHHhcCCCcceEEec-CCC--CCHHHHHHHHHH Confidence 000 0 000136665554433333222111 11222222 24333321 111 111111111 11 Q ss_pred h-hhhh--hhccceeccCCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccc--cCcHHHHHHHHHHHHHH Q lcl|NC_021301. 278 A-SIFE--AAPGALWELPPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDS--ANQSAEGAHNIEKGFLF 351 (456) Q Consensus 278 ~-~~~~--~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~--~N~Sg~Al~~~~~~l~~ 351 (456) . ..+. ...+.+..++.+.++.++...+. ..|++..+..+.+|+.+-++|+..+|... +.+++..++.....+.. T Consensus 273 ~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~~st~sn~eq~~~~f~~ 352 (466) T protein:vir:81 273 VNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLAAATYSNYGQARRRLAD 352 (466) T ss_pred HHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCCccccccHHHHHHHHHH Confidence 1 1111 11245667888888888764322 33778888899999999999999997431 11222222222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCC--CCcCHHHHHHH-------HHHHHhcCCCcHHHHHHhCCC Q lcl|NC_021301. 352 KCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESP--DRVTLGEKYAA-------ASLAKAAGESWASIRRNILNY 422 (456) Q Consensus 352 k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~--~~~~~~e~ad~-------~~kl~~~g~~s~~t~~~~~~~ 422 (456) .+ -.-+-..|++.+...+- .......+.+.|+.. +-.|..+.+++ +.+++++|+ +..-++...+. T Consensus 353 ~t---l~P~~~~ie~~l~~~L~--~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~ 426 (466) T protein:vir:81 353 GT---AHPLWQNLSGCIGHVMP--DMGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAGY-EPESVVAAVNS 426 (466) T ss_pred HH---HHHHHHHHHHHHHhhcC--CcccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcCC-ChhhccccccC Confidence 11 01111222222221111 111222344556543 34455544443 455666775 33333332211 Q ss_pred ChhHH-H---HHHHHHHHH-HHHHHh--hhhhhhcccccC Q lcl|NC_021301. 423 NADQI-K---QDDLDRARE-QITLFA--GNSVQRPQEDGS 455 (456) Q Consensus 423 ~~~~~-~---~~e~~~~~e-e~~~~~--~~~~~~~~~d~~ 455 (456) -+... . ..-.+.... ...... ....+-.+++|+ T Consensus 427 gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 427 GDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred CccccccCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 11000 0 000000000 000000 001112234555 No 240 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=97.36 E-value=8.5e-05 Score=42.91 Aligned_cols=345 Identities=9% Similarity=-0.071 Sum_probs=141.0 Q ss_pred HHHHHHHHHHhcccCcccccCcccchhhhhhhhhh--ccChHHHHHHHHHhhhccCCeecCC-C-C----cc---cHHHH Q lcl|NC_021301. 21 MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA--RTNWGLMVRDSVADRIIPNGITVGG-S-A----DS---DLALR 89 (456) Q Consensus 21 ~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~--~~n~~~~iVd~~a~~l~~~~~~~~~-~-~----d~---~~~~~ 89 (456) ..=+.++..+..+.... +.. ... ......+ .......+|+.+++-+..-|+.+-. . . +. -.... T Consensus 1 Mg~f~~~~~~~~~~~~~-~~~-~~~---~~~~~~~~~~~~~v~~~v~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~ 75 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN-DTQ-RVT---AWQNEAVEYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSD 75 (378) T ss_pred CCccccchhcccccccC-Ccc-eee---eeccchhHHHHHHHHHHHHHHHhhhhhCceeeEEEcccCcccccccccccch Confidence 11111111111111000 000 000 0000111 1235566788888877777765311 0 0 10 01123 Q ss_pred HHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEE-eeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCc Q lcl|NC_021301. 90 ARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTC-WRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAE 163 (456) Q Consensus 90 l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v-~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~ 163 (456) +.++++. |. .......+...++.+|.||++. +.+..|++... -| .++ T Consensus 76 l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~~~l--~p------------------------~~~- 128 (378) T protein:vir:94 76 LDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGELLDL--LF------------------------ADD- 128 (378) T ss_pred HHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceEEEE--Ee------------------------cCC- Confidence 4555542 32 2355677888999999999864 44433332110 00 000 Q ss_pred eEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCC-CCcHhHHHHHHHHHHHHHH Q lcl|NC_021301. 164 SDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDG-MGEVEPHIDIINRINRAEL 242 (456) Q Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g-~s~~~~v~~liDa~~~~~s 242 (456) .. -|..+.+++ +.++.. ..-..++-.+..+++..++ T Consensus 129 ~~---~~~~~diiH----------------------------------------~~~~~~~~~g~s~l~~~~~~i~~~~~ 165 (378) T protein:vir:94 129 KK---EYKPEELVR----------------------------------------LTSPFYINEDTSILDNALASIQTKLE 165 (378) T ss_pred ee---EeeeeeeEE----------------------------------------ecCcCCccchhHHHHHHHHHHHHHHh Confidence 00 011112221 111111 1111222223333332222 Q ss_pred HHHHHHHHhhchhhhhhcCCCccccccc-ccchhhhh-hhhh-----hhccceeccCCCceeEeecccchHHHHHHHHHH Q lcl|NC_021301. 243 QLLSTMAIQAFRQRALKSAGHGLPKVDE-NGNAIDYA-SIFE-----AAPGALWELPPGVDIWESQTNDFTPMLSAIKEH 315 (456) Q Consensus 243 ~~~~~~~~~~~~~~~i~g~~~~~~~~~~-~~~~~~~~-~~~~-----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~ 315 (456) . +.+.-+++- ... ..++ .+...... ..+. ...+.+..++.+.++.+++..+...-.+.++.. T Consensus 166 ~--------~~~~gil~~-~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~ 234 (378) T protein:vir:94 166 Q--------GKLRGLLKI-NAF--LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLI 234 (378) T ss_pred c--------ccccceeee-CCc--CCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEEccCChhhhhHHHHHHH Confidence 1 112112211 110 0011 11111111 1111 123346778888899887643333323456778 Q ss_pred HHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc--ceeEEecCCCCc Q lcl|NC_021301. 316 IRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVED--TVDVSFESPDRV 393 (456) Q Consensus 316 ~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~--~i~v~f~~~~~~ 393 (456) ..+|+.+-|+|+..+.+..+ ....+.+....|.-.+...+..|...|- ----...|....+ .+++.+..-.-. T Consensus 235 ~~~Ia~~fgVP~~~l~~~~s--e~~~~~f~~~tL~P~~~~ie~~l~~~Ll---~~~er~~g~~~~~~~~~~f~~~~l~~~ 309 (378) T protein:vir:94 235 KSELLTGYFMNENILLGTAS--QEQQIYFYNSTIIPLLIQLEKELTYKLI---STNRRRVVKGNLYYERIIVDNQLFKFA 309 (378) T ss_pred HHHHHHHhCCCHHHhcCChH--HHHHHHHHHHHHHHHHHHHHHHHHhhcC---ChhHhhhhhhcccccceeecchhhhhc Confidence 88999999999998854321 1112222222222222111111111110 0000011111222 234444555677 Q ss_pred CHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHHHH-HHHhhhhhhhcccccCC Q lcl|NC_021301. 394 TLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAREQI-TLFAGNSVQRPQEDGSR 456 (456) Q Consensus 394 ~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~ee~-~~~~~~~~~~~~~d~~~ 456 (456) |..+.++++.++.++|+++.--+++.+|+.|.+--. .....+.... ...........+|+.++ T Consensus 310 d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 310 TLKELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred CHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeecccccccccchhhcCCcCCCCCCCCCCCC Confidence 889999999999999999998899998886542110 0111111000 01112222333444444 No 241 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=97.35 E-value=8.9e-05 Score=42.80 Aligned_cols=430 Identities=11% Similarity=-0.002 Sum_probs=166.2 Q ss_pred CCCCC----HHHHHHHHHHHHHH----HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhc Q lcl|NC_021301. 1 MTAST----PAEWLPVLTKRIDD----GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII 72 (456) Q Consensus 1 ~~~~t----~~~~~~~l~~~~~~----~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~ 72 (456) |...- ..+-++.....+.. ...+++.+.+|..-..-.. ...+ ......++--+-+...++.+++.|. T Consensus 1 m~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~~~~---~~~~--~~~~~~~~~dst~~~a~~~LAa~L~ 75 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSVFPS---ATAD--GSTSYTTPWQSIGARGLNNLASKLM 75 (532) T ss_pred CcchhhccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCC---CCCc--chhhccccccchHHHHHHHHHHHHH Confidence 65432 23444444444433 3455566666654432111 1111 1112234555667888888888775 Q ss_pred c------CC-eecCCCCccc---------HHHH-----------HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC Q lcl|NC_021301. 73 P------NG-ITVGGSADSD---------LALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD 125 (456) Q Consensus 73 ~------~~-~~~~~~~d~~---------~~~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d 125 (456) + .| |++... |.+ ...+ +...+..++|.....++.++..++|.|.+++..++. T Consensus 76 ~~ltpp~~~WF~l~~~-d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~ 154 (532) T protein:vir:99 76 LALFPVGSSFFKLNVS-ELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQ 154 (532) T ss_pred HhhcCCCCccccccCC-HHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccccc Confidence 4 23 333221 110 1111 234556688999999999999999999988866432 Q ss_pred ---CceEEEEEccceeEEEEeCCCCceEEEEEEEEEec-C---C---ceEEEEEE--cC-CeEEEEEEeeeeccccccee Q lcl|NC_021301. 126 ---GTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL-D---A---ESDFAIVW--SG-DGWQKFARPCFVQSSSRRRL 192 (456) Q Consensus 126 ---g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-d---~---~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~~~ 192 (456) +..+++.++-.+.++.-|. .++ +...++..+-. + . .......+ .+ ..+..|+.. +...+...+. T Consensus 155 ~~~~~~~f~~~pl~~y~v~~d~-~G~-v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v-~~~~~~~~~~ 231 (532) T protein:vir:99 155 VEGQSNAPKLYKLHNFVVERDA-YDN-VLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHV-YRDPEAMVFR 231 (532) T ss_pred ccCcccceEEEEcCeEEEeeCC-CCC-eeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEE-EecCCCCeeE Confidence 3456777776665444443 343 44444433210 0 0 00000000 00 111111111 1111111111 Q ss_pred e-ccCCCceeecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcc Q lcl|NC_021301. 193 V-TRISDSWVPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGL 265 (456) Q Consensus 193 ~-~~~~~~~~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~ 265 (456) . ....+..........++. +|.+++ | .+.+|+|-.+..++-+..+|...-......+....|...+. T Consensus 232 ~~~~~~g~~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~lv~------ 305 (532) T protein:vir:99 232 SYQEIDGEIVAGTEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKVLFFVN------ 305 (532) T ss_pred EEEeecCceecccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCceec------ Confidence 0 001111111111111122 233322 2 34689999999999999998776666666666665543221 Q ss_pred cccccccchhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHH Q lcl|NC_021301. 266 PKVDENGNAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAH 343 (456) Q Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~ 343 (456) .+|. .........+.|.+....+ +....++. .+++..-.+.++.+...|....-+ +.....+...-+|.-++ T Consensus 306 ----p~g~-~~~~~~~~~~~g~~v~g~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~-~~~~~~d~~r~TAtEV~ 379 (532) T protein:vir:99 306 ----PNGV-TQIRRVAKANTGDFVAGRKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFML-NSAVQRGGDRVTAEEIR 379 (532) T ss_pred ----cccc-cchhhhccCCCcceecCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhh-hhcccCCCCcccHHHHH Confidence 1111 1111222334444433222 22333333 234444444444444444332211 11111122223444443 Q ss_pred HHHHHHHHHHH----HHH-HHHHHHHHHHHHHHHHhcCC---Ccc-ccee-EEecCCCCcCHHHHHHHHHHH-------H Q lcl|NC_021301. 344 NIEKGFLFKCE----DRL-SIAKIGLEAILVKALQIEGE---SVE-DTVD-VSFESPDRVTLGEKYAAASLA-------K 406 (456) Q Consensus 344 ~~~~~l~~k~~----~~~-~~f~~~l~~~~~l~~~~~~~---~~~-~~i~-v~f~~~~~~~~~e~ad~~~kl-------~ 406 (456) .....+.+... +.+ ..+.+-+.+.+.++.+..-. +++ .... +++- +....++.+..+ . T Consensus 380 ~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~i-----s~Laraq~~~~l~~~~~~la 454 (532) T protein:vir:99 380 YVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGL-----EALGRGHDLNKLNVFIDYMI 454 (532) T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceeecc-----hHHHHHHHHHHHHHHHHHHH Confidence 33322222211 111 22233334444444332111 111 1111 2222 222222222222 1 Q ss_pred hc-C----CCcHH----HHHHhCCCCh-------hHHHHHHHHHHHHHHHHHhh-hhhhhcccccCC Q lcl|NC_021301. 407 AA-G----ESWAS----IRRNILNYNA-------DQIKQDDLDRAREQITLFAG-NSVQRPQEDGSR 456 (456) Q Consensus 407 ~~-g----~~s~~----t~~~~~~~~~-------~~~~~~e~~~~~ee~~~~~~-~~~~~~~~d~~~ 456 (456) +. + .+.-. ...+.+|+++ ++++++.+++..++....+. ........-+.+ T Consensus 455 q~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~ 521 (532) T protein:vir:99 455 KLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAA 521 (532) T ss_pred hhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcch Confidence 11 1 11111 1224456643 23332222211111111111 111111111111 No 242 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=97.34 E-value=9.1e-05 Score=42.74 Aligned_cols=358 Identities=12% Similarity=0.048 Sum_probs=152.2 Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHhcccCcccccCccc-chhhhh-----------------------h--hhhhcc Q lcl|NC_021301. 7 AEWLPVLTKRIDDGMSRV---RLLARYSNGDAPLPELTRNT-SAAWRS-----------------------F--QREART 57 (456) Q Consensus 7 ~~~~~~l~~~~~~~~~r~---~~~~~YY~g~~~i~~~~~~~-~~~~~~-----------------------~--~~k~~~ 57 (456) +-+...|-. ....+++ .=..+|.-|+.++....+.. +.+-+. . ..-+.+ T Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~t~~~~~~~ 78 (409) T protein:vir:83 1 MGFWSNLFG--IPSIPDLPNDNGPVDYNPGDPDMVEFRGPEEEPEARALPWIRPTAWSGYPESWATPSWGSAQDKLRTLI 78 (409) T ss_pred Cchhhhhcc--cccCCCcccccccccccCCCCceeeccCCCcchhhhhcccccccccccccccccccCccccchhhHhhh Confidence 000000000 0001111 11123444443332221100 000000 0 001122 Q ss_pred ChHHHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHh--cCh---hHHHHHHHHHHhhCCeEEEE-EeeCCCCce-EE Q lcl|NC_021301. 58 NWGLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRD--NRM---DSVCKQWVKYGLDFGESYLT-CWRRDDGTA-TI 130 (456) Q Consensus 58 n~~~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~--n~~---~~~~~~~~~~a~~~G~a~~~-v~~d~dg~~-~i 130 (456) .....+|+.+++-+..-|+.+...... ...+..++.. |.+ ..+...+....+ .|.+|++ +..+.+|.+ .+ T Consensus 79 ~~v~acV~~Ia~~iA~lpl~~~~~~~~--~~~~~~ll~~~PN~~~t~~~f~~~l~~~ll-lGnay~~~i~r~~~G~~~~L 155 (409) T protein:vir:83 79 DVAWACIDLNASVLSSMPIYRMRNGRI--IDSVAWMSNPDPEVYTSWQEFAKQLFWDFQ-LGEAFVLPMAHGSDGYPIRF 155 (409) T ss_pred HHHHHHHHHHHHhhccCceEEeeCCcc--ccchhhhcccCCCCCCCHHHHHHHHHHHHh-hCCcEEEEEEECCCCcEEEE Confidence 344567888888887778765322111 1112212221 221 233333444444 4889876 567888876 58 Q ss_pred EEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccC Q lcl|NC_021301. 131 TADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTG 210 (456) Q Consensus 131 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (456) ..++|..+.+..++.. . .+|+- .+ .+..+++.++ - T Consensus 156 ~pl~p~~v~v~~~~~g-~------~~y~~-~~------~~~~~eiiHi-------------------------------r 190 (409) T protein:vir:83 156 RVVPPWLVNVELKKGA-R------REYRI-GG------LNVTDEILHI-------------------------------R 190 (409) T ss_pred EEECCcceEEEEcCCc-e------EEEEE-cc------ccCccceEEe-------------------------------C Confidence 8888888776665331 1 01110 00 0011111110 0 Q ss_pred ceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchh-hhh-hhhhhhc Q lcl|NC_021301. 211 SPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAI-DYA-SIFEAAP 285 (456) Q Consensus 211 ~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~~-~~~~~~~ 285 (456) .+. ....-.|.|-++.....++..+.+.. ....++. .|..+++ .... ..++....+ ... ....... T Consensus 191 ~~~---~~~~~~G~spi~~~~~~i~~~~a~~~---~~~~~f~nga~p~gil~-~~~~--ls~e~~~~~~~~~~~~~~~na 261 (409) T protein:vir:83 191 YQG---NTADAHGHGPLESAAPRQVVIGLLQK---YVQNLAETGGVPLYWLG-VERR--LSETEAVDLMDRWIESRSKYA 261 (409) T ss_pred CCC---CCCCcccccHHHHHHHHHHHHHHHHH---HHHHHHhcCCCcceEee-cCCC--CCHHHHHHHHHHHHHhhCCcc Confidence 000 00112466666655444443332211 1222222 3433332 1111 111111111 111 1111123 Q ss_pred cceeccCCCceeEe-e--cccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-------CcHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 286 GALWELPPGVDIWE-S--QTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-------NQSAEGAHNIEKGFLFKCED 355 (456) Q Consensus 286 ~~~~~~~~d~~~~~-~--~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-------N~Sg~Al~~~~~~l~~k~~~ 355 (456) +..+.+.++.++.+ + ++.+++ |++..+..+.+|+.+-++|++.+|.... |.....+.+....|.-.+.+ T Consensus 262 g~~~il~~g~~~~~~~~~s~~d~q-~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn~eq~~~~f~~~tL~P~~~~ 340 (409) T protein:vir:83 262 GHPALVTGGATLNQAKSMSAQDLS-LMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSNIEQLFSFHDRSSLRPKATA 340 (409) T ss_pred CccceecCCcccccccCCCHHHHH-HHHHHHhhHHHHHHHhCCCHHHccCCCCccccccccHHHHHHHHHHHHHHHHHHH Confidence 34444555555533 2 233333 7777788899999999999999974321 21112222222333333333 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHHHHHHHH Q lcl|NC_021301. 356 RLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGESWASIRRNILNYNADQIKQDDLDRA 435 (456) Q Consensus 356 ~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~~e~~~~ 435 (456) .+..+...| + .....+++.+..-+-.|.++.+++..++.++|+++.-.+++.+|+.+.+--. + T Consensus 341 ie~~l~~~L-------l-----~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~lT~NE~R~~~glpp~~ggd----~- 403 (409) T protein:vir:83 341 VMAALDRWA-------L-----PSPQHLELNRDDYTRPSLVERATAYKIMIEAGVMEPNEARAMERLHSEAAAV----R- 403 (409) T ss_pred HHHHHHHhh-------C-----CCCcEEEeehhhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCc----c- Confidence 222222211 1 1123355445555667889999999999999999998888888876532110 0 Q ss_pred HHHHHHHhhhhh Q lcl|NC_021301. 436 REQITLFAGNSV 447 (456) Q Consensus 436 ~ee~~~~~~~~~ 447 (456) +....+ T Consensus 404 ------l~~~gv 409 (409) T protein:vir:83 404 ------LSGGGV 409 (409) T ss_pred ------cCCCCC Confidence 000000 No 243 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=97.31 E-value=0.0001 Score=42.52 Aligned_cols=347 Identities=10% Similarity=-0.062 Sum_probs=144.5 Q ss_pred HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCC--C----c---ccHHHHHH Q lcl|NC_021301. 21 MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS--A----D---SDLALRAR 91 (456) Q Consensus 21 ~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~--~----d---~~~~~~l~ 91 (456) ..=+.+...+-.+.... ....-..-...... ........+|+.+++-+..-|+.+-.. . + +-....+. T Consensus 1 Mg~f~~~~~~~~~~~~~-~~~~~~~~~~~~~~--~~~~~v~~~i~~Ia~~iA~l~~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:16 1 MNLFGKVVSFSRGKLNN-DTQRVTAWQNEAVE--YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred CccchhhhhhhcccccC-Ccceeeecccchhh--HHHHHHHHHHHHHHhhhhhCceeEEEEcccccccccccccccchHH Confidence 11112222211111100 00000000000000 123345667888888777777653110 0 0 01123355 Q ss_pred HHHHh--c---ChhHHHHHHHHHHhhCCeEEEEEeeCC-CCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceE Q lcl|NC_021301. 92 RIWRD--N---RMDSVCKQWVKYGLDFGESYLTCWRRD-DGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESD 165 (456) Q Consensus 92 ~~~~~--n---~~~~~~~~~~~~a~~~G~a~~~v~~d~-dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~ 165 (456) ++++. | ........+...++.+|.||++...+. .|++. .+-| ++.. T Consensus 78 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~--~l~~-------------------------~~~~- 129 (378) T protein:vir:16 78 EVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELL--DLLF-------------------------ADDK- 129 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEE--EEEe-------------------------cCCe- Confidence 55542 2 234556678889999999998653332 22211 0000 0000 Q ss_pred EEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCC-CCCcHhHHHHHHHHHHHHHHHH Q lcl|NC_021301. 166 FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPD-GMGEVEPHIDIINRINRAELQL 244 (456) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~-g~s~~~~v~~liDa~~~~~s~~ 244 (456) ..|..+.++++ .++. +.....++..+.++++..++ T Consensus 130 --~~~~~~diih~----------------------------------------r~~~~~~~~~s~l~~~~~~i~~~~~-- 165 (378) T protein:vir:16 130 --KEYKPEELVRL----------------------------------------TSPFYINEDTSILDNALASIQTKLE-- 165 (378) T ss_pred --eEecccceEEe----------------------------------------cCccCccchhHHHHHHHHHHHHHHh-- Confidence 00111122221 1111 11122233333444432222 Q ss_pred HHHHHHhhchhhhhhcCCCcccccccc-cchhhh-hhhhh-----hhccceeccCCCceeEeecccchHHHHHHHHHHHH Q lcl|NC_021301. 245 LSTMAIQAFRQRALKSAGHGLPKVDEN-GNAIDY-ASIFE-----AAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIR 317 (456) Q Consensus 245 ~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~~~~-~~~~~-----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~ 317 (456) .+.+.-+++. ... ..++. +..... ...++ ...+.+..++.+.++.+++..+...-.+.++.+.. T Consensus 166 ------~~~~~g~l~~-~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~ 236 (378) T protein:vir:16 166 ------QGKLRGLLKI-NAF--LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKS 236 (378) T ss_pred ------cCccceeeEe-CCc--CCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHH Confidence 1112222211 111 11111 111111 11111 12345677888889988764332222344677888 Q ss_pred HHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--ccceeEEecCCCCcCH Q lcl|NC_021301. 318 QLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--EDTVDVSFESPDRVTL 395 (456) Q Consensus 318 ~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~~~i~v~f~~~~~~~~ 395 (456) +|+.+-|+|+..+.+..+ ....+.+....|.-.+...+..+...|-. ---...+... ...+++.+..-...|. T Consensus 237 ~Ia~~fgVPp~~l~g~~~--e~~~~~f~~~tl~P~~~~ie~~l~~kLl~---~~e~~~~~~~~~~~~~~f~~~~l~~~d~ 311 (378) T protein:vir:16 237 ELLTGYFMNENILLGTAS--QEQQIYFYNSTIIPLLIQLEKELTYKLIS---TNRRRVVKGNLYYERIIVDNQLFKFATL 311 (378) T ss_pred HHHHHhCCCHHHhcCCch--HHHHHHHHHHHHHHHHHHHHHHHHhhcCC---hhhhhhhhhcccccceeeccchhhhcCH Confidence 999999999998864321 22222222222222222222222211100 0000111111 2234455566677889 Q ss_pred HHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHHHHHH-HhhhhhhhcccccCC Q lcl|NC_021301. 396 GEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAREQITL-FAGNSVQRPQEDGSR 456 (456) Q Consensus 396 ~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~ee~~~-~~~~~~~~~~~d~~~ 456 (456) .+.++++.+++.+|+++.--+++.+|+.|-+--. .....+.+..+. .........+|+.++ T Consensus 312 ~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 312 KELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 9999999999999999998899999887542110 111111111111 111222223444444 No 244 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=97.22 E-value=0.00013 Score=41.94 Aligned_cols=421 Identities=11% Similarity=0.024 Sum_probs=159.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc------C Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP------N 74 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~------~ 74 (456) |-.. -....+.|..+-.....+++.+.+|..-.... ...... .....++.-+-+...++.+++.|.+ . T Consensus 1 m~~~-~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~---~~~~~~--~~~~~~~~dst~~~a~~~Laa~l~~~ltpp~~ 74 (555) T protein:vir:17 1 MKHS-AQAKYMMLRADREDYLDSGRQSARLTLPYILT---DEGHVQ--GGYLPTPWQSVGSKGVNVLASKLMLSLFPVNT 74 (555) T ss_pred ChhH-HHHHHHHHHHHhhHHHHHHHHHHHHhcccccC---CCCCcc--cccccccccccHHHHHHHHHHHHHHhhcCCCC Confidence 3322 22233334333344455666777775443111 111111 1112345566778888888887754 2 Q ss_pred C-eecCCCC--------cccHHH-----------HHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEc Q lcl|NC_021301. 75 G-ITVGGSA--------DSDLAL-----------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADS 134 (456) Q Consensus 75 ~-~~~~~~~--------d~~~~~-----------~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~ 134 (456) | |++.... +.+... .+...+..++|.....++.++..++|.+.++ .++++ +++++ T Consensus 75 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly--~~~~~---~~~~p 149 (555) T protein:vir:17 75 SFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALLY--QGKKN---LKLYP 149 (555) T ss_pred cccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--ecCCc---eeEEE Confidence 3 3343221 111111 2334455688999999999999999998654 45553 44454 Q ss_pred cceeEEEEeCCCCceEEEEEEEEEec-------CCce----------------EEE---EE-----EcCCeEEEEEEeee Q lcl|NC_021301. 135 PETMVVSVDPLQPWRIRSAMRWWRDL-------DAES----------------DFA---IV-----WSGDGWQKFARPCF 183 (456) Q Consensus 135 p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~----------------~~~---~~-----~~~~~~~~~~~~~~ 183 (456) -.+.++..|+ .++ +..+++.++-. .|.. ... .+ +....+..|. +. T Consensus 150 l~~y~v~~d~-~G~-vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~v~t-~~- 225 (555) T protein:vir:17 150 LDRFVVSRDG-EGN-VMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDALVYT-YV- 225 (555) T ss_pred cCeEEEeeCC-CcC-eeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcceeEee-cc- Confidence 4554444443 343 34444433210 0100 000 00 0000000010 00 Q ss_pred ecccccceeec-cCCCceeeccccc-ccCceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhh Q lcl|NC_021301. 184 VQSSSRRRLVT-RISDSWVPVGDAV-VTGSPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQR 256 (456) Q Consensus 184 ~~~~~~~~~~~-~~~~~~~~~~~~~-~~~~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~ 256 (456) ....+.+.+. ...+......... .+..+|.+++ | .+.+|+|-.+..++.+..+|...-......+....|.. T Consensus 226 -~~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 304 (555) T protein:vir:17 226 -CRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAMVEGSAASAKVVF 304 (555) T ss_pred -cccCCeeEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCce Confidence 0000111000 0111110000011 1222333322 2 34689999999999999999887777778888777764 Q ss_pred hhh--cCCCcccccccccchhhhhhhhhhhccceeccCC-CceeEeec-ccchHH---HHHHHHHHHHHHHhhcCCChhh Q lcl|NC_021301. 257 ALK--SAGHGLPKVDENGNAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTP---MLSAIKEHIRQLSSATKTPLPM 329 (456) Q Consensus 257 ~i~--g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~---~~~~l~~~~~~i~~~~~~p~~~ 329 (456) .+. |... .......+.+.+....+ +....++. ..+++. .++.++.-+.......+. T Consensus 305 lv~~~g~~~-------------~~~l~~~~~g~v~~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~~~~---- 367 (555) T protein:vir:17 305 MVSPSATTK-------------PQNLALAANGAIIQGRPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLMLQV---- 367 (555) T ss_pred eeccccccC-------------cceeecCCCceeecCCcccceeeeccccchhhHHHHHHHHHHHHHHHHHhhcCC---- Confidence 431 1111 11111222233322111 12222322 223332 333333333333222211 Q ss_pred hcccccCcHHHHHHHHHHHHHHHH----HHHH-HHHHHHHHHHHHHHHHhcCCC--cccceeEEecCCCCc-----CHHH Q lcl|NC_021301. 330 LMPDSANQSAEGAHNIEKGFLFKC----EDRL-SIAKIGLEAILVKALQIEGES--VEDTVDVSFESPDRV-----TLGE 397 (456) Q Consensus 330 ~~~~~~N~Sg~Al~~~~~~l~~k~----~~~~-~~f~~~l~~~~~l~~~~~~~~--~~~~i~v~f~~~~~~-----~~~e 397 (456) .+..+-+|.-++.....+.+.. .+.+ ..+.+-+.+.+.++.+..-.+ ...-+.+...-++.. +... T Consensus 368 --~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~l~r~~~~~~ 445 (555) T protein:vir:17 368 --RQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWGVGRGQDKQQ 445 (555) T ss_pred --CCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHHHHHHHHHHH Confidence 1222334444433333222221 1111 223344444555544321111 111122222211100 1111 Q ss_pred HHHHHHHHHhcCC-------CcH----HHHHHhCCCC-------hhHHHHHHHHHHHHHHHH----Hhhhhhh------- Q lcl|NC_021301. 398 KYAAASLAKAAGE-------SWA----SIRRNILNYN-------ADQIKQDDLDRAREQITL----FAGNSVQ------- 448 (456) Q Consensus 398 ~ad~~~kl~~~g~-------~s~----~t~~~~~~~~-------~~~~~~~e~~~~~ee~~~----~~~~~~~------- 448 (456) ..+.+..+.+.+- +.- ....+.+|++ +++++++..++.++++.. .+..... T Consensus 446 l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~~~~~~~~~~ 525 (555) T protein:vir:17 446 LMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQLAKTPMAEQA 525 (555) T ss_pred HHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhH Confidence 1122222222211 111 1223456764 344443322222211111 1011101 Q ss_pred --hcccccCC Q lcl|NC_021301. 449 --RPQEDGSR 456 (456) Q Consensus 449 --~~~~d~~~ 456 (456) ....++.. T Consensus 526 ~~~~~~~~~~ 535 (555) T protein:vir:17 526 MQLIQQQQEG 535 (555) T ss_pred Hhccccchhh Confidence 11111111 No 245 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=97.15 E-value=0.00015 Score=41.54 Aligned_cols=398 Identities=11% Similarity=0.050 Sum_probs=174.7 Q ss_pred CCCC-CH-HHHHHHH---HHHH---HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhh- Q lcl|NC_021301. 1 MTAS-TP-AEWLPVL---TKRI---DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI- 71 (456) Q Consensus 1 ~~~~-t~-~~~~~~l---~~~~---~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l- 71 (456) |... +. .-+-+.. -.+. .+...+|+.+..+++=. .+|+..+.=. T Consensus 46 ~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd---------------------------~Av~eIVneai 98 (521) T protein:vir:81 46 DNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNHEVE---------------------------NAVQNIVNDAI 98 (521) T ss_pred CCCcceeecceeeeecccccchhhHHHHHHHHHHHhhccchh---------------------------hHHHHhhccee Confidence 1110 00 0000000 0000 11122223332222222 2222222211 Q ss_pred ----ccCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC---CCceEEEEEccce Q lcl|NC_021301. 72 ----IPNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD---DGTATITADSPET 137 (456) Q Consensus 72 ----~~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~---dg~~~i~~~~p~~ 137 (456) ..+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+.-.|+ +|-..+..++|+. T Consensus 99 v~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~ 178 (521) T protein:vir:81 99 VFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRN 178 (521) T ss_pred EecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEcCCccccceeeeeeCCcc Confidence 12344433211 112234455666667888889999999999999998876553 3556789999998 Q ss_pred eEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--E Q lcl|NC_021301. 138 MVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--V 215 (456) Q Consensus 138 ~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--v 215 (456) +..+..-......- +. ..+......+|.+.... +......... + ....+|. | T Consensus 179 i~~vr~i~k~~~~~--~~----v~~~~~e~f~Y~~~~~~-~~~~g~~~~~----------~---------~~vkI~~dAI 232 (521) T protein:vir:81 179 LEYVREIITEDTPE--GK----IYKATKEYFIYTVGNSS-YCAGGQVFSP----------N---------SRVKIPRSAI 232 (521) T ss_pred eeeeeeecccccCc--cc----eecceeeeeeeecCCcc-ccccceeecC----------C---------cceeechhhe Confidence 87665321110000 00 01122333444443211 1100000000 0 0111111 2 Q ss_pred EEc-------cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc----------------- Q lcl|NC_021301. 216 VVY-------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE----------------- 270 (456) Q Consensus 216 v~~-------~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~----------------- 270 (456) ++. ++..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 233 ~y~hSGl~d~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvY 311 (521) T protein:vir:81 233 TYAHSGLMDCDDKYIIGYLHRAVKPANQL-KLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVY 311 (521) T ss_pred eeeeccceeCCCCeeeecchhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEe Confidence 222 2111224444432222222 233344444444333433222111 11221111 Q ss_pred ---ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc---C-cHH Q lcl|NC_021301. 271 ---NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA---N-QSA 339 (456) Q Consensus 271 ---~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~---N-~Sg 339 (456) +|+.-+.......-.. .|. .+.+..+..++..+--+-++-++-+-..++...++|.+-++...+ | --| T Consensus 312 Da~TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~ 390 (521) T protein:vir:81 312 DASTGKLKNQQANLSMTED-YWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDG 390 (521) T ss_pred ecccccccccccccchhhh-hcccccCCCcccceeecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceecccc Confidence 1111000000000000 011 112233445554332234444677778889999999888842211 1 122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHH-------HHHHHHH Q lcl|NC_021301. 340 EGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEK-------YAAASLA 405 (456) Q Consensus 340 ~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~-------ad~~~kl 405 (456) ..|--....+..-+.+.+..|..-+.++++.-+.++|... + ..|.+.|.....-.+... ++++..+ T Consensus 391 ~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~ 470 (521) T protein:vir:81 391 SEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERI 470 (521) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHh Confidence 3455566667777889999999999999998877777532 2 247778876544333333 2333333 Q ss_pred Hh-cC-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 406 KA-AG-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 406 ~~-~g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) .. .| .+|.++++ .+|.+++++++++ .+.+++|...-.-....+..++. T Consensus 471 dpyvGky~s~dyi~k~ILr~tDeei~~~-~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 471 TPYIGKYFSNQTVMRDILKYTDDQMDTE-KKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred hhhhccccchHHHHHHHhccCHHHHHHH-HHHHHHHhhCCCCCCCcccccCC Confidence 21 13 46888876 5789999888764 44455554322212222222233 No 246 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=97.13 E-value=0.00016 Score=41.41 Aligned_cols=422 Identities=13% Similarity=0.091 Sum_probs=178.8 Q ss_pred CCCCCHHHH--HHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhhc----- Q lcl|NC_021301. 1 MTASTPAEW--LPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRII----- 72 (456) Q Consensus 1 ~~~~t~~~~--~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l~----- 72 (456) -|.-+|... ...+.. ..||-.-.++.. ..+...++-...+ .+.+.=+-.+|+..+.=.+ T Consensus 19 ~s~~~~~~~dg~~~i~~------------~~~~~~~~~~e~-~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~ 85 (533) T protein:vir:10 19 PSFVQKDNLDGSQPVSG------------GGYYGYTVDFDG-QVRNEYQLISRYREMVLQPECDSAVDDIVNETICGNFD 85 (533) T ss_pred CCCCCCCcccccceeec------------ccccceeeeccc-ccchHHHHHHHHHHHhhccchhhHHHHhhcceeeecCC Confidence 111111100 000000 012211111110 0011111111111 1223334444444444322 Q ss_pred cCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC----CCceEEEEEccceeEEE Q lcl|NC_021301. 73 PNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD----DGTATITADSPETMVVS 141 (456) Q Consensus 73 ~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~----dg~~~i~~~~p~~~~~~ 141 (456) .+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+.-.|. +|-..++.+||+.+-.+ T Consensus 86 ~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr~i~~v 165 (533) T protein:vir:10 86 DVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPRKIRKI 165 (533) T ss_pred CceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeeccccceeee Confidence 2344443221 111233455566667888889999999999999999875553 36677999999988776 Q ss_pred EeCC--CCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEcc Q lcl|NC_021301. 142 VDPL--QPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQ 219 (456) Q Consensus 142 ~d~~--~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~ 219 (456) ..-. ....... ...-....++.....+|.+.+.+. ......++.. +. +. ..|. |. +..+ T Consensus 166 r~i~~~~~~~~~~-~~~~~~v~~~~~eyf~Ynp~g~~~--------~~~~~vkI~~--dA-I~---y~hS-Gl---~d~~ 226 (533) T protein:vir:10 166 NETEQKRPEQLRG-LPLNQQLSPKSAEYFLYDPKGLKN--------STTQGLKIAP--DS-IC---YVHS-GI---MDLN 226 (533) T ss_pred eeeeccCCCccce-eecchhhhccceeeeeeccccccc--------cCCCceecch--hh-ee---eeec-cc---eeCC Confidence 5211 1110000 000001112222234444432210 0000000000 00 00 0010 00 1112 Q ss_pred CCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Ccccccccccchhhhhhh------hhhhccc----- Q lcl|NC_021301. 220 NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDENGNAIDYASI------FEAAPGA----- 287 (456) Q Consensus 220 n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~~~~------~~~~~~~----- 287 (456) +..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+.-=..+.+.. ..+..|. T Consensus 227 ~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddr 305 (533) T protein:vir:10 227 KNMTLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDK 305 (533) T ss_pred CCceeccchHhHHHHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccc Confidence 222224444432222222 233344444444333332221111 112211110000000000 0000000 Q ss_pred --------eec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-Cc-HHHHHHHHHHHHHHHH Q lcl|NC_021301. 288 --------LWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGAHNIEKGFLFKC 353 (456) Q Consensus 288 --------~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N~-Sg~Al~~~~~~l~~k~ 353 (456) .|. .+.+..+..|+...--+-++-++-+-..++...++|.+-++...+ |. -|..|--....+..-+ T Consensus 306 k~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI 385 (533) T protein:vir:10 306 KFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFV 385 (533) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHH Confidence 011 112233444554432233444677777888899999888764422 21 2234666666777788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------HHHHHHHh-cC-CCcHHHHH Q lcl|NC_021301. 354 EDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------AAASLAKA-AG-ESWASIRR 417 (456) Q Consensus 354 ~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d~~~kl~~-~g-~~s~~t~~ 417 (456) .+.+..|..-|.++++.-+.++|.-. + ..|.+.|.....-.+...+ +++..+.. .| .+|.++++ T Consensus 386 ~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~ 465 (533) T protein:vir:10 386 ARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMR 465 (533) T ss_pred HHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHH Confidence 89999999999999998877777521 1 3477788765433333332 33332211 13 56888876 Q ss_pred -HhCCCChhHHHHHHHHHHHHHHHH----------HhhhhhhhcccccCC Q lcl|NC_021301. 418 -NILNYNADQIKQDDLDRAREQITL----------FAGNSVQRPQEDGSR 456 (456) Q Consensus 418 -~~~~~~~~~~~~~e~~~~~ee~~~----------~~~~~~~~~~~d~~~ 456 (456) .+|.+++++++++.. .+++|... ........|+.+|.- T Consensus 466 k~ILr~tDeei~~~~k-qI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~ 514 (533) T protein:vir:10 466 RQVLKQTDVEMKEIDK-QIESEMESGIIADPAAEMDPAMAAGDPDAGGAP 514 (533) T ss_pred HHHhccCHHHHHHHHH-HHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcc Confidence 578999988876433 33333221 111122223333333 No 247 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=97.13 E-value=0.00016 Score=41.40 Aligned_cols=347 Identities=9% Similarity=-0.063 Sum_probs=143.6 Q ss_pred HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCCC------cc---cHHHHHH Q lcl|NC_021301. 21 MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGSA------DS---DLALRAR 91 (456) Q Consensus 21 ~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~~------d~---~~~~~l~ 91 (456) ..=+.++..+-.+.... .......-...... ........+|+.+++-+..-|+.+-... +. -....+. T Consensus 1 Mg~f~~~~~f~~~~~~~-~~~~~~~~~~~~~~--~~~~~v~~~i~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:93 1 MNLFGKVVSFSRGKLNN-DTQRVTAWQNEAVE--YTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred CccchhhhhhhccccCC-CcceeeecccchhH--HHHHHHHHHHHHHHhhhhhCceeeEEEcccccccccccccccchHH Confidence 11111111111111000 00000000000000 1123456677888888777776541110 00 1122355 Q ss_pred HHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeC-CCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceE Q lcl|NC_021301. 92 RIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRR-DDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESD 165 (456) Q Consensus 92 ~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d-~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~ 165 (456) ++++. |. ...+...++...+.+|.||+++-.+ ..|++.. +| . .+... T Consensus 78 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~i~~~~~~~~g~~~~----------l~---------------~--~~~~~ 130 (378) T protein:vir:93 78 EVLNWSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLD----------LL---------------F--ADDKK 130 (378) T ss_pred HHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEE----------EE---------------e--cCCee Confidence 55542 32 2355667888999999999875433 2222111 00 0 00000 Q ss_pred EEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCC-CCCcHhHHHHHHHHHHHHHHHH Q lcl|NC_021301. 166 FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPD-GMGEVEPHIDIINRINRAELQL 244 (456) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~-g~s~~~~v~~liDa~~~~~s~~ 244 (456) .|..+.+.+ +.++. +.....++..+..+++..++ T Consensus 131 ---~~~~~diih----------------------------------------~r~~~~~~~~~s~l~~~~~~i~~~~~-- 165 (378) T protein:vir:93 131 ---EYKTEELVR----------------------------------------LTSPFYINEDTSILDNALASIQTKLE-- 165 (378) T ss_pred ---EeccceeEE----------------------------------------ecCccccchhhHHHHHHHHHHHHHHh-- Confidence 011112211 12221 11112223233333332221 Q ss_pred HHHHHHhhchhhhhhcCCCcccccccc-cchhhhh-hhhh-----hhccceeccCCCceeEeecccchHHHHHHHHHHHH Q lcl|NC_021301. 245 LSTMAIQAFRQRALKSAGHGLPKVDEN-GNAIDYA-SIFE-----AAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIR 317 (456) Q Consensus 245 ~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~~~~~-~~~~-----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~ 317 (456) .+.+.-+++. ... ..++. ....... ..++ ...+.+..++.+.++.+++..+...-.+.++.... T Consensus 166 ------~~~~~g~l~~-~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~~~~~l~~~~~~~~~~~~~~~~~ 236 (378) T protein:vir:93 166 ------QGKLRGLLKI-NAF--LDIDNTQEYREKALTTIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIDLIKS 236 (378) T ss_pred ------cCcccceeee-CCc--CCHHHHHHHHHHHHHHHHHhhcccccccceEcCCCceEEEccCChhhhhHHHHHHHHH Confidence 1122222211 110 01111 1111111 1111 12345677788889888764333333355677888 Q ss_pred HHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc--cceeEEecCCCCcCH Q lcl|NC_021301. 318 QLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVE--DTVDVSFESPDRVTL 395 (456) Q Consensus 318 ~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~--~~i~v~f~~~~~~~~ 395 (456) +|+.+-|+|+..+.+..+ ....+.+....|.-.+...+..+...| +.-.-.-.+.... ..+++.+..-.-.|. T Consensus 237 ~Ia~~fgVPp~~l~g~~~--e~~~~~f~~~tl~P~~~~ie~~l~~kL---l~~~er~~~~~~~~~~~~~fd~~~l~~~d~ 311 (378) T protein:vir:93 237 ELLTGYFMNENILLGTAT--QEQQIYFYNSTIIPLLIQLEKELTYKL---ISTNRRRVVKGNLYYERIIVDNQLFKFATL 311 (378) T ss_pred HHHHHhCCCHHHhcCCcH--HHHHHHHHHHHHHHHHHHHHHHHHhhc---CChhHhhhhhhcccccceeeccchhhhcCH Confidence 999999999988854221 111122211222222222111111111 0000001111112 234444555677899 Q ss_pred HHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHHH-----HHHHHHHHHHH-HHhhhhhhhcccccCC Q lcl|NC_021301. 396 GEKYAAASLAKAAGESWASIRRNILNYNADQIKQ-----DDLDRAREQIT-LFAGNSVQRPQEDGSR 456 (456) Q Consensus 396 ~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~~-----~e~~~~~ee~~-~~~~~~~~~~~~d~~~ 456 (456) .+.++++.++..+|+++.--+++.+|+.|.+--. .....+..... ..........+|+.+| T Consensus 312 ~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 312 KELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccccccccchhhhcCccCCCCCCCCCCCC Confidence 9999999999999999999999999987642210 01111111100 1122233344566666 No 248 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=97.09 E-value=0.00018 Score=41.17 Aligned_cols=406 Identities=11% Similarity=0.038 Sum_probs=174.8 Q ss_pred CCC---CCH----HHHHHHHHH---HH---HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHH Q lcl|NC_021301. 1 MTA---STP----AEWLPVLTK---RI---DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSV 67 (456) Q Consensus 1 ~~~---~t~----~~~~~~l~~---~~---~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~ 67 (456) +.+ .+| .-+.+.... +. .+...+|+.+..++ =+-.+|+.. T Consensus 41 ~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma~~p---------------------------Evd~Av~eI 93 (521) T protein:vir:65 41 EVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLMNNH---------------------------EVENAVQNI 93 (521) T ss_pred eecccCCccccccccceeeeccccchhhhHHHHHHHHHHHhhcc---------------------------chhhHHHHh Confidence 110 000 000000000 00 11112222222222 222222222 Q ss_pred Hhhh-----ccCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC---CCceEEEE Q lcl|NC_021301. 68 ADRI-----IPNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD---DGTATITA 132 (456) Q Consensus 68 a~~l-----~~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~---dg~~~i~~ 132 (456) +.=. ..+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+.-.|+ +|-..++. T Consensus 94 Vneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~ELr~ 173 (521) T protein:vir:65 94 VNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQ 173 (521) T ss_pred hcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCccccceeeee Confidence 2211 12344443221 112234455666667888889999999999999998876553 35567899 Q ss_pred EccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCC-eEEEEEEeeeecccccceeeccCCCceeecccccccCc Q lcl|NC_021301. 133 DSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGD-GWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGS 211 (456) Q Consensus 133 ~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (456) ++|+.+..+..-......- + ...+......+|.++ ..+.+...... ......++... +. ..|. | T Consensus 174 lDPr~i~~vr~i~k~~~~~--~----~v~~~~~e~f~Y~~~~~~~~~~g~~~~--~~~~vkI~~dA---I~---y~hS-G 238 (521) T protein:vir:65 174 LDPRNLEYVREIITEDTPE--G----KIYKATKEYFIYTVGNSSYCAGGQVFS--PNSRVKIPRSA---IT---YAHS-G 238 (521) T ss_pred eCCcceeeeeeecccccCC--c----ceecceeeeeeeecCCcceeccceeec--CCcceeechhh---ee---eeec-c Confidence 9999887665321111000 0 001122233444431 11211111000 00000000000 00 0010 1 Q ss_pred eeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc-------------------- Q lcl|NC_021301. 212 PPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE-------------------- 270 (456) Q Consensus 212 ~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~-------------------- 270 (456) . +..++..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 239 l---~d~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~ 314 (521) T protein:vir:65 239 L---MDCDDKYIIGYLHRAVKPANQL-KLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDAS 314 (521) T ss_pred c---eeCCCCeeeecchhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecc Confidence 0 1112222224444432222222 233444444444333433222111 11221111 Q ss_pred ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc---C-cHHHHH Q lcl|NC_021301. 271 NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA---N-QSAEGA 342 (456) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~---N-~Sg~Al 342 (456) +|+.-+.......-.. .|. .+.+..+..++..+--+-++-++-+-..++...++|.+-++.... | --|..| T Consensus 315 TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EI 393 (521) T protein:vir:65 315 TGKLKNQQANLSMTED-YWLQRRDGKAITDVTTLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEI 393 (521) T ss_pred cccccccccccchhhh-hcccccCCCCccceeecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchh Confidence 1111000000000000 011 112233444554332234444677777888899999877632211 1 123345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHH-------HHHHHHHHh- Q lcl|NC_021301. 343 HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEK-------YAAASLAKA- 407 (456) Q Consensus 343 ~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~-------ad~~~kl~~- 407 (456) --....+..-+.+.+..|..-+.++++.-+.++|... + ..|.+.|.....-.+... ++++..+.. T Consensus 394 tRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpy 473 (521) T protein:vir:65 394 TRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPY 473 (521) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhh Confidence 5666677778889999999999999998877777532 2 247778876543333333 333333321 Q ss_pred cC-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 408 AG-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 408 ~g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) .| .+|.++++ .+|.+++++++++ .+.+++|...-.-....+..++. T Consensus 474 vGky~S~dyi~k~ILr~tDeei~~~-~k~I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 474 IGKYFSNQTVMRDILKYTDDQMDTE-KKQIEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred hccccchHHHHHHHhccCHHHHHHH-HHHHHHhhhCCCCCCCcccccCC Confidence 13 56888876 5789999888764 44455554322211222222333 No 249 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=97.03 E-value=0.0002 Score=40.85 Aligned_cols=424 Identities=12% Similarity=0.037 Sum_probs=174.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc------CC-e Q lcl|NC_021301. 4 STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP------NG-I 76 (456) Q Consensus 4 ~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~------~~-~ 76 (456) ++-....+.|..+......+++.+.+|.--...... + .... .+...+++--+-+...++.+++.|.+ .| | T Consensus 1 m~~~~r~~~L~~~R~~~e~~w~e~~~~tlP~~~~~~-~-~~~~-~~~~~~~~~dstg~~a~~~LAa~l~~~ltpp~~~WF 77 (522) T protein:vir:10 1 MKARERYNQLTTARQMFLDKAVECSELTLPYLIDDD-I-SSRP-NHKSLTVPWQSVGAKCCVTLAAKLMLAVLPPQTSFF 77 (522) T ss_pred CchHHHHHHHHHHhhHHHHHHHHHHHHhhhcccCCC-C-CCCc-ccccccccccchHHHHHHHHHHHHHHhhcCCCCccc Confidence 445566677776666667777888888643211110 0 0111 11122235556777888888887753 23 3 Q ss_pred ecCCCCc-------ccH----HH-------HHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEcccee Q lcl|NC_021301. 77 TVGGSAD-------SDL----AL-------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETM 138 (456) Q Consensus 77 ~~~~~~d-------~~~----~~-------~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~ 138 (456) ++....+ .+. .. .+...+..++|.....++.++..++|.+.+++ ++++ +++++-.+. T Consensus 78 ~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~~~~---~~~~pl~~y 152 (522) T protein:vir:10 78 KLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNALIFM--GKDG---LKTFPLTRY 152 (522) T ss_pred cccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCceeEEE--cCCC---ceEEEcceE Confidence 3432111 001 11 13345667889999999999999999987654 5554 455555555 Q ss_pred EEEEeCCCCceEEEEEEEEEec-------CCce---E-EEEEEcCCeEEEEEEeeeecccccceeec-cCCCceeecccc Q lcl|NC_021301. 139 VVSVDPLQPWRIRSAMRWWRDL-------DAES---D-FAIVWSGDGWQKFARPCFVQSSSRRRLVT-RISDSWVPVGDA 206 (456) Q Consensus 139 ~~~~d~~~~~~~~~~~~~~~~~-------d~~~---~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 206 (456) ++.-| ..++ +...++.++-. .+.. . ...-..++..+.+....+...+.+.+... ...+........ T Consensus 153 ~v~~d-~~G~-vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~~~~~~~~~~~~~~~~~~~s 230 (522) T protein:vir:10 153 VINRD-GDGN-VLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSSGRWVWHQEAFDKIIPDSRS 230 (522) T ss_pred EEeeC-CCCC-eeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccCCceEEEEccCCcccccccc Confidence 44444 3343 34344433210 0100 0 00001111111111111111111111111 011111111111 Q ss_pred cccCceeE-EEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhh--cCCCcccccccccchhhhh Q lcl|NC_021301. 207 VVTGSPPP-VVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALK--SAGHGLPKVDENGNAIDYA 278 (456) Q Consensus 207 ~~~~~~~p-vv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~--g~~~~~~~~~~~~~~~~~~ 278 (456) ..++.-+| +++ | .+.+|+|-.+..++-+..++...-......+....|...+. |.... . T Consensus 231 ~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~lv~~~~~~~~-------------~ 297 (522) T protein:vir:10 231 TAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVFLVSPSSTTKP-------------A 297 (522) T ss_pred ccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeecccccccc-------------c Confidence 11222223 222 2 34689999999999999998877777777777777654431 21111 1 Q ss_pred hhhhhhccceeccC-CCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHH--- Q lcl|NC_021301. 279 SIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKC--- 353 (456) Q Consensus 279 ~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~--- 353 (456) .....+.+.+.... .+....++. ..++......++.+...|....-+-. ..+..+-+|.-++.....+.+.. T Consensus 298 ~l~~~~~~~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~~~---~~d~~rvTAtEV~~r~~E~~~~LGpv 374 (522) T protein:vir:10 298 TIAKAGNGAIVQGRPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLVMN---VRNAERVTAEEVRLTQLELEQQLGGI 374 (522) T ss_pred cccCCCCcceecCCCccceeecccccccchHHHHHHHHHHHHHHHHHhhcc---CCCCCCCCHHHHHHHHHHHHHHhhHH Confidence 11122223222211 122233332 23444444444444444443211100 11222335555544433332221 Q ss_pred -HHH-HHHHHHHHHHHHHHHHHhcCC----Ccc--cceeEEecCCCCcCHHHHHHHH----HHHHh-cCC------CcHH Q lcl|NC_021301. 354 -EDR-LSIAKIGLEAILVKALQIEGE----SVE--DTVDVSFESPDRVTLGEKYAAA----SLAKA-AGE------SWAS 414 (456) Q Consensus 354 -~~~-~~~f~~~l~~~~~l~~~~~~~----~~~--~~i~v~f~~~~~~~~~e~ad~~----~kl~~-~g~------~s~~ 414 (456) .+. ...+.+-+.+.+.++.+ .|. +++ ....+++..++-+ ++.++.+ ..+.+ +|- +.-. T Consensus 375 ~~rl~~E~l~Pli~r~~~il~r-~g~lP~~p~~~~~~~~v~~is~Lar--aq~~~~l~~~~~~i~~~~~p~~~~~~id~d 451 (522) T protein:vir:10 375 FSLLVIEFLIPYLNRTLLVLQR-SNQIPKLPKDIVRPTIVAGVNALGR--GQDRESLTAFVGTIAQTLGPEALMQYLNPL 451 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHh-cCCCCCCCccccccccccchhHHHH--HHHHHHHHHHHHHHHHhhCchhhhhcCCHH Confidence 122 23334445555555443 221 111 1122344433321 1111111 11111 111 1111 Q ss_pred H----HHHhCCCC-------hhHHHHHHHHHHHHHH----HHHhhhhhhhcccccCC Q lcl|NC_021301. 415 I----RRNILNYN-------ADQIKQDDLDRAREQI----TLFAGNSVQRPQEDGSR 456 (456) Q Consensus 415 t----~~~~~~~~-------~~~~~~~e~~~~~ee~----~~~~~~~~~~~~~d~~~ 456 (456) . ....+|++ ++++++++.++++.++ ...++.....+-.++.+ T Consensus 452 ~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~ 508 (522) T protein:vir:10 452 EAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTK 508 (522) T ss_pred HHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccc Confidence 1 12234543 3444443333332221 11222222333444444 No 250 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=97.03 E-value=0.0002 Score=40.83 Aligned_cols=417 Identities=12% Similarity=0.070 Sum_probs=180.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhh-----ccC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRI-----IPN 74 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l-----~~~ 74 (456) +.+..-.+=...+-.......+.-.....|+.+...++. . .++-+..+ .+.+.=+-.+|+..+.=. ..+ T Consensus 31 ~~~p~~~dGa~~I~~~~~~~~~~~~~~~~~~~~~~~~~n----~-~eLI~~YR~ma~~pEvd~Av~eIvneaiv~d~~~~ 105 (521) T protein:vir:10 31 FAVPDTADGAIEVDKQIDTTAPKTAIVQSVLGYAPKIQN----T-KDLINQYRSLSKYHEVDNAIDEIINDAIVQEDNRD 105 (521) T ss_pred cccccCCCCceeeccCCCccccccchhhhhhccccccch----H-HHHHHHHHHHhhccchhhHHHhhhcceEEecCCCc Confidence 111100000000000000000000111122222221110 0 01111111 112223333344333322 123 Q ss_pred CeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC----CCCceEEEEEccceeEEEEe Q lcl|NC_021301. 75 GITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVVSVD 143 (456) Q Consensus 75 ~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d----~dg~~~i~~~~p~~~~~~~d 143 (456) ||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+.-.| .+|-..+..++|+.+-.+.. T Consensus 106 pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~lDPr~i~~vr~ 185 (521) T protein:vir:10 106 TVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRLLDPRNVEYYRV 185 (521) T ss_pred eEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCCCccccceeeeeeCCcceeeeee Confidence 44443211 11123345556666778888999999999999999987554 34667789999998755542 Q ss_pred CCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EEEc--- Q lcl|NC_021301. 144 PLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVVY--- 218 (456) Q Consensus 144 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~~--- 218 (456) -.... ...+. .-+......+|.+.....+... + .......+|. |++. T Consensus 186 i~k~~--~~~~~----v~~~~~e~f~Y~~~~~~~~~~~----------------g------~~~~~vkI~~daI~y~hSG 237 (521) T protein:vir:10 186 NLKSN--ENGND----VYKGVKEFFTYGATEDNRYNIS----------------G------NSNNLVQIPIDAIVYSHSG 237 (521) T ss_pred ecCCC--CCcch----hhccceeeeeeccCCCceecCC----------------C------CCCcceeechhheeeeccc Confidence 11110 00000 0011223344443221111100 0 0011111111 2222 Q ss_pred ----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc--------------------ccc Q lcl|NC_021301. 219 ----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE--------------------NGN 273 (456) Q Consensus 219 ----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~--------------------~~~ 273 (456) +.....|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ +|. T Consensus 238 L~d~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGe 316 (521) T protein:vir:10 238 KVDIDGKTIVGYLHNVIKPANQL-KMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGK 316 (521) T ss_pred ceeCCCCceeccchhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCce Confidence 2334455555433322222 233444444444333433222111 11221111 111 Q ss_pred hhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc----CcHHHHHHHH Q lcl|NC_021301. 274 AIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA----NQSAEGAHNI 345 (456) Q Consensus 274 ~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~----N~Sg~Al~~~ 345 (456) ..+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-++...+ +-|+ .|--. T Consensus 317 v~ddrk~msMlED-yWLpRReGgrgTEI~TLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~-EItRD 394 (521) T protein:vir:10 317 VKNSSNNLAMTED-YWLMRRDGKATTEVSTLPGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGN-DITRD 394 (521) T ss_pred eccchhhhhhHhh-hcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceeccccc-chhHH Confidence 1000000000000 011 112233444554432234444677777888899999887754321 2233 36666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHH-------HHHHHHHHHh---c Q lcl|NC_021301. 346 EKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGE-------KYAAASLAKA---A 408 (456) Q Consensus 346 ~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e-------~ad~~~kl~~---~ 408 (456) ...+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+.. .++++.++.. . T Consensus 395 EikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yv 474 (521) T protein:vir:10 395 ELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVT 474 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCcccc Confidence 6677778889999999999999998877777521 1 34677786654333332 2344444432 2 Q ss_pred C-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 409 G-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 409 g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) | .+|.++++ .+|.+++++++++ .+.+++|...-.-...+++.+|. T Consensus 475 Gky~s~dyi~k~ILr~tDeeik~~-~k~I~~E~~~~~~~~p~~e~~df 521 (521) T protein:vir:10 475 GKYLSHEYVMKNILRMSDEDIKTE-REKIDGELKDSVYKNPEDPMEEF 521 (521) T ss_pred ccccchHHHHHHHhcCCHhHHHHH-HHHHHHhhhCCCCCCCcchhhcC Confidence 3 67888875 5789998887754 34455554322111222333344 No 251 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=97.00 E-value=0.00022 Score=40.67 Aligned_cols=398 Identities=10% Similarity=0.043 Sum_probs=174.8 Q ss_pred CCCCC----H-HHHHHH----HHHHHHH---HHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHH Q lcl|NC_021301. 1 MTAST----P-AEWLPV----LTKRIDD---GMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVA 68 (456) Q Consensus 1 ~~~~t----~-~~~~~~----l~~~~~~---~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a 68 (456) |.... + .-+.+. +...... ...+|+.+.. ++=+-.+|+..+ T Consensus 44 ~I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~---------------------------~pEvd~Av~eIV 96 (524) T protein:vir:10 44 EIETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMN---------------------------NYEVDNAVQEIV 96 (524) T ss_pred eeccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhh---------------------------ccchhhHHHHhh Confidence 11100 0 001010 0011111 1111222211 222222333333 Q ss_pred hhh-----ccCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC----CCCceEEEE Q lcl|NC_021301. 69 DRI-----IPNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITA 132 (456) Q Consensus 69 ~~l-----~~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d----~dg~~~i~~ 132 (456) .=. ..+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+.-.| .+|-..++. T Consensus 97 neaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~ 176 (524) T protein:vir:10 97 SDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKDGVQELRR 176 (524) T ss_pred cceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCccccceeeee Confidence 221 12344443211 11123445566666788888999999999999999987554 346677899 Q ss_pred EccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCce Q lcl|NC_021301. 133 DSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSP 212 (456) Q Consensus 133 ~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (456) ++|+.+-.+..-... ....+.. .. ......+|.+..- .|.... .. ...+ ....+ T Consensus 177 lDPr~i~~vr~i~~~--~~~~~~v---i~-~~~e~f~Y~~~~~-~~~~~~---------~~-~~~~---------~~ikI 230 (524) T protein:vir:10 177 LDPRQVQYIREIVTR--MEDGVKI---VD-GYREFFVYDTGHE-SYCADG---------RI-YSAG---------TKVKI 230 (524) T ss_pred eCCccceeeeeeccc--Ccccchh---hc-chhhheeecCCCc-ccccCc---------ce-ecCC---------cceec Confidence 999987555321100 0000000 01 1112233332110 000000 00 0001 11111 Q ss_pred eE--EEEccC-------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc------------ Q lcl|NC_021301. 213 PP--VVVYQN-------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE------------ 270 (456) Q Consensus 213 ~p--vv~~~n-------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~------------ 270 (456) |. |++.+. ..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 231 ~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~k 309 (524) T protein:vir:10 231 PRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQL-KLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMK 309 (524) T ss_pred chhheeeeccCcccCCCCceeccchHhhHHHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcC Confidence 11 333221 11124444332222222 233344444344333332221111 11221111 Q ss_pred --------ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc---c Q lcl|NC_021301. 271 --------NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS---A 335 (456) Q Consensus 271 --------~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~---~ 335 (456) +|...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-++... - T Consensus 310 NKlvYDa~TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f 388 (524) T protein:vir:10 310 NRVVYDASTGKIKNQQHNMSMTED-YWLQRRDGKAVTEVDTMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGV 388 (524) T ss_pred ceeEEeccCCeeccchhhhhhHhh-hcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccc Confidence 1111000000000000 011 11223344455443223444467777788899999988884321 1 Q ss_pred Cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------H Q lcl|NC_021301. 336 NQ-SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------A 400 (456) Q Consensus 336 N~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d 400 (456) |. -|..|--....+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+...+ + T Consensus 389 ~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 468 (524) T protein:vir:10 389 MFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRIN 468 (524) T ss_pred cccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHH Confidence 11 223455566667777889999999999999998877777521 1 3477788765444433332 3 Q ss_pred HHHHHHh-cC-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 401 AASLAKA-AG-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 401 ~~~kl~~-~g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) ++..+.. .| .+|.++++ .+|.+++++++++ .+.+++|...-.-....++.+|. T Consensus 469 ~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~-~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 469 MLTMAEPFIGKYISHQTAMKDFLQMTDEEINQE-AKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred HHHHhhhhhcccchhHHHHHHHhccCHHHHHHH-HHHHHHHhhcCCCCCCChhhhcC Confidence 3333321 13 46888875 5789999888764 44555554433333333344444 No 252 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=96.98 E-value=0.00023 Score=40.56 Aligned_cols=294 Identities=12% Similarity=-0.002 Sum_probs=118.8 Q ss_pred EEEEEeeCCCCceEE---EEEccceeE-EEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccce Q lcl|NC_021301. 116 SYLTCWRRDDGTATI---TADSPETMV-VSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRR 191 (456) Q Consensus 116 a~~~v~~d~dg~~~i---~~~~p~~~~-~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 191 (456) .++++|.-.+|...+ ...+|+.+. ..+++. +.......... T Consensus 1 v~Eivw~~~~g~~~~~~l~~r~~~~~~~f~~~~~----------------~~l~~~~~~~~------------------- 45 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWRPPRTISRFDVAPD----------------GGLVAIEQWGV------------------- 45 (355) T ss_pred CeEEEEEeeCCeEEEeeeeecCccceeeeeeccC----------------CceeEEEecCC------------------- Confidence 777888766665432 333333211 112211 11110000000 Q ss_pred eeccCCCceeecccccccCceeEEEE---ccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccc Q lcl|NC_021301. 192 LVTRISDSWVPVGDAVVTGSPPPVVV---YQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKV 268 (456) Q Consensus 192 ~~~~~~~~~~~~~~~~~~~~~~pvv~---~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~ 268 (456) .+. .+..-+..+++...+ ..|+.|.|.+..+.-..--=+..+.+.+..++.++.|+.+.+|-....... T Consensus 46 ----~g~----~~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~ 117 (355) T protein:vir:78 46 ----FGK----ATVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIA 117 (355) T ss_pred ----CCC----CcceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCccc Confidence 000 000011112221111 356789998887543322223346677778888877777777643221111 Q ss_pred cccc----------c-hhhhhhhhhhhccceeccCCCceeEeecc-cchHHHHHHHHHHHHHHHhhcCCChhhhc---cc Q lcl|NC_021301. 269 DENG----------N-AIDYASIFEAAPGALWELPPGVDIWESQT-NDFTPMLSAIKEHIRQLSSATKTPLPMLM---PD 333 (456) Q Consensus 269 ~~~~----------~-~~~~~~~~~~~~~~~~~~~~d~~~~~~~~-~~~~~~~~~l~~~~~~i~~~~~~p~~~~~---~~ 333 (456) ++.. + ..........+......++.+.++.-+.. .....|...++.+-.+|+.+.--.....+ +. T Consensus 118 ~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~~g 197 (355) T protein:vir:78 118 RDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAHFLTLGGDKST 197 (355) T ss_pred chhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHHhhhhhccccCCcc Confidence 1110 0 01111111223223445666666543321 12223555566666666654421111111 11 Q ss_pred ccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHHHhcCCC- Q lcl|NC_021301. 334 SANQSAEGAHNIEKGFLFKCEDRLSIAKIGLE-AILVKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLAKAAGES- 411 (456) Q Consensus 334 ~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~-~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl~~~g~~- 411 (456) ++++.|+. ...-....++.-.+.+...+. ++++-++.+.......--.++|.. .+.+..+.++.+.+|..+|+. T Consensus 198 GS~Alg~v---h~~v~~~~~~aD~~~i~~~ln~~li~~l~~lN~~~~~~~P~~~~~~-~~~~~~~~a~~~~~l~~~G~~~ 273 (355) T protein:vir:78 198 GSYALGDT---FASFFTGSLNAVMKHIADVTQQHVVEDLVDQNWGPEEPAPRLVPAQ-LGKEQPVTAEAIRALVECGAFT 273 (355) T ss_pred chhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCEEEecC-cChhHHHHHHHHHHHHhCCCcc Confidence 11221221 111222233333355556664 466666655532222223566754 345666789999999999964 Q ss_pred cH----HHHHHhCCCChhHHHHHHH----HHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 412 WA----SIRRNILNYNADQIKQDDL----DRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 412 s~----~t~~~~~~~~~~~~~~~e~----~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +. ..+++.+|+.+.+..+.+. +..................+..++ T Consensus 274 ~~~~~~~~~~e~~gip~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 326 (355) T protein:vir:78 274 ADPELEKDLRARYGLPAPAERDDGADAAAAKAAGRRRAKRLPGQRQGAALPSR 326 (355) T ss_pred ccHHHHHHHHHHhCCCCCCCCCcccCCccccccccccccccCCcccccccccc Confidence 32 3467788874221110000 000000000000000000111111 No 253 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=96.87 E-value=0.00029 Score=40.01 Aligned_cols=421 Identities=12% Similarity=0.034 Sum_probs=165.2 Q ss_pred CCCCCHHHHH---HHHHHHHHH---H----HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAEWL---PVLTKRIDD---G----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~~~---~~l~~~~~~---~----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~ 70 (456) |-.++.+++- +.|.+.++. + ..+++.+.+|.--. . .+. +..+....++--+-+...++.+++. T Consensus 1 ~~~~~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~---~-~~~---~~~~~~~~~~~dstg~~a~~~LAa~ 73 (516) T protein:vir:10 1 MKQSTDLEYGGKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPY---L-MND---KGDNETSQNGWQGVGAQATNHLANK 73 (516) T ss_pred CCchhhHhhhhHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc---c-cCC---CCCcccccccccchHHHHHHHHHHH Confidence 8777766432 444444432 2 33445555554442 1 010 0111111123345667778888877 Q ss_pred hcc------CC-eecCCCCcc--------cHHH-----------HHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC Q lcl|NC_021301. 71 IIP------NG-ITVGGSADS--------DLAL-----------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD 124 (456) Q Consensus 71 l~~------~~-~~~~~~~d~--------~~~~-----------~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~ 124 (456) |.+ .| |++...+.. .... .+...+..++|.....++.++...+|.+. +|.++ T Consensus 74 l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~d~ 151 (516) T protein:vir:10 74 LAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM--LYKPS 151 (516) T ss_pred HHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe--EEecC Confidence 753 23 433322110 1111 13345667889999999999999999985 45566 Q ss_pred CCceEEEEEccceeEEEEeCCCCceEEEEEEEEEec-------CCceEE----EEEEcCCeEEEEEEeeeecccccceee Q lcl|NC_021301. 125 DGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDL-------DAESDF----AIVWSGDGWQKFARPCFVQSSSRRRLV 193 (456) Q Consensus 125 dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (456) ++. ++.++-.+.++.-| ..++ +...++..+-. -.+... ..-.-++....+..+.... ...++.. T Consensus 152 ~~~--~~~~pl~~y~v~~d-~~G~-v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~~-~~~~~~~ 226 (516) T protein:vir:10 152 KGA--ISAIPMHHYVVNRD-TNGD-LLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKYL-GEGFWEL 226 (516) T ss_pred CCC--eEEEEcCeEEEeeC-CCCC-eEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEec-CCCceEE Confidence 654 44554455444444 4443 33333332200 000000 0000011111111111111 1111211 Q ss_pred ccCCCceeecccccc-cCceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc Q lcl|NC_021301. 194 TRISDSWVPVGDAVV-TGSPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPK 267 (456) Q Consensus 194 ~~~~~~~~~~~~~~~-~~~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~ 267 (456) ....+.......... +..+|.+++ | .+.+|+|-.+..++-+..+|...-...........|...+. T Consensus 227 ~~~~d~~~~~~~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~-------- 298 (516) T protein:vir:10 227 KQSADDIPVGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIR-------- 298 (516) T ss_pred EEeeCceeeccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCcccC-------- Confidence 111111111111111 122333322 2 34689999998888888888666666555655555433321 Q ss_pred cccccchhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHH Q lcl|NC_021301. 268 VDENGNAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNI 345 (456) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~ 345 (456) .+|. .........+.|.+....+ +....++. .++++.-.+.++.+...|....-+.. ..-.+..+-+|.-++ T Consensus 299 --p~g~-~~~~~l~~~~~g~~~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~-l~~rd~~rvTAtEV~-- 372 (516) T protein:vir:10 299 --PGAQ-TDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMET-MTRRDAERVTAVEIQ-- 372 (516) T ss_pred --cccc-cchhhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhh-hhccCCccccHHHHH-- Confidence 1111 1111122333343322222 22333333 23444444444444444433221111 000111223444333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---------HHHHHhcCCCcccceeEEecCCCCcCHHHHHHHHHHH---Hh-----c Q lcl|NC_021301. 346 EKGFLFKCEDRLSIAKIGLEAIL---------VKALQIEGESVEDTVDVSFESPDRVTLGEKYAAASLA---KA-----A 408 (456) Q Consensus 346 ~~~l~~k~~~~~~~f~~~l~~~~---------~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~ad~~~kl---~~-----~ 408 (456) .+.++++..++..+.++- +.+..+.......-+.+.. ..+.+.+..++.+..+ .+ + T Consensus 373 -----~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~--v~~i~~L~raq~~~~i~~~~q~i~~~~ 445 (516) T protein:vir:10 373 -----RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVI--ITGIEALGRMAELDKLANFAQYMSLPL 445 (516) T ss_pred -----HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChhhcCcce--ehhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 444566666666665531 1111111111111122111 1112222222222111 11 1 Q ss_pred CCCc-----------HHHHHHhCCC------ChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 409 GESW-----------ASIRRNILNY------NADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 409 g~~s-----------~~t~~~~~~~------~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ++.. -+.....+|. ++++++++..++.+.+...+.....-+....+-+ T Consensus 446 q~~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~~~~ 510 (516) T protein:vir:10 446 QWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPGVIQ 510 (516) T ss_pred cCChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHhhhcccchhh Confidence 1211 0122334443 3556666555554444333332222122221111 No 254 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=96.66 E-value=0.00043 Score=39.06 Aligned_cols=420 Identities=11% Similarity=0.057 Sum_probs=164.9 Q ss_pred CCCC-----CHHHHHHHHHHHH----HHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhh Q lcl|NC_021301. 1 MTAS-----TPAEWLPVLTKRI----DDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI 71 (456) Q Consensus 1 ~~~~-----t~~~~~~~l~~~~----~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l 71 (456) |-.. -+.+.+.....++ .....+++.+.+|.--.- . +.. .......++--+-+...++.+++.| T Consensus 1 ~~~~~~~~~~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~~-~---~~~---~~~~~~~~~~dstg~~a~~~LAa~l 73 (515) T protein:vir:70 1 MQDTILEYGGQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPYL-M---NNK---GDNETSQNGWQGVGAQATNHLANKL 73 (515) T ss_pred CcchhhhhcCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccc-c---CCC---CCcccccccccchHHHHHHHHHHHH Confidence 3221 1222333333333 222345566666655421 1 110 0011111233455677777777776 Q ss_pred cc------CC-eecCCCC------cc--cHHH-----------HHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC Q lcl|NC_021301. 72 IP------NG-ITVGGSA------DS--DLAL-----------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD 125 (456) Q Consensus 72 ~~------~~-~~~~~~~------d~--~~~~-----------~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d 125 (456) .+ .| |++...+ +. .... .+...+..++|.....++.++...+|.+.+++ |++ T Consensus 74 ~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--d~~ 151 (515) T protein:vir:70 74 AQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLLYK--PSK 151 (515) T ss_pred HHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEEEE--eCC Confidence 53 23 3433211 00 1111 13445667889999999999999999986555 555 Q ss_pred CceEEEEEccceeEEEEeCCCCceEEEEEEEEEe-------cCCceE----EEEEEcCC-eEEEEEEeeeecccccceee Q lcl|NC_021301. 126 GTATITADSPETMVVSVDPLQPWRIRSAMRWWRD-------LDAESD----FAIVWSGD-GWQKFARPCFVQSSSRRRLV 193 (456) Q Consensus 126 g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~-------~d~~~~----~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 193 (456) +. ++.++-.+.++.-| ..++ +...++.++- ..+... ...-+.++ .+..|....... .+.+.. T Consensus 152 ~~--~~~~pl~~y~v~~d-~~G~-v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~~~--~~~~~~ 225 (515) T protein:vir:70 152 GA--MSAVPMHHYVVNRD-TNGD-LMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQYAG--EGFWKI 225 (515) T ss_pred CC--eEEEEcCeEEEeeC-CCcC-eeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEecC--CCceEE Confidence 54 44555555444433 4444 3333333321 001000 00000111 111111111111 111211 Q ss_pred ccCCCceeecccccccC-ceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccc Q lcl|NC_021301. 194 TRISDSWVPVGDAVVTG-SPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPK 267 (456) Q Consensus 194 ~~~~~~~~~~~~~~~~~-~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~ 267 (456) ....+..........++ .+|.+++ | .+.+|+|-.+..++-+..+|...-......+....|...+. T Consensus 226 ~~e~d~~~~~~es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~~lv~-------- 297 (515) T protein:vir:70 226 NQSADDIPVGKESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIKYLIR-------- 297 (515) T ss_pred EEecCceeeccccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCCeeeC-------- Confidence 11111111111111112 2333322 2 34689999999999999998777777777777666644331 Q ss_pred cccccchhhhhhhhhhhccceeccC-CCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHH Q lcl|NC_021301. 268 VDENGNAIDYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNI 345 (456) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~ 345 (456) .+|. .........+.|.+.... .+....++. .++++.....++.+...|....-+.. ..-.++.+-+|.-++ T Consensus 298 --~~g~-~~~~~l~~~~~g~iv~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~-l~~rd~~rvTAtEV~-- 371 (515) T protein:vir:70 298 --PGSQ-TDVDHFVNSGTGEVITGVAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMET-MTRRDAERVTAVEIQ-- 371 (515) T ss_pred --cccc-cchhhccccCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhh-hhccCCccccHHHHH-- Confidence 1111 011111122223332221 122333433 23444444444444444433221111 111112223444333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--------HHHHHhcC-CCcccceeEEecCCCCcCHHHHH---HHHHHH---Hh--c Q lcl|NC_021301. 346 EKGFLFKCEDRLSIAKIGLEAIL--------VKALQIEG-ESVEDTVDVSFESPDRVTLGEKY---AAASLA---KA--A 408 (456) Q Consensus 346 ~~~l~~k~~~~~~~f~~~l~~~~--------~l~~~~~~-~~~~~~i~v~f~~~~~~~~~e~a---d~~~kl---~~--~ 408 (456) .+.++++..++..+.++- .+.....+ ......+++.+.. +.+.+..+ +.+... .+ + T Consensus 372 -----~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~v~~~~vs--~l~~L~r~q~~~~i~~~~q~i~~~~ 444 (515) T protein:vir:70 372 -----RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVT--GIEALGRMAELDKLANFAQYMSLPQ 444 (515) T ss_pred -----HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhhcccceeh--hHHHHHHHHHHHHHHHHHHHHHHHh Confidence 444566667777766532 11111111 1112223333322 11222221 111111 11 1 Q ss_pred CCCc-------H----HHHHHhCCC------ChhHHHHHHHHHHHHHHHHHhh----hhhhhcccccCC Q lcl|NC_021301. 409 GESW-------A----SIRRNILNY------NADQIKQDDLDRAREQITLFAG----NSVQRPQEDGSR 456 (456) Q Consensus 409 g~~s-------~----~t~~~~~~~------~~~~~~~~e~~~~~ee~~~~~~----~~~~~~~~d~~~ 456 (456) ++.. - ..+....|. ++++++++.+++.+.+...... .+....-.+..| T Consensus 445 ~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~~~ 513 (515) T protein:vir:70 445 TWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQEMK 513 (515) T ss_pred ccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhhhc Confidence 1111 0 111223332 4556665554444433322222 222222233333 No 255 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=96.60 E-value=0.00048 Score=38.80 Aligned_cols=400 Identities=12% Similarity=0.057 Sum_probs=176.3 Q ss_pred CCCCCHHHHHHHHH-------HHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhh-- Q lcl|NC_021301. 1 MTASTPAEWLPVLT-------KRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI-- 71 (456) Q Consensus 1 ~~~~t~~~~~~~l~-------~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l-- 71 (456) |..-+..-..+.+. +--.+...+|+.+..+++ +-.+|+..+.=. T Consensus 51 ~~~~~~~g~~~~~y~~~e~~~~~~~eLI~~YR~ma~~pE---------------------------vd~Av~eIVneaIv 103 (524) T protein:vir:98 51 LNNQKYAGVFQQFYSGQDPAIQNKEQLINTYRGIMSYPE---------------------------VENAVSEIIDDAIV 103 (524) T ss_pred CCcceecceeeeeccccccccchHHHHHHHHHHHhhccc---------------------------hhhHHHhhhcceeE Confidence 00000000000000 000011122222222222 222222222211 Q ss_pred ---ccCCeecCCCCc-------ccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCC---CceEEEEEcccee Q lcl|NC_021301. 72 ---IPNGITVGGSAD-------SDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDD---GTATITADSPETM 138 (456) Q Consensus 72 ---~~~~~~~~~~~d-------~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~d---g~~~i~~~~p~~~ 138 (456) ..+||.+.-+.. +....++..+.+--+|+....+..|.-.+.|+.|.+.-.|++ |-..++.++|+.+ T Consensus 104 ~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i 183 (524) T protein:vir:98 104 NEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCM 183 (524) T ss_pred ecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccc Confidence 123444432211 112344556666678888899999999999999998766543 4457888999988 Q ss_pred EEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EE Q lcl|NC_021301. 139 VVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VV 216 (456) Q Consensus 139 ~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv 216 (456) -.+..-.... ...++..+ .......+|.+.... +............ ..+|. |+ T Consensus 184 ~~vr~~~~~~-~~~~~~v~----~~~~e~f~Y~~~~~~-~~~~g~~~~~~~~-------------------ikI~~dAIv 238 (524) T protein:vir:98 184 ELIRESITET-LDGGVKVF----RGYREFFVYSAPKAG-YTYNGQIYQANQK-------------------IKIPRSAIV 238 (524) T ss_pred eeeeeccccc-cccchhhc----cceeeeeeeccCCCc-cccccceecCCCc-------------------eeechhhee Confidence 6664321111 11111111 112223344332211 1110100000000 11111 22 Q ss_pred EccC------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Ccccccccccchhhhhhhh------hh Q lcl|NC_021301. 217 VYQN------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDENGNAIDYASIF------EA 283 (456) Q Consensus 217 ~~~n------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~~~~~------~~ 283 (456) +.+. ..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+.-=..+.+..+ .+ T Consensus 239 y~hSGL~d~~~~iisyLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa 317 (524) T protein:vir:98 239 YAHSGLEDCSNNIIGYLHRAVKPANQL-RLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDA 317 (524) T ss_pred eeccCcccCCCCeeeehhHhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeec Confidence 2211 11124444332222222 234444444444434433222111 1122211100000000000 00 Q ss_pred hccc-------------eec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccc--ccCc-HHHHHH Q lcl|NC_021301. 284 APGA-------------LWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPD--SANQ-SAEGAH 343 (456) Q Consensus 284 ~~~~-------------~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~--~~N~-Sg~Al~ 343 (456) ..|. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-++.. +-|. -|..|- T Consensus 318 ~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EIt 397 (524) T protein:vir:98 318 RTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEIT 397 (524) T ss_pred cCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchh Confidence 0010 011 1122334455544322334446777778888999998777421 1121 223466 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHH-------HHHHHHHHh-c Q lcl|NC_021301. 344 NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEK-------YAAASLAKA-A 408 (456) Q Consensus 344 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~-------ad~~~kl~~-~ 408 (456) -....+..-+.+.+..|..-+.++++.-+.++|... + ..|.+.|.....-.+... ++++..+.. . T Consensus 398 RDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyv 477 (524) T protein:vir:98 398 RDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVV 477 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhcccc Confidence 666677778889999999999999998877777532 2 247778876543333333 333333322 2 Q ss_pred C-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 409 G-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 409 g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) | .+|.++++ .+|.+++++++++ ...+++|...-.-....++.+|. T Consensus 478 Gky~s~dyi~k~ILr~tDeei~~~-~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 478 GKYVSHKYIMKEILRMSDEDIDEQ-AKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred ccccchHHHHHHHhccCHHHHHHH-HHHHHHHHhCCCCcCCccccccC Confidence 3 67888875 5789999988753 44455554433333333444444 No 256 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=96.17 E-value=0.00091 Score=37.24 Aligned_cols=347 Identities=10% Similarity=-0.067 Sum_probs=140.0 Q ss_pred HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCCC------Cc---ccHHHHHH Q lcl|NC_021301. 21 MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGGS------AD---SDLALRAR 91 (456) Q Consensus 21 ~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~~------~d---~~~~~~l~ 91 (456) ..-+.++..++..+.... .+.......+.. .........+|+.+++-+..-|+.+-.. .+ +.....+. T Consensus 1 M~~f~k~~~~~~~~~~~~-~~~~~~~~~~~~--~~~~~~v~~~v~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~ 77 (378) T protein:vir:85 1 MNLFGKVVSFSRGKLNND-TQRVTAWQNEAV--EYTSAFVTNIHNKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLD 77 (378) T ss_pred CchhhhhhhhhhcccccC-Ccceeeeeccch--hhhhHHHHHHHHHHHHhHhhCceeEEEEeccccccccccccccchHH Confidence 222222222222221100 000000000010 0123345667888888877777653110 00 11122345 Q ss_pred HHHHh--cC---hhHHHHHHHHHHhhCCeEEEE-EeeCCCCceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceE Q lcl|NC_021301. 92 RIWRD--NR---MDSVCKQWVKYGLDFGESYLT-CWRRDDGTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESD 165 (456) Q Consensus 92 ~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~-v~~d~dg~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~ 165 (456) ++++. |. -..+...+....+.+|.||++ +..+.+|.+.... +. ++.. T Consensus 78 ~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~~~~-------------------------~~-~~~~- 130 (378) T protein:vir:85 78 EVLNWSYKGEHNSMEFWQKVIKKLLCTRYVDLYPIFDSETGELLDLL-------------------------FA-NDKK- 130 (378) T ss_pred HHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEeecCCCceEEEEE-------------------------ec-CCCE- Confidence 55542 32 234556678888899999975 3444443321111 00 1110 Q ss_pred EEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCC-CcHhHHHHHHHHHHHHHHHH Q lcl|NC_021301. 166 FAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGM-GEVEPHIDIINRINRAELQL 244 (456) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~-s~~~~v~~liDa~~~~~s~~ 244 (456) .|..+.+.++ .++... +....+-...++++... T Consensus 131 ---~~~~~dvih~----------------------------------------~~~~~~~~~~~~~~~a~~~~~~~~--- 164 (378) T protein:vir:85 131 ---EYKPEELVRL----------------------------------------VSPFYINEDTSILDNALASIQTKL--- 164 (378) T ss_pred ---EEcccceEEE----------------------------------------ecCcCccchhhHHHHHHHHHHHHH--- Confidence 1111111111 111111 01111111222222111 Q ss_pred HHHHHHhhchhhhhhcCCCccccccccc-chhhhhhh-hh-----hhccceeccCCCceeEeecccchHHHHHHHHHHHH Q lcl|NC_021301. 245 LSTMAIQAFRQRALKSAGHGLPKVDENG-NAIDYASI-FE-----AAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIR 317 (456) Q Consensus 245 ~~~~~~~~~~~~~i~g~~~~~~~~~~~~-~~~~~~~~-~~-----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~ 317 (456) ..+.+.-+++ .... ..++.. ........ +. ...+.+..++.+.++.+++-.+...-.+.++.... T Consensus 165 -----~~~~~~g~l~-~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~~~~~~~~~~~~~ 236 (378) T protein:vir:85 165 -----EQGKLRGLLK-INAF--LDIDNTQEYREKALATIKNMQEGSSYNGLTPVDNKTEIVELKKDYSVLNKDEIELIKS 236 (378) T ss_pred -----hcCCcceEEE-eCCc--CCHHHHHHHHHHHHHHHHHhhcccccccceecCCCceEEeccCChhhhhHHHHHHHHH Confidence 1112211111 1110 011111 11111110 11 12345677788888888753321122244567778 Q ss_pred HHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEe--cCCCCcCH Q lcl|NC_021301. 318 QLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSF--ESPDRVTL 395 (456) Q Consensus 318 ~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f--~~~~~~~~ 395 (456) +|+.+-|+|+..+.+.. .....+.+....|.-.+...+..+...| +.---...+......+++.| ..-.-.|. T Consensus 237 ~Ia~~fgVPp~~l~~s~--~e~~~~~f~~~tL~P~~~~ie~~l~~kL---l~~~er~~~~~~~~~~~~~f~~~~l~~~d~ 311 (378) T protein:vir:85 237 ELLTGYFMNENILLGTA--TQEQQIYFYNSTIIPLLIQLEKELTYKL---ISTNRRRVVKGNLYYERIIVDNQLFKFATL 311 (378) T ss_pred HHHHHhCCCHHHhcCCc--hHHHHHHHHHHHHHHHHHHHHHHHHhhc---CChhhhhhhhhccccceeeecchhhhhcCH Confidence 89999999999885432 1111122222222222222111111111 00000001111122233444 44566788 Q ss_pred HHHHHHHHHHHhcCCCcHHHHHHhCCCChhHHH-----HHHHHHHHHHH-HHHhhhhhhhcccccCC Q lcl|NC_021301. 396 GEKYAAASLAKAAGESWASIRRNILNYNADQIK-----QDDLDRAREQI-TLFAGNSVQRPQEDGSR 456 (456) Q Consensus 396 ~e~ad~~~kl~~~g~~s~~t~~~~~~~~~~~~~-----~~e~~~~~ee~-~~~~~~~~~~~~~d~~~ 456 (456) .+.++++.++.+.|+++.--+++.+|+.|-+-- ......+.... ....+...+..+|+.++ T Consensus 312 ~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~gGD~~~~~~N~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:85 312 KELIDLYHENINGPIFTQNQLLVKMGEQPIEGGDIYIANLNAVAVKNLSDLQGSRKDVASTDETNNQ 378 (378) T ss_pred HHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeecccccccccchhhcCccCCCCCCCCCCCC Confidence 999999999999999999888999988653210 00111111111 11222333445566666 No 257 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=96.08 E-value=0.001 Score=36.96 Aligned_cols=413 Identities=10% Similarity=0.041 Sum_probs=172.2 Q ss_pred CCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhh-----c Q lcl|NC_021301. 1 MTASTPAE--WLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRI-----I 72 (456) Q Consensus 1 ~~~~t~~~--~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l-----~ 72 (456) -|.-.|.. =...+..+. ....-.--+..||.....+. ...++-+..+ .+.++=+-.+|+..+.=. . T Consensus 27 ~s~~~p~~~dGa~~i~~~~-~~~~~~g~~~~~~~~~~~~~-----~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~ 100 (516) T protein:vir:10 27 ESIATPKKDDGATEIETRE-GEATYNAVMQQFFGIDNNIS-----GTKDLINTYRQLINNPEVERAVANIVNEAIVYERG 100 (516) T ss_pred CcccCCCCCCCceeeecCC-Ccccccceeeeeeccccccc-----hHHHHHHHHHHHhhccchhhHHHHhhcceeEecCC Confidence 00000000 000000000 00000000011111111110 0000100000 111222222333333221 1 Q ss_pred cCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC--CCCceEEEEEccceeEEEEe Q lcl|NC_021301. 73 PNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR--DDGTATITADSPETMVVSVD 143 (456) Q Consensus 73 ~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d--~dg~~~i~~~~p~~~~~~~d 143 (456) .+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+...| .+|-..+..++|+.+..+.- T Consensus 101 ~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~ 180 (516) T protein:vir:10 101 HKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE 180 (516) T ss_pred CceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee Confidence 2344443211 11123445556666778888999999999999999986554 35667789999998776542 Q ss_pred CCCCceEEEEEEEEEecCC-----ceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EE Q lcl|NC_021301. 144 PLQPWRIRSAMRWWRDLDA-----ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VV 216 (456) Q Consensus 144 ~~~~~~~~~~~~~~~~~d~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv 216 (456) -.. .+.++ ......+|.+.. ..+......... +. ...+|. |+ T Consensus 181 i~~-----------~~~~~~~v~~~~~e~~~Y~~~~-~~~~~~g~~~~~----------~~---------~ikI~~dAI~ 229 (516) T protein:vir:10 181 IVT-----------SDIGGTTIVKGYREFFIYTTGN-EGYSYNGRIFEP----------NT---------RIKIPRSAVV 229 (516) T ss_pred ecc-----------cccccchhhhhhhheeeeccCc-cccccccceeCC----------Cc---------ceeechhhee Confidence 100 01111 111223333211 111110000000 00 011111 22 Q ss_pred Ecc-------CCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc------------------ Q lcl|NC_021301. 217 VYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE------------------ 270 (456) Q Consensus 217 ~~~-------n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~------------------ 270 (456) +.+ ...=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 230 y~hSGL~d~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYD 308 (516) T protein:vir:10 230 YASSGLMDCSDRGIIGYLHNAVKPANQL-KLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYD 308 (516) T ss_pred eecccceeCCCCceeeeehhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEe Confidence 221 111124444332222222 233344444343333332221111 11221111 Q ss_pred --ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-C---cHHH Q lcl|NC_021301. 271 --NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-N---QSAE 340 (456) Q Consensus 271 --~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N---~Sg~ 340 (456) +|...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-++...+ | .-|. T Consensus 309 a~TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~ 387 (516) T protein:vir:10 309 SNTGTVKNQKRNLSMTED-YWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDT 387 (516) T ss_pred CCCCeeccchhhhhhHhh-hcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccc Confidence 1111000000000000 011 112233444554432234444677777888999999888754321 1 1223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHH-------HHHHHHHH Q lcl|NC_021301. 341 GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEK-------YAAASLAK 406 (456) Q Consensus 341 Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~-------ad~~~kl~ 406 (456) .+--....+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+... ++++..+. T Consensus 388 EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~d 467 (516) T protein:vir:10 388 AITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIE 467 (516) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhh Confidence 344555566677888999999999999998877777521 1 346778876543333332 33333332 Q ss_pred h--cCCCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 407 A--AGESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 407 ~--~g~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) . .+.+|.++++ .+|.++++++++++ +.+++|...-. .-.+..++|. T Consensus 468 pyvGky~s~~yi~k~ILr~tDeei~~e~-k~I~~E~~~~~-~~~p~~~~~f 516 (516) T protein:vir:10 468 PYVGKYVSHDYVMKNILQMTEEQIAQEE-KQIEQEAGIKR-FQNPENEDDF 516 (516) T ss_pred hhhccccchHHHHHHHhcCCHhhHHHHH-HHHHHhhhCCC-CCCCCccccC Confidence 1 2467888876 57899998887643 34444433211 1122333444 No 258 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=96.08 E-value=0.001 Score=36.96 Aligned_cols=413 Identities=10% Similarity=0.041 Sum_probs=172.2 Q ss_pred CCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhh-----c Q lcl|NC_021301. 1 MTASTPAE--WLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRI-----I 72 (456) Q Consensus 1 ~~~~t~~~--~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l-----~ 72 (456) -|.-.|.. =...+..+. ....-.--+..||.....+. ...++-+..+ .+.++=+-.+|+..+.=. . T Consensus 27 ~s~~~p~~~dGa~~i~~~~-~~~~~~g~~~~~~~~~~~~~-----~~~eLI~~YR~ma~~pEvd~Av~eIVneaiv~d~~ 100 (516) T protein:vir:10 27 ESIATPKKDDGATEIETRE-GEATYNAVMQQFFGIDNNIS-----GTKDLINTYRQLINNPEVERAVANIVNEAIVYERG 100 (516) T ss_pred CcccCCCCCCCceeeecCC-Ccccccceeeeeeccccccc-----hHHHHHHHHHHHhhccchhhHHHHhhcceeEecCC Confidence 00000000 000000000 00000000011111111110 0000100000 111222222333333221 1 Q ss_pred cCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC--CCCceEEEEEccceeEEEEe Q lcl|NC_021301. 73 PNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR--DDGTATITADSPETMVVSVD 143 (456) Q Consensus 73 ~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d--~dg~~~i~~~~p~~~~~~~d 143 (456) .+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+...| .+|-..+..++|+.+..+.- T Consensus 101 ~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~ 180 (516) T protein:vir:10 101 HKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYRE 180 (516) T ss_pred CceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEee Confidence 2344443211 11123445556666778888999999999999999986554 35667789999998776542 Q ss_pred CCCCceEEEEEEEEEecCC-----ceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EE Q lcl|NC_021301. 144 PLQPWRIRSAMRWWRDLDA-----ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VV 216 (456) Q Consensus 144 ~~~~~~~~~~~~~~~~~d~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv 216 (456) -.. .+.++ ......+|.+.. ..+......... +. ...+|. |+ T Consensus 181 i~~-----------~~~~~~~v~~~~~e~~~Y~~~~-~~~~~~g~~~~~----------~~---------~ikI~~dAI~ 229 (516) T protein:vir:10 181 IVT-----------SDIGGTTIVKGYREFFIYTTGN-EGYSYNGRIFEP----------NT---------RIKIPRSAVV 229 (516) T ss_pred ecc-----------cccccchhhhhhhheeeeccCc-cccccccceeCC----------Cc---------ceeechhhee Confidence 100 01111 111223333211 111110000000 00 011111 22 Q ss_pred Ecc-------CCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc------------------ Q lcl|NC_021301. 217 VYQ-------NPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE------------------ 270 (456) Q Consensus 217 ~~~-------n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~------------------ 270 (456) +.+ ...=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ T Consensus 230 y~hSGL~d~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYD 308 (516) T protein:vir:10 230 YASSGLMDCSDRGIIGYLHNAVKPANQL-KLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYD 308 (516) T ss_pred eecccceeCCCCceeeeehhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEe Confidence 221 111124444332222222 233344444343333332221111 11221111 Q ss_pred --ccchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-C---cHHH Q lcl|NC_021301. 271 --NGNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-N---QSAE 340 (456) Q Consensus 271 --~~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N---~Sg~ 340 (456) +|...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-++...+ | .-|. T Consensus 309 a~TGev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~ 387 (516) T protein:vir:10 309 SNTGTVKNQKRNLSMTED-YWLMRRDGKSVTEVSSLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDT 387 (516) T ss_pred CCCCeeccchhhhhhHhh-hcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccc Confidence 1111000000000000 011 112233444554432234444677777888999999888754321 1 1223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHH-------HHHHHHHH Q lcl|NC_021301. 341 GAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEK-------YAAASLAK 406 (456) Q Consensus 341 Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~-------ad~~~kl~ 406 (456) .+--....+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+... ++++..+. T Consensus 388 EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~d 467 (516) T protein:vir:10 388 AITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIE 467 (516) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhh Confidence 344555566677888999999999999998877777521 1 346778876543333332 33333332 Q ss_pred h--cCCCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 407 A--AGESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 407 ~--~g~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) . .+.+|.++++ .+|.++++++++++ +.+++|...-. .-.+..++|. T Consensus 468 pyvGky~s~~yi~k~ILr~tDeei~~e~-k~I~~E~~~~~-~~~p~~~~~f 516 (516) T protein:vir:10 468 PYVGKYVSHDYVMKNILQMTEEQIAQEE-KQIEQEAGIKR-FQNPENEDDF 516 (516) T ss_pred hhhccccchHHHHHHHhcCCHhhHHHHH-HHHHHhhhCCC-CCCCCccccC Confidence 1 2467888876 57899998887643 34444433211 1122333444 No 259 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=95.72 E-value=0.0016 Score=35.95 Aligned_cols=416 Identities=12% Similarity=0.071 Sum_probs=179.0 Q ss_pred CCCCCHH--HHHHHHH-HHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhhc---- Q lcl|NC_021301. 1 MTASTPA--EWLPVLT-KRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRII---- 72 (456) Q Consensus 1 ~~~~t~~--~~~~~l~-~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l~---- 72 (456) -|.-+|. +=...+- .-+..... --+..+|-+--. ..+.. ++-...+ .+.++=+-.+|+..+.=.+ T Consensus 21 ~S~~~p~~~DGa~~i~~~~~~~~~~--g~~~~~~~~~~~----~~~~~-eLI~~YR~ma~~pEvd~Av~eIvne~iv~d~ 93 (511) T protein:vir:56 21 RSFSAPDNVDGAKEIHTNLLAPQLG--HAIIPSDAQSEG----TIPVK-ELIKSYRALAEYHEVDDAIQEIVDEAIVYEN 93 (511) T ss_pred ccccCCCCCCCceEEecccccceec--ceeccccccccC----ccchH-HHHHHHHHHhhccchhhHHHHhhcceeEecC Confidence 0111110 0000000 00000000 000000000000 00000 1111111 1112223333333333221 Q ss_pred -cCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCC-CCceEEEEEccceeEEEEe Q lcl|NC_021301. 73 -PNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRD-DGTATITADSPETMVVSVD 143 (456) Q Consensus 73 -~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~-dg~~~i~~~~p~~~~~~~d 143 (456) .+||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+.-.|+ +|-..++.+||+.+-.+.. T Consensus 94 ~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~i~~vr~ 173 (511) T protein:vir:56 94 DKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMKMELVRE 173 (511) T ss_pred CCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeehhhcCcccchhhhh Confidence 2344443211 111234455666667888889999999999999988875554 4666788999998766543 Q ss_pred CCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EEEc--- Q lcl|NC_021301. 144 PLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVVY--- 218 (456) Q Consensus 144 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~~--- 218 (456) -.. .....+. ..+......+|.+...... .+.... +.......+|. |++. T Consensus 174 i~~--~~~~~~~----v~~~~~ey~~Y~~~~~~~~---~~~~~~----------------~~~~~~vkI~~daI~y~hSG 228 (511) T protein:vir:56 174 IQK--ETIDGVE----VVKGTLEYYVYKQSDYKMP---SWMSAT----------------NRAQTSFRIPKDAIVFAHSG 228 (511) T ss_pred hhc--ccccccc----cccceeeeeEecCCCcccC---cccccc----------------cccccceeechhheeeeccc Confidence 111 0000000 0011223344443221110 000000 00001111111 2221 Q ss_pred ------cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc--------------------c Q lcl|NC_021301. 219 ------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE--------------------N 271 (456) Q Consensus 219 ------~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~--------------------~ 271 (456) ++....|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ + T Consensus 229 L~d~~~~~g~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~T 307 (511) T protein:vir:56 229 LMRGCADDPYIIGYLDRAIKPANQL-KMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQT 307 (511) T ss_pred ceeccCCCCeeeccchhhhHHHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccC Confidence 2223445555433322222 233444444444333332222111 11221111 1 Q ss_pred cchhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc----C-cHHHHH Q lcl|NC_021301. 272 GNAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA----N-QSAEGA 342 (456) Q Consensus 272 ~~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~----N-~Sg~Al 342 (456) |...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-++.... | --|..| T Consensus 308 Gev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EI 386 (511) T protein:vir:56 308 GQVKNTTNAMSMLED-YYLPRREGSKGTEVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEI 386 (511) T ss_pred ceeccchhhhhhHhh-hcccccCCCCccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhh Confidence 111000000000000 011 112233444554432234444677777888999999888853321 1 123345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------HHHHHHHhc Q lcl|NC_021301. 343 HNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------AAASLAKAA 408 (456) Q Consensus 343 ~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d~~~kl~~~ 408 (456) --....+..-+.+.+..|..-+.++++.-+.++|.-. + ..|.+.|.....-.+...+ +++..+..- T Consensus 387 tRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpy 466 (511) T protein:vir:56 387 TRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDY 466 (511) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcch Confidence 6666677778889999999999999998877777521 1 3477788765444333333 333322210 Q ss_pred -C-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccccc Q lcl|NC_021301. 409 -G-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDG 454 (456) Q Consensus 409 -g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~ 454 (456) | .+|.++++ .+|.+++++++++ .+.+++|.. ....+++++|. T Consensus 467 vGky~S~~yi~k~ILr~tDeei~~~-~k~I~~E~k---~~~~~~~e~~f 511 (511) T protein:vir:56 467 AGKYYSHKYIQKNILRLSDDQITAM-QSEIDEEET---NPRFQQDDQGF 511 (511) T ss_pred hccccchHHHHHHHhccCHHHHHHH-HHHHHHhhc---CCCCCCcccCC Confidence 3 46888876 5789999887764 334444432 24555677777 No 260 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=95.70 E-value=0.0016 Score=35.92 Aligned_cols=417 Identities=11% Similarity=0.047 Sum_probs=171.8 Q ss_pred CCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhh-----c Q lcl|NC_021301. 1 MTASTPA--EWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRI-----I 72 (456) Q Consensus 1 ~~~~t~~--~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l-----~ 72 (456) -|.-+|. +=...+.... ....-.-.+..||.-...+.. ..++-...+ .+.+.=+-.+|+..+.=. . T Consensus 27 ~s~~~p~~~DGa~~i~~~~-~~~~~~g~~~~~~d~~~~~~~-----~~~LI~~YR~ma~~pEvd~Av~eIvneaiv~d~~ 100 (516) T protein:vir:10 27 ESIATPKKDDGATEIEARE-GESSYNALMQQFFGIDNNISG-----TKDLINTYRQLTNNPEVERAVANIVNEAVVYEKG 100 (516) T ss_pred CcccCCCCccCceeeecCc-ccccccceeeeeecccCcccc-----HHHHHHHHHHhhhccchhHHHHHhhcceeEecCC Confidence 1111110 0000000000 000000011112221111110 011111111 111222223333333322 1 Q ss_pred cCCeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC--CCCceEEEEEccceeEEEEe Q lcl|NC_021301. 73 PNGITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR--DDGTATITADSPETMVVSVD 143 (456) Q Consensus 73 ~~~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d--~dg~~~i~~~~p~~~~~~~d 143 (456) .+||.+..+. ......++..+.+--+|+....+..|.-.+.|+.|.+...| .+|-..+..++|+.+..+.- T Consensus 101 ~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~elr~lDPr~i~~vR~ 180 (516) T protein:vir:10 101 HKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMPNPKEGIVELRRLDPRHVEYYRE 180 (516) T ss_pred CceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEecCcccceeeeeeeCCcceeeEEe Confidence 2345543221 11123345556666678888999999999999999986554 35666789999998776542 Q ss_pred CCCCceEEEEEEEEEecCCceEEEEEEcC-CeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EEEccC Q lcl|NC_021301. 144 PLQPWRIRSAMRWWRDLDAESDFAIVWSG-DGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVVYQN 220 (456) Q Consensus 144 ~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~~~n 220 (456) -... ....+. ...+ .....+|.+ +..|.+.... .. ......+|. |++.+. T Consensus 181 i~~~--~~~~~~---v~~~-~~e~~~Y~~~~~~~~~~g~~--~~-------------------~~~~ikI~~daI~y~hS 233 (516) T protein:vir:10 181 IVTS--DVGGTS---VVKG-YREFFVYTTGNEGYAYNGRL--FE-------------------PNTRIKIPRSAIVYAHS 233 (516) T ss_pred eecc--cCcchh---hhhc-eeeeeeeecCccceeccccc--cC-------------------CCCceecchhheeeeec Confidence 1000 000000 0011 112223322 2212111000 00 000111111 222221 Q ss_pred -------CCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc--------------------cc Q lcl|NC_021301. 221 -------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE--------------------NG 272 (456) Q Consensus 221 -------~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~--------------------~~ 272 (456) ..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+ +| T Consensus 234 Gl~d~~~~~i~syLhkAiKp~NQL-km~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TG 312 (516) T protein:vir:10 234 GLQDCSDRGIVGYLHNAVKPANQL-KLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTG 312 (516) T ss_pred CcccCCCCceeceehhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCC Confidence 11123344322222221 233344444444333332222111 11221111 11 Q ss_pred chhhhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-C---cHHHHHHH Q lcl|NC_021301. 273 NAIDYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-N---QSAEGAHN 344 (456) Q Consensus 273 ~~~~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N---~Sg~Al~~ 344 (456) ...+.......-.. .|. .+.+..+..|+..+--+-++-++-+-..++...++|.+-+....+ | ..+..+-- T Consensus 313 ev~ddrk~msMlED-yWLpRReGgrgTEItTLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItR 391 (516) T protein:vir:10 313 TVKNQKRNLSMTED-YWLMRRDGKSVTEVTSLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITR 391 (516) T ss_pred eeccchhhhhhHhh-hcccccCCCcccceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhH Confidence 11000000000000 011 112233444554432234444677778889999999888754321 1 12234544 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHH-------HHHHHHHHh--c Q lcl|NC_021301. 345 IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEK-------YAAASLAKA--A 408 (456) Q Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~-------ad~~~kl~~--~ 408 (456) ....+..-+.+.+..|..-+..+++.-+.++|.-. + ..|.+.|.....-.+... ++++..+.. . T Consensus 392 DEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvG 471 (516) T protein:vir:10 392 DELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVG 471 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhc Confidence 55566677788899999999999987777777421 1 246777866543333332 333333321 2 Q ss_pred CCCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 409 GESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 409 g~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) +.+|.++++ .+|.++++++++++ +.+++|... ...+.|+++.+= T Consensus 472 ky~s~~yi~k~ILr~tDeei~~~~-k~I~~E~~~---~~~~~p~~e~~f 516 (516) T protein:vir:10 472 KYVSHDYVMKNILQMTDEQIAQEE-KQIEKEANV---KRFQNPENEDDF 516 (516) T ss_pred cccchHHHHHHHhcCCHhHHHHHH-HHHHHhhhC---CCCCCCCccccC Confidence 467888876 57899998887643 344444332 112223322222 No 261 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=95.16 E-value=0.0026 Score=34.72 Aligned_cols=417 Identities=12% Similarity=0.063 Sum_probs=160.7 Q ss_pred CCCCCH--HHHHHHHHHHHHHH----HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc- Q lcl|NC_021301. 1 MTASTP--AEWLPVLTKRIDDG----MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP- 73 (456) Q Consensus 1 ~~~~t~--~~~~~~l~~~~~~~----~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~- 73 (456) |...|. .+.++...+++..+ ..+++.+.+|.--.. ++.. .......++--+-+...++.+++.|.+ T Consensus 5 ~~~~~~~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~~----~~~~---~~~~~~~~~~dstg~~a~~~LAa~l~~~ 77 (516) T protein:vir:96 5 IDLEYGGKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPYL----MNDK---GDNETSQNGWQGVGAQATNHLANKLAQV 77 (516) T ss_pred hhhhhhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhcccc----cCCC---CCccccCCcccchHHHHHHHHHHHHHhh Confidence 444443 34455555554433 345556666655421 1110 011111123345677788888877754 Q ss_pred -----CC-eecCCCCc---------c---cHHH-------HHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce Q lcl|NC_021301. 74 -----NG-ITVGGSAD---------S---DLAL-------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA 128 (456) Q Consensus 74 -----~~-~~~~~~~d---------~---~~~~-------~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~ 128 (456) .| |++...++ . +... .+...+..++|.....++.++...+|.+.+ |.++++. T Consensus 78 ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~d~~~~- 154 (516) T protein:vir:96 78 LFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML--YKPSKGA- 154 (516) T ss_pred hcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE--EecCCCC- Confidence 23 44432211 0 0111 134456678899999999999999999865 4466654 Q ss_pred EEEEEccceeEEEEeCCCCceEEEEEEEEE-e------cCCceE-EEEEE---cCC-eEEEEEEeeeecccccceeeccC Q lcl|NC_021301. 129 TITADSPETMVVSVDPLQPWRIRSAMRWWR-D------LDAESD-FAIVW---SGD-GWQKFARPCFVQSSSRRRLVTRI 196 (456) Q Consensus 129 ~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~-~------~d~~~~-~~~~~---~~~-~~~~~~~~~~~~~~~~~~~~~~~ 196 (456) ++.++-.+.++.-| ..++ +...++... . ..+... ....+ -++ .+..|............+..+. T Consensus 155 -~~~~pl~~y~v~~d-~~G~-v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~~~~- 230 (516) T protein:vir:96 155 -ISAIPMHHYVVNRD-TNGD-LLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKYLGDGFWELKQSA- 230 (516) T ss_pred -EEEEEcCeEEEeeC-CCCC-eeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeeeeCCceeEEEEEe- Confidence 44554455444433 3343 222222221 0 000000 00000 011 1111111111111000011111 Q ss_pred CCceeecccccc-cCceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc Q lcl|NC_021301. 197 SDSWVPVGDAVV-TGSPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE 270 (456) Q Consensus 197 ~~~~~~~~~~~~-~~~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 270 (456) ++.. ....... +..+|.+++ | .+.+|+|-.+..++-+..+|...-...........|...+. . T Consensus 231 d~~~-~~~es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~~lv~----------p 299 (516) T protein:vir:96 231 DDIP-VGKVSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIKYLIR----------P 299 (516) T ss_pred Ccee-eccccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCccccC----------c Confidence 1111 1111111 122333332 2 34689999998988888888666666666666555543221 1 Q ss_pred ccchhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhc-ccccCcHHHHHHHHHH Q lcl|NC_021301. 271 NGNAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLM-PDSANQSAEGAHNIEK 347 (456) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~-~~~~N~Sg~Al~~~~~ 347 (456) +|. .........+.|.+....+ +....++. .++++.-...++.+...|....-+. .+. .+..+-+|.-++ T Consensus 300 ~g~-~~~~~l~~~~~g~i~~g~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~--~l~~r~~~rvTAtEV~---- 372 (516) T protein:vir:96 300 GAQ-TDVDHFVNSGTGEVVTGVEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMME--TMTRRDAERVTAVEIQ---- 372 (516) T ss_pred ccc-cchhhhccCCCceeecCCcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhh--hhccCCCccccHHHHH---- Confidence 111 0111112233333322222 22333333 2344444444444444443322111 011 112223444333 Q ss_pred HHHHHHHHHHHHHHHHHHHH---------HHHHHHhcCCCcccceeEEecCCCCcCHHHH----------HHHHHHHHhc Q lcl|NC_021301. 348 GFLFKCEDRLSIAKIGLEAI---------LVKALQIEGESVEDTVDVSFESPDRVTLGEK----------YAAASLAKAA 408 (456) Q Consensus 348 ~l~~k~~~~~~~f~~~l~~~---------~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~----------ad~~~kl~~~ 408 (456) .+.++++..++..+.++ .+++..+........+++.+... -+.+.. ++.+..+. T Consensus 373 ---~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp~~~v~~~~vs~--l~~l~r~~~~~~i~~~~~~i~~~~-- 445 (516) T protein:vir:96 373 ---RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFTSDLVDPVIITG--IEALGRMAELDKLANFAQYMSLPL-- 445 (516) T ss_pred ---HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCccccccceeech--HHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 44455666677666552 12222222222222233333222 111211 12222111 Q ss_pred CCCc-------HHH----HHHhCCCC------hhHHHHHHHHHHHHHHHH-Hhhhhhhh-cccccCC Q lcl|NC_021301. 409 GESW-------ASI----RRNILNYN------ADQIKQDDLDRAREQITL-FAGNSVQR-PQEDGSR 456 (456) Q Consensus 409 g~~s-------~~t----~~~~~~~~------~~~~~~~e~~~~~ee~~~-~~~~~~~~-~~~d~~~ 456 (456) ++.. -.. ..+.+|++ ++|++++.+++.+.+... .+....+. +..-+.+ T Consensus 446 ~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~ 512 (516) T protein:vir:96 446 QWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQE 512 (516) T ss_pred cCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcc Confidence 1111 111 12334543 445554433333222211 11111111 1111111 No 262 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=94.88 E-value=0.0033 Score=34.22 Aligned_cols=423 Identities=12% Similarity=0.056 Sum_probs=158.6 Q ss_pred CCCC---CHHHHHHH---HHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc- Q lcl|NC_021301. 1 MTAS---TPAEWLPV---LTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP- 73 (456) Q Consensus 1 ~~~~---t~~~~~~~---l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~- 73 (456) |+-. +-..+.++ |..+-.....+++.+.+|.--.. ....+ ......++--+-+...++.+++.|.+ T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP~~-~~~~~------~~~~~~~~~dstg~~a~~~LAa~l~~~ 73 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLPYL-MADVN------DDLSSQNAWQDDGASATNFLSNKLSQV 73 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcccc-ccCCC------CCccccccccchHHHHHHHHHHHHHHh Confidence 5444 33332222 22222233445666666654421 00000 01112234456677788888877753 Q ss_pred -----CC-eecCCCC--------cccHHHH-----------HHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce Q lcl|NC_021301. 74 -----NG-ITVGGSA--------DSDLALR-----------ARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA 128 (456) Q Consensus 74 -----~~-~~~~~~~--------d~~~~~~-----------l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~ 128 (456) .| |++.... +.+...+ +...+..++|.....++.++...+|.+.+++ + ++.. T Consensus 74 ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~--~-~~~~ 150 (517) T protein:vir:10 74 LFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMMYH--P-DKTS 150 (517) T ss_pred hcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEE--e-CCCC Confidence 23 3333211 1111111 2345566789999999999999999986543 3 3344 Q ss_pred EEEEEccceeEEEEeCCCCceEEEEEEEEEe-------cCCce----EEEEEEcCCeEEEEEEeeeecccccceee-ccC Q lcl|NC_021301. 129 TITADSPETMVVSVDPLQPWRIRSAMRWWRD-------LDAES----DFAIVWSGDGWQKFARPCFVQSSSRRRLV-TRI 196 (456) Q Consensus 129 ~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~-------~d~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 196 (456) .++.++-.+.++.-| ..++ +...++...- .-+.. ....-+.++..+.+....+... .+.+.. ... T Consensus 151 ~~~~~pl~~y~v~~d-~~G~-v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~-~~~~~~~~~~ 227 (517) T protein:vir:10 151 PIQAVPLHHYCVRRD-NNGT-VLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRTK-DGKYLIRQSA 227 (517) T ss_pred cEEEEEcCeEEEeeC-CCcC-eEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEeC-CCceEEEEEe Confidence 566665556544444 3444 3333333221 00100 0000111211111111111111 111111 111 Q ss_pred CCceeecccccc-cCceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccc Q lcl|NC_021301. 197 SDSWVPVGDAVV-TGSPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDE 270 (456) Q Consensus 197 ~~~~~~~~~~~~-~~~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~ 270 (456) ++... ...... +..+|.+++ | .+.+|+|-.+..++-+..++...-...........|...+. . T Consensus 228 d~~~~-~~~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~~a~~~~~lv~----------~ 296 (517) T protein:vir:10 228 DDVPV-GKESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMALMADVKYLVK----------P 296 (517) T ss_pred Cceee-ccccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHHHhccCCcccC----------c Confidence 11111 111111 112232222 1 34689999898888888888666655555555554433321 0 Q ss_pred ccchhhhhhhhhhhccceeccC-CCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHH Q lcl|NC_021301. 271 NGNAIDYASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKG 348 (456) Q Consensus 271 ~~~~~~~~~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~ 348 (456) +|. .........+.|.+.... .+....++. ..++....+.+..+...|....-+.. ..-.+..+-+|.-++ T Consensus 297 ~~~-~~~~~l~~~~~g~~~~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~-l~~~~~~rvTAtEV~----- 369 (517) T protein:vir:10 297 GSY-TDINQFVEGGSGAVLHGVEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEA-MTRRDAERVTAYEIQ----- 369 (517) T ss_pred ccc-cchhhccCCCccccccCCcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhh-hhccCCccccHHHHH----- Confidence 010 001111122223222211 122223332 22344333334333333333221111 101111223444333 Q ss_pred HHHHHHHHHHHHHHHHHHH--------HHHHHH-hcCCCcccceeEEecCCCC-cCHHHHHHHHHHHHh-----cCC--- Q lcl|NC_021301. 349 FLFKCEDRLSIAKIGLEAI--------LVKALQ-IEGESVEDTVDVSFESPDR-VTLGEKYAAASLAKA-----AGE--- 410 (456) Q Consensus 349 l~~k~~~~~~~f~~~l~~~--------~~l~~~-~~~~~~~~~i~v~f~~~~~-~~~~e~ad~~~kl~~-----~g~--- 410 (456) .+.++++..++..+.++ +...+. +........+++.+..++. -.....++.+..+.+ +.. T Consensus 370 --~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~~l~~~~v~~~~~s~la~l~r~~~~~~i~~~~~~i~~~a~~~~~ 447 (517) T protein:vir:10 370 --RDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISSILTSKNVSPTILTGIEALGRMAELDKLGTFNGYVSMTAQWPEP 447 (517) T ss_pred --HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhhhcCCCCccceeeccHHHHHHHHHHHHHHHHHHHHHHhhcCChH Confidence 34445555666655442 111111 1111112223333322211 011111222221111 011 Q ss_pred ----CcHHH----HHHhCCCC------hhHHHHHHHHHHHHHHH-H---Hhhhhh------hhcccccCC Q lcl|NC_021301. 411 ----SWASI----RRNILNYN------ADQIKQDDLDRAREQIT-L---FAGNSV------QRPQEDGSR 456 (456) Q Consensus 411 ----~s~~t----~~~~~~~~------~~~~~~~e~~~~~ee~~-~---~~~~~~------~~~~~d~~~ 456 (456) +.-.. ..+.+|+. ++|++++..++..+++. . .++... .+.+.+|.+ T Consensus 448 ~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 448 LQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred HHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 11111 12345553 34554433333222211 1 111111 122345555 No 263 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=94.70 E-value=0.0037 Score=33.90 Aligned_cols=426 Identities=12% Similarity=0.044 Sum_probs=175.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhh-hhccChHHHHHHHHHhhhc-----cC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQR-EARTNWGLMVRDSVADRII-----PN 74 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~-k~~~n~~~~iVd~~a~~l~-----~~ 74 (456) +.+-..++=...+...+ +-.+..+|-| +.. .+ ..++-+..+ .+.+.=+-.+|+..+.=.. .+ T Consensus 20 ~vpp~~~~~~~~i~~g~------~g~~v~~~g~-~~~----~n-~~eLI~~YR~ma~~pEVd~Av~eIVneaIv~d~~~~ 87 (564) T protein:vir:10 20 PVPPNDEASVSTVAGGY------FGTYVDTSGG-QNS----RN-EYELIRRYRDMSLHPEVDSAIDEIVNEFVVNDGDDK 87 (564) T ss_pred cccCCcCCChhhhhccc------cceeeecccc-cch----hh-HHHHHHHHHHHhhccchhhHHHHhhcceeEecCCCc Confidence 22222111111111000 0011111111 000 00 111111111 1223334444444444221 23 Q ss_pred CeecCCCC-------cccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC----CCCceEEEEEccceeEEEEe Q lcl|NC_021301. 75 GITVGGSA-------DSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR----DDGTATITADSPETMVVSVD 143 (456) Q Consensus 75 ~~~~~~~~-------d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d----~dg~~~i~~~~p~~~~~~~d 143 (456) ||.+.-+. .+....++..+.+--+|+....+..|.-.+.|+.|.+.-.| .+|-..++.+||+.+-.++. T Consensus 88 pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDPr~i~~vr~ 167 (564) T protein:vir:10 88 PVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDSLKIRKVRQ 167 (564) T ss_pred eEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcccceeeeee Confidence 45443211 01123445566666788888999999999999999987554 23556789999998887774 Q ss_pred CCCCc--eEEEEEEEEE--ecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeE--EEE Q lcl|NC_021301. 144 PLQPW--RIRSAMRWWR--DLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPP--VVV 217 (456) Q Consensus 144 ~~~~~--~~~~~~~~~~--~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--vv~ 217 (456) ..... .....++-+. ...+......+|.+... . . ..... .+...+. ......++. |++ T Consensus 168 i~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~-~--g------~~~~~---~~~~~~~----~~~~ikI~~daI~y 231 (564) T protein:vir:10 168 KLKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGF-A--G------NIPMV---TGSMDWS----NQEGIKIASDAIAQ 231 (564) T ss_pred eccccccccceeeeeeeeeccccccccceeeccccc-c--C------ccccc---ccccccc----cccceeechhhcce Confidence 32211 1111111110 01111122233322210 0 0 00000 0000000 000011111 222 Q ss_pred c-------cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Ccccccccccchhhhhhh------hhh Q lcl|NC_021301. 218 Y-------QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDENGNAIDYASI------FEA 283 (456) Q Consensus 218 ~-------~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~~~~~~~~~~~------~~~ 283 (456) . ++..=.|-+.+.+.-...+ +++-|...+-...-.|.+=+.-.+ +.++...+.-=..+.+.. ..+ T Consensus 232 ~hSGL~d~~~~~i~gyLhkAIKp~NQL-kmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa 310 (564) T protein:vir:10 232 STSGLMDLNKKMTLSFLHKAIKSLNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDG 310 (564) T ss_pred ecccceeCCCCceeccchhhhHhHHhh-HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEec Confidence 1 1111223444322222222 233344444344333332221111 112211110000000000 000 Q ss_pred hccc-------------eec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccc--cCc-HHHHHH Q lcl|NC_021301. 284 APGA-------------LWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS--ANQ-SAEGAH 343 (456) Q Consensus 284 ~~~~-------------~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~--~N~-Sg~Al~ 343 (456) ..|. .|. .+.+..+..|+...--+-++-++-+-..++...++|.+-+.... -|. -+..|- T Consensus 311 ~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EIt 390 (564) T protein:vir:10 311 QTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEIL 390 (564) T ss_pred cCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchh Confidence 0000 011 11223344455433222334466777788889999988775432 121 123455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc--c-----cceeEEecCCCCcCHHHHH-------HHHHHHHh-c Q lcl|NC_021301. 344 NIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV--E-----DTVDVSFESPDRVTLGEKY-------AAASLAKA-A 408 (456) Q Consensus 344 ~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~--~-----~~i~v~f~~~~~~~~~e~a-------d~~~kl~~-~ 408 (456) -....+..-+.+.+..|..-|.++++.-+.++|.-. + ..|.+.|.....-.+...+ +++..+.. . T Consensus 391 RDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyv 470 (564) T protein:vir:10 391 RDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFV 470 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhh Confidence 556667777889999999999999998877777521 1 3477788765443333332 33332211 1 Q ss_pred C-CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhh-------hhcccccC--------C Q lcl|NC_021301. 409 G-ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSV-------QRPQEDGS--------R 456 (456) Q Consensus 409 g-~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~-------~~~~~d~~--------~ 456 (456) | .+|.++++ .+|.+++++++++ .+.+++|...-.-... +.+.+.+. . T Consensus 471 Gky~S~dyi~k~ILr~tDeei~~~-~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~ 534 (564) T protein:vir:10 471 GKYFSTEYIRRKILMQTENEFKEI-DKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQ 534 (564) T ss_pred ccccchHHHHHHHhccCHHHHHHH-HHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhc Confidence 3 56888876 5789999888754 3334444332000000 00111100 0 No 264 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=94.50 E-value=0.0042 Score=33.59 Aligned_cols=297 Identities=8% Similarity=-0.091 Sum_probs=112.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc----CCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP----NGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~----~~~ 76 (456) |+-.+-.++++++- -+ ++-..+||+---+. ..+.++.+ + ..++.-++......+.+ +|. T Consensus 27 ~~~~~~~~~~~~~~-~~------~~~~~~~~epp~~~--------~~La~l~~-~-n~~h~~~i~~k~N~l~~~~~Pn~~ 89 (348) T protein:vir:26 27 EPVDTNSWMTRYCE-LF------YNDFDDYWEPPISL--------KGLAEIAN-A-NGYHGSLLKARANYVAGRFMNGGG 89 (348) T ss_pred eeecCcchHHHHHH-HH------hcCCCccccCCCCH--------HHHHHHHh-h-hhhhhhhHhhhhhHHhhcccCCCC Confidence 11111112222221 00 11123566543211 12222211 1 22333344444444432 221 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) .. . ..+.+++.+.+.+|.||+.+-++..|++ .+..++|..+.+.-|. . . T Consensus 90 ~t--------~-------------~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d~---~------~ 139 (348) T protein:vir:26 90 LP--------M-------------YKMNSACWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKNG---D------F 139 (348) T ss_pred CC--------H-------------HHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeeecC---c------E Confidence 10 0 1123456677889999999999888875 4777777665443221 0 1 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liD 235 (456) ++...+++. ..|.++.+.++.. ..+ ...-.|.|.+...+. T Consensus 140 ~~~~~~g~~---~~f~~~dIiHir~-------------------------~~~---------~~~~~Gls~~~~a~~--- 179 (348) T protein:vir:26 140 VQLLRNNEQ---KVFKAKDVIFIPQ-------------------------YDP---------QQQIYGLPDYLGSIQ--- 179 (348) T ss_pred EEEEecCeE---EEEcCccEEEEcC-------------------------CCC---------CCCcccccHHHHHHH--- Confidence 122222211 1123333322210 000 001236665543332 Q ss_pred HHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchhhhhhhhhhh-----ccceecc-----CCCceeEeecc Q lcl|NC_021301. 236 RINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAIDYASIFEAA-----PGALWEL-----PPGVDIWESQT 302 (456) Q Consensus 236 a~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~-----~~d~~~~~~~~ 302 (456) ++....+-..-...++ +.|-.++.-.+.. ..++.-..+ ...+... .+.++.. +.+.++..+.. T Consensus 180 si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~--ls~e~~~~l--k~~~~~~~G~~n~~~~~vl~~~g~~~Gi~~~pis~ 255 (348) T protein:vir:26 180 SSLLNRDATLFRRRYYLNGAHMGFIFYATDPN--LSEADEKAL--KEKIASSKGIGNFRSMFVNIPNGKEKGIQLIPVGD 255 (348) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEecCCC--CCHHHHHHH--HHHHHHhcCcccccceeEEcCCCCccceeEEEccC Confidence 2221111111111222 2233332211111 111111111 1111111 1223333 23445666543 Q ss_pred cch-HHHHHHHHHHHHHHHhhcCCChhhhcccccC------cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021301. 303 NDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSAN------QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE 375 (456) Q Consensus 303 ~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N------~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 375 (456) .+. ..|++..+....+|+++-++|+..+|....| ....+..+....|.-.+ +.+..++.+ .. T Consensus 256 ~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~f~~~~l~P~~----~~ie~~ln~-------~l 324 (348) T protein:vir:26 256 IATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQVYDFYEVIPVC----KRFMDAVNN-------DP 324 (348) T ss_pred ChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHH----HHHHHHHhh-------hh Confidence 322 2388877778889999999999998743222 22222222222222112 222222211 11 Q ss_pred CCCcccceeEEecCCCCcCHHHHHHHH Q lcl|NC_021301. 376 GESVEDTVDVSFESPDRVTLGEKYAAA 402 (456) Q Consensus 376 ~~~~~~~i~v~f~~~~~~~~~e~ad~~ 402 (456) +......+++.|++..-++.++ ++ T Consensus 325 ~~~~~~~~~fdl~~~~e~~~~~---a~ 348 (348) T protein:vir:26 325 EIPDNLKLKFNLNPGVESANGS---AV 348 (348) T ss_pred CCCCccEEEEecCcccccchhh---cC Confidence 2233333444444332222121 11 No 265 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=94.18 E-value=0.0051 Score=33.13 Aligned_cols=294 Identities=13% Similarity=0.057 Sum_probs=111.1 Q ss_pred CCC-------------------------CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhh Q lcl|NC_021301. 1 MTA-------------------------STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA 55 (456) Q Consensus 1 ~~~-------------------------~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~ 55 (456) ||- .+|.-++. ...-++.+.-++.|+. +.|+-....+.++.+ + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~v~~--------~~~~~~~~~~~~~~~~---~~pp~~~~~la~~~~-a 68 (344) T protein:vir:56 1 MSKKKGKTPQPAAKTMTASAPKMEAFTFGEPVPVLD--------RRDILDYVECISNGRW---YEPPVSFTGLAKSLR-A 68 (344) T ss_pred CCCCCCCCCchhhHHhhcCCCceEEEEcCCceeecC--------cchhhhHHHhhhcCcc---ccCCCCHHHHHHHHh-h Confidence 221 11111110 1111334444555543 222222223333221 1 Q ss_pred ccChHHHHHHHHHhhh----ccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EE Q lcl|NC_021301. 56 RTNWGLMVRDSVADRI----IPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TI 130 (456) Q Consensus 56 ~~n~~~~iVd~~a~~l----~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i 130 (456) ..++.-++.....++ .++|... . ..+..++.+.+.+|.||+.+-++..|++ .+ T Consensus 69 -~~~h~s~i~~k~n~l~~~~~Pnp~~t--------~-------------~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L 126 (344) T protein:vir:56 69 -AVHHSSPIYVKRNILASTFIPHPWLS--------Q-------------QDFSRFVLDFLVFGNAFLEKRYSTTGKVIRL 126 (344) T ss_pred -hhhhCccceehhhhHHhhcCCCCCCC--------H-------------HHHHHHHHHHHhcCCeEEEEEECCCCcEEEE Confidence 122222333333333 2333210 0 1123456677889999999988888876 46 Q ss_pred EEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccC Q lcl|NC_021301. 131 TADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTG 210 (456) Q Consensus 131 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (456) ..++|..+.+..+.. ..++....+.. ..|.++.+.++.. ..+ T Consensus 127 ~pl~~~~v~~~~~~~--------~~~~~~~~g~~---~~~~~~dIiHir~-------------------------~~~-- 168 (344) T protein:vir:56 127 ETSPAKYTRRGVEED--------VYWWVPSFNEP---TAFAPGSVFHLLE-------------------------PDI-- 168 (344) T ss_pred EEeCCceeEEeecCC--------EEEEEecCCeE---EEEcCccEEEECC-------------------------CCC-- Confidence 777777665433221 01111111111 1122222222210 000 Q ss_pred ceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchhhhhhhhhh---- Q lcl|NC_021301. 211 SPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAIDYASIFEA---- 283 (456) Q Consensus 211 ~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~---- 283 (456) ...-.|.|.+... +.++....+-..-...++ +.|-.++.-.+. ...++.-+.+ ...+.. T Consensus 169 -------~~~~~Gls~~~~a---~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~--~ls~e~~~~l--k~~~~~~~g~ 234 (344) T protein:vir:56 169 -------NQELYGLPEYLSA---LNSAWLNESATLFRRKYYENGAHAGYIMYVTDA--VQDRNDIEML--RENMVKSKGR 234 (344) T ss_pred -------CCCcccccHHHHH---HHHHHHHHHHHHHHHHHHhccCCCceEEEecCC--CCCHHHHHHH--HHHHHHhcCC Confidence 0012456655432 233322222111122333 223333321111 1111111111 111211 Q ss_pred hccce-e-cc----CCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcH------HHHHHHHHHHHH Q lcl|NC_021301. 284 APGAL-W-EL----PPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS------AEGAHNIEKGFL 350 (456) Q Consensus 284 ~~~~~-~-~~----~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~S------g~Al~~~~~~l~ 350 (456) +.++. + .. +++.++..+..... ..|++..+....+|+++-++|+..+|..-.|.+ ..++.+....|. T Consensus 235 ~~~r~l~l~~p~g~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL~ 314 (344) T protein:vir:56 235 NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELI 314 (344) T ss_pred CCccceEEecCCCCccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHH Confidence 11222 2 22 23456766653322 238888888899999999999999985322221 112222222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHH Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGE 397 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e 397 (456) =.+ ..++++... .+.. .+.|.+.......+ T Consensus 315 Pl~--------~~ie~~n~~----l~~~-----~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 315 PLQ--------DRIREINGW----IGQE-----VIRFKNYSLDTDNG 344 (344) T ss_pred HHH--------HHHHHHHhh----hccc-----cccCCCccccccCC Confidence 111 111222111 1111 13354443222221 No 266 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=93.94 E-value=0.0059 Score=32.81 Aligned_cols=419 Identities=10% Similarity=0.018 Sum_probs=164.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc------C Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP------N 74 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~------~ 74 (456) |... -....+.|..+......+++.+.+|.--..-..+ . ...+....++--+-+..+++.+++.|.+ . T Consensus 1 mk~~-a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~~~~~---~--~~~~~~~~~~~dstg~~a~~~Laa~l~~~ltpp~~ 74 (542) T protein:vir:78 1 MKGL-AQARYSAMRADREDFLDMARRCAALTLPYLLTED---G--HASGGRLQQPYQSLGSKGVNALSSKLMLSLFPIQT 74 (542) T ss_pred ChhH-HHHHHHHHHHHhhHHHHHHHHHHHHhccccCCCC---C--CcccccccccccchHHHHHHHHHHHHHHhhcCCCC Confidence 4321 1223444444444445666777777543211110 0 0001112234456677888888887754 2 Q ss_pred C-eecCCCC---------cccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEE Q lcl|NC_021301. 75 G-ITVGGSA---------DSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITAD 133 (456) Q Consensus 75 ~-~~~~~~~---------d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~ 133 (456) + |++.... +.+.. ..+.+.+..++|.....++.++..++|.+.+++ ++++ ++.+ T Consensus 75 ~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~--~~~~---~~~~ 149 (542) T protein:vir:78 75 SFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLVFA--GKKT---LKVY 149 (542) T ss_pred ccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEe--cCCC---ceEE Confidence 3 3333211 11111 123456667889999999999999999996654 4442 4555 Q ss_pred ccceeEEEEeCCCCceEEEEEEEEEec-------CCc----------------eEE---EEEEcCCeEEEEEEeeeeccc Q lcl|NC_021301. 134 SPETMVVSVDPLQPWRIRSAMRWWRDL-------DAE----------------SDF---AIVWSGDGWQKFARPCFVQSS 187 (456) Q Consensus 134 ~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~----------------~~~---~~~~~~~~~~~~~~~~~~~~~ 187 (456) +-.+.++.-| ..++ +...++.++-. .+. ..+ ..++..+....+.... .. T Consensus 150 pl~~y~v~~d-~~G~-vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~~~---~~ 224 (542) T protein:vir:78 150 PLDRYVIERD-GDGN-VIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTCCK---LV 224 (542) T ss_pred ecceeEEeeC-CCCC-eEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccccc---cC Confidence 5555444444 3343 33344443211 010 000 0011111110000000 00 Q ss_pred ccceeec-cCCCceeecccccccC-ceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhc Q lcl|NC_021301. 188 SRRRLVT-RISDSWVPVGDAVVTG-SPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKS 260 (456) Q Consensus 188 ~~~~~~~-~~~~~~~~~~~~~~~~-~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g 260 (456) ...+.+. ...+..+.......++ .+|.+++ | .+.+|+|-.+..++.+..+|...-......+....|...+- T Consensus 225 ~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~pp~lv~- 303 (542) T protein:vir:78 225 DGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAAAAKVVFMVS- 303 (542) T ss_pred CCeEEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeec- Confidence 0000000 0001100000001112 2232222 2 34689999999999999999888887778887777753331 Q ss_pred CCCcccccccccchhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcH Q lcl|NC_021301. 261 AGHGLPKVDENGNAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS 338 (456) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~S 338 (456) .+|. ........++.|.+....+ +....++. .+++..-.+.++.+...|....-+- . . .+...-+ T Consensus 304 ---------~~g~-~~~~~~~~~~~g~iv~g~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~~-~-~-~d~~rvT 370 (542) T protein:vir:78 304 ---------PSAT-TKPQSLARAGTGAIIQGRAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLIL-N-V-RQSERTT 370 (542) T ss_pred ---------cccc-cchhhcccCCCceeecCCccceeeeecccccchhHHHHHHHHHHHHHHHHhccc-c-c-CCccccc Confidence 0110 1111222344444432222 22233332 3344444444444444443322111 0 0 1112224 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHhcC--CCcccceeEEecCCCCcC-HHHHHHH-- Q lcl|NC_021301. 339 AEGAHNIEKGFLFKCEDRLSIAKIG------------LEAILVKALQIEG--ESVEDTVDVSFESPDRVT-LGEKYAA-- 401 (456) Q Consensus 339 g~Al~~~~~~l~~k~~~~~~~f~~~------------l~~~~~l~~~~~~--~~~~~~i~v~f~~~~~~~-~~e~ad~-- 401 (456) |.-++.. .+++...++.. +++.+.++.+..- .....-+++.+..++..- ....++. T Consensus 371 AtEV~~r-------~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r~~~~~~l~ 443 (542) T protein:vir:78 371 ATEVREV-------QMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGRGEDRAALI 443 (542) T ss_pred HHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHHHHHHHHHH Confidence 4433332 33344444443 4444444433211 122233566665554210 0111111 Q ss_pred --HHHHHhc-CC------CcHHHH----HHhCCCC-------hhHHHHHHHHHHHHHH-HHH-hhhhhhhcccccCC Q lcl|NC_021301. 402 --ASLAKAA-GE------SWASIR----RNILNYN-------ADQIKQDDLDRAREQI-TLF-AGNSVQRPQEDGSR 456 (456) Q Consensus 402 --~~kl~~~-g~------~s~~t~----~~~~~~~-------~~~~~~~e~~~~~ee~-~~~-~~~~~~~~~~d~~~ 456 (456) +..+.+. |- +.-..+ ...+|++ +++++++..+..+++. ... .+...-++..-|++ T Consensus 444 ~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~~~~~~ 520 (542) T protein:vir:78 444 EFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKSPIGEK 520 (542) T ss_pred HHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 1111111 11 111111 2234554 2333322222111111 111 11111111111111 No 267 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=93.51 E-value=0.0073 Score=32.29 Aligned_cols=399 Identities=15% Similarity=0.073 Sum_probs=170.7 Q ss_pred CCC------CCHHHHHHHHHHHHHHH-------------------HHHHHHHHHHhcccCcccccCcccchhhhhhhhhh Q lcl|NC_021301. 1 MTA------STPAEWLPVLTKRIDDG-------------------MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA 55 (456) Q Consensus 1 ~~~------~t~~~~~~~l~~~~~~~-------------------~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~ 55 (456) |.+ .++...-+.|..++... ..-.-....||-|.. . + ..++-...+.. T Consensus 1 ~~~~~~w~~~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~--~----n-~~eLI~~YR~m 73 (533) T protein:vir:58 1 MPSLEKYKKLNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIE--F----N-RFFLYDMYDRM 73 (533) T ss_pred CCCcchhhhhhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhcccc--c----c-HHHHHHHHHHh Confidence 211 11222222222222110 001111233454421 1 1 11222222223 Q ss_pred c--cChHHHHHHHHHhhhc-----cCCeecCCCCcccHHHHHH-HHHHhcChhHHHHHHHHHHhhCCeEEEEEeeC--CC Q lcl|NC_021301. 56 R--TNWGLMVRDSVADRII-----PNGITVGGSADSDLALRAR-RIWRDNRMDSVCKQWVKYGLDFGESYLTCWRR--DD 125 (456) Q Consensus 56 ~--~n~~~~iVd~~a~~l~-----~~~~~~~~~~d~~~~~~l~-~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d--~d 125 (456) . +.=+..+|+..+.-.+ .+||.+.-+ +.+..+.+. .+..-.+|+....+..|.-.+.|+.|.+.-.+ .+ T Consensus 74 a~~~pEVd~AideIvneaiv~d~~~~pV~v~l~-~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~ 152 (533) T protein:vir:58 74 DYTDPLISTVLDIIADECTIPNENGNIVDVVTK-DIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGSDG 152 (533) T ss_pred hccCcchhhHHHhhhceeeEecCCCceeEeecc-cccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCccc Confidence 2 3455666666665443 345655432 222333332 34455678999999999999999999987542 23 Q ss_pred CceEEEEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeeccc Q lcl|NC_021301. 126 GTATITADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGD 205 (456) Q Consensus 126 g~~~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (456) |-..++.+||+.+-.+++..+.. ...+|.+...... ... ....+. .+. ..- T Consensus 153 GI~elr~lDPr~i~~vr~~~t~~-----------------eyyvy~~~~~~~~------s~~-~~~kI~--~da---I~y 203 (533) T protein:vir:58 153 TIEKFQVVSPYIFSKRYNPETDT-----------------WYYVITDVYRNVV------SGY-FNEDIP--EED---VIH 203 (533) T ss_pred chhhheecCCeeeEEEEeeccce-----------------EEEeecccccccc------cCc-cccccc--hhh---eee Confidence 44478999999988887654331 1223333211100 000 000000 000 000 Q ss_pred ccccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCC-Cccccccc-------------- Q lcl|NC_021301. 206 AVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAG-HGLPKVDE-------------- 270 (456) Q Consensus 206 ~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~~~-------------- 270 (456) .|+++ ...+.+.+.|-+.+.+.-...+ +++.|...+-...-.|.+=+.=.+ +.++...+ T Consensus 204 ~~SGl-----~d~~~~~iisyLhkAiKp~NQL-kmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNk 277 (533) T protein:vir:58 204 FSHKI-----DTNFFPYGRSYLESARAIWNQL-RLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRD 277 (533) T ss_pred eeecc-----ccCCCCceehhhhHHHHHHHHH-HHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccc Confidence 01110 0112334556666543332222 233344444333333332221111 11111111 Q ss_pred ------ccchh---hhhhhhhhhccceec----cCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccccc-C Q lcl|NC_021301. 271 ------NGNAI---DYASIFEAAPGALWE----LPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-N 336 (456) Q Consensus 271 ------~~~~~---~~~~~~~~~~~~~~~----~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~-N 336 (456) +|... ..+.....-.. .|. .+.+..+..|+..++ +-++-++-+-..++...++|.+-++...+ + T Consensus 278 lvYDa~TGev~ddrk~m~~~sMlED-yWLpRReGgrgTEI~TLpGg~l-gemeDV~YF~kkLy~ALnVP~sRl~~e~~fg 355 (533) T protein:vir:58 278 YWVRNNQNQFLGIDNYFSIESILKD-YFIPRRGDRRAVEIDILQGSKV-DLAEDVEYMLNRLISALKVPKAFIGYEGDVN 355 (533) T ss_pred eEEeccCCeEeeccchhhhhhhHhh-hcccccCCCccceeeecCCCCC-CcHHHHHHHHHHHHHHhCCCeeecCCCCCCc Confidence 11110 00000000000 011 112233444554443 33455677778889999999888764432 2 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHHHH-------HHHHHHHhcC Q lcl|NC_021301. 337 QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGEKY-------AAASLAKAAG 409 (456) Q Consensus 337 ~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e~a-------d~~~kl~~~g 409 (456) .|+ .|--....+..-+.+.+..|..-|++- +.++|+.....+++.|.....-.+...+ +++..+ .+ T Consensus 356 r~~-eItRDEiKF~KFI~rLR~rF~~ll~~q----Lilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~--dp 428 (533) T protein:vir:58 356 AKN-TLATQDIKFNNTIKRIQGFFVEELERM----VRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERL--KG 428 (533) T ss_pred cch-hhhHHHHHHHHHHHHHHHHHHHHHhcc----cccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHh--cc Confidence 222 333344445556667777777666553 2345654445567777665433333333 222222 15 Q ss_pred CCcHHHHH-HhCCCChhHHHHHHHHHHHHHHHHHhhhhhhhccc---------ccCC Q lcl|NC_021301. 410 ESWASIRR-NILNYNADQIKQDDLDRAREQITLFAGNSVQRPQE---------DGSR 456 (456) Q Consensus 410 ~~s~~t~~-~~~~~~~~~~~~~e~~~~~ee~~~~~~~~~~~~~~---------d~~~ 456 (456) .++..+++ .+|.+++++.++ .+.+++|... ...+++.+ +|+. T Consensus 429 yvgk~yi~k~ILr~tdei~~q--~e~ie~E~~~---~~~~~~~~~~e~~~~~~~~~~ 480 (533) T protein:vir:58 429 WVREDWIYSNILQIPYDLKPQ--EEVAEAAGGG---GLFDTGGFGEETTPADFLGER 480 (533) T ss_pred hhhHHHHHHHHhcCChhhhHH--HHHHHHhhcC---CCCCCCCcccccCCcccCccc Confidence 67777665 578898754443 2345554322 11112211 1222 No 268 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=93.09 E-value=0.0088 Score=31.84 Aligned_cols=301 Identities=13% Similarity=0.028 Sum_probs=118.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) .+-.+|.-++. ...-++.++-+|.|+. +.|+-....+.+..+ + ..++.-++...+..+.+. |+- T Consensus 56 f~fg~p~~v~~--------~~~~~~~~~~~~~~~~---~~pp~~~~~La~~~~-~-~~~h~s~l~~k~n~l~~~-~~P-- 119 (376) T protein:vir:10 56 FTFDDPTPVMN--------RAEILDYVECWSNGEW---FEPPVSFAGLAKSFR-A-STHHSSALFFKANVLAST-FRP-- 119 (376) T ss_pred EEcCCceeccC--------cchhhhhhhhhhcCce---ecCCCCHHHHHHHHh-h-hHHhhhhHHHHhHHHHhc-cCC-- Confidence 22222221111 0111334455566642 122222223333221 1 235555566555555331 110 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) ...-. . ..+.+++.+.+.+|.||+.+-++.+|.+ .+..++|..+.+..|... .++.. T Consensus 120 np~lT-~-------------~~f~~~v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~~--------~~~~~ 177 (376) T protein:vir:10 120 HRWLS-R-------------HAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFNG--------FVYVN 177 (376) T ss_pred CCCCC-H-------------HHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceEEEeeCCe--------EEEEE Confidence 00000 0 1134556677889999999999988876 588888888776655321 11111 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~ 239 (456) ..+. ...|..+.+.++.. ... ...-.|.|.+...+. ++.. T Consensus 178 ~~~~---~~~~~~~eViHir~-------------------------~~~---------~~~~yGls~~~~a~~---si~l 217 (376) T protein:vir:10 178 GWQE---RHEFEPDSVFQLVR-------------------------PDI---------NQEVYGLPEYLSSLH---SAWL 217 (376) T ss_pred cCCe---EEEEccccEEEecC-------------------------CCC---------CCCcccccHHHHHHH---HHHH Confidence 1111 11233333322210 000 001235555554333 2222 Q ss_pred HHHHHHHHHHHhh---chhhhhhcCCCcccccccccchh-hhhhhhh--hhccceeccC-----CCceeEeecc--cchH Q lcl|NC_021301. 240 AELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWELP-----PGVDIWESQT--NDFT 306 (456) Q Consensus 240 ~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~-----~d~~~~~~~~--~~~~ 306 (456) ..+-..-...++. .|-.++.-.+. ...++.-..+ ....... ...+.++... .+.++..+.. .+. T Consensus 218 ~~aa~~f~~~~f~NGa~pggIl~~~d~--~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~~~Gi~~~pls~~~~d~- 294 (376) T protein:vir:10 218 NESSTLFRRKYYENGSHAGFILYMTDA--AQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKD- 294 (376) T ss_pred HHHHHHHHHHHHhccCCCceEEEecCC--CCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEccCCHHHH- Confidence 1221111122322 23222221111 0111111111 1111111 1122344432 3446666643 233 Q ss_pred HHHHHHHHHHHHHHhhcCCChhhhcccccCc------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc Q lcl|NC_021301. 307 PMLSAIKEHIRQLSSATKTPLPMLMPDSANQ------SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVE 380 (456) Q Consensus 307 ~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~------Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~ 380 (456) .|++..+....+|+.+-++|+..+|..-.|. ......+....|.=.++.. +++.. ..+.. T Consensus 295 qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~~~L~Pl~~~i--------eeln~----~L~~~-- 360 (376) T protein:vir:10 295 EFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQARF--------AELND----WLGEE-- 360 (376) T ss_pred HHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHH--------HHHHh----hcccc-- Confidence 3888888889999999999999987532222 1112222222222222111 11111 11111 Q ss_pred cceeEEecCCCCcCHHHHH Q lcl|NC_021301. 381 DTVDVSFESPDRVTLGEKY 399 (456) Q Consensus 381 ~~i~v~f~~~~~~~~~e~a 399 (456) -+.|++.....-.+.+ T Consensus 361 ---~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 361 ---VVRFDDYEIPPAPVAA 376 (376) T ss_pred ---ccccChhHhhcccccC Confidence 1456554211111111 No 269 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=91.77 E-value=0.014 Score=30.70 Aligned_cols=302 Identities=12% Similarity=0.028 Sum_probs=119.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) .|-.+|.-++. ...-++..+-+|.|+. +.|+-....+.+..+ + ..++.-++...+..+.+. |+- T Consensus 31 ~~~~~p~~v~~--------~~~~~~~~~~~~~~~~---~~pp~~~~~la~~~~-~-~~~h~~~l~~k~n~l~~~-~~P-- 94 (351) T protein:vir:78 31 FTFDDPTPVMN--------RAEILDYVECWSNGEW---FEPPVSFAGLAKSFR-A-STHHSSALFFKANVLAST-FRP-- 94 (351) T ss_pred EEcCCceeecC--------cchhhhhhhhhccCce---ecCCCCHHHHHHHHh-h-hHhhhhhhhhhhhHHhhc-ccC-- Confidence 11112221100 1112445566777753 223222333333321 1 335555666666665432 110 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) ...-. . ..+.+++.+.+.+|.||+.+-++..|++ .+..++|..+.+..+... .++.. T Consensus 95 n~~~t-~-------------~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~~--------~~~~~ 152 (351) T protein:vir:78 95 HRWLS-R-------------HAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSG--------FVYVN 152 (351) T ss_pred CCCCC-H-------------HHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCCe--------EEEEe Confidence 00000 0 1134466778899999999999988875 477777777665443220 11111 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~ 239 (456) .++.. ..|.++.+.++.. ..+ ...-.|.|.+...+.-+..-+. T Consensus 153 ~~~~~---~~~~~~eVihir~-------------------------~~~---------~~~~yGl~~~~~a~~si~l~~~ 195 (351) T protein:vir:78 153 GWQER---HEFAPDSVFQLVR-------------------------PDI---------NQEVYGLPEYLSSLHSAWLNES 195 (351) T ss_pred cCCeE---EEEccccEEEEcC-------------------------CCC---------CCCcccccHHHHHHHHHHHHHH Confidence 11111 1222233222210 000 0112466665543332222111 Q ss_pred HHHHHHHHHHHh---hchhhhhhcCCCcccccccccchh-hhhhhhh--hhccceeccC-----CCceeEeecccch-HH Q lcl|NC_021301. 240 AELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWELP-----PGVDIWESQTNDF-TP 307 (456) Q Consensus 240 ~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~~-----~d~~~~~~~~~~~-~~ 307 (456) +.. -...++ +.|-.++.-.+.. ..++.-..+ ....... ...+.++... .+.++..+..... .. T Consensus 196 a~~---~~~~~f~NGa~pggIl~~~~~~--ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~~~g~k~~pls~~~~d~q 270 (351) T protein:vir:78 196 STL---FRRKYYENGSHAGFILYMTDAA--QKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDE 270 (351) T ss_pred HHH---HHHHHHhccCCCceEEEecCCC--CCHHHHHHHHHHHHHhcCcccccceeeecCCCCccceeEEEcCCChhHHH Confidence 111 111222 2232222211110 111111111 1111111 1122344332 3345666543221 23 Q ss_pred HHHHHHHHHHHHHhhcCCChhhhcccccCc------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_021301. 308 MLSAIKEHIRQLSSATKTPLPMLMPDSANQ------SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVED 381 (456) Q Consensus 308 ~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~------Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~ 381 (456) |++..+....+|+++-++|+..+|....|. ...++.+....+.=.++. | +++.. ..+.. T Consensus 271 f~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~P~~~~----i----ee~n~----~l~~~--- 335 (351) T protein:vir:78 271 FFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQAR----F----AELND----WLGDE--- 335 (351) T ss_pred HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHH----H----HHHHh----hcCcc--- Confidence 888778888999999999999987532222 111222222222111111 1 11111 11211 Q ss_pred ceeEEecCCCCcCHHHHH Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKY 399 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~a 399 (456) -+.|++.....-.+.+ T Consensus 336 --~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 336 --VVRFDDYEIPPAPVAA 351 (351) T ss_pred --ceecChhhhccccccC Confidence 1456654322211111 No 270 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=91.57 E-value=0.015 Score=30.55 Aligned_cols=293 Identities=11% Similarity=0.068 Sum_probs=107.5 Q ss_pred CCC-----------CC-----------HHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccC Q lcl|NC_021301. 1 MTA-----------ST-----------PAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTN 58 (456) Q Consensus 1 ~~~-----------~t-----------~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n 58 (456) |+- .+ |.-++. ...-++.+.-++.|+. ..|+-....+..+.+ + .. T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~--------~~~~~~~~~~~~~~~~---~~pp~~~~~la~l~~-a-~~ 67 (340) T protein:vir:98 1 MSKRKPRKAVAMTASAPQKMEAFTFGEPVPVLD--------KRDILDYVECISNGKW---YEPPVSFSGLAKSLR-S-AV 67 (340) T ss_pred CCCCCCCccccccccCccceeEEEcCCceeecC--------cchhhhhhhhhhcCce---ecCCCCHHHHHHHHH-h-cc Confidence 221 11 111110 0011223333444432 112222223333221 1 22 Q ss_pred hHHHHHHHHHhhhc----cCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEE Q lcl|NC_021301. 59 WGLMVRDSVADRII----PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITAD 133 (456) Q Consensus 59 ~~~~iVd~~a~~l~----~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~ 133 (456) +..-++...+..+. ++|... . ..+..++.+.+.+|.||+.+-.+..|++ .+..+ T Consensus 68 ~h~s~i~~k~n~l~~~~~Pn~~lt--------~-------------~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl 126 (340) T protein:vir:98 68 HHSSPIYVKRNVLASTYIPHPLLS--------R-------------QDFSRFALDYLVFGNAFLEQRHSVTGQLIKLLTS 126 (340) T ss_pred ccchhhhhhhhHHhhccCCCCCCC--------H-------------HHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEe Confidence 33334444444432 222210 0 1123456677889999999988888875 46666 Q ss_pred ccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCcee Q lcl|NC_021301. 134 SPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPP 213 (456) Q Consensus 134 ~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (456) +|..+....+.. ..|+...++.. ..|.++.+.++.. ..+. T Consensus 127 ~~~~vr~~~~~~--------~~~~~~~~~~~---~~~~~~eViHir~-------------------------~~~~---- 166 (340) T protein:vir:98 127 PAKYTRRGVDDS--------VFWFVENFTQP---HEFAPDTVFHLLE-------------------------PDIN---- 166 (340) T ss_pred CCceEEEcccCc--------EEEEEecCCeE---EEEccccEEEEcC-------------------------CCCC---- Confidence 666654432211 11111122211 1123333322210 0000 Q ss_pred EEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhhhhhhhhhh-----c Q lcl|NC_021301. 214 PVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAIDYASIFEAA-----P 285 (456) Q Consensus 214 pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-----~ 285 (456) ..-.|.|.+...+. ++....+-..-...++. .|-.++.-.+.. ..++.-..+ ...++.. . T Consensus 167 -----~~~~Gls~~~~a~~---si~l~~aa~~~~~~~f~NGa~pg~il~~~~~~--ls~e~~~~l--k~~~~~~~G~~n~ 234 (340) T protein:vir:98 167 -----QEIYGLPEYLSALN---SAWLNESATLFRRKYYQNGAHAGYIMYVTDPA--QSATDVESL--RDAMRNSKGLGNF 234 (340) T ss_pred -----CCcccccHHHHHHH---HHHHHHHHHHHHHHHHhccCCCceEEEecCCC--CCHHHHHHH--HHHHHHhcCcccc Confidence 01135555543322 22211111111112222 232222211110 111111111 1111111 1 Q ss_pred cceeccC-----CCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCc------HHHHHHHHHHHHHHHH Q lcl|NC_021301. 286 GALWELP-----PGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQ------SAEGAHNIEKGFLFKC 353 (456) Q Consensus 286 ~~~~~~~-----~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~------Sg~Al~~~~~~l~~k~ 353 (456) +.++... .+.++..+..... ..|++..+....+|+++-++|+..+|..-.|. ...+..+....|.=.+ T Consensus 235 ~~~~vl~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~ 314 (340) T protein:vir:98 235 KNLFFYSPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVFVRNELSPLQ 314 (340) T ss_pred CceeEecCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHHHHH Confidence 2344432 3445555543221 23888888889999999999999987532222 1112222222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHH Q lcl|NC_021301. 354 EDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLG 396 (456) Q Consensus 354 ~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~ 396 (456) ++ |+++. ...+.. -+.|++....+.. T Consensus 315 ~~--------iee~n----~~L~~e-----~~rF~~~~l~~~d 340 (340) T protein:vir:98 315 DR--------FREVN----DWLGME-----VIRFKEYTLDNPE 340 (340) T ss_pred HH--------HHHHH----hccccc-----ccccCccccccCC Confidence 11 11111 111111 1345443221111 No 271 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=89.69 E-value=0.025 Score=29.37 Aligned_cols=294 Identities=14% Similarity=0.084 Sum_probs=108.6 Q ss_pred CCC-------------------------CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhh Q lcl|NC_021301. 1 MTA-------------------------STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA 55 (456) Q Consensus 1 ~~~-------------------------~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~ 55 (456) |+- .+|.-++. ...-++.+.-++.|+. +.|+-....+.++.+ + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~--------~~~~~~~~~~~~~~~~---~~pp~~~~~la~~~~-a 68 (344) T protein:vir:20 1 MSKKKGKTPQPAAKTMTASGPKMEAFTFGEPVPVLD--------RRDILDYVECISNGRW---YEPPVSFTGLAKSLR-A 68 (344) T ss_pred CCcccCCCCcchhhhhhccCCceEEEEcCCceEecC--------cchhhhhhhhhhcCce---ecCCCCHHHHHHHHh-h Confidence 221 11111100 0011233344445543 222222223333211 1 Q ss_pred ccChHHHHHHHHHhhh----ccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EE Q lcl|NC_021301. 56 RTNWGLMVRDSVADRI----IPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TI 130 (456) Q Consensus 56 ~~n~~~~iVd~~a~~l----~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i 130 (456) ..++.-++.....++ .++|... . ..+..++.+.+.+|.||+.+-++..|++ .+ T Consensus 69 -~~~h~~~i~~k~n~l~~~~~Pn~~lt--------~-------------~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L 126 (344) T protein:vir:20 69 -AVHHSSPIYVKRNILASTFIPHPWLS--------Q-------------QDFSRFVLDFLVFGNAFLEKRYSTTGKVIRL 126 (344) T ss_pred -hhhhCccceehhhhHHHhccCCCCCC--------H-------------HHHHHHHHHHHhcCCeEEEEEECCCCcEEEE Confidence 122222333223322 2232210 0 1123456677889999999988888875 46 Q ss_pred EEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccC Q lcl|NC_021301. 131 TADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTG 210 (456) Q Consensus 131 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (456) ..++|..+.+..+.. . .++...++.. ..|..+.+.++.. ..+. T Consensus 127 ~pl~~~~vr~~~~~~---~-----~~~~~~~~~~---~~~~~~eIiHir~-------------------------~~~~- 169 (344) T protein:vir:20 127 ETSPAKYTRRGVEED---V-----YWWVPSFNEP---TAFAPGSVFHLLE-------------------------PDIN- 169 (344) T ss_pred EEcCCceeEeeecCC---E-----EEEEccCCeE---EEEcCccEEEeCC-------------------------CCCC- Confidence 666666654432211 0 1111111110 1122222221100 0000 Q ss_pred ceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchhhhhhhhhhh--- Q lcl|NC_021301. 211 SPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAIDYASIFEAA--- 284 (456) Q Consensus 211 ~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~--- 284 (456) ..-.|.|.+.. .++++....+-..-...++ +.|-.++.-.+.. ..++.-..+ ...++.. T Consensus 170 --------~~~yGls~~~~---a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d~~--l~~e~~~~i--k~~~~~~~g~ 234 (344) T protein:vir:20 170 --------QELYGLPEYLS---ALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAV--QDRNDIEML--RENMVKSKGR 234 (344) T ss_pred --------CCcccccHHHH---HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcC--CCHHHHHHH--HHHHHHhcCC Confidence 01135565543 3333332222111222333 2233333211111 111111111 1112111 Q ss_pred -ccc-eecc-----CCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcH------HHHHHHHHHHHH Q lcl|NC_021301. 285 -PGA-LWEL-----PPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS------AEGAHNIEKGFL 350 (456) Q Consensus 285 -~~~-~~~~-----~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~S------g~Al~~~~~~l~ 350 (456) .++ ++.. +++.++..+..... ..|++..+....+|+++-++|+..+|..-.|.+ ..++.+....|. T Consensus 235 ~n~r~l~l~~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~ 314 (344) T protein:vir:20 235 NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELI 314 (344) T ss_pred CCccceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHH Confidence 111 2222 23456666653322 238888888999999999999999974322221 112222222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHH Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGE 397 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e 397 (456) =.+++ | +++ ..+.|... +.|.++......| T Consensus 315 P~~~~----~----e~i----n~~lg~~~-----i~F~~~~l~~~d~ 344 (344) T protein:vir:20 315 PLQDR----I----REI----NGWLGQEV-----IRFKNYSLDTDND 344 (344) T ss_pred HHHHH----H----HHH----HHhcCCcc-----cccCccccccCCC Confidence 11111 1 111 11223211 3455443222222 No 272 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=89.64 E-value=0.025 Score=29.35 Aligned_cols=292 Identities=14% Similarity=0.032 Sum_probs=111.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc----CCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP----NGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~----~~~ 76 (456) .|-.+|.-++. ...-++.++-||.|+. ..|+=.+..+.+..+ + ..++.-++......+.. +|. T Consensus 34 ~~~~~p~~v~~--------~~~~~~y~~~~~~~~~---~~pp~~~~~la~~~~-~-~~~h~~~l~~k~n~l~~~~~Pn~~ 100 (350) T protein:vir:11 34 FTFGDPMPVLD--------GRGILDYLECWPNGRW---YEPPLSMEGLAKSVG-S-SVYLQSGLKFKRNMLAKTFIPHRL 100 (350) T ss_pred EEeCCceeecC--------cchhhHHHHHhhcCcc---ccCCCCHHHHHHHHh-h-hhhhccchhhhhhhhhhcccCCCC Confidence 11112211100 0111344455666653 112212222322211 1 12333334433343322 221 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) . . ...+.+++.+.+.+|.||+.+-++..|++ .+..++|..+.+..+.. . . T Consensus 101 ~-------t--------------~~~f~~~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~----~----~ 151 (350) T protein:vir:11 101 L-------S--------------RATFEQFSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLE----T----F 151 (350) T ss_pred C-------C--------------HHHHHHHHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCC----e----E Confidence 1 0 01123456677889999999999988875 47777777765433211 0 1 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liD 235 (456) ++...++.. ..|.++.+.++.. ..+ ...-.|.|.+...+. T Consensus 152 ~~~~~~~~~---~~~~~~eVihir~-------------------------~~~---------~~~~yGls~~~~a~~--- 191 (350) T protein:vir:11 152 YQVRSWKDE---HEFEKGSVIQLRE-------------------------ADI---------NQEIYGVPEWFCALQ--- 191 (350) T ss_pred EEEeeCCeE---EEECcccEEEeCC-------------------------CCC---------CCCcccccHHHHHHH--- Confidence 111111111 1122222222210 000 001146665554333 Q ss_pred HHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhhhhhhhhhh-----ccceeccC-----CCceeEeecc Q lcl|NC_021301. 236 RINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAIDYASIFEAA-----PGALWELP-----PGVDIWESQT 302 (456) Q Consensus 236 a~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~-----~d~~~~~~~~ 302 (456) ++....+-..-...++. .|-.+++-.+.. ..++....+ ...+... .+.++... ++.++..+.. T Consensus 192 si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~--ls~e~~~~l--~~~~~~~~G~~N~~~~~v~~~~g~~~g~~~~pl~~ 267 (350) T protein:vir:11 192 SALLNESATLFRRKYYNNGSHAGFILYMTDAA--QNEEDIDAL--RTALKTAKGPGNFRNLFVYAPNGKKEGIQLIPVSE 267 (350) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEecCCC--CCHHHHHHH--HHHHHHhcCccccCceeeecCCCCccceEEEEcCC Confidence 22221111111112222 222222211110 111111111 1111111 22334332 2345555543 Q ss_pred cch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCc------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_021301. 303 NDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQ------SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIE 375 (456) Q Consensus 303 ~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~------Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 375 (456) ... ..|++..+....+|+++-++|+..+|..-.|. ...+..+....|.=.+. . ++++.+. . T Consensus 268 ~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~L~P~~~----~----ie~ln~~----l 335 (350) T protein:vir:11 268 VAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWASLELAPMQT----R----LQQVNEM----I 335 (350) T ss_pred ChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHHHHHHHHH----H----HHHHHhh----c Confidence 322 23888888889999999999999998532222 12222222222222121 1 1222211 1 Q ss_pred CCCcccceeEEecCCCCcCH Q lcl|NC_021301. 376 GESVEDTVDVSFESPDRVTL 395 (456) Q Consensus 376 ~~~~~~~i~v~f~~~~~~~~ 395 (456) +.. .+.|.+.....+ T Consensus 336 ~~~-----~~~F~~~~~~~l 350 (350) T protein:vir:11 336 GEE-----VVRFAQFDAPGL 350 (350) T ss_pred Ccc-----ccccCcccccCC Confidence 211 123444333333 No 273 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=88.87 E-value=0.03 Score=28.96 Aligned_cols=302 Identities=12% Similarity=0.043 Sum_probs=118.3 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) .|-.+|.-++. ...-++..+-+|.|+. +.|+-....+.+..+ + ..++.-++...+..+.+. |+- T Consensus 31 ~~~~~p~~v~~--------~~~~~~~~~~~~~~~~---~~pp~~~~~la~~~~-~-~~~h~~~l~~k~n~l~~~-~~P-- 94 (351) T protein:vir:79 31 FTFDDPTPVMN--------RAEILDYVECWSNGEW---FEPPVSFAGLAKSFR-A-STHHSSALFFKANVLAST-FRP-- 94 (351) T ss_pred EEcCCceeecC--------cchhhhhhhhhhcCce---ecCCCCHHHHHHHHh-h-hHhhhhhhhhhhhHHhhc-ccC-- Confidence 11122221100 1112445566777753 223222333333321 1 335555666666665432 110 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) ...-. . ..+.+++.+.+.+|.||+.+-++..|++ .+..++|..+.+..+... .++.. T Consensus 95 np~~t-~-------------~~f~~~v~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~~--------~~~~~ 152 (351) T protein:vir:79 95 HRWLS-R-------------HAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFSG--------FVYVN 152 (351) T ss_pred CCCCC-H-------------HHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceeeeecCCe--------EEEEe Confidence 00000 0 1133456677889999999999888875 477788877655433220 11111 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~ 239 (456) .++.. ..|.++.+.++.. ..+. ..-.|.|.+...+ +++.. T Consensus 153 ~~g~~---~~~~~~eIihir~-------------------------~~~~---------~~~yGl~~~~~a~---~si~l 192 (351) T protein:vir:79 153 GWQER---HEFEPDSVFQLVR-------------------------PDIN---------QEVYGLPEYLSSL---HSAWL 192 (351) T ss_pred cCceE---EEEcCccEEEeCC-------------------------CCCC---------CCcccccHHHHHH---HHHHH Confidence 11111 1223333322210 0000 0113555554332 22221 Q ss_pred HHHHHHHHHHHh---hchhhhhhcCCCcccccccccchh-hhhhhhh--hhccceecc-----CCCceeEeecccch-HH Q lcl|NC_021301. 240 AELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWEL-----PPGVDIWESQTNDF-TP 307 (456) Q Consensus 240 ~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~-----~~d~~~~~~~~~~~-~~ 307 (456) ..+-..-...++ +.|-.++.-.+.. ..++.-..+ ....... ...+.++.. +.+.++..+..... .. T Consensus 193 ~~~a~~~~~~~f~NGa~pg~il~~~~~~--ls~e~~~~lk~~~~~~~G~~N~~~~~v~~~~g~~~gi~~~pl~~~~~d~e 270 (351) T protein:vir:79 193 NESSTLFRRKYYENGSHAGFILYMTDAA--QKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGKKDGIQLIPVSEVAAKDE 270 (351) T ss_pred HHHHHHHHHHHHhccCCCceEEEecCCC--CCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceEEEEcCCChhHHH Confidence 111111112222 2232222211111 111111111 1111111 111233333 23345655543221 23 Q ss_pred HHHHHHHHHHHHHhhcCCChhhhcccccCcH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_021301. 308 MLSAIKEHIRQLSSATKTPLPMLMPDSANQS------AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVED 381 (456) Q Consensus 308 ~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~S------g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~ 381 (456) |++..+....+|+++-++|+..+|..-.|.+ ..++.+....|.=.++. ++++ ..+.|.. T Consensus 271 f~e~k~~s~~eI~~a~~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~l~Pl~~~--------ie~l----n~~lg~~--- 335 (351) T protein:vir:79 271 FFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRNEIRPLQAR--------FAEL----NDWLGDE--- 335 (351) T ss_pred HHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHH--------HHHH----HhhcCcc--- Confidence 8888888899999999999999975322221 11222222222111111 1111 1112221 Q ss_pred ceeEEecCCCCcCHHHHH Q lcl|NC_021301. 382 TVDVSFESPDRVTLGEKY 399 (456) Q Consensus 382 ~i~v~f~~~~~~~~~e~a 399 (456) -+.|++.....-...+ T Consensus 336 --~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 336 --VVTFDDYEIPPAPVAA 351 (351) T ss_pred --eeeeChhhhccccccC Confidence 1566654211111111 No 274 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=88.66 E-value=0.031 Score=28.86 Aligned_cols=419 Identities=12% Similarity=0.002 Sum_probs=151.0 Q ss_pred CCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc----- Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRID--DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP----- 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~--~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~----- 73 (456) |- ..+..+..++. ....+.+.+.+|..-..-.. ... .. +....++--+-+...++.+++.|.+ T Consensus 1 mk-----~~~~~~~~~lkR~~~e~~w~e~a~~tlP~~~~~---~~~-~~-~~~~~~~~dstg~~a~~~LAa~l~~~ltpp 70 (510) T protein:vir:63 1 MK-----TTAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVD---PMS-GS-RGVVEHDFQSAGALLVNNLAAKLARSLFPT 70 (510) T ss_pred Ch-----hHHHHHHHHHhccchHHHHHHHHHhhccccCCC---CCC-cc-ccccCCCccchHHHHHHHHHHHHHhhhcCC Confidence 21 11222222221 12234445555544321000 000 00 1111223345667788888877753 Q ss_pred -CC-eecCCCC--------ccc----HHH-------HHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEE Q lcl|NC_021301. 74 -NG-ITVGGSA--------DSD----LAL-------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITA 132 (456) Q Consensus 74 -~~-~~~~~~~--------d~~----~~~-------~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~ 132 (456) .| |++...+ +.. ... .+...+..++|.....++.++...+|.+.++ .++++. +++. T Consensus 71 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a~l~--~~~~~~-~~~~ 147 (510) T protein:vir:63 71 GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLY--RDSDAA-TVVA 147 (510) T ss_pred CCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEE--EcCCCc-EEEE Confidence 23 3332211 111 111 2445666788999999999999999998555 455553 4666 Q ss_pred EccceeEEEEeCCCCceEEEEEEEEEec-------CCceEEEEEE--c-CCeEEEEEEeeeeccccc-ceeecc-CCCce Q lcl|NC_021301. 133 DSPETMVVSVDPLQPWRIRSAMRWWRDL-------DAESDFAIVW--S-GDGWQKFARPCFVQSSSR-RRLVTR-ISDSW 200 (456) Q Consensus 133 ~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~ 200 (456) ++-.+.++.-| ..++... .++.+.-. .+........ . .+.+..|+.......... .+.+.. ..+.. T Consensus 148 ~pl~~y~v~~d-~~G~vd~-i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~~~~~~~sv~~e~dg~~ 225 (510) T protein:vir:63 148 WSLRSYAVRRD-ATGRWMD-IVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKGTAMEYAELYHEIDGVR 225 (510) T ss_pred EEcceeEEeeC-CCcCeeE-EEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecCCCceEEEEEEEecCce Confidence 65555444444 4343333 33332210 0000000000 0 011111111111111000 011100 11111 Q ss_pred eecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccch Q lcl|NC_021301. 201 VPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNA 274 (456) Q Consensus 201 ~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~ 274 (456) .. ......+. +|.+++ | .+.+|+|-.+..++-+..+|...-...........|...+ . ..|. T Consensus 226 ~~-~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv-~---------p~g~- 293 (510) T protein:vir:63 226 VG-KEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLV-D---------EAKG- 293 (510) T ss_pred ec-cccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc-C---------cccc- Confidence 11 11111122 232222 2 3468999999999988888877666655555555543221 1 1111 Q ss_pred hhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHH Q lcl|NC_021301. 275 IDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFK 352 (456) Q Consensus 275 ~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k 352 (456) .........+.|.+....+ +....++. .+++....+.++.+...|....=+. ....+..+.+|.-++.....+.+. T Consensus 294 ~~~~~~~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~--l~~~~~~rvTAtEV~~r~~E~~~~ 371 (510) T protein:vir:63 294 AVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAENT 371 (510) T ss_pred cchhhhccCCCceeecCCcccceeeecCcccchHHHHHHHHHHHHHHHHHHHhh--cccCCCCCcCHHHHHHHHHHHHHH Confidence 1111222333343332222 22333332 2334433333433333333321111 111122233554444332222221 Q ss_pred H----HHHH-HHHHHHHHHHHHHHHHhcCCCc--ccce---eEEecCCCCcCHHHHHHHHHHH------H-hcCCCc--- Q lcl|NC_021301. 353 C----EDRL-SIAKIGLEAILVKALQIEGESV--EDTV---DVSFESPDRVTLGEKYAAASLA------K-AAGESW--- 412 (456) Q Consensus 353 ~----~~~~-~~f~~~l~~~~~l~~~~~~~~~--~~~i---~v~f~~~~~~~~~e~ad~~~kl------~-~~g~~s--- 412 (456) . .+.+ ..+.+-+++.+.++.. .|... ...+ .+++. +....++-+..+ . ..|-+. T Consensus 372 LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~~~v~~i-----s~Laraq~~~~l~~~~q~l~~~~~~aq~~ 445 (510) T protein:vir:63 372 LGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGL-----PALSRSAAVQSMLNASQVIAGLAPIAQLD 445 (510) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCchhcccceecch-----hHHHHHHHHHHHHHHHHHHHHhcCchhhh Confidence 1 1111 2223333444444432 23211 1112 22332 322222222211 1 112111 Q ss_pred ----H----HHHHHhCCCCh-------hHHHHHHHHHHHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 413 ----A----SIRRNILNYNA-------DQIKQDDLDRAREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 413 ----~----~t~~~~~~~~~-------~~~~~~e~~~~~ee~~~~~~~~~~~~~~d~~~ 456 (456) - ......+|+++ ++++++..++.+++.. .........+-+++ T Consensus 446 ~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~--~~~~~~~~~~~a~~ 502 (510) T protein:vir:63 446 PRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQ--AQAAQETLLEGASD 502 (510) T ss_pred ccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHh Confidence 1 12234467644 3333322211111111 11111111222222 No 275 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=86.61 E-value=0.044 Score=28.00 Aligned_cols=296 Identities=13% Similarity=0.058 Sum_probs=106.5 Q ss_pred CC-------------------CCCHHHHHHHHHHHHHHHHHHHHHHHHHhc--ccCcccccCcccchhhhhhhhhhccCh Q lcl|NC_021301. 1 MT-------------------ASTPAEWLPVLTKRIDDGMSRVRLLARYSN--GDAPLPELTRNTSAAWRSFQREARTNW 59 (456) Q Consensus 1 ~~-------------------~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~--g~~~i~~~~~~~~~~~~~~~~k~~~n~ 59 (456) |+ -..|+-++. ...-++.+.-+|. |+. +.|+-....+.++.+ . ..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~p~~~~~--------~~~~~~~~~~~~~~~~~~---~~pP~~~~~La~l~~-~-~~~ 67 (337) T protein:vir:78 1 MTKRQQQPAQAAASSPRPSVVFSMPEAIDP--------TAWMTDYTGVFYNPYGEY---YQPPIDRKGLAKVAR-A-NAH 67 (337) T ss_pred CCCcccCcccccccCceeEEEecCcccccC--------cchhHhhhhhhhccCcce---ecCCCCHHHHHHHhh-c-chh Confidence 21 111211110 0000122222233 322 111111222322221 1 234 Q ss_pred HHHHHHHHHhhhccCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEcccee Q lcl|NC_021301. 60 GLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETM 138 (456) Q Consensus 60 ~~~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~ 138 (456) +..++......+.. .|... ...+..++.+.+.+|.||+.+-++..|++ .+..++|..+ T Consensus 68 h~~~L~~k~N~~~~-~f~~~--------------------~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v 126 (337) T protein:vir:78 68 HGAILMARRNMVAG-RFTNQ--------------------RATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYL 126 (337) T ss_pred hhhHHHhhhccccc-cCcCc--------------------HHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCcee Confidence 44444444433221 11100 02344566778899999999999888875 4666776655 Q ss_pred EEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEc Q lcl|NC_021301. 139 VVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVY 218 (456) Q Consensus 139 ~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~ 218 (456) ...-|. . .+|...++.. ..|.++.+.+... ..+. T Consensus 127 ~~~~d~---~------~~~~~~~~~~---~~~~~~eIiHik~-------------------------~~~~--------- 160 (337) T protein:vir:78 127 RRREDG---C------FVYLQQGKPN---LIYRPDDVIWLAQ-------------------------YDPE--------- 160 (337) T ss_pred EeeeCC---e------EEEEEcCCce---EEECCccEEEECC-------------------------CCCC--------- Confidence 433221 0 1111111111 1122222222110 0000 Q ss_pred cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchh-hhhhhhh--hhccceecc- Q lcl|NC_021301. 219 QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAI-DYASIFE--AAPGALWEL- 291 (456) Q Consensus 219 ~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~~- 291 (456) ..-.|.|.+... +.++....+-..-...++. .|-.++.-.+.. ..++....+ ....... ...+.++.. T Consensus 161 ~~~~Gls~~~~a---~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~--l~~e~~~~lk~~~~~~~G~~n~~~~~v~~ 235 (337) T protein:vir:78 161 QQVYGMPDYLGG---LQSALLNQDATLFRRRYFLNGAHMGFIFYATDPN--MDDDTEEEMKEMIANSKGVGNFRSMFVNI 235 (337) T ss_pred CCcccccHHHHH---HHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCC--CCHHHHHHHHHHHHHhcCcccccceEEEc Confidence 011355654433 3333322221111223332 233332211111 111111111 1111111 111233333 Q ss_pred ----CCCceeEeecccc-hHHHHHHHHHHHHHHHhhcCCChhhhcccccCc-HH--HH----HHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 292 ----PPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQ-SA--EG----AHNIEKGFLFKCEDRLSI 359 (456) Q Consensus 292 ----~~d~~~~~~~~~~-~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~-Sg--~A----l~~~~~~l~~k~~~~~~~ 359 (456) +.+.++..+.... -..|++..+....+|+++-++|+..+|....|. +| .+ +.+....|.=.+ +. T Consensus 236 ~~g~~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~~~f~~~~L~P~~----~~ 311 (337) T protein:vir:78 236 PDGKPDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYDATYARNEVLPLC----EL 311 (337) T ss_pred CCCCccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHHHHHHHHHHHHHH----HH Confidence 2334666654322 123778777888899999999999988533221 11 11 112212221111 11 Q ss_pred HHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCH Q lcl|NC_021301. 360 AKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTL 395 (456) Q Consensus 360 f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~ 395 (456) +++.+.. .+.....- ..|..+...-+ T Consensus 312 ----ie~~~n~----~ll~~~~~--~~f~~~~~~~~ 337 (337) T protein:vir:78 312 ----VQDAINS----AGLPRALW--VTFRETIGAAV 337 (337) T ss_pred ----HHHHHhh----hcCChhhc--eeccccccccC Confidence 2222211 11222111 22333221111 No 276 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=86.23 E-value=0.047 Score=27.86 Aligned_cols=423 Identities=12% Similarity=0.035 Sum_probs=156.9 Q ss_pred CCC-------CCHHHHHHHHHHH-------HHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHH Q lcl|NC_021301. 1 MTA-------STPAEWLPVLTKR-------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDS 66 (456) Q Consensus 1 ~~~-------~t~~~~~~~l~~~-------~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~ 66 (456) |+. +||+-+.++-.+. +.....|-+..++-|.+...-. .+.. ...|.+..|.-. . T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~~~------~~~~-~r~nl~~sni~~----i 69 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERDSA------HDAE-TRWNLFSTNIQT----Q 69 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhccccCC------Cccc-cccchhhhhHHH----H Confidence 766 4454332222222 2222333444555566543211 1111 122333333221 1 Q ss_pred HHhhhccCCee----cCCCCcccHHHHH----HHHH------HhcChhHHHHHHHHHHhhCCeEEEEEee---------- Q lcl|NC_021301. 67 VADRIIPNGIT----VGGSADSDLALRA----RRIW------RDNRMDSVCKQWVKYGLDFGESYLTCWR---------- 122 (456) Q Consensus 67 ~a~~l~~~~~~----~~~~~d~~~~~~l----~~~~------~~n~~~~~~~~~~~~a~~~G~a~~~v~~---------- 122 (456) .-+--..+|.. -..+.|....+.. .+.+ ++++|+..+....++++.||++-+.+.+ T Consensus 70 ~P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye~~~~~~~~~ 149 (663) T protein:vir:34 70 MASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYEVEWEEVAGV 149 (663) T ss_pred hhhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEeecccchhccc Confidence 11111122311 1112232223322 2322 3355888899999999998876554432 Q ss_pred ----CCC--------C---------ceEEEEEccceeEEEEeCCCCce---EEEEEEEEEecCC---------------- Q lcl|NC_021301. 123 ----RDD--------G---------TATITADSPETMVVSVDPLQPWR---IRSAMRWWRDLDA---------------- 162 (456) Q Consensus 123 ----d~d--------g---------~~~i~~~~p~~~~~~~d~~~~~~---~~~~~~~~~~~d~---------------- 162 (456) |+. + ..+|..+.=+.+ ++||...+. .++...+.+...- T Consensus 150 ~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~df--l~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~~~~~~~a~~ 227 (663) T protein:vir:34 150 DAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDV--LWSPARVWHEVRWLAFRNLLDMREFNARFDADGSRNLWASV 227 (663) T ss_pred cccCCCccccchhcccccchhhcccceeeeeechhhc--ccchhhccccccceeeeccCCHHHHHHhhcCChhhhhhhhc Confidence 110 0 111222211111 122221111 1111111110000 Q ss_pred -------------------ceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeeccccc----ccCceeE-EEEc Q lcl|NC_021301. 163 -------------------ESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAV----VTGSPPP-VVVY 218 (456) Q Consensus 163 -------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~p-vv~~ 218 (456) +.....+|....-. .+|..+ +.+.|.-+..+| .|+.+|- +.++ T Consensus 228 ~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~------------V~w~~e-g~~~~L~~~~p~lgl~~ffPcPrpl~~~ 294 (663) T protein:vir:34 228 PKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRK------------VDWYVE-GYSAVLDTQPDPLGLESFFPCPKPLLAN 294 (663) T ss_pred cCcCCccccCCCCCcchhcCcceeEEEecCCcE------------EEEEEc-CcceecccCCCCCCCCCCCCCcccccce Confidence 00011112111100 011111 111222222222 2333231 1122 Q ss_pred ---cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhhhhcc---ceeccC Q lcl|NC_021301. 219 ---QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFEAAPG---ALWELP 292 (456) Q Consensus 219 ---~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 292 (456) .+-...++|.-...+++.+|.+.-. -|.....- ...++.-.+.+. ..|..+.........+- ..+... T Consensus 295 ~~~ds~ipvpd~~~y~~~~~E~n~~t~R-in~l~d~i-kv~gvy~~~~g~----~i~~~l~~a~~n~lvpV~~~~~~~~~ 368 (663) T protein:vir:34 295 WTTDKVVPRPDFVLAQDLYKEIDLVSTR-ITLLERAI-RVVGVYDKSSGL----TIGRLLSEAAQNDLIPVENWLTFADK 368 (663) T ss_pred ecCCCeecCCcHHHHHHHHHHHHHHHHH-HHHHHhhh-hhceeeccccch----hHHHHHHHhhCCCceecchhhhhhhh Confidence 2345668898888999999855443 23222111 111221100110 01111111000000000 011111 Q ss_pred CCc-eeE-eec--c--cchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 293 PGV-DIW-ESQ--T--NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEA 366 (456) Q Consensus 293 ~d~-~~~-~~~--~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~ 366 (456) .+. +.. .++ + ..+.++.+.-+.+...++.+||+.+..=|..-.|-.+.|-..+-+.+..++.+++.......+. T Consensus 369 gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qdevqR~arD 448 (663) T protein:vir:34 369 GGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQDEVARFASD 448 (663) T ss_pred cCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHHHHHHHHHH Confidence 111 111 111 1 1233444555778888999999987666654445577777888888889999999998888888 Q ss_pred HHHHHHHh-------------cCCC-------------------cccceeEEecCCCCcCHHHHHHHHHHHHh------- Q lcl|NC_021301. 367 ILVKALQI-------------EGES-------------------VEDTVDVSFESPDRVTLGEKYAAASLAKA------- 407 (456) Q Consensus 367 ~~~l~~~~-------------~~~~-------------------~~~~i~v~f~~~~~~~~~e~ad~~~kl~~------- 407 (456) +.++...+ .|.. ..+.++|.=..+.-.|..+.-+..+++.+ T Consensus 449 i~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~l~~i~~~~q 528 (663) T protein:vir:34 449 IQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEVLSGIASFMQ 528 (663) T ss_pred HHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHHHHHHHHHHH Confidence 88776422 1110 11123333333444555433333332221 Q ss_pred -------cCCCcHHHHHHhC-----CCChh-HHHHHHHHHHHHHHHHHh---hhhhhhcccccCC Q lcl|NC_021301. 408 -------AGESWASIRRNIL-----NYNAD-QIKQDDLDRAREQITLFA---GNSVQRPQEDGSR 456 (456) Q Consensus 408 -------~g~~s~~t~~~~~-----~~~~~-~~~~~e~~~~~ee~~~~~---~~~~~~~~~d~~~ 456 (456) .+-.....+.+++ +|... +++. -.++........+ ......++....| T Consensus 529 q~~pl~~q~p~~~p~l~Ellk~~~~~f~~~~qie~-ai~~~~~~~e~aa~~~~~~~pa~~~~~~k 592 (663) T protein:vir:34 529 GVAPLAQQVPGSAPFLLQMLKWSVSGLRGSSTIEG-VLDKAIAAAEEAQKQAAQQSPAPQQPDPK 592 (663) T ss_pred HHHHHHHhhhhhHHHHHHHHHHHhhcCChhhhHHH-HHHHHHhhhHHHhhccCCCCcccchhhHH Confidence 1111222233332 33221 1110 0111111111111 1111111111111 No 277 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=85.61 E-value=0.052 Score=27.65 Aligned_cols=294 Identities=12% Similarity=0.057 Sum_probs=109.6 Q ss_pred CCC-------------------------CCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhh Q lcl|NC_021301. 1 MTA-------------------------STPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREA 55 (456) Q Consensus 1 ~~~-------------------------~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~ 55 (456) |+. .+|.-++. ...-++.+.-+++|+. +-|+=....+.+..+ + T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~f~~p~~v~~--------~~~~~~~~~~~~~~~~---~~pp~~~~~la~~~~-a 68 (344) T protein:vir:60 1 MSKKKGKTLQPAAKKMTASAPKMEAFTFGEPVPVLD--------RRDILDYVECISNGRW---YEPPISFTGLAKSLR-A 68 (344) T ss_pred CCcccCCCCCchHHhhcCCcCcEEEEEcCCceeecC--------CcchhHHHHhhhcCcc---ccCCCCHHHHHHHHH-h Confidence 221 11111100 0111233344555543 112212222332221 1 Q ss_pred ccChHHHHHHHHHhhhc----cCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EE Q lcl|NC_021301. 56 RTNWGLMVRDSVADRII----PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TI 130 (456) Q Consensus 56 ~~n~~~~iVd~~a~~l~----~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i 130 (456) ..+..-++......+. ++|... . ..+..++.+.+.+|.||+.+-++..|++ .+ T Consensus 69 -~~~h~~~i~~k~n~l~~~~~Pn~~~t--------~-------------~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L 126 (344) T protein:vir:60 69 -AVHHSSPIYVKRNILASTFIPHPWLS--------Q-------------QDFSRFVLDFLVFGNAFLEKRYSTTGKVIRL 126 (344) T ss_pred -hhhhccchhhhhhHHHhhccCCCCCC--------H-------------HHHHHHHHHHHhcCCeEEEEEECCCCcEEEE Confidence 1222333333333332 222110 0 1133456677889999999988888876 47 Q ss_pred EEEccceeEEEEeCCCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccC Q lcl|NC_021301. 131 TADSPETMVVSVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTG 210 (456) Q Consensus 131 ~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (456) ..++|..+.+..+.. ++|....+... ..|..+.+.++.. ..+. T Consensus 127 ~~l~~~~vr~~~~~~---------~~~~v~~~~~~--~~~~~~eIiHir~-------------------------~~~~- 169 (344) T protein:vir:60 127 ETSPAKYTRRGVEED---------VYWWVPSFNEP--TAFAPGSVFHLLE-------------------------PDIN- 169 (344) T ss_pred EEcCcceEEEeecCC---------eEEEEccCCeE--EEEcCccEEEEcC-------------------------CCCC- Confidence 777777665543321 01111111110 1122222222110 0000 Q ss_pred ceeEEEEccCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhhhhhhhhhh--- Q lcl|NC_021301. 211 SPPPVVVYQNPDGMGEVEPHIDIINRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAIDYASIFEAA--- 284 (456) Q Consensus 211 ~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~--- 284 (456) ..-.|.|.+.. .+.++....+-..-...++. .|-.+++-.+. ...++.-+.+ ...++.. T Consensus 170 --------~~~yGlsp~~~---a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~--~ls~e~~~~i--k~~~~~~~g~ 234 (344) T protein:vir:60 170 --------QELYGLPEYLS---ALNSAWLNESATLFRRKYYENGAHAGYIMYVTDA--VQDRNDIEML--RENMVKSKGR 234 (344) T ss_pred --------CCcccccHHHH---HHHHHHHHHHHHHHHHHHHhccCCCceEEEecCc--CCCHHHHHHH--HHHHHHhcCC Confidence 01145565543 33333322221111222322 23333321111 1111111111 1112111 Q ss_pred -cc-ceecc-----CCCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCc------HHHHHHHHHHHHH Q lcl|NC_021301. 285 -PG-ALWEL-----PPGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQ------SAEGAHNIEKGFL 350 (456) Q Consensus 285 -~~-~~~~~-----~~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~------Sg~Al~~~~~~l~ 350 (456) .+ .++.. .++.++..+..... ..|++..+....+|+++-++|+..+|....|. ...++.+....|. T Consensus 235 ~~~r~~~l~~p~g~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~L~ 314 (344) T protein:vir:60 235 NNFKNLFLYAPQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELI 314 (344) T ss_pred CCCcceEEecCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHH Confidence 11 12222 23445665543222 23888888899999999999999987432222 1112222222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcccceeEEecCCCCcCHHH Q lcl|NC_021301. 351 FKCEDRLSIAKIGLEAILVKALQIEGESVEDTVDVSFESPDRVTLGE 397 (456) Q Consensus 351 ~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e 397 (456) =.++ .|+++ ..+.|.. .+.|.++.....-. T Consensus 315 Pl~~--------~~e~l----n~~lg~~-----~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 315 PLQD--------RIREI----NGWLGQE-----VIRFKNYSLDTDNG 344 (344) T ss_pred HHHH--------HHHHH----HHhcCCc-----ccccCccccCCCCC Confidence 1111 11111 1122221 13455442221111 No 278 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=80.29 E-value=0.096 Score=26.16 Aligned_cols=419 Identities=11% Similarity=-0.022 Sum_probs=152.8 Q ss_pred CCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc----- Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRID--DGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP----- 73 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~--~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~----- 73 (456) |. ..+..+..++. ....+.+.+.+|..-..-..+.. .. -+. ..++--+-+...++.+++.|.+ T Consensus 1 mk-----~~~~~~~~~lkr~~~e~~w~e~a~~tlP~~~~~~~~-~~---~~~-~~~~~dstg~~a~~~LAa~l~~~ltpp 70 (510) T protein:vir:78 1 MK-----STAAMLWEKLRDGSVEQRAIEFAKTTLPYLMVDPMS-GS---RGV-VEHDFQSAGALLVNNLAAKLARSLFPT 70 (510) T ss_pred Ch-----hHHHHHHHHHhccchHHHHHHHHHhhccccccCCCC-cc---ccc-ccCcccchHHHHHHHHHHHHHHhhcCC Confidence 22 22222333321 12334455555544321100000 00 011 1123345567778888777753 Q ss_pred -CC-eecCCCC--------cccHH-----------HHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEE Q lcl|NC_021301. 74 -NG-ITVGGSA--------DSDLA-----------LRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITA 132 (456) Q Consensus 74 -~~-~~~~~~~--------d~~~~-----------~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~ 132 (456) .| |++...+ +.... ..+...+..++|.....++.++...+|.+.+++. +++. +++. T Consensus 71 ~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~--~~~~-~~~~ 147 (510) T protein:vir:78 71 GIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALLYRN--SDEA-TVVA 147 (510) T ss_pred CCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEEEEe--CCCC-eEEE Confidence 23 3332211 01111 1234456677888999999999999999866554 4433 4556 Q ss_pred EccceeEEEEeCCCCceEEEEEEEEEec-------CCc-----eEEEEEEcCCeEEEEEEeeeecccc-cceeecc-CCC Q lcl|NC_021301. 133 DSPETMVVSVDPLQPWRIRSAMRWWRDL-------DAE-----SDFAIVWSGDGWQKFARPCFVQSSS-RRRLVTR-ISD 198 (456) Q Consensus 133 ~~p~~~~~~~d~~~~~~~~~~~~~~~~~-------d~~-----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~ 198 (456) ++-.+.++.-| ..++ +...++.++-. -+. .....-+..-.+++.+. ...... ..+.+.. ..+ T Consensus 148 ~pl~~y~v~~d-~~G~-vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~V~--~~~~~~~~~~sv~~e~dg 223 (510) T protein:vir:78 148 WSLRSYAVRRD-ATGR-WMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTHVQ--RRKGTAMDYAEMYHEIDG 223 (510) T ss_pred EEcceeEEeeC-CCcC-eeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEEEE--eecCCCCcEEEEEEEecC Confidence 65555444434 3444 33333333210 000 00000000001111111 000000 0011000 111 Q ss_pred ceeecccccccCc-eeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCccccccccc Q lcl|NC_021301. 199 SWVPVGDAVVTGS-PPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENG 272 (456) Q Consensus 199 ~~~~~~~~~~~~~-~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~ 272 (456) .... .....++. +|.+++ | .+.+|+|-.+..++-+..+|...-...........+...+ . .+| T Consensus 224 ~~i~-~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv-~---------p~g 292 (510) T protein:vir:78 224 VRVG-ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLV-D---------EAK 292 (510) T ss_pred eeec-cccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccc-C---------Ccc Confidence 1111 11111112 222222 2 3468999999999988888876665555555544443221 1 111 Q ss_pred chhhhhhhhhhhccceeccCC-CceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHH Q lcl|NC_021301. 273 NAIDYASIFEAAPGALWELPP-GVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFL 350 (456) Q Consensus 273 ~~~~~~~~~~~~~~~~~~~~~-d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~ 350 (456) . .........+.|.+....+ +....++. .+++..-.+.++.+...|....=+. ....+..+.+|.-++.....+. T Consensus 293 ~-~~~~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~~--l~~~~~~rvTAtEV~~r~~E~~ 369 (510) T protein:vir:78 293 G-AVVDDYQDAEMGDYVPGGAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMYG--ANQRDAERVTAEEVRITAEEAE 369 (510) T ss_pred c-cchhhhccCCCceeecCCcccccccccCcccchHHHHHHHHHHHHHHHHHHhhc--cccCCCCCcCHHHHHHHHHHHH Confidence 1 1111112222233322211 12222332 2334433333444333333321111 0011222335544443322222 Q ss_pred HHH----HHHH-HHHHHHHHHHHHHHHHhcCCC---cc--cceeEEecCCCCcCHHHHHHHHHHH------H-hcCCC-- Q lcl|NC_021301. 351 FKC----EDRL-SIAKIGLEAILVKALQIEGES---VE--DTVDVSFESPDRVTLGEKYAAASLA------K-AAGES-- 411 (456) Q Consensus 351 ~k~----~~~~-~~f~~~l~~~~~l~~~~~~~~---~~--~~i~v~f~~~~~~~~~e~ad~~~kl------~-~~g~~-- 411 (456) +.. .+.+ ..+.+-+++.+.++.. .|.. .+ ....+++..+ ...++-+.++ . ..|-+ T Consensus 370 ~~LGpv~~rl~~E~l~Pli~r~~~il~r-~gl~p~p~~~~~~~~v~~is~-----Laraq~~~~l~~~~q~l~~~~~~~q 443 (510) T protein:vir:78 370 NTLGGTYSLLAENLQSPLAYVCLSEVDD-ALLQGLITKQHKPAIETGLPA-----LSRSAAVQSMLNASQVIAGLAPIAQ 443 (510) T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHh-ccCCCCCcccccceeeecccH-----HHHHHHHHHHHHHHHHHHHhcChhh Confidence 221 1111 2223334444444432 2321 11 1122343333 2222222211 1 11211 Q ss_pred -----cHH----HHHHhCCCCh-------hHHHHHHHHHHHHHHHH--HhhhhhhhcccccCC Q lcl|NC_021301. 412 -----WAS----IRRNILNYNA-------DQIKQDDLDRAREQITL--FAGNSVQRPQEDGSR 456 (456) Q Consensus 412 -----s~~----t~~~~~~~~~-------~~~~~~e~~~~~ee~~~--~~~~~~~~~~~d~~~ 456 (456) .-. .....+|+++ ++++++..++.+++... ..++.......-|+. T Consensus 444 ~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~ 506 (510) T protein:vir:78 444 LDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNA 506 (510) T ss_pred hhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Confidence 111 2234567643 44444433332222111 112222233333333 No 279 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=78.25 E-value=0.12 Score=25.71 Aligned_cols=432 Identities=12% Similarity=0.004 Sum_probs=157.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhcc------C Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIP------N 74 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~------~ 74 (456) |-...+.-+.+. +-.....+++.+.+|.--.--. .+...........+ .--+-+...++.+++.|.+ . T Consensus 1 m~~~~~~l~~k~---~R~~~e~~w~e~a~~~lP~~~~--~~~~~~~~~~~~~~-~~dstg~~a~~~LAa~l~~~ltpp~~ 74 (514) T protein:vir:80 1 MRQQASAMWAEY---RDSTAIRKAEDFAKFTIASLMV--DPLDKTHQAEVVEY-DFQSAGAFLVNNLTAKLALTLFPPGR 74 (514) T ss_pred CccchHHHHHHh---hcchHHHHHHHHHHHhcccccC--CCCCCccccccccc-ccchhHHHHHHHHHHHHHhhhcCCCC Confidence 666655443322 1122345566666664332100 01011011111111 1234456677777776653 2 Q ss_pred C-eecCCCCc---------c---cHHH-------HHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEc Q lcl|NC_021301. 75 G-ITVGGSAD---------S---DLAL-------RARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADS 134 (456) Q Consensus 75 ~-~~~~~~~d---------~---~~~~-------~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~ 134 (456) | |++..+++ . +... .+...+..++|.....++.++...+|.+.+++-.+. ..++.++ T Consensus 75 ~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~---~~~~~~p 151 (514) T protein:vir:80 75 PSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFYREPGT---GKMLVWT 151 (514) T ss_pred cccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEEEecCC---CcEEEEE Confidence 3 44432210 0 0111 133445668899999999999999999877764332 2355665 Q ss_pred cceeEEEEeCCCCceEEEEEEEEEecCC---c---eEEEEE--E-cCCeEEEEEEeeeeccccc-ceeec-cCCCceeec Q lcl|NC_021301. 135 PETMVVSVDPLQPWRIRSAMRWWRDLDA---E---SDFAIV--W-SGDGWQKFARPCFVQSSSR-RRLVT-RISDSWVPV 203 (456) Q Consensus 135 p~~~~~~~d~~~~~~~~~~~~~~~~~d~---~---~~~~~~--~-~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~ 203 (456) -.+.++.-| ..++....+.+....... . ...... . ..+.+..|........... .+.+. ...+..+. T Consensus 152 l~~y~v~~d-~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~~e~~g~~i~- 229 (514) T protein:vir:80 152 MQSYTVRRT-SHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVWHELEGKRVG- 229 (514) T ss_pred cCeEEEeeC-CCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEEEeccceeec- Confidence 555444444 444433332222211000 0 000000 0 0111111111111111111 01110 01111110 Q ss_pred ccccccC-ceeEEEE-c----cCCCCCCcHhHHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccchhhh Q lcl|NC_021301. 204 GDAVVTG-SPPPVVV-Y----QNPDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDY 277 (456) Q Consensus 204 ~~~~~~~-~~~pvv~-~----~n~~g~s~~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~ 277 (456) ......+ .+|.+++ | .+.+|+|-.+..++-+..+|.+.-...........|...+ +.+|. ... T Consensus 230 ~es~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~~~~~~v----------~~~g~-~~~ 298 (514) T protein:vir:80 230 PESSYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEALSLLNLV----------DEAKG-GAV 298 (514) T ss_pred ccCccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcee----------Ccccc-cch Confidence 0111111 1232222 2 3468999999999988888876666555555555443222 11111 011 Q ss_pred hhhhhhhccceeccC-CCceeEeec-ccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHHHHHHHHHHHHHHH-- Q lcl|NC_021301. 278 ASIFEAAPGALWELP-PGVDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKC-- 353 (456) Q Consensus 278 ~~~~~~~~~~~~~~~-~d~~~~~~~-~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~-- 353 (456) ......+.|.+.... .+....++. .+++....+.++.+...|....-+. .. ..+..+-+|.-++.....+.+.. T Consensus 299 ~~l~~~~~g~~v~g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml~-~~-~rd~~rvTAtEV~~r~~E~~~~LGp 376 (514) T protein:vir:80 299 DDYRDAETGDFVPGQVGSVASYERGDYNKIAQASASVESIVMRLNRAFMYT-GQ-VRDAERVTVEEIRTVAEEAENLLGG 376 (514) T ss_pred hhhcccCCceeecCCCccceeeecCcccchHHHHHHHHHHHHHHHHHHhhh-cc-CCCCCCCCHHHHHHHHHHHHHHhhH Confidence 111222223332222 222333332 2344444444444444443221110 01 11223335554444333222211 Q ss_pred --HHHH-HHHHHHHHHHHHHHHHh-cCC---CcccceeEEecCCC-CcCHHHH-------HHHHHHHHhcC-----CCcH Q lcl|NC_021301. 354 --EDRL-SIAKIGLEAILVKALQI-EGE---SVEDTVDVSFESPD-RVTLGEK-------YAAASLAKAAG-----ESWA 413 (456) Q Consensus 354 --~~~~-~~f~~~l~~~~~l~~~~-~~~---~~~~~i~v~f~~~~-~~~~~e~-------ad~~~kl~~~g-----~~s~ 413 (456) .+.+ ..+.+-+.+.+.++... .|. ....-+++.+.-++ .-..... ++.+..+.+.. .+.- T Consensus 377 v~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~ 456 (514) T protein:vir:80 377 VYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRNIETANILRATQEASAIVPALVQLSKRFDP 456 (514) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeecHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCH Confidence 1111 22233334444444322 232 11222344443221 1111111 22222221110 0111 Q ss_pred HHH----HHhCCCChh------HHHHHHHHHHHH-HHHHHhhhhhhhcccccCC Q lcl|NC_021301. 414 SIR----RNILNYNAD------QIKQDDLDRARE-QITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 414 ~t~----~~~~~~~~~------~~~~~e~~~~~e-e~~~~~~~~~~~~~~d~~~ 456 (456) ..+ .+.+|++.. ++.+.+++++++ +.++.........++-|.. T Consensus 457 d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 510 (514) T protein:vir:80 457 EKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSAG 510 (514) T ss_pred HHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 112 233565432 222233333322 2222222222223333333 No 280 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=75.99 E-value=0.14 Score=25.26 Aligned_cols=310 Identities=10% Similarity=-0.018 Sum_probs=111.8 Q ss_pred CCCCCHHHHHHHHHHH--------HHHH--HHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhh Q lcl|NC_021301. 1 MTASTPAEWLPVLTKR--------IDDG--MSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADR 70 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~--------~~~~--~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~ 70 (456) |........-...... +.+- ..-++.+.-+|.+.-+. +-|+=.+..+.++.+ + ..++.-++.....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~~~~~y~~~~~~~~~~~-~epp~~~~~la~l~~-~-~~~h~~~i~~k~n~ 77 (345) T protein:vir:37 1 MKTNVKTDNKKGIVIAPINDRTFSLNEISASPALDYVGIGFDENYNC-YLPPVNRHALAKLPH-Q-NAQHGGILHSRANM 77 (345) T ss_pred CCCCccccchhhcccCcceeEEeecCCcccccchhhhhhhhcCCccc-cCCCCCHHHHHHHhh-c-ccccccceeeechH Confidence 2221110000000000 0000 01122222233221111 111111222222211 1 12333333333222 Q ss_pred hc----cCCeecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCC Q lcl|NC_021301. 71 II----PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPL 145 (456) Q Consensus 71 l~----~~~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~ 145 (456) +. ++|.. . ...+.+++.+.+.+|.||+.+-++..|++ .+..++|..+.+..|.. T Consensus 78 l~~~~~Pn~~l-------t--------------~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~ 136 (345) T protein:vir:37 78 VSSLYEGGKAL-------S--------------RMDMRALCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVRKDGG 136 (345) T ss_pred HHhhccCCCCC-------C--------------HHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEEcCceeEEEEeCC Confidence 21 22211 0 01133466778899999999999888876 57888888876654432 Q ss_pred CCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCC Q lcl|NC_021301. 146 QPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMG 225 (456) Q Consensus 146 ~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s 225 (456) . ...++++....+ +.. ..|.++.+.++.. .. +...-.|.| T Consensus 137 ~----~~~~~~~~~~~~-g~~-~~~~~~dVihir~-------------------------~~---------~~~~~~Gls 176 (345) T protein:vir:37 137 Y----SYLMKKSLYDTA-QEI-YRYDAKDIIFIKL-------------------------YD---------PMQQVYGSP 176 (345) T ss_pred e----eEEEEEeEecCC-ceE-EEEccccEEEecC-------------------------CC---------CCCCccccc Confidence 1 111222211111 110 1122222222110 00 001124666 Q ss_pred cHhHHHHHHHHHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchhhhhhhhhhh-----ccceecc-----C Q lcl|NC_021301. 226 EVEPHIDIINRINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAIDYASIFEAA-----PGALWEL-----P 292 (456) Q Consensus 226 ~~~~v~~liDa~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~-----~ 292 (456) .+...+. ++....+-..-...++ +.|-.++.-.+.. ..++....+ ...++.. .+.++.. + T Consensus 177 ~~~~a~~---si~l~~~a~~~~~~~f~NG~~p~~Il~~~d~~--l~~e~~~~l--k~~~~~~~g~~n~~~~~i~~p~g~~ 249 (345) T protein:vir:37 177 DYVGGIQ---SALLNSDATVFRRRYFSNGAHMGFILYSTDPD--LTEEMEEEI--ARKISESKGVGNFRSMFVNIANGHP 249 (345) T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHhccCCcceEEEecCCC--CCHHHHHHH--HHHHHHhcCcccccceEEEcCCCcc Confidence 6554333 2221111111112222 2233333211111 111111111 1112111 1223333 2 Q ss_pred CCceeEeecccch-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcH------HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 293 PGVDIWESQTNDF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS------AEGAHNIEKGFLFKCEDRLSIAKIGLE 365 (456) Q Consensus 293 ~d~~~~~~~~~~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~S------g~Al~~~~~~l~~k~~~~~~~f~~~l~ 365 (456) .+.++..+...+. ..|++..+....+|+++-++|+..+|....|.+ ..+..+....+ .=.++.|...+. T Consensus 250 ~G~~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~llGi~~~~~~~~~~~e~~~~~f~~~~l----~P~~~~ie~~ln 325 (345) T protein:vir:37 250 DGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEV----MPLQEIIAETIN 325 (345) T ss_pred cceEEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCccCCCCCCcccHHHHHHHHHHHHH----HHHHHHHHHHhh Confidence 3455666643321 237887788899999999999999875322221 11222222222 222222333332 Q ss_pred HHHHHHHHhcCCCcccceeEEecCCCCcCHHH Q lcl|NC_021301. 366 AILVKALQIEGESVEDTVDVSFESPDRVTLGE 397 (456) Q Consensus 366 ~~~~l~~~~~~~~~~~~i~v~f~~~~~~~~~e 397 (456) ++ ..+ .....+.|.+.. +++ T Consensus 326 ~~----~~~-----~~~~~i~F~~~~---L~~ 345 (345) T protein:vir:37 326 QD----PEI-----KNLLKIKFREQN---FAK 345 (345) T ss_pred hh----ccC-----CCcceEEecchh---hcC Confidence 21 111 122346676542 222 No 281 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=72.66 E-value=0.18 Score=24.67 Aligned_cols=299 Identities=11% Similarity=0.038 Sum_probs=106.8 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhh---c--cCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI---I--PNG 75 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l---~--~~~ 75 (456) -|-.+|+-++..- .+.....-+....+||+---+ +..+.++.+. ..+..-++.....++ + .+| T Consensus 24 ~~~~~p~~~~~~~--~~~~~~~~~~~~~~~~~pp~~--------~~~la~l~~~--~~~h~~~i~~k~n~l~~l~~~Pn~ 91 (346) T protein:vir:10 24 FSFGDPIPVLDRA--DILNYLECSAMYEKWYNPPMS--------FDGLAKSLRS--STHHESAIITKANILLSTCEVDSR 91 (346) T ss_pred EecCCcceecCch--hHHHHHHHhhcCCceEecCCC--------HHHHHHHHHh--hhhcchhhhhhhhhHHHHHhCCCC Confidence 2222232222110 000111111112234432111 1112222111 112222222222222 1 111 Q ss_pred eecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEE Q lcl|NC_021301. 76 ITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAM 154 (456) Q Consensus 76 ~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~ 154 (456) .. . ...+.+++.+.+.+|.||+.+-.+..|++ .+..++|..+.+..+... . . T Consensus 92 ~~-------t--------------~~~f~~~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~~---~---~ 144 (346) T protein:vir:10 92 YL-------S--------------RRDLSSFVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAGQ---F---Y 144 (346) T ss_pred CC-------C--------------HHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCCe---E---E Confidence 11 0 01123456677889999999999888875 477788877765433221 1 0 Q ss_pred EEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHH Q lcl|NC_021301. 155 RWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDII 234 (456) Q Consensus 155 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~li 234 (456) ......+|.. ..|.++.+.++.. .... ..-.|.|.+...+..+ T Consensus 145 ~~~~~~~g~~---~~~~~~dIih~r~-------------------------~~~~---------~~~~G~~~~~~a~~si 187 (346) T protein:vir:10 145 YVPQRFDHQE---HEFAKGSIYHLLE-------------------------PDIN---------QDIYGLPQYLSALQSA 187 (346) T ss_pred EEEEccCCeE---EEEecccEEEecC-------------------------CCCC---------CCeeeccHHHHHHHHH Confidence 1111112211 1122222222210 0000 0113666554333222 Q ss_pred HHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhh-hhhhhh--hhccceeccCC-----CceeEeeccc Q lcl|NC_021301. 235 NRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAID-YASIFE--AAPGALWELPP-----GVDIWESQTN 303 (456) Q Consensus 235 Da~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~~~-----d~~~~~~~~~ 303 (456) .. ..+--.-...++. .|-.++.-.+. ...++.-+.+. ...... ...+.++...+ +.++..+... T Consensus 188 ~l---~~~a~~~~~~~~~NG~~~~~il~~~d~--~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~~~~~~gi~~~pis~~ 262 (346) T protein:vir:10 188 WL---NESATLFRRKYFLNGAHAGFVFYMSDA--SQKQEDVENIRQQLKQSKGVGNFKNLFVHAPNGKKDGIQIIPIADV 262 (346) T ss_pred HH---HHHHHHHHHHHHhccCCCceEEEeCCC--CCCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEecCCC Confidence 21 1111111122222 23322221111 11111111111 111111 11233444433 2344444332 Q ss_pred ch-HHHHHHHHHHHHHHHhhcCCChhhhcccccC------cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_021301. 304 DF-TPMLSAIKEHIRQLSSATKTPLPMLMPDSAN------QSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEG 376 (456) Q Consensus 304 ~~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N------~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~ 376 (456) .. ..|++..+....+|+++-++|+..+|....| ....++.+....+.=.++..+ ++.. ..+ T Consensus 263 ~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~~~~l~P~~~~ie--------e~n~----~L~ 330 (346) T protein:vir:10 263 SAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFFITEIEPLQERLK--------EFNQ----WLG 330 (346) T ss_pred hhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHHHHHHHHHHHH--------HHHh----hcc Confidence 22 2377777788899999999999998743222 222222222222222222211 1111 111 Q ss_pred CCcccceeEEecCCCCcCHHH Q lcl|NC_021301. 377 ESVEDTVDVSFESPDRVTLGE 397 (456) Q Consensus 377 ~~~~~~i~v~f~~~~~~~~~e 397 (456) .. .+.|++...-...| T Consensus 331 ~e-----~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 331 QE-----VIKFKPSKLLQRTQ 346 (346) T ss_pred cc-----eeeechhhhcccCC Confidence 11 14566543332222 No 282 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=69.43 E-value=0.22 Score=24.16 Aligned_cols=198 Identities=10% Similarity=0.005 Sum_probs=68.8 Q ss_pred EEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHH Q lcl|NC_021301. 154 MRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDI 233 (456) Q Consensus 154 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~l 233 (456) +|. ..||... |.+.. ......+...... ..+.-|.-+.. +...-.|.|.+..... T Consensus 1 ~r~--~~dg~~~----------y~~~~--~~~~~~g~~~~~~-------~~eilH~r~~~---~~~~~~Glspi~~a~~- 55 (219) T protein:vir:98 1 MRV--CKDGNYK----------YLMKK--SLYDTKSEIYEYN-------KNDVIFIKLYD---PMQQVYGSPDYVGGIT- 55 (219) T ss_pred Cce--eecCeEE----------EEEec--ceecCCceeEEec-------cccEEEecCCC---CCCCcceecHHHHHHH- Confidence 111 1122111 00000 0000000000000 00001110000 0011246665544332 Q ss_pred HHHHHHHHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhhhhhhhhhh-----ccceecc-----CCCceeEee Q lcl|NC_021301. 234 INRINRAELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAIDYASIFEAA-----PGALWEL-----PPGVDIWES 300 (456) Q Consensus 234 iDa~~~~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~-----~~d~~~~~~ 300 (456) ++....+-..-...+|. .|-.++.--+ ....++....+ ...+... .+.++.. ..+.++..+ T Consensus 56 --~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~--~~l~~e~~~~~--~~~~~~~~g~~n~~~~~l~~~gg~~~G~~~~~~ 129 (219) T protein:vir:98 56 --SALLNSDATIFRRRYYSNGAHMGFILYSTD--PDMTEEMEDEI--AERIRDSKGVGNFRSMFVNIAGGHPDGLKVIPI 129 (219) T ss_pred --HHHHHHHHHHHHHHHHhcCCCCceEEEeCC--CCCCHHHHHHH--HHHHHHhcCcccccceeEecCCCCccceeEEEc Confidence 22222221111223332 2333332101 01111111111 1112111 1222222 234566665 Q ss_pred cc--cchHHHHHHHHHHHHHHHhhcCCChhhhcccc------cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 301 QT--NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS------ANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKAL 372 (456) Q Consensus 301 ~~--~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~------~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~ 372 (456) .. .+.+ |++..+..+.+|+.+-++|++.+|... +|.....+.+....|.-.+.+.+..+.. . T Consensus 130 ~~~~~d~q-fle~rk~~~~eIa~~fgVPp~~lG~~~~~~~~~sn~eq~~~~f~~~tL~P~~~~ie~~ln~-------~-- 199 (219) T protein:vir:98 130 GDTGQKDE-FANIKNISAQDVLTSHRFPPGLSGIIPVNTAGLGDPLKIREAYQADEVLPLQEIIAESINS-------D-- 199 (219) T ss_pred cCCHHHHH-HHHHHHhhHHHHHHHhCCCHHHcccccCCCCCccCHHHHHHHHHHHHHHHHHHHHHHHhhh-------h-- Confidence 43 3343 888888889999999999999987421 1222222333333332222222222210 0 Q ss_pred HhcCCCcccceeEEecCCCCcCHH Q lcl|NC_021301. 373 QIEGESVEDTVDVSFESPDRVTLG 396 (456) Q Consensus 373 ~~~~~~~~~~i~v~f~~~~~~~~~ 396 (456) ..-....++.|..+.+.|.- T Consensus 200 ----~~~~~~~~~~F~~~~~~d~~ 219 (219) T protein:vir:98 200 ----YEIKSALKVNFKQPEKRDKN 219 (219) T ss_pred ----hcCCCccEEeecCcccccCC Confidence 01123356778877666655 No 283 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=67.77 E-value=0.25 Score=23.91 Aligned_cols=305 Identities=10% Similarity=0.022 Sum_probs=106.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhc----cCCe Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRII----PNGI 76 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~----~~~~ 76 (456) ++-.+|.-++..- .-++.+.-++.|.. ..++-....+....+ . ..+..-++-.....+. .+|. T Consensus 39 ~~fg~p~~~~~~~--------~~~~~~~~~~~~~~---~~~pi~~~~la~~~~-~-~~~h~~~~~~~~n~l~l~~~Pn~~ 105 (368) T protein:vir:79 39 FSFGDPVEVLDRR--------ELLDYVECMRMGQW---YEPPMPWDGLARSFR-A-AAHHSSAVYVKRNILVSTFIPHPL 105 (368) T ss_pred EEcCCceeecchh--------hHHHHHHHHhccch---hccCcCHHHHHHHHh-h-ccccchhhhhhcchhhhhcCCCcC Confidence 2222222222110 00122222233321 000000111111111 0 1111111111111111 1111 Q ss_pred ecCCCCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEE Q lcl|NC_021301. 77 TVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMR 155 (456) Q Consensus 77 ~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~ 155 (456) . . ...+.+++.+.+.+|.||+.+-.+..|++ .+..++|..+.+.-|.. .. T Consensus 106 ~----------t-----------~~~f~~l~~d~ll~Gnay~~~~r~~~G~~~~L~~l~~~~v~~~~~~~--------~~ 156 (368) T protein:vir:79 106 L----------S-----------RATFERLVLDWQVFGNAYLERRENVLGGTIRLDTPLAKYVRRGLDLN--------TY 156 (368) T ss_pred C----------C-----------HHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEeCcccceeeccCC--------EE Confidence 0 0 01123466778899999999999888885 47777777664332211 01 Q ss_pred EEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHH Q lcl|NC_021301. 156 WWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIIN 235 (456) Q Consensus 156 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liD 235 (456) ++...++.. ..|..+.+.++.. ..+. ..-.|.|.+...+.-++ T Consensus 157 ~~~~~~~~~---~~~~~~dIihir~-------------------------~~~~---------~~~yGlsp~~~a~~si~ 199 (368) T protein:vir:79 157 FFVQNWQQP---YTFAAGSVFHLQE-------------------------PDIN---------QEVYGLPEYLSALNATW 199 (368) T ss_pred EEEecCCeE---EEEccccEEEecC-------------------------CCCC---------CCcccccHHHHHHHHHH Confidence 111111111 1122222221110 0000 01146666554433333 Q ss_pred HHHHHHHHHHHHHHHh---hchhhhhhcCCCcccccccccchhh-hhhhhh--hhccceecc-----CCCceeEeecccc Q lcl|NC_021301. 236 RINRAELQLLSTMAIQ---AFRQRALKSAGHGLPKVDENGNAID-YASIFE--AAPGALWEL-----PPGVDIWESQTND 304 (456) Q Consensus 236 a~~~~~s~~~~~~~~~---~~~~~~i~g~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~~-----~~d~~~~~~~~~~ 304 (456) .-+.+.. -...++ +.|-.++.-.+.. ..++.-..+. ...... ...+.++.+ +.+.++..+.... T Consensus 200 l~~aa~~---~~~~~~~NGa~~~gil~~~~~~--l~~e~~~~lk~~~~~~~G~~N~g~~~vl~~~g~~~g~~~~pls~~~ 274 (368) T protein:vir:79 200 LNESATL---FRRRYYKNGSHAGFILYMTDAA--QKQEDVDTLREAMKSAKGPGNFRNLFMYAPNGKKDGIQLLPVSEVA 274 (368) T ss_pred HHHHHHH---HHHHHHhccCCCceEEEeCCCC--CCHHHHHHHHHHHHHhcCCcccCceeEecCCCCccceeEEEcCCCH Confidence 2111111 112222 2333333211111 1111111111 111111 112344444 2345666654322 Q ss_pred h-HHHHHHHHHHHHHHHhhcCCChhhhcccccCcH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_021301. 305 F-TPMLSAIKEHIRQLSSATKTPLPMLMPDSANQS------AEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGE 377 (456) Q Consensus 305 ~-~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~S------g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~ 377 (456) . ..|++..+....+|+.+-++|+..+|....|.+ ...+.+....+.=.++. | +++. .+.+. T Consensus 275 ~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~e~~~~~f~~~~l~Pl~~~----i----e~ln----~~l~~ 342 (368) T protein:vir:79 275 AKDEFWNIKNVTRDDQLAAHRVPPQLMGIIPNNTGGFGDVEKAAMVFARNEVKPLQDR----L----LAIN----DWIGD 342 (368) T ss_pred HHHHHHHHHHHhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHHHHHHHHHHHHHH----H----HHHH----hccCc Confidence 1 237788888899999999999999975332221 11122221222111111 1 1111 11111 Q ss_pred CcccceeEEecCCC--CcCHHHHHHHHHHHHhc Q lcl|NC_021301. 378 SVEDTVDVSFESPD--RVTLGEKYAAASLAKAA 408 (456) Q Consensus 378 ~~~~~i~v~f~~~~--~~~~~e~ad~~~kl~~~ 408 (456) . .+.|++.. ..|.+..++... .++ T Consensus 343 ---e--~~rF~~~~l~~~D~~a~a~~~~--rsa 368 (368) T protein:vir:79 343 ---E--VVRFAPYALGGHDQPAAAPGGQ--RSA 368 (368) T ss_pred ---c--eeeechhHhhcccccccCCccc--ccC Confidence 1 24455431 112222222111 122 No 284 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=64.48 E-value=0.3 Score=23.46 Aligned_cols=303 Identities=11% Similarity=-0.023 Sum_probs=110.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCeecCC Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGITVGG 80 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~~~~ 80 (456) .+-.+|.- ..-++.+.=+|.+.-+. +-|+-....+.++.+ ...++.-++.....++.+. |. . T Consensus 23 ~~~~~~~~------------~~~~~y~~~~~~~~~~~-~epp~~~~~la~~~~--~~~~h~~~i~~k~n~l~~~-~~--P 84 (345) T protein:vir:37 23 FSLSEITA------------SPALDYVGIGFDENYNC-YLPPVNRHALAKLPH--QNAQHGGILHSRANMVSAT-YE--G 84 (345) T ss_pred eecCCccc------------chhhcccceeeecCCcc-ccCCCCHHHHHHHhh--cchhhcchhhhhhhHHhhc-cC--C Confidence 22222210 01111111112111111 111111222332221 1234555666666665432 11 0 Q ss_pred CCcccHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEEEEEEEEEe Q lcl|NC_021301. 81 SADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIRSAMRWWRD 159 (456) Q Consensus 81 ~~d~~~~~~l~~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~~~~~~~~~ 159 (456) ...-. . ..+.+++.+.+.+|.||+.+-++..|++ .+..++|..+...-|.. . ...++.+.. T Consensus 85 n~~~t-~-------------~~f~~~v~d~ll~Gnay~~i~rn~~G~~~~L~pl~~~~vr~~~d~~---~-~~~~~~~~~ 146 (345) T protein:vir:37 85 GKALS-K-------------MEMRALCLNLIQFGDVGLLKVRNGFGQVVRLVPLSSLYLRVHKDGG---Y-SYLMKKSLY 146 (345) T ss_pred CCCCC-H-------------HHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCceeEEeecCC---e-eEEEeeeee Confidence 00000 0 1123456677889999999999988875 57777777665433211 1 111221111 Q ss_pred cCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHHHHHHHHHH Q lcl|NC_021301. 160 LDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHIDIINRINR 239 (456) Q Consensus 160 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~ 239 (456) ..+ .. ...|.++.+.++.. ..+ ...-.|.|.+... +.++.. T Consensus 147 ~~~-g~-~~~~~~~eViHir~-------------------------~~~---------~~~~~Gl~~~~~a---~~si~l 187 (345) T protein:vir:37 147 DTA-QE-IYRYDAKDIIFIKL-------------------------YDP---------MQQVYGSPDYVGG---IQSALL 187 (345) T ss_pred ccC-ce-EEEEccccEEEEcC-------------------------CCC---------CCCcccchHHHHH---HHHHHH Confidence 111 11 01122222222210 000 0011355544432 222221 Q ss_pred HHHHHHHHHHHhh---chhhhhhcCCCcccccccccchhh-hhhhhhh--hccceecc-----CCCceeEeecccchH-H Q lcl|NC_021301. 240 AELQLLSTMAIQA---FRQRALKSAGHGLPKVDENGNAID-YASIFEA--APGALWEL-----PPGVDIWESQTNDFT-P 307 (456) Q Consensus 240 ~~s~~~~~~~~~~---~~~~~i~g~~~~~~~~~~~~~~~~-~~~~~~~--~~~~~~~~-----~~d~~~~~~~~~~~~-~ 307 (456) ..+-..-...++. .|-.++.-.+.. ..++.-..+. ....... ..+.++.. +++.++..+.....+ . T Consensus 188 ~~~a~~~~~~~f~NGa~~~~Il~~t~~~--l~~e~~~~lk~~~~~~~g~~n~~~~~i~~~~g~~~G~~~~pl~~~~~d~q 265 (345) T protein:vir:37 188 NSDATVFRRRYFSNGAHMGFILYSTDPD--LTEEMEEEIARKISESKGVGNFRSMFVNIAGGHPDGLKVIPIGDTGTKDE 265 (345) T ss_pred HHHHHHHHHHHHhccCCcceEEEeCCCC--CCHHHHHHHHHHHHHhcCccccCceeEecCCCCccceeEEEccCChhHHH Confidence 1111111112222 222222111111 1111111111 1111110 11123332 223456666433222 3 Q ss_pred HHHHHHHHHHHHHhhcCCChhhhcccccCc------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccc Q lcl|NC_021301. 308 MLSAIKEHIRQLSSATKTPLPMLMPDSANQ------SAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESVED 381 (456) Q Consensus 308 ~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~------Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~ 381 (456) |++..+....+|+++-++|+..+|..-.|. ...++.+....|.=.+ ..|...+.+ +..+ .. T Consensus 266 f~e~k~~~~~dI~~a~~VPp~liGi~~~~t~~~s~~e~~~~~f~~~~l~P~~----~~ie~~ln~----~~e~-----~~ 332 (345) T protein:vir:37 266 FANIKNISAQDVLTAHRFPAGLSGIIPTNTGGLGDPLKYREVYHYDEVMPLQ----EIIAETINQ----DPEI-----KN 332 (345) T ss_pred HHHHHHHhHHHHHHHhCCCHHHhccccCCCCCcccHHHHHHHHHHHHHHHHH----HHHHHHhhh----hhcc-----CC Confidence 888788888999999999999987432221 2222222222222222 222222222 1111 12 Q ss_pred ceeEEecCCC-Cc Q lcl|NC_021301. 382 TVDVSFESPD-RV 393 (456) Q Consensus 382 ~i~v~f~~~~-~~ 393 (456) ...+.|.+.. .+ T Consensus 333 ~~~i~F~~~~l~k 345 (345) T protein:vir:37 333 LLKIKFREQNFAK 345 (345) T ss_pred cceEEECchhhcC Confidence 3456777642 22 No 285 >protein:vir:4698 Length: 251 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061630;genbank:gi:9635717;genbank:GeneID:1262980 Probab=44.98 E-value=0.79 Score=21.16 Aligned_cols=241 Identities=11% Similarity=-0.058 Sum_probs=90.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHh---cccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhhccCCee Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYS---NGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRIIPNGIT 77 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~~~~YY---~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l~~~~~~ 77 (456) |- |.....++-. .+.-.....+. -+..... ...+ . ...-+...-..-+|+.+++-+..-|+. T Consensus 1 Mg------lF~~~~~r~~--~~~~~~~~~~~~~~~~~~~~~--~~~v----~-~~~al~~~~v~~~i~~ia~~iA~lp~~ 65 (251) T protein:vir:46 1 MG------IFYKNEKRDL--QYNEDDLQMMVQTLPSFQGTK--LRQY----K-DIEAIRHSDIFTAVMMIASDLARMPIR 65 (251) T ss_pred CC------cccccccccc--CCCccchhhhhhhhccccCcC--ccee----c-hhhhhccHHHHHHHHHHHHhHhhCceE Confidence 11 1100000000 00000000000 0000000 0000 0 011122344556888888888887877 Q ss_pred cCCCCcccHHHHHHHHHHh--cC---hhHHHHHHHHHHhhCCeEEEEEeeCCCCce-EEEEEccceeEEEEeCCCCceEE Q lcl|NC_021301. 78 VGGSADSDLALRARRIWRD--NR---MDSVCKQWVKYGLDFGESYLTCWRRDDGTA-TITADSPETMVVSVDPLQPWRIR 151 (456) Q Consensus 78 ~~~~~d~~~~~~l~~~~~~--n~---~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~-~i~~~~p~~~~~~~d~~~~~~~~ 151 (456) +....+......+.+++.. |. .......+..+.+.+|.||+++.++.+|++ .+..++|..+.+..|+.. . +. T Consensus 66 ~~~~~~~~~~~~~~~ll~~~Pn~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g-~-~~ 143 (251) T protein:vir:46 66 VTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDARG-R-LY 143 (251) T ss_pred EeeCccccccchHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCceEEEEECCCC-c-EE Confidence 6543222222234444432 32 235667788899999999999999999886 599999999998876542 2 21 Q ss_pred EEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHhHHH Q lcl|NC_021301. 152 SAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVEPHI 231 (456) Q Consensus 152 ~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~~v~ 231 (456) ..+ .....+..+. ...|.++.+.++.. ++ .+.-.|.|-++... T Consensus 144 ~~~-~~~~~~~~g~-~~~~~~~diiH~r~--------------------------------~~---~dg~~G~spi~~~~ 186 (251) T protein:vir:46 144 YFH-QRIDSNGNNI-ERNVKFEDMLDIKF--------------------------------YS---LDGINGLSLLDTLS 186 (251) T ss_pred EEE-EEeccCCcce-eEEECCccEEEecC--------------------------------cC---CCCeeecCHHHHHH Confidence 111 1111111111 12334444433321 00 01113555555443 Q ss_pred HHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccc-cchhhhhhhhhhhccceeccCCCceeEeecccchHH Q lcl|NC_021301. 232 DIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDEN-GNAIDYASIFEAAPGALWELPPGVDIWESQTNDFTP 307 (456) Q Consensus 232 ~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 307 (456) ..+.....+..-..+...-.+.|-.+++-- .. ..++. ...+ ...+....+ ..+...+ +... ++. T Consensus 187 ~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~--l~~~e~~~~~--~~~~~~~~~---g~~n~g~---~~~g-m~~ 251 (251) T protein:vir:46 187 RTIESDNNGKDFLNNFLRNGTHAGGILKMK-GV--LDNKKARDRA--REEFPKVLV---ELNKLGK---LSYS-MNQ 251 (251) T ss_pred HHHHHHHHHHHHHHHHHHccCCCcEEEEeC-CC--CCCHHHHHHH--HHHHHHHhc---Ccccccc---cccc-cCC Confidence 333222211111111112222333333321 11 11111 0110 111111100 0000011 0000 000 No 286 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=23.47 E-value=2.3 Score=18.60 Aligned_cols=404 Identities=7% Similarity=-0.023 Sum_probs=132.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH------------------HHHHHHHHHHhcccCcccccCcccchhh-hhhhhhhccChHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG------------------MSRVRLLARYSNGDAPLPELTRNTSAAW-RSFQREARTNWGL 61 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~------------------~~r~~~~~~YY~g~~~i~~~~~~~~~~~-~~~~~k~~~n~~~ 61 (456) ++++.-..+++.-.....+. ++++..+..||.|++.... ..+..-.+ -...+.++.-.+. T Consensus 15 ~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ki~~n~~~~ivd~~~~ 93 (474) T protein:vir:10 15 ILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRL-DVSVNNKLNNSFDSEIVDTRVG 93 (474) T ss_pred CCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccc-ccCcccccccchHHHHHHhHhh Confidence 77777666665543332221 1223344567777653221 11111111 1223344444455 Q ss_pred HHHHHHHhhhccCCeecCCCCcccHHHHHHH---HHHhcChhHHHHHHHHHHhh--------CCeEEEEEeeCCCCceEE Q lcl|NC_021301. 62 MVRDSVADRIIPNGITVGGSADSDLALRARR---IWRDNRMDSVCKQWVKYGLD--------FGESYLTCWRRDDGTATI 130 (456) Q Consensus 62 ~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~---~~~~n~~~~~~~~~~~~a~~--------~G~a~~~v~~d~dg~~~i 130 (456) .++...+.|-..++-.. ...-.+....+++ +...+ ......+...... .|.-.+.+ .++..-+.+ T Consensus 94 yl~g~pv~~~~~~~~~~-~e~~~~~l~~~~~~n~~~~~~--~~~~~~~~~~G~a~~~~~~d~~~~~~~~~-i~p~~~~~v 169 (474) T protein:vir:10 94 YLHGVPVTYDLDENAEK-NEKLKKFITNFAIRNSVDDED--SEIGKMAAICGYGARLAYIDTNGDIRIKN-IDPYNVIFV 169 (474) T ss_pred heeccceeEeeCCCCcc-hHHHHHHHHHHHhhcCHhHHH--HHHHHHHhhcCeEEEEEEeCCCCeeEEEE-EcccceEEE Confidence 55544444443221110 0000011111110 10000 0111111111111 12211111 111111111 Q ss_pred E--EEccceeEEEEeC--CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccc Q lcl|NC_021301. 131 T--ADSPETMVVSVDP--LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDA 206 (456) Q Consensus 131 ~--~~~p~~~~~~~d~--~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (456) . ...+..++..|.. .........+..|+. + . ...|..+..-.+...... ..+-+......-. T Consensus 170 ~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~-~--~--~~~~~~~~~~~~~~~~~~---------~~~~g~vPvv~~~ 235 (474) T protein:vir:10 170 GDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDN-A--Y--YYVFRGEGIDALQEVGRY---------EHLFDYNPLFGVP 235 (474) T ss_pred EcCCCceEEEEEEEEEeeCCCceEEEEEEEEcC-c--e--EEEEeecCCCcccccccc---------cCCCCccceEEec Confidence 0 0112222222321 122222223333321 1 1 112222111111100000 0000100000011 Q ss_pred cccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHH----HHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhh Q lcl|NC_021301. 207 VVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAEL----QLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFE 282 (456) Q Consensus 207 ~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s----~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~ 282 (456) ++..+.+ ....+.++++-++..-..++ ..++..-.... ..+...........|... ... T Consensus 236 n~~~g~s---------d~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g----~~~~~~~~~~~~~~~~i~----~~~ 298 (474) T protein:vir:10 236 NNKEMIG---------DAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG----MGMSEEMIQETQKSGAFE----LFD 298 (474) T ss_pred CCCCCCC---------chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc----CCCCchhhhhhhhcceeE----ecC Confidence 1111211 11123333333332221111 11111111000 011111111111111111 001 Q ss_pred hhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 283 AAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAE-GAHNIEKGFLFKCEDRLSIAK 361 (456) Q Consensus 283 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~-Al~~~~~~l~~k~~~~~~~f~ 361 (456) .+.+.-+. .++.+....+++++.++..++.++.++++....+++..++.+-. .+.........+.......+. T Consensus 299 ~~~~~~~l------~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 372 (474) T protein:vir:10 299 KDMDVKYL------TKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLR 372 (474) T ss_pred CCCceeEE------eccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111122 23444566799999999999999999999988887666665433 445555556677777778888 Q ss_pred HHHHHHHHHHHH-------hcCCCcccceeEEecCCCCcCHHHHHHH-------HHHHHhcCCCcHHHHHHhCCCChhHH Q lcl|NC_021301. 362 IGLEAILVKALQ-------IEGESVEDTVDVSFESPDRVTLGEKYAA-------ASLAKAAGESWASIRRNILNYNADQI 427 (456) Q Consensus 362 ~~l~~~~~l~~~-------~~~~~~~~~i~v~f~~~~~~~~~e~ad~-------~~kl~~~g~~s~~t~~~~~~~~~~~~ 427 (456) +.++-+++++.. .........+......+. ...++.+.. -+.+...+.+.. .+++. T Consensus 373 ~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~-~e~a~~~~kl~g~iS~et~~~~l~~v~d---------~~~E~ 442 (474) T protein:vir:10 373 YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNK-LEESQVLINLKGQVSERTRLGQSQLVDD---------VDYEL 442 (474) T ss_pred HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCH-HHHHHHHHHHhccCchHHHHHhCCCCCC---------HHHHH Confidence 877776665421 111111111111111110 111111100 111112233322 12344 Q ss_pred HHHHHHH---HHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 428 KQDDLDR---AREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 428 ~~~e~~~---~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ++++.|+ .+...+...+.....++++.|- T Consensus 443 eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:10 443 DEMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 4433322 2222222222222222222222 No 287 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=23.47 E-value=2.3 Score=18.60 Aligned_cols=404 Identities=7% Similarity=-0.023 Sum_probs=132.3 Q ss_pred CCCCCHHHHHHHHHHHHHHH------------------HHHHHHHHHHhcccCcccccCcccchhh-hhhhhhhccChHH Q lcl|NC_021301. 1 MTASTPAEWLPVLTKRIDDG------------------MSRVRLLARYSNGDAPLPELTRNTSAAW-RSFQREARTNWGL 61 (456) Q Consensus 1 ~~~~t~~~~~~~l~~~~~~~------------------~~r~~~~~~YY~g~~~i~~~~~~~~~~~-~~~~~k~~~n~~~ 61 (456) ++++.-..+++.-.....+. ++++..+..||.|++.... ..+..-.+ -...+.++.-.+. T Consensus 15 ~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ki~~n~~~~ivd~~~~ 93 (474) T protein:vir:94 15 ILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRL-DVSVNNKLNNSFDSEIVDTRVG 93 (474) T ss_pred CCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccc-ccCcccccccchHHHHHHhHhh Confidence 77777666665543332221 1223344567777653221 11111111 1223344444455 Q ss_pred HHHHHHHhhhccCCeecCCCCcccHHHHHHH---HHHhcChhHHHHHHHHHHhh--------CCeEEEEEeeCCCCceEE Q lcl|NC_021301. 62 MVRDSVADRIIPNGITVGGSADSDLALRARR---IWRDNRMDSVCKQWVKYGLD--------FGESYLTCWRRDDGTATI 130 (456) Q Consensus 62 ~iVd~~a~~l~~~~~~~~~~~d~~~~~~l~~---~~~~n~~~~~~~~~~~~a~~--------~G~a~~~v~~d~dg~~~i 130 (456) .++...+.|-..++-.. ...-.+....+++ +...+ ......+...... .|.-.+.+ .++..-+.+ T Consensus 94 yl~g~pv~~~~~~~~~~-~e~~~~~l~~~~~~n~~~~~~--~~~~~~~~~~G~a~~~~~~d~~~~~~~~~-i~p~~~~~v 169 (474) T protein:vir:94 94 YLHGVPVTYDLDENAEK-NEKLKKFITNFAIRNSVDDED--SEIGKMAAICGYGARLAYIDTNGDIRIKN-IDPYNVIFV 169 (474) T ss_pred heeccceeEeeCCCCcc-hHHHHHHHHHHHhhcCHhHHH--HHHHHHHhhcCeEEEEEEeCCCCeeEEEE-EcccceEEE Confidence 55544444443221110 0000011111110 10000 0111111111111 12211111 111111111 Q ss_pred E--EEccceeEEEEeC--CCCceEEEEEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccc Q lcl|NC_021301. 131 T--ADSPETMVVSVDP--LQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDA 206 (456) Q Consensus 131 ~--~~~p~~~~~~~d~--~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (456) . ...+..++..|.. .........+..|+. + . ...|..+..-.+...... ..+-+......-. T Consensus 170 ~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~-~--~--~~~~~~~~~~~~~~~~~~---------~~~~g~vPvv~~~ 235 (474) T protein:vir:94 170 GDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDN-A--Y--YYVFRGEGIDALQEVGRY---------EHLFDYNPLFGVP 235 (474) T ss_pred EcCCCceEEEEEEEEEeeCCCceEEEEEEEEcC-c--e--EEEEeecCCCcccccccc---------cCCCCccceEEec Confidence 0 0112222222321 122222223333321 1 1 112222111111100000 0000100000011 Q ss_pred cccCceeEEEEccCCCCCCcHhHHHHHHHHHHHHHH----HHHHHHHHhhchhhhhhcCCCcccccccccchhhhhhhhh Q lcl|NC_021301. 207 VVTGSPPPVVVYQNPDGMGEVEPHIDIINRINRAEL----QLLSTMAIQAFRQRALKSAGHGLPKVDENGNAIDYASIFE 282 (456) Q Consensus 207 ~~~~~~~pvv~~~n~~g~s~~~~v~~liDa~~~~~s----~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~~~~~~~~~~ 282 (456) ++..+.+ ....+.++++-++..-..++ ..++..-.... ..+...........|... ... T Consensus 236 n~~~g~s---------d~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g----~~~~~~~~~~~~~~~~i~----~~~ 298 (474) T protein:vir:94 236 NNKEMIG---------DAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRG----MGMSEEMIQETQKSGAFE----LFD 298 (474) T ss_pred CCCCCCC---------chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcc----CCCCchhhhhhhhcceeE----ecC Confidence 1111211 11123333333332221111 11111111000 011111111111111111 001 Q ss_pred hhccceeccCCCceeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhcccccCcHHH-HHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 283 AAPGALWELPPGVDIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAE-GAHNIEKGFLFKCEDRLSIAK 361 (456) Q Consensus 283 ~~~~~~~~~~~d~~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~~~N~Sg~-Al~~~~~~l~~k~~~~~~~f~ 361 (456) .+.+.-+. .++.+....+++++.++..++.++.++++....+++..++.+-. .+.........+.......+. T Consensus 299 ~~~~~~~l------~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 372 (474) T protein:vir:94 299 KDMDVKYL------TKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLR 372 (474) T ss_pred CCCceeEE------eccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111122 23444566799999999999999999999988887666665433 445555556677777778888 Q ss_pred HHHHHHHHHHHH-------hcCCCcccceeEEecCCCCcCHHHHHHH-------HHHHHhcCCCcHHHHHHhCCCChhHH Q lcl|NC_021301. 362 IGLEAILVKALQ-------IEGESVEDTVDVSFESPDRVTLGEKYAA-------ASLAKAAGESWASIRRNILNYNADQI 427 (456) Q Consensus 362 ~~l~~~~~l~~~-------~~~~~~~~~i~v~f~~~~~~~~~e~ad~-------~~kl~~~g~~s~~t~~~~~~~~~~~~ 427 (456) +.++-+++++.. .........+......+. ...++.+.. -+.+...+.+.. .+++. T Consensus 373 ~~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~-~e~a~~~~kl~g~iS~et~~~~l~~v~d---------~~~E~ 442 (474) T protein:vir:94 373 YQFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNK-LEESQVLINLKGQVSERTRLGQSQLVDD---------VDYEL 442 (474) T ss_pred HHHHHHHHHHhhccCCCCccccccceEEeCCCCCCCH-HHHHHHHHHHhccCchHHHHHhCCCCCC---------HHHHH Confidence 877776665421 111111111111111110 111111100 111112233322 12344 Q ss_pred HHHHHHH---HHHHHHHHhhhhhhhcccccCC Q lcl|NC_021301. 428 KQDDLDR---AREQITLFAGNSVQRPQEDGSR 456 (456) Q Consensus 428 ~~~e~~~---~~ee~~~~~~~~~~~~~~d~~~ 456 (456) ++++.|+ .+...+...+.....++++.|- T Consensus 443 eri~~E~~e~~~~~~~~~~~~~~~~~~~~~s~ 474 (474) T protein:vir:94 443 DEMEKESLEFNDKLPDIDEGDANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHHHHhhcccccCCCcCCCCccccCC Confidence 4433322 2222222222222222222222 No 288 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=21.62 E-value=2.6 Score=18.33 Aligned_cols=432 Identities=12% Similarity=0.054 Sum_probs=150.2 Q ss_pred CCCCCHHHHHH--HHHHHHHHHHHHHHHHHHHhcccCcccccCcccchhhhhhhhhhccChHHHHHHHHHhhh-ccCCee Q lcl|NC_021301. 1 MTASTPAEWLP--VLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLMVRDSVADRI-IPNGIT 77 (456) Q Consensus 1 ~~~~t~~~~~~--~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~k~~~n~~~~iVd~~a~~l-~~~~~~ 77 (456) ||.-++-..+. .+... .+.+ -++++.|+.-..--.+-... ..++ +++. +.+-=|..++-. +-.|+. T Consensus 63 ~t~~~D~~~~g~~~~~~~--~~~p-r~R~qiY~~~eeM~~~p~Ia-----~Aln--iHVt-aALggde~TGd~vfI~p~~ 131 (569) T protein:vir:10 63 SGMAGDGLVDGSRFIFDE--VQLP-EDRLQRYPLLEEMAVYSTIA-----TALN--IHIT-HALSFDKKTGQTFSIVPVH 131 (569) T ss_pred cchhhhhHHHHHHHHhhh--ccCc-hhHHHHHHHHHHHhcCchhh-----hhhh--hhhh-eeecccccccceEEEEeec Confidence 55544443222 11111 1223 24444444332100000000 0111 1100 000112222222 223444 Q ss_pred cCCCCcccHHHHHHHHHHhc---ChhHHHHHHHHHHhhCCeEEEEEeeCCCCceEEEEEccceeEEEE-eCC-CCceEEE Q lcl|NC_021301. 78 VGGSADSDLALRARRIWRDN---RMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV-DPL-QPWRIRS 152 (456) Q Consensus 78 ~~~~~d~~~~~~l~~~~~~n---~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~i~~~~p~~~~~~~-d~~-~~~~~~~ 152 (456) -..+.+.+..+.+.+-.+.+ =+......++++++.||+||..+|.++.-. .+...-.+-..|.+ -|. ...+..+ T Consensus 132 ~~~~a~~daakai~~el~~dl~~~iNr~~~~lA~~~~aFGdsYaRiY~~~~~G-V~dl~~s~yt~PsfIqpFE~g~~tvG 210 (569) T protein:vir:10 132 NGNDSDYDAAQALCGELMNDIGRTINKEVAGWAFIMSVFGVAYVRPYAKEGIG-ITSFECSYYTLPSFIKEFEVSGNLAG 210 (569) T ss_pred CCCCCcchHHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhheeeeccCCce-eEEEEecccccccccchhhhcCceEE Confidence 34445555554443322221 123556789999999999999999875422 22222222211111 111 1122333 Q ss_pred EEEEEEecCCceEEEEEEcCCeEEEEEEeeeecccccceeeccCCCceeecccccccCceeEEEEccCCCCCCcHh---- Q lcl|NC_021301. 153 AMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQNPDGMGEVE---- 228 (456) Q Consensus 153 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~n~~g~s~~~---- 228 (456) +.-.| ..+...+ ...-++-....+..+.+.... ....+++.....-...+. ...-|+.+ ...|.|-++ T Consensus 211 F~~~~-~~~~~~t-i~~l~p~qm~rmKmPrm~~i~-q~~~v~~g~~~~~L~~d~---~~~~Pi~p--sn~GgSFL~~ae~ 282 (569) T protein:vir:10 211 FSGDY-LKDASGK-MVFADPWAIIPMKIPYWRPKS-NLMPVHTGHKAYSLLDNP---EERTPIET--QNYGTSLLEYAYE 282 (569) T ss_pred eeccc-CCccccc-eeeechhhhhhhcccceeecc-ccchhhhhhhheeecccc---cccccccc--hhhhhHHHHHHHh Confidence 33222 2222211 122222222222211111000 000000000000000000 01112222 124555543 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhchhhhhhcCCCcccccccccc-------hhhhhhhhhhhccceec-----cC--CC Q lcl|NC_021301. 229 PHIDIINRINRAELQLLSTMAIQAFRQRALKSAGHGLPKVDENGN-------AIDYASIFEAAPGALWE-----LP--PG 294 (456) Q Consensus 229 ~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~-----~~--~d 294 (456) +...|+-|+.-..++.-+++-..++--+...|+++..-. +=.+. .-+.+.....+...++. +| .+ T Consensus 283 pf~~l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~-~y~r~lt~~LKr~~d~ie~a~~gg~~~~~~~~H~LPv~ge 361 (569) T protein:vir:10 283 PYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAA-DYSRTITQTLKRAADLMERRARGANNMPTVTNTLLPIMGD 361 (569) T ss_pred HHHHHHHHHHhccchhhHHHHHhHHhhccccCCCHHHHh-HHHHHHHHHHHHHHHHHHHHhccCccccccceeeeeeecC Confidence 456677777766666655555544433344444321100 00000 00000000000011111 11 11 Q ss_pred c------eeEeecccchHHHHHHHHHHHHHHHhhcCCChhhhccc---ccCc-HHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_021301. 295 V------DIWESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPD---SANQ-SAEGAHNIEKGFLFKCEDRLSIAKIGL 364 (456) Q Consensus 295 ~------~~~~~~~~~~~~~~~~l~~~~~~i~~~~~~p~~~~~~~---~~N~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l 364 (456) . +.+. ..++.-+..+. =..++++++..|+...++|-. ++.. -|-+++..-+.. .++.-.+....+.+ T Consensus 362 kq~~~tvDt~~-~~A~~~gIEdv-M~~~R~LagaLGlD~SMlGwAD~LsGGLGeGG~frtSaQaa-~RS~~iRqa~~e~i 438 (569) T protein:vir:10 362 GKGQMTIDTQT-IQADINGIEDI-LTYMRQLAAALGLDYTLLGWADQMSGGLGEGGFLRTAIQAA-MRASWIQQGVEEFI 438 (569) T ss_pred ccccccccccc-cccCcccHHHH-HHHHHHHHhhhccchhHhhHHHHhcccccccHHHHHHHHHH-HHHHHHHHHHHHHH Confidence 1 1111 12333334333 334567788889988887631 1111 233455444433 33444555666666 Q ss_pred HHHHHHHHHh--cCC--CcccceeEEecCCCC-------cCHHHHHHHHHHH-------HhcCCC--cHHHHH----HhC Q lcl|NC_021301. 365 EAILVKALQI--EGE--SVEDTVDVSFESPDR-------VTLGEKYAAASLA-------KAAGES--WASIRR----NIL 420 (456) Q Consensus 365 ~~~~~l~~~~--~~~--~~~~~i~v~f~~~~~-------~~~~e~ad~~~kl-------~~~g~~--s~~t~~----~~~ 420 (456) .+++.+=+.+ .+. ..++...|.|..... .+..+.++++... ..+.++ +..... +.+ T Consensus 439 n~iidiH~~fKYgevf~~~drP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~la~l~e~n~Lg~de~~m~y~l~d~~ 518 (569) T protein:vir:10 439 QRAIDIHLAFKYGKVYPEGDRPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQILDAVSNNSVLANSDAFKRYLFSDVL 518 (569) T ss_pred HHHHHHHhhhhcCcccCCCCcceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccHHHHHHHHHHHh Confidence 6766554433 332 345667888876532 2222233322222 221111 111111 223 Q ss_pred CCChhHHHHHHHHH---HHHHHHHHhhhhhhhccc----------ccCC Q lcl|NC_021301. 421 NYNADQIKQDDLDR---AREQITLFAGNSVQRPQE----------DGSR 456 (456) Q Consensus 421 ~~~~~~~~~~e~~~---~~ee~~~~~~~~~~~~~~----------d~~~ 456 (456) +++...-+.+-.+. -++|.-.+..-....|+| +|+. T Consensus 519 ~~De~~~e~l~ae~~akp~DEe~~~~~~~~~~~~~~~~~~~~~~~~~~~ 567 (569) T protein:vir:10 519 EIDEKISEALVNELKAKSEDDDHLMDSIIKTPPQELAQILESVFKEGND 567 (569) T ss_pred hcchhHHHHHHhhcCCCcchhHHHHHHHhcCChHHHHHHHHHHhhccCC Confidence 44332222211111 111111111111111111 1111 Done!