Query lcl|NC_010179.2_cdsid_YP_001642372.2 [gene=LJ771_030] [protein=portal protein] [protein_id=YP_001642372.2] [location=17721..19130] Match_columns 469 No_of_seqs 135 out of 493 Neff 9.8 Searched_HMMs 1612 Date Thu Nov 7 13:06:42 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_30 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_30_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105461 Length: 470 100.0 2E-115 1E-118 649.1 54.0 469 1-469 1-469 (470) 2 protein:vir:102950 Length: 471 100.0 8E-106 5E-109 596.7 53.8 464 1-469 1-469 (471) 3 protein:vir:102330 Length: 451 100.0 5E-100 3E-103 564.9 52.2 438 1-466 1-451 (451) 4 protein:vir:79043 Length: 479 100.0 4.4E-99 3E-102 559.8 53.0 457 1-469 19-478 (479) 5 protein:vir:97336 Length: 492 100.0 1.1E-98 7E-102 557.6 51.8 443 1-469 40-484 (492) 6 protein:vir:94805 Length: 492 100.0 9.3E-99 6E-102 558.0 51.4 443 1-469 40-484 (492) 7 protein:vir:1236 Length: 483 # 100.0 1.6E-98 1E-101 556.8 51.7 443 1-469 31-476 (483) 8 protein:vir:96266 Length: 474 100.0 1.1E-98 7E-102 557.6 49.1 442 1-469 23-472 (474) 9 protein:vir:95899 Length: 474 100.0 1.1E-98 7E-102 557.6 49.1 442 1-469 23-472 (474) 10 protein:vir:97171 Length: 512 100.0 6.7E-98 4E-101 553.3 52.0 444 1-469 36-499 (512) 11 protein:vir:99781 Length: 511 100.0 6.7E-98 4E-101 553.3 50.6 441 1-469 40-502 (511) 12 protein:vir:96240 Length: 511 100.0 1.9E-97 1E-100 550.9 52.3 441 1-469 40-498 (511) 13 protein:vir:9306 Length: 511 # 100.0 2.2E-97 1E-100 550.5 52.0 441 1-469 40-502 (511) 14 protein:vir:96366 Length: 511 100.0 1.8E-97 1E-100 550.9 51.4 441 1-469 40-498 (511) 15 protein:vir:78805 Length: 511 100.0 1.8E-97 1E-100 550.9 51.4 441 1-469 40-498 (511) 16 protein:vir:107112 Length: 478 100.0 1.7E-97 1E-100 551.1 51.0 449 1-469 20-476 (478) 17 protein:vir:103951 Length: 511 100.0 2.6E-97 2E-100 550.1 52.0 441 1-469 40-502 (511) 18 protein:vir:96839 Length: 474 100.0 1.6E-97 1E-100 551.3 50.1 447 1-469 22-470 (474) 19 protein:vir:78083 Length: 537 100.0 2.8E-97 2E-100 549.9 51.0 460 1-469 8-516 (537) 20 protein:vir:3609 Length: 452 # 100.0 8.4E-97 5E-100 547.3 52.7 426 1-469 17-445 (452) 21 protein:vir:105292 Length: 478 100.0 3.9E-97 2E-100 549.1 50.7 449 1-469 2-471 (478) 22 protein:vir:94498 Length: 474 100.0 4.1E-97 3E-100 549.0 50.5 442 1-469 21-466 (474) 23 protein:vir:97447 Length: 474 100.0 4.1E-97 3E-100 549.0 50.5 442 1-469 21-466 (474) 24 protein:vir:106571 Length: 499 100.0 8.7E-97 5E-100 547.2 51.9 440 1-469 5-478 (499) 25 protein:vir:93747 Length: 472 100.0 1.2E-96 8E-100 546.4 51.8 443 1-469 20-466 (472) 26 protein:vir:3964 Length: 453 # 100.0 3.6E-96 2.2E-99 543.9 51.6 430 1-469 11-446 (453) 27 protein:vir:96179 Length: 468 100.0 2.8E-96 1.8E-99 544.4 50.9 444 1-469 22-466 (468) 28 protein:vir:94546 Length: 506 100.0 1.8E-96 1.1E-99 545.5 49.4 438 1-469 22-496 (506) 29 protein:vir:5961 Length: 503 # 100.0 1.6E-95 1E-98 540.3 51.8 454 1-469 25-485 (503) 30 protein:vir:9871 Length: 429 # 100.0 2.3E-95 1.4E-98 539.4 52.4 426 1-469 1-429 (429) 31 protein:vir:95113 Length: 474 100.0 1.3E-95 7.8E-99 540.9 50.3 442 1-469 23-466 (474) 32 protein:vir:4898 Length: 502 # 100.0 9.3E-95 5.8E-98 536.1 50.5 435 1-469 37-489 (502) 33 protein:vir:2732 Length: 501 # 100.0 1.9E-94 1.2E-97 534.3 51.4 435 1-469 36-497 (501) 34 protein:vir:733 Length: 453 # 100.0 4E-94 2.5E-97 532.6 50.0 426 1-469 17-451 (453) 35 protein:vir:96494 Length: 501 100.0 6.1E-94 3.8E-97 531.6 50.9 435 1-469 36-488 (501) 36 protein:vir:99522 Length: 470 100.0 5.4E-93 3.4E-96 526.4 53.2 433 1-469 25-470 (470) 37 protein:vir:106639 Length: 481 100.0 1.1E-92 6.7E-96 524.8 53.3 437 1-469 30-480 (481) 38 protein:vir:105889 Length: 474 100.0 1.2E-92 7.4E-96 524.5 49.0 439 1-469 1-467 (474) 39 protein:vir:94101 Length: 474 100.0 1.2E-92 7.4E-96 524.5 49.0 439 1-469 1-467 (474) 40 protein:vir:95806 Length: 440 100.0 2.5E-92 1.6E-95 522.7 48.3 425 9-469 1-440 (440) 41 protein:vir:9922 Length: 489 # 100.0 1.1E-90 7.1E-94 513.7 50.0 437 1-469 15-483 (489) 42 protein:vir:2500 Length: 501 # 100.0 7E-76 4.3E-79 432.6 44.6 447 1-469 23-484 (501) 43 protein:vir:78537 Length: 480 100.0 9.5E-76 5.9E-79 431.9 43.5 431 1-469 1-466 (480) 44 protein:vir:4223 Length: 486 # 100.0 3.3E-75 2E-78 428.9 44.1 426 1-469 9-476 (486) 45 protein:vir:78227 Length: 480 100.0 5.3E-75 3.3E-78 427.8 43.9 430 1-469 1-466 (480) 46 protein:vir:2427 Length: 485 # 100.0 2E-74 1.2E-77 424.7 44.2 427 1-469 8-476 (485) 47 protein:vir:104082 Length: 485 100.0 2.2E-74 1.4E-77 424.4 43.8 426 1-469 9-471 (485) 48 protein:vir:105819 Length: 456 100.0 3.9E-74 2.4E-77 423.0 42.6 444 1-464 1-456 (456) 49 protein:vir:102602 Length: 456 100.0 3.9E-74 2.4E-77 423.0 42.6 444 1-464 1-456 (456) 50 protein:vir:80680 Length: 441 100.0 4.4E-73 2.8E-76 417.2 45.4 421 1-469 1-440 (441) 51 protein:vir:99072 Length: 479 100.0 5.2E-73 3.2E-76 416.9 42.0 426 1-469 9-461 (479) 52 protein:vir:7987 Length: 456 # 100.0 2.2E-72 1.4E-75 413.4 42.8 440 1-464 1-456 (456) 53 protein:vir:2341 Length: 488 # 100.0 3.8E-72 2.4E-75 412.1 44.1 425 1-469 7-483 (488) 54 protein:vir:7768 Length: 484 # 100.0 3.2E-72 2E-75 412.6 42.4 423 1-469 11-471 (484) 55 protein:vir:38 Length: 496 # N 100.0 9.6E-67 6E-70 382.5 44.8 446 1-469 17-496 (496) 56 protein:vir:80959 Length: 499 100.0 1.6E-64 1E-67 370.3 48.6 450 2-469 1-499 (499) 57 protein:vir:99916 Length: 504 100.0 4E-65 2.5E-68 373.7 43.6 429 1-469 18-483 (504) 58 protein:vir:98444 Length: 434 100.0 5.3E-65 3.3E-68 373.0 38.7 405 37-469 1-428 (434) 59 protein:vir:9751 Length: 422 # 100.0 1.2E-62 7.2E-66 360.1 39.2 401 1-451 1-422 (422) 60 protein:vir:94742 Length: 409 100.0 3.1E-61 1.9E-64 352.3 39.9 389 1-438 1-409 (409) 61 protein:vir:8184 Length: 474 # 100.0 2.5E-60 1.6E-63 347.3 42.6 429 1-469 12-474 (474) 62 protein:vir:9568 Length: 410 # 100.0 5.9E-61 3.6E-64 350.8 38.1 390 14-453 1-410 (410) 63 protein:vir:79703 Length: 505 100.0 8.5E-59 5.2E-62 339.0 49.5 446 1-467 1-505 (505) 64 protein:vir:1587 Length: 508 # 100.0 7.1E-59 4.4E-62 339.4 45.1 449 1-469 1-508 (508) 65 protein:vir:1634 Length: 409 # 100.0 9.7E-60 6E-63 344.1 39.2 389 1-438 1-409 (409) 66 protein:vir:9815 Length: 500 # 100.0 4.1E-56 2.5E-59 324.2 44.3 447 1-464 1-500 (500) 67 protein:vir:3028 Length: 500 # 100.0 4.1E-56 2.5E-59 324.2 44.3 447 1-464 1-500 (500) 68 protein:vir:4782 Length: 522 # 100.0 4.7E-53 2.9E-56 307.5 47.7 452 1-469 1-521 (522) 69 protein:vir:78907 Length: 518 100.0 1.5E-50 9.1E-54 293.8 43.9 460 1-466 1-518 (518) 70 protein:vir:98883 Length: 517 100.0 6.1E-50 3.8E-53 290.4 43.8 456 1-469 1-517 (517) 71 protein:vir:101494 Length: 527 100.0 1.1E-45 7.1E-49 267.0 34.9 447 1-469 19-517 (527) 72 protein:vir:102239 Length: 527 100.0 1.3E-45 8.3E-49 266.6 34.8 447 1-469 19-517 (527) 73 protein:vir:7430 Length: 563 # 100.0 3.4E-41 2.1E-44 242.4 34.0 451 1-469 1-539 (563) 74 protein:vir:94956 Length: 452 100.0 4E-28 2.5E-31 170.8 35.7 431 1-468 1-452 (452) 75 protein:vir:97265 Length: 513 99.9 3.6E-26 2.2E-29 160.1 34.3 439 1-469 1-491 (513) 76 protein:vir:95149 Length: 501 99.9 1.8E-24 1.1E-27 150.7 38.0 444 1-469 1-501 (501) 77 protein:vir:78393 Length: 489 99.9 6.2E-23 3.8E-26 142.3 34.9 440 1-469 2-488 (489) 78 protein:vir:80453 Length: 535 99.9 8.1E-22 5.1E-25 136.2 38.0 435 1-469 32-532 (535) 79 protein:vir:95014 Length: 491 99.9 9.2E-22 5.7E-25 135.9 34.0 445 1-469 2-491 (491) 80 protein:vir:96783 Length: 488 99.9 1E-20 6.3E-24 130.2 37.1 425 1-455 14-488 (488) 81 protein:vir:93630 Length: 776 99.8 3.4E-20 2.1E-23 127.3 31.1 451 1-469 38-649 (776) 82 protein:vir:108295 Length: 711 99.8 4.5E-19 2.8E-22 121.1 35.5 452 1-469 26-659 (711) 83 protein:vir:9950 Length: 714 # 99.8 5E-16 3.1E-19 104.4 39.8 446 1-469 16-612 (714) 84 protein:vir:2764 Length: 714 # 99.8 5E-16 3.1E-19 104.4 39.8 446 1-469 16-612 (714) 85 protein:vir:3296 Length: 714 # 99.8 5E-16 3.1E-19 104.4 39.8 446 1-469 16-612 (714) 86 protein:vir:10117 Length: 714 99.8 5E-16 3.1E-19 104.4 39.8 446 1-469 16-612 (714) 87 protein:vir:817 Length: 714 # 99.8 5E-16 3.1E-19 104.4 39.8 446 1-469 16-612 (714) 88 protein:vir:104437 Length: 714 99.8 3.7E-16 2.3E-19 105.2 38.6 446 1-469 20-612 (714) 89 protein:vir:105619 Length: 772 99.7 1.8E-17 1.1E-20 112.4 30.8 452 1-469 22-633 (772) 90 protein:vir:8846 Length: 705 # 99.7 1.1E-14 6.5E-18 97.2 34.9 433 1-469 10-617 (705) 91 protein:vir:80040 Length: 461 99.6 3E-16 1.9E-19 105.7 25.2 408 1-464 2-461 (461) 92 protein:vir:77597 Length: 725 99.6 7E-15 4.4E-18 98.2 29.0 451 1-469 6-607 (725) 93 protein:vir:100920 Length: 725 99.6 2.1E-14 1.3E-17 95.6 30.2 452 1-469 6-607 (725) 94 protein:vir:9263 Length: 725 # 99.5 6.3E-14 3.9E-17 92.9 27.6 451 1-469 6-607 (725) 95 protein:vir:105429 Length: 708 99.5 1.5E-13 9.2E-17 90.9 27.6 458 1-469 1-640 (708) 96 protein:vir:105520 Length: 706 99.5 2.6E-12 1.6E-15 84.1 33.3 456 1-469 1-639 (706) 97 protein:vir:3520 Length: 720 # 99.5 4.5E-12 2.8E-15 82.8 33.7 455 1-469 1-627 (720) 98 protein:vir:95449 Length: 584 99.5 5.6E-13 3.5E-16 87.8 28.2 435 1-467 3-584 (584) 99 protein:vir:172 Length: 708 # 99.4 2.8E-12 1.7E-15 84.0 29.6 454 1-469 4-640 (708) 100 protein:vir:80165 Length: 651 99.4 2.4E-11 1.5E-14 78.8 38.0 432 1-469 16-624 (651) 101 protein:vir:5249 Length: 437 # 99.3 8.4E-12 5.2E-15 81.3 25.6 382 21-469 1-437 (437) 102 protein:vir:79538 Length: 502 99.3 2E-10 1.2E-13 73.8 35.3 417 1-469 1-501 (502) 103 protein:vir:95821 Length: 763 99.3 9.7E-11 6E-14 75.5 29.4 439 1-469 27-654 (763) 104 protein:vir:3139 Length: 599 # 99.3 5E-11 3.1E-14 77.0 26.5 444 1-469 1-597 (599) 105 protein:vir:3420 Length: 533 # 99.2 4.9E-10 3E-13 71.6 36.5 424 1-469 1-526 (533) 106 protein:vir:6382 Length: 553 # 99.2 5.2E-10 3.2E-13 71.5 36.4 397 1-469 46-553 (553) 107 protein:vir:107742 Length: 537 99.2 3.4E-10 2.1E-13 72.5 28.1 394 1-469 68-523 (537) 108 protein:vir:96738 Length: 505 99.2 7.4E-10 4.6E-13 70.6 36.2 427 1-468 11-505 (505) 109 protein:vir:107662 Length: 427 99.2 1.6E-10 9.9E-14 74.3 25.2 380 10-469 1-426 (427) 110 protein:vir:104338 Length: 422 99.1 3.5E-10 2.1E-13 72.5 25.6 377 21-468 1-422 (422) 111 protein:vir:96068 Length: 765 99.1 3.3E-10 2E-13 72.6 25.1 404 1-469 71-524 (765) 112 protein:vir:79647 Length: 435 99.1 2.3E-10 1.4E-13 73.4 24.1 377 1-469 5-435 (435) 113 protein:vir:94049 Length: 532 99.1 2.5E-09 1.5E-12 67.8 29.9 412 1-469 15-511 (532) 114 protein:vir:389 Length: 530 # 99.1 2.5E-09 1.6E-12 67.7 37.5 422 1-469 1-524 (530) 115 protein:vir:102668 Length: 547 99.1 3.4E-09 2.1E-12 67.0 35.8 437 1-469 1-541 (547) 116 protein:vir:99563 Length: 862 99.0 1.7E-09 1.1E-12 68.7 24.1 405 1-469 98-549 (862) 117 protein:vir:95315 Length: 559 99.0 1E-08 6.3E-12 64.4 33.3 447 1-469 1-542 (559) 118 protein:vir:95542 Length: 548 98.9 1.1E-08 7E-12 64.2 33.8 431 1-469 1-515 (548) 119 protein:vir:10321 Length: 495 98.9 1.6E-08 1E-11 63.3 36.9 426 1-469 1-495 (495) 120 protein:vir:94599 Length: 641 98.9 1.8E-08 1.1E-11 63.1 30.5 454 1-469 20-600 (641) 121 protein:vir:107822 Length: 555 98.7 9.7E-08 6E-11 59.0 38.9 443 1-469 1-541 (555) 122 protein:vir:107404 Length: 555 98.7 9.7E-08 6E-11 59.0 38.9 443 1-469 1-541 (555) 123 protein:vir:98506 Length: 555 98.7 9.7E-08 6E-11 59.0 38.9 443 1-469 1-541 (555) 124 protein:vir:7321 Length: 556 # 98.6 1.6E-07 9.6E-11 57.9 36.9 447 1-469 1-541 (556) 125 protein:vir:103765 Length: 549 98.4 7.7E-07 4.8E-10 54.1 34.1 440 1-469 1-545 (549) 126 protein:vir:3361 Length: 535 # 98.4 1E-06 6.2E-10 53.5 41.0 430 1-469 9-523 (535) 127 protein:vir:9359 Length: 348 # 98.3 1.3E-06 7.8E-10 52.9 26.9 313 79-469 1-347 (348) 128 protein:vir:2198 Length: 536 # 98.3 1.4E-06 8.6E-10 52.7 36.6 428 1-469 8-534 (536) 129 protein:vir:99232 Length: 526 98.3 1.4E-06 8.9E-10 52.6 33.2 372 1-469 40-446 (526) 130 protein:vir:1538 Length: 535 # 98.3 1.5E-06 9.4E-10 52.5 40.4 427 1-469 9-521 (535) 131 protein:vir:10447 Length: 536 98.2 2.1E-06 1.3E-09 51.7 36.4 428 1-469 8-534 (536) 132 protein:vir:94709 Length: 522 98.2 2.3E-06 1.4E-09 51.5 40.3 424 1-469 7-519 (522) 133 protein:vir:79233 Length: 526 98.2 2.6E-06 1.6E-09 51.2 34.0 390 1-469 12-446 (526) 134 protein:vir:1986 Length: 512 # 98.2 3.1E-06 1.9E-09 50.8 33.5 370 1-469 40-439 (512) 135 protein:vir:103860 Length: 528 98.1 4.5E-06 2.8E-09 49.9 33.0 373 1-469 40-448 (528) 136 protein:vir:3153 Length: 467 # 98.1 4.5E-06 2.8E-09 49.9 31.8 368 52-469 1-441 (467) 137 protein:vir:99853 Length: 488 98.1 5.3E-06 3.3E-09 49.5 29.4 376 1-469 1-408 (488) 138 protein:vir:8883 Length: 543 # 98.1 5.8E-06 3.6E-09 49.3 37.5 427 1-469 9-518 (543) 139 protein:vir:3843 Length: 397 # 98.0 7E-06 4.3E-09 48.9 28.5 369 1-469 1-397 (397) 140 protein:vir:63755 Length: 547 98.0 7.6E-06 4.7E-09 48.7 26.3 411 1-469 1-522 (547) 141 protein:vir:79063 Length: 491 97.9 1.1E-05 7.1E-09 47.7 28.5 387 1-469 13-455 (491) 142 protein:vir:107880 Length: 491 97.8 1.5E-05 9.4E-09 47.0 32.3 385 1-469 13-419 (491) 143 protein:vir:1380 Length: 422 # 97.7 2.3E-05 1.4E-08 46.0 27.9 392 1-469 1-422 (422) 144 protein:vir:102080 Length: 429 97.7 2.9E-05 1.8E-08 45.5 28.0 376 1-469 1-429 (429) 145 protein:vir:4454 Length: 414 # 97.5 5.8E-05 3.6E-08 43.8 29.6 375 1-469 1-411 (414) 146 protein:vir:105782 Length: 449 97.5 6.1E-05 3.8E-08 43.7 21.5 400 1-469 1-441 (449) 147 protein:vir:80644 Length: 551 97.4 7.7E-05 4.8E-08 43.1 29.6 411 1-469 5-525 (551) 148 protein:vir:103330 Length: 517 97.4 8.6E-05 5.3E-08 42.9 36.0 416 1-466 1-517 (517) 149 protein:vir:1785 Length: 555 # 97.3 8.9E-05 5.5E-08 42.8 33.7 425 1-469 1-533 (555) 150 protein:vir:102118 Length: 409 97.3 9.9E-05 6.1E-08 42.5 28.6 370 9-468 1-409 (409) 151 protein:vir:94426 Length: 409 97.3 9.9E-05 6.2E-08 42.5 28.1 373 1-469 1-408 (409) 152 protein:vir:93943 Length: 409 97.3 0.00011 6.6E-08 42.4 27.2 373 1-469 1-408 (409) 153 protein:vir:94572 Length: 535 97.3 0.00011 6.7E-08 42.3 38.7 426 1-469 1-521 (535) 154 protein:vir:107605 Length: 432 97.3 0.00011 7.1E-08 42.2 29.7 381 1-469 1-432 (432) 155 protein:vir:105002 Length: 432 97.3 0.00011 7.1E-08 42.2 29.7 381 1-469 1-432 (432) 156 protein:vir:102855 Length: 432 97.3 0.00011 7.1E-08 42.2 29.7 381 1-469 1-432 (432) 157 protein:vir:81152 Length: 411 97.2 0.00012 7.3E-08 42.1 29.3 371 1-468 1-411 (411) 158 protein:vir:79772 Length: 648 97.2 0.00012 7.5E-08 42.0 32.8 394 1-469 34-495 (648) 159 protein:vir:8418 Length: 409 # 97.2 0.00012 7.6E-08 42.0 29.0 376 1-469 1-409 (409) 160 protein:vir:1266 Length: 416 # 97.2 0.00014 8.7E-08 41.7 29.4 378 8-469 1-415 (416) 161 protein:vir:96980 Length: 409 97.2 0.00014 8.8E-08 41.7 27.6 378 1-469 1-408 (409) 162 protein:vir:2683 Length: 412 # 97.0 0.0002 1.2E-07 40.9 29.3 377 1-469 1-411 (412) 163 protein:vir:4598 Length: 416 # 96.9 0.00025 1.5E-07 40.4 32.0 368 32-469 1-416 (416) 164 protein:vir:81095 Length: 416 96.9 0.00025 1.5E-07 40.4 32.0 368 32-469 1-416 (416) 165 protein:vir:100039 Length: 522 96.9 0.00025 1.6E-07 40.3 37.7 417 1-469 1-515 (522) 166 protein:vir:483 Length: 413 # 96.9 0.00027 1.7E-07 40.2 28.5 375 8-469 1-410 (413) 167 protein:vir:78696 Length: 542 96.9 0.00027 1.7E-07 40.2 39.5 422 3-469 1-540 (542) 168 protein:vir:96988 Length: 516 96.9 0.00029 1.8E-07 40.0 36.5 419 1-469 11-512 (516) 169 protein:vir:7017 Length: 515 # 96.8 0.00034 2.1E-07 39.6 38.3 419 1-469 10-511 (515) 170 protein:vir:100150 Length: 437 96.8 0.00035 2.2E-07 39.5 31.0 384 13-469 1-436 (437) 171 protein:vir:4952 Length: 386 # 96.8 0.00036 2.2E-07 39.5 31.8 363 1-469 1-385 (386) 172 protein:vir:103219 Length: 201 96.7 1.4E-05 9E-09 47.1 7.1 170 271-468 1-201 (201) 173 protein:vir:106716 Length: 698 96.7 0.00039 2.4E-07 39.3 20.6 373 1-469 92-542 (698) 174 protein:vir:78589 Length: 695 96.7 0.0004 2.5E-07 39.2 20.9 382 1-469 92-545 (695) 175 protein:vir:3648 Length: 695 # 96.7 0.00042 2.6E-07 39.1 20.6 386 1-469 92-545 (695) 176 protein:vir:101541 Length: 694 96.6 0.00044 2.7E-07 39.0 20.7 386 1-469 91-544 (694) 177 protein:vir:4156 Length: 542 # 96.6 0.00045 2.8E-07 38.9 31.6 397 9-469 1-469 (542) 178 protein:vir:4509 Length: 424 # 96.6 0.00045 2.8E-07 38.9 24.7 381 9-469 1-424 (424) 179 protein:vir:1884 Length: 424 # 96.6 0.0005 3.1E-07 38.7 27.8 380 1-466 1-424 (424) 180 protein:vir:189 Length: 424 # 96.5 0.00057 3.5E-07 38.4 28.6 380 1-466 1-424 (424) 181 protein:vir:9408 Length: 441 # 96.5 0.00059 3.6E-07 38.3 32.0 386 1-469 11-441 (441) 182 protein:vir:79984 Length: 441 96.5 0.00059 3.6E-07 38.3 32.0 386 1-469 11-441 (441) 183 protein:vir:102727 Length: 945 96.3 0.00075 4.6E-07 37.7 30.1 395 1-469 65-528 (945) 184 protein:vir:104500 Length: 537 96.2 0.00085 5.3E-07 37.4 27.2 381 1-469 44-510 (537) 185 protein:vir:80796 Length: 574 96.2 0.00087 5.4E-07 37.4 31.0 413 1-469 14-525 (574) 186 protein:vir:98396 Length: 441 96.2 0.00089 5.5E-07 37.3 30.0 400 1-469 1-441 (441) 187 protein:vir:7407 Length: 392 # 96.1 0.001 6.3E-07 37.0 30.0 363 1-469 3-389 (392) 188 protein:vir:99672 Length: 532 96.1 0.001 6.3E-07 37.0 38.0 424 1-469 9-532 (532) 189 protein:vir:5737 Length: 419 # 96.0 0.0012 7.3E-07 36.6 29.2 369 25-469 1-412 (419) 190 protein:vir:80134 Length: 403 95.9 0.0013 7.8E-07 36.5 26.4 367 1-469 1-403 (403) 191 protein:vir:95378 Length: 406 95.9 0.0013 8.3E-07 36.3 26.4 363 1-469 1-406 (406) 192 protein:vir:1326 Length: 457 # 95.9 0.0013 8.3E-07 36.3 28.6 385 24-469 1-446 (457) 193 protein:vir:93610 Length: 454 95.8 0.0014 8.6E-07 36.2 31.5 379 27-469 1-440 (454) 194 protein:vir:78641 Length: 278 95.8 0.0014 8.9E-07 36.2 25.3 257 79-405 1-278 (278) 195 protein:vir:101648 Length: 518 95.7 0.0017 1E-06 35.8 31.4 383 1-469 1-432 (518) 196 protein:vir:4828 Length: 382 # 95.6 0.0017 1.1E-06 35.7 30.7 357 1-469 1-381 (382) 197 protein:vir:77981 Length: 448 95.6 0.0018 1.1E-06 35.7 29.2 357 1-469 41-428 (448) 198 protein:vir:98816 Length: 446 95.5 0.002 1.3E-06 35.3 24.6 396 1-441 1-446 (446) 199 protein:vir:4854 Length: 386 # 95.3 0.0024 1.5E-06 34.9 30.0 353 1-469 1-385 (386) 200 protein:vir:3989 Length: 392 # 95.2 0.0026 1.6E-06 34.7 31.5 363 9-469 1-390 (392) 201 protein:vir:1023 Length: 392 # 95.2 0.0026 1.6E-06 34.7 31.5 363 9-469 1-390 (392) 202 protein:vir:7853 Length: 518 # 94.8 0.0034 2.1E-06 34.1 29.9 384 1-469 1-432 (518) 203 protein:vir:100882 Length: 383 94.5 0.0043 2.7E-06 33.6 29.4 349 1-469 1-383 (383) 204 protein:vir:4194 Length: 540 # 94.4 0.0045 2.8E-06 33.5 31.2 393 1-469 6-454 (540) 205 protein:vir:6240 Length: 457 # 94.3 0.0048 2.9E-06 33.3 31.3 385 1-469 1-448 (457) 206 protein:vir:79511 Length: 448 94.0 0.0056 3.5E-06 32.9 28.2 365 1-469 41-439 (448) 207 protein:vir:105641 Length: 516 93.9 0.0061 3.8E-06 32.7 38.9 416 1-469 11-512 (516) 208 protein:vir:108215 Length: 469 93.7 0.0067 4.2E-06 32.5 32.1 396 1-469 1-453 (469) 209 protein:vir:1431 Length: 419 # 93.2 0.0083 5.1E-06 32.0 29.0 368 12-469 1-413 (419) 210 protein:vir:100691 Length: 535 93.2 0.0083 5.1E-06 32.0 33.5 368 1-469 70-515 (535) 211 protein:vir:81072 Length: 432 93.0 0.009 5.6E-06 31.8 30.4 384 1-469 1-432 (432) 212 protein:vir:80333 Length: 419 93.0 0.0093 5.8E-06 31.7 27.8 364 1-469 1-408 (419) 213 protein:vir:960 Length: 413 # 92.9 0.0095 5.9E-06 31.7 23.8 367 1-466 1-413 (413) 214 protein:vir:105064 Length: 421 92.9 0.0095 5.9E-06 31.7 30.4 376 8-469 1-416 (421) 215 protein:vir:4995 Length: 384 # 92.4 0.012 7.2E-06 31.2 33.5 360 1-469 1-383 (384) 216 protein:vir:95965 Length: 385 92.0 0.013 8.3E-06 30.8 21.8 342 1-469 1-385 (385) 217 protein:vir:103177 Length: 533 91.5 0.015 9.6E-06 30.5 27.7 387 1-469 43-517 (533) 218 protein:vir:78942 Length: 510 91.0 0.018 1.1E-05 30.2 34.4 415 6-466 1-510 (510) 219 protein:vir:104892 Length: 558 90.4 0.021 1.3E-05 29.8 27.8 409 1-469 7-526 (558) 220 protein:vir:99452 Length: 651 90.4 0.021 1.3E-05 29.8 24.7 433 1-469 14-539 (651) 221 protein:vir:95599 Length: 563 90.3 0.022 1.4E-05 29.7 27.9 400 1-469 23-526 (563) 222 protein:vir:99312 Length: 563 90.3 0.022 1.4E-05 29.7 27.9 400 1-469 23-526 (563) 223 protein:vir:9702 Length: 406 # 89.9 0.024 1.5E-05 29.5 28.7 367 1-469 1-405 (406) 224 protein:vir:100187 Length: 385 89.1 0.028 1.8E-05 29.1 29.6 352 1-468 1-385 (385) 225 protein:vir:78161 Length: 355 87.5 0.039 2.4E-05 28.3 23.1 285 122-469 1-335 (355) 226 protein:vir:10362 Length: 432 85.2 0.055 3.4E-05 27.5 31.1 383 1-469 1-432 (432) 227 protein:vir:3868 Length: 417 # 84.0 0.064 4E-05 27.1 29.7 362 28-469 1-415 (417) 228 protein:vir:5665 Length: 511 # 83.4 0.069 4.3E-05 27.0 23.5 382 1-462 53-511 (511) 229 protein:vir:6322 Length: 510 # 82.8 0.073 4.6E-05 26.8 38.1 414 6-464 1-510 (510) 230 protein:vir:4337 Length: 434 # 82.7 0.075 4.7E-05 26.7 29.9 387 1-469 1-434 (434) 231 protein:vir:345 Length: 663 # 81.7 0.083 5.2E-05 26.5 28.1 428 1-469 1-541 (663) 232 protein:vir:97060 Length: 432 81.7 0.083 5.2E-05 26.5 30.6 381 1-469 1-427 (432) 233 protein:vir:1082 Length: 359 # 80.7 0.092 5.7E-05 26.3 30.3 331 1-434 1-359 (359) 234 protein:vir:106282 Length: 521 79.3 0.11 6.6E-05 25.9 25.5 383 1-469 47-520 (521) 235 protein:vir:100598 Length: 516 79.0 0.11 6.8E-05 25.9 26.6 379 1-469 58-515 (516) 236 protein:vir:98567 Length: 340 78.5 0.11 7.1E-05 25.8 20.0 272 32-409 1-340 (340) 237 protein:vir:6896 Length: 523 # 76.9 0.13 8.1E-05 25.4 24.7 388 1-469 41-521 (523) 238 protein:vir:103458 Length: 524 75.8 0.14 8.8E-05 25.2 24.8 380 1-469 67-522 (524) 239 protein:vir:100328 Length: 346 74.5 0.16 9.7E-05 25.0 26.8 286 1-410 33-346 (346) 240 protein:vir:7208 Length: 524 # 74.5 0.16 9.8E-05 25.0 24.7 380 1-469 67-522 (524) 241 protein:vir:106999 Length: 564 73.6 0.17 0.0001 24.8 25.8 403 1-469 47-537 (564) 242 protein:vir:6596 Length: 521 # 72.2 0.19 0.00012 24.6 28.4 386 1-469 61-519 (521) 243 protein:vir:6210 Length: 394 # 70.7 0.21 0.00013 24.4 27.3 360 1-469 1-393 (394) 244 protein:vir:9507 Length: 395 # 69.1 0.23 0.00014 24.1 23.9 349 1-469 1-393 (395) 245 protein:vir:100650 Length: 395 69.1 0.23 0.00014 24.1 23.9 349 1-469 1-393 (395) 246 protein:vir:101289 Length: 395 69.1 0.23 0.00014 24.1 23.9 349 1-469 1-393 (395) 247 protein:vir:101806 Length: 516 66.1 0.27 0.00017 23.7 25.8 379 1-469 62-515 (516) 248 protein:vir:101189 Length: 516 66.1 0.27 0.00017 23.7 25.8 379 1-469 62-515 (516) 249 protein:vir:96579 Length: 576 64.4 0.3 0.00019 23.4 30.7 386 1-469 52-521 (576) 250 protein:vir:100249 Length: 431 60.3 0.38 0.00023 22.9 32.2 361 1-469 1-426 (431) 251 protein:vir:81017 Length: 521 59.9 0.38 0.00024 22.9 28.5 381 1-469 61-519 (521) 252 protein:vir:101647 Length: 460 56.9 0.45 0.00028 22.5 30.4 392 1-469 15-460 (460) 253 protein:vir:98265 Length: 524 52.5 0.55 0.00034 22.0 27.7 382 1-469 65-523 (524) 254 protein:vir:94666 Length: 723 51.4 0.58 0.00036 21.9 29.7 369 34-469 1-443 (723) 255 protein:vir:104259 Length: 403 45.4 0.77 0.00048 21.2 26.9 366 1-469 1-403 (403) 256 protein:vir:78310 Length: 376 43.9 0.83 0.00051 21.0 25.2 339 1-465 1-376 (376) 257 protein:vir:108049 Length: 524 39.6 1 0.00063 20.6 25.3 391 1-469 45-523 (524) 258 protein:vir:5691 Length: 344 # 34.3 1.3 0.00081 20.0 23.9 281 1-410 35-344 (344) 259 protein:vir:81218 Length: 423 34.2 1.3 0.00081 19.9 27.9 369 1-469 1-422 (423) 260 protein:vir:6058 Length: 344 # 33.1 1.4 0.00086 19.8 25.1 280 1-409 35-344 (344) 261 protein:vir:78749 Length: 337 31.7 1.5 0.00092 19.7 24.0 275 32-408 1-337 (337) 262 protein:vir:267 Length: 348 # 30.6 1.6 0.00097 19.5 27.7 291 1-412 30-348 (348) 263 protein:vir:8100 Length: 466 # 29.5 1.7 0.001 19.4 27.9 393 1-469 1-465 (466) 264 protein:vir:5839 Length: 533 # 28.2 1.8 0.0011 19.2 24.9 399 1-469 10-479 (533) 265 protein:vir:80211 Length: 514 25.8 2 0.0012 18.9 36.2 412 1-463 1-514 (514) 266 protein:vir:78191 Length: 351 23.0 2.4 0.0015 18.5 26.2 286 1-412 40-351 (351) 267 protein:vir:1150 Length: 350 # 22.9 2.4 0.0015 18.5 24.4 279 1-408 43-350 (350) 268 protein:vir:103971 Length: 376 22.4 2.5 0.0015 18.4 27.5 285 1-418 65-376 (376) No 1 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=2.3e-115 Score=649.06 Aligned_cols=469 Identities=94% Similarity=1.385 Sum_probs=451.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+++.++++|++++.+|.+++++|+++++||.|+|+|++++..++.....+.....+++++||++||++.||++.++||| T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 80 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRIPSNFYQLLVDQEAGYVA 80 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCcccccchHHHHHHhhhhhee Confidence 99999999999999999999999999999999999999999988888888888888899999999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+|++++++.++.+++++++++.+.+.++++.++++|++|+++|+|++|++++++++|.+++|+|+++...+++++| T Consensus 81 G~p~~~~~~d~~~~~~l~~~~~~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~a~i 160 (470) T protein:vir:10 81 SVFPDIDVGKDADNKKIIDVLGDDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPDQITPIYATTLDNKLLGIL 160 (470) T ss_pred ccceeeecCchHHHHHHHHHHhhhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999889999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+.++.+..+.+++|++..+++|...........++...+........+....+..+|+||+||||+|+||+.|+ T Consensus 161 r~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~ 240 (470) T protein:vir:10 161 RSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRL 240 (470) T ss_pred EEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCCCC Confidence 99999999998888999999999999999999988888888888888888888888889999999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.+++|++|+++++|+.+++.+++...+..++++.++..+++.+++++|++++++.+++ T Consensus 241 sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~ 320 (470) T protein:vir:10 241 PELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLHQFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEAR 320 (470) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccchhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHH Confidence 99999999999999999999999999999999999998888888999999999999999888889999999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEe Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHW 400 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f 400 (469) ++++++|.++||.+|++|++++.++||+||+||+++++++.+||+++++.|+++|++++++|+++++.++.++.+++++| T Consensus 321 ~~~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~~~~d~~~i~i~f 400 (470) T protein:vir:10 321 DDALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHW 400 (470) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccceeeEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999889999999999 Q ss_pred CCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 401 TRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 401 ~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++++|.|+++.++++++++|+||+||+++++|+++|+++|++||++|+++..+.++++++..++|+||| T Consensus 401 ~~~~p~d~~e~~~~~~~~~g~iS~et~l~~~p~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~~~dde 469 (470) T protein:vir:10 401 TRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDKEENDPYSNQADELNGKGVNDE 469 (470) T ss_pred ccCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhccccccCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999999999888888888888 No 2 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=8.1e-106 Score=596.73 Aligned_cols=464 Identities=41% Similarity=0.692 Sum_probs=409.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhc----ccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVS----KEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~----~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) |+++.+.++|.+++.+|++++++++++++||+|+|+|++++....... .........++++||++||++.||++.+ T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~ 80 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQKK 80 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHhhh Confidence 999999999999999999999999999999999999998764432221 1122234456889999999999999999 Q ss_pred HhhhcCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcC-CCceEEEEEccceeEEEEeCCCCCc Q lcl|NC_010179. 77 GYIASVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDE-DNNFRYGIIQPDQITPVYATTLDNK 155 (469) Q Consensus 77 ~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~-~~~~~i~~~~p~~~~~~~d~~~~~~ 155 (469) +|+||+||++++++++.++.++.++++++.+.+.++++.++++|++|+++|+++ +|++++.+++|.+++|+||++...+ T Consensus 81 ~yl~G~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~~~~~p~~~~~i~d~~~~~~ 160 (471) T protein:vir:10 81 AYALTYPPTFDVDDKKVNDMIVDVLGDDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRYACVDSKEVIPIYSKSLDKK 160 (471) T ss_pred hhhcccCceeccCChHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEEEEEcccceEEEEcCCCCCc Confidence 999999999999999999999999988888889999999999999999999985 6999999999999999999988889 Q ss_pred eEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC Q lcl|NC_010179. 156 LLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK 235 (469) Q Consensus 156 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 235 (469) +++++|+|...+..+.....++++|++..+++|..............................+..+|+||+||||+|+| T Consensus 161 ~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~iPvv~~~n 240 (471) T protein:vir:10 161 SIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRSSDNSFKHDFGLVPFIPFKN 240 (471) T ss_pred eEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCCceeEEEecc Confidence 99999999988878888888999999999999988766544333222222222233345566778899999999999999 Q ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecC Q lcl|NC_010179. 236 NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDI 315 (469) Q Consensus 236 ~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 315 (469) +..|.|+|+++++|||+||.++|++++.+++|++|+++++|+.++..+++...+..++++.++..+.+.+++++|++++. T Consensus 241 ~~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~ 320 (471) T protein:vir:10 241 NEIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQDKQEFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDI 320 (471) T ss_pred CCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhHHHhhcCCeEEecCCCCccCccceEEeecC Confidence 99999999999999999999999999999999999999999988888888889999999999988888889999999999 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCccc Q lcl|NC_010179. 316 PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRH 395 (469) Q Consensus 316 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~ 395 (469) +.+++++++++|.++|+.+|++|++++.++||+||+||+++++++.+||+++++.|+++|++++++|+.+++.. ++.+ T Consensus 321 ~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~--d~~~ 398 (471) T protein:vir:10 321 PTEARNLILERTKKQIFISGQGVNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLS--DKLK 398 (471) T ss_pred ChHHHHHHHHHHHHHHHHHhCCcCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--CCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999998865 4678 Q ss_pred ceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 396 ISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 396 i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++|+|++++|.|+++.++++++++|+||+||+++++|+++|+++|++||++|+++..... ....+.+.++| T Consensus 399 i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~D~~~E~eri~~E~~~~~~~~---~~~~~~~~~~e 469 (471) T protein:vir:10 399 IKQTWTRNSINNDTEMAQVVSTLATITSRENVAKSNPIVEDWQDELRLQKAEQEGRSEKL---YDMEEVEHESE 469 (471) T ss_pred eEEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcc---cccCCCCCccc Confidence 899999999999999999999999999999999999999999999999999987664433 23333334444 No 3 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=5.2e-100 Score=564.89 Aligned_cols=438 Identities=34% Similarity=0.621 Sum_probs=381.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+++.|.++|++ |..++++|+++++||.|+|+|+.+..... ........++++|+++||+++||++.++||| T Consensus 1 l~~~~i~~~i~~----~~~~~~r~~~~~~YY~g~~~i~~~~~~~~----~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~ 72 (451) T protein:vir:10 1 MELEKIRAIISA----DAARRQEILQAKSYYYNKNDILKKGVVVQ----NRDENPLRNADNRISHNFHEILVDEKASYMF 72 (451) T ss_pred CCHHHHHHHHHH----HHHHHHHHHHHHHHhcccCcccccccccc----ccccccccccccccccchHHHHHHhhhhhee Confidence 999988887755 56678899999999999999998764432 1222344578899999999999999999999 Q ss_pred cCCeeeccCch-hhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCC--------CceEEEEEccceeEEEEeCC Q lcl|NC_010179. 81 SVFPDIDVGKD-ADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDED--------NNFRYGIIQPDQITPVYATT 151 (469) Q Consensus 81 g~p~~~~~~~~-~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~--------~~~~i~~~~p~~~~~~~d~~ 151 (469) |+||+|+++++ ...+.++.++++++.+.+.++++.++++|+||+++|++++ |++++++++|++++|+|+++ T Consensus 73 G~p~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~ 152 (451) T protein:vir:10 73 TYPVLFDIDNNKELNEKVTDVLGNEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNG 152 (451) T ss_pred cccceeecCCcHHHHHHHHHHhccCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCC Confidence 99999998664 4556677777767778889999999999999999999986 78899999999999999998 Q ss_pred CCCceEEEEEEEEeeecCCc----eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCc Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAG----KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGR 227 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 227 (469) ...++.++||+|...+.++. ....++++||+..+++|...+... .....+.+..+|+||+ T Consensus 153 ~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~g~ 216 (451) T protein:vir:10 153 IERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSC----------------CGSQIEHITVQHRFNS 216 (451) T ss_pred CCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCc----------------cccccccccccCCCCe Confidence 88899999999987766543 345688999999999887654332 1123445677899999 Q ss_pred ccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCc Q lcl|NC_010179. 228 VPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSG 307 (469) Q Consensus 228 vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (469) ||||+|+||+.|.|+|+++++|||+||.++|++++.+++|++|+++++|+.++..+++...++.++++.+...+++.+++ T Consensus 217 vPvv~~~nn~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 296 (451) T protein:vir:10 217 VPFVEFSNNIKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGGEDTSEFLKELKRYKTIKTETDSEGDSGG 296 (451) T ss_pred eeEEEeccCCCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhhHHHHhhCCeEEecCcCCccCCc Confidence 99999999999999999999999999999999999999999999999999888888888999999999999888888899 Q ss_pred ceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010179. 308 VDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLN 387 (469) Q Consensus 308 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~ 387 (469) ++||+++.+.+++++++++|.++||.+|++|++++.++||+||+||+++++++.+||+++++.|+++|++++++|+++++ T Consensus 297 ~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~ 376 (451) T protein:vir:10 297 LKTMQIEIPTEARKIILEILKKQIYESGQGLQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFLG 376 (451) T ss_pred ceEEeecCCHHHHHHHHHHHHHHHHHHhCcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred ccCCCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCC Q lcl|NC_010179. 388 FSDADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGV 466 (469) Q Consensus 388 ~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~ 466 (469) .. ++.+++++|++++|.|+++.++++++++|++|+||+++++|+++|+++|++++++|+++...... +..++-++ T Consensus 377 ~~--d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~p~v~d~~~e~~~~~ee~~~~~~~~~--~~~~~~~~ 451 (451) T protein:vir:10 377 VT--DYKKIQQTYTRNMMSNDLEDADIATKSVGIIPTKIILRHHPWVDDVEEAEKLYLEEKKIQASKVS--DDYNNFTE 451 (451) T ss_pred CC--CccceeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH--hhcCCCCC Confidence 65 57789999999999999999999999999999999999999999999999999888766544332 22222221 No 4 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=4.4e-99 Score=559.81 Aligned_cols=457 Identities=30% Similarity=0.505 Sum_probs=397.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) -+.+++.++|.+++.+| ++++++++++||+|+|+|++++..... .........++++|+++||+++||++.++|++ T Consensus 19 ~~~~~~~~~i~~~~~~~--~~~~~~~~~~yy~g~~~i~~~~~~~~~--~~~~~~~~~~~~~ki~~~~~~~Ivd~~~~~l~ 94 (479) T protein:vir:79 19 ESTINLVKVIEHYILKH--RPEKYKQGEEYYYGNTDVNNKRRYYLL--DGAKVDDFTKVNNKAINNYHKLLVDQKVGYSV 94 (479) T ss_pred CChhHHHHHHHHHHhhh--hHHHHHHHHHHhccCCccccccccccc--ccccccccccCcceeecchHHHHHHHHHhhhh Confidence 34456778888887776 578899999999999999988754332 23445566789999999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++..++.++.|+++++.+.+.++++.++++|++|+++|++++|++++++++|.+++|+||++...++.+++ T Consensus 95 g~p~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i 174 (479) T protein:vir:79 95 GNPIVFNADDDNLTKLLNDLLGEEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEEAIPIWDSKRQRELVAFI 174 (479) T ss_pred cCCceeccCCHHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccceeEEEEeCCCCCceEEEE Confidence 99999999999998988888887778889999999999999999999999999999999999999999998888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+.++... .++++|+++.+++|...++.+..... ....................+|+||.||||+|+|++.|+ T Consensus 175 r~y~~~~~~~~~~-~~~e~y~~~~i~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~ 252 (479) T protein:vir:79 175 RFYYIEDIDGNKI-KRVEYYTENDITYFIERGNSFIQEFL-YDEYGKMTDIQEGHFRINNKEQGWGKVPFIPFKNNEKCV 252 (479) T ss_pred EEEEEeecCCceE-EEEEEEeCCcEEEEEecCCccccccc-ccccccccccccccccccccccCCCcccEEEecCCCCCC Confidence 9999887776654 46899999999999887765432211 111122222223344566789999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.++++++|+++++|++++..+++...++..+++.++++ ++++|++++++.+++ T Consensus 253 sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~l~~~~~~~~~ 327 (479) T protein:vir:79 253 SDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTSLQEFIDNIRYYKSIKVDGG-----GGVDKLEINIPVEAK 327 (479) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccccchhhhhhccceecCCC-----CcceEEeccCCHHHH Confidence 999999999999999999999999999999999999887777777788888888887543 569999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---CCcccce Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD---ADKRHIS 397 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~---~~~~~i~ 397 (469) ++++++|.++|+.+|++|++++.++||+||+||+++++++.+||+++++.|+++|++++++++++++..+ .+..+++ T Consensus 328 ~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~ 407 (479) T protein:vir:79 328 KELLDRLEKNIIIFGQGVNPESQNTGDKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYDYKTVQ 407 (479) T ss_pred HHHHHHHHHHHHHHhCccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccce Confidence 9999999999999999999999989999999999999999999999999999999999999999987754 5678899 Q ss_pred EEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 398 QHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 398 i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) |+|++++|.|+++.++++++++|+||+||+++++|+++|+++|++||++|+++..+..+.... ..+++.|| T Consensus 408 i~f~~~~p~~~~~~a~~~~kl~g~iS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~e 478 (479) T protein:vir:79 408 ITFNHSMIINEAEKIDMAAKSTGIVSDETIVSNHPWVEDVNDELERLKKQEDTQKEYDDLIPN-NQDGVIDE 478 (479) T ss_pred EEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhccCc-ccCCCcCc Confidence 999999999999999999999999999999999999999999999999999988877666654 55666666 No 5 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=1.1e-98 Score=557.63 Aligned_cols=443 Identities=27% Similarity=0.427 Sum_probs=381.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) -+.+...++|.+++.+|..++++++++.+||+|+|+|++++..... .......++++|+++||+++||++.++||+ T Consensus 40 ~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~----~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~ 115 (492) T protein:vir:97 40 NKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA----TGAVDPLKPDDRMITNFHANLVDQKVSYIV 115 (492) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccc----cccccccccccccccchHHHHHHHHhhhhc Confidence 3445678889999999999999999999999999999887653221 112334578899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++...+.|++++++++.+.+.+++++++++|+||+++|++++|++++++++|.+++|+||++..+++.+++ T Consensus 116 g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~v 195 (492) T protein:vir:97 116 GKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI 195 (492) T ss_pred ccCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999988888889999999999999999999999999999999999999999988888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+. ..+++|++..+++|...++..... .............+|+||.||||+|+|++.|+ T Consensus 196 r~~~~~~~------~~~~~y~~~~v~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~ 259 (492) T protein:vir:97 196 RMYKLENE------TKVEYWDKVTVNYYVYENGSLIPD----------YSNNLENSKTHFSTGSWGKIPFIPFKNNDLEI 259 (492) T ss_pred EEEeeccc------eeEEEEecCeEEEEEEecCeeeec----------ccccccccccccccCCCCCcceEEecCCCCCC Confidence 99986432 256899999999988765543221 11112233456779999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.++++++|+++++|+..++.+++...++.++++.++.+ ++++|++++.+.+++ T Consensus 260 sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~l~~~~~~~~~ 334 (492) T protein:vir:97 260 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSDN-----GGVDTIQVEVPVENS 334 (492) T ss_pred CchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccchhHHHHHhhccceecCCC-----CcceeEeccCCHHHH Confidence 999999999999999999999999999999999999988877788888888888887543 569999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+.+++.+. ++.+++++ T Consensus 335 ~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~-~~~~i~v~ 413 (492) T protein:vir:97 335 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDIS 413 (492) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-ccceeeEE Confidence 9999999999999999999998887 679999999999999999999999999999999999999998754 67889999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc-ccCCCCCCCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA-DELNGKGVDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~-~~~~~~~~~de 469 (469) |++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++.....+.. +...+.+.++| T Consensus 414 f~~~~p~~~~e~a~~~~kl~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~ 484 (492) T protein:vir:97 414 FNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQTEYNKQLPNLDDGGADSAQQQE 484 (492) T ss_pred ecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCcccc Confidence 99999999999999999999999999999999999999999999999987665433221 11111111111 No 6 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=9.3e-99 Score=558.02 Aligned_cols=443 Identities=27% Similarity=0.428 Sum_probs=382.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) =+.+.+.++|.+++.+|..+++|++++.+||+|+|+|+.++..... . ......++++|+++||+++||++.++|++ T Consensus 40 ~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~~~---~-~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~ 115 (492) T protein:vir:94 40 NKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA---T-GAVDPLKPDDRMITNFHANLVDQKVSYIV 115 (492) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc---c-ccccccccccccccchHHHHHHHHHhhhc Confidence 2335667889999999999999999999999999999887654321 1 12334578899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++...+.|++++++++.+.+.+++++++++|++|+++|.+++|++++++++|.+++|+||++..+++++++ T Consensus 116 G~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~a~~~G~a~~~v~~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~i 195 (492) T protein:vir:94 116 GKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI 195 (492) T ss_pred ccCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999988888889999999999999999999999999999999999999999998888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+. ..+++|++..+++|.......... .............+|+||.||||+|+||+.|+ T Consensus 196 r~~~~~~~------~~~~~y~~~~v~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~ 259 (492) T protein:vir:94 196 RMYKLENE------TKVEYWDKVTVNYYVYENGSLIPD----------YSNNLENSKTHFSTGSWGKIPFIPFKNNDLEI 259 (492) T ss_pred EEEeeccc------eeEEEEecCeEEEEEEecCeeeec----------cccccccccccccccCCCccceEEecCCCCCC Confidence 99986432 256899999999988765443221 11222334556778999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.+++|++|+++++|++.++.+++...+...+++.++. +++++|++++.+.+++ T Consensus 260 sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~l~~~~~~~~~ 334 (492) T protein:vir:94 260 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDDQELPEFKRLLRYYGAIKVSD-----NGGVDTIQVEVPVENS 334 (492) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhHHHHhhccceecCC-----CCcceeEeccCCHHHH Confidence 99999999999999999999999999999999999998887778888888888887754 3579999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) +.++++|.+.|+.+|++|++++..+ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++.++ +..+++|+ T Consensus 335 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~-~~~~i~v~ 413 (492) T protein:vir:94 335 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDIS 413 (492) T ss_pred HHHHHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc-ccceeeEE Confidence 9999999999999999999998887 679999999999999999999999999999999999999998755 67789999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhh-cccCCCCCCCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQ-ADELNGKGVDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~-~~~~~~~~~~de 469 (469) |++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++....... .+...+.+.++| T Consensus 414 f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~ 484 (492) T protein:vir:94 414 FNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADSAQQQE 484 (492) T ss_pred ecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCcccc Confidence 9999999999999999999999999999999999999999999999998766543322 222222222222 No 7 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=1.6e-98 Score=556.80 Aligned_cols=443 Identities=27% Similarity=0.437 Sum_probs=382.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) -+-+.+.++|.+++.+|..++++|+++.+||+|+|+|+.++..... .......++++|+++||+++||++.++||+ T Consensus 31 ~~~e~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~----~~~~~~~~~~~ki~~n~~k~Ivd~~~~~l~ 106 (483) T protein:vir:12 31 NKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDA----TGAVDPLKPDDRMITNFHANLVDQKVSYIV 106 (483) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccc----cccccccccccccccchHHHHHHHHhhhhc Confidence 3445678899999999999999999999999999999987653221 112345578899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++...+.|++|+++++.+.+.++++.++++|++|+++|+|++|++++++++|.+++|+||++...++++++ T Consensus 107 G~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~~v~d~~~~~~~~~~i 186 (483) T protein:vir:12 107 GKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI 186 (483) T ss_pred ccCceeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999998888889999999999999999999999999999999999999999998888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+. ..+++|++..+++|...+...... .............+|+||.||||+|+|++.|+ T Consensus 187 r~~~~~~~------~~~~~y~~~~v~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~ 250 (483) T protein:vir:12 187 RMYKLENE------TKVEYWDKVTVNYYVYENGSLIPD----------YSNNLENSKTHFSTGSWGKIPFIPFKNNDLEI 250 (483) T ss_pred EEEEeecc------eEEEEEecCeEEEEEEeCCeeeec----------ccccccccccccccCCCCccceEEecCCCCCC Confidence 99986432 257899999999987665433221 11112234456778999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.+++|++|+++++|++.+..+++...++.++++.++.+ ++++|++++.+.+++ T Consensus 251 sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~l~~~~~~~~~ 325 (483) T protein:vir:12 251 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDN-----GGVDTIQVEVPVENS 325 (483) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhHHHhhhhccccccCCC-----CcceEEeecCCHHHH Confidence 999999999999999999999999999999999999988877788888888888877543 579999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.+.||.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++.+. ++.+++|+ T Consensus 326 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~-~~~~i~v~ 404 (483) T protein:vir:12 326 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDIS 404 (483) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-ccceeeEE Confidence 9999999999999999999998886 679999999999999999999999999999999999999998754 67889999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhccc--CCCCCCCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADE--LNGKGVDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~--~~~~~~~de 469 (469) |++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++.......... ..+.+.+++ T Consensus 405 f~~~~p~~~~~~a~~~~kl~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~ 476 (483) T protein:vir:12 405 FNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQER 476 (483) T ss_pred eCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccccCCcccCCC Confidence 9999999999999999999999999999999999999999999999998776544322111 111111111 No 8 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=1.1e-98 Score=557.64 Aligned_cols=442 Identities=26% Similarity=0.421 Sum_probs=381.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) .+.+.+.++|.+++.+|..++++++++.+||+|+|+|++++...... ......++++||++||+++||++.++||| T Consensus 23 ~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~----~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 98 (474) T protein:vir:96 23 PKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLH----GNIDYTKPDWRITTNFHQNLVDQKVSYVA 98 (474) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhc----ccccccccccccccchHHHHHHhhhhhhc Confidence 44557788999999999999999999999999999999886542211 12234568899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++..++.+++|+++++.+.+.++++.++++|++|+++|++++|++++++++|.+++|+||++...++++++ T Consensus 99 g~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~i 178 (474) T protein:vir:96 99 GKPVTYAHDDDKVLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFI 178 (474) T ss_pred ccCceeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999988888899999999999999999999999999999999999999999998889999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|.... ..++++|++..+++|...++..... .............+|+||.||||+|+|++.|. T Consensus 179 r~~~~~~------~~~~~vy~~~~i~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~ 242 (474) T protein:vir:96 179 RIFTFNG------ETKVEYWTAETVTYYVYENGGLIPD----------FYYGDEHIQTHFSTGSWERVPFIAFKNNPEEV 242 (474) T ss_pred EEEeecC------eeEEEEEeCCeEEEEEEcCCceeec----------cccccccccCcccccCCCccceEEecCCCCCC Confidence 9987431 2467999999999998776543221 11222334556678999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.+++|++|++|++|+.+++.+++...+..++++.++.+ ++++|++++.+.+++ T Consensus 243 ~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~l~~~~~~~~~ 317 (474) T protein:vir:96 243 SDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSD-----GGVETIQVEVPVAST 317 (474) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCC-----CceeEEeccCCHHHH Confidence 999999999999999999999999999999999999988877888888888888887543 579999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.++||.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+++++. ..++.+++++ T Consensus 318 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~-~~d~~~i~i~ 396 (474) T protein:vir:96 318 KEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI-KLDAKEIEIT 396 (474) T ss_pred HHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcccceeeEE Confidence 9999999999999999999998886 5799999999999999999999999999999999999999875 4578899999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCC-------CCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKG-------VDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~-------~~de 469 (469) |++++|.|+++.++++++ +|++|+||+++++|+++|+++|++||++|+++..+........+.++ .++| T Consensus 397 f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 472 (474) T protein:vir:96 397 FNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQ 472 (474) T ss_pred ecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccc Confidence 999999999999999876 59999999999999999999999999999887655443222111111 1111 No 9 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=1.1e-98 Score=557.64 Aligned_cols=442 Identities=26% Similarity=0.421 Sum_probs=381.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) .+.+.+.++|.+++.+|..++++++++.+||+|+|+|++++...... ......++++||++||+++||++.++||| T Consensus 23 ~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~----~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 98 (474) T protein:vir:95 23 PKVETQEEMIIRLINNHKQKLKDINVGQKYYDKDNDINYQAYKQDLH----GNIDYTKPDWRITTNFHQNLVDQKVSYVA 98 (474) T ss_pred ccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccchhhhc----ccccccccccccccchHHHHHHhhhhhhc Confidence 44557788999999999999999999999999999999886542211 12234568899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++..++.+++|+++++.+.+.++++.++++|++|+++|++++|++++++++|.+++|+||++...++++++ T Consensus 99 g~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~a~i 178 (474) T protein:vir:95 99 GKPVTYAHDDDKVLDVIHQVLDTRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAEQAIPIWTDKEREQLNAFI 178 (474) T ss_pred ccCceeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999988888899999999999999999999999999999999999999999998889999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|.... ..++++|++..+++|...++..... .............+|+||.||||+|+|++.|. T Consensus 179 r~~~~~~------~~~~~vy~~~~i~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~ 242 (474) T protein:vir:95 179 RIFTFNG------ETKVEYWTAETVTYYVYENGGLIPD----------FYYGDEHIQTHFSTGSWERVPFIAFKNNPEEV 242 (474) T ss_pred EEEeecC------eeEEEEEeCCeEEEEEEcCCceeec----------cccccccccCcccccCCCccceEEecCCCCCC Confidence 9987431 2467999999999998776543221 11222334556678999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.+++|++|++|++|+.+++.+++...+..++++.++.+ ++++|++++.+.+++ T Consensus 243 ~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~l~~~~~~~~~ 317 (474) T protein:vir:95 243 SDIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLSEFMEGLKYYKAINVSSD-----GGVETIQVEVPVAST 317 (474) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCcccccchhhhhhccceeeccCC-----CceeEEeccCCHHHH Confidence 999999999999999999999999999999999999988877888888888888887543 579999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.++||.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+++++. ..++.+++++ T Consensus 318 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~-~~d~~~i~i~ 396 (474) T protein:vir:95 318 KEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKI-KLDAKEIEIT 396 (474) T ss_pred HHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcccceeeEE Confidence 9999999999999999999998886 5799999999999999999999999999999999999999875 4578899999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCC-------CCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKG-------VDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~-------~~de 469 (469) |++++|.|+++.++++++ +|++|+||+++++|+++|+++|++||++|+++..+........+.++ .++| T Consensus 397 f~~~~p~~~~e~a~~~~~-~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 472 (474) T protein:vir:95 397 FNFNVMVNDLEQSQIGAQ-SQYLSKETLVRHHPWVDDPKAELERLDEEQLELNKQLPNLDDGGADGAQQQQQSENNQ 472 (474) T ss_pred ecCCCccCHHHHHHHHHH-cCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCCCcCCCCccc Confidence 999999999999999876 59999999999999999999999999999887655443222111111 1111 No 10 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=6.7e-98 Score=553.32 Aligned_cols=444 Identities=16% Similarity=0.157 Sum_probs=367.6 Q ss_pred CCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR-NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) -++....+.|.+++.+| ..+.++|+++.+||.|+|+|+.+.... ....++++|+++||+++||++.++|+ T Consensus 36 ~~~~~~~~~i~~~i~~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~---------~~~~~~~~ki~~n~~k~Ivd~~~~yl 106 (512) T protein:vir:97 36 SDLLQNINEVSKYIEHHMDYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYF 106 (512) T ss_pred hhhhhhHHHHHHHHHHHHHhhHHHHHHHHHHhcccCccccccCcc---------cccccCcceeecchHHHHHHHHhhhh Confidence 01111223445555555 346789999999999999998765432 23346788999999999999999999 Q ss_pred hcCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) +|+||++++++++.++.|+++|+.| +...+.++++.++++|++|+++|++++|++++++++|.+++|+||++...++++ T Consensus 107 ~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~~~~~iyd~~~~~~~~~ 186 (512) T protein:vir:97 107 LGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIA 186 (512) T ss_pred cccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEE Confidence 9999999999999999999999887 456788999999999999999999999999999999999999999988889999 Q ss_pred EEEEEEeeecCCc--eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCC Q lcl|NC_010179. 159 VLRSYKQLDPEAG--KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN 236 (469) Q Consensus 159 ~v~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~ 236 (469) +||+|.....++. ....++++|+++.+++|...+...... ........+|+||.||||+|+|+ T Consensus 187 ~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~---------------~~~~~~~~~~~~g~vPvv~~~nn 251 (512) T protein:vir:97 187 GVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL---------------TPRENGFESHSFERMPITEFSNN 251 (512) T ss_pred EEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccc---------------cccccccccccCcccceEeecCC Confidence 9999988766543 456788999999999988765443221 12345677899999999999999 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecc---------cCCCCCCc Q lcl|NC_010179. 237 KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINN---------AGNGDKSG 307 (469) Q Consensus 237 ~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~ 307 (469) +.|.|+|+++++|||+||.++|++++.++++++|+++++|+...+..+. ......+++.+.. .+.+++++ T Consensus 252 ~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 330 (512) T protein:vir:97 252 ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV-RKQKEANVLFLEPTVYENRDTGIETEGSVD 330 (512) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCchhh-hhhhhcccccccccchhhcccccCCCCCcc Confidence 9999999999999999999999999999999999999999765554433 2333333333321 12345688 Q ss_pred ceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010179. 308 VDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYL 386 (469) Q Consensus 308 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~ 386 (469) ++|++++.+.+++++++++|.++|+.+|++|+++++++ ||+||+||+++++++.+||+++++.|+.+|++++++|++++ T Consensus 331 ~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~ 410 (512) T protein:vir:97 331 GGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETIL 410 (512) T ss_pred eEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999876 78999999999999999999999999999999999999988 Q ss_pred cccC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc-c Q lcl|NC_010179. 387 NFSD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD-E 460 (469) Q Consensus 387 ~~~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-~ 460 (469) +..+ .++.+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..+... . T Consensus 411 ~~~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~ 490 (512) T protein:vir:97 411 KNTRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYK 490 (512) T ss_pred HhcCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccC Confidence 6532 355689999999999999999999999999999999999999999999999999999988766554321 1 Q ss_pred CCCCCCCCC Q lcl|NC_010179. 461 LNGKGVDDE 469 (469) Q Consensus 461 ~~~~~~~de 469 (469) .+++.+++| T Consensus 491 ~~~~~~~~~ 499 (512) T protein:vir:97 491 DPRDINDDE 499 (512) T ss_pred CCCCCCCCC Confidence 111111111 No 11 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=6.7e-98 Score=553.33 Aligned_cols=441 Identities=17% Similarity=0.174 Sum_probs=367.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.|+++|++.. ..++++|+++.+||.|+|+|+++.... ....++++||++||+++||++.++||+ T Consensus 40 ~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~i~~~~~~~---------~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:99 40 QNVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccHHHHHHHHHHHH---HhhHHHHHHHHHHhcccCccccccCcc---------cccccCcceeecchHHHHHHHHHhhhc Confidence 45556666655433 346789999999999999998765432 234467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||++++++++.++.|++||+.| +.+.+.++++.++++|++|+++|++++|++++++++|.+++|+||++...+++++ T Consensus 108 g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~~~~~~~~~~ 187 (511) T protein:vir:99 108 GNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAG 187 (511) T ss_pred ccCceeecCchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEE Confidence 999999999999999999999886 5567889999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCc--eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 160 LRSYKQLDPEAG--KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 160 v~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) ||+|.....++. ....++++|+++.+++|...+.+... .........+|+||.||||+|+|++ T Consensus 188 vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~---------------~~~~~~~~~~~~~g~vPvv~~~nn~ 252 (511) T protein:vir:99 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLK---------------LTPRENGFESHSFERMPITEFSNNE 252 (511) T ss_pred EEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCcccc---------------ccccccccccCCCCccceEEecCCC Confidence 999988765543 44568899999999998876543221 1223456778999999999999999 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeec--------ccCCCCCCcce Q lcl|NC_010179. 238 YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKIN--------NAGNGDKSGVD 309 (469) Q Consensus 238 ~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 309 (469) .|+|+|+++++|||+||.++|++++.+++|++|+++++|....+..... .....+++.+. ..+...+++++ T Consensus 253 ~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 331 (511) T protein:vir:99 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVR-KQKEANVLFLEPTVYADSEGRETEGSVDGG 331 (511) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhc-ccccccceecccccccccccccCCCCccee Confidence 9999999999999999999999999999999999999997655443332 22223333321 12345578899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) ||+++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++. T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:99 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999998876 7899999999999999999999999999999999999999875 Q ss_pred cC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccC-- Q lcl|NC_010179. 389 SD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADEL-- 461 (469) Q Consensus 389 ~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~-- 461 (469) .+ .++..++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..+..... T Consensus 412 ~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:99 412 TRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDP 491 (511) T ss_pred cCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccC Confidence 43 34568999999999999999999999999999999999999999999999999999998766544332111 Q ss_pred ---CCCCCCCC Q lcl|NC_010179. 462 ---NGKGVDDE 469 (469) Q Consensus 462 ---~~~~~~de 469 (469) .++..+++ T Consensus 492 ~~~~~~~~~~~ 502 (511) T protein:vir:99 492 RNINDDEQDDS 502 (511) T ss_pred CCCCCCCCCCC Confidence 11111111 No 12 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=1.9e-97 Score=550.91 Aligned_cols=441 Identities=17% Similarity=0.169 Sum_probs=367.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.|+++|++.. ..+.+||+++.+||.|+|+|+.+.... ....++++|+++||+++||++.++||+ T Consensus 40 ~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~i~~~~~~~---------~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 40 QNVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccHHHHHHHHHHHH---HhhHHHHHHHHHHhcccCccccccCcC---------cccccCcceeecchHHHHHHHHHhhhc Confidence 55666666655543 345688999999999999998765432 234467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+|++++++.++.|+++|+.| +...+.++++.++++|++|+++|+|++|++++++++|.+++|+|+++...+++++ T Consensus 108 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd~~~~~~~~~ 187 (511) T protein:vir:96 108 GNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAG 187 (511) T ss_pred cCCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEE Confidence 999999999999999999999886 5567889999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCc--eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 160 LRSYKQLDPEAG--KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 160 v~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) ||+|.....++. ....++++|+++.+++|...+..+... ........+|+||.||||+|+|++ T Consensus 188 vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~vPvv~~~nn~ 252 (511) T protein:vir:96 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL---------------TPRENGFESHSFERMPITEFSNNE 252 (511) T ss_pred EEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccc---------------cccccccccccCCceeeEEecCCC Confidence 999988766543 445678999999999987765443221 123345678999999999999999 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeec--------ccCCCCCCcce Q lcl|NC_010179. 238 YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKIN--------NAGNGDKSGVD 309 (469) Q Consensus 238 ~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~ 309 (469) .|+|+|+++++|||+||.++|++++.++++++|+++++|....+..+... ....+++.+. ..+.+.+++++ T Consensus 253 ~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (511) T protein:vir:96 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRK-QKEANVLFLEPTVYADSEGRETEGSVDGG 331 (511) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCchhhcc-cccccceecccccccccccccCCCCccee Confidence 99999999999999999999999999999999999999976554433322 2223333322 12345578899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) ||+++.+.+++++++++|.+.|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+++++. T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:96 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999999886 7899999999999999999999999999999999999998875 Q ss_pred cC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc-ccCC Q lcl|NC_010179. 389 SD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA-DELN 462 (469) Q Consensus 389 ~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~-~~~~ 462 (469) .+ .++.+++++|++++|.|.++.++++++++|+||+||+++++|+++|+++|++||++|+++..+..+.. .... T Consensus 412 ~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:96 412 TWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP 491 (511) T ss_pred hcCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC Confidence 42 35678999999999999999999999999999999999999999999999999999988765544321 1111 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) ....++| T Consensus 492 ~~~~~~~ 498 (511) T protein:vir:96 492 RDINDDE 498 (511) T ss_pred CCCCCCC Confidence 1111111 No 13 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=2.2e-97 Score=550.48 Aligned_cols=441 Identities=16% Similarity=0.165 Sum_probs=366.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.|.++|+++. ..++++|+++++||.|+|+|+.+.... ....++++|+++||+++||++.++||+ T Consensus 40 ~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~il~~~~~~---------~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:93 40 QNVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred ccHHHHHHHHHHHH---HhhHHHHHHHHHHhcccCccccccCcC---------cccccCcceeecchHHHHHHHHhhhhc Confidence 44555555544432 346789999999999999998766533 223467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||++++++++.++.|++||+.| +.+.+.++++.++++|+||++||++++|++++++++|.+++|+||++...+++++ T Consensus 108 g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~~~~~i~~~~p~~~~~vydd~~~~~~~~~ 187 (511) T protein:vir:93 108 GNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAG 187 (511) T ss_pred ccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEE Confidence 999999999999999999999887 4567889999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCc--eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 160 LRSYKQLDPEAG--KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 160 v~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) ||+|.....++. ....++++|+++.+++|...+...... .....+..+|+||.||||+|+|++ T Consensus 188 vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~---------------~~~~~~~~~~~~g~vPvv~~~nn~ 252 (511) T protein:vir:93 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL---------------TPRENGFESHSFERMPITEFSNNE 252 (511) T ss_pred EEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccc---------------ccccccccccCCCccceEEecCCC Confidence 999987665543 456788999999999988765443221 123345678999999999999999 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecc--------cCCCCCCcce Q lcl|NC_010179. 238 YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINN--------AGNGDKSGVD 309 (469) Q Consensus 238 ~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~ 309 (469) .|.|+|+++++|||+||.++|++++.+++|++|+++++|+...+..... .....+++.+.. .+..++++++ T Consensus 253 ~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (511) T protein:vir:93 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVR-KQKEANVLFLEPTVYADSEGRETEGSVDGG 331 (511) T ss_pred CCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccCchhhc-ccccccceecccccccccccccCCCCccee Confidence 9999999999999999999999999999999999999997655543332 222333333322 2345678999 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) ||+++.+.+++++++++|.+.|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++. T Consensus 332 ~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~ 411 (511) T protein:vir:93 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999998876 7899999999999999999999999999999999999998876 Q ss_pred cC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhh-cccCC Q lcl|NC_010179. 389 SD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQ-ADELN 462 (469) Q Consensus 389 ~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~-~~~~~ 462 (469) .+ .++.+++++|++++|.|.++.++++++++|+||+||+++++|+++|+++|++||++|+++..+..+. ..... T Consensus 412 ~~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:93 412 TWSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP 491 (511) T ss_pred ccCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCC Confidence 43 3456899999999999999999999999999999999999999999999999999998876554332 11111 Q ss_pred CCCCC----CC Q lcl|NC_010179. 463 GKGVD----DE 469 (469) Q Consensus 463 ~~~~~----de 469 (469) ++..+ ++ T Consensus 492 ~~~~~~~~~~~ 502 (511) T protein:vir:93 492 RDINDDEQDDD 502 (511) T ss_pred CCCCCCCCCCc Confidence 11111 11 No 14 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=1.8e-97 Score=550.94 Aligned_cols=441 Identities=16% Similarity=0.173 Sum_probs=367.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.|.++|++.. ..++++|+++.+||.|+|+|+.+.... ....++++|+++||+++||++.++||+ T Consensus 40 ~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~il~~~~~~---------~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:96 40 QNVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cCHHHHHHHHHHHH---HhhhHHHHHHHHHhhccCccccccCcc---------cccccCcceeecchHHHHHHHHhhhhc Confidence 56666666665543 335678999999999999998765432 234467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+|++++++.++.|+++|+.|. .+.+.++++.++++|++|+++|++++|++++++++|.+++|+||++...+++++ T Consensus 108 g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~ 187 (511) T protein:vir:96 108 GNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAG 187 (511) T ss_pred ccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 9999999999999999999998874 556789999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCc--eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 160 LRSYKQLDPEAG--KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 160 v~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) ||+|.....++. ....++++|+++.+++|...+...... .....+..+|+||.||||+|+|++ T Consensus 188 vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~---------------~~~~~~~~~~~~g~vPvv~~~n~~ 252 (511) T protein:vir:96 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKL---------------TPRENSFESHSFERMPITEFSNNE 252 (511) T ss_pred EEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccc---------------cccccccccCcCcccceEEecCCC Confidence 999987765543 455688999999999987765543221 123456779999999999999999 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeee--------cccCCCCCCcce Q lcl|NC_010179. 238 YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKI--------NNAGNGDKSGVD 309 (469) Q Consensus 238 ~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~ 309 (469) .|+|+|+++++|||+||.++|++++.++++++|++|++|....+.+.. ......+++.+ ...+...+++++ T Consensus 253 ~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (511) T protein:vir:96 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV-RKQKEANVLFLEPTVYVDAEGRETEGSVDGG 331 (511) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhh-cccccccceeccccceeccccccCCCCccee Confidence 999999999999999999999999999999999999999755443322 22222222322 222345578899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) ||+++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++. T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:96 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999999886 7899999999999999999999999999999999999999875 Q ss_pred cC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc-ccCC Q lcl|NC_010179. 389 SD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA-DELN 462 (469) Q Consensus 389 ~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~-~~~~ 462 (469) .+ .++.+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..+.. .... T Consensus 412 ~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:96 412 TRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP 491 (511) T ss_pred cCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC Confidence 32 35678999999999999999999999999999999999999999999999999999988776654332 1111 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) .+..++| T Consensus 492 ~~~~~~~ 498 (511) T protein:vir:96 492 RDINDDE 498 (511) T ss_pred CCCCCCC Confidence 1211122 No 15 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=1.8e-97 Score=550.94 Aligned_cols=441 Identities=16% Similarity=0.173 Sum_probs=367.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.|.++|++.. ..++++|+++.+||.|+|+|+.+.... ....++++|+++||+++||++.++||+ T Consensus 40 ~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~il~~~~~~---------~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:78 40 QNVNEVSKYIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cCHHHHHHHHHHHH---HhhhHHHHHHHHHhhccCccccccCcc---------cccccCcceeecchHHHHHHHHhhhhc Confidence 56666666665543 335678999999999999998765432 234467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+|++++++.++.|+++|+.|. .+.+.++++.++++|++|+++|++++|++++++++|.+++|+||++...+++++ T Consensus 108 g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd~~~~~~~~~ 187 (511) T protein:vir:78 108 GNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDNTVERNSIAG 187 (511) T ss_pred ccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcCCCCCceEEE Confidence 9999999999999999999998874 556789999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCc--eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 160 LRSYKQLDPEAG--KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 160 v~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) ||+|.....++. ....++++|+++.+++|...+...... .....+..+|+||.||||+|+|++ T Consensus 188 vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~---------------~~~~~~~~~~~~g~vPvv~~~n~~ 252 (511) T protein:vir:78 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKL---------------TPRENSFESHSFERMPITEFSNNE 252 (511) T ss_pred EEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccc---------------cccccccccCcCcccceEEecCCC Confidence 999987765543 455688999999999987765543221 123456779999999999999999 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeee--------cccCCCCCCcce Q lcl|NC_010179. 238 YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKI--------NNAGNGDKSGVD 309 (469) Q Consensus 238 ~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~ 309 (469) .|+|+|+++++|||+||.++|++++.++++++|++|++|....+.+.. ......+++.+ ...+...+++++ T Consensus 253 ~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 331 (511) T protein:vir:78 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEV-RKQKEANVLFLEPTVYVDAEGRETEGSVDGG 331 (511) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCchhh-cccccccceeccccceeccccccCCCCccee Confidence 999999999999999999999999999999999999999755443322 22222222322 222345578899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) ||+++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++. T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:78 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999999886 7899999999999999999999999999999999999999875 Q ss_pred cC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc-ccCC Q lcl|NC_010179. 389 SD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA-DELN 462 (469) Q Consensus 389 ~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~-~~~~ 462 (469) .+ .++.+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..+.. .... T Consensus 412 ~~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:78 412 TRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP 491 (511) T ss_pred cCCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCC Confidence 32 35678999999999999999999999999999999999999999999999999999988776654332 1111 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) .+..++| T Consensus 492 ~~~~~~~ 498 (511) T protein:vir:78 492 RDINDDE 498 (511) T ss_pred CCCCCCC Confidence 1211122 No 16 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=1.7e-97 Score=551.13 Aligned_cols=449 Identities=25% Similarity=0.394 Sum_probs=383.3 Q ss_pred CCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MELD--ALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~--~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) |.-+ ...++|.+++.+|..++++++++.+||.|+|+|+.++..... .......++++||++||+++||++.++| T Consensus 20 ~~~~~~~~~~~i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~----~~~~~~~~~~~ki~~n~~k~ivd~~~~y 95 (478) T protein:vir:10 20 IKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPFKRDV----NGDYDETKPDWRMYTNYHQNLVDQKVAY 95 (478) T ss_pred hhhccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccchhhhc----ccccccccccceeccchHHHHHHHHhhh Confidence 2222 346788899999999999999999999999999887654322 2233456788999999999999999999 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) |||+||++++++++.++.|++++++++.+.+.++++.++++|++|++||++++|++++++++|++++|+|+++...++.+ T Consensus 96 l~g~p~~~~~~~~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~~~p~~~~~v~d~~~~~~~~~ 175 (478) T protein:vir:10 96 AVANPVTFGVDNDKALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVPAEQAVPIWTNKERDELQA 175 (478) T ss_pred hcccCceeecCChHHHHHHHHHHhccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEE Confidence 99999999999999999999999988888999999999999999999999999999999999999999999988889999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) ++|+|...+. ..+++|+++.+++|...++..... .................+|+||+||||+|+|++. T Consensus 176 ~ir~~~~~~~------~~~~~y~~~~i~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 243 (478) T protein:vir:10 176 FIRVYELDGA------ERVEYWTKDDVTFYELKEGQLIPD------FYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQ 243 (478) T ss_pred EEEEEeeeCc------eEEEEEeCCcEEEEEecCCeeecc------ccccccccccceecccccccCCcceEEEeccCCC Confidence 9999975432 356899999999988765433211 1111111222334556789999999999999999 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHH Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVE 318 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 318 (469) |.|+|+++++|||+||.++|++++.+++|++|+++++|+++++.+++...+...+++.+... ++++++|++++.+.+ T Consensus 244 g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~l~~~~~~~ 320 (478) T protein:vir:10 244 EVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGE---SGSGVDTIKVEVPID 320 (478) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCcccccchhhhhhhCceeEecCC---CCCcceEEeecCCHH Confidence 99999999999999999999999999999999999999988877788888888888887543 346799999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccce Q lcl|NC_010179. 319 ARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHIS 397 (469) Q Consensus 319 ~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~ 397 (469) ++++++++|.+.|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+++++. ..++.+++ T Consensus 321 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~-~~d~~~i~ 399 (478) T protein:vir:10 321 SVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL-DVRVQDIE 399 (478) T ss_pred HHHHHHHHHHHHHHHHhCCcCcCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-Ccccccce Confidence 999999999999999999999998886 6899999999999999999999999999999999999999875 56778899 Q ss_pred EEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCC-----CCCCCCC Q lcl|NC_010179. 398 QHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELN-----GKGVDDE 469 (469) Q Consensus 398 i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~-----~~~~~de 469 (469) |+|++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++..+......... ..+.++| T Consensus 400 i~f~~~~p~~~~e~~~~~~~~~g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~d~~ 476 (478) T protein:vir:10 400 ITFNFNVMVNELENSQIAMNSTGLLSKETILGNHSWVQDPVAEMERIEQENIELNQQLPDIEEGLNDEQQRQSEDNQ 476 (478) T ss_pred EEeCCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccccCCCCcccccccCcCCC Confidence 99999999999999999999999999999999999999999999999999988665433222111 1111111 No 17 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=2.6e-97 Score=550.09 Aligned_cols=441 Identities=17% Similarity=0.187 Sum_probs=367.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.|.++|.+.. ..+++||+++.+||.|+|+|+.+.... ....++++|+++||+++||++.++||+ T Consensus 40 ~~~~~i~~~i~~~~---~~~~~r~~~l~~Yy~g~~~i~~~~~~~---------~~~~~~~~ki~~n~~k~Iv~~~~~yl~ 107 (511) T protein:vir:10 40 QNVNEVSKCIEHHM---DYQRPRLKVLSDYYEGKTKNLVELTRR---------KEEYMADNRVAHDYASYISDFINGYFL 107 (511) T ss_pred cCHHHHHHHHHHHH---HhhHHHHHHHHHHhcccCccccccCcc---------cccccCcceeecchHHHHHHHHhhhhc Confidence 55555555554432 345789999999999999998766432 233467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+|++++++.++.|+++|+.|. ...+.++++.++++|++|+++|++++|++++++++|.+++|+|+++...+++++ T Consensus 108 g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ 187 (511) T protein:vir:10 108 GNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDNTIERNSIAG 187 (511) T ss_pred ccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcCCCCCceEEE Confidence 9999999999999999999998875 456789999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCc--eEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 160 LRSYKQLDPEAG--KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 160 v~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) ||+|.....++. ....++++|+++.+++|...+.+.... ........+|+||.||||+|+|++ T Consensus 188 vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~---------------~~~~~~~~~~~~~~vPvv~f~nn~ 252 (511) T protein:vir:10 188 VRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKL---------------TPRENGFESHSFERMPITEFSNNE 252 (511) T ss_pred EEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccc---------------cccccccccccCcceeEEEecCCC Confidence 999988766543 456678999999999987765443221 123345678999999999999999 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecc--------cCCCCCCcce Q lcl|NC_010179. 238 YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINN--------AGNGDKSGVD 309 (469) Q Consensus 238 ~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~ 309 (469) .|.|+|+++++|||+||.++|++++.+++|++|+++++|....+..+.. .....+++.+.+ .+.+.+++++ T Consensus 253 ~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 331 (511) T protein:vir:10 253 RRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVR-KQKEANVLFLEPTVYADSEGRETEGSVDGG 331 (511) T ss_pred CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhc-cchhccceecccccccccccccCCCCccee Confidence 9999999999999999999999999999999999999997655443332 222333333322 2344568899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) ||+++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+++++. T Consensus 332 ~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:10 332 YIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999998876 7899999999999999999999999999999999999998875 Q ss_pred cC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc----- Q lcl|NC_010179. 389 SD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA----- 458 (469) Q Consensus 389 ~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~----- 458 (469) .+ .++.+++++|++++|.|.++.++++++++|++|+||+++++|+++|+++|++||++|+++..+..+.. T Consensus 412 ~~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:10 412 TRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP 491 (511) T ss_pred hCCcccccccceeeEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCC Confidence 32 35678999999999999999999999999999999999999999999999999999988765544321 Q ss_pred ccCCCCCCCCC Q lcl|NC_010179. 459 DELNGKGVDDE 469 (469) Q Consensus 459 ~~~~~~~~~de 469 (469) ...+++..++| T Consensus 492 ~~~~~~~~~~~ 502 (511) T protein:vir:10 492 RDINDDEQDDD 502 (511) T ss_pred CCCCCCCCCCc Confidence 11111112222 No 18 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=1.6e-97 Score=551.30 Aligned_cols=447 Identities=27% Similarity=0.421 Sum_probs=381.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) -.-+...++|.+++.+|..++++++++++||+|+|+|+.++..... .......++++||++||++.||++.++||| T Consensus 22 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~----~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~ 97 (474) T protein:vir:96 22 PKYETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDN----KGEIDPLKPDWRMFTNYHQNLVDQKVAYAV 97 (474) T ss_pred hccCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcc----cccccccccchhcccchHHHHHHhhhhhhc Confidence 2223456788889999999999999999999999999988754322 112334578999999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+|++++++.++.+++++++++.+.+.++++.++++|++|+++|++++|++++++++|++++|+||++...++.+++ T Consensus 98 g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~v 177 (474) T protein:vir:96 98 ANPVTFSSDDDKSLKTIQEVLNHKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTFRVPAEQAIPIWTNKERDTLKAFI 177 (474) T ss_pred ccCceeecCchHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999998888899999999999999999999999999999999999999999998888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|.... ...+++|++..+++|...++........ ..............+|+||+||||+|+|++.|+ T Consensus 178 r~~~~~~------~~~~~~yt~~~v~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~ 245 (474) T protein:vir:96 178 RYYRLDG------AERVEYWTDSDVTYYEYQDGILIPDYYH------GEEHIQSHYYVGNKRVSWGRVPFIPFKNNPQEM 245 (474) T ss_pred EEEeecC------ceEEEEEeCCeEEEEEecCCceeecccc------ccccccccccccccccCCCceeEEEeccCCCCC Confidence 9997542 1356899999999998766543322111 111112233445678999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.++++++|+++++|+++++.+++...+..++++.+++ ++++|+|++++++.+++ T Consensus 246 sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~----~~~~~~~l~~~~~~~~~ 321 (474) T protein:vir:96 246 SDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEGQDLDEFMRNLKYYKAINVDG----DGSGVDTIQIEVPVQSS 321 (474) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCcccccchhhhhhcCceEEecC----CCCceeEEeecCChHHH Confidence 99999999999999999999999999999999999998877778888888889998853 34679999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+.+++. ..++.+++|+ T Consensus 322 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~-~~~~~~i~i~ 400 (474) T protein:vir:96 322 KEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL-NIKVQDVEIT 400 (474) T ss_pred HHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcccceeeEE Confidence 9999999999999999999998886 5899999999999999999999999999999999999999874 5577889999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhh-cccCCCCCCCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQ-ADELNGKGVDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~-~~~~~~~~~~de 469 (469) |++++|.|+++.++++.+ +|++|+||+++++|+++|+++|++||++|+++..+.... ..+..++..|++ T Consensus 401 f~~~~p~~~~e~~~~~~~-ag~iS~et~~~~~~~v~d~~~E~~ri~~E~~e~~~~~~~~~~~~~~~~~d~~ 470 (474) T protein:vir:96 401 FNFNVMVNELEQSQIGVQ-SQYLSKETVVTNHPWVDDPVAELERIEQDNIDFNKQLPPLEGDANGRAQDNE 470 (474) T ss_pred eccCCCcCHHHHHHHHHh-cCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccccccCCCc Confidence 999999999999998765 699999999999999999999999999998775544322 222233333333 No 19 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=2.8e-97 Score=549.89 Aligned_cols=460 Identities=18% Similarity=0.286 Sum_probs=373.8 Q ss_pred CCHHHHHHHHHHHHHHH--HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR--NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~--~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) |+++.+++++...+..| .+++++++++++||+|+|+|++++....... ........++|+||++||+++||++.++| T Consensus 8 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~-~~~~~d~~~~nnki~~nf~k~Ivd~~~~y 86 (537) T protein:vir:78 8 KPIDQLGGLLNTEITTYMASNHIKWAHIGENYYNQENDIEKSRIFYMNDK-GQLREDNYASNVKISHGFFTELVDQLAQY 86 (537) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhhccccccccc-ccccccccccccccccchHHHHHHHHhhh Confidence 99999999987766555 4678999999999999999999887654443 23334556799999999999999999999 Q ss_pred hhcCCeeeccCch---hhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCc Q lcl|NC_010179. 79 IASVFPDIDVGKD---ADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNK 155 (469) Q Consensus 79 l~g~p~~~~~~~~---~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~ 155 (469) |+|+||+|+++++ +..+.|++++++++.+.+.++++.++++|++|+++|++++|++++++++|.++||+||+. .+ T Consensus 87 l~G~Pv~~~~~d~~~~e~~~~l~~~~~~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i~p~~~~pv~d~~--~~ 164 (537) T protein:vir:78 87 LLSNGVEVKVKDEDNTQLDEILQEYFDEDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTVDGLTLIPVFDDY--GV 164 (537) T ss_pred hcccCceeecCcchhHHHHHHHHHHhhccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEEccceeEEEEcCC--CC Confidence 9999999998764 355567777776777888999999999999999999999999999999999999999974 46 Q ss_pred eEEEEEEEEeeecC----CceEEEEEEEEcCCeEEEEEeecCceeeccccccc------------ccccccccccccccc Q lcl|NC_010179. 156 LLGVLRSYKQLDPE----AGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNII------------TSYDLSAGYETGQSN 219 (469) Q Consensus 156 ~~~~v~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~ 219 (469) +.+++|+|...... ......++++||+..+++|....+........... ............... T Consensus 165 ~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 244 (537) T protein:vir:78 165 LKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKLDEAYNPNPAPHVLAIEESTDADFEDTDGYQ 244 (537) T ss_pred ceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccccccccccccceeeecccccccccccccccc Confidence 88889988765433 23456789999999999998877654332211110 001111222344556 Q ss_pred cccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecc Q lcl|NC_010179. 220 TLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINN 299 (469) Q Consensus 220 ~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 299 (469) ..+|+||+||||+|+||+.|+|+|+++++|||+||.++|+++|.+++|++|+++++|+++++.+++...++.++++.+.. T Consensus 245 ~~~~~~g~iPvv~f~nn~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~~~~~~~~~~~l~~~~~i~v~~ 324 (537) T protein:vir:78 245 VLGRSYSKFPFQLLYNNKDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFSGDSTDKLRQNIKAKKMIGVNG 324 (537) T ss_pred ccccCCcceeEEEeccCccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCCCccchhHHHHHhhcCceeecC Confidence 77899999999999999999999999999999999999999999999999999999998888888888899999998853 Q ss_pred cCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELV 379 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 379 (469) ++++|+|++|+.+.+++++++++|.++||.+|++|+++..++||+||+||+++++++.+||+.+++.|+++|++++ T Consensus 325 ----d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~~~~~~~gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~ 400 (537) T protein:vir:78 325 ----DNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFNSTAVGDGNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCA 400 (537) T ss_pred ----CCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCCCccccccCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH Confidence 3567999999999999999999999999999999999888889999999999999999999999999999999999 Q ss_pred HHHHHHhcccC---CCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHh-hh Q lcl|NC_010179. 380 RAIMRYLNFSD---ADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAKDREEN-DP 453 (469) Q Consensus 380 ~~i~~~~~~~~---~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~-~~ 453 (469) ++|+++++..+ .++.+++++|++++|.|+++.+++++++ +|++|+||+++++|+|+|++ .+++.+|+.+. .. T Consensus 401 ~~i~~~~~~~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e--~ek~~~ee~~~~~~ 478 (537) T protein:vir:78 401 DMVVSDIALRGLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDE--TLKLIAEELDLDYN 478 (537) T ss_pred HHHHHHHhhcCCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHH--HHHHHHHHHHhhhh Confidence 99999987754 5778999999999999999999999987 48999999999999999974 33444433221 11 Q ss_pred hH-----hh----c------ccC-CCCCCC------CC Q lcl|NC_010179. 454 YA-----NQ----A------DEL-NGKGVD------DE 469 (469) Q Consensus 454 ~~-----~~----~------~~~-~~~~~~------de 469 (469) .. ++ . +.. .+.+.+ |+ T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 516 (537) T protein:vir:78 479 ELKDALAEQDAQSLDVSPDVQAMLDGLPVNANQPPVDP 516 (537) T ss_pred hhhhhhhhhcccccCcCcchhhhcCCCCCCCCCCCCCc Confidence 00 00 0 000 000011 00 No 20 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=8.4e-97 Score=547.30 Aligned_cols=426 Identities=17% Similarity=0.193 Sum_probs=371.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.+. |.+++.+|..+++|++++++||+|+|+|+.++.. ...++++|+++||+++||++.++||| T Consensus 17 ~~~~~----i~~~i~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~-----------~~~~~~~ki~~n~~~~ivd~~~~~l~ 81 (452) T protein:vir:36 17 ITVEV----VTKFMEKHKLEVARYEYLKNMYLGIMAIDDEPAK-----------DSWKPDNRLAVNFTKYIVDTFTGYFN 81 (452) T ss_pred CCHHH----HHHHHHHHHHHHHHHHHHHHHhccccccccCccc-----------cccCccceeecchHHHHHHHHhhhhc Confidence 66654 4555556778889999999999999999877642 33467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+++++++..++.|+++|++| +...+.++++.++++|++|+++|++++|++++++++|.+++|+||+....+++++ T Consensus 82 g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ 161 (452) T protein:vir:36 82 GIPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVYDDTVKQEPLFA 161 (452) T ss_pred ccCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEEE Confidence 999999999999999999999876 4567889999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccc Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYR 239 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 239 (469) +|+|...+ + ..++++|++..+++|.....++. ..+..+|+||.||||+|+|++.| T Consensus 162 i~~~~~~~--~---~~~~~vyt~~~i~~~~~~~~~~~--------------------~~~~~~~~~g~iPvv~~~n~~~g 216 (452) T protein:vir:36 162 VRYGVDED--K---KLQGEVYTLLETIKISGENDEIS--------------------FGEGTYNPYPDLPVVEFYFNEER 216 (452) T ss_pred EEEEEecC--c---eEEEEEEecCeEEEEEEcCCceE--------------------EecceeccCCcccEEEecCCCCC Confidence 99986432 2 34678999999998876554322 23456899999999999999999 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHH Q lcl|NC_010179. 240 LAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEA 319 (469) Q Consensus 240 ~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 319 (469) +|+|+++++|||+||.++|++++.++++++|+++++|.... ++....+..++++.++.++.+.+++++|++++.+.++ T Consensus 217 ~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 294 (452) T protein:vir:36 217 MSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVE--EEDLKNIRSNRVINYYADGEGKNVDVKFLEKPDSDSQ 294 (452) T ss_pred CcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcC--chhhhhhhhcceEEecCCCCccCCcceeEeecCCHHH Confidence 99999999999999999999999999999999999997543 3555667778899999988888899999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC--CCcccce Q lcl|NC_010179. 320 RDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD--ADKRHIS 397 (469) Q Consensus 320 ~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~--~~~~~i~ 397 (469) +++++++|.++|+.+|++|++++.++||+||+||+++++++.+||+++++.|+.+|++++++|+++++..+ .++.+++ T Consensus 295 ~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~ 374 (452) T protein:vir:36 295 TENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIE 374 (452) T ss_pred HHHHHHHHHHHHHHHhCccccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999988754 3567899 Q ss_pred EEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 398 QHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 398 i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) |+|++++|.|+++.++++++++|+||+||+++++|+++|+++|++||++|+++..... +....+..+.+++ T Consensus 375 i~f~~~~p~d~~~~a~~~~k~~g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~-~~~~~~~~~~~~~ 445 (452) T protein:vir:36 375 YTFTRNEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFD-KDKQPSEKGTDTV 445 (452) T ss_pred EEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH-hhccCCCCccccc Confidence 9999999999999999999999999999999999999999999999999988765432 2222222222222 No 21 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=3.9e-97 Score=549.13 Aligned_cols=449 Identities=24% Similarity=0.391 Sum_probs=382.6 Q ss_pred CCH--------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCc Q lcl|NC_010179. 1 MEL--------------------DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSAD 60 (469) Q Consensus 1 ~~~--------------------~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ 60 (469) |+| +...++|.+++.+|..++++++++++||+|+|+|+.++..... .......+++ T Consensus 2 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~----~~~~~~~~~~ 77 (478) T protein:vir:10 2 ISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVREHKENIDNITMGERYYNHHPDILDAPPKRDV----NGDYDETKPD 77 (478) T ss_pred ccccCCCCchhHHHHHHHHhhccCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcccccccc----cccccccccc Confidence 222 1347788899999999999999999999999999877643221 2223345788 Q ss_pred ceeccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEc Q lcl|NC_010179. 61 NRIPSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQ 140 (469) Q Consensus 61 ~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~ 140 (469) +|+++||+++||++.++||||+||++++++++.++.|++++++++.+.+.+++++++++|++|+++|.+++|++++++++ T Consensus 78 ~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~~~~ 157 (478) T protein:vir:10 78 WRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLNHKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFRVP 157 (478) T ss_pred ceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHhcCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEc Confidence 99999999999999999999999999999999999999999988888999999999999999999999999999999999 Q ss_pred cceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccc Q lcl|NC_010179. 141 PDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNT 220 (469) Q Consensus 141 p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (469) |++++|+|+++...++.+++|+|...+ ..++++|++..+++|....+...... ............... T Consensus 158 p~~~~~i~d~~~~~~~~~~v~~~~~~~------~~~~~~y~~~~i~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~ 225 (478) T protein:vir:10 158 AEQAVPIWTNKERDELQAFIRVYELDG------AERVEYWTKDDVTYYELKEGQLIPDF------YRSDDHIQPHYYQGN 225 (478) T ss_pred ccceEEEEcCCCCCceEEEEEEEEecC------ceEEEEEeCCeEEEEEEcCCeeeccc------cccccccccceeccc Confidence 999999999988889999999997543 23578999999999887654432211 111112222334456 Q ss_pred ccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeeccc Q lcl|NC_010179. 221 LKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNA 300 (469) Q Consensus 221 ~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 300 (469) .+|+||.||||+|+|++.|+|+|+++++|||+||.++|++++.+++|++|+++++|+++++.+++...+...+++.+... T Consensus 226 ~~~~~~~vPvv~~~n~~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 305 (478) T protein:vir:10 226 KLMSWGRVPFIPFKNNPQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMKDFMHNLKYYKAISVAGE 305 (478) T ss_pred ccccCCccceEEeccCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccchhhhhhhhcceEEecCC Confidence 78999999999999999999999999999999999999999999999999999999988877888888888888887543 Q ss_pred CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELV 379 (469) Q Consensus 301 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 379 (469) ++++++|++++++.+++++++++|.+.|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++ T Consensus 306 ---~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 382 (478) T protein:vir:10 306 ---SGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELL 382 (478) T ss_pred ---CCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 346799999999999999999999999999999999998886 6899999999999999999999999999999999 Q ss_pred HHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc Q lcl|NC_010179. 380 RAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD 459 (469) Q Consensus 380 ~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~ 459 (469) ++|+++++. ..+..+++|+|++++|.|+++.|+++++++|++|+||+++++|+++|+++|++||++|+++..+...... T Consensus 383 ~li~~~~g~-~~~~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~ 461 (478) T protein:vir:10 383 QYIIDFYRL-DVKVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILSNHAWVEDPVAEMERIEQENIELNQQLPDIE 461 (478) T ss_pred HHHHHHhCC-CcccccceEEecCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccc Confidence 999999874 5678899999999999999999999999999999999999999999999999999999877655433322 Q ss_pred cCCCCCCCCC Q lcl|NC_010179. 460 ELNGKGVDDE 469 (469) Q Consensus 460 ~~~~~~~~de 469 (469) ....++.++| T Consensus 462 ~~~~~~~~~~ 471 (478) T protein:vir:10 462 EGLNGEQQRQ 471 (478) T ss_pred cccCCCCCCC Confidence 2111111111 No 22 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=4.1e-97 Score=549.03 Aligned_cols=442 Identities=27% Similarity=0.447 Sum_probs=380.0 Q ss_pred CCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MEL--DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~--~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) |+- +...++|.+++..|..++++|+++++||+|+|+|+.+..... ........++++||++||+++||++.++| T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~----~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~ 96 (474) T protein:vir:94 21 LKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVD----VHGNIDYDKPDWRITTNFHQNLVDQKVSY 96 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhc----cccccccccCcceeecchHHHHHHHHHhh Confidence 111 144578888888999999999999999999999987754432 12234566789999999999999999999 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) |||+||+++++++..++.++.|+++++.+.+.++++.++++|++|+++|++++|++++++++|++++|+||++...++.+ T Consensus 97 l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~ 176 (474) T protein:vir:94 97 VASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKS 176 (474) T ss_pred hhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEE Confidence 99999999999999999999999888888899999999999999999999999999999999999999999988899999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) ++|+|...+. .++++|++..+++|...++..... ......+......+|+||+||||+|+|++. T Consensus 177 ~ir~~~~~~~------~~~~~yt~~~~~~y~~~~~~~~~~----------~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~ 240 (474) T protein:vir:94 177 FIRYYKFNNE------EKVEFWTDTTVTYYVLENGGLIPD----------YYYGANHVQSHFSNGNWGRVPFIAFKNNPE 240 (474) T ss_pred EEEEEEecCe------EEEEEEeCCeEEEEEEcCCccccc----------cccCcCcccccccccCCCccceEEecCCcC Confidence 9999975421 367899999999998776543322 112223344567789999999999999999 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHH Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVE 318 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 318 (469) |+|+|+++++|||+||.++|++++.++++++|+++++|+.+++.+++..++..++++.+..+ ++++|++++.+.+ T Consensus 241 g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~l~~~~~~~ 315 (474) T protein:vir:94 241 EVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGD-----GGVETIQVEVPVS 315 (474) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCC-----CceeEEeecCCHH Confidence 99999999999999999999999999999999999999988877888888888888887543 4699999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccce Q lcl|NC_010179. 319 ARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHIS 397 (469) Q Consensus 319 ~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~ 397 (469) ++++++++|.+.||.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++.+ .++.+++ T Consensus 316 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~-~d~~~i~ 394 (474) T protein:vir:94 316 STKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK-TDVKDIE 394 (474) T ss_pred HHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-cccceee Confidence 999999999999999999999998876 78999999999999999999999999999999999999998864 4678899 Q ss_pred EEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc-cCCCCCCCCC Q lcl|NC_010179. 398 QHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD-ELNGKGVDDE 469 (469) Q Consensus 398 i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-~~~~~~~~de 469 (469) |+|++++|.|+++.+++++++ |++|+||+++++|+++|+++|++||++|+++..+...... ...+.+.++| T Consensus 395 v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~ 466 (474) T protein:vir:94 395 ISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQE 466 (474) T ss_pred EEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCC Confidence 999999999999999999885 8999999999999999999999999999987655443222 1111111122 No 23 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=4.1e-97 Score=549.03 Aligned_cols=442 Identities=27% Similarity=0.447 Sum_probs=380.0 Q ss_pred CCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MEL--DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~--~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) |+- +...++|.+++..|..++++|+++++||+|+|+|+.+..... ........++++||++||+++||++.++| T Consensus 21 ~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~----~~~~~~~~~~~~ki~~n~~k~Ivd~~~~~ 96 (474) T protein:vir:97 21 LKPQFETQEEMIVRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVD----VHGNIDYDKPDWRITTNFHQNLVDQKVSY 96 (474) T ss_pred hhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccchhc----cccccccccCcceeecchHHHHHHHHHhh Confidence 111 144578888888999999999999999999999987754432 12234566789999999999999999999 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) |||+||+++++++..++.++.|+++++.+.+.++++.++++|++|+++|++++|++++++++|++++|+||++...++.+ T Consensus 97 l~g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~ 176 (474) T protein:vir:97 97 VASKPVTYSCEDENVLKVIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKS 176 (474) T ss_pred hhcCCceeccCcHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEcccceEEEEcCCCCCceEE Confidence 99999999999999999999999888888899999999999999999999999999999999999999999988899999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) ++|+|...+. .++++|++..+++|...++..... ......+......+|+||+||||+|+|++. T Consensus 177 ~ir~~~~~~~------~~~~~yt~~~~~~y~~~~~~~~~~----------~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~ 240 (474) T protein:vir:97 177 FIRYYKFNNE------EKVEFWTDTTVTYYVLENGGLIPD----------YYYGANHVQSHFSNGNWGRVPFIAFKNNPE 240 (474) T ss_pred EEEEEEecCe------EEEEEEeCCeEEEEEEcCCccccc----------cccCcCcccccccccCCCccceEEecCCcC Confidence 9999975421 367899999999998776543322 112223344567789999999999999999 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHH Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVE 318 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 318 (469) |+|+|+++++|||+||.++|++++.++++++|+++++|+.+++.+++..++..++++.+..+ ++++|++++.+.+ T Consensus 241 g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~l~~~~~~~ 315 (474) T protein:vir:97 241 EVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYEGEDLEEFMRGLKYYKAINVDGD-----GGVETIQVEVPVS 315 (474) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCC-----CceeEEeecCCHH Confidence 99999999999999999999999999999999999999988877888888888888887543 4699999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccce Q lcl|NC_010179. 319 ARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHIS 397 (469) Q Consensus 319 ~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~ 397 (469) ++++++++|.+.||.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++.+ .++.+++ T Consensus 316 ~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~-~d~~~i~ 394 (474) T protein:vir:97 316 STKEYIDLMRVYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLK-TDVKDIE 394 (474) T ss_pred HHHHHHHHHHHHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-cccceee Confidence 999999999999999999999998876 78999999999999999999999999999999999999998864 4678899 Q ss_pred EEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc-cCCCCCCCCC Q lcl|NC_010179. 398 QHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD-ELNGKGVDDE 469 (469) Q Consensus 398 i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-~~~~~~~~de 469 (469) |+|++++|.|+++.+++++++ |++|+||+++++|+++|+++|++||++|+++..+...... ...+.+.++| T Consensus 395 v~f~~~~p~~~~e~a~~~~~~-g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~~~ 466 (474) T protein:vir:97 395 ISFNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQE 466 (474) T ss_pred EEeccCcccCHHHHHHHHHHc-CCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCcccCC Confidence 999999999999999999885 8999999999999999999999999999987655443222 1111111122 No 24 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=8.7e-97 Score=547.22 Aligned_cols=440 Identities=17% Similarity=0.173 Sum_probs=372.3 Q ss_pred CCHH------HH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHH Q lcl|NC_010179. 1 MELD------AL-KKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~~~------~~-~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~ 73 (469) |+-+ ++ .++|.+++.+|..+.++|+++.+||+|+|+|++++.. ...++++|+++||+++||+ T Consensus 5 ~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~-----------~~~~~~~ki~~n~~~~Iv~ 73 (499) T protein:vir:10 5 IDKDLLDDVNEPNIEAINYAIRELQNRKKRLDKLSDYYNGKQEIEKHEFD-----------NATVEAANVMVNHAKYITD 73 (499) T ss_pred hhhhHHhhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcCCcC-----------cCCCCcceeecchHHHHHH Confidence 1111 11 3346666777888899999999999999999876532 3456789999999999999 Q ss_pred HHHHhhhcCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCCCc-----------------eE Q lcl|NC_010179. 74 QEAGYIASVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDEDNN-----------------FR 135 (469) Q Consensus 74 ~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~~~-----------------~~ 135 (469) +.++||||+||+|++++++.++.++++|+.|.+ ..+.++++.++++|++|+++|++++|. ++ T Consensus 74 ~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~~~~ 153 (499) T protein:vir:10 74 MNVGFMTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNTELK 153 (499) T ss_pred HHhhhhcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEecccccccccccccccccccccceE Confidence 999999999999999999999999999988754 568899999999999999999999873 67 Q ss_pred EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccc Q lcl|NC_010179. 136 YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYET 215 (469) Q Consensus 136 i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (469) ++.++|++++|+|++....++++++|+|...+.++.+...++++|+++.+++|...+.... .... T Consensus 154 ~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~---------------~~~~ 218 (499) T protein:vir:10 154 IEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEV---------------SAND 218 (499) T ss_pred EEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccc---------------cCcc Confidence 8999999999999998888999999999998888888888999999999999987654321 1123 Q ss_pred cccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhccee Q lcl|NC_010179. 216 GQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSI 295 (469) Q Consensus 216 ~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~ 295 (469) ......+|+||.||||+|+|++.|.|+|+++++|||+||.++|++++.++++++|+++++|................++. T Consensus 219 ~~~~~~~~~~g~vPvv~~~n~~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~~~~~~ 298 (499) T protein:vir:10 219 PIVYDGENLFGAVPIIEFRNNEERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDDKDDIQRLKRGAIE 298 (499) T ss_pred eecccccCCCCccceEEecCCCCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccccchhhhhhhccee Confidence 34567789999999999999999999999999999999999999999999999999999998766655555555545444 Q ss_pred eecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 296 KINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 296 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) .+. ..++++++||+++.+.+++++++++|.+.|+.+|++|++++..+ ||+||+||+++++++.+||+++++.|+.+ T Consensus 299 ~~~---~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~ 375 (499) T protein:vir:10 299 APP---REEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDG 375 (499) T ss_pred ccC---CCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433 23457799999999999999999999999999999999988775 78999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccC--CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_010179. 375 INELVRAIMRYLNFSD--ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREEND 452 (469) Q Consensus 375 l~~~~~~i~~~~~~~~--~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~ 452 (469) |++++++|+.+++..+ .++..++++|++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++.. T Consensus 376 l~~~~~li~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~ 455 (499) T protein:vir:10 376 LRRRLKLIQTIVNIKGANDDASGCKISLVANIPSNLSDVVNNVKNADGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETI 455 (499) T ss_pred HHHHHHHHHHHHhccCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 9999999999988654 45678999999999999999999999999999999999999999999999999999988765 Q ss_pred hhHhhc------ccCCCCCCCCC Q lcl|NC_010179. 453 PYANQA------DELNGKGVDDE 469 (469) Q Consensus 453 ~~~~~~------~~~~~~~~~de 469 (469) +..+.. +..+.++..++ T Consensus 456 ~~~~~~~~~~~~~~~~~~~~~~~ 478 (499) T protein:vir:10 456 KKNQEALRGQDPDRLELEDKQDD 478 (499) T ss_pred HHHHhhhccCCCCCCCCCCCCcc Confidence 443321 11111111111 No 25 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=1.2e-96 Score=546.38 Aligned_cols=443 Identities=27% Similarity=0.426 Sum_probs=380.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) -+.+.+.++|.+++.+|..++++++++.+||+|+|+|+.++...... ......++++|+++||+++||++.++||| T Consensus 20 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~----~~~~~~~~~~ri~~n~~~~ivd~~~~~l~ 95 (472) T protein:vir:93 20 NKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDAT----GAVDPLKPDDRMITNFHANLVDQKVSYIV 95 (472) T ss_pred CchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhhcc----ccccccccccccccchHHHHHHHHhhhhc Confidence 44467899999999999999999999999999999998876532211 12234568899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+|++++++..+.|++|+++++.+.+.++++.++++|++|++||++++|++++++++|.+++|+||++...++.+++ T Consensus 96 g~~~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~~~i~d~~~~~~~~~~i 175 (472) T protein:vir:93 96 GKPIAFKHTDDEVVKRIDEVLGNRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFI 175 (472) T ss_pred ccCeeeccCChHHHHHHHHHHhccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999988888889999999999999999999999999999999999999999988888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+. ..+++|++..+++|........... ............+|+||.||||+|+|++.|+ T Consensus 176 r~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g~ 239 (472) T protein:vir:93 176 RMYKLENE------TKVEYWDKVTVNYYVYENGSLIPDY----------SNNLENSKTHFSTGSWGKIPFIPFKNNDLEI 239 (472) T ss_pred EEEEeecc------eeEEEEecCeEEEEEEecCeeeecc----------cccccccccccccCCCCCcceEEecCCCCCC Confidence 99976532 2468999999988876654332211 1112234456778999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.+++|++|+++++|++....++....++..+++.++.+ ++++|++++.+.+++ T Consensus 240 s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~l~~~~~~~~~ 314 (472) T protein:vir:93 240 SDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDDQELPEFKRLLRYYGAIKVSDN-----GGVDTIQVEVPVENS 314 (472) T ss_pred CchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCcccchhhHHHHhhccccccCCC-----CcceeEeecCCHHHH Confidence 999999999999999999999999999999999999987777777777888888776543 569999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.++|+.+|++|+++++++ ||+||+||+++++++.+||+++++.|+++|++++++|+.+++.+. ++.+++++ T Consensus 315 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~-~~~~i~v~ 393 (472) T protein:vir:93 315 KKYLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG-EHKDVDIS 393 (472) T ss_pred HHHHHHHHHHHHHHhCCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc-ccceeeEE Confidence 9999999999999999999998876 679999999999999999999999999999999999999998654 67789999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc---cCCCCCCCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD---ELNGKGVDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~---~~~~~~~~de 469 (469) |++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++......... ..+.++++++ T Consensus 394 f~~~~p~~~~~~~~~~~k~~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~ 466 (472) T protein:vir:93 394 FNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERS 466 (472) T ss_pred eCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCC Confidence 999999999999999999999999999999999999999999999999876554432221 1111111111 No 26 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=3.6e-96 Score=543.87 Aligned_cols=430 Identities=16% Similarity=0.188 Sum_probs=370.4 Q ss_pred CCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MEL--DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~--~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) |.. +...++|.+++.+|..+++|++++++||+|+|+|+.++.. ...++++|+++||+++||++.++| T Consensus 11 ~p~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~-----------~~~~~~~ki~~n~~~~ivd~~~~~ 79 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTK-----------DLWKPDNRLTVNFTKYIVDTFTGY 79 (453) T ss_pred cCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCc-----------cccCccceeecchHHHHHHHHhhh Confidence 221 2234556666677888899999999999999999877542 345678899999999999999999 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceE Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLL 157 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~ 157 (469) |||+||+|++++++.++.|+++|++|.+ ..+.++++.++++|++|++||++++|++++++++|.+++|+|++....+++ T Consensus 80 l~g~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~~~~~~~ 159 (453) T protein:vir:39 80 FNGIPVKKSHSDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFMVYDDTIKQEPL 159 (453) T ss_pred hcccCceeccCChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEecCCCCCeEE Confidence 9999999999999999999999998755 567899999999999999999999999999999999999999998888899 Q ss_pred EEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 158 GVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 158 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) +++|+|... ....++++|+++.+++|...+..+ ...+..+|+||.||||+|+|++ T Consensus 160 ~~ir~~~~~-----~~~~~~~~yt~~~i~~~~~~~~~~--------------------~~~~~~~~~~g~vPvv~~~n~~ 214 (453) T protein:vir:39 160 FAVRYGYDD-----DYKLYGEVYTKETTYALNGTMGFY--------------------NMTEQAPNPFDDLPVVEFYFNE 214 (453) T ss_pred EEEEEEEeC-----CeEEEEEEEeCCeEEEEEecCCce--------------------eeecccccCCCceeEEEecCCC Confidence 999988532 234578999999999887655432 2335668999999999999999 Q ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeeccc-CCCCCCcceEEeecCC Q lcl|NC_010179. 238 YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNA-GNGDKSGVDKLQIDIP 316 (469) Q Consensus 238 ~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~~~ 316 (469) .|+|+|+++++|||+||.++|++++.+++|++|+++++|.+.+ ++....+..++++.+... +.+.+++++|++++.+ T Consensus 215 ~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~ 292 (453) T protein:vir:39 215 ERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAAVE--EEDLKNIRSNRVINYYGESSEAKNVDVKFLEKPDS 292 (453) T ss_pred CCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCCC--chhhhhhhhcceeeecCCCCCCCCCceeEEeecCC Confidence 9999999999999999999999999999999999999997543 344555667777776543 2345788999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC--CCcc Q lcl|NC_010179. 317 VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD--ADKR 394 (469) Q Consensus 317 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~--~~~~ 394 (469) .+++++++++|.++|+.+|++|++++.++||+||+||++++++|.+||+++++.|+.+|++++++|+++++..+ .+.. T Consensus 293 ~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~ 372 (453) T protein:vir:39 293 DSQTENLLDRLTKLIFQTTMVANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWK 372 (453) T ss_pred HHHHHHHHHHHHHHHHHHhCCcccccccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999988754 4567 Q ss_pred cceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 395 HISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 395 ~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++|+|++++|.|+++.++++++++|+||+||+++++|+++|+++|++||++|+++..+... .....+.+.+++ T Consensus 373 ~i~v~f~~~~p~~~~~~a~~~~kl~g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~-~~~~~~~~~~~~ 446 (453) T protein:vir:39 373 DIEYTFTRNEPKDIKEQAETANILMGITSQETALSVISVIPDVQAEMEKIKKEEASTAIFDK-DKQPSEKGTDTV 446 (453) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHH-hccCCCCCCCCC Confidence 89999999999999999999999999999999999999999999999999999987766443 223333333333 No 27 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=2.8e-96 Score=544.42 Aligned_cols=444 Identities=25% Similarity=0.402 Sum_probs=381.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) -+.+...++|.+++.+|..++++|+++++||.|+|+|+.+...... .......++++||++||++.||++.++||| T Consensus 22 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~----~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~ 97 (468) T protein:vir:96 22 PQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNV----KGEIDPFKPDWRMYTNYHQNLVDQKVAYAV 97 (468) T ss_pred ccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccc----cccccccccccccccchHHHHHHHHHhhhc Confidence 2223446678888889999999999999999999999887654322 122334567899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||++++++++.++.|+++|++|+.+.+.++++.++++|++|++||++++|++++++++|.+++|+|+++...++.+++ T Consensus 98 g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~~~~~~~~~~~~i 177 (468) T protein:vir:96 98 ANPVTYGTEDEKSLKTIQEVLNHKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTFRVPAEQAIPIWTNKERDELKAFI 177 (468) T ss_pred cCCceeccCChHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999998888999999999999999999999999999999999999999999998888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+. ..+++|++..+++|...++....... ........+......+|+||+||||+|+|++.|+ T Consensus 178 r~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~iPvv~~~n~~~g~ 245 (468) T protein:vir:96 178 RLYELDGG------ERVEYWTANDVTFYELKDGQLIPDYY------QGEEHVQAHYYVGNKSMSWNRVPFIPFKNNPQEV 245 (468) T ss_pred EEEEecCc------eEEEEEeCCeEEEEEEcCCceeeccc------ccccccccceeeccccccCCcccEEEecCCCCCC Confidence 99975432 35689999999999877655432211 1111222334556678999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.++++++|+++++|+.+++.+.+...++.++++.+...+ +++++|++++++.+++ T Consensus 246 sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~d~---~~~~~~l~~~~~~~~~ 322 (468) T protein:vir:96 246 SDLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYEGEDLEEFMYNLKYYKAINVDGDG---SGGVDTIQIDVPVQSA 322 (468) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCccccchhhhhhhcCceEEecCCC---CCcceEEeecCChHHH Confidence 9999999999999999999999999999999999999888778888888888888886543 4679999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.++||.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+++++. ..++.+++|+ T Consensus 323 ~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~-~~d~~~i~i~ 401 (468) T protein:vir:96 323 KEYLDMLRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL-SIKVQDVEIT 401 (468) T ss_pred HHHHHHHHHHHHHHhCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcccceeeEE Confidence 9999999999999999999998886 5899999999999999999999999999999999999999875 4577889999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) |++++|.|+++.|+++++ +|+||+||+++++|+++||++|++||++|+++..+.. +. ..++.++| T Consensus 402 f~~~~p~d~~e~a~~~~~-~g~iS~et~i~~l~~v~D~~~E~~ri~~E~~~~~~~~---~~-~~~~~~~~ 466 (468) T protein:vir:96 402 FNFNVMVNELEQSQIGVN-SQYLSKETVVTNHPWVDDPVAEMERIDQEELALPSIE---EG-LNGKENNE 466 (468) T ss_pred ecCCCCcCHHHHHHHHHh-cCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHh---hc-cCCCCCCC Confidence 999999999999999876 5999999999999999999999999999987665533 22 33444555 No 28 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=1.8e-96 Score=545.50 Aligned_cols=438 Identities=16% Similarity=0.202 Sum_probs=372.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.++|.++|++... ++.++|+++.+||+|+|++..+.... .....++++|+++||+++||++.++||| T Consensus 22 l~~~~i~~li~~~~~---~~~~r~~~l~~YY~g~~~~i~~~~~~--------~~~~~~~~~ki~~n~~~~Iv~~~~~~l~ 90 (506) T protein:vir:94 22 LTPNKIMKFITHHFN---YQRPRLEMLDDYYQGYNLKILDKQSR--------RHEDGKADHRATHSFAKYIADFQTSYSV 90 (506) T ss_pred CCHHHHHHHHHHHHH---HHHHHHHHHHHHhcCCCccccccccc--------cccccCCcceeecchHHHHHHHhhhhhc Confidence 888888888777543 45678899999999999765443221 2334568899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+|+++++..++.|+++|++|.+ ..+.++++.++++|++|+++|++++|++++++++|.+++|+|++....+++++ T Consensus 91 G~p~~~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v~dd~~~~~~~~~ 170 (506) T protein:vir:94 91 GNPINVKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVIYSTDVDPKPIMA 170 (506) T ss_pred ccCceeecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEecCCCCCceEEE Confidence 99999999999999999999988655 56789999999999999999999999999999999999999999888889999 Q ss_pred EEEEEeeecCCce---EEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCC Q lcl|NC_010179. 160 LRSYKQLDPEAGK---YFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN 236 (469) Q Consensus 160 v~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~ 236 (469) ||+|...+.++.. ...+.++|+...+++|.....++ ......+|+||.||||+|+|+ T Consensus 171 v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~~--------------------~~~~~~~~~~g~vPvv~~~n~ 230 (506) T protein:vir:94 171 VRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIMG--------------------KMQVDTTKPITTFPVVEFKNS 230 (506) T ss_pred EEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCcc--------------------ceeccccccCCccceEEecCC Confidence 9999877666544 44577889998888776543322 223456799999999999999 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcc------------------------cchhhhhhhhhc Q lcl|NC_010179. 237 KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGA------------------------SLKQFMNDLREY 292 (469) Q Consensus 237 ~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~------------------------~~~~~~~~~~~~ 292 (469) +.|.|+|+++++|||+||.++|++++.++++++|+++++|.... ...+....+..+ T Consensus 231 ~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (506) T protein:vir:94 231 NFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKDA 310 (506) T ss_pred CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhhc Confidence 99999999999999999999999999999999999999996421 223445566778 Q ss_pred ceeeecccC----CCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAG----NGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKT 367 (469) Q Consensus 293 ~~~~~~~~~----~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~ 367 (469) +++.+.+++ .+.+++++||+++.+.+++++++++|.+.|+.+|++|++++.++ ||+||+||+++++++.+||+++ T Consensus 311 ~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Aik~~~~~l~~k~~~k 390 (506) T protein:vir:94 311 NMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDENFASNSSGVAMQYKVLGTVELASTK 390 (506) T ss_pred CeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccccccccchHHHHHHHHHHHHHHHHHH Confidence 888887664 34567899999999999999999999999999999999988875 7899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccc----CCCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHH Q lcl|NC_010179. 368 QTYFEHAINELVRAIMRYLNFS----DADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKD 443 (469) Q Consensus 368 ~~~~~~~l~~~~~~i~~~~~~~----~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~er 443 (469) ++.|+++|++++++|+++++.. ..++.+++|+|++++|.|+++.|+++++++|+||+||+++++|+++|+++|++| T Consensus 391 ~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~lp~v~d~~~E~~r 470 (506) T protein:vir:94 391 RRMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQAGATLPQKYLYQQLPGVTNPQDIVDM 470 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHH Confidence 9999999999999999998754 345678999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 444 LAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 444 i~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) |++|+++..+..++....+..+..+| T Consensus 471 i~~E~~~~~~~~~~~~~~~~~~~~~~ 496 (506) T protein:vir:94 471 MKEQSANGDYSFDQNGVISNDGQTNT 496 (506) T ss_pred HHHHHHHHhhcchhhcCCCcccCccc Confidence 99999877666544433333333222 No 29 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=1.6e-95 Score=540.26 Aligned_cols=454 Identities=23% Similarity=0.336 Sum_probs=382.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) -..+...++|.+++.+| +.++++++++||.|+|+|+.+....+..... ......++++|+++||+++||++.++|+| T Consensus 25 ~~~~~~~~~i~~~i~~~--~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~-~~~~~~~~~~ri~~n~~~~ivd~~~~yl~ 101 (503) T protein:vir:59 25 EIAEPDTTMIQKLIDEH--NPEPLLKGVRYYMCENDIEKKRRTYYDAAGQ-QLVDDTKTNNRTSHAWHKLFVDQKTQYLV 101 (503) T ss_pred hccchhHHHHHHHHHhh--cHHHHHHHHHHhccccchhhccchhcccccc-cccccccccceeecchHHHHHHHHHhhhh Confidence 11122345677777776 4578999999999999999887654443333 33445678999999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++..++.++.++++++.+.+.++++.++++|++|++||+|++|++++++++|.+++|+|++....++.++| T Consensus 102 g~~~~~~~~d~~~~~~l~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~~i 181 (503) T protein:vir:59 102 GEPVTFTSDNKTLLEYVNELADDDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAEEMIVVYKDNTRRDILFAL 181 (503) T ss_pred cCCeeeccCcHHHHHHHHHHHhcCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccceeEEEEeCCCCCceEEEE Confidence 99999999999999999999888888889999999999999999999999999999999999999999999889999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) |+|...+.++. ...++++|+++.+++|......+....... ..........+..+|+||+||||+|+|++.|. T Consensus 182 r~~~~~~~~~~-~~~~~evy~~~~i~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~vPiv~~~nn~~~~ 254 (503) T protein:vir:59 182 RYYSYKGIMGE-ETQKAELYTDTHVYYYEKIDGVYQMDYSYG------ENNPRPHMTKGGQAIGWGRVPIIPFKNNEEMV 254 (503) T ss_pred EEEEEecCCCc-eEEEEEEEeCCcEEEEEEcCCccccccccc------ccccccceeecceeccCCccceEEecCCCCCC Confidence 99987766554 446789999999999988766654332211 11122233455678999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.++++++|+++++|+++++.+++...+...+++.++.+ ++++|++++++.+++ T Consensus 255 sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~l~~~~~~~~~ 329 (503) T protein:vir:59 255 SDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPKEFTANLRYHSVIKVSGD-----GGVDTLRAEIPVDSA 329 (503) T ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccchhhhhhhcccceeccCC-----CcceeEeccCCHHHH Confidence 999999999999999999999999999999999999988888888888888888876543 568999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCC----Cccc Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDA----DKRH 395 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~----~~~~ 395 (469) +.++++|.+.|+.+|++|++++..+ |++||+||+++++++.+||+++++.|+.+|++++++|+.+++..+. +..+ T Consensus 330 ~~~~~~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~~~ 409 (503) T protein:vir:59 330 AKELERIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPDKE 409 (503) T ss_pred HHHHHHHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccc Confidence 9999999999999999999988765 7899999999999999999999999999999999999999876432 3457 Q ss_pred ceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 396 ISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 396 i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++++|++++|.|+++.+++++++ +|++|+||+++++|+++|+++|++||++|+++..+..........+..+++ T Consensus 410 i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 485 (503) T protein:vir:59 410 LTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGNLLDDEGGDDDLE 485 (503) T ss_pred eeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhccccCccCCCCCCC Confidence 99999999999999999999998 689999999999999999999999999998776554433222111111111 No 30 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=2.3e-95 Score=539.44 Aligned_cols=426 Identities=19% Similarity=0.206 Sum_probs=369.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.+.|.++|++ |..+++||+++++||+|+|+|+.+... ...++++|+++||+++||++.++||| T Consensus 1 l~~~~l~~~i~~----~~~~~~r~~~l~~yy~g~~~il~~~~~-----------~~~~~~~ki~~n~~~~ivd~~~~~l~ 65 (429) T protein:vir:98 1 MTKDLLSELIQK----HRSFNLSYSAYKQLYEGDHAILQQKQK-----------EQYKPDNRLVVNFAKYIVDTFNGYFI 65 (429) T ss_pred CCHHHHHHHHHH----HHHHHHHHHHHHHHhcccccccccccc-----------ccCCCcceeecchHHHHHHHHhhhhc Confidence 888877776654 667889999999999999999876542 23467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+++++++..++.++++|+.|. ...+.+++++++++|++|+++|++++|++++++++|.+++|+||+....+++++ T Consensus 66 g~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~ 145 (429) T protein:vir:98 66 GVPVQTSHENKQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFA 145 (429) T ss_pred ccCceeecCChHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEE Confidence 9999999999999999999998875 466889999999999999999999999999999999999999998888889999 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccc Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYR 239 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 239 (469) +|+|...+ ...+.++|+...+++|.....++ ...+..+|+||.||||+|+|++.| T Consensus 146 i~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~~~g~vPvv~~~n~~~g 200 (429) T protein:vir:98 146 VRYFYNKG-----GVLEGSYSDASNITYFKDGEKGI--------------------EIGESEPHPFDGVPMIEYVENEER 200 (429) T ss_pred EEEEEecC-----ceEEEEEEeCceEEEEEecCCce--------------------EecccccccCCccceEEecCCCCC Confidence 99986432 23466889999888887654432 223456899999999999999999 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHH Q lcl|NC_010179. 240 LAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEA 319 (469) Q Consensus 240 ~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 319 (469) +|+|+++++|||+||.++|++++.++++++|+++++|.... +++...+..++++.++.+ ++.+++++|++++.+.++ T Consensus 201 ~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~ 277 (429) T protein:vir:98 201 QSLLASVVTLINAFNKAISEKANDVEYFADAYLKILGAELD--DETLKSLRDTRIINLKDT-DAQQLTVEFLQKPDADAT 277 (429) T ss_pred CCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCCC--cchhhhHhhCceeeccCC-CCCCcceeEEeecCCHHH Confidence 99999999999999999999999999999999999997544 355667777888888655 456788999999999999 Q ss_pred HHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC--CCcccce Q lcl|NC_010179. 320 RDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD--ADKRHIS 397 (469) Q Consensus 320 ~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~--~~~~~i~ 397 (469) +++++++|.++|+.+|++|++++.++||+||+||+++++++.+||+++++.|+.+|++++++|+++++..+ .++.+++ T Consensus 278 ~~~~~~~l~~~i~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~ 357 (429) T protein:vir:98 278 QEHLLDRLENLIFRTAMVANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIK 357 (429) T ss_pred HHHHHHHHHHHHHHHhCccccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccce Confidence 99999999999999999999999999999999999999999999999999999999999999999988754 4567899 Q ss_pred EEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 398 QHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 398 i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++|++++|.|+++.+++++|++|++|+||+++++|+++|+++|++||++|+++..+..+..-.......+.| T Consensus 358 v~f~~~~p~~~~~~a~~~~kl~g~is~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 358 YKFTRNLPANLLEESQIAGNLAGIVSEETQVGVLSIVENPQKEIERKNSDKSTLISRQAGGLNGQNTTTILE 429 (429) T ss_pred EEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhcCCCCCCCCC Confidence 999999999999999999999999999999999999999999999999999876543322222122222222 No 31 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=1.3e-95 Score=540.86 Aligned_cols=442 Identities=27% Similarity=0.448 Sum_probs=377.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) =+.+...++|.+++.+|..++++++++.+||.|+|+|+.+..... ........++++||++||++.||++.++||| T Consensus 23 ~~~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~----~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~ 98 (474) T protein:vir:95 23 PQFETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVD----VYGNIDYDKPDWRITTNFHQNLVDQKVSYVA 98 (474) T ss_pred hccCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccc----cccccccccccceeccchHHHHHHHHHhhhc Confidence 112234568888888899999999999999999999988765432 1223345678999999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVL 160 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v 160 (469) |+||+++++++..++.++.++++++.+.+.++++.++++|++|+++|++++|++++++++|.+++|+|++....++.+++ T Consensus 99 g~p~~~~~~d~~~~~~l~~~~~n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~d~~~~~~~~~~i 178 (474) T protein:vir:95 99 SKPVTYSCEDESVLKIIHDVLDTRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAEQAIPIWVDKEREELKSFI 178 (474) T ss_pred cCCceeccCchHHHHHHHHHHhccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEE Confidence 99999999999999999999988888889999999999999999999999999999999999999999998888999999 Q ss_pred EEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccc Q lcl|NC_010179. 161 RSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRL 240 (469) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~ 240 (469) ++|...+. ..+++|++..+++|....+.+.... ............+|+||.||||+|+|++.|+ T Consensus 179 ~~~~~~~~------~~~~~y~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~g~iPvv~~~nn~~g~ 242 (474) T protein:vir:95 179 RYYKFNNE------EKVEFWTDTTVTYYVLENGGLIPDY----------YYGANHIQSHFSNGNWGRVPFIAFKNNPEEV 242 (474) T ss_pred EEEEEcCe------eEEEEEeCCeEEEEEEcCCcccccc----------ccCcccccccccccCCCccceEeecCCCCCC Confidence 99975432 2578999999999887765543211 1112233455678999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHH Q lcl|NC_010179. 241 AELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEAR 320 (469) Q Consensus 241 ~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 320 (469) |+|+++++|||+||.++|++++.+++|++|+++++|+++++..++...+...+++.++.+ ++++|++++++.+++ T Consensus 243 sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~~~~~~~~~~~~i~~~~~-----~~~~~l~~~~~~~~~ 317 (474) T protein:vir:95 243 SDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYEGQDLEEFMRGLKYYKAINVDGD-----GGVETIQVEVPVSST 317 (474) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccchhhhhhhhccceeeccCC-----CceeEEeecCCHHHH Confidence 999999999999999999999999999999999999988877778888888888877543 569999999999999 Q ss_pred HHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEE Q lcl|NC_010179. 321 DDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQH 399 (469) Q Consensus 321 ~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~ 399 (469) ++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+++++. ..++.+++++ T Consensus 318 ~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~-~~d~~~i~v~ 396 (474) T protein:vir:95 318 KEYIDLMRAYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNL-KMDVKDIEIS 396 (474) T ss_pred HHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC-CcccceeeEE Confidence 9999999999999999999988876 6899999999999999999999999999999999999999885 4578899999 Q ss_pred eCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCC-CCCCC Q lcl|NC_010179. 400 WTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGK-GVDDE 469 (469) Q Consensus 400 f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~-~~~de 469 (469) |++++|.|+++.+++++++ |++|+||+++++|+++|+++|++||++|+++.............. ..++| T Consensus 397 f~~~~p~d~~e~a~~~~~~-g~iS~et~i~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~ 466 (474) T protein:vir:95 397 FNFNRMMNDAEQSQIIAQS-QYLSRETLVKSSPLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQE 466 (474) T ss_pred eccCCCcCHHHHHHHHHhc-CCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCC Confidence 9999999999999999885 899999999999999999999999999987765544322221111 11111 No 32 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=9.3e-95 Score=536.09 Aligned_cols=435 Identities=18% Similarity=0.244 Sum_probs=365.8 Q ss_pred CCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccC-CcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRN-DLINNYKKSVDYYENK-TDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~-~~~~~~~~~~~Yy~g~-~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) ++.. ..++|.+++.+|. .+++||+++.+||.|+ |+|..+... ....++++|+++||+++||++.++| T Consensus 37 ~~~~-~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~~~~~----------~~~~~~~~ki~~n~~k~Ivd~~~~y 105 (502) T protein:vir:48 37 LMVN-NWELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR----------KDNEMADKRAVHNYGRMISKFKTGY 105 (502) T ss_pred hccc-cHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccc----------cccccccceeecchHHHHHHHHhhh Confidence 1111 1345677777775 4578999999999997 466554321 3445778999999999999999999 Q ss_pred hhcCCeeeccCchh----hHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010179. 79 IASVFPDIDVGKDA----DNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLD 153 (469) Q Consensus 79 l~g~p~~~~~~~~~----~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~ 153 (469) |+|+||+++++++. .++.|+++|+.|.+ ..+.++++.++++|++|+++|++++|++++++++|.+++|+||++.. T Consensus 106 l~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~vydd~~~ 185 (502) T protein:vir:48 106 LAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLE 185 (502) T ss_pred hcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEEEEcCCCC Confidence 99999999987643 55678888887755 56789999999999999999999999999999999999999999888 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) .+++++||+|.....++. ..++++|+++.+++|...+. +.+....+|+||.||||+| T Consensus 186 ~~~~~~ir~~~~~~~~~~--~~~~~iyt~~~i~~~~~~~~---------------------~~~~~~~~~~~g~vPvv~~ 242 (502) T protein:vir:48 186 DNSIAAVRYYNRGTLQNA--KDVVEIYTNQHIYTLDASDS---------------------FNEISVTPHAFGTVPITEF 242 (502) T ss_pred CceEEEEEEEEEeecCCc--EEEEEEEeCCeEEEEEeCCc---------------------eeeccceecCCCccceEEe Confidence 899999999987665544 35679999999988864332 2334567899999999999 Q ss_pred cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeeccc----CCCCCCcce Q lcl|NC_010179. 234 PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNA----GNGDKSGVD 309 (469) Q Consensus 234 ~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 309 (469) +|++.|.|+|+++++|||+||.++|++++.+++|++|+++++|......++....++..+++.+... +.+.+++++ T Consensus 243 ~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 322 (502) T protein:vir:48 243 LNNADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLKPPKSADGKEGTVKAE 322 (502) T ss_pred cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeeccccccccccccCccee Confidence 9999999999999999999999999999999999999999999877777777778888888877543 334568899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) |++++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++. T Consensus 323 ~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 402 (502) T protein:vir:48 323 YLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSL 402 (502) T ss_pred EeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 999999999999999999999999999999998886 7899999999999999999999999999999999999999876 Q ss_pred cC----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhH--hhcccCC Q lcl|NC_010179. 389 SD----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYA--NQADELN 462 (469) Q Consensus 389 ~~----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~--~~~~~~~ 462 (469) .+ .++.+++++|++++|.|.++.++++++++|+||+||+++++|+++|+++|++||++|+++..... ...+... T Consensus 403 ~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 482 (502) T protein:vir:48 403 VNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLGGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDNV 482 (502) T ss_pred cccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhcccccccccc Confidence 43 46678999999999999999999999999999999999999999999999999999987543222 2222222 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) +.+.+++ T Consensus 483 ~~~~d~~ 489 (502) T protein:vir:48 483 GKYTDEV 489 (502) T ss_pred cccCCCc Confidence 2222222 No 33 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=1.9e-94 Score=534.34 Aligned_cols=435 Identities=18% Similarity=0.239 Sum_probs=365.7 Q ss_pred CCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCC-cccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRN-DLINNYKKSVDYYENKT-DITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~-~~~~~~~~~~~Yy~g~~-~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) ++... .++|.+++.+|. .+++||+++.+||.|+| .|..+.. .....++++|+++||+++||++.++| T Consensus 36 ~~~~~-~~~l~~~i~~~~~~~~~r~~~l~~yY~g~~~~i~~~~~----------~~~~~~~~~ki~~n~~k~Ivd~~~~y 104 (501) T protein:vir:27 36 LMVNN-WELLKNFINHHKLRQAPRIQELLDYARGENHDVLQFGR----------RKDREMADKRAVHNYGRMISKFKTGY 104 (501) T ss_pred ccccc-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccCc----------cCccccccceeccchHHHHHHHHhhh Confidence 22222 234566666664 45789999999999985 4544332 23445788999999999999999999 Q ss_pred hhcCCeeeccCchh----hHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010179. 79 IASVFPDIDVGKDA----DNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLD 153 (469) Q Consensus 79 l~g~p~~~~~~~~~----~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~ 153 (469) |+|+||+++++++. .++.++++|+.|. ...+.++++.++++|++|+++|++++|++++++++|.+++|+||++.. T Consensus 105 l~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~v~d~~~~ 184 (501) T protein:vir:27 105 LAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFVIYDNSLE 184 (501) T ss_pred hcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEEEecCCCC Confidence 99999999987644 4566788887764 556789999999999999999999999999999999999999999888 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) .+++++||+|...+.++. ..++++|+++.+++|...+. +.+.+..+|+||.||||+| T Consensus 185 ~~~~~~ir~~~~~~~~~~--~~~~~vyt~~~v~~~~~~~~---------------------~~~~~~~~~~~g~vPvv~~ 241 (501) T protein:vir:27 185 DNSIAAVRYYNRGTLQNA--KDVVEIYTNEHIYTLDASDD---------------------FNEISVTTHAFGTVPITEF 241 (501) T ss_pred CceEEEEEEEEeeecCCc--EEEEEEEeCCeEEEEEeCCc---------------------eeeccccccCCCcccEEEe Confidence 899999999987765554 45789999999988865432 2234567899999999999 Q ss_pred cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccC----CCCCCcce Q lcl|NC_010179. 234 PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAG----NGDKSGVD 309 (469) Q Consensus 234 ~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~ 309 (469) +|++.|.|+|+++++|||+||.++|++++.++++++|+++++|...+..+++...++..+++.+...+ ...+++++ T Consensus 242 ~nn~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (501) T protein:vir:27 242 LNNVDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAE 321 (501) T ss_pred cCCCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCceeecccccccCCCCCccee Confidence 99999999999999999999999999999999999999999998877777888888888888876543 34457899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) |++++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+.+++.|+.+|++++++|+++++. T Consensus 322 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~ 401 (501) T protein:vir:27 322 YLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSL 401 (501) T ss_pred eeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 999999999999999999999999999999998886 7899999999999999999999999999999999999999876 Q ss_pred cC----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCC-- Q lcl|NC_010179. 389 SD----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELN-- 462 (469) Q Consensus 389 ~~----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~-- 462 (469) .+ .++.+++++|++++|.|.++.++++++++|++|+||+++++|+++||++|++||++|+++..+...+.+-.. T Consensus 402 ~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~~~~~~~~ 481 (501) T protein:vir:27 402 VNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYSNDFNEHV 481 (501) T ss_pred cccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhcCcccccc Confidence 43 456789999999999999999999999999999999999999999999999999999876554433321111 Q ss_pred CC---------CCCCC Q lcl|NC_010179. 463 GK---------GVDDE 469 (469) Q Consensus 463 ~~---------~~~de 469 (469) ++ +++.| T Consensus 482 ~~~~d~~~~~~~d~~e 497 (501) T protein:vir:27 482 GKYTDEVKETHTDDFE 497 (501) T ss_pred ccccCCCCCCcccccc Confidence 11 11111 No 34 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=4e-94 Score=532.60 Aligned_cols=426 Identities=16% Similarity=0.165 Sum_probs=361.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-+ .|.+++.+|..++++|+++.+||+|+|+|+++.. ....++++|+++||+++||++.++||+ T Consensus 17 ~~~~----~i~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~-----------~~~~~~~~ki~~n~~~~ivd~~~~~l~ 81 (453) T protein:vir:73 17 ITDK----VVNDFMKKHQEEVERYEYLGNMYKGIMEISSQKA-----------KDSWKPDNRLTNNFAKYIVDTFVGYFN 81 (453) T ss_pred CCHH----HHHHHHHHHHHHHHHHHHHHHHhccccchhcCCC-----------CCccCccceeecchHHHHHHHhhhhhc Confidence 4444 4555666778889999999999999999987643 234567899999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+++++++..++.|+++|+.| +.+.+.+++++++++|++|+++|++++|++++++++|.+++|+|++....+++++ T Consensus 82 g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~ 161 (453) T protein:vir:73 82 GIPIKKTHDDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDSIKQKPLFA 161 (453) T ss_pred ccCceeecCChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCCCCceeEEE Confidence 999999999999999999999876 5567889999999999999999999999999999999999999999888889999 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccc Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYR 239 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 239 (469) +++|... ++. .++++|+++.+++|...+..+ ...+..+|+||.||||+|+|++.| T Consensus 162 i~~~~~~--~~~---~~~~vyt~~~i~~~~~~~~~~--------------------~~~~~~~~~~g~vPvv~~~n~~~g 216 (453) T protein:vir:73 162 VYYGFDE--EGN---LSGTVYTLLETISITGKAGEV--------------------KFGESTYNVYSDLPIVEYNFNEER 216 (453) T ss_pred EEEEEec--Cce---EEEEEEeCCeEEEEEecCCce--------------------EEccceeccCCceeEEEecCCCCC Confidence 9987532 332 367899999998887654432 234567899999999999999999 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecc------cCCCCCCcceEEee Q lcl|NC_010179. 240 LAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINN------AGNGDKSGVDKLQI 313 (469) Q Consensus 240 ~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~l~~ 313 (469) +|+|+++++|||+||.++|++++.+++|++|+++++|...+. +....+...+++.+.. +..+.+++++|+++ T Consensus 217 ~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~ 294 (453) T protein:vir:73 217 QSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAEVDE--EDAKNIKDNRLINFFDKNSNGQGTNAAKVDVKFLDK 294 (453) T ss_pred CcchhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCCCc--hhhhcccccccccccccccccccccccCceeEEeee Confidence 999999999999999999999999999999999999975443 3444444455544332 12344677999999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC--C Q lcl|NC_010179. 314 DIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD--A 391 (469) Q Consensus 314 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~--~ 391 (469) +.+.+++++++++|.++|+.+|++|++++.++||+||+||+++++++.+||+++++.|+.+|++++++|+++++..+ . T Consensus 295 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~ 374 (453) T protein:vir:73 295 PDSDVQTENLLNRLERSIFQFTMAANISDENFGNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKD 374 (453) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999887654 3 Q ss_pred CcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 392 DKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 392 ~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++.+++++|++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++.....+.... ...+.+-. T Consensus 375 ~~~~i~v~f~~~~p~~~~~~a~~~~k~~giis~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~~~~ 451 (453) T protein:vir:73 375 AWKDIEYTFTRNEPKDIKEQAETANILKGITSEETALSVISVIPDVQAEMEKIKKKKLLQLSLTRTSNL-VRMKQMRG 451 (453) T ss_pred ccccceEEeCCCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhccC-Ccchhhhc Confidence 567899999999999999999999999999999999999999999999999999999877654433211 11111111 No 35 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=6.1e-94 Score=531.61 Aligned_cols=435 Identities=18% Similarity=0.242 Sum_probs=365.5 Q ss_pred CCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhccCC-cccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRN-DLINNYKKSVDYYENKT-DITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~-~~~~~~~~~~~Yy~g~~-~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~ 78 (469) +.... .++|.+++.+|. .+.++|+++.+||.|+| .|+.+... ....++++|+++||+++||++.++| T Consensus 36 ~~~~~-~~~i~~~i~~~~~~~~~r~~~~~~yY~g~~~~i~~~~~~----------~~~~~~~~ri~~n~~k~Ivd~~~~y 104 (501) T protein:vir:96 36 LMVNN-WELLKNFINHHKLRQAPRIQELLDYARGENHDVLKSGRR----------KDNEMADKRAVHNYGRMISKFKTGY 104 (501) T ss_pred ccCCh-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccCcccc----------CccccccceeecchHHHHHHHHhhh Confidence 21221 234566666665 45679999999999974 56544321 3345778899999999999999999 Q ss_pred hhcCCeeeccCch----hhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010179. 79 IASVFPDIDVGKD----ADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLD 153 (469) Q Consensus 79 l~g~p~~~~~~~~----~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~ 153 (469) ++|+||++++.++ ..++.|+++|+.|.+ ..+.+++++++++|++|+++|++++|++++++++|.+++|+||++.. T Consensus 105 l~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~v~d~~~~ 184 (501) T protein:vir:96 105 LAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFVIYDNSLE 184 (501) T ss_pred hcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEEEEcCCCC Confidence 9999999998653 456678889987754 56889999999999999999999999999999999999999999888 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) ++++++|++|...+.++. ..++++|+++.+++|...+. ..+.+..+|+||.||||+| T Consensus 185 ~~~~~~v~~~~~~~~~~~--~~~~~vyt~~~i~~~~~~~~---------------------~~~~~~~~~~~g~vPvv~~ 241 (501) T protein:vir:96 185 DNSIAAVRYYNRGTLQSA--KDVVEIYTDEHIYTLDASDD---------------------FNEISVTTHAFGTVPITEY 241 (501) T ss_pred CceEEEEEEEEeecCCCc--EEEEEEEcCCcEEEEeeCCC---------------------ceeccccccCCCccceEEe Confidence 899999999987665554 35688999999988864432 2234567899999999999 Q ss_pred cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccC----CCCCCcce Q lcl|NC_010179. 234 PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAG----NGDKSGVD 309 (469) Q Consensus 234 ~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~ 309 (469) +|++.|+|+|+++++|||+||.++|++++.++++++|+++++|......+++...+...+++.+...+ ...+++++ T Consensus 242 ~nn~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (501) T protein:vir:96 242 LNNIDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLKPPKSADGKEGTVKAE 321 (501) T ss_pred cCCccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeecccccccccccCccee Confidence 99999999999999999999999999999999999999999999877777788888888888876543 34567899 Q ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 310 KLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 310 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) |++++.+.++++.++++|.+.|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++|+++++. T Consensus 322 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~ 401 (501) T protein:vir:96 322 YLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSL 401 (501) T ss_pred eEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999999999999999998876 7899999999999999999999999999999999999999876 Q ss_pred c----CCCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhh--cccCC Q lcl|NC_010179. 389 S----DADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQ--ADELN 462 (469) Q Consensus 389 ~----~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~--~~~~~ 462 (469) . ..++.+++|+|++++|.|+++.++++++++|+||+||+++++|+++|+++|++||++|+++....... .++.. T Consensus 402 ~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 481 (501) T protein:vir:96 402 VNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYSNDFNEHV 481 (501) T ss_pred cccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccccchhhcc Confidence 4 34567899999999999999999999999999999999999999999999999999998876543322 22222 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) +...+++ T Consensus 482 ~~~~~~~ 488 (501) T protein:vir:96 482 GKYTDEV 488 (501) T ss_pred cccCCcC Confidence 2222222 No 36 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=5.4e-93 Score=526.42 Aligned_cols=433 Identities=18% Similarity=0.205 Sum_probs=367.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.++|.++|+++ +.++.++|+++.+||+|+|+|++++. ...++++|+++||+++||++.++||+ T Consensus 25 ~~~~~i~~~i~~~---~~~~~~~~~~l~~Yy~g~~~i~~~~~------------~~~~~~~ki~~n~~~~Ivd~~~~~l~ 89 (470) T protein:vir:99 25 LTSNELLGFIAYN---ETVLKPRYRENMKLYLGKHKILTAPE------------KETGADNRIVVNSAKYVVDVYNGYFC 89 (470) T ss_pred cCHHHHHHHHHHH---HHhhHHHHHHHHHHhccccccccCcc------------cccCCcceeecchHHHHHHHHhhhhc Confidence 7777777766553 34566889999999999999987653 23467889999999999999999999 Q ss_pred cCCeeeccCch-hhHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 81 SVFPDIDVGKD-ADNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 81 g~p~~~~~~~~-~~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) |+||+++++++ ..++.++++|..|. .+.+.++++.++++|++|+++|++++|++++++++|.+++|+||+....++++ T Consensus 90 g~p~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~~i~d~~~~~~~~~ 169 (470) T protein:vir:99 90 GIEPKLALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAFIIYDDTVQRQPLA 169 (470) T ss_pred cCCeeEeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeEEEEcCCCCcceEE Confidence 99999998654 56788999998765 46788999999999999999999999999999999999999999988888999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) ++|+|.... +.....++.+|++..+++|...+.. ......+..+|+||.||||+|+|++. T Consensus 170 ~vr~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~g~vPvv~~~n~~~ 229 (470) T protein:vir:99 170 FVHYQIDNS--NNWTDAYGVIQYADKFYKFKGYDIE------------------EDTNAAGYAINPYGLVPAVEFFENEE 229 (470) T ss_pred EEEEEEEec--CCeeEEEEEEEecCeEEEEEecccc------------------cccccccccccCCCccceEeecCCCC Confidence 999987643 3445566788888888777654322 12233456689999999999999999 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcc--cchhhhhhhhhcceeeecccCCCCCCcceEEeecCC Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGA--SLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIP 316 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 316 (469) |+|+|+++++|||+||.++|++++.++++++|+++++|+... ..+++...+..++++.++..+.+.+++++|++++.+ T Consensus 230 g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 309 (470) T protein:vir:99 230 RQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLPEDDEGNPKFDFKNNRVLYVSQLDPDTNPQIGFIAKPDA 309 (470) T ss_pred CCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcccccccchhhhhhhcceeeecCCCCCCCCcceEEeecCC Confidence 999999999999999999999999999999999999997543 334566677788888888877778899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---CC Q lcl|NC_010179. 317 VEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD---AD 392 (469) Q Consensus 317 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~---~~ 392 (469) .+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++++++++.+++..+ .+ T Consensus 310 ~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~ 389 (470) T protein:vir:99 310 DQMQENLIQHLTDFIFMMAMVPNIQDKNFAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQEL 389 (470) T ss_pred hHHHHHHHHHHHHHHHHHhCCccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccc Confidence 99999999999999999999999988886 789999999999999999999999999999999999999987643 45 Q ss_pred cccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc----ccCCCCCCCC Q lcl|NC_010179. 393 KRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA----DELNGKGVDD 468 (469) Q Consensus 393 ~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~----~~~~~~~~~d 468 (469) +.+++++|++++|.|+++.++++++++|++|+||+++++|++ |+++|++||++|+++.....++. +..+..+.++ T Consensus 390 ~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~et~l~~l~~v-d~~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~e 468 (470) T protein:vir:99 390 WSELDFKFTRNLPEDMASAIDNAKNAEGIVSKKTQLGMIPDI-EPDAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAE 468 (470) T ss_pred cccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCC-CHHHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCcc Confidence 678999999999999999999999999999999999999998 79999999999987765544332 2222222222 Q ss_pred -C Q lcl|NC_010179. 469 -E 469 (469) Q Consensus 469 -e 469 (469) | T Consensus 469 e~ 470 (470) T protein:vir:99 469 EE 470 (470) T ss_pred CC Confidence 2 No 37 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=1.1e-92 Score=524.77 Aligned_cols=437 Identities=16% Similarity=0.243 Sum_probs=371.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.+.|+++|+++. ..++++++++.+||+|+|.+.......+ .....++++|+++||++.||++.++|++ T Consensus 30 ~~~~~i~~~i~~~~---~~~~~~~~~~~~yY~g~~~~i~~~~~~~-------~~~~~~~~~ki~~n~~~~ivd~~~~~l~ 99 (481) T protein:vir:10 30 LKEENLRNFISRHQ---TEQVPRLEMLESYYLNRNTDILAGERRL-------QKYGDKADHRAVHNYAKYVSRFIVGYLT 99 (481) T ss_pred cCHHHHHHHHHHHH---HHHHHHHHHHHHHhcCCCcccccCcccc-------ccccccccceeecchHHHHHHHHHhhhc Confidence 77777777777643 4577889999999999986543332221 1234467889999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) |+||+|+++++..++.++++|++| +...+.++++.++++|++|+++|++++|++++++++|++++|+||+....+++++ T Consensus 100 g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p~~~~~v~d~~~~~~~~~~ 179 (481) T protein:vir:10 100 GNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDPKSTFVVYDQTLDKKVVAG 179 (481) T ss_pred cCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcccceEEEEcCCCCCceEEE Confidence 999999999999999999999886 4557889999999999999999999999999999999999999999888899999 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccc Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYR 239 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 239 (469) +|+|...+.++.. ..++++|+++.+++|...++.+ ..++..+|+||.||||+|+|++.| T Consensus 180 i~~~~~~~~~~~~-~~~~~~y~~~~i~~~~~~~~~~--------------------~~~~~~~~~~g~vPvv~~~n~~~g 238 (481) T protein:vir:10 180 VRYFEKQDKDKVP-VQHVEVYTTDKIYYIEIKGGTY--------------------HRVEEVEHYYNDVPIIEYLNDQFK 238 (481) T ss_pred EEEEEEeeCCCce-EEEEEEEecCeEEEEEecCCce--------------------eecccccccCCceeEEEeecCCCC Confidence 9999877665544 4578999999999987665432 234567899999999999999999 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeeccc----CCCCCCcceEEeecC Q lcl|NC_010179. 240 LAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNA----GNGDKSGVDKLQIDI 315 (469) Q Consensus 240 ~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~ 315 (469) +|+|+++++|||+||.++|++++.++++++|+++++|....+ ++....+...+.+.+... +++++++++|++++. T Consensus 239 ~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 317 (481) T protein:vir:10 239 QGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLD-SEDAKAFRDANMIHLEPGTNANGSEGKAEVKYVYKQY 317 (481) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCC-ccchhhhhhccceeccccccccCCCCCcceeEEeecC Confidence 999999999999999999999999999999999999965443 344555555555555433 345578899999999 Q ss_pred CHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---C Q lcl|NC_010179. 316 PVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD---A 391 (469) Q Consensus 316 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~---~ 391 (469) +.+++++++++|.+.|+.+|++|+++++.+ ||+||+||++++++|.+||+++++.|+.+|++++++++++++..+ . T Consensus 318 ~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~ 397 (481) T protein:vir:10 318 DVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSGVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVNLTGLKQH 397 (481) T ss_pred CHHHHHHHHHHHHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcc Confidence 999999999999999999999999988776 789999999999999999999999999999999999999987754 4 Q ss_pred CcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc-----ccCCCCCC Q lcl|NC_010179. 392 DKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA-----DELNGKGV 466 (469) Q Consensus 392 ~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~-----~~~~~~~~ 466 (469) +..+++++|++++|.|+++.++++++++|+||+||+++++|+++|+++|++||++|+++..+..++. .+.+++.+ T Consensus 398 ~~~~i~v~f~~~~~~~~~~~a~~~~kl~g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~d 477 (481) T protein:vir:10 398 NYAELTITFTPNLPKSMMESINAFNALSGGVSESTRLSLLDFIDNPKEELEKMQEEEAQREKQADKRGYGEAFENHLNVD 477 (481) T ss_pred ccceeeEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhhhccCCccCCCCCCCC Confidence 5678999999999999999999999999999999999999999999999999999998877665532 22233333 Q ss_pred CCC Q lcl|NC_010179. 467 DDE 469 (469) Q Consensus 467 ~de 469 (469) |+| T Consensus 478 d~~ 480 (481) T protein:vir:10 478 DSN 480 (481) T ss_pred CCC Confidence 333 No 38 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=1.2e-92 Score=524.53 Aligned_cols=439 Identities=19% Similarity=0.222 Sum_probs=365.2 Q ss_pred CCHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHhccCCc---ccccccchh---hhcccccccccccCcceec Q lcl|NC_010179. 1 MELDAL----------KKLIRNTSTSRNDLINNYKKSVDYYENKTD---ITTRNNGKP---KVSKEGKKDPLRSADNRIP 64 (469) Q Consensus 1 ~~~~~~----------~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~---i~~~~~~~~---~~~~~~~~~~~~~~~~ri~ 64 (469) |++-.. .++|.+++..|...+.++.++.+||+|.++ +..++.... ............++++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:10 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 433221 245677777888889999999999999764 333322111 1111223345567889999 Q ss_pred cchHHHHHHHHHHhhhcCCeeeccC-----chhhHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEE Q lcl|NC_010179. 65 SNFYQLLVDQEAGYIASVFPDIDVG-----KDADNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGI 138 (469) Q Consensus 65 ~n~~k~iv~~~~~~l~g~p~~~~~~-----~~~~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~ 138 (469) +||+++||++.++|+||+||+|+++ ++..++.|+++|++|. .+.+.+++++++++|+||+++|++++|++++++ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~ 160 (474) T protein:vir:10 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKN 160 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEE Confidence 9999999999999999999999875 3456678899998764 567889999999999999999999999999999 Q ss_pred EccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccc Q lcl|NC_010179. 139 IQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQS 218 (469) Q Consensus 139 ~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (469) ++|.+++|+||++ .+++++|++|...+..+......+++|++..++.|...+.+ .+.+. T Consensus 161 i~p~~~~~v~d~~--~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~-------------------~~~~~ 219 (474) T protein:vir:10 161 IDPYNVIFVGDNI--LEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGID-------------------ALQEV 219 (474) T ss_pred EcccceEEEEcCC--CceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCC-------------------ccccc Confidence 9999999999864 46789999999888888888889999999999888765322 12345 Q ss_pred ccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeec Q lcl|NC_010179. 219 NTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKIN 298 (469) Q Consensus 219 ~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 298 (469) +..+|+||.||||+|+|++.|.|+|+++++|||+||.++|++++.++++++|+++++|+.... +....+...+++.+. T Consensus 220 ~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~ 297 (474) T protein:vir:10 220 GRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELF 297 (474) T ss_pred ccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEec Confidence 567899999999999999999999999999999999999999999999999999999976543 455566677787775 Q ss_pred ccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 299 NAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (469) Q Consensus 299 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 377 (469) +. +++++|++++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++ T Consensus 298 ~~----~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 373 (474) T protein:vir:10 298 DK----DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRY 373 (474) T ss_pred CC----CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43 46799999999999999999999999999999999998876 78999999999999999999999999999999 Q ss_pred HHHHHHHHhcccC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_010179. 378 LVRAIMRYLNFSD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREEND 452 (469) Q Consensus 378 ~~~~i~~~~~~~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~ 452 (469) ++++|+.+++..+ .++.+++++|++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++.. T Consensus 374 ~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~ 453 (474) T protein:vir:10 374 QFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFN 453 (474) T ss_pred HHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 9999999987643 35678999999999999999999999999999999999999999999999999999987655 Q ss_pred hhHhhcccCCCCCCCCC Q lcl|NC_010179. 453 PYANQADELNGKGVDDE 469 (469) Q Consensus 453 ~~~~~~~~~~~~~~~de 469 (469) ...... ..++.+++ T Consensus 454 ~~~~~~---~~~~~~~~ 467 (474) T protein:vir:10 454 DKLPDI---DEGDANDK 467 (474) T ss_pred hhcccc---cCCCcCCC Confidence 433222 11222222 No 39 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=1.2e-92 Score=524.53 Aligned_cols=439 Identities=19% Similarity=0.222 Sum_probs=365.2 Q ss_pred CCHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHhccCCc---ccccccchh---hhcccccccccccCcceec Q lcl|NC_010179. 1 MELDAL----------KKLIRNTSTSRNDLINNYKKSVDYYENKTD---ITTRNNGKP---KVSKEGKKDPLRSADNRIP 64 (469) Q Consensus 1 ~~~~~~----------~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~---i~~~~~~~~---~~~~~~~~~~~~~~~~ri~ 64 (469) |++-.. .++|.+++..|...+.++.++.+||+|.++ +..++.... ............++++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~ 80 (474) T protein:vir:94 1 MTLYKLIDDIEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLN 80 (474) T ss_pred CchHHHHhhccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccc Confidence 433221 245677777888889999999999999764 333322111 1111223345567889999 Q ss_pred cchHHHHHHHHHHhhhcCCeeeccC-----chhhHHHHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEE Q lcl|NC_010179. 65 SNFYQLLVDQEAGYIASVFPDIDVG-----KDADNKKILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGI 138 (469) Q Consensus 65 ~n~~k~iv~~~~~~l~g~p~~~~~~-----~~~~~~~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~ 138 (469) +||+++||++.++|+||+||+|+++ ++..++.|+++|++|. .+.+.+++++++++|+||+++|++++|++++++ T Consensus 81 ~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~~~~ 160 (474) T protein:vir:94 81 NSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDTNGDIRIKN 160 (474) T ss_pred cchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCeeEEEE Confidence 9999999999999999999999875 3456678899998764 567889999999999999999999999999999 Q ss_pred EccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccc Q lcl|NC_010179. 139 IQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQS 218 (469) Q Consensus 139 ~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (469) ++|.+++|+||++ .+++++|++|...+..+......+++|++..++.|...+.+ .+.+. T Consensus 161 i~p~~~~~v~d~~--~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~-------------------~~~~~ 219 (474) T protein:vir:94 161 IDPYNVIFVGDNI--LEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGID-------------------ALQEV 219 (474) T ss_pred EcccceEEEEcCC--CceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCC-------------------ccccc Confidence 9999999999864 46789999999888888888889999999999888765322 12345 Q ss_pred ccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeec Q lcl|NC_010179. 219 NTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKIN 298 (469) Q Consensus 219 ~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 298 (469) +..+|+||.||||+|+|++.|.|+|+++++|||+||.++|++++.++++++|+++++|+.... +....+...+++.+. T Consensus 220 ~~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~~~~--~~~~~~~~~~~i~~~ 297 (474) T protein:vir:94 220 GRYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMGMSE--EMIQETQKSGAFELF 297 (474) T ss_pred ccccCCCCccceEEecCCCCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCCCCc--hhhhhhhhcceeEec Confidence 567899999999999999999999999999999999999999999999999999999976543 455566677787775 Q ss_pred ccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 299 NAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (469) Q Consensus 299 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 377 (469) +. +++++|++++.+.+++++++++|.++|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|++ T Consensus 298 ~~----~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 373 (474) T protein:vir:94 298 DK----DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRY 373 (474) T ss_pred CC----CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43 46799999999999999999999999999999999998876 78999999999999999999999999999999 Q ss_pred HHHHHHHHhcccC-----CCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_010179. 378 LVRAIMRYLNFSD-----ADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREEND 452 (469) Q Consensus 378 ~~~~i~~~~~~~~-----~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~ 452 (469) ++++|+.+++..+ .++.+++++|++++|.|+++.++++++++|++|+||+++++|+++|+++|++||++|+++.. T Consensus 374 ~~~li~~~l~~~~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~e~~ 453 (474) T protein:vir:94 374 QFKVILSALKRKGYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLKGQVSERTRLGQSQLVDDVDYELDEMEKESLEFN 453 (474) T ss_pred HHHHHHHHHhhccCCCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH Confidence 9999999987643 35678999999999999999999999999999999999999999999999999999987655 Q ss_pred hhHhhcccCCCCCCCCC Q lcl|NC_010179. 453 PYANQADELNGKGVDDE 469 (469) Q Consensus 453 ~~~~~~~~~~~~~~~de 469 (469) ...... ..++.+++ T Consensus 454 ~~~~~~---~~~~~~~~ 467 (474) T protein:vir:94 454 DKLPDI---DEGDANDK 467 (474) T ss_pred hhcccc---cCCCcCCC Confidence 433222 11222222 No 40 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=2.5e-92 Score=522.73 Aligned_cols=425 Identities=16% Similarity=0.178 Sum_probs=359.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeeecc Q lcl|NC_010179. 9 LIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDIDV 88 (469) Q Consensus 9 ~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~ 88 (469) ||..+ +..+.+||+++++||+|+|+++.++... ....++++|+++||+++||++.++||||+||++++ T Consensus 1 ~~~~~---~~~~~~r~~~l~~yy~g~~~~~~~~~~~---------~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~ 68 (440) T protein:vir:95 1 MLAAF---LGSQKQRLAILASYAQGDNFSILSGHRR---------LDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGV 68 (440) T ss_pred ChhhH---HHHHHHHHHHHHHHhccCCccccccccc---------ccccCCcceeecchHHHHHHhhhhheeccCceEee Confidence 54444 4556788999999999999987665432 34457889999999999999999999999999987 Q ss_pred Cch---hhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEE Q lcl|NC_010179. 89 GKD---ADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYK 164 (469) Q Consensus 89 ~~~---~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~ 164 (469) .++ +..+.|+++|++|.+ ..+.+++++++++|++|+++|++++|++++++++|.+++|+||+....++++++++|. T Consensus 69 ~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~ 148 (440) T protein:vir:95 69 MEGGSADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPI 148 (440) T ss_pred CCCccHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEE Confidence 554 455678899987755 5678999999999999999999999999999999999999999988889999999987 Q ss_pred eeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHH Q lcl|NC_010179. 165 QLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELN 244 (469) Q Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~ 244 (469) ..+ ...+++|++..+++|...+... ......+..+|+||.||||+|+|++.|+|+|+ T Consensus 149 ~~~------~~~~~vyt~~~~~~~~~~~~~~-----------------~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e 205 (440) T protein:vir:95 149 YAD------KVNMTVYTKDKVITYKPYSNNS-----------------VRLVVDDVKKHSYNDVPVVEWWNNRFRMGDYE 205 (440) T ss_pred ecC------ceEEEEEeCCeEEEEEEecCCc-----------------cceeecceeeccCceeeEEEeeCCCCCCCchh Confidence 543 1356899999999887654321 12345567899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcc--cchhhhhhhhhcceeeeccc----CCCCCCcceEEeecCCHH Q lcl|NC_010179. 245 KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGA--SLKQFMNDLREYKSIKINNA----GNGDKSGVDKLQIDIPVE 318 (469) Q Consensus 245 ~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~--~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~l~~~~~~~ 318 (469) ++++|||+||.++|++++.+++|++|+++++|.... ..++....+...+++.+... +.+.+++++|++++++.+ T Consensus 206 ~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lt~~~~~~ 285 (440) T protein:vir:95 206 SEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLFLKTGISTTGQQTTADASYIYKQYDVN 285 (440) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhccceecccccccccCCCCcceeEEeecCCHH Confidence 999999999999999999999999999999996432 23445555666666655432 345678899999999999 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcc Q lcl|NC_010179. 319 ARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKR 394 (469) Q Consensus 319 ~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~ 394 (469) ++++++++|.+.|+.+|++|++++.++ ||+||+||+++++++.+||+++++.|+++|++++++|+.+++.. ..++. T Consensus 286 ~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~ 365 (440) T protein:vir:95 286 GTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAINGPVIEAN 365 (440) T ss_pred HHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccccc Confidence 999999999999999999999998886 68999999999999999999999999999999999999998754 35677 Q ss_pred cceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhh-cccCCCCCCCCC Q lcl|NC_010179. 395 HISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQ-ADELNGKGVDDE 469 (469) Q Consensus 395 ~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~-~~~~~~~~~~de 469 (469) +++++|++++|.|+++.+++++|++|+||+||+++++|++++ ++|++||++|+++..+..++ .....+++.|+| T Consensus 366 ~v~i~f~~~~p~~~~~~ad~~~kl~g~iS~et~~~~l~~~d~-~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 366 KLTFTFHPNIPQDVWTEIKAYIEAGGEISQETLMENASFTDY-KTEHSRILKQGGSSDLEIGQIVGDADVGQADTE 440 (440) T ss_pred cceEEeCCCCCCCHHHHHHHHHHHhccCcHHHHHHhCCCCCc-HHHHHHHHHHHHHhhhhHHhhccCCCCCCcCCC Confidence 899999999999999999999999999999999999999854 67999999998887665544 455566666666 No 41 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=1.1e-90 Score=513.68 Aligned_cols=437 Identities=17% Similarity=0.173 Sum_probs=360.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.+.++|.++. .++.++|+++.+||.|+|+|++++.. ....++++|+++||+++||++.++|+| T Consensus 15 ~~~~~~~~~i~~~~---~~~~~r~~~~~~yy~g~~~i~~~~~~----------~~~~~~~~ki~~n~~~~iv~~~~~~l~ 81 (489) T protein:vir:99 15 LWIDQLKNYISRFK---AEQLERLKELKRYYLGDNNIKYRPAK----------TDKYAADNRIASDFAKYITVFEQGYML 81 (489) T ss_pred CCHHHHHHHHHHHH---HHHHHHHHHHHHHhcccCcccccccc----------ccccCCcceeecchHHHHHHHHhhhhc Confidence 77788888887763 44678899999999999999876532 233467789999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEE----cCCCceEEEEEccceeEEEEeCCCCCc Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWI----DEDNNFRYGIIQPDQITPVYATTLDNK 155 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~----d~~~~~~i~~~~p~~~~~~~d~~~~~~ 155 (469) |+||+|+++++..++.++++|+.|.+ ..+.++++.++++|++|+++|+ |+++++++++++|.+++|+|++....+ T Consensus 82 g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~v~dd~~~~~ 161 (489) T protein:vir:99 82 GVPVEYKNENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFVIYDDTYQRN 161 (489) T ss_pred cCCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEEEEcCCCCCc Confidence 99999999999999999999988755 5678999999999999999986 567899999999999999999888888 Q ss_pred eEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC Q lcl|NC_010179. 156 LLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK 235 (469) Q Consensus 156 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 235 (469) ++++|++|...+.++ ....++++|+++.+++|........ .....+..+|+||.||||+|+| T Consensus 162 ~~~~i~~~~~~~~~~-~~~~~~~~y~~~~i~~~~~~~~~~~-----------------~~~~~~~~~~~~g~vPvv~~~n 223 (489) T protein:vir:99 162 SLMAVHFYDIDYGSG-KRKQIIKAYTSDTIYTYEDYNLETK-----------------GMRLKDYEGHFFKGVPVNEYAN 223 (489) T ss_pred eEEEEEEEEEecCCC-ceEEEEEEEeCCcEEEEEecCCCcc-----------------cceecccccccCCceeEEEeec Confidence 999999998765544 4456789999999988876543211 1233456789999999999999 Q ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc--hhh--------------hhhhhhcceeeecc Q lcl|NC_010179. 236 NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL--KQF--------------MNDLREYKSIKINN 299 (469) Q Consensus 236 ~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~--~~~--------------~~~~~~~~~~~~~~ 299 (469) ++.|+|+|+++++|||+||.++|++++.++++++|+++++|...... ... .......+++.+.. T Consensus 224 ~~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 303 (489) T protein:vir:99 224 NEERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADENDYLDDGRLNPNGRLAISIGFKKAQVLILDD 303 (489) T ss_pred CCCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccchhhhhhcccccccccccccccccceeeeecc Confidence 99999999999999999999999999999999999999999643221 111 11122233444444 Q ss_pred cC--CCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AG--NGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAIN 376 (469) Q Consensus 300 ~~--~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 376 (469) ++ ++.+++++||+++.+.+++++++++|.+.||.+|++|++++.++ ||+||+||+++++++.+||+++++.|+.+|+ T Consensus 304 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~ 383 (489) T protein:vir:99 304 NPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLM 383 (489) T ss_pred ccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 32 34467899999999999999999999999999999999988775 7899999999999999999999999999999 Q ss_pred HHHHHHHHHhcccCC------CcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCC--CHHHHHHHHHHHH Q lcl|NC_010179. 377 ELVRAIMRYLNFSDA------DKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVD--DWQQELKDLAKDR 448 (469) Q Consensus 377 ~~~~~i~~~~~~~~~------~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~--d~~~E~eri~~E~ 448 (469) +++++|+.+++..+. .+.+++|+|++++|.|.++.++++++++|+||+||+++++|+++ |+++|++||++|+ T Consensus 384 ~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~giis~et~~~~l~~v~~~d~~~E~~ri~~E~ 463 (489) T protein:vir:99 384 RRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEILNTVTGVDAEAELKRLKEEA 463 (489) T ss_pred HHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhcCCCCchhHHHHHHHHHHHH Confidence 999999999876432 34679999999999999999999999999999999999999987 6889999999998 Q ss_pred HHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 449 EENDPYANQADELNGKGVDDE 469 (469) Q Consensus 449 ~~~~~~~~~~~~~~~~~~~de 469 (469) ++.....+.... ++...++| T Consensus 464 ~~~~~~~~~~~~-~~~~~~~~ 483 (489) T protein:vir:99 464 DKKQSLPEPRLV-GDASGQEE 483 (489) T ss_pred HHHhcccccccc-CCCCCCcC Confidence 766554332211 11111111 No 42 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=7e-76 Score=432.63 Aligned_cols=447 Identities=12% Similarity=0.054 Sum_probs=334.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.+++.+++.+++.+|..+++|++++.+||+|+|++........ ......+.++++|||++||++.++|++ T Consensus 23 ~~~~~~~~l~~~l~~~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~--------~~~~~~~~~~v~n~~~~ivd~~a~~l~ 94 (501) T protein:vir:25 23 MSREQLGALVADMWRLHISERQWLDRIYEYTKGLRGRPEVPEGAS--------DEVKELAKLSVKNVLSLVRDSFAQNLS 94 (501) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhccccCC--------hhhhhhHhhhhcChHHHHHHHHHhhhc Confidence 999999999999999999999999999999999998644322111 111122345678999999999999997 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEe-CCCCCceEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYA-TTLDNKLLG 158 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d-~~~~~~~~~ 158 (469) .++ |+++++..++.++++|+.|+++ ...+++++++++|+||++||.++++ +++++++|.+++++|+ +..+..+.+ T Consensus 95 ~~g--f~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~-~~i~~~sp~~~~~iy~D~~~~~~~~~ 171 (501) T protein:vir:25 95 VVG--YRNALAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEG-PVFRTRSPRQILAVYADPSVDAWPQY 171 (501) T ss_pred ccc--eecCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCC-CeEEEeccccEEEEEecCCCCcceeE Confidence 554 6666777788899999888665 4679999999999999999999887 5899999999999995 555667999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccc--cccccccccccccccccCCcccEEEecCC Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSY--DLSAGYETGQSNTLKHNFGRVPFIEFPKN 236 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~g~vPvv~~~n~ 236 (469) ++++|......+ ....+++|++..+++|......... .....++.. ......+..+....+|+||.||||+|+|+ T Consensus 172 ai~~~~~~~~~~--~~~~~~~y~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~f~N~ 248 (501) T protein:vir:25 172 ALETWVAQKDAK--PHRRGVLYDDTYMYELDLGEVVLGD-AGGGQATQQPVNVREVTDVIEHGATFEGKPVCPVVRFVNG 248 (501) T ss_pred EEEEEeeccccC--cceeEEEecCeeEEEEecCceeeee-ccccccccccccccccccccccccccCCccceeeEeccCc Confidence 999987665433 3346778988877766543221111 111011111 11112223344567899999999999995 Q ss_pred ----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe Q lcl|NC_010179. 237 ----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ 312 (469) Q Consensus 237 ----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 312 (469) +.|+|+|+++++|+|+||+++|++++..+++++|+++++|.+.+..+. ..+..++++.++. +++++.+ T Consensus 249 ~~~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~--~~~~~~~i~~~~~------~~~~~~q 320 (501) T protein:vir:25 249 RDADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWTGSKAEV--LKASALRVWTFED------PEVKAQA 320 (501) T ss_pred cccCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCCCCccch--hhhcccceeccCC------CCceEEE Confidence 458999999999999999999999999999999999999987655432 2334445554431 3467777 Q ss_pred ecC-CHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_010179. 313 IDI-PVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD 390 (469) Q Consensus 313 ~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~ 390 (469) ++. +.+++.+.++.+..+|+..|++|+.++.++ +|+||+||++++.+|.+||.++++.|+.+|++++++++.+.+... T Consensus 321 ~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~ 400 (501) T protein:vir:25 321 FPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMINVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPD 400 (501) T ss_pred ecccChHHHHHHHHHHHHHHHhhcCCChhhhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 665 578899999999999999999998888764 789999999999999999999999999999999999999887654 Q ss_pred -CCcccceEEeCCCCCCCHHHHHHHHHHHhcc-CChHHHHHhCCCCCCHHH-HHHHHHHHHHHhhhhHhh--cccCCCCC Q lcl|NC_010179. 391 -ADKRHISQHWTRTKVEDSLTKAQIVSTVANY-SSKEAVAKANPIVDDWQQ-ELKDLAKDREENDPYANQ--ADELNGKG 465 (469) Q Consensus 391 -~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~-iS~et~~~~l~~v~d~~~-E~eri~~E~~~~~~~~~~--~~~~~~~~ 465 (469) .+..++++.|+++.|.|.++.+++++|+.|+ +|.||++.++|++++++. ++++.++|+......... .+..++.. T Consensus 401 ~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gis~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~~ 480 (501) T protein:vir:25 401 TAADSGAEVLWRDTEARSFGAVVDGITKLASAGIPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVPP 480 (501) T ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCCC Confidence 3557899999999999999999999999776 899999999999986542 233333333222111111 11111111 Q ss_pred CCCC Q lcl|NC_010179. 466 VDDE 469 (469) Q Consensus 466 ~~de 469 (469) .+.| T Consensus 481 ~~~~ 484 (501) T protein:vir:25 481 PPPQ 484 (501) T ss_pred CCCC Confidence 1111 No 43 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=9.5e-76 Score=431.88 Aligned_cols=431 Identities=14% Similarity=0.067 Sum_probs=326.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.+ .++|..++..|..+.+|++++.+||+|+|+|.+.... .+....++|+++||+++||++.++||+ T Consensus 1 ~~t~--~d~i~~L~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~----------~~~~~~~~~~~~n~~~~ivd~~~~~l~ 68 (480) T protein:vir:78 1 MTTY--HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG----------APPELAYLDVQPGWVATYLRTLSDRLD 68 (480) T ss_pred CCCH--HHHHHHHHHHHHHHHHHHHHHHHHHhccccchhcccc----------cchhhhhhhhhcchHHHHHHHHHhhhc Confidence 8888 7788888888999999999999999999987543321 122234678999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEE------cCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWI------DEDNNFRYGIIQPDQITPVYATTLD 153 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~~~d~~~~ 153 (469) +++++.. ++++.++.++++|+.|.+ ..+.+++++++++|+||++||. +++|++++++++|.+++|+||+... T Consensus 69 ~~g~~~~-~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~~i~D~~~~ 147 (480) T protein:vir:78 69 IEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNT 147 (480) T ss_pred cCceecC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceEEEEcCCCc Confidence 8887644 566778889999987755 5678999999999999999985 5678999999999999999999988 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) +++.+++++|...+..+ ....+++|+++.+++|...+..... .....+..+|+||+||||+| T Consensus 148 ~~~~~~i~~~~~~d~~~--~~~~~~~y~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~g~vPvv~f 209 (480) T protein:vir:78 148 RRVTRAVRLYTTRDDVA--VPDRATLYLPDETVPLRRNGGLNDQ----------------WVVDGDVIKHGLGVVPVVPL 209 (480) T ss_pred cceEEEEEEEEeecCCc--ceEEEEEEeCCeEEEEEecCCCccc----------------ccccccccccCCCCcceEEe Confidence 89999999997665433 4567889999998888765433211 01223567899999999999 Q ss_pred cCCc-----cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhh--hhhh-hcceeeecccCCCC Q lcl|NC_010179. 234 PKNK-----YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFM--NDLR-EYKSIKINNAGNGD 304 (469) Q Consensus 234 ~n~~-----~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~--~~~~-~~~~~~~~~~~~~~ 304 (469) +|++ .|.|+++ .|++|+|+||+++|++++.+++|++|+++++|.+.+...... ..+. ..+.+.. .+ T Consensus 210 ~n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 284 (480) T protein:vir:78 210 TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-----LA 284 (480) T ss_pred ecccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccchhhhhhhhhcc-----CC Confidence 9975 4889997 599999999999999999999999999999997543221111 1111 1111111 12 Q ss_pred CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC----C-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 305 KSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS----N-ASGVAIKMLYSHLELKAAKTQTYFEHAINELV 379 (469) Q Consensus 305 ~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g----~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 379 (469) ++++++.+++.. ..+.+++++...|+.++.+|++++..|| | +||+||++++++|.+||+++++.|+.+|++++ T Consensus 285 ~~~~~~~~~~~~--~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~ 362 (480) T protein:vir:78 285 SEAAKISEFKAA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) T ss_pred CCCceEEecCcc--CHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 345788886652 2344555555555555555554444433 2 69999999999999999999999999999999 Q ss_pred HHHHHHhcccC-CCcccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhh Q lcl|NC_010179. 380 RAIMRYLNFSD-ADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDREENDPY 454 (469) Q Consensus 380 ~~i~~~~~~~~-~~~~~i~i~f~~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~ 454 (469) ++++.+++... .++..++++|+++.|+|.++.+++++|+. |++|++|+++++|+++|+.+|++++++++++.... T Consensus 363 rl~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~e~~~~~~~~~~~~~~ 442 (480) T protein:vir:78 363 RIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDMID 442 (480) T ss_pred HHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHHHH Confidence 99999987543 45678999999999999999999999873 46999999999999999999998887766543211 Q ss_pred Hh---------hcccCCCCCCCCC Q lcl|NC_010179. 455 AN---------QADELNGKGVDDE 469 (469) Q Consensus 455 ~~---------~~~~~~~~~~~de 469 (469) .. .......++.++| T Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~~~ 466 (480) T protein:vir:78 443 TLYSTTKAQADATPKPTVTETKTE 466 (480) T ss_pred HhhccccCCCccccCCCCCCCCCc Confidence 11 1111122222222 No 44 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=3.3e-75 Score=428.92 Aligned_cols=426 Identities=13% Similarity=0.085 Sum_probs=325.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) =+.+...+++.+++.+|..+++|++++.+||+|+|+|.+..... +....+.++++||+++||+..+++|. T Consensus 9 ~e~~~~~~~~~~l~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~----------~~~~~~~~~v~n~~~~iVd~~~~~l~ 78 (486) T protein:vir:42 9 EEIEDPAVVREEMISAFEDASKDLASNTSYYDAERRPEAIGVTV----------PREMQQLLAHVGYPRLYVDSVAERQA 78 (486) T ss_pred CCcccHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcchhccccc----------chhHhhhhhccchHHHHHHHHHhhhc Confidence 23445567888899999999999999999999999886543221 11223456789999999999999997 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcC--------CCceEEEEEccceeEEEEeCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDE--------DNNFRYGIIQPDQITPVYATT 151 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~--------~~~~~i~~~~p~~~~~~~d~~ 151 (469) ..++++. +++..++.++++|+.|.++ ...+++++++++|+||++||.++ ++.+++++++|.+++++||+. T Consensus 79 ~~g~~~~-~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~i~d~~ 157 (486) T protein:vir:42 79 VEGFRLG-DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQNVPIIRVEPPTRMHAEIDPR 157 (486) T ss_pred ccceecC-CCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCeeEEEEecccceEEEEeCC Confidence 7666533 4455667799999877554 57799999999999999999865 456789999999999999986 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) . .++.+++++|... + ....+++++|+++.+++|...++.+. ..+..+|+||.|||| T Consensus 158 ~-~~~~~~~~~~~~~--~-~~~~~~~~~y~~~~~~~~~~~~~~~~--------------------~~~~~~h~~g~vPvv 213 (486) T protein:vir:42 158 I-NRVSKAIRVAYDK--E-GNEIQAATLYTPMETIGWFRADGEWA--------------------EWFNVPHGLGVVPVV 213 (486) T ss_pred C-CCeEEEEEEEEec--C-CCeEEEEEEEcCCcEEEEEecCCcEE--------------------eecceecCCCCceEE Confidence 5 5788999887532 2 34456789999999888876554322 234568999999999 Q ss_pred EecCCc-----cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh---hh---hhhhcceeeecc Q lcl|NC_010179. 232 EFPKNK-----YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF---MN---DLREYKSIKINN 299 (469) Q Consensus 232 ~~~n~~-----~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~---~~---~~~~~~~~~~~~ 299 (469) +|+|++ .|.|+++ .|++|||+||+++|++++.++++++|+++++|.+.+..... .. .....++. .. T Consensus 214 ~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~-~~- 291 (486) T protein:vir:42 214 PLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQTLFDAYLARIL-AF- 291 (486) T ss_pred EeccccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCccccccccccccchhhhhhchhc-cc- Confidence 999985 4789998 59999999999999999999999999999999754322110 00 10111111 11 Q ss_pred cCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-----ccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-----ASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-----~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) .++++++.+++ ....++++++++..|+.++.+|++++..||. +||+||++++.+|.+||+++++.|+.+ T Consensus 292 ----~~~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~ 365 (486) T protein:vir:42 292 ----EDAEGKIQQFS--AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNLMFGGA 365 (486) T ss_pred ----CCCCceEEeec--ccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 12457777664 4456788888899999988888877765542 599999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccC--CCcccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFSD--ADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDR 448 (469) Q Consensus 375 l~~~~~~i~~~~~~~~--~~~~~i~i~f~~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~ 448 (469) |++++++++++.+..+ .+...++++|+++.|.|.++.+++++|++ |++|+||+++++|+++|+.+|++|+++|+ T Consensus 366 l~~~~~l~~~~~~~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~ 445 (486) T protein:vir:42 366 WEEAMRIAYRIMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEE 445 (486) T ss_pred HHHHHHHHHHHhcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHH Confidence 9999999999887543 35678999999999999999999999974 78999999999999999999999998887 Q ss_pred HHhhhhHhh-----c-----ccCCCCCCCCC Q lcl|NC_010179. 449 EENDPYANQ-----A-----DELNGKGVDDE 469 (469) Q Consensus 449 ~~~~~~~~~-----~-----~~~~~~~~~de 469 (469) .+....... . ++..++...++ T Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 476 (486) T protein:vir:42 446 AAMGLGLLGTMVDADPTVPGSPSPTAPPKPQ 476 (486) T ss_pred HHHHHHHHHHhhcCCCCCCCCCCCCCCCCCC Confidence 654322111 0 11111111111 No 45 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=5.3e-75 Score=427.78 Aligned_cols=430 Identities=13% Similarity=0.068 Sum_probs=322.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |..+ .++|+.++.+|..+++|++++.+||+|+|+|.+.... .+....++|+++||+++||++.++|++ T Consensus 1 ~~t~--~~~i~~L~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~----------~~~~~~~~~~~~n~~~~ivd~~~~~l~ 68 (480) T protein:vir:78 1 MTTY--HEHVERLQGLLARDLPNLLEAEAYRNGTRRLKTIGIG----------APPELAYLDVQPGWVATYLRTLSDRLD 68 (480) T ss_pred CCCH--HHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc----------cchhHhhhhhhcchHHHHHHHHHhhhc Confidence 7777 6677888888888999999999999999987543221 122334668999999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEE------cCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWI------DEDNNFRYGIIQPDQITPVYATTLD 153 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~------d~~~~~~i~~~~p~~~~~~~d~~~~ 153 (469) +++++.. ++++.++.++++|+.|.+ +.+.+++++++++|+||++||. |++|++++++++|.+++|+||+... T Consensus 69 ~~g~~~~-~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~~~~~D~~~~ 147 (480) T protein:vir:78 69 IEGFRIS-EDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYMYAELDPRNT 147 (480) T ss_pred cCceecC-CCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccceEEEEcCCCc Confidence 8887644 566778899999987755 5678999999999999999996 4678999999999999999999888 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) +++.+++++|...+..+ ....+++|+++.+++|...+..... .....+..+|+||+||||+| T Consensus 148 ~~~~~~i~~~~~~~~~~--~~~~~~~y~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~g~vPvv~f 209 (480) T protein:vir:78 148 RRVTRAVRLYTTRDDVA--VPDRATLYLPDETVPLRRNGGLNDQ----------------WVVDGDVIKHGLGVVPVVPL 209 (480) T ss_pred cceEEEEEEEEeecCCC--ceEEEEEEeCCeEEEEEecCCCccc----------------cccccccccCCCCCcceEEe Confidence 99999999997655443 3467789999999888765443211 01123456899999999999 Q ss_pred cCCc-----cccccHHH-HHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhh--hhh-hcceeeecccCCCC Q lcl|NC_010179. 234 PKNK-----YRLAELNK-YKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMN--DLR-EYKSIKINNAGNGD 304 (469) Q Consensus 234 ~n~~-----~g~~~~~~-v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~--~~~-~~~~~~~~~~~~~~ 304 (469) +|++ .|.|+++. |++|+|+||+++|++++.+++|++|+++++|.+.+....... .+. ..+.+.. .. T Consensus 210 ~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 284 (480) T protein:vir:78 210 TNDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENTTLDIYYGRILT-----LA 284 (480) T ss_pred ecccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccchhhhhhhhhcc-----CC Confidence 9974 58899985 999999999999999999999999999999975433221111 011 1111111 12 Q ss_pred CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC----C-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 305 KSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS----N-ASGVAIKMLYSHLELKAAKTQTYFEHAINELV 379 (469) Q Consensus 305 ~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g----~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 379 (469) ++++++++++.. ..+.+++++...|+.++.+|++++..|| | +||+||++++.+|..||+++++.|+.+|++++ T Consensus 285 ~~~~~~~~~~~~--~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~ 362 (480) T protein:vir:78 285 SEAAKISEFKAA--ELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAERKGRIFGGAWERAM 362 (480) T ss_pred CCCceEEecCcc--CHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 345788887652 2344455555555555554444443332 2 69999999999999999999999999999999 Q ss_pred HHHHHHhcccC-CCcccceEEeCCCCCCCHHHHHHHHHHH----hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhh Q lcl|NC_010179. 380 RAIMRYLNFSD-ADKRHISQHWTRTKVEDSLTKAQIVSTV----ANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPY 454 (469) Q Consensus 380 ~~i~~~~~~~~-~~~~~i~i~f~~~~p~d~~e~~~~~~kl----~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~ 454 (469) ++|+.+.+... .++..+++.|+++.+.|.++.+++++|+ .|++|+||+++++|+++|+.+|++++++|+.+.. . T Consensus 363 ~l~~~~~g~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d~~~~~~~~~~e~~~~~-~ 441 (480) T protein:vir:78 363 RIAMQIMGREVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTATQREQMRDWDKQETEDM-I 441 (480) T ss_pred HHHHHHcCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCHhHHHHHHHHHHHHHHHH-H Confidence 99999987543 4567899999999999999999999986 2479999999999999998888888777765432 1 Q ss_pred Hhhcc---cC------CCCC-CCCC Q lcl|NC_010179. 455 ANQAD---EL------NGKG-VDDE 469 (469) Q Consensus 455 ~~~~~---~~------~~~~-~~de 469 (469) ..... .. ++.+ ...| T Consensus 442 ~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (480) T protein:vir:78 442 DTLYSTTKAQADATPKPTVTETKTE 466 (480) T ss_pred HHhhccccccCCCCCCCCCCCCCCc Confidence 11110 00 1011 1011 No 46 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=2e-74 Score=424.67 Aligned_cols=427 Identities=14% Similarity=0.086 Sum_probs=320.3 Q ss_pred CCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDAL-KKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) .+-.+. ...+..++.+|..+++|++++.+||+|+|+|...... .+...+++|+++||+++||++.++|| T Consensus 8 ~~~~~~~~~~~~~L~~~~~~~~~r~~~~~~YY~G~~~i~~~~~~----------~~~~~~~~~~~~n~~~~ivd~~~~~l 77 (485) T protein:vir:24 8 QEEIADPAIARDEMVSAFEDQNQNLRSNTSYYEAERRPEAIGVT----------VPVQMQSLLAHVGYPRLYVDSIAERQ 77 (485) T ss_pred CCcccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCchhhcCcc----------cchhhhhhhhccchHHHHHHHHhhhh Confidence 211122 3445667788888899999999999999987543321 12223567889999999999999999 Q ss_pred hcCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCC--------CceEEEEEccceeEEEEeC Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDED--------NNFRYGIIQPDQITPVYAT 150 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~--------~~~~i~~~~p~~~~~~~d~ 150 (469) ++++++.. +++..++.++++|+.|.++ .+.+++++++++|+||++||.+++ +.++|++++|.+++|+||+ T Consensus 78 ~~~g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~i~D~ 156 (485) T protein:vir:24 78 AVEGFRLG-DADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLIRVEPPTRMYAEIDP 156 (485) T ss_pred ccCceecC-CCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceEEEeccceeEEEeeC Confidence 99887644 4566778899999887654 578999999999999999999875 4568999999999999998 Q ss_pred CCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_010179. 151 TLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF 230 (469) Q Consensus 151 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 230 (469) .. .++.+++++|... .......+++|+++.+++|...++.+ ......+|+||.||| T Consensus 157 ~~-~~~~~~~~~~~~~---~~~~~~~~~~y~~~~~~~~~~~~~~~--------------------~~~~~~~h~~g~vPv 212 (485) T protein:vir:24 157 RI-GRPAKAIRVAYDA---EGNEIQAATLYTPNETFGWFRAEGEW--------------------VEWFSDPHGLGAVPV 212 (485) T ss_pred Cc-CceeEEEEEEEee---cCCeEEEEEEEcCCcEEEEEecCCce--------------------EeecccccCCCcccE Confidence 76 4567776665432 22344567889999888876554332 223456899999999 Q ss_pred EEecCCc-----cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch---hhhhhh-h-hcceeeecc Q lcl|NC_010179. 231 IEFPKNK-----YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK---QFMNDL-R-EYKSIKINN 299 (469) Q Consensus 231 v~~~n~~-----~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~---~~~~~~-~-~~~~~~~~~ 299 (469) |+|+|++ .|.|+++ .|++|||+||+++|++++.+++|++|+++++|.+..... +....+ . ..+.+...+ T Consensus 213 v~f~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 292 (485) T protein:vir:24 213 VPLPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARILAFE 292 (485) T ss_pred EEeccCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccccccchhhhcccceeccC Confidence 9999975 5888987 699999999999999999999999999999997543221 111111 1 111121111 Q ss_pred cCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC-----CccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS-----NASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-----~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) ++++++.+++ ....+.++++|...|+.++.+|++++..|| ++||+||++++.+|.+||+++++.|+++ T Consensus 293 -----~~~~~~~q~~--~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~~~~~f~~~ 365 (485) T protein:vir:24 293 -----DAEGKIQQFS--AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNAIFGGA 365 (485) T ss_pred -----CCCceEEeec--ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2456776654 456778889999999999888888776654 2699999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccC--CCcccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFSD--ADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDR 448 (469) Q Consensus 375 l~~~~~~i~~~~~~~~--~~~~~i~i~f~~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~ 448 (469) |++++++++.+.+..+ .+...++++|+++.|.|.++.+++++|+. |++|+||+++++|+++|+.+|++++++|+ T Consensus 366 l~~~~~l~~~~~~~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~~d~~~e~~~~~ee~ 445 (485) T protein:vir:24 366 WEEAMRLAYRLMKGGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYSIAEREEMRRWDEEE 445 (485) T ss_pred HHHHHHHHHHHhcCCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCCHhHHHHHHHHHHHH Confidence 9999999999876543 45678999999999999999999999973 47999999999999999888999998887 Q ss_pred HHhhhhH-h----hcccCCC-----CCCCCC Q lcl|NC_010179. 449 EENDPYA-N----QADELNG-----KGVDDE 469 (469) Q Consensus 449 ~~~~~~~-~----~~~~~~~-----~~~~de 469 (469) .+..... . .....++ +..+++ T Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 476 (485) T protein:vir:24 446 AAMGLGLLGTMVDADPTVPGSPNPTPAPKPQ 476 (485) T ss_pred hhhhhhHHHhhcccCCCCCCCCCCCCCCCCc Confidence 5432211 1 1111111 111111 No 47 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=2.2e-74 Score=424.41 Aligned_cols=426 Identities=14% Similarity=0.099 Sum_probs=319.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) .+.+....+++.++.+|..+++|++++.+||+|+|+|.+..... +...+++++++||+++||++.++||+ T Consensus 9 ~~~~~~~~~~~~l~~~~~~~~~r~~~~~~Yy~G~~~i~~~~~~~----------~~~~~~~~~~~n~~~~ivd~~~~~l~ 78 (485) T protein:vir:10 9 EEIEDPAIARDEMVSAFEDSTQNLKTNTSYYEAERRPEAIGVTV----------PIQMQSLLAHVGYPRLYVDSIAERQA 78 (485) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCCCC----------ChhhhhhhhhcCcHHHHHHHHHhhhc Confidence 45667778889999999999999999999999999876533221 12223557788999999999999998 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcC--------CCceEEEEEccceeEEEEeCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDE--------DNNFRYGIIQPDQITPVYATT 151 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~--------~~~~~i~~~~p~~~~~~~d~~ 151 (469) +++++. .++++.++.++++|+.|.+ ....++++.++++|+||++||.++ ++.++|++++|.+++++||+. T Consensus 79 ~~g~~~-~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~p~~~~~~~D~~ 157 (485) T protein:vir:10 79 VEGFRF-GDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEPPTRMYAEIDPR 157 (485) T ss_pred ccceec-CCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEccceeEEEEcCC Confidence 777654 3556677889999988765 467799999999999999999985 356789999999999999986 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) .. ++.+++++|... + .....++++|+++.+++|.....++ ...+..+|+||+|||| T Consensus 158 ~~-~~~~~~~~~~~~--~-~~~~~~~~~y~~~~~~~~~~~~~~~--------------------~~~~~~~~~~g~vPvv 213 (485) T protein:vir:10 158 IG-RVSKAIRVAYDA--E-GNEIQAATLYTPNDIFGWYRVENEW--------------------QEWFNNPHGLGVVPVV 213 (485) T ss_pred CC-ceeEEEEEEEee--C-CCeEEEEEEEeCCeEEEEEEcCCce--------------------EEeccccCCCCcccEE Confidence 64 466666655422 2 2335577899999998887654432 2234568999999999 Q ss_pred EecCCc-----cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch---hhhh---hhhhcceeeecc Q lcl|NC_010179. 232 EFPKNK-----YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK---QFMN---DLREYKSIKINN 299 (469) Q Consensus 232 ~~~n~~-----~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~---~~~~---~~~~~~~~~~~~ 299 (469) +|+|++ .|.|+++ +|++|||+||+++|++++.+++|++|+++++|...+... .... .....++. .. T Consensus 214 ~~~n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~i~-~~- 291 (485) T protein:vir:10 214 PIPNRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQTLFDAYLARIL-AF- 291 (485) T ss_pred EeccccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccchhhhhccccee-cc- Confidence 999985 4788997 599999999999999999999999999999997543321 1111 11111111 11 Q ss_pred cCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC-----CccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS-----NASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-----~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) .++++++.+++. ...+.++++|...|+.++.+|++++..|| ++||+||++++.+|.+||+++++.|+.+ T Consensus 292 ----~~~d~k~~q~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~~k~~~f~~~ 365 (485) T protein:vir:10 292 ----EDAEGKIQQFSA--AELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESRLIKKVERKNSIFGGA 365 (485) T ss_pred ----CCCCceEEeecc--cchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 124577877654 34566777777777777666665554432 3699999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccC--CCcccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFSD--ADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDR 448 (469) Q Consensus 375 l~~~~~~i~~~~~~~~--~~~~~i~i~f~~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~ 448 (469) |++++++++.+.+..+ .+...+++.|+++.|+|.++.+++++||. |++|+||+++++|++++..+|++++.+|+ T Consensus 366 l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~~~~~~~~~~ee~ 445 (485) T protein:vir:10 366 WEEAMRLAYRMMKGGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIAEREEMRRWDEEE 445 (485) T ss_pred HHHHHHHHHHHhCCCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHhHHHHHHHHHHHH Confidence 9999999999876543 35678999999999999999999999983 48999999999999988888999988776 Q ss_pred HHhhhhHh--h---cccCCCCCCCCC Q lcl|NC_010179. 449 EENDPYAN--Q---ADELNGKGVDDE 469 (469) Q Consensus 449 ~~~~~~~~--~---~~~~~~~~~~de 469 (469) .+...... . ....++.+..++ T Consensus 446 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (485) T protein:vir:10 446 AAMGLGLIGTMVDPNPTVPGSPSPAP 471 (485) T ss_pred HHHHHHHHHHhhccCCCCCCCCCccc Confidence 54322111 1 111111211211 No 48 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=3.9e-74 Score=423.04 Aligned_cols=444 Identities=11% Similarity=0.036 Sum_probs=327.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-..-.+++++++.+|..+++|++++.+||+|+|+|.+...... ......++|+++|||++||++.++|++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~--------~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTS--------AAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccC--------hhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 888887888899999999999999999999999998865433221 222234678999999999999999999 Q ss_pred cCCeeeccCc-hhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 81 SVFPDIDVGK-DADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 81 g~p~~~~~~~-~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) |+|+++.+++ .+..+.++++|+.|.++ ...+++++++++|++|++||.+++|.+++++++|.+++++||+....++.+ T Consensus 73 ~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~ 152 (456) T protein:vir:10 73 PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRA 152 (456) T ss_pred cCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEE Confidence 9999997654 45567799999887655 567899999999999999999999999999999999999999999899999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) ++++|...+.. . ...+.++++....++.......... . ... ......+...+..+|++|.|||++| ||+. T Consensus 153 ~i~~~~~~d~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~---~--~~~-~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~ 222 (456) T protein:vir:10 153 AMRWWRDLDAE--S-DFAIVWSGDGWQKFARPCFVQSSSR---R--RLV-TRISDSWVPVGDAVVTGSPPPVVVY-QNPD 222 (456) T ss_pred EEEEEEecCCc--e-eEEEEEeccceeEEEEEEEEeeccc---c--eee-eecCCceeeccccCCCCCceeEEEe-cCCC Confidence 99999754322 2 2233444444444443211100000 0 000 0111222333556899999999887 5688 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc--hhhhhhhhhcceeeecccC-CCCCCcceEEeec- Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL--KQFMNDLREYKSIKINNAG-NGDKSGVDKLQID- 314 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~- 314 (469) |.|+|+++++|||+||.++|++++..+++++|++++.|...... ++.+..+..........+. -...+++++.+.+ T Consensus 223 g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 302 (456) T protein:vir:10 223 GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQA 302 (456) T ss_pred CCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEecc Confidence 99999999999999999999999999999999999999643221 1111111110001110000 0011233444433 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCcCccc-cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCc Q lcl|NC_010179. 315 IPVEARDDALKITRDNIFLFGQGIDPANFE-SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADK 393 (469) Q Consensus 315 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~ 393 (469) .+.+.+.+.++.+..+|+..+++|+..+.+ .+|+||+||++++.+|.+||+.+++.|+++|++++++++.+.+.. +. T Consensus 303 ~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~--~~ 380 (456) T protein:vir:10 303 NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--VE 380 (456) T ss_pred cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--cc Confidence 467888888999999999999999877766 468999999999999999999999999999999999999876643 44 Q ss_pred ccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhh-HhhcccCCCC Q lcl|NC_010179. 394 RHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDD--WQQELKDLAKDREENDPY-ANQADELNGK 464 (469) Q Consensus 394 ~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~-~~~~~~~~~~ 464 (469) ..++++|+++.|+|.++.+|+++|+ +|++|.+++.+++|++++ .++|++|+++|....... .+..++.... T Consensus 381 ~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 381 DTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred cceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 5789999999999999999999998 478999999999998755 346788888776543221 1122222222 No 49 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=3.9e-74 Score=423.04 Aligned_cols=444 Identities=11% Similarity=0.036 Sum_probs=327.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-..-.+++++++.+|..+++|++++.+||+|+|+|.+...... ......++|+++|||++||++.++|++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~--------~~~~~~~~k~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:10 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTS--------AAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCchhcCcccC--------hhhhhhhhhhhcchHHHHHHHHHhhhc Confidence 888887888899999999999999999999999998865433221 222234678999999999999999999 Q ss_pred cCCeeeccCc-hhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 81 SVFPDIDVGK-DADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 81 g~p~~~~~~~-~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) |+|+++.+++ .+..+.++++|+.|.++ ...+++++++++|++|++||.+++|.+++++++|.+++++||+....++.+ T Consensus 73 ~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~ 152 (456) T protein:vir:10 73 PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRA 152 (456) T ss_pred cCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeEEEEcCCCCcceEE Confidence 9999997654 45567799999887655 567899999999999999999999999999999999999999999899999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) ++++|...+.. . ...+.++++....++.......... . ... ......+...+..+|++|.|||++| ||+. T Consensus 153 ~i~~~~~~d~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~---~--~~~-~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~ 222 (456) T protein:vir:10 153 AMRWWRDLDAE--S-DFAIVWSGDGWQKFARPCFVQSSSR---R--RLV-TRISDSWVPVGDAVVTGSPPPVVVY-QNPD 222 (456) T ss_pred EEEEEEecCCc--e-eEEEEEeccceeEEEEEEEEeeccc---c--eee-eecCCceeeccccCCCCCceeEEEe-cCCC Confidence 99999754322 2 2233444444444443211100000 0 000 0111222333556899999999887 5688 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc--hhhhhhhhhcceeeecccC-CCCCCcceEEeec- Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL--KQFMNDLREYKSIKINNAG-NGDKSGVDKLQID- 314 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~- 314 (469) |.|+|+++++|||+||.++|++++..+++++|++++.|...... ++.+..+..........+. -...+++++.+.+ T Consensus 223 g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~ 302 (456) T protein:vir:10 223 GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASIFEAAPGALWELPPGVDIWESQA 302 (456) T ss_pred CCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhhhhhhccccccCCCCcceEEecc Confidence 99999999999999999999999999999999999999643221 1111111110001110000 0011233444433 Q ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCcCccc-cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCc Q lcl|NC_010179. 315 IPVEARDDALKITRDNIFLFGQGIDPANFE-SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADK 393 (469) Q Consensus 315 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~ 393 (469) .+.+.+.+.++.+..+|+..+++|+..+.+ .+|+||+||++++.+|.+||+.+++.|+++|++++++++.+.+.. +. T Consensus 303 ~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~N~Sg~Ai~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~--~~ 380 (456) T protein:vir:10 303 NDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--VE 380 (456) T ss_pred cChhHHHHHHHHHHHHHHhccCCChHHhcccccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--cc Confidence 467888888999999999999999877766 468999999999999999999999999999999999999876643 44 Q ss_pred ccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhh-HhhcccCCCC Q lcl|NC_010179. 394 RHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDD--WQQELKDLAKDREENDPY-ANQADELNGK 464 (469) Q Consensus 394 ~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~-~~~~~~~~~~ 464 (469) ..++++|+++.|+|.++.+|+++|+ +|++|.+++.+++|++++ .++|++|+++|....... .+..++.... T Consensus 381 ~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:10 381 DTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred cceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhhcCCCCCCC Confidence 5789999999999999999999998 478999999999998755 346788888776543221 1122222222 No 50 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=4.4e-73 Score=417.24 Aligned_cols=421 Identities=13% Similarity=0.049 Sum_probs=313.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-++ .++|+.++.+|..+++|++++.+||+|+|+|...... .+...+++|+++|||++||++.++|+. T Consensus 1 ~~~~~-~~~i~~l~~~~~~~~~r~~~l~~Yy~G~~~i~~~~~~----------~~~~~~~~k~~~n~~~~ivd~~~~~l~ 69 (441) T protein:vir:80 1 MNSDE-LALIEGMYDRIQRLSSWHCCIEGYYEGSNRVRDLGVA----------IPPELQRVQTVVSWPGIAVDALEERLD 69 (441) T ss_pred CCccH-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCcchhcCcc----------cchhhhhhhhhcchHHHHHHHHHhhhc Confidence 76654 3456888888888999999999999999987543321 122345678999999999999999996 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) +++ |+++++ +.++++|+.|.+ +.+.+++++++++|+||++||.|++|++++++++|.+++|+||+.......++ T Consensus 70 ~~g--~~~~d~---~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~~~~~~~~~ 144 (441) T protein:vir:80 70 WLG--WTNGDG---YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSADGSRLDAGL 144 (441) T ss_pred ccc--ccCCCh---HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCCCCceeEEE Confidence 554 555543 458888887655 56789999999999999999999999999999999999999998776555555 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc-- Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK-- 237 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-- 237 (469) ++++... + . ...+++|+++.+++|...+.+ .+...+..+|+||+||||+|.|++ T Consensus 145 ~~~~~~~--~-~--~~~~~vy~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~g~vPvv~~~n~~~~ 200 (441) T protein:vir:80 145 VVQQTCD--P-E--VVEAELLLPDVIVQVERRGSR-------------------EWVEVDRIPNVLGAVPLVPIVNRRRT 200 (441) T ss_pred EEEEEec--C-c--eEEEEEEecCeEEEEEEcCCc-------------------ceeeccccccCCCceeEEEeeccccC Confidence 5555422 1 1 235677888887776544322 123345678999999999999986 Q ss_pred ---cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEee Q lcl|NC_010179. 238 ---YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQI 313 (469) Q Consensus 238 ---~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 313 (469) .|.|++. +|++|||+||.++|++++.+++|++|+++++|...+............+++.++.+++++ .+++.++ T Consensus 201 ~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~ 278 (441) T protein:vir:80 201 SRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQPGWVLSMASVWAVDKDDDGD--TPNVGSF 278 (441) T ss_pred CccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCccccccchhhhcccccccCCCCCCCC--cceeEec Confidence 3788885 699999999999999999999999999999998665544444445556666666554433 4566554 Q ss_pred cCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC----C-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 314 DIPVEARDDALKITRDNIFLFGQGIDPANFESS----N-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 314 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g----~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) +. ...+.++++|...|+.++.+|++++..+| | .||+||++++.+|.+||+++++.|+++|++++++++.+++. T Consensus 279 ~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~ 356 (441) T protein:vir:80 279 PV--NSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDS 356 (441) T ss_pred Cc--cchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 43 34556666666666666666555443332 2 59999999999999999999999999999999999999886 Q ss_pred cCC---CcccceEEeCCCCCCCHHHHHHHHHHHh--cc--CChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccC Q lcl|NC_010179. 389 SDA---DKRHISQHWTRTKVEDSLTKAQIVSTVA--NY--SSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQADEL 461 (469) Q Consensus 389 ~~~---~~~~i~i~f~~~~p~d~~e~~~~~~kl~--g~--iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~ 461 (469) ... ....++++|+++.|.|.++.+++++|+. |+ +|++|+++.+|++++ |++|+++|+++......+ ... T Consensus 357 ~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~~~---e~~~~~~e~~e~~~~~~~-~~~ 432 (441) T protein:vir:80 357 RVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLDDV---QVEAVMRHRAESSDPLAV-LAG 432 (441) T ss_pred CCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCCHH---HHHHHHHHHHHHHHHHHH-Hhh Confidence 542 3468899999999999999999999983 43 689999999998765 455555554444332211 111 Q ss_pred CCCCCCCC Q lcl|NC_010179. 462 NGKGVDDE 469 (469) Q Consensus 462 ~~~~~~de 469 (469) ..+...|| T Consensus 433 ~~~~~~~~ 440 (441) T protein:vir:80 433 AISRQTNE 440 (441) T ss_pred hhhccccc Confidence 23344444 No 51 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=5.2e-73 Score=416.87 Aligned_cols=426 Identities=12% Similarity=0.072 Sum_probs=310.2 Q ss_pred CCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLI-RNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |+.+++.+.| .+++.+|..+++|++++.+||+|+|+|+........ ...++..+++++|||++||++.++++ T Consensus 9 l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~-------~~~~~~~~~~~~n~~~~iVd~~~~~l 81 (479) T protein:vir:99 9 LSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKN-------KEREVLQQLSRKPWMGLMVNSFAQQL 81 (479) T ss_pred CChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCC-------hhHHHHHHHhhcCcHHHHHHHHHhhc Confidence 9999998866 589999999999999999999999998765432111 11122334567899999999999999 Q ss_pred hcCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEE-----cCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWI-----DEDNNFRYGIIQPDQITPVYATTLD 153 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~-----d~~~~~~i~~~~p~~~~~~~d~~~~ 153 (469) +.++ |++++++.++.++++|+.|.++ ...+++++++++|++|++||. |++|.+++++++|.+++++|++... T Consensus 82 ~~~g--f~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~iydd~~~ 159 (479) T protein:vir:99 82 IVDG--YRKTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPRDAFAIWEDPYW 159 (479) T ss_pred cccc--ccCCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechhheEEEecCCcc Confidence 7554 5667777788899999887554 567899999999999999995 6678899999999999999987654 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) ... +++.+. .+ +. ....+|+...+++|...++. +...+..+|+||+||||+| T Consensus 160 ~~~--~~~~~~-~~--~~---~~~~~~~~~~~~~~~~~~~~--------------------~~~~~~~~h~~g~vPvv~f 211 (479) T protein:vir:99 160 DEW--PKYLLE-RQ--PN---GQYWWWTEEDYSIFEFKQGK--------------------FIYRETVSHDYGHIPFVRY 211 (479) T ss_pred cce--eeEEEe-ec--Cc---eeEEEEecceEEEEEecCCc--------------------eeeccccccCCCCcceEEe Confidence 332 222211 11 11 12356777766666544332 2233567899999999999 Q ss_pred cCC----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchh---hhhhhhhcceeeecccCCCCCC Q lcl|NC_010179. 234 PKN----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQ---FMNDLREYKSIKINNAGNGDKS 306 (469) Q Consensus 234 ~n~----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~ 306 (469) .|+ +.|.|+|+++++|||+||+++|++++.+++|++|++++.|........ ....+...+++... ++ T Consensus 212 ~n~~~~~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~i~~~~------~~ 285 (479) T protein:vir:99 212 VNVMDLRGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLMLPEGANADQEKMRFAQESMLISQ------NE 285 (479) T ss_pred ecCCCcCcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCCcccccccchhccccccccceeec------CC Confidence 998 569999999999999999999999999999999999999975433221 12223334444322 23 Q ss_pred cceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 307 GVDKLQIDI-PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRY 385 (469) Q Consensus 307 ~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~ 385 (469) ++++.+.+. +.+++.+.++.+..+|+..+++|+..+...||+||+||++++.+|.+||+.+++.|+.+|++++++++.+ T Consensus 286 ~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~ 365 (479) T protein:vir:99 286 KASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQIVNVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKI 365 (479) T ss_pred CceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 567776653 4455555555555566666666665554457899999999999999999999999999999999999998 Q ss_pred hcccC-CCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHH-HHHHHHHHHHhhhhHhhcc-- Q lcl|NC_010179. 386 LNFSD-ADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQE-LKDLAKDREENDPYANQAD-- 459 (469) Q Consensus 386 ~~~~~-~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E-~eri~~E~~~~~~~~~~~~-- 459 (469) .+... .+...++++|.++.+.|.++.+++++|| +|++|+||+++++|++++++.| +++.++++.+......... T Consensus 366 ~~~~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 445 (479) T protein:vir:99 366 EGRTEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQNG 445 (479) T ss_pred cCCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 87654 3456799999999999999999999997 5799999999999999875422 3333333322221111110 Q ss_pred ------cCCCCCCCCC Q lcl|NC_010179. 460 ------ELNGKGVDDE 469 (469) Q Consensus 460 ------~~~~~~~~de 469 (469) ....++.+++ T Consensus 446 ~~~~~~~~~~~~~~~~ 461 (479) T protein:vir:99 446 PDPAEQRGGPNGATNM 461 (479) T ss_pred cCcccccCCCCCCCCC Confidence 0011111111 No 52 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=2.2e-72 Score=413.41 Aligned_cols=440 Identities=11% Similarity=0.039 Sum_probs=322.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-..-.+++++++.+|..+++|++++.+||+|+|+|.+...... ...+..++++++||+++||++.++|++ T Consensus 1 ~~~~t~~~~~~~l~~~~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~--------~~~~~~~~~~~~n~~~~ivd~~~~~l~ 72 (456) T protein:vir:79 1 MTASTPAEWLPVLTKRIDDGMSRVRLLARYSNGDAPLPELTRNTS--------AAWRSFQREARTNWGLMVRDSVADRII 72 (456) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCChhhcCcccC--------hhhchhhhhhhcchHHHHHHHHHhhhc Confidence 777777778889999999999999999999999998865432211 122233457889999999999999999 Q ss_pred cCCeeeccCch-hhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 81 SVFPDIDVGKD-ADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 81 g~p~~~~~~~~-~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) |+|+++.++++ +..+.++++|++|+++ ...+++++++++|+||+++|.+++|.+++++++|.+++|+||+....++.+ T Consensus 73 ~~g~~~~~~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~d~~~~~~~~~ 152 (456) T protein:vir:79 73 PNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSVDPLQPWRIRS 152 (456) T ss_pred cCCeecCCCCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEEcCCCCCceEE Confidence 99999886554 5567899999988665 467899999999999999999999999999999999999999999889999 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) ++++|...+. . . ....+|.+..+.++............ .. .......+......+|++|+||||+| +|+. T Consensus 153 ~~~~~~~~d~--~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~pvv~~-~N~~ 222 (456) T protein:vir:79 153 AMRWWRDLDA--E-S-DFAIVWSGDGWQKFARPCFVQSSSRR----RL-VTRISDSWVPVGDAVVTGSPPPVVVY-QNPD 222 (456) T ss_pred EEEEEEecCC--c-e-eEEEEEcCCceEEEEEEEEeeccccc----ee-eeccCCceeecccccCCCCceeEEEe-cCCC Confidence 9999864432 2 2 23344554444443222110000000 00 01111122334456899999999998 5688 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc--hhhhhhhhh-----cceeeecccCCCCCCcceEE Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL--KQFMNDLRE-----YKSIKINNAGNGDKSGVDKL 311 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~--~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~l 311 (469) |.|+|+++++|||+||.++|++++.++++++|++++.|...... ++.+..+.. ...-.+... .+++++. T Consensus 223 ~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~~~~~~~~~~~----~~~~~~~ 298 (456) T protein:vir:79 223 GMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIFEAAPGALWEL----PPGVDIW 298 (456) T ss_pred CCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhhhhhccccccC----CCCccee Confidence 99999999999999999999999999999999999999643221 111111100 000011111 1223333 Q ss_pred ee-cCCHHHHHHHHHHHHHHHHHHhCCCCcCccc-cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010179. 312 QI-DIPVEARDDALKITRDNIFLFGQGIDPANFE-SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS 389 (469) Q Consensus 312 ~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~ 389 (469) +. +.+.+.+...++.+.++|+..+++|+..+.+ .+|+||+||++++.+|.+||+.+++.|+++|++++++++.+.+.. T Consensus 299 q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~N~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~ 378 (456) T protein:vir:79 299 ESQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSANQSAEGAHNIEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES 378 (456) T ss_pred eecccChHHHHHHHHHHHHHHHhhcCCChhHhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 33 3466788888888888888888888877765 468999999999999999999999999999999999999887643 Q ss_pred CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHh-hcccCCCC Q lcl|NC_010179. 390 DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYAN-QADELNGK 464 (469) Q Consensus 390 ~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~-~~~~~~~~ 464 (469) +...++++|+++.|.|.++.+|+++|+ +|++|.+++++.++++++ .++|++|+++|........- ..++...- T Consensus 379 --~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 379 --VEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRRNILNYNADQIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred --ccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 456799999999999999999999997 578999999999988654 45678888877654422211 11111111 No 53 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=3.8e-72 Score=412.11 Aligned_cols=425 Identities=12% Similarity=0.082 Sum_probs=315.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++- .++|++++.+|..+.+|++++.+||+|+|+|.+..... +...+++|+++|||++||++.+++|+ T Consensus 7 ~d~---~~~i~~L~~~~~~~~~r~~~~~~Yy~g~~~i~~~~~~~----------~~~~~~~~~~~n~~~~ivd~~a~~l~ 73 (488) T protein:vir:23 7 IDP---EKLRDQLLDAFENKQNELKSSKAYYDAERRPDAIGLAV----------PLDMRKYLAHVGYPRTYVDAIAERQE 73 (488) T ss_pred CCH---HHHHHHHHHHHHHHHHHHHHHHHHHhcccchhhcCccc----------chhhhhhhhhcchHHHHHHHHHHhhh Confidence 553 35788888899999999999999999999886544321 22234678999999999999997654 Q ss_pred ------cCCeee---ccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEc--------CCCceEEEEEccc Q lcl|NC_010179. 81 ------SVFPDI---DVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWID--------EDNNFRYGIIQPD 142 (469) Q Consensus 81 ------g~p~~~---~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d--------~~~~~~i~~~~p~ 142 (469) |.|..+ .+++++..+.++++|+.|.+ +...+++++++++|+||++||.+ +++.++|++++|. T Consensus 74 ~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~ 153 (488) T protein:vir:23 74 LEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVPLIRVEPPT 153 (488) T ss_pred ccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcceEEEeccc Confidence 444333 23566778889999988755 56789999999999999999864 4567889999999 Q ss_pred eeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccc Q lcl|NC_010179. 143 QITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (469) Q Consensus 143 ~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (469) +++|+||+.. +++.+++++|... ++.. ..++++|+++.+++|...++++ ...+..+ T Consensus 154 ~~~~~~d~~~-~~~~~~~~~~~~~--~~~~-~~~~~~y~~~~~~~~~~~~~~~--------------------~~~~~~~ 209 (488) T protein:vir:23 154 ALYAEVDPRT-RKVLYAIRAIYGA--DGNE-IVSATLYLPDTTMTWLRAEGEW--------------------EAPTSTP 209 (488) T ss_pred eeEEEEecCC-CceEEEEEEEEec--CCCc-EEEEEEEecCcEEEEEecCCce--------------------Eeccccc Confidence 9999999765 5678888877543 2333 4567889998888877654432 2234568 Q ss_pred ccCCcccEEEecCCc-----cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch---hhhhhh---h Q lcl|NC_010179. 223 HNFGRVPFIEFPKNK-----YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK---QFMNDL---R 290 (469) Q Consensus 223 ~~~g~vPvv~~~n~~-----~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~---~~~~~~---~ 290 (469) |+||+||||+|.|++ +|+|+++ .|++|+|+||+++|++++.++++++|+++++|.+..... .....+ . T Consensus 210 h~~g~vPvv~f~n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~ 289 (488) T protein:vir:23 210 HGLEMVPVIPISNRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQRMFDAY 289 (488) T ss_pred cCCCCcceEEeccccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccchhhhhh Confidence 999999999999975 5889997 689999999999999999999999999999997543321 111111 1 Q ss_pred hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC-----CccHHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS-----NASGVAIKMLYSHLELKAA 365 (469) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-----~~Sg~Al~~~~~~l~~k~~ 365 (469) ..++..+. ++.++++.+++. ...++++++|...|+.++.+|++++..|| ++||+||++++++|.+||+ T Consensus 290 ~~~v~~~~-----~g~~~~~~q~~~--~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al~~~~~~l~~k~~ 362 (488) T protein:vir:23 290 MARILAFE-----GGEGAHAEQFSA--AELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAIKAAESRLVKKVE 362 (488) T ss_pred hhhhccCC-----CCCCceeEecCC--CChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHH Confidence 12222222 223567777554 34566666666666666666655544432 2699999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcccC--CCcccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHH Q lcl|NC_010179. 366 KTQTYFEHAINELVRAIMRYLNFSD--ADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQ 439 (469) Q Consensus 366 ~~~~~~~~~l~~~~~~i~~~~~~~~--~~~~~i~i~f~~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~ 439 (469) ++++.|+.+|++++++++.+++..+ .+...++++|+++.|.|.++.+++++|+. |++|+||+++++|+++|+.+ T Consensus 363 ~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~ 442 (488) T protein:vir:23 363 RKNKIFGGAWEQAMRLAYKMVKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVERE 442 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHH Confidence 9999999999999999999887654 34578999999999999999999999973 47999999999999999999 Q ss_pred HHHHHHHHHHHhhhhH-----------hhcccCCCCCCCCC Q lcl|NC_010179. 440 ELKDLAKDREENDPYA-----------NQADELNGKGVDDE 469 (469) Q Consensus 440 E~eri~~E~~~~~~~~-----------~~~~~~~~~~~~de 469 (469) |++++++|+.+..... .+.+..+.++.+++ T Consensus 443 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 483 (488) T protein:vir:23 443 QMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGEPPAP 483 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCCCCCC Confidence 9999876654432211 01111122222222 No 54 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=3.2e-72 Score=412.57 Aligned_cols=423 Identities=12% Similarity=0.098 Sum_probs=312.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++-+++.+.+.++ +..+.++++++.+||+|+|+|....... +....+.++++||+++||++.+++++ T Consensus 11 ~~~~~~~~~l~~~---~~~~~~rl~~l~~Yy~G~~~i~~~~~~~----------~~~~~~~~~~~n~~~~ivd~~~~~l~ 77 (484) T protein:vir:77 11 VDPEKAREEMLNL---FTERTQDLGDNTAYYESERRPDAVGVTV----------PQQMQKLLAHVGYPRLYIDAIAARQE 77 (484) T ss_pred CCHHHHHHHHHHH---HHHHHHHHHHHHHHHhccccchhccccc----------chhHHhhhhhcCcHHHHHHHHHhhhc Confidence 6655554444444 4445567888999999999875432211 11122446789999999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCc--------eEEEEEccceeEEEEeCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNN--------FRYGIIQPDQITPVYATT 151 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~--------~~i~~~~p~~~~~~~d~~ 151 (469) +++++.. +++..++.++++|++|+++ ...+++++++++|+||++||.+++|. ++|++++|.+++++||+. T Consensus 78 ~~g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~~~~p~~~~~~~D~~ 156 (484) T protein:vir:77 78 LEGFRLG-GADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIRVEPPTNLYAQIDPR 156 (484) T ss_pred cCceecC-CcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEEEeccceeEEEecCC Confidence 8887754 4556778899999987665 57799999999999999999998875 469999999999999986 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) .+++.+++++|...+ +. ....+++|+.+.+++|...++.+ ...+..+|+||+|||| T Consensus 157 -~~~~~~a~~~~~~~~--~~-~~~~~~~y~~~~~~~~~~~~~~~--------------------~~~~~~~~~~g~vPvv 212 (484) T protein:vir:77 157 -TRQVMRAIRAIEDEE--GN-EVIGATLYLPNNTVIWNREDGQW--------------------VQVANVAHNLEMVPVI 212 (484) T ss_pred -CCceEEEEEEEEeec--CC-cEEEEEEEecCeEEEEEecCCce--------------------EeeccccCCCCCcceE Confidence 467899998876432 22 24556788888887776554332 2234568999999999 Q ss_pred EecCCc-----cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh-h--h-hhh--hcceeeecc Q lcl|NC_010179. 232 EFPKNK-----YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF-M--N-DLR--EYKSIKINN 299 (469) Q Consensus 232 ~~~n~~-----~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~-~--~-~~~--~~~~~~~~~ 299 (469) +|.|++ .|+|+++ .|++|+|+||+++|++++..+++++|+++++|.+.+..... . . .+. ..++..+ T Consensus 213 ~f~N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 290 (484) T protein:vir:77 213 PIPNRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETGQTLFDAYLARILAF-- 290 (484) T ss_pred EeccccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhcccccccchhhhhhhhhhccc-- Confidence 999975 4789997 69999999999999999999999999999999754432111 0 0 011 1111111 Q ss_pred cCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC----C-ccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS----N-ASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g----~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) .++++++.+++. .+.+.++++|+..|+.++.+|++++..|| | +||+||++++.+|.+||+++++.|+++ T Consensus 291 ----~~~~~~~~q~~~--~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~ka~~k~~~f~~~ 364 (484) T protein:vir:77 291 ----EDHESKAQQFSA--AELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSSESRLVKTVERKNKIFGGA 364 (484) T ss_pred ----CCCCceeEeecC--CChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 223567766553 34566777777777777766666554443 2 699999999999999999999999999 Q ss_pred HHHHHHHHHHHhcccC--CCcccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFSD--ADKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDR 448 (469) Q Consensus 375 l~~~~~~i~~~~~~~~--~~~~~i~i~f~~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~ 448 (469) |++++++++.+.+..+ .+...++++|+++.|.|.++.+++++|++ |++|++|+++++|+++|+.+|++|+++|+ T Consensus 365 l~~~~~l~~~~~~~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~~~~e~~~~~~ee 444 (484) T protein:vir:77 365 WEQAMRVAYKVMNGGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSITEREEMRKWDEEE 444 (484) T ss_pred HHHHHHHHHHHhCCCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChhHHHHHHHHHHHH Confidence 9999999999877543 35578999999999999999999999973 48999999999999999999999998887 Q ss_pred HHhhhhHh-----h-cccCCCCCCCCC Q lcl|NC_010179. 449 EENDPYAN-----Q-ADELNGKGVDDE 469 (469) Q Consensus 449 ~~~~~~~~-----~-~~~~~~~~~~de 469 (469) .+...... . .+..+.++..++ T Consensus 445 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (484) T protein:vir:77 445 QAQGLGLMGTMFGTDPSGGGNPDNPET 471 (484) T ss_pred HHHHHHHHhhhccccccCCCCCCCCCc Confidence 65432111 1 111111221222 No 55 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=9.6e-67 Score=382.51 Aligned_cols=446 Identities=12% Similarity=0.131 Sum_probs=321.1 Q ss_pred CCHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIR-NTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~-~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) -..+.+++++. .-+..+..++.+++++++||+|+|+++.+..... ....+.++|+++||+|.||++.++|| T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~--------~~~~~~~~~~~~n~~k~i~~~~a~~l 88 (496) T protein:vir:38 17 GLLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEH--------NGNPVNRRQLSMNLPKVTAKYMSKLL 88 (496) T ss_pred ccchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhcc--------CCCccccceeecchHHHHHHHHhhhh Confidence 12344444443 2233356677899999999999999876543221 11223356799999999999999999 Q ss_pred hcCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) +|+||+++++++..++.|++++++| +.+.+.+++..++++|.+|+++|+|++|++++.+++|.+++|++++..+-...+ T Consensus 89 ~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~~~~~~~~~~ 168 (496) T protein:vir:38 89 FNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSNDSENVDECV 168 (496) T ss_pred hCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccceEEEEecCCcEEEEE Confidence 9999999999999999999999875 778899999999999999999999999999999999999999988754322233 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEE----EeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFF----RTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) +++.|. . +..+++.+++|+.....++ .....+...+ ....+....++........+++.++|+++|+ T Consensus 169 f~~~~~---~-~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~-----g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~ 239 (496) T protein:vir:38 169 IANSFH---K-NNKYYTLLEWNEWQGDVYTVTTELYQSDDPNEL-----GTKVSLTLLFDDIEPVVPLPDFTRPTFIYIK 239 (496) T ss_pred EEEEEE---e-CCeEEEEEEEEEEeCceEEEEEEEEecCCcccc-----CccccccccccccccceeecCCCcceEEEec Confidence 333332 2 3456777788764332221 1111111110 1111111112223333445678899999997 Q ss_pred CC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEec-------CCcccchhhhhhhhhcceeeec Q lcl|NC_010179. 235 KN---------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTN-------YGGASLKQFMNDLREYKSIKIN 298 (469) Q Consensus 235 n~---------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g-------~~~~~~~~~~~~~~~~~~~~~~ 298 (469) ++ +.|.|+|+++++|||+||.++|++++.++.+..++++... ..+.....+........++... T Consensus 240 ~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~ 319 (496) T protein:vir:38 240 PNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQYFDSTDEAFFLYQGD 319 (496) T ss_pred CCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccccCCCCccceEEEeecC Confidence 64 4589999999999999999999999999988777776221 1111111222222222222222 Q ss_pred ccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--CccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 299 NAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAIN 376 (469) Q Consensus 299 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 376 (469) . .+.+..++.++.+++.+++.+.++.+.+.|...++.++. ++.+.|..||.+++++++.+.++|+.+++.|+.+|+ T Consensus 320 ~--~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~~~~l~ 397 (496) T protein:vir:38 320 Q--DDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLIEQGIK 397 (496) T ss_pred C--CcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 222346788888999999999999999999988888764 334456789999999999999999999999999999 Q ss_pred HHHHHHHHHhcc------cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCH--HHHHHHHHH Q lcl|NC_010179. 377 ELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDW--QQELKDLAK 446 (469) Q Consensus 377 ~~~~~i~~~~~~------~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~--~~E~eri~~ 446 (469) ++++.++.+... ...+...++++|++++|.|..+.+++++++ +|++|++|+++.+|+++|+ ++|++|+++ T Consensus 398 ~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d~ea~~el~ri~~ 477 (496) T protein:vir:38 398 EMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITEAEADEWAEMLAK 477 (496) T ss_pred HHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHHHHHHH Confidence 999999865431 234556789999999999999999999987 6999999999999999874 458999999 Q ss_pred HHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 447 DREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 447 E~~~~~~~~~~~~~~~~~~~~de 469 (469) |+++..+ .++.++-++++| T Consensus 478 E~~~~~~----~~d~~~~~~~~e 496 (496) T protein:vir:38 478 EKQAEMP----NNDMNGIFGEEE 496 (496) T ss_pred hhhccCc----cccccCCCCCCC Confidence 9876544 223334444444 No 56 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=1.6e-64 Score=370.32 Aligned_cols=450 Identities=10% Similarity=0.117 Sum_probs=321.0 Q ss_pred CHHHHHHHHHHHHH------------------HHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCccee Q lcl|NC_010179. 2 ELDALKKLIRNTST------------------SRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRI 63 (469) Q Consensus 2 ~~~~~~~~i~~~~~------------------~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri 63 (469) =++.++..|+.++. .+...+.++.++++||+|+|+.+....... ....+.++|+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~--------~~~~~~~~~~ 72 (499) T protein:vir:80 1 MINQIIAGVKGVMRRMGLLKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEH--------NGNPVNRRQL 72 (499) T ss_pred ChhHHHHHHHHHHHHhccccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhcccccc--------CCCcccccee Confidence 23333333333332 245567889999999999997664432211 1223446789 Q ss_pred ccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccc Q lcl|NC_010179. 64 PSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPD 142 (469) Q Consensus 64 ~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~ 142 (469) ++|+++.||++.|+||||+||+++++++..++.|++++++| +...+.+++..|+++|.+|+++|+|++|++++.+++|. T Consensus 73 s~n~~~~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~ 152 (499) T protein:vir:80 73 SMNLPKVTAKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATAD 152 (499) T ss_pred ecchHHHHHHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCC Confidence 99999999999999999999999999999999999999876 67778999999999999999999999999999999999 Q ss_pred eeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeE--EEEEeecCceeeccccccccccccccccccccccc Q lcl|NC_010179. 143 QITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEA--QFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNT 220 (469) Q Consensus 143 ~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (469) +++|++.++.+-..+++++.+. .+..+++.+|+|..... ..|.....-+............+....+...++.. T Consensus 153 ~~~Pi~~d~~~~~~~~f~~~~~----~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~ 228 (499) T protein:vir:80 153 CMYPLSNDSENVDECLIANSFH----KNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVV 228 (499) T ss_pred ceEEEEecCCCeEEEEEEEEEe----ecCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCce Confidence 9999876543323333333332 23456677777653321 11221111111111111111111111122333444 Q ss_pred ccccCCcccEEEecCC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEe-------cCCcccchh Q lcl|NC_010179. 221 LKHNFGRVPFIEFPKN---------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT-------NYGGASLKQ 284 (469) Q Consensus 221 ~~~~~g~vPvv~~~n~---------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~-------g~~~~~~~~ 284 (469) ..++++++|+++|+++ +.|.|+|+++++|||+||.++|++++.++....+++|.. +.++..... T Consensus 229 ~~~~~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~~~ 308 (499) T protein:vir:80 229 PLPSLTRPTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTTQY 308 (499) T ss_pred eecCCCccceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcccC Confidence 4567899999999864 458999999999999999999999999999988888732 222232233 Q ss_pred hhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--CccccCCccHHHHHHHHHHHHH Q lcl|NC_010179. 285 FMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFESSNASGVAIKMLYSHLEL 362 (469) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~Sg~Al~~~~~~l~~ 362 (469) +..+...++.+....++ .+..++.++++++.+++.+.++.+.+.|...++.++. ++.+.|..||.+++++.+.+.+ T Consensus 309 ~~~~~~~~~~~~~~~~~--~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~ 386 (499) T protein:vir:80 309 FDSTDEAFFLYQGEQDD--NGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQ 386 (499) T ss_pred CCcccceeeEeeccCCC--CcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHH Confidence 33333334444332222 2345888889999999999999999999988877753 3344567799999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcc------cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC Q lcl|NC_010179. 363 KAAKTQTYFEHAINELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIV 434 (469) Q Consensus 363 k~~~~~~~~~~~l~~~~~~i~~~~~~------~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v 434 (469) +++.+++.|+.+|+++++.|+.+... ...+...+++.|++++|.|..+++++++++ +|++|++|+++.++++ T Consensus 387 ~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~ 466 (499) T protein:vir:80 387 TKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNI 466 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCC Confidence 99999999999999999999876432 123456899999999999999999999886 6999999999999998 Q ss_pred CCHH--HHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 435 DDWQ--QELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 435 ~d~~--~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +|++ +|++||++|+++..+. ++..+-.+++| T Consensus 467 ~d~ea~~el~~i~~E~~~~~~~----~d~~g~~ge~e 499 (499) T protein:vir:80 467 TEAEADEWAEMLAKEKQAEIPN----NDMTGIFGEEE 499 (499) T ss_pred ChHHHHHHHHHHHHHhhcCCCC----CCccccCCCCC Confidence 8754 6799999998665442 22233334444 No 57 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=4e-65 Score=373.66 Aligned_cols=429 Identities=13% Similarity=0.012 Sum_probs=315.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-++ ..+|+.|+..+..++++++++.+||+|+|.+.+.....+ ....++++++||+++||++.++++. T Consensus 18 l~~~e-~~~i~~L~~~~~~~~~r~~~l~~YY~G~~~i~~~~~~~p----------~~~~~~~~v~n~~~~iVd~~a~rl~ 86 (504) T protein:vir:99 18 LNDDV-VDKVNGLYQQLVDRTPRNLLRASFYDGKYAIRQIGNLIP----------PEYLRTATVLGWSAKAVDTLARRCN 86 (504) T ss_pred CCHHH-HHHHHHHHHHHHHHhHHHHHHHHHHhccccchhcccccc----------HHHHHHhhccCcHHHHHHHHHhhhc Confidence 77666 355788888888899999999999999998754332211 1122446789999999999999998 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCc--eEEEEEccceeEEEEeCCCCCceE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNN--FRYGIIQPDQITPVYATTLDNKLL 157 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~--~~i~~~~p~~~~~~~d~~~~~~~~ 157 (469) .++.+.. +++..+..++++|+.|.++ ...+++++++++|+||++||.+++|+ +.|++++|.+++++||+.. .++. T Consensus 87 ~~Gf~~~-d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d~~~~~~I~~~sP~~~~~iyD~~~-~~~~ 164 (504) T protein:vir:99 87 LESFVWP-DGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGAGEPDSLIHVKSAMQATGEWNSRR-NAMD 164 (504) T ss_pred cceeeCC-CCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceeEEEEeCCC-Ccee Confidence 8776543 4556677899999988765 56799999999999999999999886 4688999999999999865 5677 Q ss_pred EEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 158 GVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 158 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) +++++|.. +.+ .....+++|+++.++++...+.+.. ..+..+|++| ||||+|.|++ T Consensus 165 ~a~~~~~~-d~~--g~~~~~~~y~~~~~~~~~~~~~~~~--------------------~~~~~~~~~g-vPvV~~~n~~ 220 (504) T protein:vir:99 165 SLLSITSR-DAE--GHPTGIALYEDGVTVTADMDDDGDW--------------------HADVRTHKLG-VPVEVLPYKP 220 (504) T ss_pred EEEEEEEe-cCC--CeEEEEEEEcCCcEEEEEEcCCcee--------------------eeccccCCCC-cceEEecccc Confidence 77776653 333 3455678999998888765433211 1245689998 9999999873 Q ss_pred -----cccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc----hh--hhhhhhhcceeeecccCCC-- Q lcl|NC_010179. 238 -----YRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL----KQ--FMNDLREYKSIKINNAGNG-- 303 (469) Q Consensus 238 -----~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~----~~--~~~~~~~~~~~~~~~~~~~-- 303 (469) .|.|++. +|++|+|++|+++|++.+..++|++|++++.|...++. +. ........+++.++....+ T Consensus 221 ~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~ 300 (504) T protein:vir:99 221 REDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPD 300 (504) T ss_pred cCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCcccccc Confidence 5888874 89999999999999999999999999999999765331 11 1122223455555544332 Q ss_pred -CCCcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCcc--cc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 -DKSGVDKLQIDI-PVEARDDALKITRDNIFLFGQGIDPANF--ES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINEL 378 (469) Q Consensus 304 -~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 378 (469) .++++++.+.+. +.+.+.+.++.+..+|+..|++|..+++ +. +++||+||++++.+|.+||.++++.|+.+|+++ T Consensus 301 ~~~~~~~~~q~~~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~ 380 (504) T protein:vir:99 301 AARARADVKQFPASSPQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASREDLIAEAEGATDDWSPAFRRS 380 (504) T ss_pred ccCccceeeecCCCChHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234577776554 5677777777777777777888865553 32 468999999999999999999999999999999 Q ss_pred HHHHHHHhcccC---CCcccceEEeCCCCCCCHHHHHHHHHHHhcc-----CChHHHHHhCCCCCCHHHHHHHHHHHHHH Q lcl|NC_010179. 379 VRAIMRYLNFSD---ADKRHISQHWTRTKVEDSLTKAQIVSTVANY-----SSKEAVAKANPIVDDWQQELKDLAKDREE 450 (469) Q Consensus 379 ~~~i~~~~~~~~---~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~-----iS~et~~~~l~~v~d~~~E~eri~~E~~~ 450 (469) +++++.+.+..+ .++..+++.|+++.+.|.++.+++++|+.+. .+.+++++++|+.+ .|++|+++|.+. T Consensus 381 ~rla~~~~~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg~~~---~ei~r~~~e~~~ 457 (504) T protein:vir:99 381 MIRALAIKNGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLGLTP---QQAKRALAERRR 457 (504) T ss_pred HHHHHHHhcCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcCCCH---HHHHHHHHHHHH Confidence 999999876543 3457899999999999999999999998542 34688999998753 456666555433 Q ss_pred hhh--h---HhhcccC--CCCCCCCC Q lcl|NC_010179. 451 NDP--Y---ANQADEL--NGKGVDDE 469 (469) Q Consensus 451 ~~~--~---~~~~~~~--~~~~~~de 469 (469) ... . ....+.. .+++..++ T Consensus 458 ~~~~~~~~~l~~~~~~~~~~~~~~~~ 483 (504) T protein:vir:99 458 ASSVSIIEALNRRQQEAATAGEDQDQ 483 (504) T ss_pred HhhHHHHHHHhcccCCCCCCCCCCCc Confidence 211 1 1111111 11112222 No 58 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=5.3e-65 Score=372.98 Aligned_cols=405 Identities=9% Similarity=-0.002 Sum_probs=278.0 Q ss_pred ccccccchhhhcccccccccccCcc-eeccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHH Q lcl|NC_010179. 37 ITTRNNGKPKVSKEGKKDPLRSADN-RIPSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLV 114 (469) Q Consensus 37 i~~~~~~~~~~~~~~~~~~~~~~~~-ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~ 114 (469) ++.+.. .+.-+..+ +.++|||++||++.++++.+++ |++++.+.++.++++|+.|++ +...++++ T Consensus 1 ~l~~~~-----------~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~g--f~~~d~~~~~~~~~i~~~N~~d~~~~~~~~ 67 (434) T protein:vir:98 1 MLPKNA-----------EQAFLDFQRKARTNFCGLIANASVHRLLALG--VTGPDGEPDTRASRWWQANRLDSRQKLVWR 67 (434) T ss_pred CCCCCc-----------cHHHHHhhhhhhccchHHHHHHHHhhhccCc--eecCCCchHHHHHHHHHhcChhHHHHHHHH Confidence 111111 01111122 3578999999999999998665 557777888999999988755 45679999 Q ss_pred HHHhCCeEEEEEEEcCCC-------ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEE Q lcl|NC_010179. 115 DSSNAGRAWLHYWIDEDN-------NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQF 187 (469) Q Consensus 115 ~~~~~G~~~~~v~~d~~~-------~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (469) +++++|+||++||.++++ .+.|++++|.+++++||+..+ ++.+++++|.... ++..+. .+.+++...+++ T Consensus 68 ~a~i~G~ay~~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~-~~~~ai~~~~~~~-~~~~~~-~~~~~~~~~~~~ 144 (434) T protein:vir:98 68 MAMAQSAGYMLVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETG-EPLVGLKVWHNDI-DGFGYA-RVFFDDTSFPYR 144 (434) T ss_pred HHhhcCceEEEEecCCCcccccCCceeEEEEeccceeEEEEeCCCC-ceEEEEEEEEecc-CCceEE-EEEEeCcEEEEE Confidence 999999999999987654 467999999999999998765 6889998886433 333332 333444444443 Q ss_pred EEeecCceeecccccccccccccccccccccccccccCCcccEEEecCC----ccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 188 FRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN----KYRLAELNKYKGLIDAYDDIYNGFIND 263 (469) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~----~~g~~~~~~v~~liD~~~~~~s~~~~~ 263 (469) +........... ..............+|+||.||||+|.|| ..|.|+|+++++|||+||+++|++++. T Consensus 145 ~~~~~~~~~~~~--------~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~~~~~~g~sd~e~vi~liDa~~~~~s~~~~~ 216 (434) T protein:vir:98 145 TRERTGARLPWG--------PDSWVYTGTADSGDVHDLGGMQLVEFARMPDLGEDPEPEFAGVLDIQDRVNLGILNRMAA 216 (434) T ss_pred Eeeccccccccc--------cccceecccccccccCCCCccceEEeccCCCcCcCCcchhhhHHHHHHHHHHHHHHHHHH Confidence 333222111110 00111122334567899999999999998 568999999999999999999999999 Q ss_pred HHHhcCceeEEecCCcccchhhhhhhh-hcceeeeccc--CCCCCCcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCC Q lcl|NC_010179. 264 LDDVQTVILVLTNYGGASLKQFMNDLR-EYKSIKINNA--GNGDKSGVDKLQIDI-PVEARDDALKITRDNIFLFGQGID 339 (469) Q Consensus 264 ~~~~~~p~l~~~g~~~~~~~~~~~~~~-~~~~~~~~~~--~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~ 339 (469) .+++++|+++++|.............. .........+ ...+++++++.+.+. +.+++.+.++.+.++|+..+.+|+ T Consensus 217 ~~~~a~p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~ 296 (434) T protein:vir:98 217 SRFSGFRQKWIKGHKFAKRTDPATGMTVVDQPFVPSPSAVWASEGENTQFGQLDATDLSGFLKEHASDVRDMLTISQTPT 296 (434) T ss_pred HHHhcchhhhhcCCCcccccccccccchhhhhhhccccccccCCCCCceEEEecCcchHHHHHHHHHHHHHHhcccCCCH Confidence 999999999999975443221111100 0000000000 001234577777554 445555555555555555555554 Q ss_pred cCccc-cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_010179. 340 PANFE-SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV 418 (469) Q Consensus 340 ~~~~~-~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl 418 (469) ..+.+ .+|+||+||++++.+|.+||+++++.|+++|++++++++.+.+.. .+..+++++|+++.|+|.++.+|+++|| T Consensus 297 ~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~~-~~~~~~~v~w~~~~~~s~~~~ada~~kl 375 (434) T protein:vir:98 297 YLYATDLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQAGVP-EDYTEAEVRWANPAHVTMAVKADAATKL 375 (434) T ss_pred HHhccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC-hhheeeeEEecCCCCCCHHHHHHHHHHH Confidence 44433 257999999999999999999999999999999999999887653 3566899999999999999999999999 Q ss_pred hcc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHhhh----hHhhcccCCCCCCCCC Q lcl|NC_010179. 419 ANY-SSKEAVAKANPIVDDWQQELKDLAKDREENDP----YANQADELNGKGVDDE 469 (469) Q Consensus 419 ~g~-iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~----~~~~~~~~~~~~~~de 469 (469) .|+ +|.+++++++|+++ +|++|+.+|+++... ...+......+.++++ T Consensus 376 ~~~g~~~e~~~~~lg~~~---~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~ 428 (434) T protein:vir:98 376 KSIGYPLDVIAEELDESP---ARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDS 428 (434) T ss_pred HhcCCcHHHHHHhCCCCH---HHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcc Confidence 876 89999999999854 578887777554322 2222333333333333 No 59 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=1.2e-62 Score=360.13 Aligned_cols=401 Identities=12% Similarity=0.028 Sum_probs=304.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+...+..|++.+ ..+.++++++.+||+|+|++.+.....+. .-++.+|.++|||+++|+..++.+. T Consensus 1 m~~~~i~~L~~~~----~~~~~r~~~~~~yy~g~~~~~~~~~~~p~---------~~~~~~~~v~nw~~~~Vd~~a~rl~ 67 (422) T protein:vir:97 1 MNYMGMGYLRRKL----ALFKTGVDKRYRYYAMDDRDDTRSIVMPN---------NVREMYRSVLEWTAKGVDSLADRII 67 (422) T ss_pred CChHHHHHHHHHH----HHHHHHHHHHHHHHhcCCChhhcCccccH---------HHHHHHHhhcchhHHHHHHHHhccc Confidence 9988887776555 44567899999999999987544322111 1123346778999999999999764 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcC-CCceEEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDE-DNNFRYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~-~~~~~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) -+. |+++|. .++++|+.|.++ ...++++.++++|+||++||.++ +|.+.|++++|.+++++||+.. +++.+ T Consensus 68 ~~G--f~~~d~----~l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D~~~-~~~~~ 140 (422) T protein:vir:97 68 FRE--FTNDDF----NAWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILDPTT-FLLTE 140 (422) T ss_pred cce--eeCCch----hHHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEeCCC-Cccee Confidence 443 455543 378899887665 56789999999999999999986 6889999999999999999875 45666 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc- Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK- 237 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~- 237 (469) ++++|. .+.++ ..+...+|++..+.++..... ....+|++|.||+|+|.|++ T Consensus 141 a~~~~~-~~~~~--~~~~~~~~~~~~~~~~~~~~~------------------------~~~~~~~~g~vPvv~~~n~~~ 193 (422) T protein:vir:97 141 GYAILE-SDSNG--NPTLEAYFTDKDIWYYPKKGK------------------------PYNIKNPTGHPLLVPIIHRPD 193 (422) T ss_pred eEEEEE-ecCCC--cEEEEEEEcCceEEEEcCCCc------------------------cccccCCCCCcceEEecccCC Confidence 666654 22233 334556777777766643211 12357999999999999864 Q ss_pred ----cccccH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe Q lcl|NC_010179. 238 ----YRLAEL-NKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ 312 (469) Q Consensus 238 ----~g~~~~-~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 312 (469) +|.|++ +++++|+|++|++++++....+++++|+++++|.+.+............+++.++.+.+++. +++.+ T Consensus 194 ~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~de~~~~--~~v~q 271 (422) T protein:vir:97 194 AVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWRATVSTLLEISKDEDGDK--PTVGQ 271 (422) T ss_pred CccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCcccccCchhhhhhhhhhccCCCCCCCc--ceeee Confidence 588988 78999999999999999999999999999999986433323233333446666766555444 45544 Q ss_pred ecC-CHHHHHHHHHHHHHHHHHHhCCCCcCccccC-C-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010179. 313 IDI-PVEARDDALKITRDNIFLFGQGIDPANFESS-N-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS 389 (469) Q Consensus 313 ~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~ 389 (469) .+. +.+.+.+.++.+..+++..|++|...+...+ | +||+||++++.+|.+||.++++.|+.+|++++++++.+.+.. T Consensus 272 ~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~ 351 (422) T protein:vir:97 272 FTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQRSFSSGFLNVAYIAVCLRDEF 351 (422) T ss_pred cCCCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 443 5667777777777777777788866665433 4 699999999999999999999999999999999999987753 Q ss_pred C---CCcccceEEeCCCCCCC---HHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_010179. 390 D---ADKRHISQHWTRTKVED---SLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDREEN 451 (469) Q Consensus 390 ~---~~~~~i~i~f~~~~p~d---~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~ 451 (469) . ..+.++++.|+++.|.| .++.+|+++|+. |+++.+++++++|+ ++++.|..|+.+++.+- T Consensus 352 ~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~-~~~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 352 PYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLTGV-KGADKPIPAITEVTTDG 422 (422) T ss_pred cccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHcCC-CchhHHHHHHHhhhccC Confidence 3 24567999999888887 778889999973 57899999999998 77888999998886544 No 60 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=3.1e-61 Score=352.32 Aligned_cols=389 Identities=10% Similarity=0.007 Sum_probs=297.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+...|.+|++.+ ..+.++++++.+||+|+|++.+.....+. .-++++|+++|||++||+..++++. T Consensus 1 ~~~~~i~~L~~~~----~~~~~r~~~~~~yY~g~~~~~~~~~~~p~---------~~~~~~~~v~nw~~~iVds~a~rl~ 67 (409) T protein:vir:94 1 MTEKGIGYLRFKL----SVHKRRAEMRYDQYAMKYVDRFKGITIPQ---------ALSQQYRSILGWCAKGVDSLADRLV 67 (409) T ss_pred CCHHHHHHHHHHH----HHHhHHHHHHHHHhcccCchhhcChhhhH---------HHHHHHhhhcchhHHHHHHhHhhcc Confidence 9988888877665 44567889999999999987543322111 1123457889999999999999775 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) -+. |++++. .++++|+.|.++ ...++++.++++|++|++||.+++|+++|++++|.+++.+||+.. +++.++ T Consensus 68 ~~G--f~~~d~----~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D~~~-~~~~~a 140 (409) T protein:vir:94 68 FRE--FENDDF----TVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIIDPIT-GLLTEG 140 (409) T ss_pred cCc--ccCCch----HHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEecCC-Cceeee Confidence 333 455543 478999888765 577999999999999999999999999999999999999999854 578888 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc-- Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK-- 237 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-- 237 (469) ++++... .........+|+++.++++......+ ...+|++|.||+|+|.|++ T Consensus 141 ~~~~~~d---~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~n~~g~vPvV~f~n~~~~ 194 (409) T protein:vir:94 141 YAVLERD---ENNNVVLEAHFLPDRTDYYYRDSRNN-----------------------ISIANPTGHPLLVPIIHRPDA 194 (409) T ss_pred EEEEEec---CCCceEEEEEEecCcEEEEEecCcee-----------------------EeeeCCCCCcceEEecccccc Confidence 8876432 22233456778888777665443221 2347999999999999874 Q ss_pred ---cccccH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEee Q lcl|NC_010179. 238 ---YRLAEL-NKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQI 313 (469) Q Consensus 238 ---~g~~~~-~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 313 (469) .|.|++ +++++|+|++|++++++.+..+++++|+++++|.+.+...........++++.++.+.+++ ++++.+. T Consensus 195 ~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~dg~--~~~v~q~ 272 (409) T protein:vir:94 195 VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGD--KPTLGQF 272 (409) T ss_pred ccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhHHHhhcCCCCCCCC--CceEEec Confidence 588988 6899999999999999999999999999999998643322222333345666676655544 4555554 Q ss_pred cC-CHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_010179. 314 DI-PVEARDDALKITRDNIFLFGQGIDPANFES-SN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD 390 (469) Q Consensus 314 ~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~ 390 (469) +. +.+.+.+.++.+..+++..|++|..++... .| +||+||++++.+|..|+.++++.|+.+|++++++++.+.+..+ T Consensus 273 ~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~~~fg~~~~~~~rla~~i~~~~~ 352 (409) T protein:vir:94 273 TQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDAP 352 (409) T ss_pred CCCChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC Confidence 44 567777777777777777778877666543 35 6999999999999999999999999999999999999877643 Q ss_pred ---CCcccceEEeCCCCCCC---HHHHHHHHHHHh--c--cCChHHHHHhCCCCCCHH Q lcl|NC_010179. 391 ---ADKRHISQHWTRTKVED---SLTKAQIVSTVA--N--YSSKEAVAKANPIVDDWQ 438 (469) Q Consensus 391 ---~~~~~i~i~f~~~~p~d---~~e~~~~~~kl~--g--~iS~et~~~~l~~v~d~~ 438 (469) .++..+++.|.+..|.+ .++.+|+++|+. | +.+.++++.++|+.++ + T Consensus 353 ~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~~~~lG~~~~-d 409 (409) T protein:vir:94 353 YLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIEGG-E 409 (409) T ss_pred ccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHHHHHcCCCCC-C Confidence 34578999999776665 678889999984 3 5678999999998653 2 No 61 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=2.5e-60 Score=347.32 Aligned_cols=429 Identities=16% Similarity=0.098 Sum_probs=312.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.++ ..+|..|+.++..+.++++++.+||+|+|.+.+.....+ ....+.++++||++++|+..++++. T Consensus 12 l~~~~-~~~~~~L~~~~~~~~~~~~~~~~Yy~G~~~~~~~~~~~p----------~~~r~~~~v~nw~~~~Vd~~a~rl~ 80 (474) T protein:vir:81 12 LSNDE-NALINGLLAQIENLRWKNLLRTSYYENKRTIQYVGTLIP----------PQYFNLGLVLGWTGKAVDALARRCN 80 (474) T ss_pred CChhH-HHHHHHHHHHHHHHhhHHHHHHHHhccCCChhhcccccc----------HHHHHHHhhcChHHHHHHHHHhhhc Confidence 88876 567888888999999999999999999998754432211 1112345789999999999999997 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHH-HHHHHHHHHhCCeEEEEEEEcCCCc--eEEEEEccceeEEEEeCCCCCceE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALT-LNSLLVDSSNAGRAWLHYWIDEDNN--FRYGIIQPDQITPVYATTLDNKLL 157 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~~--~~i~~~~p~~~~~~~d~~~~~~~~ 157 (469) -+..+.. +.+.....++++|+.|.++. ..++++.++++|+||++|+.+++|+ +.|++++|.+++++||+.. +++. T Consensus 81 ~~Gf~~~-d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~~~~D~~~-~~~~ 158 (474) T protein:vir:81 81 LEGFVWP-DGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEATGEWNRRR-RGLN 158 (474) T ss_pred ccceECC-CCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEEEEEeCCC-Ccce Confidence 7766643 44445567899999887765 6789999999999999999977765 7799999999999999876 4566 Q ss_pred EEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc Q lcl|NC_010179. 158 GVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK 237 (469) Q Consensus 158 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~ 237 (469) +++..+. .+.+|. .+...+|.++.++++......+. + ..+..+|++| ||||+|.|++ T Consensus 159 ~al~~~~-~~~~g~--~~~~~ly~~~~~~~~~~~~~~~~------------------w-~~~~~~~~~g-vPvV~~~n~~ 215 (474) T protein:vir:81 159 NLLSIID-KDKEGK--VLSLALYLDNETVTAQRDKATLK------------------W-QVDRDEHVYG-VPAQVLPYKP 215 (474) T ss_pred eeeEEEE-EcCCCc--EEEEEEEeCCcEEEEEEcCccce------------------e-eeccCCCCCC-cceEEecccc Confidence 6666554 333333 34567888887777654332111 1 1245689999 7999999874 Q ss_pred -----cccccH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch----hhhhhh--hhcceeeecccCCCCC Q lcl|NC_010179. 238 -----YRLAEL-NKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK----QFMNDL--REYKSIKINNAGNGDK 305 (469) Q Consensus 238 -----~g~~~~-~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~----~~~~~~--~~~~~~~~~~~~~~~~ 305 (469) +|+|++ +++++|+|++|++++++....+++++|++++.|....... .....+ ...++..++.+.+++. T Consensus 216 ~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~~i~~~~~d~d~~~ 295 (474) T protein:vir:81 216 APKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLGRIKGLPDDADADI 295 (474) T ss_pred cccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHHHHhcCCCcccccc Confidence 688887 7999999999999999999999999999999998653321 111111 1223444554443332 Q ss_pred ---CcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCcc--ccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 306 ---SGVDKLQIDI-PVEARDDALKITRDNIFLFGQGIDPANF--ESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAINEL 378 (469) Q Consensus 306 ---~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 378 (469) +.+++.+.+. +.+.+.+.++.+..++...|++|...++ +..| +||+||+++...|..|+.++++.|+.+|+++ T Consensus 296 ~~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~~l~~kae~k~~~fg~~l~~~ 375 (474) T protein:vir:81 296 PQLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQYELIAEAEGAVDDFTPALRKA 375 (474) T ss_pred cccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3466666554 6677777788888888888899876553 3455 6999999999999999999999999999999 Q ss_pred HHHHHHHhcccCC-----CcccceEEeCCCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHH-- Q lcl|NC_010179. 379 VRAIMRYLNFSDA-----DKRHISQHWTRTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKD-- 447 (469) Q Consensus 379 ~~~i~~~~~~~~~-----~~~~i~i~f~~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E-- 447 (469) +++++.+.+.... ....+++.|.++..++.++.+|+++|+. |+.+.+++++++++++ .|++|++++ T Consensus 376 ~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg~t~---~~i~~~~~~~~ 452 (474) T protein:vir:81 376 FIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIGLTP---QQARRAMADKR 452 (474) T ss_pred HHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcCCCH---HHHHHHHHHHH Confidence 9999998875332 3568999999999999999999999984 3566778888887654 455555444 Q ss_pred -HHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 448 -REENDPYANQADELNGKGVDDE 469 (469) Q Consensus 448 -~~~~~~~~~~~~~~~~~~~~de 469 (469) ++.............++ ...| T Consensus 453 ~~~~~~~~~~l~~~~~~~-~~aq 474 (474) T protein:vir:81 453 RVQGRGTLQALIDRSNNG-ATAQ 474 (474) T ss_pred HHhHHHHHHHHHhcCCCC-CCCC Confidence 33333333322322222 2222 No 62 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=5.9e-61 Score=350.79 Aligned_cols=390 Identities=10% Similarity=0.009 Sum_probs=296.3 Q ss_pred HHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeeeccCchhh Q lcl|NC_010179. 14 STSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDIDVGKDAD 93 (469) Q Consensus 14 ~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~ 93 (469) +..| .+|++++.+||+|+|++.+-....+. .-++++|+++||+++||++.++++.-+. |+.+|. T Consensus 1 l~~~---~~r~~~~~~yY~g~~~~~~~~~~~p~---------~~~~~~~~v~nw~~~~Vds~a~rl~~~G--f~~~d~-- 64 (410) T protein:vir:95 1 MNLY---QSRVNLRYKHYAMQHYEAPTGITIPA---------HIRAKYQAVLGWAAKGVDSLADRLIFRA--FANDDF-- 64 (410) T ss_pred CCcc---hhhHHHHHHHhcCCCCccccchhccH---------HHHhHHHhhcchhHHHHHHhHhhhcccc--ccCCCc-- Confidence 4444 45678889999999987543322111 1123457789999999999999886444 444443 Q ss_pred HHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCce Q lcl|NC_010179. 94 NKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGK 172 (469) Q Consensus 94 ~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~ 172 (469) .++++|+.|.++ .+.++++.++++|+||++||.+++|+++|++++|.+++++||+. .+++.++++.+... ... T Consensus 65 --~l~~i~~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~-~~~~~~al~~~~~~---~~~ 138 (410) T protein:vir:95 65 --NVTEIFDRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPI-TGLLVEGYAVLARD---DYN 138 (410) T ss_pred --hHHHHHhhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCC-CCceEEEEEEEEec---CCC Confidence 378999888665 57799999999999999999999999999999999999999985 46788888876432 222 Q ss_pred EEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc-----cccccH-HHH Q lcl|NC_010179. 173 YFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK-----YRLAEL-NKY 246 (469) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~g~~~~-~~v 246 (469) ....+.+|+++.+.++...+.. ...+|++|.||+|+|.|++ .|.|++ ++| T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~------------------------~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v 194 (410) T protein:vir:95 139 RPTLEAYFEPNATHFIPKDGEP------------------------YSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAG 194 (410) T ss_pred eEEEEEEEeCCcEEEEeeCCcc------------------------ccccCCCCCcceEEecccccCCccCCccccchhH Confidence 3456778888888877643221 1347999999999999864 588887 789 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecC-CHHHHHHHHH Q lcl|NC_010179. 247 KGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDI-PVEARDDALK 325 (469) Q Consensus 247 ~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~ 325 (469) ++|+|++|++++++....+++++|++++.|.+.+............+++.++.+.+++ .+++.+.+. +.+.+.+.++ T Consensus 195 ~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~~~~~~--~~~v~q~~~~~l~~~~~~l~ 272 (410) T protein:vir:95 195 MYYQKYAKRTLERADITAEFYSWPQKYILGLDPDAEPMEKWKATVSSLLTISSSDKGV--KPSVGQFTTASMSPFTEQLR 272 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhhhhhheeccCCCCCC--cceEEecCCCChHHHHHHHH Confidence 9999999999999999999999999999998654333333344445667776665444 455655444 6777778888 Q ss_pred HHHHHHHHHhCCCCcCcccc-CC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC---CCcccceEEe Q lcl|NC_010179. 326 ITRDNIFLFGQGIDPANFES-SN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD---ADKRHISQHW 400 (469) Q Consensus 326 ~l~~~i~~~s~~p~~~~~~~-g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~---~~~~~i~i~f 400 (469) .+..++...|++|...+... .| +||+||+++..+|..|+.++++.|+.+|++++++++.+.+..+ .....+++.| T Consensus 273 ~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~v~W 352 (410) T protein:vir:95 273 TAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDEFRYTRSQFVRTAVKW 352 (410) T ss_pred HHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcccccceeeEEe Confidence 88888888888887766543 35 6999999999999999999999999999999999999876543 3456789999 Q ss_pred C---CCCCCCHHHHHHHHHHHh----ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhh Q lcl|NC_010179. 401 T---RTKVEDSLTKAQIVSTVA----NYSSKEAVAKANPIVDDWQQELKDLAKDREENDP 453 (469) Q Consensus 401 ~---~~~p~d~~e~~~~~~kl~----g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~ 453 (469) . ++...+.++.+|+++|+. |++|.+++++++|++++ .+..++.+|+++.-+ T Consensus 353 ~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~lg~~~~--~~~~~~~~e~~~~g~ 410 (410) T protein:vir:95 353 EPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLTGIAGD--MSAKPVVSEGGSNGE 410 (410) T ss_pred eecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhcCCChH--HHHHHHHHHHHhCCC Confidence 8 455568999999999972 68899999999999754 233333444332222 No 63 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=8.5e-59 Score=338.96 Aligned_cols=446 Identities=15% Similarity=0.146 Sum_probs=320.5 Q ss_pred CCHH-HHHHHHHHHHHH------------------HHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcc Q lcl|NC_010179. 1 MELD-ALKKLIRNTSTS------------------RNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADN 61 (469) Q Consensus 1 ~~~~-~~~~~i~~~~~~------------------~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 61 (469) |.+- -|+.+|+++..+ -..++.+++++++||+|+|+.+..... ....+.++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~----------~~~~~~~~ 70 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNS----------YGDTQKHE 70 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCcccccccc----------CCCccccc Confidence 6543 344455554321 145678889999999999876543221 11223445 Q ss_pred eeccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEc Q lcl|NC_010179. 62 RIPSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQ 140 (469) Q Consensus 62 ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~ 140 (469) ++++|+++.||++.|+|+||+||+++++++..++.|++++++| +...+.+++..++..|.+++.+|+| .|+++|.+++ T Consensus 71 ~~slnl~~~i~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D-~~~~~i~~v~ 149 (505) T protein:vir:79 71 LQSVNVTKLASAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVD-SGKIKLAWAT 149 (505) T ss_pred eeecchHHHHHHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEe-CCceEEEEEc Confidence 7889999999999999999999999999999999999999876 6677889999999999999999998 4789999999 Q ss_pred cceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCC----eEEEEEeecCceeeccccccccccccccccccc Q lcl|NC_010179. 141 PDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDK----EAQFFRTSATDSTVIEPYNIITSYDLSAGYETG 216 (469) Q Consensus 141 p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (469) |++++|++.++.+...++++..|...+.....+++.+|+|+.. .|.+..........+.. .........+... T Consensus 150 ad~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~---~v~l~~~~~~~~l 226 (505) T protein:vir:79 150 ADQVYPLQADTNQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGI---NVPLNSLEQYEGL 226 (505) T ss_pred CCeeEEEEEcCCCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCc---ccchhhccccccc Confidence 9999998544444445555666666666667788899999743 33332222211111110 0011111112223 Q ss_pred ccccccccCCcccEEEecCC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC------C-cc Q lcl|NC_010179. 217 QSNTLKHNFGRVPFIEFPKN---------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY------G-GA 280 (469) Q Consensus 217 ~~~~~~~~~g~vPvv~~~n~---------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~------~-~~ 280 (469) ++.....++.+.|+++|+|+ +.|.|+|++++++||++|.++|++++.++....++.+...+ . +. T Consensus 227 ~~~~~~~g~~~p~f~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~ 306 (505) T protein:vir:79 227 EPQVKITGLKHPLFAFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQ 306 (505) T ss_pred CcceeecCCCcceEEEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcc Confidence 33344467788888888652 35899999999999999999999999999998888873322 1 11 Q ss_pred cchhhhhhh-hhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--CccccCCccHHHHHHHH Q lcl|NC_010179. 281 SLKQFMNDL-REYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFESSNASGVAIKMLY 357 (469) Q Consensus 281 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~Sg~Al~~~~ 357 (469) ......... ....+... ..+++.++.++.++++++.+++.+.++.+.+.|...++.+.. ++.+.|..||.++++.. T Consensus 307 ~~~~~~~~fd~~~~~y~~-~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~ 385 (505) T protein:vir:79 307 ASETHPPMFDPDETVYQA-MYGDASEVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNN 385 (505) T ss_pred cccccccCCCccceeeee-ccCCCCCCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHH Confidence 100000001 11222222 122333467899999999999999999999999988877653 33345667999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCC Q lcl|NC_010179. 358 SHLELKAAKTQTYFEHAINELVRAIMRYLNF------------SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSS 423 (469) Q Consensus 358 ~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~------------~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS 423 (469) +.+.++++++++.|+.+|+++++.|+.+... ...+..+++|.|+++++.|..+.++..+++ +|++| T Consensus 386 ~~l~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s 465 (505) T protein:vir:79 386 SQTYQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMP 465 (505) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCC Confidence 9999999999999999999999999876432 122346789999999999999999988876 69999 Q ss_pred hHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCC Q lcl|NC_010179. 424 KEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVD 467 (469) Q Consensus 424 ~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~ 467 (469) ++++++.+++++| +++|++||++|+.+..|.+. +.|+| T Consensus 466 ~e~~l~~~~~~~eeea~~el~ri~~E~~~~~p~~~------~~gg~ 505 (505) T protein:vir:79 466 KKQFLMRNYGLDEEEADEWLAQIDAENSTAEPEFN------QFGGD 505 (505) T ss_pred HHHHHHhcCCCChHHHHHHHHHHHHhccccCCCch------hccCC Confidence 9999999998877 67899999999876555432 22222 No 64 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=7.1e-59 Score=339.39 Aligned_cols=449 Identities=16% Similarity=0.155 Sum_probs=319.3 Q ss_pred CCH-HHHHHHHHHHHH------------------HHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcc Q lcl|NC_010179. 1 MEL-DALKKLIRNTST------------------SRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADN 61 (469) Q Consensus 1 ~~~-~~~~~~i~~~~~------------------~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 61 (469) |.+ +-|+.++++.+. --..++.+++.+.+||+|+|+....... ....+... T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~----------~~~~~~~~ 70 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQAS----------DGIKKKRL 70 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccC----------CCCccccc Confidence 543 233333333221 1245778999999999999864422110 11222345 Q ss_pred eeccchHHHHHHHHHHhhhcCCeeecc-CchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEE Q lcl|NC_010179. 62 RIPSNFYQLLVDQEAGYIASVFPDIDV-GKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGII 139 (469) Q Consensus 62 ri~~n~~k~iv~~~~~~l~g~p~~~~~-~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~ 139 (469) ++++|+++.|++..|+++||+|+++++ +++..++.|++++++| +...+.+.+..+++.|.+++.+|+|. ++++|.++ T Consensus 71 ~~sln~~~~i~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~i~~v 149 (508) T protein:vir:15 71 KNTINMAKTAARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDG-NHIKIAWV 149 (508) T ss_pred eeecchHHHHHHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeC-CeeEEEEE Confidence 789999999999999999999999998 5566777899999876 56678899999999999999999985 67999999 Q ss_pred ccceeEEE-EeCCCCCceEEEEEEEEeeecCCceEEEEEEEEc-----CCeEEEEEeecCceeecccccccccccccccc Q lcl|NC_010179. 140 QPDQITPV-YATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWT-----DKEAQFFRTSATDSTVIEPYNIITSYDLSAGY 213 (469) Q Consensus 140 ~p~~~~~~-~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (469) +|++++|+ |+.+ +..-.++++.+...+..+.++++++|+|+ ...|.+..........+... ........+ T Consensus 150 ~ad~~~P~~~d~~-~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~---v~l~~~~e~ 225 (508) T protein:vir:15 150 RADQFYPLQSNTN-DISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQ---VPLSTLPVY 225 (508) T ss_pred cCCeeEEEEEcCC-CeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcc---cchhhcccc Confidence 99999997 5543 33334444555555556677889999986 34444333332221111111 111111112 Q ss_pred cccccccccccCCcccEEEecCC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC---Cccc Q lcl|NC_010179. 214 ETGQSNTLKHNFGRVPFIEFPKN---------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY---GGAS 281 (469) Q Consensus 214 ~~~~~~~~~~~~g~vPvv~~~n~---------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~---~~~~ 281 (469) ...++....+++.++|+++|+++ +.|.|+|++++++||++|.++|+++++++....++++..++ ++.. T Consensus 226 ~~l~~~~~~~g~~~p~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~~ 305 (508) T protein:vir:15 226 KELAPQVTISGLQRPLFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDEH 305 (508) T ss_pred cCCCcceEecCCCcceeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCCC Confidence 22233444567888999999752 45999999999999999999999999999888888874443 2221 Q ss_pred chhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcC--ccccCCccHHHHHHHHHH Q lcl|NC_010179. 282 LKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPA--NFESSNASGVAIKMLYSH 359 (469) Q Consensus 282 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~~~g~~Sg~Al~~~~~~ 359 (469) ...+.. ..+.+..-..++..+..++.++++++.+++.+.++.+.+.|...++.+... +.+.|..||.++++..+. T Consensus 306 ~~~~~~---~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~ 382 (508) T protein:vir:15 306 KPTFDT---EQNVYVGVLSDDNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSM 382 (508) T ss_pred ccccCC---CCeeEEeccCCCCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHH Confidence 111111 122233222233344568999999999999999999999998888777543 334456799999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----C---------CCcccceEEeCCCCCCCHHHHHHHHHHH--hccCC Q lcl|NC_010179. 360 LELKAAKTQTYFEHAINELVRAIMRYLNFS-----D---------ADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSS 423 (469) Q Consensus 360 l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-----~---------~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS 423 (469) +.++++.+++.|+.+|++++++|+.+...- + .+..+++|.|+++++.|..++++..+++ +|++| T Consensus 383 ~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s 462 (508) T protein:vir:15 383 TYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALS 462 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCC Confidence 999999999999999999999998765421 1 2345689999999999999999988875 69999 Q ss_pred hHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 424 KEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 424 ~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++++++++|+++| +++|++||++|+.+..+... +..+.+|.++| T Consensus 463 ~e~~i~~~~g~~deea~~el~ri~~E~~~~~~~~~--~~~~~~g~~ge 508 (508) T protein:vir:15 463 KQTFLQRNYGMTDEQAAEELAKIQSEAPTDTFEGG--RSAILNGGDGE 508 (508) T ss_pred HHHHHHhcCCCChHHHHHHHHHHHHhccccCcccc--ccccCCCCCCC Confidence 9999999988876 56899999999877665543 33344555555 No 65 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=9.7e-60 Score=344.11 Aligned_cols=389 Identities=10% Similarity=0.007 Sum_probs=295.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+...|.+|++.+ ..+.++++++.+||+|+|.+.+-....+. .-+.++|.++|||++||++.++++. T Consensus 1 ~~~~~i~~L~~~~----~~~~~r~~~~~~yY~g~~~~~~~~~~~p~---------~~~~~~~~v~nw~~~iVds~a~rl~ 67 (409) T protein:vir:16 1 MTEKGIGYLRFKL----SVHKRRAEMRYEQYAMKHVDRFKGITIPQ---------ALSQQYRSILGWCAKGVDSLADRLV 67 (409) T ss_pred CCHHHHHHHHHHH----HHHhHHHHHHHHHHhccCchhhcchhhhH---------HHHHHHhhhcChhHHHHHHhHhhcc Confidence 9998888887666 34568889999999999977543222111 1123456788999999999999875 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHH-HHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRAL-TLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~-~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) -+. |++++ ..++++|+.|.++ ...++++.++++|++|++||.+++|+++|++++|.+++++||+.. +++.++ T Consensus 68 ~~G--f~~~d----~~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D~~~-~~~~~a 140 (409) T protein:vir:16 68 FRE--FENDD----FTVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIIDPIT-GLLTEG 140 (409) T ss_pred ccc--ccCcc----hHHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEeeccc-ccceee Confidence 333 45554 3478999888765 577999999999999999999999999999999999999999854 567787 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCc-- Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK-- 237 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-- 237 (469) +++|... .. ...+...+|.++.+.++......+ ...+|++|.||+|+|.|++ T Consensus 141 ~~~~~~d-~~--~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~g~vPvV~f~n~~~~ 194 (409) T protein:vir:16 141 YAVLERD-EN--NNVVLEAHFLPDRTDYYYRDSRNN-----------------------ISIANPTGNPLLVPIIHRPDA 194 (409) T ss_pred eEEEEec-CC--CceEEEEEEecCcEEEEEecCccc-----------------------cceecCCCCcceEEecccccc Confidence 7776432 22 233455677777666554432211 2357999999999999874 Q ss_pred ---cccccH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEee Q lcl|NC_010179. 238 ---YRLAEL-NKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQI 313 (469) Q Consensus 238 ---~g~~~~-~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 313 (469) .|.|++ ++|++|+|++|++++++....+++++|++++.|.+.+...........++++.++.+.+++. +++.+. T Consensus 195 ~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~~~i~~~~~d~~g~~--~~v~q~ 272 (409) T protein:vir:16 195 VRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATVSSMLQFTKDEDGDK--PTLGQF 272 (409) T ss_pred cccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCCCCCccchhhhhhhHhhccCCCCCCCC--ceEEec Confidence 588988 68999999999999999999999999999999986433222223333456777766655544 445444 Q ss_pred cC-CHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_010179. 314 DI-PVEARDDALKITRDNIFLFGQGIDPANFES-SN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD 390 (469) Q Consensus 314 ~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~ 390 (469) +. +.+.+.+.++.+..+++..|++|...+... .| +||+||++++.+|..|+.++++.|+.+|++++++++.+.+..+ T Consensus 273 ~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~~~~~~~ 352 (409) T protein:vir:16 273 TQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYLAACLRDDVP 352 (409) T ss_pred CCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 43 567777777777888888888887666543 45 6999999999999999999999999999999999999877643 Q ss_pred C---CcccceEEeCCCCCCC---HHHHHHHHHHHhc----cCChHHHHHhCCCCCCHH Q lcl|NC_010179. 391 A---DKRHISQHWTRTKVED---SLTKAQIVSTVAN----YSSKEAVAKANPIVDDWQ 438 (469) Q Consensus 391 ~---~~~~i~i~f~~~~p~d---~~e~~~~~~kl~g----~iS~et~~~~l~~v~d~~ 438 (469) . ....+++.|.++.+.+ .++.+|+++|+.+ +...++++.++|+.++ + T Consensus 353 ~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v~~~~~g~~~~-d 409 (409) T protein:vir:16 353 YLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTIRDLTGIKGA-E 409 (409) T ss_pred ccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhHHHHhccCCCC-C Confidence 2 2467899999777554 7899999999854 3567899999998653 2 No 66 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=4.1e-56 Score=324.24 Aligned_cols=447 Identities=15% Similarity=0.110 Sum_probs=306.6 Q ss_pred CCH-HHHHHHHHHHH-----------------HHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcce Q lcl|NC_010179. 1 MEL-DALKKLIRNTS-----------------TSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNR 62 (469) Q Consensus 1 ~~~-~~~~~~i~~~~-----------------~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~r 62 (469) |.+ +-|+.+|++.. .--.+++.+|+.+.+||+|+|.-...... .+..+.+++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~----------~~~~~~~~~ 70 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNT----------DGETKKRDL 70 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccC----------CCCcccCce Confidence 432 22333333211 11256778999999999998643321111 112234557 Q ss_pred eccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEcc Q lcl|NC_010179. 63 IPSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQP 141 (469) Q Consensus 63 i~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 141 (469) +++|+++.|++..|+|+||+||+++++++..++.|++++++| |...+.+++..++..|.+++.+|+|. ++++|.+++| T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~a 149 (500) T protein:vir:98 71 NHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQA 149 (500) T ss_pred eecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcC Confidence 889999999999999999999999999999999999999886 66678899999999999999999985 6899999999 Q ss_pred ceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEc-----CCeEEEEEeecCceeeccccccccccccccccccc Q lcl|NC_010179. 142 DQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWT-----DKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETG 216 (469) Q Consensus 142 ~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (469) ++++|+..++......+++..+......+..+++++|+|+ ...|.+....+.....+ ....+....+... T Consensus 150 d~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~l-----G~~v~l~~~~~~l 224 (500) T protein:vir:98 150 PVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKV-----GSRVPLSEVYKDL 224 (500) T ss_pred CeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEeccccccc-----CcccccccccCCc Confidence 9999986555443333333333333334556788899886 22333322222111111 1111111112233 Q ss_pred ccccccccCCcccEEEecCC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC--------c Q lcl|NC_010179. 217 QSNTLKHNFGRVPFIEFPKN---------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG--------G 279 (469) Q Consensus 217 ~~~~~~~~~g~vPvv~~~n~---------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~--------~ 279 (469) +......++.+.|+++|+++ +.|.|+|++++++||++|.++|+++++++....++.+...+- + T Consensus 225 ~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g 304 (500) T protein:vir:98 225 KDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDG 304 (500) T ss_pred CcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCc Confidence 34444567788888888642 458999999999999999999999999999888887744321 1 Q ss_pred ccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--CccccCCccHHHHHHHH Q lcl|NC_010179. 280 ASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFESSNASGVAIKMLY 357 (469) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~Sg~Al~~~~ 357 (469) ........+.. .........+.+.+..++.++++++.+++.+.++.+.+.|...+..+.. ++.+.|..||.++++.. T Consensus 305 ~~~~~~~~d~~-~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~ 383 (500) T protein:vir:98 305 DVVPRPRFESD-QNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSEN 383 (500) T ss_pred cccCCcccCCC-cceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHH Confidence 11111111111 1222222223334457889999999999999999999988776665543 33344667999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHH Q lcl|NC_010179. 358 SHLELKAAKTQTYFEHAINELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAK 429 (469) Q Consensus 358 ~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~------~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~ 429 (469) +.+.++++++++.|+.+|++++++|+.+... .-....+++|.|+++++.|..++++..+++ +|+||.+++++ T Consensus 384 ~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~ 463 (500) T protein:vir:98 384 SDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQ 463 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHH Confidence 9999999999999999999999999876432 112345689999999999999999988876 78999999998 Q ss_pred hCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCC Q lcl|NC_010179. 430 ANPIVDD--WQQELKDLAKDREENDPYANQADELNGK 464 (469) Q Consensus 430 ~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~ 464 (469) ++++++| +++|++||++|+.+....+......-|+ T Consensus 464 ~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:98 464 KVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred hcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 8755554 5567899988764333222211111111 No 67 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=4.1e-56 Score=324.24 Aligned_cols=447 Identities=15% Similarity=0.110 Sum_probs=306.6 Q ss_pred CCH-HHHHHHHHHHH-----------------HHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcce Q lcl|NC_010179. 1 MEL-DALKKLIRNTS-----------------TSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNR 62 (469) Q Consensus 1 ~~~-~~~~~~i~~~~-----------------~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~r 62 (469) |.+ +-|+.+|++.. .--.+++.+|+.+.+||+|+|.-...... .+..+.+++ T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~----------~~~~~~~~~ 70 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNT----------DGETKKRDL 70 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccC----------CCCcccCce Confidence 432 22333333211 11256778999999999998643321111 112234557 Q ss_pred eccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEcc Q lcl|NC_010179. 63 IPSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQP 141 (469) Q Consensus 63 i~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 141 (469) +++|+++.|++..|+|+||+||+++++++..++.|++++++| |...+.+++..++..|.+++.+|+|. ++++|.+++| T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~-~~~~I~~v~a 149 (500) T protein:vir:30 71 NHLPIARTAAKKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDG-DKVRVAFVQA 149 (500) T ss_pred eecchHHHHHHHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcC Confidence 889999999999999999999999999999999999999886 66678899999999999999999985 6899999999 Q ss_pred ceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEc-----CCeEEEEEeecCceeeccccccccccccccccccc Q lcl|NC_010179. 142 DQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWT-----DKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETG 216 (469) Q Consensus 142 ~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (469) ++++|+..++......+++..+......+..+++++|+|+ ...|.+....+.....+ ....+....+... T Consensus 150 d~~~P~~~d~~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~l-----G~~v~l~~~~~~l 224 (500) T protein:vir:30 150 PVFLPLQSNTQDVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKV-----GSRVPLSEVYKDL 224 (500) T ss_pred CeeEEEEEcCCCeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEeccccccc-----CcccccccccCCc Confidence 9999986555443333333333333334556788899886 22333322222111111 1111111112233 Q ss_pred ccccccccCCcccEEEecCC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC--------c Q lcl|NC_010179. 217 QSNTLKHNFGRVPFIEFPKN---------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG--------G 279 (469) Q Consensus 217 ~~~~~~~~~g~vPvv~~~n~---------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~--------~ 279 (469) +......++.+.|+++|+++ +.|.|+|++++++||++|.++|+++++++....++.+...+- + T Consensus 225 ~~~~~~~~~~~p~f~~~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g 304 (500) T protein:vir:30 225 KDEAKVTDVTRPIFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDG 304 (500) T ss_pred CcceEeccCCCccEEEecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCc Confidence 34444567788888888642 458999999999999999999999999999888887744321 1 Q ss_pred ccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--CccccCCccHHHHHHHH Q lcl|NC_010179. 280 ASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFESSNASGVAIKMLY 357 (469) Q Consensus 280 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g~~Sg~Al~~~~ 357 (469) ........+.. .........+.+.+..++.++++++.+++.+.++.+.+.|...+..+.. ++.+.|..||.++++.. T Consensus 305 ~~~~~~~~d~~-~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~ 383 (500) T protein:vir:30 305 DVVPRPRFESD-QNVYIRMGGRDLDSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSEN 383 (500) T ss_pred cccCCcccCCC-cceEEEcCCCCCcCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHH Confidence 11111111111 1222222223334457889999999999999999999988776665543 33344667999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHH Q lcl|NC_010179. 358 SHLELKAAKTQTYFEHAINELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAK 429 (469) Q Consensus 358 ~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~------~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~ 429 (469) +.+.++++++++.|+.+|++++++|+.+... .-....+++|.|+++++.|..++++..+++ +|+||.+++++ T Consensus 384 ~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~ 463 (500) T protein:vir:30 384 SDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQ 463 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHH Confidence 9999999999999999999999999876432 112345689999999999999999988876 78999999998 Q ss_pred hCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCC Q lcl|NC_010179. 430 ANPIVDD--WQQELKDLAKDREENDPYANQADELNGK 464 (469) Q Consensus 430 ~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~ 464 (469) ++++++| +++|++||++|+.+....+......-|+ T Consensus 464 ~~~g~~eeea~~~l~~i~~E~~~~~~~~~~~~~~~g~ 500 (500) T protein:vir:30 464 KVLNVTEEKAQEIAAEINTGIVDEINQQRTDTHLYGE 500 (500) T ss_pred hcCCCCHHHHHHHHHHHHHhccccCCCCCccccccCC Confidence 8755554 5567899988764333222211111111 No 68 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=4.7e-53 Score=307.46 Aligned_cols=452 Identities=15% Similarity=0.091 Sum_probs=308.1 Q ss_pred CCH-HHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcce Q lcl|NC_010179. 1 MEL-DALKKLIRNTST-----------------SRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNR 62 (469) Q Consensus 1 ~~~-~~~~~~i~~~~~-----------------~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~r 62 (469) |.+ .-++.+|+++.. .+.+++.+|.+++.||+|+++....... ......+++ T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~----------~~~~~~~~~ 70 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNT----------DGDIKSRPM 70 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCccccccccc----------Ccchhcccc Confidence 554 334444443331 1567889999999999998643221110 111123457 Q ss_pred eccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEcc Q lcl|NC_010179. 63 IPSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQP 141 (469) Q Consensus 63 i~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p 141 (469) +++|+++.|++..|+++||+|++++++++..++.|++++++| |...+.+.+..++..|.+++.+|++ .+++++.+++| T Consensus 71 ~slnl~~~i~~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d-~~~~~i~~v~a 149 (522) T protein:vir:47 71 NHLPIARTASKKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYID-GDKVRVAFIQA 149 (522) T ss_pred eecchHHHHHHHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEc-CCceEEEEEcC Confidence 889999999999999999999999999999999999999876 5556789999999999999999997 57899999999 Q ss_pred ceeEEEEeCCCCCceEEEEEEEEe-eecCCceEEEEEEEEcC----------------CeEEEEEeecCceeeccccccc Q lcl|NC_010179. 142 DQITPVYATTLDNKLLGVLRSYKQ-LDPEAGKYFTVHEYWTD----------------KEAQFFRTSATDSTVIEPYNII 204 (469) Q Consensus 142 ~~~~~~~d~~~~~~~~~~v~~~~~-~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~ 204 (469) +.++|+..++.. ...+++.+... .......+++.+++|.- ..|.+....+.....+ . T Consensus 150 d~~~P~~~~~~~-~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~l-----G 223 (522) T protein:vir:47 150 PVFFPLESNTQD-VSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVL-----G 223 (522) T ss_pred CceEEEEEcCCc-eEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCccc-----C Confidence 999998443332 33333322222 23334456677777641 2222211111111111 0 Q ss_pred ccccccc--cccccccccccccCCcccEEEecCC---------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_010179. 205 TSYDLSA--GYETGQSNTLKHNFGRVPFIEFPKN---------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (469) Q Consensus 205 ~~~~~~~--~~~~~~~~~~~~~~g~vPvv~~~n~---------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~ 273 (469) ...+... .+....+...-.++.+.++++|+++ +.|.|+|+++++++|++|.++|+++++++....+++| T Consensus 224 ~~v~l~~~~e~~~l~~~~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v 303 (522) T protein:vir:47 224 QRVNLSELDKYKNLEPVTVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIV 303 (522) T ss_pred ccccccccccccCCCCceEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeec Confidence 1111111 1222233344467788888998753 4699999999999999999999999999999998888 Q ss_pred EecCC----cccchh---hhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCC--cCccc Q lcl|NC_010179. 274 LTNYG----GASLKQ---FMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGID--PANFE 344 (469) Q Consensus 274 ~~g~~----~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~ 344 (469) ...+- ....+. .........+...-..+.+++.+++.++++++.+.+.+.++.+.+.|...+.... +++.+ T Consensus 304 ~~~~l~~~~~~~~g~~~~~~~fd~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~ 383 (522) T protein:vir:47 304 PEHLTQRQYQRPDGTIDFRPRFDVEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDG 383 (522) T ss_pred chHHhccCCCCCCcccccccccCcccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccc Confidence 43321 111111 0000011222222222233455789999999999999999999988877665543 23333 Q ss_pred cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cCCCcccceEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_010179. 345 SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV 418 (469) Q Consensus 345 ~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~------~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl 418 (469) .|..||.++++..+.+.++++++++.|+.+|+++++.|+.+... ...+..+++|.|+++++.|..+.++..+++ T Consensus 384 ~~~kTAtEi~s~~~~~~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~ 463 (522) T protein:vir:47 384 QGMKTATEIVSENSDTYQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKM 463 (522) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHH Confidence 45579999999999999999999999999999999999977543 222456799999999999999999988875 Q ss_pred --hccCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhh---cccCCCCCCCCC Q lcl|NC_010179. 419 --ANYSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQ---ADELNGKGVDDE 469 (469) Q Consensus 419 --~g~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~---~~~~~~~~~~de 469 (469) +|+||++++++++++++| +++|++||++|+.+..+.... ..+...+..|+| T Consensus 464 v~aG~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~~ 521 (522) T protein:vir:47 464 VAAGFSTKKRAIGKTLNISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADDK 521 (522) T ss_pred HhcCCCCHHHHHHhcCCCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCCC Confidence 699999999998876665 568999999998765443211 111222233333 No 69 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=1.5e-50 Score=293.79 Aligned_cols=460 Identities=11% Similarity=0.003 Sum_probs=303.4 Q ss_pred CCHH-HHHHHHHHHHHHH--HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELD-ALKKLIRNTSTSR--NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~-~~~~~i~~~~~~~--~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) |.|= .+++.|+.+..-. ..++.+++.+.++|.++..-..+. .+....+..+........++++|+++.|++..|+ T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~w~~~~~~~~~~~~~~~~l~~~i~~~~A~ 78 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNGSEPELIPKYLPLVPDNQKEWSKD--SYLTSLWAQGYVPTVHDKLMNSGTGNEIVVVAAE 78 (518) T ss_pred CcchhhHHHHHHHhhcCCCCccchhccHHHhhhcccchhhhhhh--hhhhhhcccCCCCccccccccCChHHHHHHHHHH Confidence 6553 4566776665332 235566777777787764321111 1111122333334455678999999999999999 Q ss_pred hhhcCCeeecc------CchhhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeC Q lcl|NC_010179. 78 YIASVFPDIDV------GKDADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYAT 150 (469) Q Consensus 78 ~l~g~p~~~~~------~~~~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~ 150 (469) |+||+|+++++ +++.+++.|++++++| |...+.+.+..++..|.+++.+|++ +|++++.+++|+.++|++++ T Consensus 79 ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d-~~~~~i~~v~ad~~~P~~~~ 157 (518) T protein:vir:78 79 YISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINIL-NGRPSISVHSSSQFWIDFKN 157 (518) T ss_pred hhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEE-CCeeEEEEEcCCeeEEEeec Confidence 99999999875 4566788999999776 5667889999999999999999987 48899999999999999987 Q ss_pred CCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecC--------------ceeeccccccccccccccccccc Q lcl|NC_010179. 151 TLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSAT--------------DSTVIEPYNIITSYDLSAGYETG 216 (469) Q Consensus 151 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~ 216 (469) +. +..++.+......+...+++.+++|..+.+.+.....+ ...................++.. T Consensus 158 g~---~~~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~ 234 (518) T protein:vir:78 158 NE---PFRFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDI 234 (518) T ss_pred Cc---EEEEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccccccccccccccC Confidence 54 44444443333333445677888875443322211111 10000000000001111111222 Q ss_pred ccccccccCCcccEEEecCC----------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc-----cc Q lcl|NC_010179. 217 QSNTLKHNFGRVPFIEFPKN----------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG-----AS 281 (469) Q Consensus 217 ~~~~~~~~~g~vPvv~~~n~----------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~-----~~ 281 (469) .....-....+.|+++|.+| +.|.|+|++++++||++|.++|+++++++....++.|...+-. .. T Consensus 235 ~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~ 314 (518) T protein:vir:78 235 QLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKKVNKST 314 (518) T ss_pred ccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccCCCCCC Confidence 22222222345666666322 3499999999999999999999999999998888887543311 00 Q ss_pred c---hhhhhhhhhcceeeecccCC-CCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc-ccCCccHHHHHHH Q lcl|NC_010179. 282 L---KQFMNDLREYKSIKINNAGN-GDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF-ESSNASGVAIKML 356 (469) Q Consensus 282 ~---~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~g~~Sg~Al~~~ 356 (469) . ..+..+.+.+..+....++. +....++.++++++.+++.+.++.+.+.|...++.+...+. +.+..||.++++. T Consensus 315 ~~~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~~TATei~s~ 394 (518) T protein:vir:78 315 DKEEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNREVKATEIWSL 394 (518) T ss_pred CccccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCcccccccHHHHHHH Confidence 0 01111112222332222211 11224788899999999999999999999888877654332 2346899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--------CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHH Q lcl|NC_010179. 357 YSHLELKAAKTQTYFEHAINELVRAIMRYLNFS--------DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEA 426 (469) Q Consensus 357 ~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~--------~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et 426 (469) .+.+.+++++++..++.+|+++++.++.+++.- ..+...++|.|++.++.|..+.+++++++ +|+||+++ T Consensus 395 ~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~~~v~aGimS~e~ 474 (518) T protein:vir:78 395 QDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLNNMNSALAMSVEE 474 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHH Confidence 999999999999999999999999999876532 12345789999999999999999999874 79999999 Q ss_pred HHHh-CCCCCC--HHHHHHHHHHHHHHhhhh-HhhcccCCCCCC Q lcl|NC_010179. 427 VAKA-NPIVDD--WQQELKDLAKDREENDPY-ANQADELNGKGV 466 (469) Q Consensus 427 ~~~~-l~~v~d--~~~E~eri~~E~~~~~~~-~~~~~~~~~~~~ 466 (469) ++++ .|.++| +++|++||++|+.+..+. ......++.+|+ T Consensus 475 ~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~~~g~~~~~g 518 (518) T protein:vir:78 475 KVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEAIGGMETKGG 518 (518) T ss_pred HHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCccccCCCCCCC Confidence 9986 455555 568999999998764321 112222222222 No 70 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=6.1e-50 Score=290.38 Aligned_cols=456 Identities=14% Similarity=0.108 Sum_probs=309.0 Q ss_pred CCHH-HHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcce Q lcl|NC_010179. 1 MELD-ALKKLIRNTSTS-----------------RNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNR 62 (469) Q Consensus 1 ~~~~-~~~~~i~~~~~~-----------------~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~r 62 (469) |.+- -|+.+++++..+ -.....++.++.+||+|+++-.... ......+..++ T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~----------~~~~~~~~~~~ 70 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYI----------NSQGKIQERDY 70 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccc----------cccccccccce Confidence 5543 334444333211 1445678899999999998532211 11122234457 Q ss_pred eccchHHHHHHHHHHhhhcCCeeeccCch-----------hhHHHHHHHHhcc-HHHHHHHHHHHHHhCCeEEEEEEEcC Q lcl|NC_010179. 63 IPSNFYQLLVDQEAGYIASVFPDIDVGKD-----------ADNKKILDVLGDD-RALTLNSLLVDSSNAGRAWLHYWIDE 130 (469) Q Consensus 63 i~~n~~k~iv~~~~~~l~g~p~~~~~~~~-----------~~~~~l~~~~~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~ 130 (469) +++|+++.|+...|+++|++++++++++. ..++.|++++++| +...+.+.+..++..|.+++.+|+|. T Consensus 71 ~sl~~~~~i~~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~ 150 (517) T protein:vir:98 71 MTLNLRKLSADVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDN 150 (517) T ss_pred eecCcHHHHHHHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeC Confidence 89999999999999999999999987653 3578899999887 55677899999999999999999985 Q ss_pred CCceEEEEEccceeEEEEeCCCCCceEEEEEE-EEeeecCCceEEEEEEEEcCCeEEE----EEeecCceeecccccccc Q lcl|NC_010179. 131 DNNFRYGIIQPDQITPVYATTLDNKLLGVLRS-YKQLDPEAGKYFTVHEYWTDKEAQF----FRTSATDSTVIEPYNIIT 205 (469) Q Consensus 131 ~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~-~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 205 (469) +.++|.+++|+.++|+-.+. .+...+++.+ ......++..+++.+|+|..+...+ |.....-+.......... T Consensus 151 -~~~~I~~v~ad~~~Pl~~~~-~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~ 228 (517) T protein:vir:98 151 -GEIEFSWALANAFYPLRSNS-NGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGK 228 (517) T ss_pred -CeeEEEEEcCCeeEEEEecC-CCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccc Confidence 68999999999999963332 3344444433 2333444566788899987654321 111100000000111111 Q ss_pred cccccccccccccccccccCCcccEEEecC---------CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEec Q lcl|NC_010179. 206 SYDLSAGYETGQSNTLKHNFGRVPFIEFPK---------NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTN 276 (469) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n---------~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g 276 (469) ..+....+....+...-.++.+.++++|++ .+.|.|+|+++++++|++|.++|+++++++....++.+... T Consensus 229 ~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~ 308 (517) T protein:vir:98 229 RIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDV 308 (517) T ss_pred cccccccccCCCcceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChh Confidence 111111122233334445667777778765 25699999999999999999999999999999998887544 Q ss_pred CCcccc--hhhhhh--h-hhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCc--cccCCcc Q lcl|NC_010179. 277 YGGASL--KQFMND--L-REYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPAN--FESSNAS 349 (469) Q Consensus 277 ~~~~~~--~~~~~~--~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~--~~~g~~S 349 (469) +-..+. +..... . ....+.....++ .+++.++.++++++.+++.+.++.+.+.|...++.+...+ .+.|..| T Consensus 309 ~l~~~~~~~g~~~~~~~d~~~~~y~~~~~~-~~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kT 387 (517) T protein:vir:98 309 MLRTVPDESGMPPPQVFDPDVNVYKSIRMG-TDEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKT 387 (517) T ss_pred hhccccCCCCcccCCCCCcccceeeeccCC-CCCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccccccc Confidence 321111 111100 0 112222222222 2345678888999999999999999999999888876433 3334568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hcc Q lcl|NC_010179. 350 GVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANY 421 (469) Q Consensus 350 g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~------~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~ 421 (469) |.+++...+.+.++++++++.|+.+|++++++|+.+... ......+++|.|.+.++.|..+.+++.+++ +|+ T Consensus 388 ATEi~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ 467 (517) T protein:vir:98 388 ATEIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGF 467 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCC Confidence 999999999999999999999999999999999865432 112346789999999999999999988875 789 Q ss_pred CChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 422 SSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 422 iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ||.++++.++..+++ +++|+.||++|+.+.++........+.-.+|+| T Consensus 468 ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 468 IPTVEAIQRIFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGDEE 517 (517) T ss_pred CCHHHHHHHhCCCChHHHHHHHHHHHHhccccCCCCccccccCCCCCCCC Confidence 999999988755554 567899999999877765433333233333344 No 71 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=1.1e-45 Score=266.98 Aligned_cols=447 Identities=11% Similarity=0.101 Sum_probs=291.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ..+.++ + ...-++|+.+|+.+.+||.+++.-+.-. ...+.. .. -.++.++..++|+.....|+. T Consensus 19 ~~~p~~---v---~~~d~~Rl~aY~l~~~~y~n~~~~~~~~------lrg~~~--~~--~r~~~~ps~~~~~~~~~~~~~ 82 (527) T protein:vir:10 19 ANFPNA---V---TDFDKARLASYRLYEDMYLTNTSDYQVI------LRGGDE--GD--QRPIYVPNGEKLIEAKMRFLG 82 (527) T ss_pred ccCccc---C---CHHHHHHHHHHHHHHHHhcCchhheeee------cCCccc--cc--cceeeehhhHHhhCCcceeec Confidence 222222 2 2334678999999999999986322110 000000 00 113556666666665554442 Q ss_pred -cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 81 -SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 81 -g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~~p~~~~~~~d~~~~~ 154 (469) |-....+..++..++.|++|++.+.. ..+.+..+++.+.|+++.++-+|++ ++++++.+||.+.||+.|+.... T Consensus 83 ~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~ 162 (527) T protein:vir:10 83 QGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRYPG 162 (527) T ss_pred cCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCCCC Confidence 21122233456778889888876655 4577899999999998888887753 47999999999999998887655 Q ss_pred ceEEE--EEEEEeeecCCceEE--E--E--EEEEc-----CCeEEEEEe---ecCceeeccccccccccccccccccccc Q lcl|NC_010179. 155 KLLGV--LRSYKQLDPEAGKYF--T--V--HEYWT-----DKEAQFFRT---SATDSTVIEPYNIITSYDLSAGYETGQS 218 (469) Q Consensus 155 ~~~~~--v~~~~~~~~~~~~~~--~--~--~~~~~-----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (469) .+..+ +.-|+..+.....++ + . .++-+ ..+..+|.- +.+++.-. ...-...-.........+. T Consensus 163 ~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~-~e~p~~~~~~~~~~~~~~l 241 (527) T protein:vir:10 163 QVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDR-PESPLEPDDIKKLSTLTEE 241 (527) T ss_pred ceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccc-cccccchhhhhhhcCceee Confidence 55544 333555444433222 0 0 01100 011111110 01111000 0000000011112334456 Q ss_pred ccccccCCcccEEEecCCc-----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh--hhh Q lcl|NC_010179. 219 NTLKHNFGRVPFIEFPKNK-----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND--LRE 291 (469) Q Consensus 219 ~~~~~~~g~vPvv~~~n~~-----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~--~~~ 291 (469) ...+++++.||||+|+|-+ -|+|+++++++++|++|.++|+.+.++++.+.|+.++.|...-+....... +.. T Consensus 242 ~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgP 321 (527) T protein:vir:10 242 EPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISP 321 (527) T ss_pred ecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCC Confidence 6789999999999997643 499999999999999999999999999999999999999754432222222 223 Q ss_pred cceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc--cc-CCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 292 YKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF--ES-SNASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~-g~~Sg~Al~~~~~~l~~k~~~~~ 368 (469) ..+..++ +++++..+.....++.++.+++.|.+.|+..|++|..... +. +++||.||+..+++|.+++.+++ T Consensus 322 G~iweL~-----e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~ 396 (527) T protein:vir:10 322 LGMVEHG-----QNNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQE 396 (527) T ss_pred ceeEecC-----CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHH Confidence 3333333 3456777766568899999999999999999999998776 33 45899999999999999999999 Q ss_pred HHHHHHHHHHHH-HHHHH------hcccCC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCC Q lcl|NC_010179. 369 TYFEHAINELVR-AIMRY------LNFSDA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVD 435 (469) Q Consensus 369 ~~~~~~l~~~~~-~i~~~------~~~~~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~ 435 (469) ..++...+|..+ .+..+ ++..++ ....+.|+|.+++|.|.++.++.++++ +|++|.+||+++| +|++ T Consensus 397 L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGi~S~~tAv~~L~~~~g~e 476 (527) T protein:vir:10 397 LELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWEAGLIPAKKLTEELSKIMGFE 476 (527) T ss_pred HHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCC Confidence 999988887554 33222 223333 345789999999999999999999987 7999999998887 7899 Q ss_pred CHHHHHHHHHHHHHHhhhhH-h-----hcccCCCCCCCC-C Q lcl|NC_010179. 436 DWQQELKDLAKDREENDPYA-N-----QADELNGKGVDD-E 469 (469) Q Consensus 436 d~~~E~eri~~E~~~~~~~~-~-----~~~~~~~~~~~d-e 469 (469) |+++|+++|.+|+++..... + ..+....+|-++ | T Consensus 477 D~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~ 517 (527) T protein:vir:10 477 LTEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEE 517 (527) T ss_pred ChHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCC Confidence 99999999998876543211 1 111111122222 2 No 72 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=1.3e-45 Score=266.59 Aligned_cols=447 Identities=11% Similarity=0.099 Sum_probs=291.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ..+.++ + ...-++|+.+|+.+.+||.+++.-+.-. ...+.. .. -..+.++..++|+.....|+. T Consensus 19 ~~~p~~---v---~~~d~~Rl~aY~l~~~~y~n~~~~~~~~------lrg~~~--~~--~r~~~~ps~~~~~~~~~~~~~ 82 (527) T protein:vir:10 19 ANFPNA---V---TDFDKARLASYRLYEDMYLTNTSDYQVI------LRGGDE--GD--QRPIYVPNGEKLIEAKMRFLG 82 (527) T ss_pred ccCccc---C---CHHHHHHHHHHHHHHHHhcCchhheeee------cCCccc--cc--cceeeehhhHHhhCCcceeec Confidence 222222 2 2334678999999999999986322110 000000 00 113556666666665554442 Q ss_pred -cCCeeeccCchhhHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 81 -SVFPDIDVGKDADNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 81 -g~p~~~~~~~~~~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~~p~~~~~~~d~~~~~ 154 (469) |-....+..++..++.|++|++.+.. ..+.+..+++.+.|+++.++-+|++ ++++++.+||.+.||+.|+.... T Consensus 83 ~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~f~~ed~d~~~ 162 (527) T protein:vir:10 83 QGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTYFPYEDPRYPG 162 (527) T ss_pred cCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCcceeeeeecCCCCC Confidence 21112233456778888888876655 4577899999999998888887753 47999999999999998887655 Q ss_pred ceEEE--EEEEEeeecCCceEE--E--E--EEEEc-----CCeEEEEEe---ecCceeeccccccccccccccccccccc Q lcl|NC_010179. 155 KLLGV--LRSYKQLDPEAGKYF--T--V--HEYWT-----DKEAQFFRT---SATDSTVIEPYNIITSYDLSAGYETGQS 218 (469) Q Consensus 155 ~~~~~--v~~~~~~~~~~~~~~--~--~--~~~~~-----~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (469) .+..+ +.-|+..+.....++ + . .++-+ ..+..+|.- +.+++.-. ...-...-.........+. T Consensus 163 ~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~-~e~p~~~~~~~~~~~~~~l 241 (527) T protein:vir:10 163 QVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDR-PESPLEPDDIKKLSTLTEE 241 (527) T ss_pred ceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccc-cccccchhhhhhhcCceee Confidence 55544 333555444433222 0 0 01100 011111110 01111000 0000000011112334456 Q ss_pred ccccccCCcccEEEecCCc-----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh--hhh Q lcl|NC_010179. 219 NTLKHNFGRVPFIEFPKNK-----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND--LRE 291 (469) Q Consensus 219 ~~~~~~~g~vPvv~~~n~~-----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~--~~~ 291 (469) ...+++++.||||+|+|-+ -|+|+++++++++|++|.++|+.+.++++.+.|+.+++|...-+....... +.. T Consensus 242 ~~lp~pi~fiPvV~~~t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~~~VgP 321 (527) T protein:vir:10 242 EPLPEQITTLPVFHFRGHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPPRDSRGNMVPWTISP 321 (527) T ss_pred ecccCCCCccceEeecCCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeecccccccccCCcCccccCC Confidence 6789999999999997643 499999999999999999999999999999999999999754432222222 223 Q ss_pred cceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc--cc-CCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 292 YKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF--ES-SNASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~-g~~Sg~Al~~~~~~l~~k~~~~~ 368 (469) ..+..++ +++++..+.....++.++.+++.|.+.|+..|++|..... +. +++||.||+..+++|.+++.+++ T Consensus 322 G~iweL~-----e~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~SG~ALeL~L~PLlar~~rk~ 396 (527) T protein:vir:10 322 LGMVEHG-----QNNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAESGIALDLKLSAILSSCAEQE 396 (527) T ss_pred ceeEecC-----CCcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCcHHHHHHHHHHHHHHHHHHHH Confidence 3333333 3456777766568899999999999999999999998776 33 45899999999999999999999 Q ss_pred HHHHHHHHHHHH-HHHHH------hcccCC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCC Q lcl|NC_010179. 369 TYFEHAINELVR-AIMRY------LNFSDA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVD 435 (469) Q Consensus 369 ~~~~~~l~~~~~-~i~~~------~~~~~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~ 435 (469) ..++...+|..+ .+..+ ++..++ ....+.|+|.+++|.|.++.++.++++ +|++|.+||+++| +|++ T Consensus 397 L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~aGiiS~etAv~~L~~~~g~e 476 (527) T protein:vir:10 397 LELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWEAGLIPAKKLTEELSKIMGFE 476 (527) T ss_pred HHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHHcCchhHHHHHHHHHhccCCC Confidence 999988887554 33222 223333 345789999999999999999999987 7999999998887 7899 Q ss_pred CHHHHHHHHHHHHHHhhhhH-h-----hcccCCCCCCCC-C Q lcl|NC_010179. 436 DWQQELKDLAKDREENDPYA-N-----QADELNGKGVDD-E 469 (469) Q Consensus 436 d~~~E~eri~~E~~~~~~~~-~-----~~~~~~~~~~~d-e 469 (469) |+++|+++|.+++++..... + ..+....+|-++ | T Consensus 477 D~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~ 517 (527) T protein:vir:10 477 LTEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEE 517 (527) T ss_pred chHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCCCCCC Confidence 99999999998876543211 1 111111122222 2 No 73 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=3.4e-41 Score=242.43 Aligned_cols=451 Identities=11% Similarity=0.062 Sum_probs=278.2 Q ss_pred CCH---------HHHHHHHHHHH-HHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHH Q lcl|NC_010179. 1 MEL---------DALKKLIRNTS-TSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQL 70 (469) Q Consensus 1 ~~~---------~~~~~~i~~~~-~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~ 70 (469) |-- ..+......+. ...++|+.+|+.+.+||.|+|.-+... .. ...+ .-+..|..++ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~i--l~----G~dr-------~~~~~ps~r~ 67 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLV--LR----GDDS-------VPILMPSGRK 67 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhh--cC----CCce-------eeeccchHHH Confidence 111 11222222222 334679999999999999987422110 00 0001 1144567889 Q ss_pred HHHHHHHhhhcCCeeeccCchh--------hHHHHHHHHhccHH-HHHHHHHHHHHhCCeEEEEEEEcCC----CceEEE Q lcl|NC_010179. 71 LVDQEAGYIASVFPDIDVGKDA--------DNKKILDVLGDDRA-LTLNSLLVDSSNAGRAWLHYWIDED----NNFRYG 137 (469) Q Consensus 71 iv~~~~~~l~g~p~~~~~~~~~--------~~~~l~~~~~~n~~-~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~ 137 (469) +|++.+ +++|.+..|+++..+ .+..|++|.+.+.. ..+.+..+++.+.|+++.++-+|.+ +++++. T Consensus 68 ~V~~~~-~~Lg~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~ 146 (563) T protein:vir:74 68 IVEAVH-RFLGVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVD 146 (563) T ss_pred HHHHHH-HhcCCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEe Confidence 999955 555999999664322 34556677766554 4567889999999998888877743 589999 Q ss_pred EEccceeEEEEeCCCCCce--EEEEEEEEeeecCCceEEEEE--E-EEcCCeE--EEEEeecCceeec--cccc-ccccc Q lcl|NC_010179. 138 IIQPDQITPVYATTLDNKL--LGVLRSYKQLDPEAGKYFTVH--E-YWTDKEA--QFFRTSATDSTVI--EPYN-IITSY 207 (469) Q Consensus 138 ~~~p~~~~~~~d~~~~~~~--~~~v~~~~~~~~~~~~~~~~~--~-~~~~~~~--~~~~~~~~~~~~~--~~~~-~~~~~ 207 (469) .+||.+.||.-|++..... +.++.-|...+........+- . .+.+++. .+|.+...-+..- +... ....+ T Consensus 147 ~vDP~~~fp~~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~dae~w~lg~wd~r~~~~~~~ 226 (563) T protein:vir:74 147 EVDPRQIFLIEDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSELTHWTLGNWDDRGAISDEQ 226 (563) T ss_pred ecCCceeeeccCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeeccchhccccccccCccchhh Confidence 9999999996666543221 222234544444333222111 1 1112221 1121111111100 0000 00000 Q ss_pred ---ccc--ccccccccccccccCCcccEEEecCCc-----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC Q lcl|NC_010179. 208 ---DLS--AGYETGQSNTLKHNFGRVPFIEFPKNK-----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY 277 (469) Q Consensus 208 ---~~~--~~~~~~~~~~~~~~~g~vPvv~~~n~~-----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~ 277 (469) ... ......++...|++++.||+|.|+|-+ -|+|++++++++++++|.++|+.+..+.++.+|+.++.|. T Consensus 227 ~~~~~~~~~~~~d~e~~~LP~pi~~iPiv~~~tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~~ 306 (563) T protein:vir:74 227 ARRKEQVRSAQHDEEEEELPEPISQLPLYRWRNKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNAS 306 (563) T ss_pred hcccchhhhhhhhchhhhccccccCccEEEcCCCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEeccc Confidence 000 111222556679999999999997643 4899999999999999999999999999999999999875 Q ss_pred Ccccc--hhhh-hhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHH-HHHHHhCCCCcCcc--ccC-CccH Q lcl|NC_010179. 278 GGASL--KQFM-NDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRD-NIFLFGQGIDPANF--ESS-NASG 350 (469) Q Consensus 278 ~~~~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~-~i~~~s~~p~~~~~--~~g-~~Sg 350 (469) .+-+. ++.. ..+....+..+.. +...+....+.-...++.+..|++.|.. .|+..|++|....+ ..| .+|| T Consensus 307 ~p~d~~~g~~~~w~vgpG~i~El~~--~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~vD~~~~~SG 384 (563) T protein:vir:74 307 APVDPNTGELTDWNIGPMQIVEIAG--NRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRVDVTSAESG 384 (563) T ss_pred cccccccccccccccCCceeEeccC--CccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeecccccccccch Confidence 43221 1111 1123344444432 2233445555554567899999988776 88999999998776 444 4799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHh-c------------ccCCCc-ccceEEeCCCCCCCHHHHH Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINE----LVRAIMRYL-N------------FSDADK-RHISQHWTRTKVEDSLTKA 412 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~----~~~~i~~~~-~------------~~~~~~-~~i~i~f~~~~p~d~~e~~ 412 (469) .||+..+.+|.+++++++..+..++++ ++.+.+.++ + ...... ..++|+|.+.+|.|.++.+ T Consensus 385 iALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~~P~d~~~vv 464 (563) T protein:vir:74 385 ISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADPMPVNKTQVT 464 (563) T ss_pred hhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCCCCccHHHHH Confidence 999999999999999999988888877 344433221 1 111122 3478999999999999999 Q ss_pred HHHHHH--hccCChHHHHHhC---CCC-CCHHHHHHHHHHHHH---------HhhhhHhhcccCCCCCC-----CCC Q lcl|NC_010179. 413 QIVSTV--ANYSSKEAVAKAN---PIV-DDWQQELKDLAKDRE---------ENDPYANQADELNGKGV-----DDE 469 (469) Q Consensus 413 ~~~~kl--~g~iS~et~~~~l---~~v-~d~~~E~eri~~E~~---------~~~~~~~~~~~~~~~~~-----~de 469 (469) +.++.+ +|++|+|||+++| ||. +|++.|+++|+.++= +..+.. .+..+++|- ||+ T Consensus 465 ~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~--~~a~~~~g~~~~~~dd~ 539 (563) T protein:vir:74 465 QDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLG--LSAMDNGGAGEQQFDDQ 539 (563) T ss_pred HHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCccc--ceecccCCCCccccccc Confidence 998876 7999999998887 663 477777777654321 112221 222222222 232 No 74 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.96 E-value=4e-28 Score=170.82 Aligned_cols=431 Identities=11% Similarity=0.027 Sum_probs=263.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccc-cchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRN-NGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~-~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |++....... ....++.+.+.+-|.|...+-... .+.++........-..+...-+-.|+++.+++..++++ T Consensus 1 m~V~~~hp~y-------~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n~~~~t~~~~~G~v 73 (452) T protein:vir:94 1 MPIETKHPEY-------LAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYSITSKTLSALSGMV 73 (452) T ss_pred CCCCCcCHHH-------HHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCchHHHHHHHHhchh Confidence 7765433333 334566777888887764321110 00111100000000001011133699999999999999 Q ss_pred hcCCeeeccCchhhHHHHHHHHhccHHHH-HHHHHHHHHhCCeEEEEEEEcCCC-ceEEEEEccceeEEEEeCCCCCceE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGDDRALT-LNSLLVDSSNAGRAWLHYWIDEDN-NFRYGIIQPDQITPVYATTLDNKLL 157 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~-~~~i~~~~p~~~~~~~d~~~~~~~~ 157 (469) |.+||+++.++.- ..+..=..-+.++. ...+...++.+|+++++|-++..| +|.+..++|.+++ =|+......+. T Consensus 74 f~k~p~~~~p~~l--~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy~~~~~~~~Ii-~W~~~~~g~l~ 150 (452) T protein:vir:94 74 LDQPPVITHPDAM--SKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPYISVYTTENIL-NWEEDEDGRLL 150 (452) T ss_pred hcCCceecccHHH--HHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceEEEEechhhhc-CccccccCCee Confidence 9999999765322 22211112234444 557788999999999999887665 7999999999976 36544445554 Q ss_pred EEEEEEEee--ecC---CceEEEEEEEEc--CCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_010179. 158 GVLRSYKQL--DPE---AGKYFTVHEYWT--DKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF 230 (469) Q Consensus 158 ~~v~~~~~~--~~~---~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 230 (469) .++-..... +.. +.+....+.+++ ++.....+++..+.... . ......++...|+|+.||+ T Consensus 151 ~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~---------~---~~~~~~~~~~~~~l~~IP~ 218 (452) T protein:vir:94 151 MVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVW---------E---LAKTSTIQNVGVTMDYIPF 218 (452) T ss_pred EEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCcee---------e---eccceeecCCCcccceeEE Confidence 443322222 111 122222222332 33222222221111110 0 0122344556789999999 Q ss_pred EEecCCc----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCC Q lcl|NC_010179. 231 IEFPKNK----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKS 306 (469) Q Consensus 231 v~~~n~~----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (469) |.+.... .+.|-+.++..+.-++.+..|++.+++...++|++++.|.+... ...+....++.++. .++ T Consensus 219 v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~~~~----~i~iG~~~~~~lpe----~~~ 290 (452) T protein:vir:94 219 FCITPSGLSMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPWITGAESQS----TMHIGSTKAWVIPE----VAA 290 (452) T ss_pred EEEcCCCCCCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeEeecCcCCC----ceEecccccccCCC----CCC Confidence 9987543 25667889999999999999999999999999999999964322 12334445555553 235 Q ss_pred cceEEeecCC-HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 307 GVDKLQIDIP-VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRY 385 (469) Q Consensus 307 ~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~ 385 (469) +++|++++.+ .++.+..++.++++|...+.-. +.....++.||+|.........+........++.++++++++++.+ T Consensus 291 ~~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~l-l~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w 369 (452) T protein:vir:94 291 KVGFLEFTGQGLQSLEKALSEKQAQLASLSARL-IDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDM 369 (452) T ss_pred cceEEccCchhHHHHHHHHHHHHHHHHHHHHHh-hccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6899998875 5788999999999998866421 1222345678888777666666677777788889999999999999 Q ss_pred hcccCCCcccceEEeCCC--CCCCHHHHHHHHHHH--hccCChHHHHHhC--CCCCCHHHHHHHHHHHHHHhhhhHhhcc Q lcl|NC_010179. 386 LNFSDADKRHISQHWTRT--KVEDSLTKAQIVSTV--ANYSSKEAVAKAN--PIVDDWQQELKDLAKDREENDPYANQAD 459 (469) Q Consensus 386 ~~~~~~~~~~i~i~f~~~--~p~d~~e~~~~~~kl--~g~iS~et~~~~l--~~v~d~~~E~eri~~E~~~~~~~~~~~~ 459 (469) ++... +..|..+.. .+.-..+.++++.++ +|.+|++|++..| ..|-|+++|.+++..|.++..+. ..+ T Consensus 370 ~g~~~----~~~v~~n~dF~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~~~gvl~~~~e~~~i~~E~~~~~~~--~~~ 443 (452) T protein:vir:94 370 ESMGG----TLNIKLNSAFLDSKLTAAELKAWVEAYLSGGISKEIYIHALKVGKVLPPPGESMGVIPDPPAPEPS--PSN 443 (452) T ss_pred cCCCC----ceEEEeccccccccCCHHHHHHHHHHHhcCCCcHHHHHHHHHhCCCCCCccCHHHHHHHhhccCcc--cCC Confidence 88632 233332222 223345666766664 7899999999887 34667888889999886654432 233 Q ss_pred cCCCCCCCC Q lcl|NC_010179. 460 ELNGKGVDD 468 (469) Q Consensus 460 ~~~~~~~~d 468 (469) ..++++..- T Consensus 444 ~~~~~~~~~ 452 (452) T protein:vir:94 444 TPPNPSSKA 452 (452) T ss_pred CCCCCccCC Confidence 333333333 No 75 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.94 E-value=3.6e-26 Score=160.07 Aligned_cols=439 Identities=11% Similarity=0.050 Sum_probs=267.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc-ccchh---hhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTR-NNGKP---KVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~-~~~~~---~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) |.-.+++. ++.--..+....++.+.+.+-|.|...+-.. ..+.+ ............ .-+-.|+++.+++..+ T Consensus 1 m~~~~~~~-v~~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~---rA~~~n~~~~tl~~l~ 76 (513) T protein:vir:97 1 MADKDPKS-PATTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLA---SAVLLNMVEQTLDTLS 76 (513) T ss_pred CCCCCCCC-CCcCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHh---cccCCChHHHHHHHHh Confidence 66555443 2222233445667788888888886432110 00000 000001111111 1244699999999999 Q ss_pred HhhhcCCeeeccCchhhH-H-HHHHHHhc-cHHHH-HHHHHHHHHhCCeEEEEEEEcCCC------------------ce Q lcl|NC_010179. 77 GYIASVFPDIDVGKDADN-K-KILDVLGD-DRALT-LNSLLVDSSNAGRAWLHYWIDEDN------------------NF 134 (469) Q Consensus 77 ~~l~g~p~~~~~~~~~~~-~-~l~~~~~~-n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~------------------~~ 134 (469) +++|.+||+++.+..... + .+.++-.. +.++. +..+...++.+|+++++|-++..+ +| T Consensus 77 G~vf~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rP 156 (513) T protein:vir:97 77 GKPFSEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLRP 156 (513) T ss_pred hhhhhcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccCc Confidence 999999998864322221 1 12233212 34444 457788899999999999765432 36 Q ss_pred EEEEEccceeEEEEeCCC-C--CceEEEEEEEEeeecCC--ceEEEEEEEEcCCeEEEEEeecCceeecccccccccccc Q lcl|NC_010179. 135 RYGIIQPDQITPVYATTL-D--NKLLGVLRSYKQLDPEA--GKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDL 209 (469) Q Consensus 135 ~i~~~~p~~~~~~~d~~~-~--~~~~~~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (469) .+..++|.+++= |+... . ..+..++..-...+.++ .+.+..+-+++++.+..|+........ T Consensus 157 y~~~~~~e~Iin-W~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~------------ 223 (513) T protein:vir:97 157 YWVMIKPECLLF-ARSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQ------------ 223 (513) T ss_pred eEEEecHhhhcC-cceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCcc------------ Confidence 788888888653 33211 1 12333322222223343 344455556677666655543322111 Q ss_pred cccccccccccccccCCcccEEEecCCc----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh Q lcl|NC_010179. 210 SAGYETGQSNTLKHNFGRVPFIEFPKNK----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF 285 (469) Q Consensus 210 ~~~~~~~~~~~~~~~~g~vPvv~~~n~~----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~ 285 (469) ...+..+....|+|+.||||.+.... .+.+-|.++..|..++-...|++..++...++|++++.|.+....+ T Consensus 224 --~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~~~~~~~-- 299 (513) T protein:vir:97 224 --KEEWALADEWATGLNYVPLVTFYADRQGFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILACSGASGEDSD-- 299 (513) T ss_pred --ccceEEecCCCCcCCceeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceeeeecCCcCCCC-- Confidence 11233344556899999999997543 2556688999999999999999999999999999999997544322 Q ss_pred hhhhhhcceeeecccCCCCCCcceEEeecCC-HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHH Q lcl|NC_010179. 286 MNDLREYKSIKINNAGNGDKSGVDKLQIDIP-VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKA 364 (469) Q Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~ 364 (469) ...+..++++.++. +++++.|++++.+ .++.+..++.++++|...+..+ -....++.||+|.+.......+.. T Consensus 300 ~i~iG~~~~~~lpe----~~~~~~yie~~g~~i~~~~~~l~~le~qm~~~Ga~l--l~~~~~~~Ta~a~~~~~~~~~S~L 373 (513) T protein:vir:97 300 PVVVGPNKVLYNPD----PAGRFYYVEHTGQAIAAGRTDLKDLEEQMAGYGAEF--LKRKTGGQTATARALDSAEATSDL 373 (513) T ss_pred ceEeeccccccCCC----CCCcceeeccCchhHHHHHHHHHHHHHHHHHHHHHh--hccCCccccHHHHHHHHHHHHHHH Confidence 12233444555542 3456899998875 5778999999999998876543 222346789999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCC-CC-HHHHHHHHHHH--hccCChHHHHHhC---CCC--- Q lcl|NC_010179. 365 AKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKV-ED-SLTKAQIVSTV--ANYSSKEAVAKAN---PIV--- 434 (469) Q Consensus 365 ~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p-~d-~~e~~~~~~kl--~g~iS~et~~~~l---~~v--- 434 (469) ......++.++++++++++.+++... + ..+|..++... .. ..+.++++.++ +|.+|++|.++.| +.+ T Consensus 374 ~~~a~~le~al~~~l~~~a~wlg~~~-~--~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~~~L~r~gvl~~d 450 (513) T protein:vir:97 374 SAMTGLFEDALAQALDITADWLRLGP-N--GGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKTYLNGLRLRGVLPED 450 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCC-C--ccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHHHHHHHHhccCCCcc Confidence 99999999999999999999998632 1 23333333222 22 34566666664 7899999998765 222 Q ss_pred CCHHHHHHHHHHHHHHhhhh----HhhcccCCC--CCCCCC Q lcl|NC_010179. 435 DDWQQELKDLAKDREENDPY----ANQADELNG--KGVDDE 469 (469) Q Consensus 435 ~d~~~E~eri~~E~~~~~~~----~~~~~~~~~--~~~~de 469 (469) .|.+++.++++++.++..-. ......+++ ++.++| T Consensus 451 ~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 491 (513) T protein:vir:97 451 FDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGE 491 (513) T ss_pred CCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCC Confidence 13456677777664333111 111111111 122222 No 76 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.93 E-value=1.8e-24 Score=150.72 Aligned_cols=444 Identities=11% Similarity=0.017 Sum_probs=252.2 Q ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccc-cchhhhc---ccc--cccccccCcceeccchHHHHHH Q lcl|NC_010179. 1 ME-LDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRN-NGKPKVS---KEG--KKDPLRSADNRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~-~~~~~~~---~~~--~~~~~~~~~~ri~~n~~k~iv~ 73 (469) |. +.... ..+....++.+.+.+-+.|...+..+. .+.++.. ... ...-..+...-+-.|+++.+++ T Consensus 1 m~~V~~~h-------p~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t~~ 73 (501) T protein:vir:95 1 MPNVSFIR-------PELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRTLF 73 (501) T ss_pred CCCCCCCC-------HHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHHHH Confidence 44 11111 123445566777888888865321110 0010000 000 0000001111234699999999 Q ss_pred HHHHhhhcCCeeeccCchhhHHHHHHHHhc-----cHHHH-HHHHHHHHHhCCeEEEEEEEcCCC--------------- Q lcl|NC_010179. 74 QEAGYIASVFPDIDVGKDADNKKILDVLGD-----DRALT-LNSLLVDSSNAGRAWLHYWIDEDN--------------- 132 (469) Q Consensus 74 ~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~-----n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~--------------- 132 (469) ..++++|.+||++..+ ..+..++.+ +.++. +..+...++.+|+++++|-++..+ T Consensus 74 ~l~G~vf~k~p~~~~p-----~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~t~a~~~~~~~ 148 (501) T protein:vir:95 74 GLVGQVFMRDPVVKVP-----ALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAEGGASIADLEAGRI 148 (501) T ss_pred HHhhhhhcCCcceeCc-----HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccccHHHHHhccC Confidence 9999999999998643 334444432 34444 557788999999999999765432 Q ss_pred ceEEEEEccceeEEEEeCC-C--CCceEEEEEEEEeeecCC---ceEEEEEEEEc--CCeEEEEE-eecCceeecccccc Q lcl|NC_010179. 133 NFRYGIIQPDQITPVYATT-L--DNKLLGVLRSYKQLDPEA---GKYFTVHEYWT--DKEAQFFR-TSATDSTVIEPYNI 203 (469) Q Consensus 133 ~~~i~~~~p~~~~~~~d~~-~--~~~~~~~v~~~~~~~~~~---~~~~~~~~~~~--~~~~~~~~-~~~~~~~~~~~~~~ 203 (469) +|.+..++|.+++= |+.. + ...+..++..-.....++ ......+-+.+ .++.+.++ +........ .... T Consensus 149 rPy~~~~~~~~Iin-W~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~-~~~~ 226 (501) T protein:vir:95 149 RPTLYVYSPTEIIN-WRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKA-DGSK 226 (501) T ss_pred CcEEEEecHhhhcC-cceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCccc-Ccce Confidence 37788888887643 3321 1 113333322211122222 12222222221 22222222 111111100 0000 Q ss_pred cccccccccccccccccccccCCcccEEEecCCcc----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc Q lcl|NC_010179. 204 ITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY----RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG 279 (469) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~----g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~ 279 (469) ...........+...+...|+|+.||+|.+..... +.+-+.++..+.-+.-...|+..+.+...++|+++++|.+. T Consensus 227 ~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~ 306 (501) T protein:vir:95 227 IPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSENNDSNPDNPNFYDLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTE 306 (501) T ss_pred ecCCcccccceeeeeccCCCcCCeeeEEEEecCCCCCCCCccchHHHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCcc Confidence 00111111112233344468999999998854322 35557788888778888889999999999999999999765 Q ss_pred ccc---hhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHH Q lcl|NC_010179. 280 ASL---KQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKML 356 (469) Q Consensus 280 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~ 356 (469) +.. ......+.....+.++. +++++|++.+.+.- .+..++.+.++|......+ ...+.++.||+|.+.. T Consensus 307 ~~~~~~~~~~i~~G~~~~~~lP~-----~~~~~~ie~~~~~i-~~~~l~~l~~~m~~~Ga~l--l~~~~~~~Ta~~~~~~ 378 (501) T protein:vir:95 307 EWVTNVLKGSVNFGSRGGIPLPV-----GADAKLLQASENTM-LKEAMDTKERQMVALGAKL--VEQKEVQRTATEAELE 378 (501) T ss_pred cccccCCCCceeecccccccCCC-----CCceeEEecChhhH-HHHHHHHHHHHHHHHHHhh--ccCCccchhHHHHHHH Confidence 321 11122223344444443 45789999865443 3677999999998876432 2344567899999888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCC-C-HHHHHHHHHHH--hccCChHHHHHhC- Q lcl|NC_010179. 357 YSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVE-D-SLTKAQIVSTV--ANYSSKEAVAKAN- 431 (469) Q Consensus 357 ~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~-d-~~e~~~~~~kl--~g~iS~et~~~~l- 431 (469) .....+........++.++.+++++++.+++..+. .++|..++..+. . ..+.++++.++ +|.+|++|++..| T Consensus 379 ~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~~~~---~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~~~L~ 455 (501) T protein:vir:95 379 AASEGSTLSSATKNVSAAFEWALKWAARWVGQADS---GVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMRTGLR 455 (501) T ss_pred HHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCCCC---ceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHHHHHH Confidence 88888888999999999999999999999986532 234444443332 2 45667777665 7899999997665 Q ss_pred --CCCC-CHHHHHHHHHHHHHHhhhhHhh----cccCCCCC-CCCC Q lcl|NC_010179. 432 --PIVD-DWQQELKDLAKDREENDPYANQ----ADELNGKG-VDDE 469 (469) Q Consensus 432 --~~v~-d~~~E~eri~~E~~~~~~~~~~----~~~~~~~~-~~de 469 (469) +.++ +.+.|.++|..|..+.+..... ....++++ .+.| T Consensus 456 ~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 456 KAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred hCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccccccCCC Confidence 4443 4567788888776554333222 22222222 2223 No 77 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.90 E-value=6.2e-23 Score=142.35 Aligned_cols=440 Identities=13% Similarity=0.045 Sum_probs=247.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccc-cccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEG-KKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~-~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) ++-+--..=|+.--..+....++.+.+.+-|.|......+..+........ ...-..+...-+-.|+++.+++..++++ T Consensus 2 ~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~v 81 (489) T protein:vir:78 2 LTENGQGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSV 81 (489) T ss_pred ccCCCccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHHHHhchh Confidence 221111111111112334556778888888999542111211111100000 0000001111244799999999999999 Q ss_pred hcCCeeeccCchhhHHHHHHHHhc-cHHHH-HHHHHHHHHhCCeEEEEEEEcCCC------------ceEEEEEccceeE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGD-DRALT-LNSLLVDSSNAGRAWLHYWIDEDN------------NFRYGIIQPDQIT 145 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~-n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~------------~~~i~~~~p~~~~ 145 (469) |.+||++..++. ....+.++-.. +.++. +..+...++.+|+++++|-++..+ +|.+..++|.+++ T Consensus 82 frk~p~~~~p~~-l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~Ii 160 (489) T protein:vir:78 82 MRKEPEINIPKE-LEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENIV 160 (489) T ss_pred hcCCcceeccHH-HHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhhc Confidence 999999865422 22223333222 34444 557888999999999999876654 5788899998865 Q ss_pred EEEeCCC---CCceEEEEEEEEee--ec-C--CceEEEEEEEEcCC--e---EEEEEeecCceeeccccccccccccccc Q lcl|NC_010179. 146 PVYATTL---DNKLLGVLRSYKQL--DP-E--AGKYFTVHEYWTDK--E---AQFFRTSATDSTVIEPYNIITSYDLSAG 212 (469) Q Consensus 146 ~~~d~~~---~~~~~~~v~~~~~~--~~-~--~~~~~~~~~~~~~~--~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (469) = |.... ...+..++-..... +. + +.+....+-+++.+ + +..|+....+.... . T Consensus 161 n-W~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~-------------~ 226 (489) T protein:vir:78 161 N-WRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQE-------------D 226 (489) T ss_pred C-ceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccc-------------e Confidence 3 32111 12344333222211 11 1 12333334444432 1 22222222211100 0 Q ss_pred ccccccccccccCCcccEEEecCCc----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhh-- Q lcl|NC_010179. 213 YETGQSNTLKHNFGRVPFIEFPKNK----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFM-- 286 (469) Q Consensus 213 ~~~~~~~~~~~~~g~vPvv~~~n~~----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~-- 286 (469) .....+....|+|+.||+|.+.... .+.+-+.++-.|.-+.-...|+.-+++...++|++++.|.+....+... T Consensus 227 ~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~~~~~~~~~ 306 (489) T protein:vir:78 227 VVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENLTPQAFKEA 306 (489) T ss_pred eeEEeccCCCCccCeeeEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccCCccccccc Confidence 0011123345789999999996432 2455688888888899999999999999999999999996432211111 Q ss_pred ----hhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHh-CCCCcCccccCCccHHHHHHHHHHHH Q lcl|NC_010179. 287 ----NDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFG-QGIDPANFESSNASGVAIKMLYSHLE 361 (469) Q Consensus 287 ----~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s-~~p~~~~~~~g~~Sg~Al~~~~~~l~ 361 (469) .-+.....+.++ .++.++|++...+.- .+..++.++.++.... .+.. ..++.||++......... T Consensus 307 ~~~~i~~g~~~~~~lp-----~~~~~~~ie~~~~~~-~r~~l~~le~qm~~lGa~l~~----~~~~~Ta~~~~~~~~~~~ 376 (489) T protein:vir:78 307 NPNGIKFGSRRGHNLG-----YGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLIT----PTQQITAQSARIQRGADT 376 (489) T ss_pred CccceeeCCcccccCC-----CCCCcceeccCcchH-HHHHHHHHHHHHHHHhhhhcc----CCcchhHHHHHHHHHHhh Confidence 111122222233 235678998876543 4777888888888764 3321 235678888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcccCCCccc--ceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC--CCCC Q lcl|NC_010179. 362 LKAAKTQTYFEHAINELVRAIMRYLNFSDADKRH--ISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN--PIVD 435 (469) Q Consensus 362 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~--i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l--~~v~ 435 (469) +........++.++.+++++++.+++........ +...|... .-..+.++++.++ +|.+|++|.+..| ..|- T Consensus 377 S~L~~~a~~~e~al~~~l~~~a~w~G~~~~~~~~i~~n~dF~~~--~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv~ 454 (489) T protein:vir:78 377 SVMATIARNVSQAYTDALRWVAVMLGKPEDTEVEFRLNMDFFLE--PMTAQDRAAWMADINAGLLPATAYYAALRKAGVT 454 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccCcc--cCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCC Confidence 8888999999999999999999999875332222 23334322 2235566666665 7899999998865 2233 Q ss_pred CH--HHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 436 DW--QQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 436 d~--~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) |+ +.+.++|..|- .....+-..+.+....++| T Consensus 455 d~~~e~~~~ei~~~~--~~~~~~~~g~~~~~~q~~~ 488 (489) T protein:vir:78 455 DWTDADIKDAVADQP--LPVATEVQGEIPQSAQQQE 488 (489) T ss_pred CccHHHHHHHHhhcC--CCcccCCcccCCCCccccc Confidence 32 22333443321 1111223344444444555 No 78 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.89 E-value=8.1e-22 Score=136.20 Aligned_cols=435 Identities=9% Similarity=0.028 Sum_probs=242.0 Q ss_pred CC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccc-cchhhhccc---c--cccccccCcceeccchHHHHHH Q lcl|NC_010179. 1 ME-LDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRN-NGKPKVSKE---G--KKDPLRSADNRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~-~~~~~~~~~---~--~~~~~~~~~~ri~~n~~k~iv~ 73 (469) |. +.... ..+....++.+.+.+-+.|...+.... .+.++.... . ...-..+...-+-.|+++.+++ T Consensus 32 m~dV~~~h-------p~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~ 104 (535) T protein:vir:80 32 LPNVGYQR-------VEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLD 104 (535) T ss_pred CCCCCcCC-------HHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHH Confidence 54 21111 112334566777777787764322111 011110000 0 0000001111245699999999 Q ss_pred HHHHhhhcCCeeeccCchhhHHHHHHHHhc-----cHHHH-HHHHHHHHHhCCeEEEEEEEcCCC-------------ce Q lcl|NC_010179. 74 QEAGYIASVFPDIDVGKDADNKKILDVLGD-----DRALT-LNSLLVDSSNAGRAWLHYWIDEDN-------------NF 134 (469) Q Consensus 74 ~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~-----n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~-------------~~ 134 (469) ..++++|.+||++..+ ..+..++.+ +.++. +..+...++.+|+++++|-+...+ +| T Consensus 105 ~l~G~vfrk~p~~~~p-----~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~~~~rP 179 (535) T protein:vir:80 105 GMMGQVFSRDPIRQLP-----PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKLGLYRP 179 (535) T ss_pred HHhchhhcCCcceecc-----HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHhcCCCc Confidence 9999999999988644 334444432 33444 557788999999999999765544 37 Q ss_pred EEEEEccceeEEEEeCCC---CCceEEEEEEEEe-eecCC--ceEEEEEEEEcC--C---eEEEEEeecCceeecccccc Q lcl|NC_010179. 135 RYGIIQPDQITPVYATTL---DNKLLGVLRSYKQ-LDPEA--GKYFTVHEYWTD--K---EAQFFRTSATDSTVIEPYNI 203 (469) Q Consensus 135 ~i~~~~p~~~~~~~d~~~---~~~~~~~v~~~~~-~~~~~--~~~~~~~~~~~~--~---~~~~~~~~~~~~~~~~~~~~ 203 (469) .+..++|.+++= |+... ...+..++..-.. ..+++ .+....+-+... + .+..|+.+..+... T Consensus 180 y~~~y~ae~Iin-W~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~------ 252 (535) T protein:vir:80 180 TITLVHPTSIIN-WRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMY------ 252 (535) T ss_pred EEEEechhhccC-ccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccc------ Confidence 788888888653 43221 1234333221111 11222 222222222222 1 12222222111000 Q ss_pred cccccccccccccccccccccCCcccEEEecCCc----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc Q lcl|NC_010179. 204 ITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNK----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG 279 (469) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~ 279 (469) ........+....|.|+.||||.|.... .+.+-|.++..|.-++....|++.+.+...++|++++.|.+. T Consensus 253 ------~~~~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~ 326 (535) T protein:vir:80 253 ------YSYSKHVPTDGNGNPFKEIPFQFIGPLDNNADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTK 326 (535) T ss_pred ------cccceeecccCCCcccCeeEEEEeecCCCCCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceeeeecCch Confidence 0011223334556899999999986432 245568889999889999999999999999999999999754 Q ss_pred ccch----hhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHH Q lcl|NC_010179. 280 ASLK----QFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKM 355 (469) Q Consensus 280 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~ 355 (469) .... .....+....++.++. +++++|++...+.-+ ...++.+..+|..+....- ....++.++.+-+. T Consensus 327 ~~~~~~~~~~~i~iG~~~~~~lP~-----~~~~~~~e~~~~~~a-~~~l~~~e~qM~~lGa~ll--~~~~~~~Ta~~a~~ 398 (535) T protein:vir:80 327 DWVEDVFKDFKVHLGSRAIIPLPQ-----GATAGILQITPNSVP-FEAMTHKESQMIAMGANLL--VKSGGNRTFGEAQQ 398 (535) T ss_pred hhhhcCCCCcceEecCcccccCCC-----CCCcceeeeccchhH-HHHHHHHHHHHHHHHHHhh--ccCcccccHHHHHH Confidence 3311 1112233444554544 345788887665444 3568888888888654331 22245555544455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCC-CCCC-HHHHHHHHHHH--hccCChHHHHHhC Q lcl|NC_010179. 356 LYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRT-KVED-SLTKAQIVSTV--ANYSSKEAVAKAN 431 (469) Q Consensus 356 ~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~-~p~d-~~e~~~~~~kl--~g~iS~et~~~~l 431 (469) ..+...+........++.++.+++++++.+++... +...+.|..+.. ...+ ..+.++++.++ +|.+|++|++..| T Consensus 399 ~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G~~~-~~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~~~L 477 (535) T protein:vir:80 399 EEASEQSILSACTKNVSMAFRKALRWANQFQTGIV-NDETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMRAGL 477 (535) T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCCcc-CCCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHHHHH Confidence 55666666777888899999999999999988532 222333333322 1222 34566666665 7899999998776 Q ss_pred ---CCCC---CHHHHHHHHHHHHHHhhhhHhhcccCCCCCC-----------CCC Q lcl|NC_010179. 432 ---PIVD---DWQQELKDLAKDREENDPYANQADELNGKGV-----------DDE 469 (469) Q Consensus 432 ---~~v~---d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~-----------~de 469 (469) +.++ +.++|+.||+.|-.+.........+...+|. -++ T Consensus 478 ~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~~ 532 (535) T protein:vir:80 478 RRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGNQ 532 (535) T ss_pred HhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCcccccc Confidence 3332 2466788888875443332221111111111 111 No 79 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.88 E-value=9.2e-22 Score=135.91 Aligned_cols=445 Identities=14% Similarity=0.062 Sum_probs=243.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcc-cccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSK-EGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~-~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) ++.+--..=|+.--..+....++.+.+.+-|.|.+....+..+.+.... .....-..+...-+-.|+++.+++..++++ T Consensus 2 ~~~~~~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~~l~G~v 81 (491) T protein:vir:95 2 LTANGQGSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLSGMVGSV 81 (491) T ss_pred cccCCccCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHHHHhchh Confidence 2222111111111123345567788888889885321111111111000 000000001111244699999999999999 Q ss_pred hcCCeeeccCchhhHHHHHHHHhc-cHHHH-HHHHHHHHHhCCeEEEEEEEcCCC------------ceEEEEEccceeE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGD-DRALT-LNSLLVDSSNAGRAWLHYWIDEDN------------NFRYGIIQPDQIT 145 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~-n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~------------~~~i~~~~p~~~~ 145 (469) |.+||++..++ .....+.++-.. +.++. +..+...++.+|+++++|-.+..+ +|.+..++|.+++ T Consensus 82 frk~p~~~~p~-~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~Ii 160 (491) T protein:vir:95 82 MRKEPEINIPK-ELEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTENIV 160 (491) T ss_pred hcCCceeeccH-HHHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhhhc Confidence 99999986542 222333333222 34444 557888999999999999876554 4778888998865 Q ss_pred EEEeCCC---CCceEEEEEEEEe--eecC---CceE---EEEEEEEcCC--eEEEEEeecCceeeccccccccccccccc Q lcl|NC_010179. 146 PVYATTL---DNKLLGVLRSYKQ--LDPE---AGKY---FTVHEYWTDK--EAQFFRTSATDSTVIEPYNIITSYDLSAG 212 (469) Q Consensus 146 ~~~d~~~---~~~~~~~v~~~~~--~~~~---~~~~---~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (469) = |.... ...+..++..-.. .+.. +.+. +++++..+++ .+..|+....+.... . T Consensus 161 n-W~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~-------------~ 226 (491) T protein:vir:95 161 N-WRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQE-------------E 226 (491) T ss_pred C-ceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCccee-------------e Confidence 3 32111 1234333322221 1111 1222 2232322222 222232221111110 0 Q ss_pred ccccccccccccCCcccEEEecCCc----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh Q lcl|NC_010179. 213 YETGQSNTLKHNFGRVPFIEFPKNK----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND 288 (469) Q Consensus 213 ~~~~~~~~~~~~~g~vPvv~~~n~~----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~ 288 (469) .....+....|+|+.||+|.+.... .+.+-+.++-.|.-+.-...|+.-+++...++|++++.|.+....+..... T Consensus 227 ~~~~~~~~g~~~l~~IPfv~~~~~~~~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~~~~~~~~~ 306 (491) T protein:vir:95 227 VVEIYPDLGESLRGVIPFTFIGATNNDATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNLTPQSFKEA 306 (491) T ss_pred eeeeeecCCCcccCeeEEEEEecCCCCCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcccCcchhhcc Confidence 0111122335789999999996432 245558888888888999999999999999999999999653222211111 Q ss_pred hhhcceeeecccC---CCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHh-CCCCcCccccCCccHHHHHHHHHHHHHHH Q lcl|NC_010179. 289 LREYKSIKINNAG---NGDKSGVDKLQIDIPVEARDDALKITRDNIFLFG-QGIDPANFESSNASGVAIKMLYSHLELKA 364 (469) Q Consensus 289 ~~~~~~~~~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s-~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~ 364 (469) . ...+.+.+.. -..+++++|++.+.+.- .+..++.++.++.... ++. . ..++.||++.........+.. T Consensus 307 -~-~~~i~~g~~~~~~lP~~~~~~~ie~~~~~~-~~~~l~~~e~qm~~~Ga~l~---~-~~~~~Ta~~~~~~~~~~~S~L 379 (491) T protein:vir:95 307 -N-PNGIKFGSRCGHNLGYGGSAQLIQAGENNL-ARQNMLDKEQQAIQIGAQLI---T-PSQQITAESARIQRGADTSVM 379 (491) T ss_pred -C-cceeEecCcCCcCCCCCCccceeecCcchH-HHHHHHHHHHHHHHHHHHhc---c-CCcchhHHHHHHHHHHhhHHH Confidence 1 1112221110 01346788998876543 4777888888887763 332 1 235679999888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCCCcc--cceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCC--CCC--C Q lcl|NC_010179. 365 AKTQTYFEHAINELVRAIMRYLNFSDADKR--HISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANP--IVD--D 436 (469) Q Consensus 365 ~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~--~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~--~v~--d 436 (469) ......++.++.+++++++.+++.+..... .+...|... .-..+.++++.++ +|.+|++|.+..|- .|. + T Consensus 380 ~~~a~~~e~al~~~l~~~a~w~G~~~~~~v~i~~n~dF~~~--~~~~~~~~all~~~~~G~is~~t~~~~L~~~~vl~~~ 457 (491) T protein:vir:95 380 ATIARNVSQAYTDALRWVAMMLGKPEDSEVEFQLNMDFFLQ--PMTAQDRAAWMADINAGLLPATAYYAALRKAGVTDWT 457 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCCCCceEEEeecccccc--cCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCCCCcc Confidence 999999999999999999999987532222 223334322 2235567777664 78999999987652 233 3 Q ss_pred HHHHHHHHHHHHHHhhhhHhhcccCCCCCC-CCC Q lcl|NC_010179. 437 WQQELKDLAKDREENDPYANQADELNGKGV-DDE 469 (469) Q Consensus 437 ~~~E~eri~~E~~~~~~~~~~~~~~~~~~~-~de 469 (469) .+++.++|++|.-..-....-..+.++-.. ..| T Consensus 458 ~e~~~~~ie~~~~~~~~~~~~~~~~~~~~~~~~~ 491 (491) T protein:vir:95 458 DEDILNAIEDAPLPSGAVTQVAGEIPQAAQQQQE 491 (491) T ss_pred HHHHHHHHHhcCCCCCccccccccchhhhhhccC Confidence 456677775552110000001111111111 111 No 80 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.87 E-value=1e-20 Score=130.18 Aligned_cols=425 Identities=11% Similarity=-0.004 Sum_probs=229.0 Q ss_pred CCHHHHHHHHHHHHHHHHHH----HHHHH-HHHHHhcc--CCcccccccchhhhcccccccccccCcc---e-eccchHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDL----INNYK-KSVDYYEN--KTDITTRNNGKPKVSKEGKKDPLRSADN---R-IPSNFYQ 69 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~----~~~~~-~~~~Yy~g--~~~i~~~~~~~~~~~~~~~~~~~~~~~~---r-i~~n~~k 69 (469) |.+.........+..+-..- ....+ ....|..- .++.....+..+...... ....+.+. | +-.|+++ T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~--~~~~y~~~~~~rA~~~n~~~ 91 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAK--IEKDWEDLTWRLANYVNIVN 91 (488) T ss_pred ecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhcc--chhhhHhhhhhccccCchhH Confidence 44332222222111111000 00011 11112110 001000000000000000 00000011 1 2359999 Q ss_pred HHHHHHHHhhhcCCeeeccCch-hhHHHHHHHHhc-cHHHH-HHHHHHHHHhCCeEEEEEEEcCCC-----------ceE Q lcl|NC_010179. 70 LLVDQEAGYIASVFPDIDVGKD-ADNKKILDVLGD-DRALT-LNSLLVDSSNAGRAWLHYWIDEDN-----------NFR 135 (469) Q Consensus 70 ~iv~~~~~~l~g~p~~~~~~~~-~~~~~l~~~~~~-n~~~~-~~~~~~~~~~~G~~~~~v~~d~~~-----------~~~ 135 (469) ..++..+|++|.+||++..++. .....+.++-.. +.++. ...+...++.+|+++++|-.++.+ +|. T Consensus 92 ~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T~ade~~~~~rPy 171 (488) T protein:vir:96 92 PTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESATMADWNKGKKLPT 171 (488) T ss_pred HHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCCHHHHHHhcCCcE Confidence 9999999999999999986543 333333333222 34444 557888999999999999877643 478 Q ss_pred EEEEccceeEEEEeCCC-C--CceEEEEEEEEeeecCCc----eEEEEEEEEcCCeEEEEEeecCceeeccccccccccc Q lcl|NC_010179. 136 YGIIQPDQITPVYATTL-D--NKLLGVLRSYKQLDPEAG----KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYD 208 (469) Q Consensus 136 i~~~~p~~~~~~~d~~~-~--~~~~~~v~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (469) +..++|.+++= |+... . ..+..++..-...+.++. +....+-.+.+.....++...+.. T Consensus 172 ~~~~~a~~Iin-W~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~------------- 237 (488) T protein:vir:96 172 AAFYDALHIID-WEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEY------------- 237 (488) T ss_pred EEEechhhhcC-cceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCc------------- Confidence 88899988653 32211 1 134333222112222221 111111223444333333322221 Q ss_pred ccccccccccccccccCCcccEEEecCCc----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchh Q lcl|NC_010179. 209 LSAGYETGQSNTLKHNFGRVPFIEFPKNK----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQ 284 (469) Q Consensus 209 ~~~~~~~~~~~~~~~~~g~vPvv~~~n~~----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~ 284 (469) ..+........|+|+.||||+|.... .+.+-+.++..|.-+.-...|++.+.+...+.|++++.+. +...+. T Consensus 238 ---~~e~~~~~~g~~~l~~IP~v~~~~~~~~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~-~~~~~~ 313 (488) T protein:vir:96 238 ---SDEWTPVLINSKQSDTIPFFLASSQSNEWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAKWMVDMG-DMNKTM 313 (488) T ss_pred ---ccceEeecCCCcccCeeEEEEEecCCCCCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCceeeeccC-CCCccc Confidence 11122223345789999999996433 2456688889998899999999999999999998886432 211111 Q ss_pred hhhhhhhcceee-ecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhC-CCCcCccccCCccHHHHHHHHHHHHH Q lcl|NC_010179. 285 FMNDLREYKSIK-INNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQ-GIDPANFESSNASGVAIKMLYSHLEL 362 (469) Q Consensus 285 ~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~-~p~~~~~~~g~~Sg~Al~~~~~~l~~ 362 (469) .......++.. .........+.++|++.+.+.- .+..++.++++|...+. ++. . .++-||++.........+ T Consensus 314 -~~~~~~~g~~~~~~~~~~~~~g~~~~~e~~~~~l-~~~~l~~l~~qm~~~Ga~l~~---~-~~~~Ta~~~~~~~~~~~S 387 (488) T protein:vir:96 314 -ASEMNPLGFTLAGRMPYYVKNGDVKVIQAQFSPE-TENKVEKLFEQAVKVGASLFT---Q-QSNETATGAAIRSGSSTA 387 (488) T ss_pred -ccccccceeeecccccccccCCceeecCCchhHH-HHHHHHHHHHHHHHHhHhhcc---C-CCcchHHHHHHHHHHhhH Confidence 11111111111 0011112245688887665433 36778999999877553 221 2 245688888888888888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCCC--cccceEEeCCCC-CCC-HHHHHHHHHHH--hccCChHHHHHhC--CCC Q lcl|NC_010179. 363 KAAKTQTYFEHAINELVRAIMRYLNFSDAD--KRHISQHWTRTK-VED-SLTKAQIVSTV--ANYSSKEAVAKAN--PIV 434 (469) Q Consensus 363 k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~--~~~i~i~f~~~~-p~d-~~e~~~~~~kl--~g~iS~et~~~~l--~~v 434 (469) ........++.++++++++++.+++..+.. ..++++..++.. ... ..+.++++.++ +|.||++|.+..| ..+ T Consensus 388 ~L~~~a~~le~al~~~l~~~A~w~g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~~~L~~~gv 467 (488) T protein:vir:96 388 SMATLGNNVEDTVRNMLRFIMRYFEGTNLYVNPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWFELLKRARV 467 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcCCCCCCcCccceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCc Confidence 888999999999999999999999976532 334444444332 222 45677777775 7899999998775 233 Q ss_pred C--C--HHHHHHHHHHHHHHhhhhH Q lcl|NC_010179. 435 D--D--WQQELKDLAKDREENDPYA 455 (469) Q Consensus 435 ~--d--~~~E~eri~~E~~~~~~~~ 455 (469) - | .++|.+||+.+ .... T Consensus 468 l~~d~~~e~~~~~ie~~----g~~~ 488 (488) T protein:vir:96 468 VRGDMSKEEFDEHIAEL----GFGM 488 (488) T ss_pred CCccCCHHHHHHHHhhc----CCCC Confidence 2 2 34455555432 1111 No 81 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.83 E-value=3.4e-20 Score=127.33 Aligned_cols=451 Identities=13% Similarity=0.078 Sum_probs=217.5 Q ss_pred CCHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR-------NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~-------~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~ 73 (469) ++-++..++.++++... ..-+....+..+||.|+|-- ...... ....++ ..+.+|..+.+|+ T Consensus 38 ~~~~~~~~~~~~l~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~----~~~~~~-----l~~~g~--p~~~~N~i~~~i~ 106 (776) T protein:vir:93 38 LDSEQAVELHSRLLSYYRQELSRQQDNRAEMAVDEDYYDNIQWS----QDEIDE-----LKERGQ--APTVYNVISQSVN 106 (776) T ss_pred CCCHHHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHhCCCCCC----HHHHHH-----HHhcCC--ceEEecchHHHHH Confidence 55555544444443322 22333445678999998731 111111 111222 2478999999999 Q ss_pred HHHHhhhcCCeee--ccCc---hhhHH----HHHHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEc Q lcl|NC_010179. 74 QEAGYIASVFPDI--DVGK---DADNK----KILDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQ 140 (469) Q Consensus 74 ~~~~~l~g~p~~~--~~~~---~~~~~----~l~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~ 140 (469) ..+++...+.+.+ ...+ .+..+ .++.++..|. ......+..+++++|.+|+.|+++.+ +.+.+.+++ T Consensus 107 ~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~~~~ 186 (776) T protein:vir:93 107 WIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAGAES 186 (776) T ss_pred HHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEeeccC Confidence 9999987765543 3322 22223 2333444444 44577889999999999999988765 345567788 Q ss_pred cceeEEEEeCCC----CCceEEEEEE---------EEe------------------ee---------------------- Q lcl|NC_010179. 141 PDQITPVYATTL----DNKLLGVLRS---------YKQ------------------LD---------------------- 167 (469) Q Consensus 141 p~~~~~~~d~~~----~~~~~~~v~~---------~~~------------------~~---------------------- 167 (469) |.++++-.+... +.+.++..++ |.. .+ T Consensus 187 p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 266 (776) T protein:vir:93 187 WRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTAG 266 (776) T ss_pred hhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhccccccccccccccccccccccc Confidence 988765432211 1111111110 000 00 Q ss_pred --cCCceEEEEEEEEcCCeEEEEEee--cCceee--ccccc-------------------cccccccccccccccccccc Q lcl|NC_010179. 168 --PEAGKYFTVHEYWTDKEAQFFRTS--ATDSTV--IEPYN-------------------IITSYDLSAGYETGQSNTLK 222 (469) Q Consensus 168 --~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~--~~~~~-------------------~~~~~~~~~~~~~~~~~~~~ 222 (469) ......++.+|+|....+...... ..++.. ..... ....+..-.+.........| T Consensus 267 ~~~~~~~~v~v~E~~~r~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~~~~~p 346 (776) T protein:vir:93 267 AVAYARKRVRMIEAWFRMPVRVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMWAGPSP 346 (776) T ss_pred ccccCCCeEEEEEEEEeeeeehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhhccCCC Confidence 001123344555543322221110 000000 00000 00000011111112223344 Q ss_pred ccCCcccEEEecCCc-----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh-hhhcceee Q lcl|NC_010179. 223 HNFGRVPFIEFPKNK-----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND-LREYKSIK 296 (469) Q Consensus 223 ~~~g~vPvv~~~n~~-----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~-~~~~~~~~ 296 (469) .+.+++|+|+|+..+ .|.|.+..+++.++.+|..+|.+.+.+. +.++++..|... +.++.... .+.++++. T Consensus 347 ~~~~~~Pfv~~~~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l~--~~~~~~~~gav~-~~d~~~~~~~rp~~vi~ 423 (776) T protein:vir:93 347 YRHNRYPFTPIWGFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYILS--TNKVLMEEGAVD-DIDEFRREAARPDAVMT 423 (776) T ss_pred CCCCccceEEecCceecccccccchHHhhhHHHHHHHHHHHHHHHhhc--CCceeecccccc-chHHHHHhcccCCceee Confidence 556789999886532 4789999999999999999999988763 556666666432 22333322 34566777 Q ss_pred ecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 297 INNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAI 375 (469) Q Consensus 297 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l 375 (469) +.++.. +.+.+.....-..++...+..+...|...|++.+...+..+| .||+|+..+..............|..++ T Consensus 424 ~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~~~~~~~~~~~~~dn~~~~~ 500 (776) T protein:vir:93 424 VKNGKL---GAVKMDVDRDLAPAHLELASRSIQMIQQVGGVTDEMLGRTTNAVSGVAIQARQEQGSVATNKLFDNLRLAF 500 (776) T ss_pred eCCccc---cccccccCcCccHHHHHHHHHHHHHHHHhhCcChHHhCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 765432 233443322224667777888899999998887765555444 7999998877776666666666666666 Q ss_pred HHHHHHHHHHh----cc------cCCC----c----------------ccceEEeCCCCCCCHHHHHHHHHHHhccCChH Q lcl|NC_010179. 376 NELVRAIMRYL----NF------SDAD----K----------------RHISQHWTRTKVEDSLTKAQIVSTVANYSSKE 425 (469) Q Consensus 376 ~~~~~~i~~~~----~~------~~~~----~----------------~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~e 425 (469) +++.++++.++ +. .+.+ + .+|.|.=.+..+.-..+..+.++.+.+.+..+ T Consensus 501 ~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~r~~~~~~l~ql~~~~~p~ 580 (776) T protein:vir:93 501 QQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATMRQAAVAELMEVIGKMPPE 580 (776) T ss_pred HHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhHHHHHHHHHHHHHhhcChh Confidence 66666555443 21 1111 1 01222222222221222333333333322222 Q ss_pred H-------HHHhCCCCCCHHHHHHHHHHHHHHhhhhH-----------------hh-cccCCCCCCCC-C Q lcl|NC_010179. 426 A-------VAKANPIVDDWQQELKDLAKDREENDPYA-----------------NQ-ADELNGKGVDD-E 469 (469) Q Consensus 426 t-------~~~~l~~v~d~~~E~eri~~E~~~~~~~~-----------------~~-~~~~~~~~~~d-e 469 (469) . +++..++ .+.++-.+++++.....++.. .+ ......-.... + T Consensus 581 ~~~~~~~~~~e~~d~-p~~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~~~~~a~~~~~q 649 (776) T protein:vir:93 581 IALTMLDLLVENMDI-PNRDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYNDALAIATLEEQQ 649 (776) T ss_pred hHHHHHHHHHHhcCc-cchHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHHHHhhhhhhHhh Confidence 1 1222211 111111222221110000000 00 00000000000 0 No 82 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.82 E-value=4.5e-19 Score=121.15 Aligned_cols=452 Identities=11% Similarity=0.012 Sum_probs=227.5 Q ss_pred CCHHHH-HHHHHHHHH---HHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MELDAL-KKLIRNTST---SRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~~~~-~~~i~~~~~---~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) .+.+.+ .++...+.. .+.+.+....+-.+||.|+|- ........ ...++ ..+.+|.++.+|+..+ T Consensus 26 ~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw----~~~~~~~l-----~~~g~--p~~~~N~i~~~v~~v~ 94 (711) T protein:vir:10 26 DDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQW----PSQVRTER-----ELEQR--PCLVNNVLPTFVDQVL 94 (711) T ss_pred chHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCC----CHHHHHHH-----HhcCC--CcEEEcchHHHHHHHh Confidence 333322 223333322 233445556678999999872 11111111 11112 2467899999999999 Q ss_pred HhhhcCCeeec--c-------------------------CchhhHHHHHH----HHhccHH-HHHHHHHHHHHhCCeEEE Q lcl|NC_010179. 77 GYIASVFPDID--V-------------------------GKDADNKKILD----VLGDDRA-LTLNSLLVDSSNAGRAWL 124 (469) Q Consensus 77 ~~l~g~p~~~~--~-------------------------~~~~~~~~l~~----~~~~n~~-~~~~~~~~~~~~~G~~~~ 124 (469) ++--.+.+.+. + ++.+..+.+.. +.+.+.. .....+..+++++|.+|+ T Consensus 95 g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~af~d~~~~G~G~~ 174 (711) T protein:vir:10 95 GDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEYDIAFQGAVESGMGYL 174 (711) T ss_pred hhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHHHHHHHHhhhcCcceE Confidence 99877665542 2 12233333333 3334443 456788899999999999 Q ss_pred EEEEcC------CCceEEEEE-ccceeEEEEeCCC------CCceEEEEEE---------EEeee----------cCC-- Q lcl|NC_010179. 125 HYWIDE------DNNFRYGII-QPDQITPVYATTL------DNKLLGVLRS---------YKQLD----------PEA-- 170 (469) Q Consensus 125 ~v~~d~------~~~~~i~~~-~p~~~~~~~d~~~------~~~~~~~v~~---------~~~~~----------~~~-- 170 (469) -++.|. +|++++..+ +|.+++ ||+.. +.+.++..++ |.... ..+ T Consensus 175 ev~~d~~~~d~~~~e~~i~~v~~p~~v~--~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~yp~~a~~~~~~~~~~~~~~~ 252 (711) T protein:vir:10 175 RVRSDYLADDSFEQDLIIEAIQNQFSVT--IDPDAKKRDRSDMNWCLIDDTMSKEKFKALYPDATAEPVYEDSVADYDTW 252 (711) T ss_pred EEEecccCCCCCCCCeEEeeecChhhee--eCccccccChhhhcceeeeecCCHHHHHHhCCchhhhhhhcccccccCcc Confidence 887542 378888777 688865 44422 1222222222 11000 000 Q ss_pred --ceEEEEEEEEcCCeEEEEEeecCceeeccccc-----------------------ccccccccccccccccccccccC Q lcl|NC_010179. 171 --GKYFTVHEYWTDKEAQFFRTSATDSTVIEPYN-----------------------IITSYDLSAGYETGQSNTLKHNF 225 (469) Q Consensus 171 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~ 225 (469) ...++..++|....+.+......++....... ....+..-.+ .....+..|.+. T Consensus 253 ~~~~~vrv~E~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~~~~~G-~~~L~~~~p~~~ 331 (711) T protein:vir:10 253 FTEKSVRVSEYFTREPVIREIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYWRKITG-ANVLEGPVEIPS 331 (711) T ss_pred cCcceeeEEEEEeeeeeeeEEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEEEEEec-ceeecCCCCCCC Confidence 12234556665443322221111111110000 0000000111 112223344555 Q ss_pred CcccEEEecC-------CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE-ecCCcccchhhhh--hhhhccee Q lcl|NC_010179. 226 GRVPFIEFPK-------NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL-TNYGGASLKQFMN--DLREYKSI 295 (469) Q Consensus 226 g~vPvv~~~n-------~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~-~g~~~~~~~~~~~--~~~~~~~~ 295 (469) +++|+|+|.- ...+.|.+..+++.++.+|...|.+.+.+...+.+.+++ .|.. .+..+... ..+.++++ T Consensus 332 ~~~P~vp~~g~r~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai-~~~~~~~~e~~~~~~~vi 410 (711) T protein:vir:10 332 TTIPVIPVWGKSLIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNV-EGREDEWEQANTKNFSLL 410 (711) T ss_pred CcccEEEEeeeeeccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCccc-CChHHHHHhccccCCCee Confidence 7788887742 223567899999999999999999999999988876555 4432 23233222 24566777 Q ss_pred eecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC-CccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 296 KINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS-NASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 296 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) .+.++..+. +.++++..+.-..++...++.....|-..|+..+...+..+ +.||+|+.................+..+ T Consensus 411 ~~~~~~~~~-~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n~~Sg~ai~~~q~qg~~~l~~~~dn~~~~ 489 (711) T protein:vir:10 411 TYIPQYQGD-PGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGNETSGRAIIARQRQGDRGSFAFIDNLTKS 489 (711) T ss_pred EecccccCc-CCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777765433 45677665555677788888888999888887765554443 4799999888777666666566666666 Q ss_pred HHHHHHHHHHH----hcc------cCC----Cc-------------------------ccceEEeCCCCCCCHHHHHHHH Q lcl|NC_010179. 375 INELVRAIMRY----LNF------SDA----DK-------------------------RHISQHWTRTKVEDSLTKAQIV 415 (469) Q Consensus 375 l~~~~~~i~~~----~~~------~~~----~~-------------------------~~i~i~f~~~~p~d~~e~~~~~ 415 (469) .+++.++++.+ +.. .+- ++ .+|.+.=.+..+.-..+.+..+ T Consensus 490 ~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nDi~~g~~Dv~i~~~p~~~s~r~~~~~~l 569 (711) T protein:vir:10 490 IRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHDLNVQKYDVVVTTGPAFATQRIEAAEAM 569 (711) T ss_pred HHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeeccceeeeEEEEeeccCchhHHHHHHHHH Confidence 66665555544 321 110 00 1223333334444344444555 Q ss_pred HHHhccCChH------HHHHhCCCCCCHHHHHHHHH--------------------HHHHHhhhhHh---h-cccC---- Q lcl|NC_010179. 416 STVANYSSKE------AVAKANPIVDDWQQELKDLA--------------------KDREENDPYAN---Q-ADEL---- 461 (469) Q Consensus 416 ~kl~g~iS~e------t~~~~l~~v~d~~~E~eri~--------------------~E~~~~~~~~~---~-~~~~---- 461 (469) ..+.+.+|.- .+++.+++ ++.++=.++++ .|++......+ + .+.. T Consensus 570 ~ql~~~~p~~~~~~~~~il~~~d~-p~~~el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa 648 (711) T protein:vir:10 570 IQFAQAVPSAAAVMADLIAQNMDW-PGADVIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQA 648 (711) T ss_pred HHHHhhcchhhhHHHHHHHHhcCC-CCHHHHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5444444321 12333332 22211122221 11110000000 0 0000 Q ss_pred ---CCCCCCCC Q lcl|NC_010179. 462 ---NGKGVDDE 469 (469) Q Consensus 462 ---~~~~~~de 469 (469) .....-++ T Consensus 649 ~ae~~~Aqae~ 659 (711) T protein:vir:10 649 EADTAQAQADM 659 (711) T ss_pred HHHHHHHHHHH Confidence 00000000 No 83 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.75 E-value=5e-16 Score=104.45 Aligned_cols=446 Identities=13% Similarity=0.046 Sum_probs=218.0 Q ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR---NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) .+.+.-.+++..+.... .+-+....+..+||.|.|-- ....... ...++ ..+.+|.++.+|+..++ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~----~~~~~~l-----~~~g~--p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:99 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP----PEVLQVL-----KDRGQ--PMTIHNLIAPTVDGVLG 84 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcCC--CcEEeccHHHHHHHHHh Confidence 33344444544443332 33345566788999998731 1111111 11122 24678999999999999 Q ss_pred hhhcCCeee--ccC--ch---hhHHHH----HHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccc Q lcl|NC_010179. 78 YIASVFPDI--DVG--KD---ADNKKI----LDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPD 142 (469) Q Consensus 78 ~l~g~p~~~--~~~--~~---~~~~~l----~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~ 142 (469) +--.+.+.+ .+. ++ ...+.| +.++..+. ......+..+++++|.+|+-+|.+.+ +.+++..++|. T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~ 164 (714) T protein:vir:99 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRN 164 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchh Confidence 987776554 331 21 123333 33344443 44577889999999999999998765 56889999999 Q ss_pred eeEEEEeCC----CCCceEEEEEEEEeeec-------------------------------------------------- Q lcl|NC_010179. 143 QITPVYATT----LDNKLLGVLRSYKQLDP-------------------------------------------------- 168 (469) Q Consensus 143 ~~~~~~d~~----~~~~~~~~v~~~~~~~~-------------------------------------------------- 168 (469) ++++-.+.. .+.+.++ ++.|...+. T Consensus 165 ~v~~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (714) T protein:vir:99 165 EVFWDWLSREADLSDCRWLM-RRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQN 243 (714) T ss_pred heeeccccccCChhhcccee-eeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccccc Confidence 976533210 1112222 111111000 Q ss_pred ----CCceEEEEEEEEcCCeEEEE--EeecCceeeccccccccc-------------------ccccccccccccccccc Q lcl|NC_010179. 169 ----EAGKYFTVHEYWTDKEAQFF--RTSATDSTVIEPYNIITS-------------------YDLSAGYETGQSNTLKH 223 (469) Q Consensus 169 ----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~ 223 (469) .....+..+++|-.....+. ...++....+........ ...-.+......+..|. T Consensus 244 ~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 323 (714) T protein:vir:99 244 EWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSA 323 (714) T ss_pred ccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCC Confidence 00012233444432221111 111111111111000000 00001111111122233 Q ss_pred cCCcccEEEecCC---cc--ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh-hhhcceeee Q lcl|NC_010179. 224 NFGRVPFIEFPKN---KY--RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND-LREYKSIKI 297 (469) Q Consensus 224 ~~g~vPvv~~~n~---~~--g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~-~~~~~~~~~ 297 (469) +.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|............ -++++++.+ T Consensus 324 p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:99 324 PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred CCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceee Confidence 3355666665432 12 34778999999999999999998876 4555555556443332333222 334566767 Q ss_pred cccCCC---CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNG---DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 298 ~~~~~~---~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) .++... ..+.++......-..++...++.....|-..|+..+...+..+| .||+|+...-..........-.-+.. T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~ 481 (714) T protein:vir:99 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQF 481 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554221 12234443333335677777888888888888776655544444 69999887665544444444444555 Q ss_pred HHHHH----HHHHHHHhcc------cCC-Cc--------------------------ccceEEeCCCCCCCHHHHHHHHH Q lcl|NC_010179. 374 AINEL----VRAIMRYLNF------SDA-DK--------------------------RHISQHWTRTKVEDSLTKAQIVS 416 (469) Q Consensus 374 ~l~~~----~~~i~~~~~~------~~~-~~--------------------------~~i~i~f~~~~p~d~~e~~~~~~ 416 (469) +.+++ +.+|..++.. .+- +. .+|.|.=.+..|....+.++.+. T Consensus 482 ~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~ 561 (714) T protein:vir:99 482 ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMS 561 (714) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHH Confidence 55544 4445444431 110 00 12222223334443455555555 Q ss_pred HHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 417 TVANYSSK-------EAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 417 kl~g~iS~-------et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+.+.++. ..+++.+++ ++.++=.++|++-.-... ......-+| T Consensus 562 ~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~~~~--------~~~~~~~e~ 612 (714) T protein:vir:99 562 EVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALGTPK--------SPDEMTPEE 612 (714) T ss_pred HHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcCCCC--------Cccccchhh Confidence 55443333 345555553 444444555543210000 000000000 No 84 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.75 E-value=5e-16 Score=104.45 Aligned_cols=446 Identities=13% Similarity=0.046 Sum_probs=218.0 Q ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR---NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) .+.+.-.+++..+.... .+-+....+..+||.|.|-- ....... ...++ ..+.+|.++.+|+..++ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~----~~~~~~l-----~~~g~--p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:27 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP----PEVLQVL-----KDRGQ--PMTIHNLIAPTVDGVLG 84 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcCC--CcEEeccHHHHHHHHHh Confidence 33344444544443332 33345566788999998731 1111111 11122 24678999999999999 Q ss_pred hhhcCCeee--ccC--ch---hhHHHH----HHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccc Q lcl|NC_010179. 78 YIASVFPDI--DVG--KD---ADNKKI----LDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPD 142 (469) Q Consensus 78 ~l~g~p~~~--~~~--~~---~~~~~l----~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~ 142 (469) +--.+.+.+ .+. ++ ...+.| +.++..+. ......+..+++++|.+|+-+|.+.+ +.+++..++|. T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~ 164 (714) T protein:vir:27 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRN 164 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchh Confidence 987776554 331 21 123333 33344443 44577889999999999999998765 56889999999 Q ss_pred eeEEEEeCC----CCCceEEEEEEEEeeec-------------------------------------------------- Q lcl|NC_010179. 143 QITPVYATT----LDNKLLGVLRSYKQLDP-------------------------------------------------- 168 (469) Q Consensus 143 ~~~~~~d~~----~~~~~~~~v~~~~~~~~-------------------------------------------------- 168 (469) ++++-.+.. .+.+.++ ++.|...+. T Consensus 165 ~v~~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (714) T protein:vir:27 165 EVFWDWLSREADLSDCRWLM-RRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQN 243 (714) T ss_pred heeeccccccCChhhcccee-eeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccccc Confidence 976533210 1112222 111111000 Q ss_pred ----CCceEEEEEEEEcCCeEEEE--EeecCceeeccccccccc-------------------ccccccccccccccccc Q lcl|NC_010179. 169 ----EAGKYFTVHEYWTDKEAQFF--RTSATDSTVIEPYNIITS-------------------YDLSAGYETGQSNTLKH 223 (469) Q Consensus 169 ----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~ 223 (469) .....+..+++|-.....+. ...++....+........ ...-.+......+..|. T Consensus 244 ~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 323 (714) T protein:vir:27 244 EWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSA 323 (714) T ss_pred ccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCC Confidence 00012233444432221111 111111111111000000 00001111111122233 Q ss_pred cCCcccEEEecCC---cc--ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh-hhhcceeee Q lcl|NC_010179. 224 NFGRVPFIEFPKN---KY--RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND-LREYKSIKI 297 (469) Q Consensus 224 ~~g~vPvv~~~n~---~~--g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~-~~~~~~~~~ 297 (469) +.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|............ -++++++.+ T Consensus 324 p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:27 324 PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred CCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceee Confidence 3355666665432 12 34778999999999999999998876 4555555556443332333222 334566767 Q ss_pred cccCCC---CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNG---DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 298 ~~~~~~---~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) .++... ..+.++......-..++...++.....|-..|+..+...+..+| .||+|+...-..........-.-+.. T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~ 481 (714) T protein:vir:27 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQF 481 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554221 12234443333335677777888888888888776655544444 69999887665544444444444555 Q ss_pred HHHHH----HHHHHHHhcc------cCC-Cc--------------------------ccceEEeCCCCCCCHHHHHHHHH Q lcl|NC_010179. 374 AINEL----VRAIMRYLNF------SDA-DK--------------------------RHISQHWTRTKVEDSLTKAQIVS 416 (469) Q Consensus 374 ~l~~~----~~~i~~~~~~------~~~-~~--------------------------~~i~i~f~~~~p~d~~e~~~~~~ 416 (469) +.+++ +.+|..++.. .+- +. .+|.|.=.+..|....+.++.+. T Consensus 482 ~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~ 561 (714) T protein:vir:27 482 ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMS 561 (714) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHH Confidence 55544 4445444431 110 00 12222223334443455555555 Q ss_pred HHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 417 TVANYSSK-------EAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 417 kl~g~iS~-------et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+.+.++. ..+++.+++ ++.++=.++|++-.-... ......-+| T Consensus 562 ~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~~~~--------~~~~~~~e~ 612 (714) T protein:vir:27 562 EVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALGTPK--------SPDEMTPEE 612 (714) T ss_pred HHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcCCCC--------Cccccchhh Confidence 55443333 345555553 444444555543210000 000000000 No 85 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.75 E-value=5e-16 Score=104.45 Aligned_cols=446 Identities=13% Similarity=0.046 Sum_probs=218.0 Q ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR---NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) .+.+.-.+++..+.... .+-+....+..+||.|.|-- ....... ...++ ..+.+|.++.+|+..++ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~----~~~~~~l-----~~~g~--p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:32 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP----PEVLQVL-----KDRGQ--PMTIHNLIAPTVDGVLG 84 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcCC--CcEEeccHHHHHHHHHh Confidence 33344444544443332 33345566788999998731 1111111 11122 24678999999999999 Q ss_pred hhhcCCeee--ccC--ch---hhHHHH----HHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccc Q lcl|NC_010179. 78 YIASVFPDI--DVG--KD---ADNKKI----LDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPD 142 (469) Q Consensus 78 ~l~g~p~~~--~~~--~~---~~~~~l----~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~ 142 (469) +--.+.+.+ .+. ++ ...+.| +.++..+. ......+..+++++|.+|+-+|.+.+ +.+++..++|. T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~ 164 (714) T protein:vir:32 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRN 164 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchh Confidence 987776554 331 21 123333 33344443 44577889999999999999998765 56889999999 Q ss_pred eeEEEEeCC----CCCceEEEEEEEEeeec-------------------------------------------------- Q lcl|NC_010179. 143 QITPVYATT----LDNKLLGVLRSYKQLDP-------------------------------------------------- 168 (469) Q Consensus 143 ~~~~~~d~~----~~~~~~~~v~~~~~~~~-------------------------------------------------- 168 (469) ++++-.+.. .+.+.++ ++.|...+. T Consensus 165 ~v~~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (714) T protein:vir:32 165 EVFWDWLSREADLSDCRWLM-RRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQN 243 (714) T ss_pred heeeccccccCChhhcccee-eeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccccc Confidence 976533210 1112222 111111000 Q ss_pred ----CCceEEEEEEEEcCCeEEEE--EeecCceeeccccccccc-------------------ccccccccccccccccc Q lcl|NC_010179. 169 ----EAGKYFTVHEYWTDKEAQFF--RTSATDSTVIEPYNIITS-------------------YDLSAGYETGQSNTLKH 223 (469) Q Consensus 169 ----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~ 223 (469) .....+..+++|-.....+. ...++....+........ ...-.+......+..|. T Consensus 244 ~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 323 (714) T protein:vir:32 244 EWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSA 323 (714) T ss_pred ccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCC Confidence 00012233444432221111 111111111111000000 00001111111122233 Q ss_pred cCCcccEEEecCC---cc--ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh-hhhcceeee Q lcl|NC_010179. 224 NFGRVPFIEFPKN---KY--RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND-LREYKSIKI 297 (469) Q Consensus 224 ~~g~vPvv~~~n~---~~--g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~-~~~~~~~~~ 297 (469) +.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|............ -++++++.+ T Consensus 324 p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:32 324 PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred CCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceee Confidence 3355666665432 12 34778999999999999999998876 4555555556443332333222 334566767 Q ss_pred cccCCC---CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNG---DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 298 ~~~~~~---~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) .++... ..+.++......-..++...++.....|-..|+..+...+..+| .||+|+...-..........-.-+.. T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~ 481 (714) T protein:vir:32 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQF 481 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554221 12234443333335677777888888888888776655544444 69999887665544444444444555 Q ss_pred HHHHH----HHHHHHHhcc------cCC-Cc--------------------------ccceEEeCCCCCCCHHHHHHHHH Q lcl|NC_010179. 374 AINEL----VRAIMRYLNF------SDA-DK--------------------------RHISQHWTRTKVEDSLTKAQIVS 416 (469) Q Consensus 374 ~l~~~----~~~i~~~~~~------~~~-~~--------------------------~~i~i~f~~~~p~d~~e~~~~~~ 416 (469) +.+++ +.+|..++.. .+- +. .+|.|.=.+..|....+.++.+. T Consensus 482 ~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~ 561 (714) T protein:vir:32 482 ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMS 561 (714) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHH Confidence 55544 4445444431 110 00 12222223334443455555555 Q ss_pred HHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 417 TVANYSSK-------EAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 417 kl~g~iS~-------et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+.+.++. ..+++.+++ ++.++=.++|++-.-... ......-+| T Consensus 562 ~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~~~~--------~~~~~~~e~ 612 (714) T protein:vir:32 562 EVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALGTPK--------SPDEMTPEE 612 (714) T ss_pred HHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcCCCC--------Cccccchhh Confidence 55443333 345555553 444444555543210000 000000000 No 86 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.75 E-value=5e-16 Score=104.45 Aligned_cols=446 Identities=13% Similarity=0.046 Sum_probs=218.0 Q ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR---NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) .+.+.-.+++..+.... .+-+....+..+||.|.|-- ....... ...++ ..+.+|.++.+|+..++ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~----~~~~~~l-----~~~g~--p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:10 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP----PEVLQVL-----KDRGQ--PMTIHNLIAPTVDGVLG 84 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcCC--CcEEeccHHHHHHHHHh Confidence 33344444544443332 33345566788999998731 1111111 11122 24678999999999999 Q ss_pred hhhcCCeee--ccC--ch---hhHHHH----HHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccc Q lcl|NC_010179. 78 YIASVFPDI--DVG--KD---ADNKKI----LDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPD 142 (469) Q Consensus 78 ~l~g~p~~~--~~~--~~---~~~~~l----~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~ 142 (469) +--.+.+.+ .+. ++ ...+.| +.++..+. ......+..+++++|.+|+-+|.+.+ +.+++..++|. T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~ 164 (714) T protein:vir:10 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRN 164 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchh Confidence 987776554 331 21 123333 33344443 44577889999999999999998765 56889999999 Q ss_pred eeEEEEeCC----CCCceEEEEEEEEeeec-------------------------------------------------- Q lcl|NC_010179. 143 QITPVYATT----LDNKLLGVLRSYKQLDP-------------------------------------------------- 168 (469) Q Consensus 143 ~~~~~~d~~----~~~~~~~~v~~~~~~~~-------------------------------------------------- 168 (469) ++++-.+.. .+.+.++ ++.|...+. T Consensus 165 ~v~~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (714) T protein:vir:10 165 EVFWDWLSREADLSDCRWLM-RRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQN 243 (714) T ss_pred heeeccccccCChhhcccee-eeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccccc Confidence 976533210 1112222 111111000 Q ss_pred ----CCceEEEEEEEEcCCeEEEE--EeecCceeeccccccccc-------------------ccccccccccccccccc Q lcl|NC_010179. 169 ----EAGKYFTVHEYWTDKEAQFF--RTSATDSTVIEPYNIITS-------------------YDLSAGYETGQSNTLKH 223 (469) Q Consensus 169 ----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~ 223 (469) .....+..+++|-.....+. ...++....+........ ...-.+......+..|. T Consensus 244 ~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 323 (714) T protein:vir:10 244 EWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSA 323 (714) T ss_pred ccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCC Confidence 00012233444432221111 111111111111000000 00001111111122233 Q ss_pred cCCcccEEEecCC---cc--ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh-hhhcceeee Q lcl|NC_010179. 224 NFGRVPFIEFPKN---KY--RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND-LREYKSIKI 297 (469) Q Consensus 224 ~~g~vPvv~~~n~---~~--g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~-~~~~~~~~~ 297 (469) +.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|............ -++++++.+ T Consensus 324 p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:10 324 PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred CCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceee Confidence 3355666665432 12 34778999999999999999998876 4555555556443332333222 334566767 Q ss_pred cccCCC---CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNG---DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 298 ~~~~~~---~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) .++... ..+.++......-..++...++.....|-..|+..+...+..+| .||+|+...-..........-.-+.. T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~ 481 (714) T protein:vir:10 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQF 481 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554221 12234443333335677777888888888888776655544444 69999887665544444444444555 Q ss_pred HHHHH----HHHHHHHhcc------cCC-Cc--------------------------ccceEEeCCCCCCCHHHHHHHHH Q lcl|NC_010179. 374 AINEL----VRAIMRYLNF------SDA-DK--------------------------RHISQHWTRTKVEDSLTKAQIVS 416 (469) Q Consensus 374 ~l~~~----~~~i~~~~~~------~~~-~~--------------------------~~i~i~f~~~~p~d~~e~~~~~~ 416 (469) +.+++ +.+|..++.. .+- +. .+|.|.=.+..|....+.++.+. T Consensus 482 ~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~ 561 (714) T protein:vir:10 482 ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMS 561 (714) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHH Confidence 55544 4445444431 110 00 12222223334443455555555 Q ss_pred HHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 417 TVANYSSK-------EAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 417 kl~g~iS~-------et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+.+.++. ..+++.+++ ++.++=.++|++-.-... ......-+| T Consensus 562 ~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~~~~--------~~~~~~~e~ 612 (714) T protein:vir:10 562 EVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALGTPK--------SPDEMTPEE 612 (714) T ss_pred HHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcCCCC--------Cccccchhh Confidence 55443333 345555553 444444555543210000 000000000 No 87 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.75 E-value=5e-16 Score=104.45 Aligned_cols=446 Identities=13% Similarity=0.046 Sum_probs=218.0 Q ss_pred CCHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR---NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~---~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) .+.+.-.+++..+.... .+-+....+..+||.|.|-- ....... ...++ ..+.+|.++.+|+..++ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~Qw~----~~~~~~l-----~~~g~--p~~~~N~i~~~v~~v~g 84 (714) T protein:vir:81 16 ATPRFSQRQLQALCSDIDSQPKWRDAANKACAYYDGDQLP----PEVLQVL-----KDRGQ--PMTIHNLIAPTVDGVLG 84 (714) T ss_pred hhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcCC--CcEEeccHHHHHHHHHh Confidence 33344444544443332 33345566788999998731 1111111 11122 24678999999999999 Q ss_pred hhhcCCeee--ccC--ch---hhHHHH----HHHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccc Q lcl|NC_010179. 78 YIASVFPDI--DVG--KD---ADNKKI----LDVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPD 142 (469) Q Consensus 78 ~l~g~p~~~--~~~--~~---~~~~~l----~~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~ 142 (469) +--.+.+.+ .+. ++ ...+.| +.++..+. ......+..+++++|.+|+-+|.+.+ +.+++..++|. T Consensus 85 ~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~~i~i~~v~p~ 164 (714) T protein:vir:81 85 MEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGPEFKVSTVSRN 164 (714) T ss_pred HHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCCCeEEEecchh Confidence 987776554 331 21 123333 33344443 44577889999999999999998765 56889999999 Q ss_pred eeEEEEeCC----CCCceEEEEEEEEeeec-------------------------------------------------- Q lcl|NC_010179. 143 QITPVYATT----LDNKLLGVLRSYKQLDP-------------------------------------------------- 168 (469) Q Consensus 143 ~~~~~~d~~----~~~~~~~~v~~~~~~~~-------------------------------------------------- 168 (469) ++++-.+.. .+.+.++ ++.|...+. T Consensus 165 ~v~~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (714) T protein:vir:81 165 EVFWDWLSREADLSDCRWLM-RRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQN 243 (714) T ss_pred heeeccccccCChhhcccee-eeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhhhcccccccc Confidence 976533210 1112222 111111000 Q ss_pred ----CCceEEEEEEEEcCCeEEEE--EeecCceeeccccccccc-------------------ccccccccccccccccc Q lcl|NC_010179. 169 ----EAGKYFTVHEYWTDKEAQFF--RTSATDSTVIEPYNIITS-------------------YDLSAGYETGQSNTLKH 223 (469) Q Consensus 169 ----~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~ 223 (469) .....+..+++|-.....+. ...++....+........ ...-.+......+..|. T Consensus 244 ~~~~~~~~rv~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 323 (714) T protein:vir:81 244 EWLQRERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSA 323 (714) T ss_pred ccccccccEEEEEEEEEEEEEEEEeeccCCCceEEeCccCHHHHHHHhhcchhhhccccceEEEEEEecCcccccCCCCC Confidence 00012233444432221111 111111111111000000 00001111111122233 Q ss_pred cCCcccEEEecCC---cc--ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh-hhhcceeee Q lcl|NC_010179. 224 NFGRVPFIEFPKN---KY--RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND-LREYKSIKI 297 (469) Q Consensus 224 ~~g~vPvv~~~n~---~~--g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~-~~~~~~~~~ 297 (469) +.+++|+|+|+-. .. ..|.+..+++.++.+|...|.+...+ .++..++..|............ -++++++.+ T Consensus 324 p~~~fp~vp~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a~~~~d~~~~e~~arp~~vi~~ 401 (714) T protein:vir:81 324 PQGMFPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQIERPDGIIKL 401 (714) T ss_pred CCCceeEEEEeeeeeeccCceeehhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCcccccHHHHHHhccCCCCceee Confidence 3355666665432 12 34778999999999999999998876 4555555556443332333222 334566767 Q ss_pred cccCCC---CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNG---DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 298 ~~~~~~---~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) .++... ..+.++......-..++...++.....|-..|+..+...+..+| .||+|+...-..........-.-+.. T Consensus 402 ~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~ 481 (714) T protein:vir:81 402 NPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQF 481 (714) T ss_pred cccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCCccchhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 554221 12234443333335677777888888888888776655544444 69999887665544444444444555 Q ss_pred HHHHH----HHHHHHHhcc------cCC-Cc--------------------------ccceEEeCCCCCCCHHHHHHHHH Q lcl|NC_010179. 374 AINEL----VRAIMRYLNF------SDA-DK--------------------------RHISQHWTRTKVEDSLTKAQIVS 416 (469) Q Consensus 374 ~l~~~----~~~i~~~~~~------~~~-~~--------------------------~~i~i~f~~~~p~d~~e~~~~~~ 416 (469) +.+++ +.+|..++.. .+- +. .+|.|.=.+..|....+.++.+. T Consensus 482 ~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~ 561 (714) T protein:vir:81 482 ACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMS 561 (714) T ss_pred HHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceecccceeeeEEEEEeeccCchHHHHHHHHHHH Confidence 55544 4445444431 110 00 12222223334443455555555 Q ss_pred HHhccCCh-------HHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 417 TVANYSSK-------EAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 417 kl~g~iS~-------et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+.+.++. ..+++.+++ ++.++=.++|++-.-... ......-+| T Consensus 562 ~l~~~~~p~~~~~~~~~~l~~~d~-p~~~el~~~ir~~~~~~~--------~~~~~~~e~ 612 (714) T protein:vir:81 562 EVIQGLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALGTPK--------SPDEMTPEE 612 (714) T ss_pred HHHhhcCchhhhhHHHHHHHhcCC-CCHHHHHHHHHHHcCCCC--------Cccccchhh Confidence 55443333 345555553 444444555543210000 000000000 No 88 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.75 E-value=3.7e-16 Score=105.19 Aligned_cols=446 Identities=13% Similarity=0.053 Sum_probs=221.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) +..+.+..+...... +..-+....+-.+||.|.|- ........ ...++ ..+.+|.++.+|+..+++.- T Consensus 20 ~~~~~l~~~~~~~~~-~~~~r~~a~~d~~fy~G~Qw----~~~~~~~l-----~~~g~--p~~~~N~i~~~v~~v~g~~~ 87 (714) T protein:vir:10 20 FSQRQLLSLCSDIDS-QPLWRDAANKACAYYDGDQL----APEVIQVL-----KDRGQ--PMTIHNLIAPTVDGVLGMEA 87 (714) T ss_pred hhHHHHHHHHHHHhh-hHHHHHHHHHHHHhhcCCCC----CHHHHHHH-----HhcCC--CcEEeccHHHHHHHHHHHHH Confidence 566666666555333 33344667788999999872 11111111 11122 24678999999999999987 Q ss_pred cCCeeec--cC--ch---hhHHHHH----HHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccceeE Q lcl|NC_010179. 81 SVFPDID--VG--KD---ADNKKIL----DVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPDQIT 145 (469) Q Consensus 81 g~p~~~~--~~--~~---~~~~~l~----~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~~~~ 145 (469) .+.+.+. +. ++ +..+.+. .++..+. ......+..+++++|.+|+-++.+.+ +.+++..++|.+++ T Consensus 88 ~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~d~~~~~i~i~~v~p~~v~ 167 (714) T protein:vir:10 88 KTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSEPFGPEFKVSTVSRNEVF 167 (714) T ss_pred hCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeeccCCCCCCeEEEecChhhee Confidence 7766543 31 12 1233333 3333444 34577889999999999999998766 67999999999976 Q ss_pred EEEeCCC----CCceEEEEEE---------EEe--------------e-----------------------e-------c Q lcl|NC_010179. 146 PVYATTL----DNKLLGVLRS---------YKQ--------------L-----------------------D-------P 168 (469) Q Consensus 146 ~~~d~~~----~~~~~~~v~~---------~~~--------------~-----------------------~-------~ 168 (469) +-.+... +.+.++..++ |.. . + . T Consensus 168 ~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (714) T protein:vir:10 168 WDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEEYQSWDRQQNEWLQ 247 (714) T ss_pred eccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhcccccccchhhcccccccccccc Confidence 5432110 1111111111 100 0 0 0 Q ss_pred CCceEEEEEEEEcCCeEEEEEeec--Cceeecccccccc-------------------cccccccccccccccccccCCc Q lcl|NC_010179. 169 EAGKYFTVHEYWTDKEAQFFRTSA--TDSTVIEPYNIIT-------------------SYDLSAGYETGQSNTLKHNFGR 227 (469) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~g~ 227 (469) .....+.++++|-...+....... +............ .+..-.+......+..|.+.++ T Consensus 248 ~~~~rV~v~E~w~k~~~~~~~~~~~~g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~p~~~ 327 (714) T protein:vir:10 248 RERRRVLLQVVYYRTFERLPVIELSNGRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIREAWFVGPHFIVDRPCSAPQGM 327 (714) T ss_pred cCcceEEEEEEEEeEEEEEEeecCCCCCeeeeCccCHHHHHHHHhccceecccceeeEEEEEEecchhhhcCCCCCCCCc Confidence 011224455555433322222111 1111111100000 0000011111112233444456 Q ss_pred ccEEEecCC---c--cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh-hhcceeeecccC Q lcl|NC_010179. 228 VPFIEFPKN---K--YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL-REYKSIKINNAG 301 (469) Q Consensus 228 vPvv~~~n~---~--~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~-~~~~~~~~~~~~ 301 (469) +|+|+|+-. . ...|.+..+++.++.+|...|.+...+ .++..++..|............. ++++++.+.+.. T Consensus 328 fp~vP~~g~~~~~~g~~~G~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~gav~~~d~~~~e~~~rp~~vi~~~~~~ 405 (714) T protein:vir:10 328 FPLVPFWGYRKDKTGEPYGLISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDATQLSDNDLMEQLERPDGIIKLNPVR 405 (714) T ss_pred eeeEEecceeeeccCccceehhhhhhHHHHHHHHHHHHHHHH--hCCceeeccccccccHHHHHHhccCCCCeEEecccc Confidence 777766532 1 245788999999999999999998876 34455555554433323333322 345677775542 Q ss_pred C-C--CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 302 N-G--DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (469) Q Consensus 302 ~-~--~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 377 (469) . + ..+.++......-..++...++.....|-..|+..+...+..+| .||+|+.................+..+.++ T Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~~na~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~ 485 (714) T protein:vir:10 406 KNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQDSGATSGVAISNLVEQGATTLAEINDNYQFACQQ 485 (714) T ss_pred cccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 1 12233433322234577777888888898888876655544444 699998877665555555555555556655 Q ss_pred HHHHHHH----Hhccc------C--CC---cc----------------------cceEEeCCCCCCCHHHHHHHHHHHhc Q lcl|NC_010179. 378 LVRAIMR----YLNFS------D--AD---KR----------------------HISQHWTRTKVEDSLTKAQIVSTVAN 420 (469) Q Consensus 378 ~~~~i~~----~~~~~------~--~~---~~----------------------~i~i~f~~~~p~d~~e~~~~~~kl~g 420 (469) +.++++. ++... + .. .. +|.+.=.+..+.-..+.++.+..+.+ T Consensus 486 ~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~ 565 (714) T protein:vir:10 486 VGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQ 565 (714) T ss_pred HHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCccccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHh Confidence 5555544 43211 0 00 00 11111122222323344444444433 Q ss_pred cCC-------hHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 421 YSS-------KEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 421 ~iS-------~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .++ ...+++.+.+ +..++-+++|.+-.-...+ .....-+| T Consensus 566 ~~~p~~~~~~~~~~le~~d~-p~~~ei~~~ir~~~~~~~~--------~~~~~~e~ 612 (714) T protein:vir:10 566 GLPPQVQAVVLDLWVNLLDV-PQKQEFVERIRAALGTPKS--------PDEMTPEE 612 (714) T ss_pred hcCchhhhhHHHHHHHhcCC-cCHHHHHHHHHHHcCCCCC--------ccccCcch Confidence 222 2334455543 3444445555432110000 00000000 No 89 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.75 E-value=1.8e-17 Score=112.37 Aligned_cols=452 Identities=11% Similarity=0.063 Sum_probs=217.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.+..+...... +.+-+....+..+||.|.|-- ....... ...+++ .+.+|.++.+|+..+++.- T Consensus 22 ~~~~~~~~~~~~~~~-q~~~r~~a~~d~~fy~G~QW~----~~~~~~l-----~~~g~p--~~~~N~i~~~v~~v~g~~~ 89 (772) T protein:vir:10 22 LTVDEYADINYEIED-QPAWRAVADKEMDYADGNQLD----TELLRRQ-----QALGIP--PAVEDLIGPALLSLQGYEA 89 (772) T ss_pred cCHHHHHHHHHHHhc-cHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcCCC--cEEEcchHHHHHHHHHHHH Confidence 777777666555443 344455667888999998731 1111111 112222 4778999999999999987 Q ss_pred cCCeeec--cCc----hhhHHHHH----HHHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccceeEE Q lcl|NC_010179. 81 SVFPDID--VGK----DADNKKIL----DVLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPDQITP 146 (469) Q Consensus 81 g~p~~~~--~~~----~~~~~~l~----~~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~~~~~ 146 (469) .+.+.+. ... .+..+.|. .+++.+. ...+.....+++++|.+|+-++++.+ +.+++..++|..++. T Consensus 90 ~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i~i~~v~p~~v~~ 169 (772) T protein:vir:10 90 VTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPYRCRPIRRDEIHW 169 (772) T ss_pred hcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCeEEEeeCccccee Confidence 7765543 321 12233333 3333443 45677889999999999999998766 468899999999654 Q ss_pred EEeCCCCCce---EEEE-EE----------EEee---------------------------------------------- Q lcl|NC_010179. 147 VYATTLDNKL---LGVL-RS----------YKQL---------------------------------------------- 166 (469) Q Consensus 147 ~~d~~~~~~~---~~~v-~~----------~~~~---------------------------------------------- 166 (469) |+.....+ .++. +. |... T Consensus 170 --Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (772) T protein:vir:10 170 --DMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWNEARAWTVQED 247 (772) T ss_pred --cCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccchhhccccccc Confidence 54332211 1111 00 1000 Q ss_pred --ecCCceEEEEEEEEcCCeEEEEEeecCceeec--cccc-------------------ccccccccccccccccccccc Q lcl|NC_010179. 167 --DPEAGKYFTVHEYWTDKEAQFFRTSATDSTVI--EPYN-------------------IITSYDLSAGYETGQSNTLKH 223 (469) Q Consensus 167 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-------------------~~~~~~~~~~~~~~~~~~~~~ 223 (469) .....+.++.+|+|-...+.+.......+..+ .... ....+..-.+......+..|. T Consensus 248 ~~~~~~~~rVrv~E~w~r~~~~~~~~~~~~g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv~~~~~~g~~~L~~~~~p~ 327 (772) T protein:vir:10 248 HWYNPTSKEICLVELWYRRWVQVHVLKSPDGRVVEYDPNNLAHNIALASGRISPKKVTVSRVRRSYWLGPHCLHDGPTPY 327 (772) T ss_pred cccccCCceEEEEEEeeeeeeeeeeeccCCCceEeeCcccHHHHHHHhhcccchheeeeeEEEEEEEecceeeccCCCCC Confidence 00001234455554333222221111111111 0000 000011111111222233344 Q ss_pred cCCcccEEEecCC-----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh-hhcceeee Q lcl|NC_010179. 224 NFGRVPFIEFPKN-----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL-REYKSIKI 297 (469) Q Consensus 224 ~~g~vPvv~~~n~-----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~-~~~~~~~~ 297 (469) +.+.+|+|+|+-. ....|.+.++++.++.+|...|.+...+...+ +..-.|........+...+ +.++++.+ T Consensus 328 ~~~~fP~vP~~g~r~~~~g~~~G~vr~~kd~Qr~~N~~~S~~~~~l~~~~--~~~~~gav~~~d~~~~e~~arp~~vi~~ 405 (772) T protein:vir:10 328 THRHFPYVPFFGFREDATGIPYGYVRGMKYAQDSLNSGVSKLRWGMSVAR--VERTKGAVAMTDAQFRRQIARPDADIVL 405 (772) T ss_pred CCCccceEEEeeeEeccCCcccchhhhhhhHHHHHHHHHHHHHHHHhccc--ccccCCCccchhHHHHHhccCCCCeEEe Confidence 5566777776532 22457889999999999999999988876544 3333443322222333333 34566677 Q ss_pred cccCCCC-CCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNGD-KSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAI 375 (469) Q Consensus 298 ~~~~~~~-~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l 375 (469) .++..+. ++.+++.....-..++...++.....|-.+|+..+...+..|| .||+|+...-..........-.-+..+. T Consensus 406 ~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~ 485 (772) T protein:vir:10 406 DENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKGTATSGIQEQQQIEQSNQSIGRIMDNFRAGR 485 (772) T ss_pred CCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 6553222 2334444333334677777888888888888776554444454 6999987665554444444444455555 Q ss_pred HHH----HHHHHHHhcc------cCCCcc------------------------cceE-EeC---CCCCCCHH---HHHHH Q lcl|NC_010179. 376 NEL----VRAIMRYLNF------SDADKR------------------------HISQ-HWT---RTKVEDSL---TKAQI 414 (469) Q Consensus 376 ~~~----~~~i~~~~~~------~~~~~~------------------------~i~i-~f~---~~~p~d~~---e~~~~ 414 (469) ++. +.+|..++.. .+-+.. +|++ .+. ...|...+ +.++. T Consensus 486 ~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~ 565 (772) T protein:vir:10 486 TLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAYLSNDLLRTRIKVALEDVPSTNSYRGQQLNA 565 (772) T ss_pred HHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccceeccceeeeEEEEeeccccchHHHHHHHHH Confidence 554 4444444421 111100 1100 010 11222222 33334 Q ss_pred HHHHhccCChHHHH-------HhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc-------cCCCCCCCCC Q lcl|NC_010179. 415 VSTVANYSSKEAVA-------KANPIVDDWQQELKDLAKDREENDPYANQAD-------ELNGKGVDDE 469 (469) Q Consensus 415 ~~kl~g~iS~et~~-------~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-------~~~~~~~~de 469 (469) +..+.+.++.+... +.+. .+..++-.++|++-....++...+.+ .......+-+ T Consensus 566 m~ql~~~~~P~~~~~~~~~~le~~D-~p~~~ei~~~ir~~~~~~~peq~~~~~~q~~qq~~~~~~~el~ 633 (772) T protein:vir:10 566 MSEAVKSMPPQYQAAVLPFLVSLMD-VPFKRDVVEAIRAVDQQQTPEQIQQQIDQAVQDALAKAGNDIK 633 (772) T ss_pred HHHHHhccChhHHHHHHHHHHhhcC-CCChHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHH Confidence 44444434444322 2222 12222223333322111111000000 0000000000 No 90 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.66 E-value=1.1e-14 Score=97.22 Aligned_cols=433 Identities=10% Similarity=0.042 Sum_probs=202.4 Q ss_pred CCHHHHHHHHHHHHHHHHH----HHH-HHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRND----LIN-NYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQE 75 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~----~~~-~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~ 75 (469) |+-+++..++...+..-.. .+. ...+..+||.|+..-. ... -..+++.+.....|+.. T Consensus 10 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~y~g~~~~~---------------~~~--~~s~~~~~~v~~~v~~~ 72 (705) T protein:vir:88 10 MDDEQVLRHLDQLVNDALDFNSSELSKQRSEALKYYFGEPFGN---------------ERP--GKSGIVSRDVQETVDWI 72 (705) T ss_pred CCHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCCCCCc---------------ccC--CCCccccHHHHHHHHHH Confidence 9999888887766544332 332 4466778999974210 001 12457778888778877 Q ss_pred HHhh----hcCC--eeecc---CchhhHHHHHHHH-----hcc-HHHHHHHHHHHHHhCCeEEEEEEEcCC--------- Q lcl|NC_010179. 76 AGYI----ASVF--PDIDV---GKDADNKKILDVL-----GDD-RALTLNSLLVDSSNAGRAWLHYWIDED--------- 131 (469) Q Consensus 76 ~~~l----~g~p--~~~~~---~~~~~~~~l~~~~-----~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~~--------- 131 (469) ...| ||.+ +.+.+ +|....+.+..++ +.| ..+.+...+++++++|.+++.|||+.. T Consensus 73 ~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~~~e~~~ 152 (705) T protein:vir:88 73 MPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKPTFERFS 152 (705) T ss_pred HHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccchhhhhhc Confidence 7754 3433 34433 3333333333332 233 345677899999999999999988432 Q ss_pred ---------------------------------------CceEEEEEccceeEEEEeCCCCCceEEEEEEEEee--ec-- Q lcl|NC_010179. 132 ---------------------------------------NNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQL--DP-- 168 (469) Q Consensus 132 ---------------------------------------~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~--~~-- 168 (469) |++++..++|..+++-.+........+.++.+... +. T Consensus 153 ~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~dp~a~~~~d~~~~~~~~~~t~~dl~~ 232 (705) T protein:vir:88 153 GLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVDRLATCIDDARFLCHREKYTVSDLRL 232 (705) T ss_pred cCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceecCCCCCcccCcEEEEEEeccHHHHHh Confidence 56788889999987644322222222222221111 00 Q ss_pred CCc---e--EEE--------------------------EEEEEcCC---eEEEEEee----cCceeeccccccccccccc Q lcl|NC_010179. 169 EAG---K--YFT--------------------------VHEYWTDK---EAQFFRTS----ATDSTVIEPYNIITSYDLS 210 (469) Q Consensus 169 ~~~---~--~~~--------------------------~~~~~~~~---~~~~~~~~----~~~~~~~~~~~~~~~~~~~ 210 (469) .|. . -.. ....+.+. .+.+|.+. ..+.....++... + T Consensus 233 ~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~~--~--- 307 (705) T protein:vir:88 233 LGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRIL--Y--- 307 (705) T ss_pred hcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEEE--E--- Confidence 000 0 000 00000000 01111100 0000000000000 0 Q ss_pred ccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE-ecCCcccchh Q lcl|NC_010179. 211 AGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL-TNYGGASLKQ 284 (469) Q Consensus 211 ~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~-~g~~~~~~~~ 284 (469) .. ..... .-++|.+|++.++ ....|.|.++.+.++++.+|..++.+.+.+..+.+|...+ .|.. +..+ T Consensus 308 -~g-~~il~--~~~~~~~PF~~~~~~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v--~~~d 381 (705) T protein:vir:88 308 -VG-DYIIS--NEPWDCRPFADLNAYRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQV--NLED 381 (705) T ss_pred -eC-ccccc--cccCCCCCEEEecceeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceecccccc--Cccc Confidence 00 00001 1245667777643 4456899999999999999999999999999999986554 2321 1111 Q ss_pred hhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc----c-CCccHHHHHHHHHH Q lcl|NC_010179. 285 FMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE----S-SNASGVAIKMLYSH 359 (469) Q Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~----~-g~~Sg~Al~~~~~~ 359 (469) ......++++.+... +.+.++..+.-..+....++.+...+...|++++...+. . ++.|+.|+..+... T Consensus 382 -~~~~~pg~vv~~~~~-----~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~Ta~~i~~~~~~ 455 (705) T protein:vir:88 382 -LLTNEAAGIVRVKSM-----NSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQAAMSVNQLMTA 455 (705) T ss_pred -ccccCCCeeEEecCC-----CccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchhhHHHHHHHHHH Confidence 122334555554432 346676555555667777889999999999999876542 1 34577777777777 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHHH----HHhccc------C----CC------cccceEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_010179. 360 LELKAAKTQTYFE-HAINELVRAIM----RYLNFS------D----AD------KRHISQHWTRTKVEDSLTKAQIVSTV 418 (469) Q Consensus 360 l~~k~~~~~~~~~-~~l~~~~~~i~----~~~~~~------~----~~------~~~i~i~f~~~~p~d~~e~~~~~~kl 418 (469) .........+.|. .++++++++++ .++... + .+ ..++.+.-... ..+..+....+..+ T Consensus 456 ~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~-~~~~eq~~a~l~~l 534 (705) T protein:vir:88 456 AEQQIDLIARMFAETGVKRLFQLLHDHAIKYQNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIG-NMNKDQQMLHLMRI 534 (705) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCceEEeeccchhccchHhhccCCceEEeeccc-cchHHHHHHHHHHH Confidence 6666777777764 35555555444 443221 1 00 11222222211 11111222111111 Q ss_pred ----hc---------cCChHHH-------HHhCCCCCCHHH------HHHHHHHHHHHhhhhHh------hcccCCCCC- Q lcl|NC_010179. 419 ----AN---------YSSKEAV-------AKANPIVDDWQQ------ELKDLAKDREENDPYAN------QADELNGKG- 465 (469) Q Consensus 419 ----~g---------~iS~et~-------~~~l~~v~d~~~------E~eri~~E~~~~~~~~~------~~~~~~~~~- 465 (469) .. +++.... .+.++ +.++.. .++..+.+........+ +.+..-... T Consensus 535 l~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~-~k~~~~~~~~~~~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q 613 (705) T protein:vir:88 535 WEMAQAVVGGGGLGVLVSEQNLYNILKEVTENAG-YKDPDRFWTNPNSPEALQAKAIREQKEAQPKPEDIKAQADAQRAQ 613 (705) T ss_pred HHHHHHhhcccchhhhcChHHHHHHHHHHHHhhh-hhhHHHHhhhhhhHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHH Confidence 01 1111111 00111 011000 00111000000000000 000000000 Q ss_pred CCCC Q lcl|NC_010179. 466 VDDE 469 (469) Q Consensus 466 ~~de 469 (469) .+-+ T Consensus 614 ~e~~ 617 (705) T protein:vir:88 614 SDAL 617 (705) T ss_pred HHHH Confidence 0000 No 91 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.65 E-value=3e-16 Score=105.70 Aligned_cols=408 Identities=14% Similarity=0.051 Sum_probs=206.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----C-CcccccccchhhhcccccccccccC---cceeccchHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYEN----K-TDITTRNNGKPKVSKEGKKDPLRSA---DNRIPSNFYQLLV 72 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g----~-~~i~~~~~~~~~~~~~~~~~~~~~~---~~ri~~n~~k~iv 72 (469) =++++.++.. . ..+. ..+..|..| . ++...+... +.....++. ..--.+.+++.|| T Consensus 2 ~~~~~a~~~~--~-~~~a------~~~~~~~~~~g~~~~~d~~~~~~~-------~~~~~~~~~~l~~lY~~~~l~r~iV 65 (461) T protein:vir:80 2 YSIDKAKQAK--I-DSKI------VNRNDFMVGHGKANSRDKLTRQTP-------GNGQKLDLKACENLYASNSIAMNIV 65 (461) T ss_pred ccchhhhhhh--h-hhhh------hhhhHHHhhcCCcchhhhhhcccc-------CcccccCHHHHHHHHHhCCccchhh Confidence 1222222111 0 0000 011122211 1 111110000 000000000 0001357888899 Q ss_pred HHHHHhhhcCCeeeccCchhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCC Q lcl|NC_010179. 73 DQEAGYIASVFPDIDVGKDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATT 151 (469) Q Consensus 73 ~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~ 151 (469) +..+..++.+++.+++++++..+.+..+|++ +....+.+..+++..+|.+++++-..+... ..|...-|+. +. T Consensus 66 d~~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~-----~~~~~~~pl~-~~ 139 (461) T protein:vir:80 66 DIISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNR-----EQADLSTAID-PK 139 (461) T ss_pred ccchHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCc-----cccCccCCcc-cc Confidence 9999999999999999988888888888875 455678899999999999998886533211 0111111111 00 Q ss_pred CCCceEEEEEEEEeeec----CCceEEEEEEEEcCCeEEEEEeecC-ceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDP----EAGKYFTVHEYWTDKEAQFFRTSAT-DSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) ....+....-+|..... ..+. ..-.++.+ .+|..... ...... ...........+. T Consensus 140 ~~~~~~~l~~~~~~~i~~~~~~~dp--~sp~fg~P---~~y~i~~~~~~~~~~--------------~~~~~~~~~~~iH 200 (461) T protein:vir:80 140 TIKSIPYINTFNTQKVTQLYLNQDM--FSEHFGEV---EFFEVNRVSQLGEEI--------------LSGTTASTSEQIH 200 (461) T ss_pred cccceeEEEeccccccchhhhcccC--cCcccccc---eEEEEeccccccccc--------------cccccCccceEEc Confidence 00111111001110000 0000 00001111 01110000 000000 0000000011122 Q ss_pred cccEEEecCCc-----cccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc---ccchhhhhhhh----hcce Q lcl|NC_010179. 227 RVPFIEFPKNK-----YRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG---ASLKQFMNDLR----EYKS 294 (469) Q Consensus 227 ~vPvv~~~n~~-----~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~---~~~~~~~~~~~----~~~~ 294 (469) .-++++|.+.+ .|.|.++.+.+.+.+++.+.-..+..+..+..+.+.+.|... +........+. ..++ T Consensus 201 ~SRii~~~~~~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~~~~~~~~~~~~~~~~g~ 280 (461) T protein:vir:80 201 RSRIIHEQGLRFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDDKANLTAMLDFMFRTEAL 280 (461) T ss_pred cccEEEecCCCCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchHHHHHHHHHHHhcCCceE Confidence 34667775543 489999999999999999998888888888888777665422 11111111111 1223 Q ss_pred eeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc---cCCccHHHHHHHHHHHHHHHHHHH-HH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE---SSNASGVAIKMLYSHLELKAAKTQ-TY 370 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g~~Sg~Al~~~~~~l~~k~~~~~-~~ 370 (469) +.+... .+++ +.+.+.+.....++.+...|...+.+|-.-..+ .+++||..=...| ..+++.++ .. T Consensus 281 ~~~d~~-----e~~e--~~~~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~~~y---yd~i~~~qe~~ 350 (461) T protein:vir:80 281 AIIKGD-----EQLT--KESTNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDVMNY---YARVSSIQENR 350 (461) T ss_pred EEEcCC-----cceE--EEecCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHHHHH---HHHHHHHHHHH Confidence 222211 2233 445677788899999999999999999754433 3456777532233 33444444 56 Q ss_pred HHHHHHHHHHHHHHHhccc----CCCcccceEEeCCCCCCCHHHHHHHHHH-------H--hccCChHHHHHhC----CC Q lcl|NC_010179. 371 FEHAINELVRAIMRYLNFS----DADKRHISQHWTRTKVEDSLTKAQIVST-------V--ANYSSKEAVAKAN----PI 433 (469) Q Consensus 371 ~~~~l~~~~~~i~~~~~~~----~~~~~~i~i~f~~~~p~d~~e~~~~~~k-------l--~g~iS~et~~~~l----~~ 433 (469) ++..+++++.+++...... +.+..++++.|++-.+.+++|.|++..+ + +|++|.+++.+.+ +. T Consensus 351 l~p~le~l~~~i~~s~~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~ 430 (461) T protein:vir:80 351 LRPQLEYLTRLLMWASDDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGL 430 (461) T ss_pred HHHHHHHHHHHHHHHhcccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCC Confidence 7899999999888754332 2345688999999999999999886543 2 5789988876533 11 Q ss_pred CCC-----HHHHHHHHHHHHHHhhhhHhhcccCCCC Q lcl|NC_010179. 434 VDD-----WQQELKDLAKDREENDPYANQADELNGK 464 (469) Q Consensus 434 v~d-----~~~E~eri~~E~~~~~~~~~~~~~~~~~ 464 (469) .++ ...|.+.+.++.. .+ ..++.+++ T Consensus 431 ~~~~~~~~~~~~~~~~~~~~~--~~---~~~e~~~g 461 (461) T protein:vir:80 431 ENSSKFSGDSAEIDKLAKLVY--DA---YAKKNADG 461 (461) T ss_pred CCCccCCCCCchhhhhhhhcc--cc---ccccCCCC Confidence 111 1123333333211 00 01111111 No 92 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.61 E-value=7e-15 Score=98.17 Aligned_cols=451 Identities=7% Similarity=-0.038 Sum_probs=202.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ..++.+..-+........+-.....+-.+||.|.|-- ....... ...+ |..+|.++.+|+..+|+-- T Consensus 6 ~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~----~~~~~~l-----~~q~----rp~~N~i~~~i~~v~g~~~ 72 (725) T protein:vir:77 6 NRLESILSRFDADWTASDEARREAKNDLFFSRVSQWD----DWLSQYT-----TLQY----RGQFDVVRPVVRKLVSEMR 72 (725) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhCCCCCC----HHHHHHH-----HhcC----CCccccHHHHHHHHHhhHH Confidence 4444455445455555555556777889999998731 1111111 1112 2257999999999999876 Q ss_pred cCCeee--ccCc---hhhHHHHHHHH----hccH-HHHHHHHHHHHHhCCeEEEEEEEc---CC---CceEEEEE----c Q lcl|NC_010179. 81 SVFPDI--DVGK---DADNKKILDVL----GDDR-ALTLNSLLVDSSNAGRAWLHYWID---ED---NNFRYGII----Q 140 (469) Q Consensus 81 g~p~~~--~~~~---~~~~~~l~~~~----~~n~-~~~~~~~~~~~~~~G~~~~~v~~d---~~---~~~~i~~~----~ 140 (469) -+.+.+ .+.+ ....+.+..++ ..+. ......+..+++++|.+|+-|+.| ++ +.+.|... + T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~~~~~ 152 (725) T protein:vir:77 73 QNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) T ss_pred hCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeecccC Confidence 665444 3322 22333333333 3333 445778899999999999888644 22 34444433 4 Q ss_pred cceeEEEEeCCCC------CceEEEEEEEEeee----------------------------cCCceEEEEEEEEcCCeEE Q lcl|NC_010179. 141 PDQITPVYATTLD------NKLLGVLRSYKQLD----------------------------PEAGKYFTVHEYWTDKEAQ 186 (469) Q Consensus 141 p~~~~~~~d~~~~------~~~~~~v~~~~~~~----------------------------~~~~~~~~~~~~~~~~~~~ 186 (469) |.+++ ||+... .+.++ ++.|...+ ......++.+++|....+. T Consensus 153 ~~~v~--~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~E~~~r~~~~ 229 (725) T protein:vir:77 153 CSHVI--WDSNSKLMDKSDARHCT-VIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred hhhce--eCchhhccChhhHHHHH-HHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCeeEEEEEEEEEEEe Confidence 44444 343221 11111 11111100 0011234566666544332 Q ss_pred EEE--eec---Cceeeccccccc-----------------------ccccccccccccccccccccCCcccEEEecC--- Q lcl|NC_010179. 187 FFR--TSA---TDSTVIEPYNII-----------------------TSYDLSAGYETGQSNTLKHNFGRVPFIEFPK--- 235 (469) Q Consensus 187 ~~~--~~~---~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n--- 235 (469) ... ... +.....+..... ..|....+... ..++.+.+-+.+|+|+|.- T Consensus 230 ~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~-l~~~~~~~~~~~P~vP~~g~r~ 308 (725) T protein:vir:77 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAV-LKDKQLIAGEHIPIVPVFGEWG 308 (725) T ss_pred eEEEEecCCCCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCcee-eccCCcCCCCccceEEEeeeee Confidence 211 111 111111111100 00111111111 1122233335566665432 Q ss_pred ----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE-ecCCcccchhhhhhhhhcceeee----cccCCCCCC Q lcl|NC_010179. 236 ----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL-TNYGGASLKQFMNDLREYKSIKI----NNAGNGDKS 306 (469) Q Consensus 236 ----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~-~g~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~ 306 (469) .+.+.|.+.++++.++.+|...|.+...+...+....++ .|.. +..............+.. ...|....+ T Consensus 309 ~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 387 (725) T protein:vir:77 309 FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENSGDLPTQ 387 (725) T ss_pred ccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhh-hHHHHHHHhccCCceecccccccCCCccccc Confidence 123448889999999999999999998887665443222 2211 111111111111111100 111111122 Q ss_pred cceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q lcl|NC_010179. 307 GVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIM-- 383 (469) Q Consensus 307 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~-- 383 (469) .+.....+.=..++...++.....|-..|+..+...+..+| .||+|+...-......+...-.-+..+.++..++++ T Consensus 388 ~i~~~~~~~lp~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~~~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:77 388 PLAYYENPEVPQANAYMLEAATSAVKEVATLGVDTEAVNGGQVAFDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred CccccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 34444333223456667888888888888766554544444 799999887766666666566666666666555444 Q ss_pred --HHhcc------cCCC----c------------------------ccceEEeCCCCCCCHHHHHHHHHHHhccCCh--- Q lcl|NC_010179. 384 --RYLNF------SDAD----K------------------------RHISQHWTRTKVEDSLTKAQIVSTVANYSSK--- 424 (469) Q Consensus 384 --~~~~~------~~~~----~------------------------~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~--- 424 (469) .++.. .+-+ + .++.|.=.+..+.=..+.++.++.+...++. T Consensus 468 I~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~ 547 (725) T protein:vir:77 468 VNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) T ss_pred HHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeeccchHHHHHHHHHHHHHHHHhccccch Confidence 44321 1100 0 1222222222222222334444443322221 Q ss_pred ---HHHHHhCCCC--CCHHHHHHHHHHHHHHhhhhHhhc----------ccCCCCCCCCC Q lcl|NC_010179. 425 ---EAVAKANPIV--DDWQQELKDLAKDREENDPYANQA----------DELNGKGVDDE 469 (469) Q Consensus 425 ---et~~~~l~~v--~d~~~E~eri~~E~~~~~~~~~~~----------~~~~~~~~~de 469 (469) -++...++.. +..++.+++++++........... ........+-| T Consensus 548 ~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~e~q~~~~~qq~~~~q~~~e 607 (725) T protein:vir:77 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPA 607 (725) T ss_pred hHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChhhHHHHHHHHHHHHHhHHHH Confidence 1122222221 122334555554322211100000 00000000000 No 93 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=99.59 E-value=2.1e-14 Score=95.62 Aligned_cols=452 Identities=8% Similarity=-0.039 Sum_probs=203.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ..++.+..-+........+-.....+-.+||.|.|-- ....... ...++ ..+|.++.+|+..+++-- T Consensus 6 ~~~~~~~~~~~~~~~~~~~~R~~a~~d~~fy~G~QW~----~~~~~~l-----~~q~r----p~~N~i~~~v~~v~g~e~ 72 (725) T protein:vir:10 6 NRLESILSRFDADWTASDEARREAKNDLFFSRVSQWD----DWLSQYT-----TLQYR----GQFDVVRPVVRKLVSEMR 72 (725) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcCC----CcccchHHHHHHHHhhHH Confidence 3444444444444455555556777889999998731 1111111 11122 247999999999999976 Q ss_pred cCCeee--ccCc---hhhHHHHHHHH----hccH-HHHHHHHHHHHHhCCeEEEEEEEc---CC---CceEEEEE----c Q lcl|NC_010179. 81 SVFPDI--DVGK---DADNKKILDVL----GDDR-ALTLNSLLVDSSNAGRAWLHYWID---ED---NNFRYGII----Q 140 (469) Q Consensus 81 g~p~~~--~~~~---~~~~~~l~~~~----~~n~-~~~~~~~~~~~~~~G~~~~~v~~d---~~---~~~~i~~~----~ 140 (469) -+.+.+ .+.+ ....+.+..++ ..+. ......+..+++++|.+|+-|..| ++ +.+.|..+ + T Consensus 73 ~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~ 152 (725) T protein:vir:10 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) T ss_pred hCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCCceeeeeeecccC Confidence 655444 3322 22333333333 3333 345778889999999999888533 22 33444433 3 Q ss_pred cceeEEEEeCCCC------CceEEEEEEEEee----------ec------------------CCceEEEEEEEEcCCeEE Q lcl|NC_010179. 141 PDQITPVYATTLD------NKLLGVLRSYKQL----------DP------------------EAGKYFTVHEYWTDKEAQ 186 (469) Q Consensus 141 p~~~~~~~d~~~~------~~~~~~v~~~~~~----------~~------------------~~~~~~~~~~~~~~~~~~ 186 (469) +.+++ ||+... .+.++ ++.|... +. -....++.+++|....+. T Consensus 153 ~~~v~--~Dp~a~~~D~sDar~~~-~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~ 229 (725) T protein:vir:10 153 CSHVI--WDSNSKLMDKSDARHCT-VIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred HhHcc--cCchhhccChhhhhhhh-hhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeEEEEEEEEEEEEe Confidence 44454 443221 11111 1111110 00 011223445555433222 Q ss_pred E--EEeec---Cceeecccccc-----------------------cccccccccccccccccccccCCcccEEEecC--- Q lcl|NC_010179. 187 F--FRTSA---TDSTVIEPYNI-----------------------ITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK--- 235 (469) Q Consensus 187 ~--~~~~~---~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n--- 235 (469) . +...+ +.......... ...|....+.. ...++.+.+-+.+|+|+|.- T Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~-~l~~~~~~~~~~fP~vP~~g~r~ 308 (725) T protein:vir:10 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTA-VLKDKQLIAGEHIPIVPVFGEWG 308 (725) T ss_pred eEEEEeccCCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchh-hhcCCCCCCCCceeEEEEEeeee Confidence 1 11111 11111111110 00011111111 11222233335566665532 Q ss_pred ----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeeccc----CCCCCCc Q lcl|NC_010179. 236 ----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNA----GNGDKSG 307 (469) Q Consensus 236 ----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 307 (469) .+.+.|.+.++++.++.+|...|.+...+...+...........+..............+.+... |.-..+. T Consensus 309 ~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~~~~~~~~~~~~~~~~~~~~~g~~~~~~ 388 (725) T protein:vir:10 309 FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEHMYDGNDDYPYYLLNRTDENNGEMPTQP 388 (725) T ss_pred ccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHHHHhccCCceeeecccccccCccccccc Confidence 12344889999999999999999999888765554333221111111111111111111111111 1111123 Q ss_pred ceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----H Q lcl|NC_010179. 308 VDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA----I 382 (469) Q Consensus 308 ~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~----i 382 (469) +.+...+.-..++...++.....|-..|+..+...+..|| .||+|+...-............-+..+.++..++ | T Consensus 389 i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~lI 468 (725) T protein:vir:10 389 LAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSIV 468 (725) T ss_pred CcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhCcCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4444433334566778888888998888876655544444 7999998877666555555555555565555444 4 Q ss_pred HHHhccc------CCC----------------------------cccceEEeCCCCCCCHHHHHHHHHHHhccCCh---- Q lcl|NC_010179. 383 MRYLNFS------DAD----------------------------KRHISQHWTRTKVEDSLTKAQIVSTVANYSSK---- 424 (469) Q Consensus 383 ~~~~~~~------~~~----------------------------~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~---- 424 (469) ..+++.. +-+ ..++.|.=.+..+.-..+.++.++.+...++. T Consensus 469 ~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~qll~~~~~~~~~ 548 (725) T protein:vir:10 469 NDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRSEILELLGKTPQGTPE 548 (725) T ss_pred HHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeeccCcHHHHHHHHHHHHHHHHhccccchh Confidence 4444211 100 01223333333332233444444444333321 Q ss_pred --HHHHHhCCC--CCCHHHHHHHHHHHHHHhhhhHhh----------cccCCCCCCCCC Q lcl|NC_010179. 425 --EAVAKANPI--VDDWQQELKDLAKDREENDPYANQ----------ADELNGKGVDDE 469 (469) Q Consensus 425 --et~~~~l~~--v~d~~~E~eri~~E~~~~~~~~~~----------~~~~~~~~~~de 469 (469) .+++..++. .+..++-.++++++.......... .........+.| T Consensus 549 ~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~q~~~e 607 (725) T protein:vir:10 549 YQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPA 607 (725) T ss_pred HHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHhhhHHH Confidence 222232322 222333455555432221110000 000000000000 No 94 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.53 E-value=6.3e-14 Score=92.94 Aligned_cols=451 Identities=8% Similarity=-0.027 Sum_probs=198.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ..++.+..-+........+-.....+-.+||.|.|-- ....... ...+ |..+|.++.+|+..+++-- T Consensus 6 ~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~----~~~~~~l-----~~q~----rp~~N~i~~~i~~v~g~e~ 72 (725) T protein:vir:92 6 NRLESILSRFDADWTASDEARREAKNDLFFSRISQWD----DWLSQYT-----TLQY----RGQFDVVRPVVRKLVSEMR 72 (725) T ss_pred HHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCC----HHHHHHH-----HhcC----CCcccchHHHHHHHHhhHH Confidence 3344444444444444455555677889999998731 1111111 1112 2247999999999999876 Q ss_pred cCCee--eccCc---hhhHHHHHHHH----hccH-HHHHHHHHHHHHhCCeEEEEEEEc---CC---CceEEEEE---cc Q lcl|NC_010179. 81 SVFPD--IDVGK---DADNKKILDVL----GDDR-ALTLNSLLVDSSNAGRAWLHYWID---ED---NNFRYGII---QP 141 (469) Q Consensus 81 g~p~~--~~~~~---~~~~~~l~~~~----~~n~-~~~~~~~~~~~~~~G~~~~~v~~d---~~---~~~~i~~~---~p 141 (469) -+.+. +.+.+ ....+.+..++ ..+. ......+..+++++|.+|+-|+.| ++ +.+.|... +| T Consensus 73 ~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~~~~i~~~~i~~~ 152 (725) T protein:vir:92 73 QNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSNNQVIRREPIHSA 152 (725) T ss_pred hCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCCceeeEEeeccCC Confidence 55544 33322 22333333333 3333 455778899999999999888643 22 34444432 23 Q ss_pred c-eeEEEEeCCCC------CceEEEEEEEEeee---------------------c-------CCceEEEEEEEEcCCeEE Q lcl|NC_010179. 142 D-QITPVYATTLD------NKLLGVLRSYKQLD---------------------P-------EAGKYFTVHEYWTDKEAQ 186 (469) Q Consensus 142 ~-~~~~~~d~~~~------~~~~~~v~~~~~~~---------------------~-------~~~~~~~~~~~~~~~~~~ 186 (469) . +++ ||+... .+.++ ++.|...+ . -....++.+++|....+. T Consensus 153 ~~~V~--~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~~~~ 229 (725) T protein:vir:92 153 CSHVI--WDSNSKLMDKSDSRHCT-VIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTIQIAEFYEVVEKK 229 (725) T ss_pred hhhcc--cCchhhccChhhHHHHH-HHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeEEEEEEEEEEEEe Confidence 2 233 343221 11111 11111100 0 011234455665543322 Q ss_pred E--EEee---cCceeecccccc-----------------------cccccccccccccccccccccCCcccEEEecC--- Q lcl|NC_010179. 187 F--FRTS---ATDSTVIEPYNI-----------------------ITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK--- 235 (469) Q Consensus 187 ~--~~~~---~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n--- 235 (469) . +... ++.......... ...|....+. ....++.+.+-+.+|+|+|.- T Consensus 230 ~~~~~~~d~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~-~~l~~~~~~~~~~~P~vP~~g~r~ 308 (725) T protein:vir:92 230 ETAFIYQDPVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCT-AVLKDKQLIAGEHIPIVPVFGEWG 308 (725) T ss_pred eeEEeecCCCCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecch-hhhcCCCCCCCCceeeEEEEeeee Confidence 1 1111 111111111110 0011111111 111222233335566665532 Q ss_pred ----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE-ecCCcccchhhhhhhhhcceeeeccc----CCCCCC Q lcl|NC_010179. 236 ----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL-TNYGGASLKQFMNDLREYKSIKINNA----GNGDKS 306 (469) Q Consensus 236 ----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 306 (469) .+.+.|.+.++++.++.+|...|.+...+...+....++ .|.. +..............+..... |.-... T Consensus 309 ~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~ 387 (725) T protein:vir:92 309 FVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQI-AGFEHMYDGNDDYPYYLLNRTDENNGEMPTQ 387 (725) T ss_pred ccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhh-hHHHHHHhccCccceeecccccccccccccc Confidence 123448899999999999999999998887665543322 1211 111111111111111111111 111112 Q ss_pred cceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHH Q lcl|NC_010179. 307 GVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS-NASGVAIKMLYSHLELKAAKTQTYFEHAINEL----VRA 381 (469) Q Consensus 307 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~----~~~ 381 (469) .+++...+.-..++...++.....|-..|+..+...+..+ +.||+|+...-............-+..+.++. +.+ T Consensus 388 ~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~~SG~ai~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~l 467 (725) T protein:vir:92 388 PLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQVAYDTVNQLNMRADLETYVFQDNLATAMRRDGEIYQSI 467 (725) T ss_pred CCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3444443333456777888888899888887654444333 47999998776555555454555555555554 444 Q ss_pred HHHHhccc------CCC----------------------------cccceEEeCCCCCCCHHHHHHHHHHHhccCCh--- Q lcl|NC_010179. 382 IMRYLNFS------DAD----------------------------KRHISQHWTRTKVEDSLTKAQIVSTVANYSSK--- 424 (469) Q Consensus 382 i~~~~~~~------~~~----------------------------~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~--- 424 (469) |..++... +-+ ..++.|.=.+..+.-..+.+..++.+...++. T Consensus 468 I~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~~~s~r~~~~~~l~ql~~~~~~~~~ 547 (725) T protein:vir:92 468 VNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPSFQSMKQQNRAEILELLGKTPQGTP 547 (725) T ss_pred HHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccChHHHHHHHHHHHHHHHHhcccchh Confidence 44444211 100 01222222232222223334444444333321 Q ss_pred ---HHHHHhCCC--CCCHHHHHHHHHHHHHHhhhh----------HhhcccCCCCCCCCC Q lcl|NC_010179. 425 ---EAVAKANPI--VDDWQQELKDLAKDREENDPY----------ANQADELNGKGVDDE 469 (469) Q Consensus 425 ---et~~~~l~~--v~d~~~E~eri~~E~~~~~~~----------~~~~~~~~~~~~~de 469 (469) -++...++. .+...+..++++++....... ..+.........+.| T Consensus 548 ~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~~~qqa~~~q~~~e 607 (725) T protein:vir:92 548 EYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQGQQDPA 607 (725) T ss_pred HHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccchhhhHHHHHHHHHHHhhhHHH Confidence 111222221 122233345554332111100 000000000000000 No 95 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.50 E-value=1.5e-13 Score=90.92 Aligned_cols=458 Identities=8% Similarity=0.012 Sum_probs=198.4 Q ss_pred CCHH---HHHHHHHHHHHHH---HHHHHHHHHHHHHh--ccCCcccccccchhhhcccccccccccCcceeccchHHHHH Q lcl|NC_010179. 1 MELD---ALKKLIRNTSTSR---NDLINNYKKSVDYY--ENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLV 72 (469) Q Consensus 1 ~~~~---~~~~~i~~~~~~~---~~~~~~~~~~~~Yy--~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv 72 (469) |.-+ -+.++...+.... .+-+.....-++|| .|+|-- ... ....... ....++ ..+.+|.++.+| T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~D~~f~~~~G~QW~--~~~--~~~l~~~-~q~~gr--P~~~~N~i~~~v 73 (708) T protein:vir:10 1 MAETLEKKHERIMLRFDRAYSPQKEVREKCIEATRFARVPGGQWE--GAT--AAGTKLD-EQFEKY--PKFEINKVATEL 73 (708) T ss_pred CchhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcCCCCCCC--HHH--HHHHHHh-hhhcCC--CceEEcchHHHH Confidence 4322 2234443333322 22333343444566 476621 000 0000000 000011 246789999999 Q ss_pred HHHHHhhhcCCeee--ccCc----hhhHHHHHH----HHhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---------C Q lcl|NC_010179. 73 DQEAGYIASVFPDI--DVGK----DADNKKILD----VLGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---------N 132 (469) Q Consensus 73 ~~~~~~l~g~p~~~--~~~~----~~~~~~l~~----~~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---------~ 132 (469) +..+++--.+.+.+ .+.+ .+..+.|.. +.+.+. ......+..+++++|.+|+.++.|.. . T Consensus 74 ~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~~~~~~ 153 (708) T protein:vir:10 74 NRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQ 153 (708) T ss_pred HHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCCCCCcc Confidence 99999987776554 3322 222333333 333443 44677889999999999998865421 1 Q ss_pred ceEEE-EEcc-ceeEEEEeCCCC------CceEEEEEE---------EEeee---------------cCCceEEEEEEEE Q lcl|NC_010179. 133 NFRYG-IIQP-DQITPVYATTLD------NKLLGVLRS---------YKQLD---------------PEAGKYFTVHEYW 180 (469) Q Consensus 133 ~~~i~-~~~p-~~~~~~~d~~~~------~~~~~~v~~---------~~~~~---------------~~~~~~~~~~~~~ 180 (469) ++.+. +.+| ..++ ||+... .+.++-.++ |.... .-+.......+|| T Consensus 154 ~i~i~~~~~p~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v~ey~ 231 (708) T protein:vir:10 154 RIAIEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYIAKYY 231 (708) T ss_pred ccceEEeecchhhcc--cCccccccChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEEEEee Confidence 23332 2334 3333 333211 111111111 11000 0011223445555 Q ss_pred cCCeEEE----EEe-ecCceeeccccccc-------------------c----cccccccccccccccccccCCcccEEE Q lcl|NC_010179. 181 TDKEAQF----FRT-SATDSTVIEPYNII-------------------T----SYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 181 ~~~~~~~----~~~-~~~~~~~~~~~~~~-------------------~----~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) ....+.. +.. .++........... . .+.... ......+..+-+++.+|+|+ T Consensus 232 ~r~~~~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~-g~~~le~~~~~p~~~fP~vP 310 (708) T protein:vir:10 232 EVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVD-GDGFLEKPRRIPGEHIPLIP 310 (708) T ss_pred eEEEEEEEEEEEecCCCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeec-chhhhccCCCCCCCceeeEE Confidence 4332211 111 11111111110000 0 000101 11112234455667788888 Q ss_pred ecC-------CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh-hh-hhhhcceeeeccc--- Q lcl|NC_010179. 233 FPK-------NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF-MN-DLREYKSIKINNA--- 300 (469) Q Consensus 233 ~~n-------~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~-~~-~~~~~~~~~~~~~--- 300 (469) |.- .+...|.+.++++.++.+|...|.+.+.+......+.++....-...... .. +........+... T Consensus 311 ~~g~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 390 (708) T protein:vir:10 311 VYGKRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDK 390 (708) T ss_pred EeeeeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHHHHhhccccchhhhcccccccc Confidence 752 12336889999999999999999999988877665444321110000000 00 0000000000000 Q ss_pred -CC-CCCC-cceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 -GN-GDKS-GVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (469) Q Consensus 301 -~~-~~~~-~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 377 (469) |. ...+ ....+..+.-..++...++.....|-.+|+..+...+.-+|.||+|+...-..........-.-+..+.++ T Consensus 391 ~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~sn~SG~aI~~rq~qg~~~l~~~~Dnl~~~~~~ 470 (708) T protein:vir:10 391 SGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLKR 470 (708) T ss_pred ccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 12233333345567777888888888888776655544567899999887766666666666666666666 Q ss_pred HHHHHHH----Hhcc------cCCC-----------------------------cccceEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_010179. 378 LVRAIMR----YLNF------SDAD-----------------------------KRHISQHWTRTKVEDSLTKAQIVSTV 418 (469) Q Consensus 378 ~~~~i~~----~~~~------~~~~-----------------------------~~~i~i~f~~~~p~d~~e~~~~~~kl 418 (469) ..++++. ++.. .+-+ ..+|.|.=.+..+.-..+.++.++.+ T Consensus 471 ~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~i~~~p~~~s~r~~~~~~l~ql 550 (708) T protein:vir:10 471 AGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTNV 550 (708) T ss_pred HHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEEEecccCchhHHHHHHHHHHHH Confidence 5555544 4321 1100 00222322334444444556666655 Q ss_pred hccCCh---HH------HHHhCCCCCCHHHHHHHHHHHHHHh-------------------------hhhHhh--ccc-- Q lcl|NC_010179. 419 ANYSSK---EA------VAKANPIVDDWQQELKDLAKDREEN-------------------------DPYANQ--ADE-- 460 (469) Q Consensus 419 ~g~iS~---et------~~~~l~~v~d~~~E~eri~~E~~~~-------------------------~~~~~~--~~~-- 460 (469) .+.++. .+ +++.+. .+..++=.+||++..... .+...+ ... T Consensus 551 l~~~~p~~~~~~~~~~~~l~~~D-~p~~~ei~erir~~~~~~~~~~~~~~ee~q~~~~~q~~~q~q~~~~~~e~qa~~~~ 629 (708) T protein:vir:10 551 LSSMLPTDPMRPAIQGIILDNID-GEGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMVA 629 (708) T ss_pred HHhcCCCchhhHHHHHHHHHhcC-CcChHHHHHHHHHhhcccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433321 11 222222 122223344443321000 000000 000 Q ss_pred --CCCCCCCCC Q lcl|NC_010179. 461 --LNGKGVDDE 469 (469) Q Consensus 461 --~~~~~~~de 469 (469) ........+ T Consensus 630 ~qAe~~ka~a~ 640 (708) T protein:vir:10 630 AQAEAQKATNE 640 (708) T ss_pred HHHHHHHHHHH Confidence 000000000 No 96 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=99.49 E-value=2.6e-12 Score=84.14 Aligned_cols=456 Identities=8% Similarity=0.004 Sum_probs=203.0 Q ss_pred CCHH---HHHHHHHHHHHHH---HHHHHHHHHHHHHh--ccCCcccccccchhhhcccccccccccCcceeccchHHHHH Q lcl|NC_010179. 1 MELD---ALKKLIRNTSTSR---NDLINNYKKSVDYY--ENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLV 72 (469) Q Consensus 1 ~~~~---~~~~~i~~~~~~~---~~~~~~~~~~~~Yy--~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv 72 (469) |.-+ -+.++...+.... .+...+...-++|| .|.|--. . ........ ....++| .+.+|..+.+| T Consensus 1 m~e~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW~~--~--~~~~l~~~-~q~~grP--~~~~N~i~~~v 73 (706) T protein:vir:10 1 MAESRQKQHERVMLRFDRAWSPQQVVREKCIEATRFVRVPGGQWEG--A--TVAGTKLD-EQFEKYP--KFEINKVATEL 73 (706) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCccCCH--H--HHHHHHhh-hhhcCCC--ceEecchHHHH Confidence 6553 3344444443322 34444455566777 4665211 1 11111000 0001112 57789999999 Q ss_pred HHHHHhhhcCCeeec--cCc----hhhHHHHHHHH----hccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---------C Q lcl|NC_010179. 73 DQEAGYIASVFPDID--VGK----DADNKKILDVL----GDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---------N 132 (469) Q Consensus 73 ~~~~~~l~g~p~~~~--~~~----~~~~~~l~~~~----~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---------~ 132 (469) +..+++.--+.+.+. +.. .+..+.+..++ ..+. ......+..+++++|.+|+-+..|.. + T Consensus 74 ~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~~~~~~ 153 (706) T protein:vir:10 74 NRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDPMDERQ 153 (706) T ss_pred HHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCCCCCCc Confidence 999999876665543 211 22333333333 3343 44577889999999999998865421 2 Q ss_pred ceEEEEE-ccceeEEEEeCCC------CCceEEEEEE---------EEeee--------------cCCceEEEEEEEEcC Q lcl|NC_010179. 133 NFRYGII-QPDQITPVYATTL------DNKLLGVLRS---------YKQLD--------------PEAGKYFTVHEYWTD 182 (469) Q Consensus 133 ~~~i~~~-~p~~~~~~~d~~~------~~~~~~~v~~---------~~~~~--------------~~~~~~~~~~~~~~~ 182 (469) ++.+..+ +|... +.||+.. +.+.++..++ |.... ..........++|.. T Consensus 154 ~i~i~~v~~p~~~-v~~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~~~eyy~~ 232 (706) T protein:vir:10 154 RIAVEPIYDPARS-VWFDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVYIAKYYEV 232 (706) T ss_pred cceeeeeccchhc-eecCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcceecccccc Confidence 3444433 56532 2345421 1111221111 11100 000112344555654 Q ss_pred CeE----EEEEee-cCceeeccccccccccc-----------------------ccccccccccccccccCCcccEEEec Q lcl|NC_010179. 183 KEA----QFFRTS-ATDSTVIEPYNIITSYD-----------------------LSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 183 ~~~----~~~~~~-~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) ... .+|... +......+......... ...+ .....+..|-+.+++|+|+|. T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g-~~~l~~~~p~~~~~~P~vP~~ 311 (706) T protein:vir:10 233 RKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDG-DGFLEKPRRIPGEHIPLIPVY 311 (706) T ss_pred cceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeecc-ccccccCCCCCCCccceEEEe Confidence 422 122211 11111111111111100 0011 111112233444788888775 Q ss_pred CC-------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhc-----ceeeecccCC Q lcl|NC_010179. 235 KN-------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREY-----KSIKINNAGN 302 (469) Q Consensus 235 n~-------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~-----~~~~~~~~~~ 302 (469) -. ....|.+.++++.++.+|..+|.+.+.+...... ...|.. ++.......+... ..+.+.+.+. T Consensus 312 g~r~~~d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~--~~~~~~-~~i~~~~~~~~~~~~~~~~~l~~~~~~~ 388 (706) T protein:vir:10 312 GKRWFIDDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQ--TPIVDM-EQIRGLEQHWEGRNRKRPAFLPLRTVTD 388 (706) T ss_pred eccccccccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCc--ccccch-hHHHHHHHHhhhcccccccchhcccccC Confidence 32 2246888999999999999999999987554442 223321 1111111111100 0111111111 Q ss_pred CCC------CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 GDK------SGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAIN 376 (469) Q Consensus 303 ~~~------~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 376 (469) ..+ ....++..+.-..++...++.....|-.+|+..+...+.-||.||+|+...-..........-.-+..+.+ T Consensus 389 ~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~ 468 (706) T protein:vir:10 389 KTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSNVARETVNSLLNRSDMASFIYLDNMAKSLK 468 (706) T ss_pred CCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 12233333333355666777878888888887665555557899999988776666666666666666666 Q ss_pred HHHHHHHHH----hcc------cCCCc-----------------------------ccceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_010179. 377 ELVRAIMRY----LNF------SDADK-----------------------------RHISQHWTRTKVEDSLTKAQIVST 417 (469) Q Consensus 377 ~~~~~i~~~----~~~------~~~~~-----------------------------~~i~i~f~~~~p~d~~e~~~~~~k 417 (469) +..++++.+ +.. .+-+. .+|.|.=.+..+.-..+.++.++. T Consensus 469 ~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t~r~~~~~~m~e 548 (706) T protein:vir:10 469 RAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSARRDATVNALTQ 548 (706) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcchHHHHHHHHHHH Confidence 665554444 321 11000 112222233344434455555555 Q ss_pred Hhcc-CC--hHH------HHHhCCCCCCHHHHHHHHHHHHHHhhh-------------hHhhcc--------------cC Q lcl|NC_010179. 418 VANY-SS--KEA------VAKANPIVDDWQQELKDLAKDREENDP-------------YANQAD--------------EL 461 (469) Q Consensus 418 l~g~-iS--~et------~~~~l~~v~d~~~E~eri~~E~~~~~~-------------~~~~~~--------------~~ 461 (469) +.+. .+ ..+ +++.+++ +..++-.++|++....... ..++.+ .. T Consensus 549 l~~~~~p~~~~~~~l~~~~~~~~d~-p~~~e~~e~irk~~~~q~~~~~~~~~eq~~~~q~qq~q~~q~~~~~~~~~aq~~ 627 (706) T protein:vir:10 549 LLQGMLPQDPMRPALMGIIIDNMEG-EGLDDFKAFNRRQLLTQGIVKPRNQQEQAIVQQAQQAQATQPDPNMLLAQAQMV 627 (706) T ss_pred HHHhcCCcchhhHHHHHHHHhhcCc-cchHHHHHHHHHhhcccCCccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4332 21 122 2333322 2222234444332110000 000000 00 Q ss_pred CC----CCCCCC Q lcl|NC_010179. 462 NG----KGVDDE 469 (469) Q Consensus 462 ~~----~~~~de 469 (469) .. .....+ T Consensus 628 ~~qA~~~k~~a~ 639 (706) T protein:vir:10 628 VAQAEAQKSQNE 639 (706) T ss_pred HHHHHHHHHHHH Confidence 00 000000 No 97 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=99.47 E-value=4.5e-12 Score=82.81 Aligned_cols=455 Identities=8% Similarity=-0.003 Sum_probs=198.9 Q ss_pred CCHH---HHHHHHHHHHHHH---HHHHHHHHHHHHHhc--cCCcccccccchhhhcccccccccccCcceeccchHHHHH Q lcl|NC_010179. 1 MELD---ALKKLIRNTSTSR---NDLINNYKKSVDYYE--NKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLV 72 (469) Q Consensus 1 ~~~~---~~~~~i~~~~~~~---~~~~~~~~~~~~Yy~--g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv 72 (469) |.-+ -+.++...+...+ .+-+.....-++||. |+|- ...... ..+. .....++| .+.+|.++.+| T Consensus 1 ma~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~d~~f~~~~G~QW--~~~~~~--~~~~-~l~~~~~P--~~~~N~i~~~v 73 (720) T protein:vir:35 1 MAETLQKRHEQIMRKFDRAHSPQEAVREKCLEATRFARVPGGQW--EGATAA--GSEL-GKHFEKYP--KFEINKISTEL 73 (720) T ss_pred CchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhccCCCCC--CHHHHH--HHHH-HHhhCCCC--eEEEccHHHHH Confidence 6543 2344444444333 222333445667775 6552 111100 0000 00111122 36689999999 Q ss_pred HHHHHhhhcCCeee--ccCc----hhhHHHHHHH----HhccH-HHHHHHHHHHHHhCCeEEEEEEEcCC---------C Q lcl|NC_010179. 73 DQEAGYIASVFPDI--DVGK----DADNKKILDV----LGDDR-ALTLNSLLVDSSNAGRAWLHYWIDED---------N 132 (469) Q Consensus 73 ~~~~~~l~g~p~~~--~~~~----~~~~~~l~~~----~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d~~---------~ 132 (469) +..+++---+.+.+ .+.+ ....+.|..+ .+.+. ......++.+++++|.+|.-|+.|.+ + T Consensus 74 ~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~~~~~~ 153 (720) T protein:vir:35 74 NRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDPMDERQ 153 (720) T ss_pred HHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCCCcccc Confidence 99999986665544 3322 2223333333 33343 34577889999999999999976521 1 Q ss_pred ceEEEEE--ccceeEEEEeCCCC------CceEEEEEEEEeee------------------------cCCceEEEEEEEE Q lcl|NC_010179. 133 NFRYGII--QPDQITPVYATTLD------NKLLGVLRSYKQLD------------------------PEAGKYFTVHEYW 180 (469) Q Consensus 133 ~~~i~~~--~p~~~~~~~d~~~~------~~~~~~v~~~~~~~------------------------~~~~~~~~~~~~~ 180 (469) .+++..+ ++..++ ||+... .+..+ ++.|...+ .-....++.+++| T Consensus 154 ~i~i~~v~~~~~~v~--~Dp~a~~~D~sDar~~~-~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~E~~ 230 (720) T protein:vir:35 154 RICLEPIYDPARSVW--FDPDAKKYDKSDAEWAF-CMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIAKYY 230 (720) T ss_pred eeeEecccCchhhee--ecccccccChhhhhhhh-hhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEEEee Confidence 2333332 223443 333221 11111 11111000 0011234455555 Q ss_pred cCCeEE----EEEe-ecCceeecccccc-----------------c--ccccc---cccccccccccccccCCcccEEEe Q lcl|NC_010179. 181 TDKEAQ----FFRT-SATDSTVIEPYNI-----------------I--TSYDL---SAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 181 ~~~~~~----~~~~-~~~~~~~~~~~~~-----------------~--~~~~~---~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) ....+. .+.. .++....+..... . ..+.. ..+......+..+-+++.+|+|+| T Consensus 231 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~~~fP~vP~ 310 (720) T protein:vir:35 231 EVKKESVDVVSFQNPLTSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPGEHIPLIPV 310 (720) T ss_pred EEEEEEEEEEEeecCCCCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCCCccceEEE Confidence 443321 1111 1111111111100 0 00000 001111222334455677888877 Q ss_pred cCC-------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcc---eee--ecccC Q lcl|NC_010179. 234 PKN-------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYK---SIK--INNAG 301 (469) Q Consensus 234 ~n~-------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~---~~~--~~~~~ 301 (469) .-. +...|.+.++++.++.+|...|.+.+.+.. .+...-.|... +...+...+.... ... +.... T Consensus 311 ~g~r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~--~~~~~~~~a~~-~~~~~~~~~a~~~~~~~~~l~~~~~~ 387 (720) T protein:vir:35 311 YGKRWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQ--DTGSIPIVGKS-QIKTLEKYWANRNKNRPAFLPLNEIV 387 (720) T ss_pred EeeeeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHc--CCccccccCcc-hHHHHHHHhhcccccccccccccccc Confidence 532 223588899999999999999999999854 44444444322 2222222221111 111 11111 Q ss_pred C--C----CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 302 N--G----DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAI 375 (469) Q Consensus 302 ~--~----~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l 375 (469) . | ....+.+...+.-..+....++.-...|-..|+..+-..+.-+|.||+|+...-............-+..+. T Consensus 388 ~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~ 467 (720) T protein:vir:35 388 DKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSNIAKETVNHLMHRSDMSSFIYLDNMAKSL 467 (720) T ss_pred ccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 0 112344544433445666777777788888887766555545778999998865554444444555555555 Q ss_pred HHHHH----HHHHHhcc------cCCCc-----------------------------ccceEEeCCCCCCCHHHHHHHHH Q lcl|NC_010179. 376 NELVR----AIMRYLNF------SDADK-----------------------------RHISQHWTRTKVEDSLTKAQIVS 416 (469) Q Consensus 376 ~~~~~----~i~~~~~~------~~~~~-----------------------------~~i~i~f~~~~p~d~~e~~~~~~ 416 (469) ++..+ +|..++.. .+-+. .+|.+.=.+..+.-..+.++.++ T Consensus 468 ~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~~p~~~s~req~~~~m~ 547 (720) T protein:vir:35 468 KRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDVGPSYTARRDATVSVLT 547 (720) T ss_pred HHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEecccCcccHHHHHHHHHH Confidence 55444 44444421 11100 11222223333333344455555 Q ss_pred HHhccCChHH---------HHHhCCCCCCHHHHHHHHHHHHHHhhhh-----------HhhcccCCCCCCCC-------- Q lcl|NC_010179. 417 TVANYSSKEA---------VAKANPIVDDWQQELKDLAKDREENDPY-----------ANQADELNGKGVDD-------- 468 (469) Q Consensus 417 kl~g~iS~et---------~~~~l~~v~d~~~E~eri~~E~~~~~~~-----------~~~~~~~~~~~~~d-------- 468 (469) .+.+.++.+. .++.+++ +..++-.+|+++........ .+..+....-.... T Consensus 548 qll~~~~p~~~~~~~~~~~ile~~d~-p~~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~qq~~qq~~~e~~~aqa~l~ 626 (720) T protein:vir:35 548 NLLAGMLPQDPMRQVLQGIILDNMEG-EGLDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQMIQQAQQPNAELVAAQGVLM 626 (720) T ss_pred HHHHhcCCCchhHHHHHHHHHHhcCc-hhHHHHHHHHHhhcchhcccCccChhHHHHHHHHHHHHHhHhHHHHHHHHHHH Confidence 5444333221 2333322 22233344443321100000 00000000000000 Q ss_pred C Q lcl|NC_010179. 469 E 469 (469) Q Consensus 469 e 469 (469) + T Consensus 627 q 627 (720) T protein:vir:35 627 Q 627 (720) T ss_pred H Confidence 0 No 98 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.47 E-value=5.6e-13 Score=87.76 Aligned_cols=435 Identities=14% Similarity=0.115 Sum_probs=218.8 Q ss_pred CCHHHHHHHH----------HHHHHHHHHHHHHH---HHHHHHhccCCcccccccchhhhcccccccccccCcceeccch Q lcl|NC_010179. 1 MELDALKKLI----------RNTSTSRNDLINNY---KKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNF 67 (469) Q Consensus 1 ~~~~~~~~~i----------~~~~~~~~~~~~~~---~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~ 67 (469) -+++++.+++ .++....+.|.+.. +++++||.+... ++.. .. .. .-.+++.+|. T Consensus 3 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~el~~y~~a~~~---~~~~------~~--~~--~~r~~~~~~k 69 (584) T protein:vir:95 3 VKVAELNSLLVRDSSAQWVAYLWDRFNNQRRQKIEEWKELRNYVFATDT---TTTS------NQ--GL--PWKNSTTLPK 69 (584) T ss_pred cchhhhhhhccccchHHHHHHHHHHHHhhhchhhccCHHHHHHHHhhhh---hhhh------hc--cc--ccccccchhH Confidence 2333444333 33333333333322 568888887432 1100 00 00 1134778888 Q ss_pred HHHHHHHHHHhh----hcCC-----eeeccCchh--hHHHHHHHHhc-----cHHHHHHHHHHHHHhCCeEEEEEEEcCC Q lcl|NC_010179. 68 YQLLVDQEAGYI----ASVF-----PDIDVGKDA--DNKKILDVLGD-----DRALTLNSLLVDSSNAGRAWLHYWIDED 131 (469) Q Consensus 68 ~k~iv~~~~~~l----~g~p-----~~~~~~~~~--~~~~l~~~~~~-----n~~~~~~~~~~~~~~~G~~~~~v~~d~~ 131 (469) +.-+++..++++ |++. +.+..++.+ .++.++.+..+ ++......+..++.++|.|+..+++... T Consensus 70 ~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~~k~~~~~~ 149 (584) T protein:vir:95 70 LCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAFATVSFEAK 149 (584) T ss_pred HHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceEEEEeEeec Confidence 888888887765 3322 112222222 24556666533 3445677888999999999999987543 Q ss_pred -------------CceEEEEEccceeEEEEeCCC--CCceEEEEEEEE-------------------------------- Q lcl|NC_010179. 132 -------------NNFRYGIIQPDQITPVYATTL--DNKLLGVLRSYK-------------------------------- 164 (469) Q Consensus 132 -------------~~~~i~~~~p~~~~~~~d~~~--~~~~~~~v~~~~-------------------------------- 164 (469) .++++..++|..+|| |++. ....-+.+|.+. T Consensus 150 ~~e~~e~~~v~~~~~prieriSP~d~~~--Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~~~~~~~~ 227 (584) T protein:vir:95 150 YKEMTDGTLVPDYIGPRLVRISPLDIVF--NPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRREEICRHL 227 (584) T ss_pred ceeeeccccccccccceEEeeChhheee--cCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHHHHhccCC Confidence 268899999999884 5544 222222222211 Q ss_pred ----eeecCCc------eEEEEEEEEcCCeEEEEEeecC----ceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_010179. 165 ----QLDPEAG------KYFTVHEYWTDKEAQFFRTSAT----DSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF 230 (469) Q Consensus 165 ----~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 230 (469) ..+.++- +.....++|....+.....++. .......+...+.+ .+.........+.++|.+|+ T Consensus 228 ~~~~~~~~~~~~~~~~d~~~~~~ey~~~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~---~g~~iIR~~~np~~~~~~PF 304 (584) T protein:vir:95 228 GGYSVEDFDKAAGFDVDGFGNLYEYYMSDWVEILEFYGDYHDKETGELQTNRIITVV---DRSTEVRNESIPTWFGSAPI 304 (584) T ss_pred CCCcccccccccccccccccccccccCCceeEEEeecccccccccCCCcccceEEEE---eccEEEEeeecCCCCCCCCE Confidence 1111110 0011233333333333222110 00000111111111 11111112244567799999 Q ss_pred EEecCC-----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCC Q lcl|NC_010179. 231 IEFPKN-----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDK 305 (469) Q Consensus 231 v~~~n~-----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (469) +.+.-- -.|.|+...+.++++.+|.+.-.+.+.+..+.+|.+...+...+. ..++++.+.. +.. T Consensus 305 ~~~~~~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~pv~k~~~~~~~~------~~~pg~~~~~-----~~~ 373 (584) T protein:vir:95 305 YHVGWRFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQPPLKIIGEVEEF------VWGPGAEIHL-----DQG 373 (584) T ss_pred EEEcceeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCcceeeccccchh------cccCCceeec-----CCC Confidence 887643 359999999999999999999999999999999977666532221 1223444443 334 Q ss_pred CcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_010179. 306 SGVDKLQIDI-PVEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFEHAI-NELVRA 381 (469) Q Consensus 306 ~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l-~~~~~~ 381 (469) +.++++.++. +..+.-+.+..+...+-..|+.|..+.+. .++.++..+.+++.++-.-...+.+.|..++ ++++.+ T Consensus 374 ~~~q~~~p~a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~naa~~~~r~~~~~f~~~ll~~l~~l 453 (584) T protein:vir:95 374 GDVQEIAKNVNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGNAAGRIFQEKVTTFEVELLEPVLNA 453 (584) T ss_pred CCcceecCchhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5678888764 34445566777888888899999876543 2344555566677777777788888888876 777777 Q ss_pred HHHHhcc--cCCCc------------------ccceEEeC--CCCCCCHHHHHHHHHHHhc------------cCChHHH Q lcl|NC_010179. 382 IMRYLNF--SDADK------------------RHISQHWT--RTKVEDSLTKAQIVSTVAN------------YSSKEAV 427 (469) Q Consensus 382 i~~~~~~--~~~~~------------------~~i~i~f~--~~~p~d~~e~~~~~~kl~g------------~iS~et~ 427 (469) +.++... ...+. .++.-.|. .--..-..+.++.++.+.. .++.... T Consensus 454 l~~~~~~nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~~q~l~~ilq~~~~~~i~p~~~~~~l 533 (584) T protein:vir:95 454 MLETATRNMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQDLQNLVGIFNSQIGQMILPHTSGKAL 533 (584) T ss_pred HHHHHHhhccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHHHHHHHHHHHhhhhhhccccchHHHH Confidence 7765311 00011 11222221 1111112223332222211 1233222 Q ss_pred HH------hCCC----CCC----HHHHHHHHHHHHHHhhhhHhhcccCCCCCCC Q lcl|NC_010179. 428 AK------ANPI----VDD----WQQELKDLAKDREENDPYANQADELNGKGVD 467 (469) Q Consensus 428 ~~------~l~~----v~d----~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~ 467 (469) .. .+|. ..+ .+.|.+....+..+.. ....+++.+|.- T Consensus 534 ~~~ladl~~~p~~~~~~~~~~~~~Q~~~q~~~~~~q~~~---~~~~~~~~~~~~ 584 (584) T protein:vir:95 534 ATFVDDVTGLQGYEIFRPNVAVAEQAETQSLVAQAQEDL---QLQAQMPAEGAI 584 (584) T ss_pred HHHHHHHhCCCcccccCCCcccchhHHHHhhhHHHHHHH---HHHHhhhhccCC Confidence 11 1331 112 1223333222211111 112222222222 No 99 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.43 E-value=2.8e-12 Score=83.96 Aligned_cols=454 Identities=7% Similarity=-0.008 Sum_probs=192.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHH---HHHHHH--HHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLI---NNYKKS--VDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQE 75 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~---~~~~~~--~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~ 75 (469) ..-+.+.++...+........ .+...- .+||.|.|- ........... ....++ ..+.+|.++.+|+.. T Consensus 4 ~~~~~~~~~~~r~~~~~~~~~~~r~~~~~d~~f~~y~G~Qw----~~~~~~~l~~~-~q~~~r--P~~~~N~i~~~i~~v 76 (708) T protein:vir:17 4 TLEKKHERIMLRFDRAYSPQQEVREKCIEATRFARVPGGQW----EGATAAGTKLD-EQFEKY--PKFEINKVATELNRI 76 (708) T ss_pred hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhhccCCCCC----CHHHHHHHHhh-hhhcCC--CceEEcchHHHHHHH Confidence 222345555554444332221 122112 368999872 11111111000 000111 246789999999999 Q ss_pred HHhhhcCCee--eccCc----hhhHHHHHHH----HhccH-HHHHHHHHHHHHhCCeEEEEEEEc---CC------CceE Q lcl|NC_010179. 76 AGYIASVFPD--IDVGK----DADNKKILDV----LGDDR-ALTLNSLLVDSSNAGRAWLHYWID---ED------NNFR 135 (469) Q Consensus 76 ~~~l~g~p~~--~~~~~----~~~~~~l~~~----~~~n~-~~~~~~~~~~~~~~G~~~~~v~~d---~~------~~~~ 135 (469) +|+---+.+. +.+.+ .+..+.+..+ .+.+. ......+..+++++|.+|+.+..| +. .++. T Consensus 77 ~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~~~~~~~i~ 156 (708) T protein:vir:17 77 IAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDPMDDRQRIA 156 (708) T ss_pred HhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCCCCCccccc Confidence 9997655544 33332 2223333333 33343 445778889999999999877532 21 2344 Q ss_pred EEEE--ccceeEEEEeCCCC------CceEEEEEE---------EEeee---------------cCCceEEEEEEEEcCC Q lcl|NC_010179. 136 YGII--QPDQITPVYATTLD------NKLLGVLRS---------YKQLD---------------PEAGKYFTVHEYWTDK 183 (469) Q Consensus 136 i~~~--~p~~~~~~~d~~~~------~~~~~~v~~---------~~~~~---------------~~~~~~~~~~~~~~~~ 183 (469) +..+ ++..++ ||+... .+.++-.++ |.... .-+...++.+++|... T Consensus 157 i~~~~~~~~~v~--~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~vrv~e~~~r~ 234 (708) T protein:vir:17 157 IEPIYDPSRSVW--FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVIYIAKYYEVR 234 (708) T ss_pred eEeeccchhhee--cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeEEEEEEEEEe Confidence 4443 345554 454321 111111111 11100 0011234455555322 Q ss_pred e--EEEEEee---cCceeecccccc-------------------cc----cccccccccccccccccccCCcccEEEecC Q lcl|NC_010179. 184 E--AQFFRTS---ATDSTVIEPYNI-------------------IT----SYDLSAGYETGQSNTLKHNFGRVPFIEFPK 235 (469) Q Consensus 184 ~--~~~~~~~---~~~~~~~~~~~~-------------------~~----~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 235 (469) . ..++... ++...++..... .. .|... .......+..+-+++.+|+|+|.- T Consensus 235 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~-~g~~~l~~~~~~p~~~fP~vP~~g 313 (708) T protein:vir:17 235 KESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVV-DGDGFLEKPRRIPGEHIPLIPVYG 313 (708) T ss_pred eeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEee-cccccccCCCCCCCCccceEEEec Confidence 1 1111111 111111111000 00 00000 111122234445567788887753 Q ss_pred Cc---c----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEe-----cCCcc----cchhh--hhhhh-hcceee Q lcl|NC_010179. 236 NK---Y----RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT-----NYGGA----SLKQF--MNDLR-EYKSIK 296 (469) Q Consensus 236 ~~---~----g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~-----g~~~~----~~~~~--~~~~~-~~~~~~ 296 (469) .+ + ..|.+.++++.++.+|...|.+.+.+........++. |.... ..... ..... ...... T Consensus 314 ~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~~~~~~~~~g~ 393 (708) T protein:vir:17 314 KRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLPLREVRDKYGN 393 (708) T ss_pred ccccccCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhhhhccCCcccc Confidence 21 1 2577889999999999999999988877665544332 11100 00000 00000 011111 Q ss_pred ecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 297 INNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAIN 376 (469) Q Consensus 297 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 376 (469) +..+. .+ ......+.-..++...++.....|-..|++.+...+..+|.||+|+...-............-+..+.+ T Consensus 394 v~~~a---~~-~~~~~~~~~~~~~~~llq~~~~~i~~~tGi~d~~~G~~sn~SG~Ai~~rq~qg~~~~~~~~Dnl~~~~~ 469 (708) T protein:vir:17 394 IIAGA---TP-AGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPSNIAQETVNNLMNRADMASFIYLDNMAKSLK 469 (708) T ss_pred ccccc---CC-cccCCCccccHHHHHHHHHHHHHHHHhcCCChHHccCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 11 112222223356777788888888888887766555567889999987766555555555555555555 Q ss_pred HHH----HHHHHHhcc------cCCC----c-------------------------ccceEEeCCCCCCCHHHHHHHHHH Q lcl|NC_010179. 377 ELV----RAIMRYLNF------SDAD----K-------------------------RHISQHWTRTKVEDSLTKAQIVST 417 (469) Q Consensus 377 ~~~----~~i~~~~~~------~~~~----~-------------------------~~i~i~f~~~~p~d~~e~~~~~~k 417 (469) +.. .+|..++.. .+-+ + .+|.+.=.+..+.-..+..+.++. T Consensus 470 ~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r~~~~~~l~q 549 (708) T protein:vir:17 470 RAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARRDATVSVLTN 549 (708) T ss_pred HHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHHHHHHHHHHH Confidence 544 444444421 1100 0 011221122222223344444554 Q ss_pred HhccCChH---H------HHHhCCCCCCHHHHHHHHHHHHHHhh-------------------------hhHh--hcccC Q lcl|NC_010179. 418 VANYSSKE---A------VAKANPIVDDWQQELKDLAKDREEND-------------------------PYAN--QADEL 461 (469) Q Consensus 418 l~g~iS~e---t------~~~~l~~v~d~~~E~eri~~E~~~~~-------------------------~~~~--~~~~~ 461 (469) +.+.++.. + +++.+++ +..++-.++|++...... +... +.+.. T Consensus 550 ll~~~~~~~~~~~~~~~l~l~~~D~-p~~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~~~~eaqa~~~ 628 (708) T protein:vir:17 550 VLSSMLPADPMRPAIQGIILDNIDG-EGLDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNPEMVLAQAQMV 628 (708) T ss_pred HHHhcCCccchhHHHHHHHHHhcCC-CChHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43322211 1 2233321 222222333332211000 0000 00000 Q ss_pred C----CCCCCCC Q lcl|NC_010179. 462 N----GKGVDDE 469 (469) Q Consensus 462 ~----~~~~~de 469 (469) . .....-| T Consensus 629 ~~qAe~~ka~ae 640 (708) T protein:vir:17 629 AAQAEAQKATNE 640 (708) T ss_pred HHHHHHHHHHHH Confidence 0 0000000 No 100 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.41 E-value=2.4e-11 Score=78.79 Aligned_cols=432 Identities=10% Similarity=0.047 Sum_probs=195.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHH----HHH----------HHHHHhccCCcccccccchhhhcccccccccccCcceeccc Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLIN----NYK----------KSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSN 66 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~----~~~----------~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n 66 (469) .+-+.|-.+|.+...++.+.+. +-+ ++.+||.|... +. .......-.++++.+ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~r~~~~~~w~~~~~~~~~~~~~~~y~~~~~~---~~----------~~~~~~~~rs~~~~~ 82 (651) T protein:vir:80 16 DETHDVSSYVKKEYKRFCDARQVCEETWLEAWGMYLSTPEAQDYLRDQVL---RS----------VGDVNADWRHKITTG 82 (651) T ss_pred hhhHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHhhcccHHHHHhhccccc---cc----------cCCCCCCCCccccCh Confidence 3444555555544444433321 111 33455555321 00 000000112468889 Q ss_pred hHHHHHHHHHHhhh----cCCeeec--c-Cchh----hHHHHHHHHh----cc-HHHHHHHHHHHHHhCCeEEEEEEEcC Q lcl|NC_010179. 67 FYQLLVDQEAGYIA----SVFPDID--V-GKDA----DNKKILDVLG----DD-RALTLNSLLVDSSNAGRAWLHYWIDE 130 (469) Q Consensus 67 ~~k~iv~~~~~~l~----g~p~~~~--~-~~~~----~~~~l~~~~~----~n-~~~~~~~~~~~~~~~G~~~~~v~~d~ 130 (469) .....|+...+.|+ +.+.-+. . ++.+ ..+.+..++. +. +......+..+++++|.+++.||++. T Consensus 83 ~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~i~kv~we~ 162 (651) T protein:vir:80 83 KAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNSVLALPWRV 162 (651) T ss_pred hHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCceEEEEeecc Confidence 99999988777653 3322222 1 2222 2333555544 22 33445567789999999999998763 Q ss_pred C-------------------------------CceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeec------CCc-- Q lcl|NC_010179. 131 D-------------------------------NNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDP------EAG-- 171 (469) Q Consensus 131 ~-------------------------------~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~------~~~-- 171 (469) . |.+++..++|.++++-.+........++++.+..... +|. T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~dp~a~~~~d~~~v~~~~~t~~~l~~l~~~g~~~ 242 (651) T protein:vir:80 163 ETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFYDPNVTDPNRGAFIRKLTKTKADILNLLSEGYYY 242 (651) T ss_pred eeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeeecCCCcCccccceeeeeeeeHHHHHHHHhccccc Confidence 2 5678999999998864332222223333333321100 000 Q ss_pred -------------------------------------eEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccc Q lcl|NC_010179. 172 -------------------------------------KYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYE 214 (469) Q Consensus 172 -------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (469) ..+..+|+|.. +..++.....+ ..... T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~-----~d~e~~~~~~~-----------~v~~~ 306 (651) T protein:vir:80 243 GVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGD-----IHLENKTYHDV-----------VVTIM 306 (651) T ss_pred chhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEE-----eeccCCceEEE-----------EEEEc Confidence 00011111110 01011110000 00000 Q ss_pred c-cccccccccC-CcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhh Q lcl|NC_010179. 215 T-GQSNTLKHNF-GRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMN 287 (469) Q Consensus 215 ~-~~~~~~~~~~-g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~ 287 (469) + .......+++ ..+|++.++- ...|.|..+.+.+.+..+|.+...+.+.+..+++|.+.+...+..+..+. T Consensus 307 g~~il~~~~~~~~~~~Pf~~~~~~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~~~~~~~l-- 384 (651) T protein:vir:80 307 GNEVLRFEQNPYWCGRPFVIGTYIPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDGLLQPEDV-- 384 (651) T ss_pred CcEEecccccCCCCCCCeeeecceecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCccccHHHh-- Confidence 0 0011112222 2357766543 34699999999999999999999999999999999877643222222222 Q ss_pred hhhhcceeeecccCCCCCCcceEEeecC-CHHHHHHHHHHHHHHHHHHhCCCCcCcc----ccCCccHHHHHHHHHHHHH Q lcl|NC_010179. 288 DLREYKSIKINNAGNGDKSGVDKLQIDI-PVEARDDALKITRDNIFLFGQGIDPANF----ESSNASGVAIKMLYSHLEL 362 (469) Q Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~~g~~Sg~Al~~~~~~l~~ 362 (469) ....++++.+... +++..++... +.......++.+...+-..++++++... ..++.++.++......+.. T Consensus 385 ~~~pg~vi~~~~~-----~~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~~~~~~~~~TAteI~~~~~~~~~ 459 (651) T protein:vir:80 385 YTEPGKVFLVSDH-----GDLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGANAARSGERVTAAEVAAVREAGGN 459 (651) T ss_pred hcCCCceEEecCC-----CCceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCCCccchhhccHHHHHHHHHHHHH Confidence 2335566654432 3466776543 4456667788888999889888875442 2344566666555555555 Q ss_pred HHHHHHHHHHH-HHHHH----HHHHHHHhccc------C----------CCcccceEEeCC--CCCCCHHHHHHHHHHH- Q lcl|NC_010179. 363 KAAKTQTYFEH-AINEL----VRAIMRYLNFS------D----------ADKRHISQHWTR--TKVEDSLTKAQIVSTV- 418 (469) Q Consensus 363 k~~~~~~~~~~-~l~~~----~~~i~~~~~~~------~----------~~~~~i~i~f~~--~~p~d~~e~~~~~~kl- 418 (469) .....-+.|.. +++.+ ++++.++.... + ....++++.|.- .-+....+..+.++++ T Consensus 460 ~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~iv~~g~~~~~~r~~~~~~l~ 539 (651) T protein:vir:80 460 RLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRLVPIGSDHVIERKQYIEDRL 539 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeeeeeeeeccHHHHHHHHHHHHHHH Confidence 55555555544 44444 44444432211 0 001123333321 1111122222222221 Q ss_pred -----hccCC---h-----HH---HHHhCCCCCCHHH--------------HHHHHHHHH--HHh--hhhHhhcccCCCC Q lcl|NC_010179. 419 -----ANYSS---K-----EA---VAKANPIVDDWQQ--------------ELKDLAKDR--EEN--DPYANQADELNGK 464 (469) Q Consensus 419 -----~g~iS---~-----et---~~~~l~~v~d~~~--------------E~eri~~E~--~~~--~~~~~~~~~~~~~ 464 (469) .+..+ . .. .++.++ +.++.. +....+.+. .+. .....+.+...+. T Consensus 540 ~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g-~~~~~~~l~~~~q~~~~~~~~~~~~q~~~~~~~a~~~~~~~~~~~~~~~ 618 (651) T protein:vir:80 540 TFIQAVAQVPEMGQLVDYKRILVDLLQHWG-FEEPEAYLKQQDQQAPANPQEALLSQAKDVGGQAMSNMLQNQLQADGGT 618 (651) T ss_pred HHHHhhccCCccchhhhHHHHHHHHHHHcC-CCCcHHhcCCCccchhhhhhHHHHhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111 1 11 223333 222211 000001110 000 0000111000000 Q ss_pred CCC-CC Q lcl|NC_010179. 465 GVD-DE 469 (469) Q Consensus 465 ~~~-de 469 (469) ... |+ T Consensus 619 ~~~~~~ 624 (651) T protein:vir:80 619 QMMSEM 624 (651) T ss_pred HHHHHH Confidence 000 00 No 101 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.32 E-value=8.4e-12 Score=81.30 Aligned_cols=382 Identities=10% Similarity=0.042 Sum_probs=190.4 Q ss_pred HHHHHHHHHHhccC-Ccccccccchhhhcccccccccc--cC-cceeccchHHHHHHHHHHhhhcCCeeeccCchh--hH Q lcl|NC_010179. 21 INNYKKSVDYYENK-TDITTRNNGKPKVSKEGKKDPLR--SA-DNRIPSNFYQLLVDQEAGYIASVFPDIDVGKDA--DN 94 (469) Q Consensus 21 ~~~~~~~~~Yy~g~-~~i~~~~~~~~~~~~~~~~~~~~--~~-~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~~--~~ 94 (469) ....+-+...-.|. +. +....+ ..+...... .- ..--.+.+++.+|+..+.-++-+++.+++++.+ .. T Consensus 1 ~~~~D~~~~~~~~~g~~---~~~~~~---~~~~~~~~~~~~l~a~Y~~~~l~~~~vd~~a~d~~r~~~~i~~~d~~~~~~ 74 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSK---QEQTYY---SPSLSLTDDLVQLEALWRDNWIANKVCIKRPEDMVRNWREIYSNDLNSKQL 74 (437) T ss_pred CchhhhhHhHHhcCCCc---ccccee---ecCccccccHHHHHHHHHhCchhhHHhhcchHHhhcCCceEecCCCCHHHH Confidence 11111211122111 10 000000 000000000 00 000136888999999999999999999886433 33 Q ss_pred HHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC---------CceE-EEEEccceeEEEEeCC-CCCceEEE-EE Q lcl|NC_010179. 95 KKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED---------NNFR-YGIIQPDQITPVYATT-LDNKLLGV-LR 161 (469) Q Consensus 95 ~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~---------~~~~-i~~~~p~~~~~~~d~~-~~~~~~~~-v~ 161 (469) +.++..|++ +....+.++.+.+-.+|.+++++-.+.. +.++ +.++++.++.|..-.. +...+-++ .. T Consensus 75 ~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~ 154 (437) T protein:vir:52 75 DLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYS 154 (437) T ss_pred HHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEechhhccccccccccccccccCcce Confidence 457777765 5667788999999999999988866543 2222 4445554444322111 11110000 01 Q ss_pred EEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccc Q lcl|NC_010179. 162 SYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLA 241 (469) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~ 241 (469) +|.... +.. ...+ .+..+.+|.. ..+| ...++-.|.| T Consensus 155 ~y~v~~--~~~---~~~i-H~SRii~~~~-----------------------------------~~~~--~~~~~~~G~s 191 (437) T protein:vir:52 155 EYSILG--GSQ---SITV-HHSRLIILNA-----------------------------------NDAP--LSDNDIWGVS 191 (437) T ss_pred EEEEec--CCc---ceeE-ccceeEEecC-----------------------------------ccCC--CccccccCCc Confidence 111100 000 0000 0111111100 0112 1112335899 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCccc---chh----hhhh---hh-hcceeeecccCCCCCCcceE Q lcl|NC_010179. 242 ELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGAS---LKQ----FMND---LR-EYKSIKINNAGNGDKSGVDK 310 (469) Q Consensus 242 ~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~---~~~----~~~~---~~-~~~~~~~~~~~~~~~~~~~~ 310 (469) .++.+.+-+.+++.+.-..+..+..+..+.+.+.|..... ..+ .... .+ ..+++.++.+ .+| T Consensus 192 ~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~-------~~~ 264 (437) T protein:vir:52 192 DLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAGMENEVASVISAVQEIKSATNSLLLDAE-------NEY 264 (437) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCCcHHHHHHHHHHHHHhcCCCceEEEcCC-------cce Confidence 9999999999999988888888877777777776642111 011 1111 11 2233333221 233 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CC-ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHh Q lcl|NC_010179. 311 LQIDIPVEARDDALKITRDNIFLFGQGIDPANFES--SN-ASGVAIKMLYSHLELKAAKTQ-TYFEHAINELVRAIMRYL 386 (469) Q Consensus 311 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~-~Sg~Al~~~~~~l~~k~~~~~-~~~~~~l~~~~~~i~~~~ 386 (469) -+.+.+.+.....++....+|...+++|-.-..+. ++ .||..=...|.. .++..+ ..+...+++++.+++... T Consensus 265 e~~~~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~yyd---~i~~~Qe~~l~p~le~l~~~i~~~~ 341 (437) T protein:vir:52 265 DRKELTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQNYHE---AIRRLQETRLRPIFEIIDPLICNEL 341 (437) T ss_pred EEEecCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 34445677788888999999999999997544332 22 466654334433 333443 568889999999887643 Q ss_pred cccCCCcccceEEeCCCCCCCHHHHHHHHHH-------H--hccCChHHHHHhC------CCCCCHHHHHHHHHHHHHHh Q lcl|NC_010179. 387 NFSDADKRHISQHWTRTKVEDSLTKAQIVST-------V--ANYSSKEAVAKAN------PIVDDWQQELKDLAKDREEN 451 (469) Q Consensus 387 ~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k-------l--~g~iS~et~~~~l------~~v~d~~~E~eri~~E~~~~ 451 (469) .. .. ..+++++|++-...++++.+++..+ + +|++|.+.+.+.| |.+++ ++++... .. T Consensus 342 ~g-~~-~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~g~~~~i~~--~~~~~~~----~~ 413 (437) T protein:vir:52 342 FG-GL-PADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRESGLFANISA--EHIEELK----NA 413 (437) T ss_pred cC-CC-CCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCCCCCCc--ccccccc----CC Confidence 22 12 2368899999988999988876543 2 5678877776653 22332 1111110 00 Q ss_pred hhhHh------hcccCCCCCCCCC Q lcl|NC_010179. 452 DPYAN------QADELNGKGVDDE 469 (469) Q Consensus 452 ~~~~~------~~~~~~~~~~~de 469 (469) .+..+ ...........+| T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~ 437 (437) T protein:vir:52 414 DEFAGNFEEPEKMEGAQVQNSEDQ 437 (437) T ss_pred CCCCCccCCCCCCCCCCCCCCCCC Confidence 11000 0000111111111 No 102 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.28 E-value=2e-10 Score=73.81 Aligned_cols=417 Identities=10% Similarity=0.002 Sum_probs=220.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcc-----c-ccccccccCc-ceeccchHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSK-----E-GKKDPLRSAD-NRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~-----~-~~~~~~~~~~-~ri~~n~~k~iv~ 73 (469) |++ +-++|.-+..+..-+..+.....+-|.|-..-. .....+.... . ....-..++. .-...++++-.|+ T Consensus 1 mn~--~dr~i~~~sP~~~~~R~~ar~~~~~y~aa~~~r-~~~~~~~~~s~~~~~~~~~~~lr~RaRdl~rNn~~a~~av~ 77 (502) T protein:vir:79 1 MAI--LDDVIGVFSPGWKAARLRSRAVIQAYEAVKTTR-THKARRENRTADQLSQYGAVSLREQARYLDNNHDLVIGVFD 77 (502) T ss_pred Cch--HhhHHhhcChHHHHHHHhhHHHHhhccccCccc-ccCCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHH Confidence 654 456665555444333333333334466532100 0000000000 0 0000000000 0123578999999 Q ss_pred HHHHhhhcC-Ceeecc----Cc----hhhHHHHHHHHhc-----------cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 74 QEAGYIASV-FPDIDV----GK----DADNKKILDVLGD-----------DRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 74 ~~~~~l~g~-p~~~~~----~~----~~~~~~l~~~~~~-----------n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) ..+..+.|. .+++.+ .+ ++.++.+...|.. +|......+.+.....|.+++++.+++.+. T Consensus 78 ~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~~~~~~~~ 157 (502) T protein:vir:79 78 KLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLRTWLRDGEVFAQMVSGRINS 157 (502) T ss_pred HHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHHHHHhCCceEEEEeecccCc Confidence 999999985 444322 22 3344545555431 223333345677888999999987766432 Q ss_pred --------eEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccc Q lcl|NC_010179. 134 --------FRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIIT 205 (469) Q Consensus 134 --------~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (469) +++..++|+.+---++ ....+..+|.+ +..|....+++.-..+.. . T Consensus 158 ~~~g~~~~l~lq~iepd~l~~~~~--~~~~i~~GVe~----d~~Gr~~aY~i~~~hPgd----------~---------- 211 (502) T protein:vir:79 158 LTPSAGVHFWLEALEPDFIPMTSD--ESNRLNQGVFV----DDWGRPEKYLVYKSRPVS----------G---------- 211 (502) T ss_pred cCCCcccceEEEEecchhcCCCCC--CCCeeEeeeEE----CCCCceEEEEEeecCCCC----------C---------- Confidence 5789999988632122 12345555542 333433322211100000 0 Q ss_pred cccccccccccccccccccCCccc---EEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC Q lcl|NC_010179. 206 SYDLSAGYETGQSNTLKHNFGRVP---FIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY 277 (469) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~ 277 (469) ....+.+|| |+|+.. ...|.|+|..++..+..++.............+.-..+++.. T Consensus 212 ---------------~~~~~~rvpA~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~~~dael~~a~i~A~~~~fi~~~ 276 (502) T protein:vir:79 212 ---------------RQMETKEVDAERMLHLKFVRRLHQMRGTSLLSGVLIRLSALKEYEDSELTAARIAAALGMYIRKG 276 (502) T ss_pred ---------------cccceeEechhheEEeecccCCccccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecC Confidence 001122344 555543 246999999999999888877666555555444433444432 Q ss_pred Cccc-c--------hhhhhhhhhcceee-ecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc-ccC Q lcl|NC_010179. 278 GGAS-L--------KQFMNDLREYKSIK-INNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF-ESS 346 (469) Q Consensus 278 ~~~~-~--------~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~g 346 (469) .++. . ......+....++. +.+ +-++++++++.+...+..++..+.+.|..-.++|--... .++ T Consensus 277 ~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~p-----Ge~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s 351 (502) T protein:vir:79 277 DGQSYEPDGNGSKENERELTIQPGIIYDDLKP-----GEEIGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYN 351 (502) T ss_pred CCcccccccCCCCCccccccccCCccccccCC-----CceeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc Confidence 2111 0 11111222222222 232 246889888878889999999999999998888843222 243 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH--Hhcc----cCC--CcccceEEeCCCCC--CCHHHHHHHH Q lcl|NC_010179. 347 NASGVAIKMLYSHLELKAAKTQTYFEHAINE-LVRAIMR--YLNF----SDA--DKRHISQHWTRTKV--EDSLTKAQIV 415 (469) Q Consensus 347 ~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~-~~~~i~~--~~~~----~~~--~~~~i~i~f~~~~p--~d~~e~~~~~ 415 (469) .|-.+++..+......+...+..|...+-+ +++..+. ++.. .++ ......+.|..+-. .|....+++. T Consensus 352 -~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~ 430 (502) T protein:vir:79 352 -GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAW 430 (502) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHH Confidence 366777777777777777777666554333 3332222 2211 111 12234677754433 5777777777 Q ss_pred HHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc-----------------ccCCCCCCCCC Q lcl|NC_010179. 416 STV--ANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA-----------------DELNGKGVDDE 469 (469) Q Consensus 416 ~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~-----------------~~~~~~~~~de 469 (469) ..+ +|+.|.+..+...| .|+++.++++.+|++......-.+ .+....+.+.| T Consensus 431 ~~~i~~Gl~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e 501 (502) T protein:vir:79 431 KIQIRGGAATESDWVRAGG--RNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSE 501 (502) T ss_pred HHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 664 79999999999987 488888999998877654332211 11111111111 No 103 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.28 E-value=9.7e-11 Score=75.49 Aligned_cols=439 Identities=11% Similarity=0.073 Sum_probs=186.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHh-- Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGY-- 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~-- 78 (469) -.+..+++-+......|...+.+...+.+||.+.-+...+. ..+ ..+++.+.....|+..... T Consensus 27 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~g--rs~vv~~~v~~~ve~~~~~l~ 91 (763) T protein:vir:95 27 LSLQALKADLDAAKPSHTAMMIKVKEWNDLMRIEGKAKPPK-------------VKG--RSQVQPKLVRRQAEWRYSALT 91 (763) T ss_pred HHHHHHHHHHHhhhcchhHHHHHHHHHHHhhhccccCcccc-------------cCC--CccccCHHHHHHHHHHHHHHH Confidence 33445555556666677777777777888766543221111 011 2346667666666665543 Q ss_pred --hhcCC--eeecc---Cchhh----HHHHHHHH-hcc-HHHHHHHHHHHHHhCCeEEEEEEEcC--------------- Q lcl|NC_010179. 79 --IASVF--PDIDV---GKDAD----NKKILDVL-GDD-RALTLNSLLVDSSNAGRAWLHYWIDE--------------- 130 (469) Q Consensus 79 --l~g~p--~~~~~---~~~~~----~~~l~~~~-~~n-~~~~~~~~~~~~~~~G~~~~~v~~d~--------------- 130 (469) ++|.+ +.+.. +|.+. ...++-++ ..| ..+.+...+++++.+|.+++.|||+. T Consensus 92 ~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~~~~~~~~~ 171 (763) T protein:vir:95 92 EPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQEVPVFSLF 171 (763) T ss_pred HhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeeeeehhhhhc Confidence 34433 34443 22222 22333333 333 45667899999999999999998741 Q ss_pred ---------------------------------------------------------------CCceEEEEEccceeEEE Q lcl|NC_010179. 131 ---------------------------------------------------------------DNNFRYGIIQPDQITPV 147 (469) Q Consensus 131 ---------------------------------------------------------------~~~~~i~~~~p~~~~~~ 147 (469) .++++|..++|.++++- T Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V~p~d~~iD 251 (763) T protein:vir:95 172 PIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEMLNPENIIID 251 (763) T ss_pred cccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEeecHHHheec Confidence 13456777999998864 Q ss_pred EeCCCC-CceEEEE-EEEEeee-c--CCceEEE-----------------------EEEEEc--CCeEEEEEeec----C Q lcl|NC_010179. 148 YATTLD-NKLLGVL-RSYKQLD-P--EAGKYFT-----------------------VHEYWT--DKEAQFFRTSA----T 193 (469) Q Consensus 148 ~d~~~~-~~~~~~v-~~~~~~~-~--~~~~~~~-----------------------~~~~~~--~~~~~~~~~~~----~ 193 (469) .+...+ ....+++ +.+.... . .+..+.. ...+.+ ...+..+.+.. . T Consensus 252 p~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~E~y~~~d~~ 331 (763) T protein:vir:95 252 PSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAYEYWGFWDIE 331 (763) T ss_pred CCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEEEeeeeeccC Confidence 432111 1222222 2222110 0 0100000 000000 01111111100 0 Q ss_pred ceeecccccccccccccccccccccccccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010179. 194 DSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQ 268 (469) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~ 268 (469) +......+ ...+ .+.........|.+.|++|++.|+- ...|.|.++.++++++.+|..++.+.+.+...+ T Consensus 332 gdg~~~~~--~v~~---~g~~iL~~~~~p~~~~~~PFv~~~~~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~~d~l~~~~ 406 (763) T protein:vir:95 332 GNGVLEPI--VATW---IGSTLIRLEKNPYPDGKLPFVLIPYMPVKRDMYGEPDAELLGDNQAVLGAVMRGMIDLLGRSA 406 (763) T ss_pred CcceeEEE--EEEE---EcCeeeecccccccCCCcCEEEecceeecCcccCCchHHHhhHHHHHHHHHHHHHHHHHHhhc Confidence 00000000 0000 0111112223344557778776543 346889999999999999999999999999999 Q ss_pred CceeEE-ecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-- Q lcl|NC_010179. 269 TVILVL-TNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-- 345 (469) Q Consensus 269 ~p~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-- 345 (469) +|.+.+ .|.. ... .....+..+++.+.++..... .+.++..+.........+..+...+-..|+.++.+..-. T Consensus 407 ~~~~~v~~gav-~~~--d~~~~~pg~v~~v~~g~~~~~-~~~~~~~p~~~~~~~~~l~~~~~~~e~~TGv~~~~~G~~~~ 482 (763) T protein:vir:95 407 NGQRGMPKGML-DAL--NSRRYREGEDYEYNPTQNPAQ-MIIEHKFPELPQSALTMATLQNQEAESLTGVKAFAGGVTGE 482 (763) T ss_pred CCcEEeecccc-cch--hhhcccCCceEEeeCCCChhh-hcccccCCCCcchHHHHHHHHHHHHHHhhCcchhhcCcCcc Confidence 885543 3331 111 111234556666665543322 233333333334445555555556666777766543311 Q ss_pred --C-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hcc------cCCCc-----------ccceEEeC Q lcl|NC_010179. 346 --S-NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRY----LNF------SDADK-----------RHISQHWT 401 (469) Q Consensus 346 --g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~----~~~------~~~~~-----------~~i~i~f~ 401 (469) | ..||++ .+............+.|..+++.+++.++.+ +.. .+..+ .++.+.-. T Consensus 483 ~~~~tat~v~--~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~~~~~~~~~DV~V~~~ 560 (763) T protein:vir:95 483 SYGDVAAGIR--GVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIKREDLKGNFDLEVDIS 560 (763) T ss_pred cccchhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCccccccHHHhcCCcceEEecc Confidence 2 223333 2333333333444455555655555554444 321 11111 12222222 Q ss_pred CCCCCCH-HHHHHHHHH----HhccCChHHH---H----HhCC---CCC---------CHH----H--HHHHHHHHHHHh Q lcl|NC_010179. 402 RTKVEDS-LTKAQIVST----VANYSSKEAV---A----KANP---IVD---------DWQ----Q--ELKDLAKDREEN 451 (469) Q Consensus 402 ~~~p~d~-~e~~~~~~k----l~g~iS~et~---~----~~l~---~v~---------d~~----~--E~eri~~E~~~~ 451 (469) . .+. .+.+..+.. +...++.... + +... ++. ++. + +.++++.+.++. T Consensus 561 ~---as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~~~~~~~~lr~~q~~~d~~~q~qaqle~~~~q~e~~~~ 637 (763) T protein:vir:95 561 T---AEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKRMPKLAHDLRTWQPQPDPVQEQLKQLAVEKAQLENEEL 637 (763) T ss_pred c---chHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhchhhhHHHHHhcCCCccchhhhHHHHHHHHHHHHHHHH Confidence 1 121 222222222 2112222111 1 1110 000 000 0 000000000000 Q ss_pred hhhHhhcccCCCCCCCCC Q lcl|NC_010179. 452 DPYANQADELNGKGVDDE 469 (469) Q Consensus 452 ~~~~~~~~~~~~~~~~de 469 (469) ....+..+. .......| T Consensus 638 ~akaq~~qa-qa~~~~aq 654 (763) T protein:vir:95 638 RSKIRLNDA-QAQKAMAE 654 (763) T ss_pred HHHHHHHHH-HHHHHHHH Confidence 000000000 00000000 No 104 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=99.25 E-value=5e-11 Score=77.03 Aligned_cols=444 Identities=11% Similarity=0.032 Sum_probs=215.9 Q ss_pred CC--HHHHHHHH--------------HHHHHHHHHHHH---HHHHHHHHhccCCcccccccchhhhcccccccccccCcc Q lcl|NC_010179. 1 ME--LDALKKLI--------------RNTSTSRNDLIN---NYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADN 61 (469) Q Consensus 1 ~~--~~~~~~~i--------------~~~~~~~~~~~~---~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 61 (469) |. +..+.+++ ..+....+.|.. .-+++.+|-.. .+.+. .+..+... .+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~r~~~~~~w~e~~~yi~~-------~~tr~--t~~~~~~w----~~ 67 (599) T protein:vir:31 1 MSTDIKTLQKMLEGRDDDRAFIDELVVLFTNMENARAQKDREDKELMDYIDA-------TDTRK--TSNSKLPF----KN 67 (599) T ss_pred CccchHHHHHHhhccCchHHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhh-------hcccc--cccCCCCc----cc Confidence 22 22333332 233222222222 22344444221 11111 00101010 14 Q ss_pred eeccchHHHHHHHHHHhhhcCC------eee---ccC--chhhHHHHHHHHhc-----cHHHHHHHHHHHHHhCCeEEEE Q lcl|NC_010179. 62 RIPSNFYQLLVDQEAGYIASVF------PDI---DVG--KDADNKKILDVLGD-----DRALTLNSLLVDSSNAGRAWLH 125 (469) Q Consensus 62 ri~~n~~k~iv~~~~~~l~g~p------~~~---~~~--~~~~~~~l~~~~~~-----n~~~~~~~~~~~~~~~G~~~~~ 125 (469) ++.+|-.--+++....++++-- ..+ ..+ .....+.++.+.++ ++......+..+.+.+|.++.. T Consensus 68 s~t~~k~~~~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat 147 (599) T protein:vir:31 68 STTINKLAHLHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAH 147 (599) T ss_pred ccchHHHHHHHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEe Confidence 5667777778888888776521 111 112 12233445555443 3444556677889999988887 Q ss_pred EEEc------CCC-------ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC------C---------------- Q lcl|NC_010179. 126 YWID------EDN-------NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE------A---------------- 170 (469) Q Consensus 126 v~~d------~~~-------~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~------~---------------- 170 (469) +-+. ++| .|++..++|..++|--+.+......+.+|.+.....- + T Consensus 148 ~~~er~~~~~~d~~v~~~~~~P~~ervsP~Di~~Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~ 227 (599) T protein:vir:31 148 TRHVKRMTVTAENQVIKNYSGTVTERLSPSDVFWDVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREE 227 (599) T ss_pred eeEEEcceeecccccccccccceEEeecccceeeCCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhh Confidence 7532 122 4789999999988754444444444445544311000 0 Q ss_pred -----------------------ceEEEEEEEEcCCeEEEEEeecCceeecccccccccc-cccccc---cccccccccc Q lcl|NC_010179. 171 -----------------------GKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSY-DLSAGY---ETGQSNTLKH 223 (469) Q Consensus 171 -----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~---~~~~~~~~~~ 223 (469) .......+||.+..+..+..++. ..+........ -+.+.. .....+..|. T Consensus 228 ~~~~~~~~~d~~~~~~g~D~~~~d~~~~~~eY~~~~~VevLeywGd---~ydee~d~~~~~~ViTi~g~~~liR~e~np~ 304 (599) T protein:vir:31 228 RRTIREALADGYNGRRKFDSLHKKGYGSMMNYINEGVVEVLTFMGD---FYDEENDELWNNYEITVIDRKIIGRKQSKDT 304 (599) T ss_pred ccCCCccccchhhhhhhccccccccccchhhhcccchhhhhhhhhh---hhcccCCccccceEEEEecCcEEeecccCCC Confidence 00001111222111111111100 00000000000 011111 1112334456 Q ss_pred cCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeec Q lcl|NC_010179. 224 NFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKIN 298 (469) Q Consensus 224 ~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 298 (469) +.|..|++...- +..|.|.+..+.++++.+|.+.-.+.+.++.+..|+++..|... .......+++++.+. T Consensus 305 ~~g~~Pyvv~~~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~~~~dl~----~eD~~~~P~~v~~~~ 380 (599) T protein:vir:31 305 WDGSQNLHIAVYEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLKKVGDVR----EKGMRGGPNHVFEVE 380 (599) T ss_pred CCCCCCeEEEEeeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccccccccccc----ccCccCCCCcceeec Confidence 668889886643 34588999999999999999999999999999999888777411 111112245555543 Q ss_pred ccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_010179. 299 NAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFEHAI- 375 (469) Q Consensus 299 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l- 375 (469) +.+++.++.++.+.......+..+...+-..|+.|..+.+. .|..++..+..+..+.-....++.+.|..++ T Consensus 381 -----d~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e~~~l 455 (599) T protein:vir:31 381 -----ETGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFERELL 455 (599) T ss_pred -----CCCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHHHHHH Confidence 34568888888887777788888888888899999876653 3456777777777777777777778887765 Q ss_pred HHHHHHHHHHh----cccC------CC-----ccc-------ceEEeCCCCCCCHHHHHHHHHHHhcc------------ Q lcl|NC_010179. 376 NELVRAIMRYL----NFSD------AD-----KRH-------ISQHWTRTKVEDSLTKAQIVSTVANY------------ 421 (469) Q Consensus 376 ~~~~~~i~~~~----~~~~------~~-----~~~-------i~i~f~~~~p~d~~e~~~~~~kl~g~------------ 421 (469) +.+++-+..+. ...+ .+ ..+ -...+.+--..-..+..+.++++.++ T Consensus 456 epll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~~~P~ 535 (599) T protein:vir:31 456 TPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAALAPH 535 (599) T ss_pred HHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCccchh Confidence 33555333322 1110 00 011 11222221122345555655554322 Q ss_pred CChHHH---HHh---C--CCC--CCH---HHH-HHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 422 SSKEAV---AKA---N--PIV--DDW---QQE-LKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 422 iS~et~---~~~---l--~~v--~d~---~~E-~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +|++.. +.. + +.+ ..+ +++ +-++.+.+-+....-+..++.-++...|+ T Consensus 536 ~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~ 597 (599) T protein:vir:31 536 MSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTTDT 597 (599) T ss_pred hHHHHHHHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCccc Confidence 334222 221 1 111 111 111 22222221111221222333334444444 No 105 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=99.22 E-value=4.9e-10 Score=71.65 Aligned_cols=424 Identities=8% Similarity=-0.022 Sum_probs=208.1 Q ss_pred CCHHHHHHHH--HHHHHHHHHHHHHHHHHH----HHhccCCcccccccchhhhcccccccccccCc-ceeccchHHHHHH Q lcl|NC_010179. 1 MELDALKKLI--RNTSTSRNDLINNYKKSV----DYYENKTDITTRNNGKPKVSKEGKKDPLRSAD-NRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~~~~~~~~i--~~~~~~~~~~~~~~~~~~----~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~-~ri~~n~~k~iv~ 73 (469) |.-.-+..+. ...... .....|...- +-..|-+......+.. ...... .-..++. .-...+|++-+|+ T Consensus 1 ~~~p~~~~~~~~~~~~~~--~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~--~~~~~~-~lr~RaRdl~rNn~~a~~av~ 75 (533) T protein:vir:34 1 MKTPTIPTLLGPDGMTSL--REYAGYHGGGSGFGGQLRSWNPPSESVDAA--LLPNFT-RGNARADDLVRNNGYAANAIQ 75 (533) T ss_pred CCCchhhhhhcccccchH--HHHHhhhhccCCCCCcccccccCCCCHHHH--HHHHHH-HHHHHHHHHHhcChHHHHHHH Confidence 1111111110 000000 0111111100 0001100000000000 000000 0000000 0013589999999 Q ss_pred HHHHhhhcCCeeeccC------------chhhHHHHHHHHhc---------------cHHHHHHHHHHHHHhCCeEEEEE Q lcl|NC_010179. 74 QEAGYIASVFPDIDVG------------KDADNKKILDVLGD---------------DRALTLNSLLVDSSNAGRAWLHY 126 (469) Q Consensus 74 ~~~~~l~g~p~~~~~~------------~~~~~~~l~~~~~~---------------n~~~~~~~~~~~~~~~G~~~~~v 126 (469) ..+.++.|..++..+. ++..++.+...|.. +|...+..+.+.....|.+++.. T Consensus 76 ~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~f~~~ 155 (533) T protein:vir:34 76 LHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQA 155 (533) T ss_pred HHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHHHHHHHHHHHHhCCceEEEe Confidence 9999999988877652 22333444444421 12222334567788999999988 Q ss_pred EEcCCC----ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccc Q lcl|NC_010179. 127 WIDEDN----NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYN 202 (469) Q Consensus 127 ~~d~~~----~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (469) .+.+.+ .+++..++|+.+---++......+..+|.+ +..|..+.+++-...+.+...+. T Consensus 156 ~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~----d~~Gr~~aY~i~~~~~~~~~~~~------------- 218 (533) T protein:vir:34 156 TWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGVQI----NDSGAALGYYVSEDGYPGWMPQK------------- 218 (533) T ss_pred eeccCCCCccceEEEEechhhcCCCCCCCCCCceEeeeEE----CCCCCeEEEEEeecCCCCccccc------------- Confidence 776553 357899999886533332223445555643 33343333322111111000000 Q ss_pred ccccccccccccccccccccccCCccc---EEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE Q lcl|NC_010179. 203 IITSYDLSAGYETGQSNTLKHNFGRVP---FIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL 274 (469) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~ 274 (469) +. ..+ .+..+| |+|+.. ...|.|+|..++..+..++.............+.=..++ T Consensus 219 ------------~~---~~~-~~~~v~a~~VlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi 282 (533) T protein:vir:34 219 ------------WT---WIP-RELPGGRASFIHVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATI 282 (533) T ss_pred ------------cc---eee-eeeccChhHeeeeccccCCCcccCCchHHHHHHHHHHHHHHHHHHHHHHHHhhhheeee Confidence 00 000 011122 444433 246999999988888887776555444444333222333 Q ss_pred ecCCcc-------------cch-hh--------------hhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHH Q lcl|NC_010179. 275 TNYGGA-------------SLK-QF--------------MNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKI 326 (469) Q Consensus 275 ~g~~~~-------------~~~-~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 326 (469) +...+. ... .. ...+....+..+.+ +.++++++++.+...+..+... T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p-----Ge~i~~~~~~~p~~~~~~f~~~ 357 (533) T protein:vir:34 283 ESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMP-----GDSLNLQTAQDTDNGYSVFEQS 357 (533) T ss_pred ecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCC-----CCeeeecCCCCCCCCHHHHHHH Confidence 321100 000 00 00123333333333 3468898888888899999999 Q ss_pred HHHHHHHHhCCCCcCc-cccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHH--HHhc----ccCC---Cc-- Q lcl|NC_010179. 327 TRDNIFLFGQGIDPAN-FESSNASGVAIKMLYSHLELKAAKTQTYFEHAIN-ELVRAIM--RYLN----FSDA---DK-- 393 (469) Q Consensus 327 l~~~i~~~s~~p~~~~-~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~--~~~~----~~~~---~~-- 393 (469) +.+.|..-.++|--.. ..+++.|-.+.+..+......+...+..|...+- .+++..+ .++. .+.. ++ T Consensus 358 ~lr~iAaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~ 437 (533) T protein:vir:34 358 LLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQE 437 (533) T ss_pred HHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchh Confidence 9999999888774333 3456777777777777777777776666655432 2222222 1222 1111 11 Q ss_pred ---ccceEEeCCC--CCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc-----cC Q lcl|NC_010179. 394 ---RHISQHWTRT--KVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD-----EL 461 (469) Q Consensus 394 ---~~i~i~f~~~--~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-----~~ 461 (469) ....+.|..+ .-.|....+++...+ +|+.|.+..+...| .|+++.++++.+|++......-... .. T Consensus 438 ~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G--~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~ 515 (533) T protein:vir:34 438 ARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRETMERRAAGLKPPAWAAAAF 515 (533) T ss_pred hHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCc Confidence 1245677544 334777777777664 79999999999987 4888999999988877654422111 11 Q ss_pred CCC---CCCCC Q lcl|NC_010179. 462 NGK---GVDDE 469 (469) Q Consensus 462 ~~~---~~~de 469 (469) ..+ ..+++ T Consensus 516 ~s~~~~~~~~~ 526 (533) T protein:vir:34 516 ESGLRQSTEEE 526 (533) T ss_pred cCCCCCCCCCC Confidence 111 11111 No 106 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=99.21 E-value=5.2e-10 Score=71.48 Aligned_cols=397 Identities=9% Similarity=-0.002 Sum_probs=206.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) .+.+ ..+..-..... .+..+.++ ..+|++-+|+..+..+. T Consensus 46 ~s~~---~~~~~~~~~lr------~RaRdL~r-------------------------------Nn~~a~~av~~~~~nvV 85 (553) T protein:vir:63 46 RSPD---ALINPLKRIAD------ARGRDMAD-------------------------------NDGFTNGAVGYQRDSIV 85 (553) T ss_pred CChH---HHHHHHHHHHH------HHHHHHHh-------------------------------cChHHHHHHHHHHHhhc Confidence 1111 11111000000 01111122 24889999999999999 Q ss_pred cCCeeeccC-------------chhhHHHHHHHHh---c------------cHHHHHHHHHHHHHhCCeEEEEEEEcCC- Q lcl|NC_010179. 81 SVFPDIDVG-------------KDADNKKILDVLG---D------------DRALTLNSLLVDSSNAGRAWLHYWIDED- 131 (469) Q Consensus 81 g~p~~~~~~-------------~~~~~~~l~~~~~---~------------n~~~~~~~~~~~~~~~G~~~~~v~~d~~- 131 (469) |..++..+. ++..++.+...|+ + +|...+..+.+.....|.++++..+.+. T Consensus 86 G~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~~~~~~~~~ 165 (553) T protein:vir:63 86 GAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLGVVGYVKTGEVLATAEWDRAA 165 (553) T ss_pred cCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHHHHHHHhCCceEEEeeeccCC Confidence 988776542 1223333433332 1 1222233456778899999988766544 Q ss_pred C---ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccc Q lcl|NC_010179. 132 N---NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYD 208 (469) Q Consensus 132 ~---~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (469) | .+++..++|+.+-.-++......+..+|.+ +..|....+++--..+............+..+..+ T Consensus 166 ~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GVE~----d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~------- 234 (553) T protein:vir:63 166 NRPYATCFQMVSTDRLSNPYQQLDTPTLRRGVQY----DKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQS------- 234 (553) T ss_pred CCcccceEEEechhhcCCCCCCCCCCeeEeeeEE----CCCCceEEEEeeccCCCccccccccccceeeeccc------- Confidence 2 357889999876544433333445666643 33444443332221222211111111110000000 Q ss_pred ccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC-cccc Q lcl|NC_010179. 209 LSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG-GASL 282 (469) Q Consensus 209 ~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~-~~~~ 282 (469) ...+.-=|+|+. ....|.|+|..++..+..++.....-.......+.=..+++... .+.. T Consensus 235 --------------~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~daeL~~a~i~A~~a~fi~~~~~~~~~ 300 (553) T protein:vir:63 235 --------------KPWGRRQVIHILEPREPDQSRGIADIVSGLKDMRMAKRFKEMSLQNAVINASYAAAIESELPPEFI 300 (553) T ss_pred --------------cccChhHheecccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCChhhh Confidence 000111123322 23469999999998888887765554444444333223333211 1000 Q ss_pred h-----------------------------hhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 283 K-----------------------------QFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFL 333 (469) Q Consensus 283 ~-----------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~ 333 (469) . .....+....+..+.++ -++++++++.+...+..+...+.+.|.. T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG-----e~i~~~~p~~p~~~~~~F~~~~lr~iaa 375 (553) T protein:vir:63 301 HSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPG-----TKLNLKPMGTPGGVGSEFEASLNRHLAS 375 (553) T ss_pred hhhcccccccccccccccccccccccccccccceeecCceeeecCCC-----CeeeecCCCCCCCCHHHHHHHHHHHHHh Confidence 0 00112233333333333 4688988887888999999999999999 Q ss_pred HhCCCCcCc-cccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH--HHhc----ccCCC-----------cc Q lcl|NC_010179. 334 FGQGIDPAN-FESSNASGVAIKMLYSHLELKAAKTQTYFEHAINE-LVRAIM--RYLN----FSDAD-----------KR 394 (469) Q Consensus 334 ~s~~p~~~~-~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~-~~~~i~--~~~~----~~~~~-----------~~ 394 (469) -.++|--.. ..++++|-.+.+..+..........+..|...+-+ +++..+ .++. ..+.. .. T Consensus 376 glGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a 455 (553) T protein:vir:63 376 AFGMSYEEFTRDFSKANYSSIQAGIAMTRRFLEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEA 455 (553) T ss_pred hcCCCHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhh Confidence 888774333 34666777777777777766666666666554333 333222 2222 11110 11 Q ss_pred cceEEeCCCCC--CCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc----------- Q lcl|NC_010179. 395 HISQHWTRTKV--EDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD----------- 459 (469) Q Consensus 395 ~i~i~f~~~~p--~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~----------- 459 (469) .+.+.|..+-. .|....+++.... +|+.|.+..+...| .|+++-++++.+|++.........+ T Consensus 456 ~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~ 533 (553) T protein:vir:63 456 LSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARLG--GDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGR 533 (553) T ss_pred hhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCc Confidence 24577765544 4777777777664 78999999999997 4888888888888776544322111 Q ss_pred ----------cCCCCCCCCC Q lcl|NC_010179. 460 ----------ELNGKGVDDE 469 (469) Q Consensus 460 ----------~~~~~~~~de 469 (469) ....++.+.| T Consensus 534 ~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 534 DAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred ccCCCCCCCCCCCCcccccC Confidence 1111111111 No 107 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.19 E-value=3.4e-10 Score=72.49 Aligned_cols=394 Identities=13% Similarity=0.060 Sum_probs=192.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |..+-.-.....+-...... +-..+..||.... ...++- .. -+ -.+.+++.+|+..+.-++ T Consensus 68 ~a~d~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~l--~a-------------~Y-~~~~l~r~iVd~~A~d~~ 128 (537) T protein:vir:10 68 MAMDGLDVEGGTFSAYANPN--LSEGLVLWYAQQA-FIGHQM--CA-------------LI-ATHWLVNKACSQMPRDAM 128 (537) T ss_pred hhccccccchhhhhhhcccc--ccchhhhhccccC-CccHHH--HH-------------HH-HhCchhhhhhhhhhHHhh Confidence 22221111111111110000 0011112222211 111100 00 00 136889999999999999 Q ss_pred cCCeeeccCchh-----hHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcC-CCceE----------------EE Q lcl|NC_010179. 81 SVFPDIDVGKDA-----DNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDE-DNNFR----------------YG 137 (469) Q Consensus 81 g~p~~~~~~~~~-----~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~-~~~~~----------------i~ 137 (469) -+++.+++++++ ..+.+...|++ +....+.++.+.+..+|.+++++..+. ++... +. T Consensus 129 r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i~v~~~D~~~~~~Pl~~~~i~kg~~k~l~ 208 (537) T protein:vir:10 129 RKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALFKVDSPDPYYYEKPFNIDGVMPGAYKGIV 208 (537) T ss_pred cCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEeecCcCCcccccccccccccccceeEEE Confidence 999999886532 22345555544 345668888899899999988876643 22211 22 Q ss_pred EEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccc Q lcl|NC_010179. 138 IIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQ 217 (469) Q Consensus 138 ~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (469) +++|..+.|...... ..+..... +...+.|.-....+ .. T Consensus 209 vidp~~~~~~~~~~~------------~~dp~sp~-fg~P~~y~v~g~~i----------------------------H~ 247 (537) T protein:vir:10 209 QIDPYWCAPLLDAQA------------SSNPVSMH-FYEPTYWLINGKKY----------------------------HR 247 (537) T ss_pred Eechhhcccccchhh------------hccCCccc-cCCceeeeecCeEe----------------------------cc Confidence 333333332210000 00000000 00001110000000 00 Q ss_pred cccccccCC-cccEEEec-CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchh-hhhhh----- Q lcl|NC_010179. 218 SNTLKHNFG-RVPFIEFP-KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQ-FMNDL----- 289 (469) Q Consensus 218 ~~~~~~~~g-~vPvv~~~-n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~-~~~~~----- 289 (469) .... |.-| .+|-..-+ ++-.|.|.++.+.+-+..++.+.-..+..+..+..+.+.+.|.......+ ....+ T Consensus 248 SRli-~f~g~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~~~~~~~~r~~~~~~ 326 (537) T protein:vir:10 248 SHLA-IYINDEVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLANKQQFDETMSWWTA 326 (537) T ss_pred eeEE-EecCCCCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhcCHHHHHHHHHHHHh Confidence 0000 0000 01111111 12248999999999999999999888888888888887776642211111 11111 Q ss_pred --hhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C-CccHHHHHHHHHHHHHH Q lcl|NC_010179. 290 --REYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S-NASGVAIKMLYSHLELK 363 (469) Q Consensus 290 --~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-~~Sg~Al~~~~~~l~~k 363 (469) ...+++.+... .-+|-+.+.+.+.....++.+...|...+++|-.-..+. | |.||..=...|...+ T Consensus 327 ~r~n~g~~~id~e------~e~~e~~~~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I-- 398 (537) T protein:vir:10 327 TRDNYQVRVVDKD------NEDVVQIDTTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEEC-- 398 (537) T ss_pred hcCCcceeEecCC------CceeEEEeccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHH-- Confidence 11233333321 124444556777788899999999999999997644332 2 356776554555444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHH-------HH--hccCChHHHHHhCCCC Q lcl|NC_010179. 364 AAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVS-------TV--ANYSSKEAVAKANPIV 434 (469) Q Consensus 364 ~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~-------kl--~g~iS~et~~~~l~~v 434 (469) +.++..++..+++++.+++..... ...+++++|++-...|++|.|++.. ++ +|++|.+++...|..- T Consensus 399 -~~~Qe~l~p~l~~l~~ll~~~~~~---~~~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~ 474 (537) T protein:vir:10 399 -ESTQDDMRPLIDRHHQLVCRSHLR---KRIRVKVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMD 474 (537) T ss_pred -HHHHHHHHHHHHHHHHHHHHhcCC---CCcceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhcc Confidence 344445789999999998875432 2446899999999999999887643 33 5789998887775321 Q ss_pred ---------CCHH-HHHHHHHHHHHHhhhhHhhcccC------CCCCCCCC Q lcl|NC_010179. 435 ---------DDWQ-QELKDLAKDREENDPYANQADEL------NGKGVDDE 469 (469) Q Consensus 435 ---------~d~~-~E~eri~~E~~~~~~~~~~~~~~------~~~~~~de 469 (469) .... +..+....+.+ ..+ .....+. .+....++ T Consensus 475 ~~~g~~~l~~~~~~ed~e~~~~~~~-~~~-~~~~~~~~~~~~~~~~~~~~~ 523 (537) T protein:vir:10 475 PTLGFTSITPAMRPTDAEDIDVDDE-GKP-VRIIEDQPAPSEMFGATSSGE 523 (537) T ss_pred CccccccccCCCChhhhhcccCCcc-CCc-CCCCCCCCCccccCCCCcccc Confidence 1111 11111111100 000 0000000 00000000 No 108 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.19 E-value=7.4e-10 Score=70.64 Aligned_cols=427 Identities=9% Similarity=0.005 Sum_probs=213.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhh-cccccccccccCc-ceeccchHHHHHHHHHHh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKV-SKEGKKDPLRSAD-NRIPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~-~~~~~~~~~~~~~-~ri~~n~~k~iv~~~~~~ 78 (469) ++-= +.-...............|+-...=..+..-.......-... .......-..++. .-...++++-+|+..+.. T Consensus 11 ~dr~-i~~~~~~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~~~~n 89 (505) T protein:vir:96 11 AQRM-VNWAWYRYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLASLVQRAREQSINNPYAKRFYQLLKNN 89 (505) T ss_pred hhcc-cchhhhhhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH Confidence 0000 000000000000111112221110000000000000000000 0000000000000 001257889999999999 Q ss_pred hhc-CCeeeccC--------chhhHHHHHHHHhc-----c--------HHHHHHHHHHHHHhCCeEEEEEEEcCCC--ce Q lcl|NC_010179. 79 IAS-VFPDIDVG--------KDADNKKILDVLGD-----D--------RALTLNSLLVDSSNAGRAWLHYWIDEDN--NF 134 (469) Q Consensus 79 l~g-~p~~~~~~--------~~~~~~~l~~~~~~-----n--------~~~~~~~~~~~~~~~G~~~~~v~~d~~~--~~ 134 (469) +.| ..+++.+. +++.++.+...|.. + |...+..+.+.....|.+++.+.....+ .+ T Consensus 90 vVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~~~~ 169 (505) T protein:vir:96 90 VIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHLWMETLARDGEVLVREHRGYPNKWGY 169 (505) T ss_pred hcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHHHHHHHhhCCceEEEEeecCCCCcce Confidence 998 57766542 45555665555542 1 2222334567788899999887665444 25 Q ss_pred EEEEEccceeEEEEeC--CCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccc Q lcl|NC_010179. 135 RYGIIQPDQITPVYAT--TLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAG 212 (469) Q Consensus 135 ~i~~~~p~~~~~~~d~--~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (469) ++..++|+.+---++. .....+..+|++ +..|..+.+++---.+....... T Consensus 170 ~lqliepd~l~~~~n~~~~~~~~i~~GIe~----d~~Gr~~aY~i~~~hPgd~~~~~----------------------- 222 (505) T protein:vir:96 170 ALQILECDRLDLNYNADLQNGNRIRMSIEL----DAWERPVAYHLLVNHPGDNSYCY----------------------- 222 (505) T ss_pred EEEEechhhcCCCCCcccCCcCeEEeceEE----CCCCceEEEEEeecCCCcccccc----------------------- Confidence 7889999886422211 112234555543 33344333222111111110000 Q ss_pred ccccccccccccCCccc---EEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCccc--- Q lcl|NC_010179. 213 YETGQSNTLKHNFGRVP---FIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGAS--- 281 (469) Q Consensus 213 ~~~~~~~~~~~~~g~vP---vv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~--- 281 (469) ......+.+|| |+|+.. ...|.|+|..++..+..++.............+.=..+++...+.. T Consensus 223 ------~~~~~~~~rvpa~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~ 296 (505) T protein:vir:96 223 ------HYAGQTYERVPADEIIHTFVPWRPHQNRGIPWTHASMVELHHIGEYRKSEMIAAELGAKKVGFYEQDPEAYDQP 296 (505) T ss_pred ------ccccccccccCHhHhhhhhcccCCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCccCCCc Confidence 00001233344 344432 2369999999999888888776665555555444334455422111 Q ss_pred ----chhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCc-cccCCccHHHHHHH Q lcl|NC_010179. 282 ----LKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPAN-FESSNASGVAIKML 356 (469) Q Consensus 282 ----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~g~~Sg~Al~~~ 356 (469) .+.....+....+..+.++ -++++++++.+...+..++..+.+.|..-..+|--.. ..+++.|-.+.+.. T Consensus 297 ~~~~~~~~~~~l~pG~i~~L~pG-----e~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYSS~R~~ 371 (505) T protein:vir:96 297 PEDDQGEIVEEVEAGTYQLLPYG-----IRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFSSLRSG 371 (505) T ss_pred cccccCccccccCCceeeecCCC-----CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHH Confidence 1111223333333334433 4689998888889999999999999999888874333 34566777788888 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHH--hc----ccCCC-cccceEEeCCCCC--CCHHHHHHHHHHH--hccCCh Q lcl|NC_010179. 357 YSHLELKAAKTQTYFEHAI-NELVRAIMRY--LN----FSDAD-KRHISQHWTRTKV--EDSLTKAQIVSTV--ANYSSK 424 (469) Q Consensus 357 ~~~l~~k~~~~~~~~~~~l-~~~~~~i~~~--~~----~~~~~-~~~i~i~f~~~~p--~d~~e~~~~~~kl--~g~iS~ 424 (469) +......+...+..|...+ +.+++..+.. +. ..+.+ .....+.|..+-- .|....+++...+ +|+.|. T Consensus 372 ~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~ 451 (505) T protein:vir:96 372 ELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKNRTRSR 451 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHcCCCCH Confidence 8877777777777666533 3344433322 11 11111 1224577755432 4777777777664 799999 Q ss_pred HHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhh------------cccCCCCCCCC Q lcl|NC_010179. 425 EAVAKANPIVDDWQQELKDLAKDREENDPYANQ------------ADELNGKGVDD 468 (469) Q Consensus 425 et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~------------~~~~~~~~~~d 468 (469) +..+...| .|+++.++++.+|++......-. .++..+...|| T Consensus 452 ~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 452 SSIIRAAG--DDPEDVFDEIAWEEQLMRDKGVNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred HHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 99999987 48888899999888765443211 01111111111 No 109 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.17 E-value=1.6e-10 Score=74.30 Aligned_cols=380 Identities=13% Similarity=0.027 Sum_probs=185.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeeeccC Q lcl|NC_010179. 10 IRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDIDVG 89 (469) Q Consensus 10 i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~ 89 (469) ++.+. -+-+....-|.++-..+.... ...+.. .. .-. -.+.+++.+|+..+.-++.+.+.++++ T Consensus 1 ~~~~~---------~d~~~~~~~~~~~~~~~~~~~---~~~~~~--l~-a~Y-~~~~l~~~~Vd~~aed~~r~g~~i~g~ 64 (427) T protein:vir:10 1 MKIVK---------HDGYNDIFNGGADGSPKPFFM---SDASYH--VG-SFY-NDNATAKRIVDVIPEEMVTAGFKMSGV 64 (427) T ss_pred CCccc---------cchHHHHhhcCCCCcccCccc---cCchHH--HH-HHH-HcCchhhhhhccchHHhhcCCccccCc Confidence 11111 111122223322211111100 000000 00 000 136778899999999999999999875 Q ss_pred chhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc----------e-EEEEEccceeEEEEeCCCCCceE Q lcl|NC_010179. 90 KDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN----------F-RYGIIQPDQITPVYATTLDNKLL 157 (469) Q Consensus 90 ~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~----------~-~i~~~~p~~~~~~~d~~~~~~~~ 157 (469) ++. +.+...|++ +....+.++.+.+..+|.+++++-++.... + .+.++++.++.|-.-..++..+- T Consensus 65 ~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~ 142 (427) T protein:vir:10 65 KDE--KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPR 142 (427) T ss_pred cHH--HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccccccCCCcceeEEEEechhcccccccccCccccc Confidence 432 445555554 556678899999999999998886643321 1 13333333322211100000000 Q ss_pred EE-EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE-ecC Q lcl|NC_010179. 158 GV-LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE-FPK 235 (469) Q Consensus 158 ~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~-~~n 235 (469) ++ ..+|......+. ....+|.. .+..+.. ..+|-.. ..+ T Consensus 143 fg~P~~y~v~~~~~~---~~~~iH~S-Rli~~~g-----------------------------------~~~p~~~~~~~ 183 (427) T protein:vir:10 143 YGEPEIYKVSPGDNM---QPYLIHHS-RVFIADG-----------------------------------ERVAQQARKQN 183 (427) T ss_pred cCcceEEEEecCCCC---cceEEccc-cEEEecC-----------------------------------CCchhhhcccC Confidence 00 001111000000 00000000 0000000 0011110 012 Q ss_pred CccccccHH-HHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc-----ccchhhhhhh-------hhcceeeecccCC Q lcl|NC_010179. 236 NKYRLAELN-KYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG-----ASLKQFMNDL-------REYKSIKINNAGN 302 (469) Q Consensus 236 ~~~g~~~~~-~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~-----~~~~~~~~~~-------~~~~~~~~~~~~~ 302 (469) +..|.|.+. .+.+-+..++.+.-.....+..+....+.+.|... .......... ..++.+.+.. T Consensus 184 ~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~--- 260 (427) T protein:vir:10 184 QGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA--- 260 (427) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeec--- Confidence 334677664 46677778888887777777777777777666421 1111111111 1122333321 Q ss_pred CCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc--C--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 GDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES--S--NASGVAIKMLYSHLELKAAKTQTYFEHAINEL 378 (469) Q Consensus 303 ~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g--~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~ 378 (469) .+.+++ +.+.+.+.....++....+|...+++|-.-..+. + |.||..=..-|...+.- ..+..+...++++ T Consensus 261 -~~e~~e--~~~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~--~Qe~~l~p~l~~l 335 (427) T protein:vir:10 261 -ETEEYD--VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDR--KREEDYRPLLEFL 335 (427) T ss_pred -CCCcee--EEecccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHH--HHHHHHHHHHHHH Confidence 112233 4456777888889999999999999997544332 2 35677544444444332 2345688888988 Q ss_pred HHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHH-------H--hccCChHHHHHhCC----C--CCCHHHHHHH Q lcl|NC_010179. 379 VRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST-------V--ANYSSKEAVAKANP----I--VDDWQQELKD 443 (469) Q Consensus 379 ~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k-------l--~g~iS~et~~~~l~----~--v~d~~~E~er 443 (469) +.+++.- .+++++|++-...+++|.+++..+ + +|+++.+++.+.|- . +.+.. . T Consensus 336 ~~~i~~s--------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~----~ 403 (427) T protein:vir:10 336 LPFIVDE--------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGN----N 403 (427) T ss_pred HHHhhcC--------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCc----c Confidence 8887632 367899999999999999876543 2 46788887766541 1 11100 0 Q ss_pred HHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 444 LAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 444 i~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +..|. .+.....++..+++.+|| T Consensus 404 ~~~e~---~~~~~e~~p~~~e~~~d~ 426 (427) T protein:vir:10 404 INIRE---PEETTEPEPGLGEKLEDE 426 (427) T ss_pred ccccc---cchhcCCCCCCCCCCCCC Confidence 11110 001111233445555555 No 110 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.13 E-value=3.5e-10 Score=72.45 Aligned_cols=377 Identities=11% Similarity=0.049 Sum_probs=181.5 Q ss_pred HHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeeeccCchhhHHHHHHH Q lcl|NC_010179. 21 INNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDIDVGKDADNKKILDV 100 (469) Q Consensus 21 ~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~ 100 (469) +.+.+-+...+-|-++--..... +..... .... .-+ -.+.+++.+|+..+.-++.+.+.+++++++ +.+..- T Consensus 1 ~~~~D~~~n~~~gg~~~~~~~~~-~~~~~~---~~l~-a~Y-~~~~l~~~~Vd~~aed~~r~g~~i~~~~~~--~~~~~~ 72 (422) T protein:vir:10 1 MVKTDSYANIFLGGSDGSEIYGS-LQNQAP---TILA-SLY-ADNALVRRIIDTIPETALAAGFHIDGIDDE--PAFWSR 72 (422) T ss_pred CccchhhHHHHcCCCCCccccCc-ccccCH---HHHH-HHH-HhChhhHHHHhhhhHHHhcCCccccCCCHH--HHHHHH Confidence 22222222233342210000000 000000 0000 001 246888999999999999999999876543 234444 Q ss_pred Hhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCC----------ce-EEEEEccceeEEEEeCCCCCceEEE-EEEEEeee Q lcl|NC_010179. 101 LGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDN----------NF-RYGIIQPDQITPVYATTLDNKLLGV-LRSYKQLD 167 (469) Q Consensus 101 ~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~----------~~-~i~~~~p~~~~~~~d~~~~~~~~~~-v~~~~~~~ 167 (469) |.+ +....+.++.+.+..+|.+++++-..+.. .+ .+.++++.++.|..-+.+...+-++ ..+|.... T Consensus 73 ~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~ 152 (422) T protein:vir:10 73 WDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITT 152 (422) T ss_pred HHHhhHHHHHHHHHHhhccccceEEEEEecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEec Confidence 433 45567888999999999999888663221 11 1333333333322100000000000 00111000 Q ss_pred cCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccE-EEecCCccccccHHH- Q lcl|NC_010179. 168 PEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF-IEFPKNKYRLAELNK- 245 (469) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv-v~~~n~~~g~~~~~~- 245 (469) ..+.. ...+|. ..+..+. + ..+|- ....++..|.|.+.. T Consensus 153 ~~~~~---~~~iH~-SRli~~~----------------------------------g-~~~p~~~~~~~~~~G~S~l~~~ 193 (422) T protein:vir:10 153 NESDM---FYDVHY-SRIHIID----------------------------------G-ERIPNVMRRQNDGWGRSVLSSD 193 (422) T ss_pred CCCCc---ceeecc-ceeEEeC----------------------------------C-CCchhhhcccCCcccchhHHHH Confidence 00000 000000 0000000 0 00111 112233457787875 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcc---c--chhhhhhh-------hhcceeeecccCCCCCCcceEEee Q lcl|NC_010179. 246 YKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGA---S--LKQFMNDL-------REYKSIKINNAGNGDKSGVDKLQI 313 (469) Q Consensus 246 v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~---~--~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~l~~ 313 (469) +.+-+..++.+.-.....+..+....+.+.|.... . ........ ...+.+.+.. .+.+++ +. T Consensus 194 ~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~----~~e~~e--~~ 267 (422) T protein:vir:10 194 ILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDA----ESEEYS--VL 267 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEec----CCcceE--EE Confidence 56888888888888888787777777776663211 1 11101111 1122232321 112233 34 Q ss_pred cCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc--C--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc Q lcl|NC_010179. 314 DIPVEARDDALKITRDNIFLFGQGIDPANFES--S--NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS 389 (469) Q Consensus 314 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g--~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~ 389 (469) +.+.+.....++.....|...+++|-.-..+. + |.||..-..-|...+.- ..+..++..+++++.+++. T Consensus 268 ~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~--~Qe~~l~p~l~~l~~~i~~----- 340 (422) T protein:vir:10 268 NSDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDR--KRNAELLPILEFLIPFIVN----- 340 (422) T ss_pred ecccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhcc----- Confidence 56677888889999999999999997544332 2 24666544444444332 2345678899999888764 Q ss_pred CCCcccceEEeCCCCCCCHHHHHHHHHHH---------hccCChHHHHHhCCCC-CC--HHHHH--HHHHHHHHHhhhhH Q lcl|NC_010179. 390 DADKRHISQHWTRTKVEDSLTKAQIVSTV---------ANYSSKEAVAKANPIV-DD--WQQEL--KDLAKDREENDPYA 455 (469) Q Consensus 390 ~~~~~~i~i~f~~~~p~d~~e~~~~~~kl---------~g~iS~et~~~~l~~v-~d--~~~E~--eri~~E~~~~~~~~ 455 (469) ..+++++|++-...+++|.|++..+. +|++|.+++.+.|-.. ++ ....+ +.+..++.. T Consensus 341 ---s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~~~~----- 412 (422) T protein:vir:10 341 ---AEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKINDGSVETEVTISETS----- 412 (422) T ss_pred ---cCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCCCCCccccchhhcC----- Confidence 13678999999999999988765442 4678887776655110 00 00001 001111100 Q ss_pred hhcccCCCCCCCC Q lcl|NC_010179. 456 NQADELNGKGVDD 468 (469) Q Consensus 456 ~~~~~~~~~~~~d 468 (469) .+...+..+| T Consensus 413 ---~~~~~~~~~d 422 (422) T protein:vir:10 413 ---NDPLEVPTDD 422 (422) T ss_pred ---CCCCCCCCCC Confidence 1111222222 No 111 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.12 E-value=3.3e-10 Score=72.58 Aligned_cols=404 Identities=12% Similarity=0.037 Sum_probs=186.1 Q ss_pred CCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MEL---DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~---~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) |+= +-...-..-+..-....... ..+..||....-+- .+ ... . + -.+.+++.||+..+. T Consensus 71 ~ds~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~f~g-yq-----l~a------l----Y-~~~~l~rkiVd~pAe 132 (765) T protein:vir:96 71 MDSAYGDGPTPAAKAAAGGQNPYVVP-TMLQDWYNSQGFIG-YQ-----ACA------I----I-SQHWLVDKACSMSGE 132 (765) T ss_pred ccccccccccchHHHhhhccCccchh-hHHHhhhcccCCcc-HH-----HHH------H----H-HhCchhhhhhhcchH Confidence 110 00000000000000000001 11222322211000 00 000 0 0 136888999999999 Q ss_pred hhhcCCeeeccCchhh----HHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC-CceEEEEEccce-------e Q lcl|NC_010179. 78 YIASVFPDIDVGKDAD----NKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED-NNFRYGIIQPDQ-------I 144 (469) Q Consensus 78 ~l~g~p~~~~~~~~~~----~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~-~~~~i~~~~p~~-------~ 144 (469) -++.+++.+++++++. .+.+...|++ +....+.++.+.+-.+|.+|+++-++.+ +...-..+++.. . T Consensus 133 Da~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~~~D~~~l~~PL~~~~I~kg~~kg 212 (765) T protein:vir:96 133 DAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVESDDPDYYEKPFNPDGIAPGSYKG 212 (765) T ss_pred HhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEecccCcchhhccccccccccceeeE Confidence 9999999998765433 2344444443 4456788899999999999987765432 211111122211 1 Q ss_pred EEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccccc Q lcl|NC_010179. 145 TPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (469) Q Consensus 145 ~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (469) +.++|+.--.. . .+..+ ..+.. .-.|+.+.. |... +......... - T Consensus 213 l~vldp~~~~~-~-~v~e~-~~Dp~------sp~fg~P~~---y~i~--------------------g~~IH~SRli--~ 258 (765) T protein:vir:96 213 ISQIDPYWAMP-Q-LTAES-TADPS------AEHFYEPDF---WIIS--------------------GKKYHRSHLV--V 258 (765) T ss_pred EEEechhhccc-c-cchhc-ccccc------ccccCccee---eeec--------------------CceeccceEE--E Confidence 11111100000 0 00000 00000 000111100 0000 0000000000 0 Q ss_pred CC--cccEEEec-CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcc-cchhhhhhh------h-hcc Q lcl|NC_010179. 225 FG--RVPFIEFP-KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGA-SLKQFMNDL------R-EYK 293 (469) Q Consensus 225 ~g--~vPvv~~~-n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~-~~~~~~~~~------~-~~~ 293 (469) |. .+|-..-+ ++-.|.|.++.+.+-+.+++.+.-.....+..+....+.+.+.... +.......+ + ..+ T Consensus 259 ~~g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n~g 338 (765) T protein:vir:96 259 VRGPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIANEDAFNARLAFWIANRDNHG 338 (765) T ss_pred ecCCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhccHHHHHHHHHHHHHhcCCce Confidence 00 01111101 1234899999999999999999888888888888777766553211 111111111 1 122 Q ss_pred eeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C-CccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 294 SIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S-NASGVAIKMLYSHLELKAAKTQT 369 (469) Q Consensus 294 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-~~Sg~Al~~~~~~l~~k~~~~~~ 369 (469) ++.+... -+|=+.+.+.+.+...++.+...|...+.+|-.-..+. | |.||..=..-|...+.- ..+. T Consensus 339 ~~~id~e-------e~~e~~s~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s--~Qe~ 409 (765) T protein:vir:96 339 VKVIGID-------ETMEQFDTNLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELES--IQEH 409 (765) T ss_pred eEEecCC-------cceeEEecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHH--HHHH Confidence 3333221 23334456777888999999999999999997544332 2 46777543344443332 3346 Q ss_pred HHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHH-------H--hccCChHHHHHhCC------CC Q lcl|NC_010179. 370 YFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST-------V--ANYSSKEAVAKANP------IV 434 (469) Q Consensus 370 ~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k-------l--~g~iS~et~~~~l~------~v 434 (469) .+...+++++.+++....+ + .+++++|++-...+++|.|++..+ + +|++|..++.+.+. +- T Consensus 410 ~l~p~le~L~~li~~s~~i---~-~d~~i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~ 485 (765) T protein:vir:96 410 IFDPLLERHYLLLAKSESI---D-VQLEIVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYN 485 (765) T ss_pred HHHHHHHHHHHHHHHhcCC---C-CcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCC Confidence 6788999999998875322 2 368999999999999998886543 2 57899888887652 11 Q ss_pred CCHHHHHHH---HHHHHHHhhhhHh-hcccCCCCCCCCC Q lcl|NC_010179. 435 DDWQQELKD---LAKDREENDPYAN-QADELNGKGVDDE 469 (469) Q Consensus 435 ~d~~~E~er---i~~E~~~~~~~~~-~~~~~~~~~~~de 469 (469) .....+.+. +..|..+..+... ...+..+++.-++ T Consensus 486 ~l~d~~~e~~~~~~pe~~~~~~~~~~~~~~~~~e~~~~~ 524 (765) T protein:vir:96 486 RLTDDQAETEPGMSPENLAELEKAGAQSAKAKGEAERAE 524 (765) T ss_pred CCCccccccccCCCccccccccCCCcccccccCcccccc Confidence 101111110 1000000000000 0000000000000 No 112 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.12 E-value=2.3e-10 Score=73.41 Aligned_cols=377 Identities=14% Similarity=0.011 Sum_probs=186.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |.-+ ++....-+-+...+.+.-.. .++...+...-... .. ..-+ -.+.+++.+|+..+.-++ T Consensus 5 m~~~-------------~~~~~~~D~~~~~~~~~~g~-~~~~~~~~~~~~~~--~l-~~~Y-~~~~l~~~~Vd~~aed~~ 66 (435) T protein:vir:79 5 MSDK-------------VKAITKEDGYNEIFGSKDGT-FRPNAFYMQRAAFK--AL-SQFY-EEDGMARRIVDVIPEEMV 66 (435) T ss_pred cccc-------------cccchhhcchhhhhcccccc-cccCcccCCcCCHH--HH-HHHH-hcCchhhhhhccchHHhh Confidence 4444 11111112222223322111 00000000000000 00 0001 136888999999999999 Q ss_pred cCCeeeccCchhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc----------e-EEEEEccceeEEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN----------F-RYGIIQPDQITPVY 148 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~----------~-~i~~~~p~~~~~~~ 148 (469) .+.+.++++++. +.+...|++ +....+.++.+.+..+|.+++++-..+... + .+.+++|.++.|-. T Consensus 67 r~g~~i~g~~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~ 144 (435) T protein:vir:79 67 TPGFKVDGVKNE--KSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKMLKSPVKPGAQLEDIRVYDRYQITIHE 144 (435) T ss_pred cCCceecCCChH--HHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCcccccccCCceeeEEeechhhccchh Confidence 999998865432 345555554 455678899999999999988886532211 1 13333333322211 Q ss_pred eCCCCCceEEE-EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCc Q lcl|NC_010179. 149 ATTLDNKLLGV-LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGR 227 (469) Q Consensus 149 d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 227 (469) -+.++..+-++ ..+|......+. ....-|+-.. T Consensus 145 ~~~dp~sp~fg~P~~y~v~~~~~~----------------------------------------------~~~~iH~SRl 178 (435) T protein:vir:79 145 RETNARSVRYGEPKLYKISPGGDI----------------------------------------------PEFFVHYSRI 178 (435) T ss_pred hccCCcccccCcceEEEEecCCCC----------------------------------------------CceEEcceeE Confidence 00000000000 001111000000 0000111111 Q ss_pred c-------cEEE-ecCCccccccH-HHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcc-----cchhhhhh----- Q lcl|NC_010179. 228 V-------PFIE-FPKNKYRLAEL-NKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGA-----SLKQFMND----- 288 (469) Q Consensus 228 v-------Pvv~-~~n~~~g~~~~-~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~-----~~~~~~~~----- 288 (469) | |-.. ..++..|.|.+ +.+.+-+..++.+....+..+..+..+.+.+.|.... ........ T Consensus 179 i~~~g~~~p~~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~ 258 (435) T protein:vir:79 179 CIIDGERVSNEKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVD 258 (435) T ss_pred EEecCCcchhhhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHH Confidence 1 1111 11233467765 6788888888888888888887777777766653211 11111111 Q ss_pred --hhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C-CccHHHHHHHHHHHHH Q lcl|NC_010179. 289 --LREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S-NASGVAIKMLYSHLEL 362 (469) Q Consensus 289 --~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-~~Sg~Al~~~~~~l~~ 362 (469) ...++.+.+... ...++.+ +.+.+.....++.....|...+++|-.-..+. | |.||..-..-|...+. T Consensus 259 ~~~~~~~~~~i~~~----~e~~e~~--~~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~ 332 (435) T protein:vir:79 259 DESGVGKAIGIDAT----DEEYEVL--NSDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLID 332 (435) T ss_pred HhcCCCCceeEecC----CcceEEE--ecccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHH Confidence 111333434321 1234443 45677888889999999999999997544332 2 3577765555555444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH---------hccCChHHHHHhC-- Q lcl|NC_010179. 363 KAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV---------ANYSSKEAVAKAN-- 431 (469) Q Consensus 363 k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl---------~g~iS~et~~~~l-- 431 (469) . ..+..++..+++++.+++.- .+++++|++-...|++|.|++..+. .|++|.+++...+ T Consensus 333 ~--~Qe~~l~p~l~~l~~li~~s--------~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~ 402 (435) T protein:vir:79 333 R--KRVEDYKPILEFLLPFMISE--------TEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRS 402 (435) T ss_pred H--HHHHHHHHHHHHHHHHhhcC--------CCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHH Confidence 3 23466788888888877642 3678999999999999888765442 4677776665433 Q ss_pred --CC---CCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 432 --PI---VDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 432 --~~---v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +. -.+...+++ | ......+...++.++| T Consensus 403 ~~~~~~~~~~~~~~~~----~------~~d~~~~~~~e~g~~~ 435 (435) T protein:vir:79 403 ICPDLKIMDNDNIELP----E------PEDLDPEPGQEGGLNK 435 (435) T ss_pred hccccCCCCcccccCC----c------cccCCCCCCCCCCCCC Confidence 11 011011110 0 0111112333444445 No 113 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=99.09 E-value=2.5e-09 Score=67.78 Aligned_cols=412 Identities=10% Similarity=0.050 Sum_probs=191.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHH--Hh----ccCCccccc------ccchhhhcccccccccccCcce-----e Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVD--YY----ENKTDITTR------NNGKPKVSKEGKKDPLRSADNR-----I 63 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~--Yy----~g~~~i~~~------~~~~~~~~~~~~~~~~~~~~~r-----i 63 (469) -++-+...+-.+-..+....+.|-..... |- .+-+..+.- ..........+....+ ..+. - T Consensus 15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~~~~~~~--~~~~l~a~Y~ 92 (532) T protein:vir:94 15 ATLQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSFVEATSW--PGFPTLALLA 92 (532) T ss_pred hhhhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccccccccccccccccc--chHHHHHHHH Confidence 23333333332222222222222111100 00 000000000 0000000000000000 0000 1 Q ss_pred ccchHHHHHHHHHHhhhcCCeeeccCchh-----hHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc---- Q lcl|NC_010179. 64 PSNFYQLLVDQEAGYIASVFPDIDVGKDA-----DNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN---- 133 (469) Q Consensus 64 ~~n~~k~iv~~~~~~l~g~p~~~~~~~~~-----~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~---- 133 (469) .+.+++.+|+..+.=++-+.++++++++. ..+.+...|+. +....+.++.+.+..+|.+++++-++.++. T Consensus 93 ~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~~~ 172 (532) T protein:vir:94 93 QLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSVPA 172 (532) T ss_pred cCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccccc Confidence 35667889999999898889999875432 22233333332 445677888899999999998876654331 Q ss_pred ---------------e-EEEEEccceeEEEE-eCCCCCceEEE-EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCce Q lcl|NC_010179. 134 ---------------F-RYGIIQPDQITPVY-ATTLDNKLLGV-LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDS 195 (469) Q Consensus 134 ---------------~-~i~~~~p~~~~~~~-d~~~~~~~~~~-v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (469) + .+.+++|..+.|-. +..+...+-++ ..+|... + ...+| +..+.+|.. T Consensus 173 ~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~---~-----g~~iH-~SRli~f~g----- 238 (532) T protein:vir:94 173 DAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIAT---S-----GKKIH-SSRIHTVVG----- 238 (532) T ss_pred cccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEc---c-----Ceeec-cceEEEecC----- Confidence 1 13344444443321 11111110000 0000000 0 00011 111111100 Q ss_pred eecccccccccccccccccccccccccccCCcccEEEec-CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE Q lcl|NC_010179. 196 TVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL 274 (469) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~ 274 (469) ..+|-...+ ++-.|.|.++.+.+-+..++.+.-..+..+..+....+.+ T Consensus 239 ------------------------------~~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~ 288 (532) T protein:vir:94 239 ------------------------------RPVGDMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLAT 288 (532) T ss_pred ------------------------------CCchhhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee Confidence 011211111 1224889999999999999998888887777777766554 Q ss_pred ecCCcc----cchhhhhhh------h-hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc Q lcl|NC_010179. 275 TNYGGA----SLKQFMNDL------R-EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF 343 (469) Q Consensus 275 ~g~~~~----~~~~~~~~~------~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 343 (469) +.... ........+ + ..+++.+..+ ..+++ +...+.+.....++.+.+.|...+.+|-.-.. T Consensus 289 -~~a~~ls~~~~~~~~~r~~~~~~~~~n~g~~~id~~----~e~~e--~~~~~lsgl~~~l~~~~~~iAaa~~IP~t~Lf 361 (532) T protein:vir:94 289 -DMAQLLAPGGAQSLDARLQLFNLYRDNRNIGALDKG----TEEIQ--QTNTPLSGLDSLQAQSQEQMAAVSHIPLVKLL 361 (532) T ss_pred -chHHhhcchhHHHHHHHHHHHHhhcCCccceEEcCC----CceeE--EEecccCCHHHHHHHHHHHHHhHhCCCeeeee Confidence 32111 111111111 1 1123333221 12233 34466777888899999999999999976443 Q ss_pred cc---C-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHH-- Q lcl|NC_010179. 344 ES---S-NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST-- 417 (469) Q Consensus 344 ~~---g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k-- 417 (469) +. | |.||..=..-|...+. ...+..+...+++++.+++..... ....+++++|++-...+++|.+++..+ T Consensus 362 G~sp~GlnstGe~D~~~yyd~I~--s~Qe~~l~p~le~l~~~l~~s~~g--~~~~d~~~~f~pL~~~s~kEkAei~~~~a 437 (532) T protein:vir:94 362 GITPNGLNASSDGEIRVWYDFIA--GYQATNLTPLMEWIIDLIQLSEYG--QIDPGLAWEWSPLMELDDKELAEVRQLNA 437 (532) T ss_pred cCCcccccccchHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhcC--CCCCCceEEeCCCCCCCHHHHHHHHHHHH Confidence 32 1 3466654444544433 233456788899999888754321 122368899999888899988876533 Q ss_pred -----H--hccCChHHHHHhCCCC------CC-H-HHHH---HHHHHHHHHh---hhhHh-hcccCCCCCCCCC Q lcl|NC_010179. 418 -----V--ANYSSKEAVAKANPIV------DD-W-QQEL---KDLAKDREEN---DPYAN-QADELNGKGVDDE 469 (469) Q Consensus 418 -----l--~g~iS~et~~~~l~~v------~d-~-~~E~---eri~~E~~~~---~~~~~-~~~~~~~~~~~de 469 (469) + .|++|.+.+..++..- .+ + ..++ +.+.+|.+.. .+... ...+...+..+|+ T Consensus 438 ~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 511 (532) T protein:vir:94 438 STDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELDDVEEIAKQLMAAALNPPATAPQTPNPQPDSEDDQ 511 (532) T ss_pred HHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccccccchhhhhcccccCCCCCCCCCCCCCCCCCCCC Confidence 2 5789998887765321 11 0 1112 2222221111 11111 1112222333344 No 114 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.08 E-value=2.5e-09 Score=67.71 Aligned_cols=422 Identities=9% Similarity=-0.022 Sum_probs=207.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHhccCCcccccccchhhhcccccccccccCc-ceeccchHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKK--------SVDYYENKTDITTRNNGKPKVSKEGKKDPLRSAD-NRIPSNFYQLL 71 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~--------~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~-~ri~~n~~k~i 71 (469) |..-.+.-+=..... .....|.. ...+..... ..+. .+..... .-..++. .-...+|++-+ T Consensus 1 ~~~~~~~~~~~~~~~---~~~~~~~~~a~~~~~~~~~w~~~~~----s~~~--~i~~~~~-~lr~RaRdl~rNn~~a~~a 70 (530) T protein:vir:38 1 MKIPSLVGPDGKTSL---REYAGYHGGGGGFGGQLRGWNPPSE----SADA--ALLPNYS-RGNARADDLVRNNGYAANA 70 (530) T ss_pred CccceeecCccccch---HHHhhhhcccCCCCCcccccccCCC----CHHH--HHHHHHH-HHHHHHHHHHhcChHHHHH Confidence 222211110000000 00000110 000000000 0000 0000000 0000000 00125899999 Q ss_pred HHHHHHhhhcCCeeeccC------------chhhHHHHHHHHhc---------------cHHHHHHHHHHHHHhCCeEEE Q lcl|NC_010179. 72 VDQEAGYIASVFPDIDVG------------KDADNKKILDVLGD---------------DRALTLNSLLVDSSNAGRAWL 124 (469) Q Consensus 72 v~~~~~~l~g~p~~~~~~------------~~~~~~~l~~~~~~---------------n~~~~~~~~~~~~~~~G~~~~ 124 (469) |+..+..+.|..++..+. ++..++.+.+.|.. ++......+.+...+.|.+++ T Consensus 71 v~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l~~r~~~~dGE~~~ 150 (530) T protein:vir:38 71 VQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCV 150 (530) T ss_pred HHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHHHHHHHhhCCceEE Confidence 999999999988776542 23334555555531 112223345677889999999 Q ss_pred EEEEcCCC----ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccc Q lcl|NC_010179. 125 HYWIDEDN----NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEP 200 (469) Q Consensus 125 ~v~~d~~~----~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (469) ++.+.+.+ .+++..++|+.+---++......+..+|.+ +..|....+++--..+.+..+. T Consensus 151 ~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GIe~----d~~Gr~~aY~i~~~~~~~~~~~------------ 214 (530) T protein:vir:38 151 QATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGVKI----NDSGAALGYYVSDDGYPGWMAQ------------ 214 (530) T ss_pred EeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeeeEE----CCCCceEEEEEeeccCCCcccc------------ Confidence 88766543 367899999886432322223345555543 3334333222111000000000 Q ss_pred ccccccccccccccccccccccccCCcccEEEecCC-----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEe Q lcl|NC_010179. 201 YNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN-----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT 275 (469) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~ 275 (469) .+... ......+.--|+|+... .+|.|+|..++..+..++.............+.=..+++ T Consensus 215 -------------~~~~~-~~~~~v~a~~vlH~f~~~r~gQ~RGis~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~ 280 (530) T protein:vir:38 215 -------------NWTYI-PRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIE 280 (530) T ss_pred -------------cccee-eeeeccChhHeEeeccccCCCcccCCchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeee Confidence 00000 00011122235665432 369999999988888877765554444444333223333 Q ss_pred cCCcc-------------cchh---------------hhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHH Q lcl|NC_010179. 276 NYGGA-------------SLKQ---------------FMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKIT 327 (469) Q Consensus 276 g~~~~-------------~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l 327 (469) ...+. .... ....+....+..+.+ +.++++.+++.+...+..++..+ T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~p-----Ge~i~~~~p~~p~~~~~~f~~~~ 355 (530) T protein:vir:38 281 SELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLP-----GDSLNLQSAQDTDNGYSTFEQSL 355 (530) T ss_pred ccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCC-----CCeeeeeCCCCCCCCHHHHHHHH Confidence 21110 0000 000122233333333 34689998888888999999999 Q ss_pred HHHHHHHhCCCCcCc-cccCCccHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH--HHhcc----cCC---Cc--- Q lcl|NC_010179. 328 RDNIFLFGQGIDPAN-FESSNASGVAIKMLYSHLELKAAKTQTYFEHAI-NELVRAIM--RYLNF----SDA---DK--- 393 (469) Q Consensus 328 ~~~i~~~s~~p~~~~-~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l-~~~~~~i~--~~~~~----~~~---~~--- 393 (469) .+.|..-..+|--.. ..++++|-.+.+..+......+...+..|...+ ..+++..+ .++.. ... +. T Consensus 356 lr~iaaglGi~ye~lt~D~s~~nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~wl~~av~~G~i~~p~~~~~~~~~~ 435 (530) T protein:vir:38 356 LRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGRRKFVASRQACQMFLCWLEEAIVRRVVTLPSKARFSFQEA 435 (530) T ss_pred HHHHHhhcCCCHHHHhcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCccCCCCCCCCchhh Confidence 999999888875433 345667777777777777777777777665543 32333222 12221 110 11 Q ss_pred --ccceEEeCCC--CCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcc-----cC- Q lcl|NC_010179. 394 --RHISQHWTRT--KVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQAD-----EL- 461 (469) Q Consensus 394 --~~i~i~f~~~--~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~-----~~- 461 (469) ....+.|..+ .-.|....+++.... +|+.|.+.++...| .|+++.++++.+|++......-... .. T Consensus 436 ~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~ 513 (530) T protein:vir:38 436 RTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKRG--DDYQEIFAQQVRESMERRAAGLNPPAWAAAAFE 513 (530) T ss_pred HHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCCCcccccC Confidence 1235667544 334777777777654 79999999999987 4888889999988876654322111 11 Q ss_pred ---CCCCCCCC Q lcl|NC_010179. 462 ---NGKGVDDE 469 (469) Q Consensus 462 ---~~~~~~de 469 (469) .....+++ T Consensus 514 ~~~~~~~~~~~ 524 (530) T protein:vir:38 514 AGVKKSNEEEQ 524 (530) T ss_pred CCCCCCCCCCC Confidence 11111111 No 115 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=99.06 E-value=3.4e-09 Score=66.99 Aligned_cols=437 Identities=11% Similarity=0.066 Sum_probs=214.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+.+.+++-.+.+..++..-..+.+.+.+|..- .+...... .........+.+.++-.+-+...++..++.|+ T Consensus 1 ~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~lP-----~~~~~~~~--~~~~~~~~~~~~~~i~dst~~~a~~~Las~L~ 73 (547) T protein:vir:10 1 MENSKIVKRLDFLKTDRKNVEQIWDCIRKYIMP-----MRSDFFSD--LRSEGSINWNQNREVFDSTAGDGLETLSSSLH 73 (547) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcc-----cccccccC--CCCCcccccccccccccchHHHHHHHHHHHHH Confidence 999999988888877766656666666666431 11110000 00000111123456777888888888887665 Q ss_pred cC--Ce-----eeccCch------hhHHH-------HHHHHh-ccHHHHHHHHHHHHHhCCeEEEEEEEcCC--CceEEE Q lcl|NC_010179. 81 SV--FP-----DIDVGKD------ADNKK-------ILDVLG-DDRALTLNSLLVDSSNAGRAWLHYWIDED--NNFRYG 137 (469) Q Consensus 81 g~--p~-----~~~~~~~------~~~~~-------l~~~~~-~n~~~~~~~~~~~~~~~G~~~~~v~~d~~--~~~~i~ 137 (469) +- || ++...+. ...+. +...+. .||...+.++.++..++|.+.+++-.+++ +.++++ T Consensus 74 ~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~d~~~~~~~r~~ 153 (547) T protein:vir:10 74 GSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMVEEEDEDEEGSVVFQ 153 (547) T ss_pred HhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEeccCCCCCCceeEE Confidence 42 22 1222222 12222 223332 35666778889999999999776654443 567899 Q ss_pred EEccceeEEEEeCCCCCceEEEEEEEEeeec---------------------CCceEEEEEEEEcCCeEEEEEeecCce- Q lcl|NC_010179. 138 IIQPDQITPVYATTLDNKLLGVLRSYKQLDP---------------------EAGKYFTVHEYWTDKEAQFFRTSATDS- 195 (469) Q Consensus 138 ~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~---------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~- 195 (469) .++..++++.-|. .+++...+|.++..-. +.+.....+++++.- |....... T Consensus 154 ~~pl~~~~v~~d~--~G~v~~i~r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~~~v~~~v----~~~~~~~~~ 227 (547) T protein:vir:10 154 SSPIQDSYFEEDS--RGQVVNFYRVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALKQEVVMCV----FTRYDKKQN 227 (547) T ss_pred EeecceEEEeeCC--CcCeeeeeeeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccceEEEEEEE----eeccCCCCC Confidence 9999998887665 4556666665443111 111111122221100 00000000 Q ss_pred ----eeccccc--ccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 196 ----TVIEPYN--IITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDL 264 (469) Q Consensus 196 ----~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~ 264 (469) ..+.... ....|....+.... ..+.+|..+|++.++ ++.+|.|-.++..+-+..+|.+.-...... T Consensus 228 ~~~~~~~~~~~~p~~s~~~e~~~~~~~---l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~ 304 (547) T protein:vir:10 228 RNAGTVLAPTERPFGKKWILKEGAVQL---GEEGGYYEMPAYAIRWRKSAGSQWGFGPSHLALPDVLTANRYVELVLRSS 304 (547) T ss_pred ccccceeeccccceeEEEEEecCceee---eecCCcccCCeeeeeeeecCCcccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 0000000 00001111111101 112345667777765 345799999999999999999999999999 Q ss_pred HHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc Q lcl|NC_010179. 265 DDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE 344 (469) Q Consensus 265 ~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 344 (469) +...+|.+.+.-.+.. . ..+...++++.. +...+++-++...+.......++.++..|-..-....+...+ T Consensus 305 ~~~~~pp~~v~~~g~~---~-~~~~~pgg~~~~-----~~~~~v~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~ 375 (547) T protein:vir:10 305 EKVIDPAIMVTERGLI---S-DIDLGASGLTVV-----RDMESMKPFESRARFDVSSIQLTDLRSAVRRIYYVDQLQMKD 375 (547) T ss_pred HHHhcCceeccccccc---c-cceecCCeeeec-----CCcccceeeecccchHHHHHHHHHHHHHHHHHhhhhhhhcCC Confidence 9999998765311111 1 122334444432 123456667777777777888888887776533222222222 Q ss_pred cCCccHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHhcccCC-------CcccceEEeCCCCC Q lcl|NC_010179. 345 SSNASGVAIKMLYSHLELKAAKTQTYFEHAI------------NELVRAIMRYLNFSDA-------DKRHISQHWTRTKV 405 (469) Q Consensus 345 ~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l------------~~~~~~i~~~~~~~~~-------~~~~i~i~f~~~~p 405 (469) ....|+..+... ++++...++..+ .+.+.++.+.--.... ....+.|++..++- T Consensus 376 ~~~~TAtEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~~~~~~v~~is~La 448 (547) T protein:vir:10 376 SPAMTATEVQVR-------YELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGKLGELPSKLLESGKAAMDIVYTGPLS 448 (547) T ss_pred CccccHHHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCcceEEEEeccHHH Confidence 334455544332 233334444433 3334433332111111 22356677776665 Q ss_pred CCHH-HHH-------HHHHHHhcc-------CChHHHHHhC---CCCCC----HHHHHHHHHHHHHHhhhhHhh------ Q lcl|NC_010179. 406 EDSL-TKA-------QIVSTVANY-------SSKEAVAKAN---PIVDD----WQQELKDLAKDREENDPYANQ------ 457 (469) Q Consensus 406 ~d~~-e~~-------~~~~kl~g~-------iS~et~~~~l---~~v~d----~~~E~eri~~E~~~~~~~~~~------ 457 (469) +... +.+ +.+..++++ +....++..+ -+++- .++|++.+.+++++..+...+ T Consensus 449 raq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~qaa~~~~ 528 (547) T protein:vir:10 449 RAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPKAKVTSIRKNRSQTQQKAEQAAIAEA 528 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5321 112 222223332 3334444332 12321 246777776665444332221 Q ss_pred -cccCCCCCCCCC Q lcl|NC_010179. 458 -ADELNGKGVDDE 469 (469) Q Consensus 458 -~~~~~~~~~~de 469 (469) .+.++.-|..+- T Consensus 529 ~g~~m~~~~~~~a 541 (547) T protein:vir:10 529 EGNAMEAQGKGQA 541 (547) T ss_pred HHHHHHhhcCccc Confidence 111111111111 No 116 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.00 E-value=1.7e-09 Score=68.66 Aligned_cols=405 Identities=8% Similarity=-0.021 Sum_probs=184.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLI-NNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~-~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |..+.+..++..+-..+.... ..-..+.++|... ....+.. .+.+ -.+.+++.||+..+.=+ T Consensus 98 ~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~------~f~gyql----------~alY-~~~~larkiVd~pAeDa 160 (862) T protein:vir:99 98 FAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQ------GFIGHQA----------CALI-AQHWLVDKACSLAGEDA 160 (862) T ss_pred hhhhcchhhhhhccccccccccccchhcccccccc------CcccHHH----------HHHH-HhCchhhhhhhhhhHHH Confidence 111111111111100000000 0000001111100 0000000 0001 13688899999999999 Q ss_pred hcCCeeeccCch------hhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC-CceEEEEEccce-------e Q lcl|NC_010179. 80 ASVFPDIDVGKD------ADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED-NNFRYGIIQPDQ-------I 144 (469) Q Consensus 80 ~g~p~~~~~~~~------~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~-~~~~i~~~~p~~-------~ 144 (469) +-+.+.+.+.++ +..+.+...+.+ +....+.++.+.+-.+|.+++++-++.+ +...-..+++.. . T Consensus 161 tR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~~LsqPLn~e~I~kG~lkg 240 (862) T protein:vir:99 161 IRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPDYYEKPFNPDGITPGSYRG 240 (862) T ss_pred hhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCchhhhcCcCcccccccceeE Confidence 999999987432 233445555544 3445677888888889988877655432 211111222211 1 Q ss_pred EEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccccc Q lcl|NC_010179. 145 TPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (469) Q Consensus 145 ~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (469) +.++|+.-... .....+.. +. ...-+...+.|.-... .+.. .....- . + T Consensus 241 l~vlDp~w~~p-~~v~~~~~--Dp-~sp~yGkP~~y~I~g~-----------~IH~------SRliif-~---------g 289 (862) T protein:vir:99 241 ISQIDPYWMMP-MLTAESTA--DP-SSQFFYEPEFWIISGQ-----------KYHR------SHLIIA-R---------G 289 (862) T ss_pred EEEechhhhcc-cccccccc--cc-cccccCCceeeeecCe-----------eecc------ceeEEe-c---------C Confidence 22222110000 00000000 00 0000000011100000 0000 000000 0 0 Q ss_pred CCcccEEEe-cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc-hhhhh------hhhh-ccee Q lcl|NC_010179. 225 FGRVPFIEF-PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL-KQFMN------DLRE-YKSI 295 (469) Q Consensus 225 ~g~vPvv~~-~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~-~~~~~------~~~~-~~~~ 295 (469) ..+|-..- .++-.|.|.++.+.+.+..++.+.......+..+....+.+.+...... ..... ..+. .+++ T Consensus 290 -~~vpd~lk~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN~Gi~ 368 (862) T protein:vir:99 290 -PQPADILKPTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIANEDKFIQRLMFWVRYRDNHAVK 368 (862) T ss_pred -CCchhhhhccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhccHHHHHHHHHHHHhccCcceeE Confidence 00111100 1123489999999999999999988888888888877777666432111 11111 1111 2233 Q ss_pred eecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C-CccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 296 KINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S-NASGVAIKMLYSHLELKAAKTQTYF 371 (469) Q Consensus 296 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-~~Sg~Al~~~~~~l~~k~~~~~~~~ 371 (469) .+..+ .+++ +.+.+.+.+...++.+..+|...+.+|-.-..+. | |+||..=..-|...+.-. .+..+ T Consensus 369 liD~e-----Ee~e--~ls~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~--QE~~L 439 (862) T protein:vir:99 369 VLGTD-----ETME--QFDTSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESI--QEHVY 439 (862) T ss_pred EecCC-----Ccee--EEecccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHH--HHHHH Confidence 33221 2233 3446777888899999999999999997644332 3 357775334444443322 24568 Q ss_pred HHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHH-------HH--hccCChHHHHHhC------C--CC Q lcl|NC_010179. 372 EHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVS-------TV--ANYSSKEAVAKAN------P--IV 434 (469) Q Consensus 372 ~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~-------kl--~g~iS~et~~~~l------~--~v 434 (469) +..|++++.++..-++. ..+++++|++-...|++|.|++.. ++ +|++|.+++..+| + .+ T Consensus 440 ~P~LerL~~li~~~lg~----~~d~~ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l 515 (862) T protein:vir:99 440 MPFLQRHYLISRLSLGI----QHEIDVVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRL 515 (862) T ss_pred HHHHHHHHHHHHHhcCC----CCcceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCC Confidence 88888888776554432 346899999999999999987653 33 5789988887753 2 23 Q ss_pred CCHHHHHHH-HHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 435 DDWQQELKD-LAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 435 ~d~~~E~er-i~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +|.+.|-.. +..|..+.. .+..........+++ T Consensus 516 ~ded~E~d~~~~~e~~~~~--e~~g~a~~~ap~de~ 549 (862) T protein:vir:99 516 TKEDAEETPGASPENLAAY--QKAGAAQETASAKET 549 (862) T ss_pred CcccccccCCCCccccccc--ccCCccccccccccc Confidence 322211100 111111100 000000000000000 No 117 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.96 E-value=1e-08 Score=64.41 Aligned_cols=447 Identities=10% Similarity=0.034 Sum_probs=202.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |.....+++.+.+-.-...|.+....++++|. +-++.+.... .. ......+.+.++..+-+...++..++.|+ T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~---~~--~~~~~~~~~~~~~dst~~~a~~~Las~l~ 73 (559) T protein:vir:95 1 MAETTKERLNKQFAQLESERQSFEPHWRELSD--YINPRGSRFL---TS--EVNRNDRRNTRIIDSTGTMAARTLASGMM 73 (559) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhccccCCcC---CC--CCCcccccccccccchHHHHHHHHHHHHH Confidence 88877777766554444444444444444443 1111111100 00 00111223445667777788888877665 Q ss_pred cC--Ce-----eeccCchh------hHHH-------HHHHHh-ccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEE Q lcl|NC_010179. 81 SV--FP-----DIDVGKDA------DNKK-------ILDVLG-DDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGII 139 (469) Q Consensus 81 g~--p~-----~~~~~~~~------~~~~-------l~~~~~-~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~ 139 (469) +- || ++...++. ..+. +.+.+. .||...+.++.++..++|.+.+++-.+..+.+++..+ T Consensus 74 ~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~d~~~~~r~~~~ 153 (559) T protein:vir:95 74 SGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLDDDEDIIRTMPF 153 (559) T ss_pred HhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeecCCCceeEEEEe Confidence 32 21 22222221 1111 222332 3566678888999999999987665555566789999 Q ss_pred ccceeEEEEeCCCCCceEEEEEEEEeeec-------------------CCceEEEEEEEEcCCeEEEEEeecCceeeccc Q lcl|NC_010179. 140 QPDQITPVYATTLDNKLLGVLRSYKQLDP-------------------EAGKYFTVHEYWTDKEAQFFRTSATDSTVIEP 200 (469) Q Consensus 140 ~p~~~~~~~d~~~~~~~~~~v~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (469) +..++++.-|. .+++...+|.++..-. +....-..+++++. .|.....+...... T Consensus 154 ~l~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~~ 227 (559) T protein:vir:95 154 PIGSYYLANSP--RGSVDTCFRKFSMTVRQLVQEFGLNNVSESVKSMWESGTYEKWIEVMHS----VYPNIDRDTSKLDS 227 (559) T ss_pred ecCeEEEeeCC--CCCeEEEEEeEecCHHHHHHHcCcccCCHHHHHHHhcCCCCCeEEEEEE----Eecccccccccccc Confidence 99998887765 4566666665543221 00010011121110 00000000000000 Q ss_pred ccc--c-ccccccccccccccccccccCCcccEEEec-----CCcccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010179. 201 YNI--I-TSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAE-LNKYKGLIDAYDDIYNGFINDLDDVQTVI 271 (469) Q Consensus 201 ~~~--~-~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~-~~~v~~liD~~~~~~s~~~~~~~~~~~p~ 271 (469) ... . ..|......... ..+.+|..+|++.++ ++.+|+|- ..+..+-+..+|.+.-......+...+|. T Consensus 228 ~~~pf~s~~~e~~~~~~~~---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l~~~~~~~~pp 304 (559) T protein:vir:95 228 KNKPFKSVYYEVGGDNDKL---LRESGFDEFPIMAPRWEVNGEDVYGSSCPGMLALGPVKALQLLQKRKSQLIDKATNPP 304 (559) T ss_pred ccceEEEEEEEecCCCcee---eecCCcccCCccceeeeecCCccccccchHHHhhHHHHHHHHHHHHHHHHHHHHhcCc Confidence 000 0 000110000000 112345556666554 34578884 88899999999999999999999999997 Q ss_pred eEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe-ecCCHHHHHHHHHHHHHHHHHHhCCCC---cCccccCC Q lcl|NC_010179. 272 LVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ-IDIPVEARDDALKITRDNIFLFGQGID---PANFESSN 347 (469) Q Consensus 272 l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~---~~~~~~g~ 347 (469) +.+.+.. .........++...+...+.. ..++.+. .+.+.......++.++..|-..-..-. +...+..+ T Consensus 305 ~~v~~~~----~~~~~~l~pgg~~~~~~~~~~--~~i~p~~~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~l~~r~~~r 378 (559) T protein:vir:95 305 MVAPTSL----KNQRASLLPGDITYIDQITGQ--DGFRPAYLVNPSTADLVADIQDTRQIINSAYFVDLFMMLQNINTRS 378 (559) T ss_pred eeccccc----cccceeeeccceeeeCCCCCc--ccceeecccccchHHHHHHHHHHHHHHHHHhhhhhHHHhhcCCCCC Confidence 7764321 111122334444444333221 1234332 233455556667776666644332211 12223344 Q ss_pred ccHHHHHHHHHHHHHHHH----H-HHHHHHHHHHHHHHHHHHHhccc----CCCcccceEEeCCCCCCCH-HHH------ Q lcl|NC_010179. 348 ASGVAIKMLYSHLELKAA----K-TQTYFEHAINELVRAIMRYLNFS----DADKRHISQHWTRTKVEDS-LTK------ 411 (469) Q Consensus 348 ~Sg~Al~~~~~~l~~k~~----~-~~~~~~~~l~~~~~~i~~~~~~~----~~~~~~i~i~f~~~~p~d~-~e~------ 411 (469) .|+..+.....-...... + ....+..-+.+.+.++.+.--+. ......++|++..++..-. ... T Consensus 379 vTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~v~~is~La~aqk~~~~~~i~~ 458 (559) T protein:vir:95 379 MPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLKVEYISVMAQAQKSIGLSSLAS 458 (559) T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceEEEeecHHHHHHHHHHHHHHHH Confidence 566554433221111111 1 11223333344444443321111 1123456777776665411 111 Q ss_pred -HHHHHHHhcc-------CChHHHHHhC---CCCC-C---HHHHHHHHHHHHHHhhhhHhh----------cccCCCCCC Q lcl|NC_010179. 412 -AQIVSTVANY-------SSKEAVAKAN---PIVD-D---WQQELKDLAKDREENDPYANQ----------ADELNGKGV 466 (469) Q Consensus 412 -~~~~~kl~g~-------iS~et~~~~l---~~v~-d---~~~E~eri~~E~~~~~~~~~~----------~~~~~~~~~ 466 (469) ++.+..++++ +....++..+ -+|+ + .++|++++++++++.....++ ...+..-+. T Consensus 459 ~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~q~~~~~~~aa~~~~~~~~~~~ 538 (559) T protein:vir:95 459 TVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQQMMAMGMAAAQGVKTLSEAKT 538 (559) T ss_pred HHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccC Confidence 2222233332 3334444432 1222 1 246777666554444332211 111111111 Q ss_pred CC-C Q lcl|NC_010179. 467 DD-E 469 (469) Q Consensus 467 ~d-e 469 (469) .. + T Consensus 539 ~~~~ 542 (559) T protein:vir:95 539 SDPS 542 (559) T ss_pred CChh Confidence 11 1 No 118 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=98.94 E-value=1.1e-08 Score=64.16 Aligned_cols=431 Identities=10% Similarity=-0.002 Sum_probs=213.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-Ccccc---cccchh-hhcccccccccccCc-ceeccchHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENK-TDITT---RNNGKP-KVSKEGKKDPLRSAD-NRIPSNFYQLLVDQ 74 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~-~~i~~---~~~~~~-~~~~~~~~~~~~~~~-~ri~~n~~k~iv~~ 74 (469) |++ |.++|.-+.....-+........+-|+|- +.-.. +...-. .........-..++. .-...+|++-+|+. T Consensus 1 Mn~--iDr~i~~~sP~~a~~R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~RaRdL~rNn~~a~~av~~ 78 (548) T protein:vir:95 1 MNL--IDRLLEPLAPELVARRLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMREQCRKLDEDHDLVTGLLDR 78 (548) T ss_pred Cch--HHhHhhhcchHHHHHHHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH Confidence 665 34666655443322222222333446553 21000 000000 000000000000000 01235788889999 Q ss_pred HHHhhhcC-Ceeecc----Cchh----hHHHHHHHHhc-----------cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc- Q lcl|NC_010179. 75 EAGYIASV-FPDIDV----GKDA----DNKKILDVLGD-----------DRALTLNSLLVDSSNAGRAWLHYWIDEDNN- 133 (469) Q Consensus 75 ~~~~l~g~-p~~~~~----~~~~----~~~~l~~~~~~-----------n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~- 133 (469) .+..+.|. .+.+.+ .+.+ .++.+...|.. +|......+.+.....|.+++...++..+. T Consensus 79 ~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~~~dGE~f~~~~~~~~~~~ 158 (548) T protein:vir:95 79 LEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTWLRDGEGLAQKLMGRVPNY 158 (548) T ss_pred HHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHHHhCCceEEEeeecccccc Confidence 99998883 333322 2222 23333333321 123333446678899999999887765431 Q ss_pred -------eEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccc Q lcl|NC_010179. 134 -------FRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITS 206 (469) Q Consensus 134 -------~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (469) +++..++|+.+---++. ....+..+|.+ +..|....+++--..+...... .....+. T Consensus 159 ~~g~~~~~~lqliepd~l~~~~~~-~~~~i~~GIE~----D~~Grp~aY~i~~~hPgd~~~~-~~~~~~~---------- 222 (548) T protein:vir:95 159 TFATSVPFALELLEPDYLPFSYNN-LSKGIVQGIER----DTWRRKRAYHLLKDHPGNLQTL-GGSLAVK---------- 222 (548) T ss_pred cCCcccceEEEEechhhcCCCCCC-CCCceeeeeEE----CCCCceEEEEEeecCCCccccc-cccccee---------- Confidence 47889999886322222 22345555543 3344443333221111111100 0000000 Q ss_pred ccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCccc----- Q lcl|NC_010179. 207 YDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGAS----- 281 (469) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~----- 281 (469) -.....+-|.|... .....+|.|.|..++..+..++.....-.......+.-..+++...++. T Consensus 223 --------rvpA~~VlHif~~~----r~gQ~RGvs~lapvl~~l~~l~~y~dael~~aki~A~~a~fi~~~~~~~~~~~~ 290 (548) T protein:vir:95 223 --------RVEAERIIHIAYRK----RIGQNRGVPMLHAVLIRLADLKDYEESERVAARISAALAMYIKKGNPDSYTVEP 290 (548) T ss_pred --------eechhHheeccccc----CCccccCcchHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeecCCCccccCCC Confidence 00001112222110 0123469999999998888887766665555554443334444321111 Q ss_pred ---chhhhhhhhhcceee-ecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc-ccCCccHHHHHHH Q lcl|NC_010179. 282 ---LKQFMNDLREYKSIK-INNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF-ESSNASGVAIKML 356 (469) Q Consensus 282 ---~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~g~~Sg~Al~~~ 356 (469) .......+....++. +.+ +-++++++++-+...+..+...+.+.|..-..+|--... .++ .|-.+.+.. T Consensus 291 ~~~~~~~~~~~~pG~iv~~L~p-----Ge~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s-~nYSS~R~~ 364 (548) T protein:vir:95 291 GKDRKNRTIPIAPGMVFDDLEP-----GEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYD-GTYSAQRQE 364 (548) T ss_pred CcccccccccccCCccccccCC-----CceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccc-hhHHHHHHH Confidence 011111122222222 232 236889888777889999999999999998888743222 233 366677777 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-HHHHHHH--Hhc----ccCC--CcccceEEeCCCC--CCCHHHHHHHHHHH--hccCC Q lcl|NC_010179. 357 YSHLELKAAKTQTYFEHAINE-LVRAIMR--YLN----FSDA--DKRHISQHWTRTK--VEDSLTKAQIVSTV--ANYSS 423 (469) Q Consensus 357 ~~~l~~k~~~~~~~~~~~l~~-~~~~i~~--~~~----~~~~--~~~~i~i~f~~~~--p~d~~e~~~~~~kl--~g~iS 423 (469) +..........+..|...+-+ +++..+. ++. ...+ ....+.+.|..+- -.|....+++...+ +|+.| T Consensus 365 l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T 444 (548) T protein:vir:95 365 LVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKAGFAD 444 (548) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHcCCCC Confidence 776666666666655544333 3333332 221 1111 1123578885543 25888888777665 78999 Q ss_pred hHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCC---------------------------CCCCCCC Q lcl|NC_010179. 424 KEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELN---------------------------GKGVDDE 469 (469) Q Consensus 424 ~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~---------------------------~~~~~de 469 (469) .+..+...| .|+++.++++.+|.+......-.++..+ +...++| T Consensus 445 ~~~~~a~~G--~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 515 (548) T protein:vir:95 445 EAEVARARG--RDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEAREL 515 (548) T ss_pred HHHHHHHhC--CCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHh Confidence 999999987 4888889999988876654332211100 0111111 No 119 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.91 E-value=1.6e-08 Score=63.29 Aligned_cols=426 Identities=10% Similarity=-0.014 Sum_probs=205.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccc---cchhh-hcccccccccccCc-ceeccchHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRN---NGKPK-VSKEGKKDPLRSAD-NRIPSNFYQLLVDQE 75 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~---~~~~~-~~~~~~~~~~~~~~-~ri~~n~~k~iv~~~ 75 (469) |++-.. -++.---.-..+ ....-|+|-..-.... ..-.. ........-..++. .-...+|++-+|+.. T Consensus 1 m~~~~~-~~~a~~~~~~~~------~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~RaRdl~rNn~~a~~av~~~ 73 (495) T protein:vir:10 1 MNMTPS-GYQSLASGLLVP------VGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRARSHHNVRNNPWATNAVATW 73 (495) T ss_pred CCcccc-cccccchhhhhH------HHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHHHHHHHhcChHHHHHHHHH Confidence 444332 121100000000 0011233311000000 00000 00000000000000 012358899999999 Q ss_pred HHhhhcCCeeecc--CchhhHHHHHHHHhc-----------cHHHHHHHHHHHHHhCCeEEEEEEEcCC--C---ceEEE Q lcl|NC_010179. 76 AGYIASVFPDIDV--GKDADNKKILDVLGD-----------DRALTLNSLLVDSSNAGRAWLHYWIDED--N---NFRYG 137 (469) Q Consensus 76 ~~~l~g~p~~~~~--~~~~~~~~l~~~~~~-----------n~~~~~~~~~~~~~~~G~~~~~v~~d~~--~---~~~i~ 137 (469) +.++.|..++..+ +++...+.+...|.. +|......+.+.....|.+|+.+.+.+. | .+++. T Consensus 74 ~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~~~~~~~~~~g~~~~~~lq 153 (495) T protein:vir:10 74 VAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFVIKKPRPLSEGLSVPLQLQ 153 (495) T ss_pred HHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEEEEeecccCCCCccceEEE Confidence 9999988776553 566666666665542 1223333466788999999987765433 3 36899 Q ss_pred EEccceeE-EEEeC--CCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccc Q lcl|NC_010179. 138 IIQPDQIT-PVYAT--TLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYE 214 (469) Q Consensus 138 ~~~p~~~~-~~~d~--~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (469) .++|+.+- |.-+. .....+..+|.+ +..|....+++--..+..... ...... .. T Consensus 154 liepd~l~~~~~~~~~~~g~~i~~GIe~----d~~Gr~vaY~i~~~hpgd~~~-~~~~~~------------------~~ 210 (495) T protein:vir:10 154 IIEPDMLASDIPDETLPSGGYVKGGIRF----SNGGKRKAYCFYRNHPAESSL-IGDPVD------------------TV 210 (495) T ss_pred EechhhcCCCCCCCCCCCCCEEEeceEE----CCCCceEEEEEeecCCCcccc-cccccc------------------ee Confidence 99999864 32211 112235566653 333443333321111111000 000000 00 Q ss_pred ccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCccc-c----------- Q lcl|NC_010179. 215 TGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGAS-L----------- 282 (469) Q Consensus 215 ~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~-~----------- 282 (469) ......+-|.| |. .+....|.|.+..++.|-|.-+...+.+....-...+. .+++...+.. . T Consensus 211 rvpA~~vlH~f---~~--r~gQ~RGis~la~i~~l~~l~~y~dael~~a~i~A~~~-~fi~~~~~~~~~~~~~~~~~~~~ 284 (495) T protein:vir:10 211 WIKAEHVLHVT---VL--TVRSDAGAPWFQLLLRLNELDQYEDAELVRKKTAALFA-AFIQEATADSTGGPTIGQPKRSK 284 (495) T ss_pred eechhheEecc---cc--CCCcccCcchhHHHHHHHHhhHHHHHHHHHHHHhhhhe-eeeecCCCccccccccCcccccc Confidence 00001122332 10 12344688988877765433333333333333333333 3343211111 0 Q ss_pred -hhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCc-cccCCccHHHHHHHHHHH Q lcl|NC_010179. 283 -KQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPAN-FESSNASGVAIKMLYSHL 360 (469) Q Consensus 283 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~g~~Sg~Al~~~~~~l 360 (469) ......+....+..+.++ -++++++++.+...+..++..+.+.|..-.++|--.. ..++++|-.+++..+... T Consensus 285 ~~~~~~~l~pG~i~~L~pG-----e~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYSS~R~~~~e~ 359 (495) T protein:vir:10 285 GGKRITGLNPGTLQYLQPG-----QEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYSSIRAGLLEF 359 (495) T ss_pred CcccceecCCceeeecCCC-----CeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHHHHHH Confidence 001112333333333333 4689988887888999999999999998887774333 235666667777777777 Q ss_pred HHHHHHHHH-HHHHHH-HHHHHHHHH--Hhc----ccCC-C--cccceEEeCCCC--CCCHHHHHHHHHHH--hccCChH Q lcl|NC_010179. 361 ELKAAKTQT-YFEHAI-NELVRAIMR--YLN----FSDA-D--KRHISQHWTRTK--VEDSLTKAQIVSTV--ANYSSKE 425 (469) Q Consensus 361 ~~k~~~~~~-~~~~~l-~~~~~~i~~--~~~----~~~~-~--~~~i~i~f~~~~--p~d~~e~~~~~~kl--~g~iS~e 425 (469) ...+...+. .+...+ +.+++..+. ++. ..++ + .....+.|..+- -.|....+++...+ +|+.|.+ T Consensus 360 ~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~ 439 (495) T protein:vir:10 360 RRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVRAGFAPIS 439 (495) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHHcCCCCHH Confidence 666665443 344333 333333222 221 1111 1 112456775543 34778888777665 7999999 Q ss_pred HHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc--------------ccCCCCCCCCC Q lcl|NC_010179. 426 AVAKANPIVDDWQQELKDLAKDREENDPYANQA--------------DELNGKGVDDE 469 (469) Q Consensus 426 t~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~--------------~~~~~~~~~de 469 (469) ..+...| .|+++.++++.+|++......-.+ ........++| T Consensus 440 ~~~a~~G--~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 440 DKQAERG--YDMEELFDMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHcC--CCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCCCCCCCCC Confidence 9999987 488888888988887754432211 11111222222 No 120 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.90 E-value=1.8e-08 Score=63.08 Aligned_cols=454 Identities=10% Similarity=0.086 Sum_probs=184.1 Q ss_pred CCHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSR----NDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~----~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) |.-+.+-..|.+.+... +.--++.+.+.+||....+.. ...................+|+..+.+...++..+ T Consensus 20 ~~~~~~~~~l~~~~~~~~~~R~~~e~~W~e~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~r~ki~~~~~~~~~~~l~ 96 (641) T protein:vir:94 20 LSTDRIGGVVISKWQESRDKRNTVENNWDETYELYRASAIDR---QNTRARNFQTTGADDADWRHRINTGHTFEVVETLV 96 (641) T ss_pred CCchhHHHHHHHHHHHHHHhhcchHHHHHHHHHHhhcchhhh---hhcccccccccccchhcccccccchhHHHHHHHHh Confidence 66665544443333222 222234456666665432111 00000000000000111134788888888888877 Q ss_pred Hhhhc----CCe--eec---cCchhhHHHHHHHHh----c-cHHHHHHHHHHHHHhCCeEEEEEEEcC------------ Q lcl|NC_010179. 77 GYIAS----VFP--DID---VGKDADNKKILDVLG----D-DRALTLNSLLVDSSNAGRAWLHYWIDE------------ 130 (469) Q Consensus 77 ~~l~g----~p~--~~~---~~~~~~~~~l~~~~~----~-n~~~~~~~~~~~~~~~G~~~~~v~~d~------------ 130 (469) +-|.+ .+. .+. .++.+..+.++.+|. + ++.+...+..++++.+|.+++.++++. T Consensus 97 s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~~~w~~~~~~~~~~~~~~ 176 (641) T protein:vir:94 97 AYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYRLGWDTSMERQFKRTFVE 176 (641) T ss_pred hHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEEeehhhHHHHhhhhhccc Confidence 65543 322 222 233333333444443 2 345666678889999999999887541 Q ss_pred ----------------CCceEEEEEccceeEEEEeCCCCCceEEEEEEE--Eee----ecCCc---eEE---EEEEE-Ec Q lcl|NC_010179. 131 ----------------DNNFRYGIIQPDQITPVYATTLDNKLLGVLRSY--KQL----DPEAG---KYF---TVHEY-WT 181 (469) Q Consensus 131 ----------------~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~--~~~----~~~~~---~~~---~~~~~-~~ 181 (469) ...+++..++|..+++ |++....-..++++. ... ..+|. ... ...++ +. T Consensus 177 ~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~--dps~~~~~~~f~~~r~t~~t~~~l~~eg~~~~d~v~~~~~~~~~~~ 254 (641) T protein:vir:94 177 TGDIFGGWEDVAVNRQRSELRIEPLSPYDVWL--DTSGGKNTGTFVRLRHTREELHELVTSGYYDLDLTQVEQYVDYKFA 254 (641) T ss_pred chhhcccccccceecccceeeEEecchhheee--cCCCCcccccceehhhhHHHHHHHHhcCCCChhhcchhhccccccc Confidence 1224556666666553 443322111112211 100 00000 000 00000 00 Q ss_pred -CCeEEEEEe-ecCceeecccc-----ccccccccccccc-ccccccccc-cCCcccEEEecC-----CccccccHHHHH Q lcl|NC_010179. 182 -DKEAQFFRT-SATDSTVIEPY-----NIITSYDLSAGYE-TGQSNTLKH-NFGRVPFIEFPK-----NKYRLAELNKYK 247 (469) Q Consensus 182 -~~~~~~~~~-~~~~~~~~~~~-----~~~~~~~~~~~~~-~~~~~~~~~-~~g~vPvv~~~n-----~~~g~~~~~~v~ 247 (469) .+....... ....+..++.+ .....+....... ........+ .|...|++.++- .-+|.|....+. T Consensus 255 ~~d~~~d~~~~~~~~~~~~e~~gd~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~r~~~~~~~~YG~gp~~~~l 334 (641) T protein:vir:94 255 DPDTPKDVNGTDTSGWDIIEYYGPLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTTTLLPDRDSVYGMSVLHPNL 334 (641) T ss_pred ccccccccccccccccceeeeeeeeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEecceecCCcccCCChHHHHH Confidence 000000000 00000000000 0000000000000 001111112 345668876653 346999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeec-CCHHHHHHHHHH Q lcl|NC_010179. 248 GLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQID-IPVEARDDALKI 326 (469) Q Consensus 248 ~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~ 326 (469) +.+..+|.+.-...+.+..+.+|.+.+......+... ....+++++.+.. .+.++++... .+.......++. T Consensus 335 ~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~~~--l~~~PG~ii~~~~-----~~~v~pl~~~~~~~~~~~~~~~~ 407 (641) T protein:vir:94 335 GALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKRED--VKAKPGAVFKVAQ-----HGSLQPIDMGRQDFVVTYQEAQV 407 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCeeeeccccccccce--eeccCCcceeeCC-----CCcceeecCCccccchhHHHHHH Confidence 9999999999999999999999987654321111111 1222344444322 2346666433 233333445555 Q ss_pred HHHHHHHHhCCCCcCcc---ccC-CccHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhccc------------ Q lcl|NC_010179. 327 TRDNIFLFGQGIDPANF---ESS-NASGVAIKMLYSHLELKAAKTQTYFE-HAINELVRAIMRYLNFS------------ 389 (469) Q Consensus 327 l~~~i~~~s~~p~~~~~---~~g-~~Sg~Al~~~~~~l~~k~~~~~~~~~-~~l~~~~~~i~~~~~~~------------ 389 (469) +...|-....+..+... ..| +.++..+..+......+.....+.|. +++..+++-++.++... T Consensus 408 ~~~~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~ 487 (641) T protein:vir:94 408 QESSVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHTHIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVP 487 (641) T ss_pred HHHHHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhhhhhhch Confidence 55555443333222111 112 23555555555555555565556665 35555555544433211 Q ss_pred -----C---CCcccceEEeCCCCCCCHH---HHHHHHHHHhcc------CC-----------hHHHHHhCCCCCCH---- Q lcl|NC_010179. 390 -----D---ADKRHISQHWTRTKVEDSL---TKAQIVSTVANY------SS-----------KEAVAKANPIVDDW---- 437 (469) Q Consensus 390 -----~---~~~~~i~i~f~~~~p~d~~---e~~~~~~kl~g~------iS-----------~et~~~~l~~v~d~---- 437 (469) + ....++...|.- +|...+ +.++.+..+.++ .| .+.+++.++. .++ T Consensus 488 ~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~a~~P~v~d~~d~~~~~~~~~~~~g~-~~p~~~i 565 (641) T protein:vir:94 488 EEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDISGRVPQIGQSLDYALILEDLLRQMRF-TDPMRYI 565 (641) T ss_pred hhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHhhcChhhhhcCCHHHHHHHHHHHhCC-CCchhhc Confidence 1 122233333322 233322 233333332221 12 0222222221 111 Q ss_pred ---HHHHHHHHHHHHHh-hhhHhhcccCCCCCCCCC Q lcl|NC_010179. 438 ---QQELKDLAKDREEN-DPYANQADELNGKGVDDE 469 (469) Q Consensus 438 ---~~E~eri~~E~~~~-~~~~~~~~~~~~~~~~de 469 (469) +.+-+....++++. ....++.+.. +++..++ T Consensus 566 r~~~~~~~~~~~~~~~~q~~~~~~a~~~-~~~~~~~ 600 (641) T protein:vir:94 566 KKAEAPPAAPPIAPAEPGALPPEMMNSV-GGGLNDQ 600 (641) T ss_pred cCccCchhHHHHHHHHHHHHHHHHHHHH-HhhhHHH Confidence 11111111111111 1111222221 2233444 No 121 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.70 E-value=9.7e-08 Score=59.03 Aligned_cols=443 Identities=9% Similarity=0.003 Sum_probs=202.3 Q ss_pred CCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDAL-KKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |.-... +++.+.+-.-..+|.+....++++|. +-++.+...... . .......+.++..+-+...++..++.| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~-~----~~~~~~~~~~~~dst~~~a~~~LAa~L 73 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQ-D----RNRGEKRHNNILDNTGTRALRVLAAGM 73 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCC-C----CCcchhcccccccccHHHHHHHHHHHH Confidence 655533 33444444333444444444444433 111211110000 0 011122344566777777888887766 Q ss_pred hcC--Ce-----eeccCchh------hHH-------HHHHHHh-ccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEE Q lcl|NC_010179. 80 ASV--FP-----DIDVGKDA------DNK-------KILDVLG-DDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGI 138 (469) Q Consensus 80 ~g~--p~-----~~~~~~~~------~~~-------~l~~~~~-~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~ 138 (469) ++- || ++...+.+ ..+ .+...+. .||...+.++.++..++|.+.+++-.+..+.+++.. T Consensus 74 ~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~ 153 (555) T protein:vir:10 74 MAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHS 153 (555) T ss_pred HHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEE Confidence 532 22 12222211 111 1222332 356667788889999999998877666667788888 Q ss_pred EccceeEEEEeCCCCCceEEEEEEEEeeec-------------------CCceEEEEEEEEcCCeEEEEEeecCceeecc Q lcl|NC_010179. 139 IQPDQITPVYATTLDNKLLGVLRSYKQLDP-------------------EAGKYFTVHEYWTDKEAQFFRTSATDSTVIE 199 (469) Q Consensus 139 ~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (469) ++..++++.-|. .+++...+|.++..-. +....-..+++++. .|........... T Consensus 154 ~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~ 227 (555) T protein:vir:10 154 LTAGEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKRD 227 (555) T ss_pred eecceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCCC Confidence 999998886665 4566666665443211 00011111121110 0000000000000 Q ss_pred cc-cccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_010179. 200 PY-NIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (469) Q Consensus 200 ~~-~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~ 273 (469) .. ..+.++-...+.+.... ..+-+|..+|++.++ .+.+|+|-.++..+-+..+|...-..........+|.+. T Consensus 228 ~~~~p~~s~~~~~~~d~~~v-l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 228 DRNMAWKSVYFEPGADETRT-LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ccccceEEEEEEeccCCccc-cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 00 00000000000011000 112345567777664 345799999999999999999877788888888888766 Q ss_pred EecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCC---CcCccccCCccH Q lcl|NC_010179. 274 LTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGI---DPANFESSNASG 350 (469) Q Consensus 274 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p---~~~~~~~g~~Sg 350 (469) +..... .......+++...+..+..++.. .-.+....+.....+.++.++..|-..-... .+...+....|+ T Consensus 307 v~~~~~----~~~~~~~pgg~~~v~~g~~~d~~-~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TA 381 (555) T protein:vir:10 307 LPVSAK----NQDISTVPGGLSYVDAAAPNGGI-RTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTA 381 (555) T ss_pred eccccc----cccceeccccccccccCCCCcce-ecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccH Confidence 532111 11123334444333333222211 1222334466777788888888775433221 122233344566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcccC--------CCcccceEEeCCCCCCCH-HHH-- Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNFSD--------ADKRHISQHWTRTKVEDS-LTK-- 411 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~~~--------~~~~~i~i~f~~~~p~d~-~e~-- 411 (469) ..+... +.++...++..+.++ ++-++.++...+ .....++|.|..++-... .+. T Consensus 382 tEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~ 454 (555) T protein:vir:10 382 TEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATN 454 (555) T ss_pred HHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHH Confidence 555332 233444444443332 222223332221 123456777777665421 111 Q ss_pred -----HHHHHHHhcc-------CChHHHHHhC---CCCC-C---HHHHHHHHHHHHHHhhhhH----------hhcccCC Q lcl|NC_010179. 412 -----AQIVSTVANY-------SSKEAVAKAN---PIVD-D---WQQELKDLAKDREENDPYA----------NQADELN 462 (469) Q Consensus 412 -----~~~~~kl~g~-------iS~et~~~~l---~~v~-d---~~~E~eri~~E~~~~~~~~----------~~~~~~~ 462 (469) ++.+..++++ +....++..+ -+++ . .++|++++++++++..... +..+..+ T Consensus 455 ~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~ 534 (555) T protein:vir:10 455 SVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLG 534 (555) T ss_pred HHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 2222223332 2233333332 1222 1 2466666665544332211 1123444 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) +...+.+ T Consensus 535 ~~~~~~~ 541 (555) T protein:vir:10 535 SVDTSKQ 541 (555) T ss_pred ccccCcc Confidence 4444444 No 122 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.70 E-value=9.7e-08 Score=59.03 Aligned_cols=443 Identities=9% Similarity=0.003 Sum_probs=202.3 Q ss_pred CCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDAL-KKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |.-... +++.+.+-.-..+|.+....++++|. +-++.+...... . .......+.++..+-+...++..++.| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~-~----~~~~~~~~~~~~dst~~~a~~~LAa~L 73 (555) T protein:vir:10 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQ-D----RNRGEKRHNNILDNTGTRALRVLAAGM 73 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCC-C----CCcchhcccccccccHHHHHHHHHHHH Confidence 655533 33444444333444444444444433 111211110000 0 011122344566777777888887766 Q ss_pred hcC--Ce-----eeccCchh------hHH-------HHHHHHh-ccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEE Q lcl|NC_010179. 80 ASV--FP-----DIDVGKDA------DNK-------KILDVLG-DDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGI 138 (469) Q Consensus 80 ~g~--p~-----~~~~~~~~------~~~-------~l~~~~~-~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~ 138 (469) ++- || ++...+.+ ..+ .+...+. .||...+.++.++..++|.+.+++-.+..+.+++.. T Consensus 74 ~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~ 153 (555) T protein:vir:10 74 MAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHS 153 (555) T ss_pred HHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEE Confidence 532 22 12222211 111 1222332 356667788889999999998877666667788888 Q ss_pred EccceeEEEEeCCCCCceEEEEEEEEeeec-------------------CCceEEEEEEEEcCCeEEEEEeecCceeecc Q lcl|NC_010179. 139 IQPDQITPVYATTLDNKLLGVLRSYKQLDP-------------------EAGKYFTVHEYWTDKEAQFFRTSATDSTVIE 199 (469) Q Consensus 139 ~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (469) ++..++++.-|. .+++...+|.++..-. +....-..+++++. .|........... T Consensus 154 ~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~ 227 (555) T protein:vir:10 154 LTAGEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKRD 227 (555) T ss_pred eecceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCCC Confidence 999998886665 4566666665443211 00011111121110 0000000000000 Q ss_pred cc-cccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_010179. 200 PY-NIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (469) Q Consensus 200 ~~-~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~ 273 (469) .. ..+.++-...+.+.... ..+-+|..+|++.++ .+.+|+|-.++..+-+..+|...-..........+|.+. T Consensus 228 ~~~~p~~s~~~~~~~d~~~v-l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:10 228 DRNMAWKSVYFEPGADETRT-LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ccccceEEEEEEeccCCccc-cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 00 00000000000011000 112345567777664 345799999999999999999877788888888888766 Q ss_pred EecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCC---CcCccccCCccH Q lcl|NC_010179. 274 LTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGI---DPANFESSNASG 350 (469) Q Consensus 274 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p---~~~~~~~g~~Sg 350 (469) +..... .......+++...+..+..++.. .-.+....+.....+.++.++..|-..-... .+...+....|+ T Consensus 307 v~~~~~----~~~~~~~pgg~~~v~~g~~~d~~-~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TA 381 (555) T protein:vir:10 307 LPVSAK----NQDISTVPGGLSYVDAAAPNGGI-RTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTA 381 (555) T ss_pred eccccc----cccceeccccccccccCCCCcce-ecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccH Confidence 532111 11123334444333333222211 1222334466777788888888775433221 122233344566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcccC--------CCcccceEEeCCCCCCCH-HHH-- Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNFSD--------ADKRHISQHWTRTKVEDS-LTK-- 411 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~~~--------~~~~~i~i~f~~~~p~d~-~e~-- 411 (469) ..+... +.++...++..+.++ ++-++.++...+ .....++|.|..++-... .+. T Consensus 382 tEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~ 454 (555) T protein:vir:10 382 TEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATN 454 (555) T ss_pred HHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHH Confidence 555332 233444444443332 222223332221 123456777777665421 111 Q ss_pred -----HHHHHHHhcc-------CChHHHHHhC---CCCC-C---HHHHHHHHHHHHHHhhhhH----------hhcccCC Q lcl|NC_010179. 412 -----AQIVSTVANY-------SSKEAVAKAN---PIVD-D---WQQELKDLAKDREENDPYA----------NQADELN 462 (469) Q Consensus 412 -----~~~~~kl~g~-------iS~et~~~~l---~~v~-d---~~~E~eri~~E~~~~~~~~----------~~~~~~~ 462 (469) ++.+..++++ +....++..+ -+++ . .++|++++++++++..... +..+..+ T Consensus 455 ~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~ 534 (555) T protein:vir:10 455 SVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLG 534 (555) T ss_pred HHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 2222223332 2233333332 1222 1 2466666665544332211 1123444 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) +...+.+ T Consensus 535 ~~~~~~~ 541 (555) T protein:vir:10 535 SVDTSKQ 541 (555) T ss_pred ccccCcc Confidence 4444444 No 123 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.70 E-value=9.7e-08 Score=59.03 Aligned_cols=443 Identities=9% Similarity=0.003 Sum_probs=202.3 Q ss_pred CCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDAL-KKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |.-... +++.+.+-.-..+|.+....++++|. +-++.+...... . .......+.++..+-+...++..++.| T Consensus 1 M~~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~~~-~----~~~~~~~~~~~~dst~~~a~~~LAa~L 73 (555) T protein:vir:98 1 MAEQTERKLLLSRWGQLRTERESWMSHWKEISD--YLLPRAGRFFVQ-D----RNRGEKRHNNILDNTGTRALRVLAAGM 73 (555) T ss_pred CCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhCcccccccCC-C----CCcchhcccccccccHHHHHHHHHHHH Confidence 655533 33444444333444444444444433 111211110000 0 011122344566777777888887766 Q ss_pred hcC--Ce-----eeccCchh------hHH-------HHHHHHh-ccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEE Q lcl|NC_010179. 80 ASV--FP-----DIDVGKDA------DNK-------KILDVLG-DDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGI 138 (469) Q Consensus 80 ~g~--p~-----~~~~~~~~------~~~-------~l~~~~~-~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~ 138 (469) ++- || ++...+.+ ..+ .+...+. .||...+.++.++..++|.+.+++-.+..+.+++.. T Consensus 74 ~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~d~~~~~rf~~ 153 (555) T protein:vir:98 74 MAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLPDFDAVVYHHS 153 (555) T ss_pred HHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEecCCCceEEEEE Confidence 532 22 12222211 111 1222332 356667788889999999998877666667788888 Q ss_pred EccceeEEEEeCCCCCceEEEEEEEEeeec-------------------CCceEEEEEEEEcCCeEEEEEeecCceeecc Q lcl|NC_010179. 139 IQPDQITPVYATTLDNKLLGVLRSYKQLDP-------------------EAGKYFTVHEYWTDKEAQFFRTSATDSTVIE 199 (469) Q Consensus 139 ~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (469) ++..++++.-|. .+++...+|.++..-. +....-..+++++. .|........... T Consensus 154 ~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~v~v~~~----V~pr~~~~~~~~~ 227 (555) T protein:vir:98 154 LTAGEYAIAADN--QGRVNTLYREFQITVAQMVREFGKDKCSTTVQSLFDRGALEQWVTVIHA----IEPRADRDPSKRD 227 (555) T ss_pred eecceeEEeeCC--CCCEEEEEEEEeccHHHHHHhcCcccCCHHHHHHHhcCCCCceEEEEEE----EeeccCcCcCCCC Confidence 999998886665 4566666665443211 00011111121110 0000000000000 Q ss_pred cc-cccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_010179. 200 PY-NIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (469) Q Consensus 200 ~~-~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~ 273 (469) .. ..+.++-...+.+.... ..+-+|..+|++.++ .+.+|+|-.++..+-+..+|...-..........+|.+. T Consensus 228 ~~~~p~~s~~~~~~~d~~~v-l~esgy~e~P~i~~Rw~~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~~~~~~~pp~~ 306 (555) T protein:vir:98 228 DRNMAWKSVYFEPGADETRT-LRESGYRSFRALCPRWALVGGDIYGNSPAMEALGDVRQLQHEQLRKAQAIDYKSNPPLQ 306 (555) T ss_pred ccccceEEEEEEeccCCccc-cccCCcccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Confidence 00 00000000000011000 112345567777664 345799999999999999999877788888888888766 Q ss_pred EecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCC---CcCccccCCccH Q lcl|NC_010179. 274 LTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGI---DPANFESSNASG 350 (469) Q Consensus 274 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p---~~~~~~~g~~Sg 350 (469) +..... .......+++...+..+..++.. .-.+....+.....+.++.++..|-..-... .+...+....|+ T Consensus 307 v~~~~~----~~~~~~~pgg~~~v~~g~~~d~~-~~~~~~~~d~~~~~~~i~~~~~rI~~af~~dlf~~l~~~~~~~~TA 381 (555) T protein:vir:98 307 LPVSAK----NQDISTVPGGLSYVDAAAPNGGI-RTAFEVNLDLSHLLADIVDVRERIKASFYADLFLMLANGTNPQMTA 381 (555) T ss_pred eccccc----cccceeccccccccccCCCCcce-ecccccccchHHHHHHHHHHHHHHHHHhhcchhhhccCCCCCcccH Confidence 532111 11123334444333333222211 1222334466777788888888775433221 122233344566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcccC--------CCcccceEEeCCCCCCCH-HHH-- Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNFSD--------ADKRHISQHWTRTKVEDS-LTK-- 411 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~~~--------~~~~~i~i~f~~~~p~d~-~e~-- 411 (469) ..+... +.++...++..+.++ ++-++.++...+ .....++|.|..++-... .+. T Consensus 382 tEV~~r-------~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aq~~~~~~ 454 (555) T protein:vir:98 382 TEVAER-------HEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFVSMLAQAQRAIATN 454 (555) T ss_pred HHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeccHHHHHHHHHHHH Confidence 555332 233444444443332 222223332221 123456777777665421 111 Q ss_pred -----HHHHHHHhcc-------CChHHHHHhC---CCCC-C---HHHHHHHHHHHHHHhhhhH----------hhcccCC Q lcl|NC_010179. 412 -----AQIVSTVANY-------SSKEAVAKAN---PIVD-D---WQQELKDLAKDREENDPYA----------NQADELN 462 (469) Q Consensus 412 -----~~~~~kl~g~-------iS~et~~~~l---~~v~-d---~~~E~eri~~E~~~~~~~~----------~~~~~~~ 462 (469) ++.+..++++ +....++..+ -+++ . .++|++++++++++..... +..+..+ T Consensus 455 ~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~~~~q~~~~~~~~~ 534 (555) T protein:vir:98 455 SVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAALLNQGADTAAKLG 534 (555) T ss_pred HHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 2222223332 2233333332 1222 1 2466666665544332211 1123444 Q ss_pred CCCCCCC Q lcl|NC_010179. 463 GKGVDDE 469 (469) Q Consensus 463 ~~~~~de 469 (469) +...+.+ T Consensus 535 ~~~~~~~ 541 (555) T protein:vir:98 535 SVDTSKQ 541 (555) T ss_pred ccccCcc Confidence 4444444 No 124 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=98.64 E-value=1.6e-07 Score=57.92 Aligned_cols=447 Identities=11% Similarity=0.048 Sum_probs=203.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |.-.+.+++.+.+-.-...|.+....++++|. +-++.+.... .. ......+.+.++..+-+...++..++.|+ T Consensus 1 m~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~--~~lP~~~~~~---~~--~~~~~~~~~~~~~dst~~~a~~~Las~l~ 73 (556) T protein:vir:73 1 MAETEKERLLKQLAQLKNERTSFESHWLDLSD--FINPRGSRFL---TS--DVNRDDRRNTKIVDPTGSMAQRILSSGMM 73 (556) T ss_pred CChhhHHHHHHHHHHHHHHhhHHHHHHHHHHH--HhccccCCcC---CC--CCCcchhhcCccccchHHHHHHHHHHHHH Confidence 87777777766655444555444444444443 1111111100 00 00111122345667777777887777664 Q ss_pred cC--Ce-----eeccCchh------hH-------HHHHHHHh-ccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEE Q lcl|NC_010179. 81 SV--FP-----DIDVGKDA------DN-------KKILDVLG-DDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGII 139 (469) Q Consensus 81 g~--p~-----~~~~~~~~------~~-------~~l~~~~~-~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~ 139 (469) +- || ++...+.. .. +.+.+.+. .||...+.++.++..++|.+.+++-.+..+.+++..+ T Consensus 74 ~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~~r~~~~ 153 (556) T protein:vir:73 74 SGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVMEDDQDVIRTMPF 153 (556) T ss_pred HhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeeecCCceEEEEEe Confidence 31 22 22222211 11 12223332 3566677888999999999988776666677888899 Q ss_pred ccceeEEEEeCCCCCceEEEEEEEEeeec-------------------CCceEEEEEEEEcCCeEEEEEeecCceeeccc Q lcl|NC_010179. 140 QPDQITPVYATTLDNKLLGVLRSYKQLDP-------------------EAGKYFTVHEYWTDKEAQFFRTSATDSTVIEP 200 (469) Q Consensus 140 ~p~~~~~~~d~~~~~~~~~~v~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (469) +..++++.-|. .+++...+|.++..-. +.......++++.. .|.....+...... T Consensus 154 ~l~~~~~~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~~v~~~----V~pr~~~~~~~~~~ 227 (556) T protein:vir:73 154 PIGSYYLANSP--RGSVDTCIRQFSMTVRQMVQEFGLDNVSTSVKGMWENGTYETWVEVNHC----ITPNVNRDSGKMDS 227 (556) T ss_pred ecceeEEeeCC--CCCeEEEEEEEeccHHHHHHHcCcccCCHHHHHHHhcCCccceEEEEEE----EeccccccccccCc Confidence 99998876665 4566666766544310 00111111121100 00000000000000 Q ss_pred cc-cc--ccccccccccccccccccccCCcccEEEec-----CCcccccc-HHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010179. 201 YN-II--TSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAE-LNKYKGLIDAYDDIYNGFINDLDDVQTVI 271 (469) Q Consensus 201 ~~-~~--~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~-~~~v~~liD~~~~~~s~~~~~~~~~~~p~ 271 (469) .. .+ ..|......... ..+-+|..+|++.++ ++.+|+|- ..+..+-+..+|.+.-......+...+|. T Consensus 228 ~~~p~~s~~~~~~~~~~~v---l~esg~~e~P~~~~Rw~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~~~~~~~~pp 304 (556) T protein:vir:73 228 KNKPYRSVYFESGGDSDKL---LRESGFDEFPILAPRWEVNGEDVYASSCPGMLALGQVKALQVEQKRKAQLIDKATNPP 304 (556) T ss_pred ccceEEEEEEEecCCCcee---cccCCcccCCceeeeeeecCCcccccCccHHHhHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 00 00 001100000000 112345566776654 34578985 88999999999999999999999999997 Q ss_pred eEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe-ecCCHHHHHHHHHHHHHHHHHHhCCCC---cCccccCC Q lcl|NC_010179. 272 LVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ-IDIPVEARDDALKITRDNIFLFGQGID---PANFESSN 347 (469) Q Consensus 272 l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~---~~~~~~g~ 347 (469) +.+.... .........+++......++ ...++.+. .+.+.....+.++.++..|-..-...- +...+..+ T Consensus 305 ~~v~~~~----~~~~~~~~pgg~~~~~~~~~--~~~i~p~~~~~~d~~~~~~~i~~~~~rI~~af~~d~~~~l~~~~~~r 378 (556) T protein:vir:73 305 MVAPTSL----KNQRVSLLPGDVTYLDVISG--QDGFKPAYLVNPNTADLLADIQDTRQTINSAYFVDLFMMLQNINTRS 378 (556) T ss_pred eeccccc----cccceeeccCccccccCCCC--ccceeeeccccccHHHHHHHHHHHHHHHHHHhhcchhhhhccCCCCC Confidence 7654321 11112233344333322222 22355442 223556666677777776644322211 12223344 Q ss_pred ccHHHHHHHHHHHHHHHH----H-HHHHHHHHHHHHHHHHHHHhcccC----CCcccceEEeCCCCCCCH-HHHH----- Q lcl|NC_010179. 348 ASGVAIKMLYSHLELKAA----K-TQTYFEHAINELVRAIMRYLNFSD----ADKRHISQHWTRTKVEDS-LTKA----- 412 (469) Q Consensus 348 ~Sg~Al~~~~~~l~~k~~----~-~~~~~~~~l~~~~~~i~~~~~~~~----~~~~~i~i~f~~~~p~d~-~e~~----- 412 (469) .|+..+...-.-...... + ....+..-+.+.+.++.+.--+.. .....++|++..++-... ...+ T Consensus 379 ~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yis~La~aqk~~~~~~i~~ 458 (556) T protein:vir:73 379 MPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRIEYISVMAQAQKSIGLTSLSQ 458 (556) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEeecHHHHHHHHHHHHHHHH Confidence 466554333221111111 1 112223333344443333211111 123357777776665421 1111 Q ss_pred --HHHHHHhcc-------CChHHHHHhC---CCCC----CHHHHHHHHHHHHHHhhhhHhh----------cccCCCCCC Q lcl|NC_010179. 413 --QIVSTVANY-------SSKEAVAKAN---PIVD----DWQQELKDLAKDREENDPYANQ----------ADELNGKGV 466 (469) Q Consensus 413 --~~~~kl~g~-------iS~et~~~~l---~~v~----d~~~E~eri~~E~~~~~~~~~~----------~~~~~~~~~ 466 (469) +.+..++++ +....++..+ -+|+ -.++|++.+++++++.....++ ...+..-+. T Consensus 459 ~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~~~~~~~~a~~~~~~~~~~~~ 538 (556) T protein:vir:73 459 TVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQAMAMGQAAAQGAKTLSETQT 538 (556) T ss_pred HHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccC Confidence 222223332 3334444432 1122 1245666665554333322221 111222222 Q ss_pred CCC Q lcl|NC_010179. 467 DDE 469 (469) Q Consensus 467 ~de 469 (469) .+. T Consensus 539 ~~~ 541 (556) T protein:vir:73 539 SDP 541 (556) T ss_pred CCH Confidence 122 No 125 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.41 E-value=7.7e-07 Score=54.09 Aligned_cols=440 Identities=11% Similarity=0.032 Sum_probs=196.2 Q ss_pred CCHH------HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHH Q lcl|NC_010179. 1 MELD------ALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQ 74 (469) Q Consensus 1 ~~~~------~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~ 74 (469) |.-+ ++++..+.+..++..-..+.+.+.+|-. +.+.... ... ........+.+.++-.+-+...++. T Consensus 1 m~~d~~~~~~~l~~r~~~l~~~R~~~e~~w~e~~~~~l-----P~~~~~~-~~~-~~~~~~~~~~~~~~~dstg~~a~~~ 73 (549) T protein:vir:10 1 MTNDDAKILQALNADHGRMKEKRQSYEAVWNDVIDYLM-----PRLDKFG-QLP-RPDSEKGRERSQKMFDSTAPLALRN 73 (549) T ss_pred CCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhc-----ccccccc-ccC-CCCCCcccccccccccchHHHHHHH Confidence 6553 2233333333333444444444444422 1111000 000 0000111123345666777777887 Q ss_pred HHHhhhc--CCee-----eccCchhh------HHHH-------HHHH---hccHHHHHHHHHHHHHhCCeEEEEEEEcCC Q lcl|NC_010179. 75 EAGYIAS--VFPD-----IDVGKDAD------NKKI-------LDVL---GDDRALTLNSLLVDSSNAGRAWLHYWIDED 131 (469) Q Consensus 75 ~~~~l~g--~p~~-----~~~~~~~~------~~~l-------~~~~---~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~ 131 (469) .++.|++ -||. +...++.. ...| ...+ ..||.....++.++...+|.+.+++-.+.. T Consensus 74 LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta~l~~~~~~~ 153 (549) T protein:vir:10 74 FVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPGALMIEHDVG 153 (549) T ss_pred HHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcceeeEEeecCC Confidence 7776653 2222 23333221 1111 1222 245666777888999999999887766665 Q ss_pred CceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC--------C----------ceEEEEEEEEcCCeEEEEEeecC Q lcl|NC_010179. 132 NNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE--------A----------GKYFTVHEYWTDKEAQFFRTSAT 193 (469) Q Consensus 132 ~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~--------~----------~~~~~~~~~~~~~~~~~~~~~~~ 193 (469) +.++++.++-.++++.-|. .+++...+|.++..-.. . ......+++|+. .+..... T Consensus 154 ~~~~f~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~l~~~v~~~~~~~~~~~~~v~~~----V~pr~~~ 227 (549) T protein:vir:10 154 KGIVYRNVPMQRLWFAENN--SGLIDKTHVQWELTLRQAAQRFGRENLSPSMQSTLEKDPEKSAIFYHA----VEPRADR 227 (549) T ss_pred CeeEEEEEEcCeEEEeeCC--CCCeEEEEEEeecCHHHHHHhcCcccCCHHHHHHhhcCCCceEEEEEE----eecCCCC Confidence 6678888888888877775 45666666654431110 0 000112222211 0000000 Q ss_pred ceeeccccc--ccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 194 DSTVIEPYN--IITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDD 266 (469) Q Consensus 194 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~ 266 (469) ......... ....|-.. +.... ....+|..+|++.++ ++.+|.|-.++..+-+..+|.+.-......+. T Consensus 228 ~~~~~~~~~~pf~sv~~e~-~~~~i---l~esg~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~ 303 (549) T protein:vir:10 228 DPRKLDGRNMQFASYWLDE-GRDRI---VQNSGFRTFPFAIGRFYVGTDDVYGGSPAYDAMPDVRMANDMAKTNIRGAQK 303 (549) T ss_pred CccccccccCceEEEEEEe-cCCEe---eccCCcccCCcceeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 00000000 11111 112345567777654 34579999999999999999999999999999 Q ss_pred hcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCc-ccc Q lcl|NC_010179. 267 VQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPAN-FES 345 (469) Q Consensus 267 ~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~ 345 (469) ..+|.+.+.-.+..+ ...+..++......+. +....+.-+....+.......++.++..|-..-....+.. .+. T Consensus 304 ~~~p~~~v~~~g~~~----~~~l~pgg~~~~~~~~-~~~~~~~pl~~~~~~~~~~~~i~~~~~rI~~af~~d~~~~~~~~ 378 (549) T protein:vir:10 304 LVDPPLLANEDGVLD----GFDLRSGALNWGGLND-KGEEMVKPLLTGKQAQIGIEFAQDTRQTINQWFYVTLFQILVDS 378 (549) T ss_pred HhcCceeeccccccc----cceeccCCccccccCC-CCccceeeeccccchhHHHHHHHHHHHHHHHHHhhhhhhhhcCC Confidence 999987753211111 1122333332222221 1223455555555667777777777776655332211111 122 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHhcccCC------CcccceEEeCCCCCCC Q lcl|NC_010179. 346 SNASGVAIKMLYSHLELKAAKTQTYFEH------------AINELVRAIMRYLNFSDA------DKRHISQHWTRTKVED 407 (469) Q Consensus 346 g~~Sg~Al~~~~~~l~~k~~~~~~~~~~------------~l~~~~~~i~~~~~~~~~------~~~~i~i~f~~~~p~d 407 (469) ...|+..+...- .++...++. -+.+.+.++.+.--+... ....+.++|..++-+. T Consensus 379 ~~~TAtEV~~r~-------~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~~i~yis~La~a 451 (549) T protein:vir:10 379 GDMTATEVLQRA-------QEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADVDVEYDSPLNKA 451 (549) T ss_pred CCccHHHHHHHH-------HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCceeEEEeecHHHHH Confidence 334555443322 233333333 334444443332111111 2234667776555542 Q ss_pred -HHHHH-------HHHHHHhcc-------CChHHHHHhC---CCCCC----HHHHHHHHHHHHHHhhhhHhhc------- Q lcl|NC_010179. 408 -SLTKA-------QIVSTVANY-------SSKEAVAKAN---PIVDD----WQQELKDLAKDREENDPYANQA------- 458 (469) Q Consensus 408 -~~e~~-------~~~~kl~g~-------iS~et~~~~l---~~v~d----~~~E~eri~~E~~~~~~~~~~~------- 458 (469) ..+.+ +.+..++++ +....++..+ -+++- .++|++++.++.++.....+.. T Consensus 452 q~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~~~~~~~a~~a~ 531 (549) T protein:vir:10 452 MRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQMQQMLAAAPVAA 531 (549) T ss_pred HHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11112 222222222 3333444332 11221 2466666655433332221110 Q ss_pred ---ccCCCCCCCCC Q lcl|NC_010179. 459 ---DELNGKGVDDE 469 (469) Q Consensus 459 ---~~~~~~~~~de 469 (469) .+......-.+ T Consensus 532 ~~a~~~~~~~ta~~ 545 (549) T protein:vir:10 532 GAIKDLSDAQTAAQ 545 (549) T ss_pred HHHHhhhhhcCCCc Confidence 11111111111 No 126 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.37 E-value=1e-06 Score=53.47 Aligned_cols=430 Identities=10% Similarity=0.007 Sum_probs=205.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) +.-+.+++..+.+..++..-..+.+.+.+|....- . .... ........++-.+-+...++..++.|+ T Consensus 9 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~--~-~~~~----------~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (535) T protein:vir:33 9 LGEDGAKATYDRLTNDRRAYETRAENCAQYTIPSL--F-PKES----------DNESTDYTTPWQAVGARGLNNLASKLM 75 (535) T ss_pred cChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc--c-CCCC----------CcccccccccccccHHHHHHHHHHHHH Confidence 56677778888887777766777777777765421 1 1000 001111123445556666777766554 Q ss_pred cC--Cee----eccCch---------hhHHHHHHH-----------H-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 81 SV--FPD----IDVGKD---------ADNKKILDV-----------L-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 81 g~--p~~----~~~~~~---------~~~~~l~~~-----------~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) +- |.+ +...+. .....++.| + ..||...+.++.++..++|.+.+++-.+..+. T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 155 (535) T protein:vir:33 76 LALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSY 155 (535) T ss_pred HhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCc Confidence 31 221 111211 111122222 3 24666778889999999999977766555566 Q ss_pred eEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC-----CceE-EEEE--EEEcCCeEEEEEeec---Cceeeccccc Q lcl|NC_010179. 134 FRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE-----AGKY-FTVH--EYWTDKEAQFFRTSA---TDSTVIEPYN 202 (469) Q Consensus 134 ~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~-----~~~~-~~~~--~~~~~~~~~~~~~~~---~~~~~~~~~~ 202 (469) ++++.++-.++++.-|. .+++...+|.++..... +... .... ..+.+-.++.+.... ..+.. T Consensus 156 ~~f~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~~~~~~~~~----- 228 (535) T protein:vir:33 156 NPMKLYRLSSYVVQRDA--YGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLDEESGDYLK----- 228 (535) T ss_pred eeeEEEEcCeeEEeeCC--CCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEeeCCCCcEEE----- Confidence 77888877776666554 45677777766543110 0000 0000 001111111111111 11110 Q ss_pred ccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC Q lcl|NC_010179. 203 IITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY 277 (469) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~ 277 (469) +....+.... ......+|+.+|++.++ ++.+|.|-.++..+-+..+|.+.-..........+|.+.+.-. T Consensus 229 ----~~~~~~~~~~-~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv~~~ 303 (535) T protein:vir:33 229 ----YEEVEDVEID-GSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLVNPA 303 (535) T ss_pred ----EEEEeCcccc-ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeccc Confidence 0000011100 01112356677887765 3457999999999999999999999999999999998665311 Q ss_pred CcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHH Q lcl|NC_010179. 278 GGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKM 355 (469) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~ 355 (469) +... ...+...+...+..+ ...++..+. ...+.......++.++..|-..-..-.+...+....|+..+.. T Consensus 304 g~~~----~~~~~~~~~g~~v~g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtEV~~ 376 (535) T protein:vir:33 304 GITQ----PRRLTKAQTGDFVPG---RREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEEIRY 376 (535) T ss_pred cccc----hhhcccCCceeeecC---CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHHHHH Confidence 1111 111112221111111 223344443 3346777788888887777553221112122233345544432 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcc----cCCCcccceEEeCCCCCCC-HHHHHHHHH----HH Q lcl|NC_010179. 356 LYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNF----SDADKRHISQHWTRTKVED-SLTKAQIVS----TV 418 (469) Q Consensus 356 ~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~----~~~~~~~i~i~f~~~~p~d-~~e~~~~~~----kl 418 (469) ++.++...++..+.++ ++.++.++.. ...+...++++|..++..- ..+.++.+. .+ T Consensus 377 -------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~~~l 449 (535) T protein:vir:33 377 -------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCISAW 449 (535) T ss_pred -------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHHHHH Confidence 3344455555544442 3333333322 2344456788887766542 112222221 22 Q ss_pred hcc--------CChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHhhhhHhhcccCC-------CCCCCCC Q lcl|NC_010179. 419 ANY--------SSKEAVAKAN---PIVD-----DWQQELKDLAKDREENDPYANQADELN-------GKGVDDE 469 (469) Q Consensus 419 ~g~--------iS~et~~~~l---~~v~-----d~~~E~eri~~E~~~~~~~~~~~~~~~-------~~~~~de 469 (469) +++ +....++..+ -+|+ -.++|++++.+++.+.....++....+ .++-++. T Consensus 450 a~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 523 (535) T protein:vir:33 450 AALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVENAAAAGGAGVGALATSSPEAM 523 (535) T ss_pred HhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhhcCChhH Confidence 221 2233333332 1222 124566666655544333322222211 1111111 No 127 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.33 E-value=1.3e-06 Score=52.93 Aligned_cols=313 Identities=15% Similarity=0.140 Sum_probs=144.2 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhc--c-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCC Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGD--D-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATT 151 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~--n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~ 151 (469) +-.-|+.+.-.++.....+.+++.. | . .+....+...+..+|.+|+++-.+..|++ .+.+++|..+-++.+.. T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 3334555433344444556666642 3 1 22234566788999999999988888987 48888998888776653 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) . ..+ +|......+... .+....+.+++.-. + T Consensus 81 ~-~~~-----~y~~~~~~g~~~-----~~~~~eiih~r~~~-------------------------------~------- 111 (348) T protein:vir:93 81 S-REL-----YYSIHAATGNKL-----IVHNMDMLHFKHIV-------------------------------A------- 111 (348) T ss_pred C-cEE-----EEEEEcCCCeEE-----EEccccEEEecCCC-------------------------------C------- Confidence 2 111 121111112110 12222222221100 0 Q ss_pred EecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCc-eeEEecCCcccchhhhhhh--------hh-cceeeecccC Q lcl|NC_010179. 232 EFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTV-ILVLTNYGGASLKQFMNDL--------RE-YKSIKINNAG 301 (469) Q Consensus 232 ~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p-~l~~~g~~~~~~~~~~~~~--------~~-~~~~~~~~~~ 301 (469) .+.-.|.|-++.+...++..+.+... .+..+..+ -.++. .+....++....+ .. .+++.++ T Consensus 112 --~~~~~G~s~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~-~~~~l~~e~~~~~~~~~~~~~~n~~~~~vl~--- 182 (348) T protein:vir:93 112 --SNMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLK-YGSNVSTEKRQQVLEDFKQYYEENGGILFQE--- 182 (348) T ss_pred --CCceeeccHHHHHHHHHHHHHHHHHH---HHHhcCCCceeEEe-cCCCCCHHHHHHHHHHHHHHhhcCCCeeecC--- Confidence 00113666666665555544333221 23444443 22222 1111111111111 11 1122221 Q ss_pred CCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 302 NGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELV 379 (469) Q Consensus 302 ~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 379 (469) ++.+|..... ....+.+..+.....|+..-++|+.-....++.+...++... ...+..+|.-++ T Consensus 183 ----~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~----------~~~~~~~l~P~~ 248 (348) T protein:vir:93 183 ----PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELN----------RFYLQHTLLPIV 248 (348) T ss_pred ----CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHH Confidence 2344444333 333555666777888999889987543322222222222111 122334454455 Q ss_pred HHHHHHhccc---CCC-cccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-----HHHHH Q lcl|NC_010179. 380 RAIMRYLNFS---DAD-KRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ-----ELKDL 444 (469) Q Consensus 380 ~~i~~~~~~~---~~~-~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~-----E~eri 444 (469) +.|...++.+ ..+ .....+.| ..-+..|.++.++++.++ +|+++.-++.+.++. +++-+. -+-.+ T Consensus 249 ~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~ 328 (348) T protein:vir:93 249 KQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPI 328 (348) T ss_pred HHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcCeEeeccccccc Confidence 5454444321 111 11233444 455567889999999887 689999888888754 222111 11111 Q ss_pred HHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 445 AKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 445 ~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) . .+...+....+++.+++| T Consensus 329 ~------~~~~~~~~~~gg~~n~~~ 347 (348) T protein:vir:93 329 D------TPLELRKSLKGGDKNVNE 347 (348) T ss_pred c------cchhhcccccCCCCCcCC Confidence 1 111111112223333333 No 128 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=98.32 E-value=1.4e-06 Score=52.69 Aligned_cols=428 Identities=10% Similarity=0.040 Sum_probs=192.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ..-+.+++..+.+..++..-..+.+.+.+|....- .+... ........++..+-+...++..++.|+ T Consensus 8 ~~~~~~~~r~~~lk~~R~~~e~~w~e~~~~~lP~~---~~~~~----------~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) T protein:vir:21 8 LAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDS----------DNASTDYQTPWQAVGARGLNNLASKLM 74 (536) T ss_pred hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCC----------CcccccccccccccHHHHHHHHHHHHH Confidence 55667777777777666655666777777765321 11110 011112235566677777777776654 Q ss_pred cC--Cee----eccCchh---------hH-----------HHHHHHH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 81 SV--FPD----IDVGKDA---------DN-----------KKILDVL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 81 g~--p~~----~~~~~~~---------~~-----------~~l~~~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) +- |.+ +...+.. .. +.+...+ ..||...+.++.++..++|.+.+++--+..+. T Consensus 75 ~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~ 154 (536) T protein:vir:21 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) T ss_pred HhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCc Confidence 31 211 1111111 11 1222233 23566677888999999998876553333333 Q ss_pred e-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC------------C--ceEEEEEEEEcCCeEEEEEeecCceeec Q lcl|NC_010179. 134 F-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE------------A--GKYFTVHEYWTDKEAQFFRTSATDSTVI 198 (469) Q Consensus 134 ~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~------------~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (469) + .++.++-.++++.-|. .+++...+|.++..... + ......+++|+. .+....+....+ T Consensus 155 ~~~f~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~----v~~~~~~~~~~~ 228 (536) T protein:vir:21 155 YNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTH----IYLDEDSGEYLR 228 (536) T ss_pred eeeEEEEEcCeEEEeeCC--CCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEE----EEEecCCCcEEE Confidence 3 3667776777665564 45677777665542210 0 111122222211 011111111110 Q ss_pred ccccccccccccccccccccccccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_010179. 199 EPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (469) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~ 273 (469) +....+ ..........+|..+|++.++- +.+|.|-.++..+-+..+|.+.-...........|.+. T Consensus 229 --------~~e~~g-~~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~l 299 (536) T protein:vir:21 229 --------YEEVEG-MEVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) T ss_pred --------EeccCC-eeeccccCccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc Confidence 011111 1111122234577888887753 45799999999999999998877777766666665443 Q ss_pred E-ecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHH Q lcl|NC_010179. 274 L-TNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVA 352 (469) Q Consensus 274 ~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~A 352 (469) + .+. ... ...+...+...+-++.. +...+..+....+.......++.++..|-..-..-.+...+....|+.. T Consensus 300 v~p~g-~~~----~~~~~~~~~g~~v~g~~-~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtE 373 (536) T protein:vir:21 300 VNPAG-ITQ----PRRLTKAQTGDFVTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEE 373 (536) T ss_pred cCccc-ccc----hhhhccCCCcceecCCc-ccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHH Confidence 3 211 111 11111111111111111 1112222334446666777777777777543222122222233345554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhccc----CCCcccceEEeCCCCCC-CHHHHHHHH---- Q lcl|NC_010179. 353 IKMLYSHLELKAAKTQTYFEHAINE--------LVRAIMRYLNFS----DADKRHISQHWTRTKVE-DSLTKAQIV---- 415 (469) Q Consensus 353 l~~~~~~l~~k~~~~~~~~~~~l~~--------~~~~i~~~~~~~----~~~~~~i~i~f~~~~p~-d~~e~~~~~---- 415 (469) +.. ++.++...++..+.+ +++.++.++... ..+...+++.+..++.. ...+.++.+ T Consensus 374 V~~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~ 446 (536) T protein:vir:21 374 IRY-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCV 446 (536) T ss_pred HHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHH Confidence 433 233444444444433 233333333222 22333355666544432 111222222 Q ss_pred HHHhcc--------CChHHHHHh----CCCCC-C---HHHHHHHHHHHHHHhhhhHhhcccCC----------------- Q lcl|NC_010179. 416 STVANY--------SSKEAVAKA----NPIVD-D---WQQELKDLAKDREENDPYANQADELN----------------- 462 (469) Q Consensus 416 ~kl~g~--------iS~et~~~~----l~~v~-d---~~~E~eri~~E~~~~~~~~~~~~~~~----------------- 462 (469) +.++++ +....++.. +|..+ . .++|++++.+++++.....++..... T Consensus 447 ~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 526 (536) T protein:vir:21 447 TAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAA 526 (536) T ss_pred HHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChhhHHhh Confidence 112221 233333332 33211 1 25677777766544433222111100 Q ss_pred -CCCCCCC Q lcl|NC_010179. 463 -GKGVDDE 469 (469) Q Consensus 463 -~~~~~de 469 (469) +..+-++ T Consensus 527 ~~~~g~~~ 534 (536) T protein:vir:21 527 ADSVGLQP 534 (536) T ss_pred hhccccCC Confidence 0000001 No 129 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=98.31 E-value=1.4e-06 Score=52.63 Aligned_cols=372 Identities=12% Similarity=0.042 Sum_probs=181.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ||...+..+++.... ..+.++-.+.+...- ......-.+.+...-+. T Consensus 40 ltp~~l~~iLr~a~~---gd~~~~~~L~e~m~e------------------------------~D~~i~s~l~~Rk~av~ 86 (526) T protein:vir:99 40 LTPAKLARILVEAEQ---GNLQAQAELFMDMEE------------------------------RDAHLFAEMSKRKRAIL 86 (526) T ss_pred CCHHHHHHHHHhhhC---CCHHHHHHHHHHHHh------------------------------hChHHHHHHHHHHHHHh Confidence 333333333322110 001111111111000 02344445555566677 Q ss_pred cCCeeeccCc------hhhHHHHHHHHhc--cHHHHHHHHHHHHHhCCeEEE-EEEEcCCCceE---EEEEccceeEEEE Q lcl|NC_010179. 81 SVFPDIDVGK------DADNKKILDVLGD--DRALTLNSLLVDSSNAGRAWL-HYWIDEDNNFR---YGIIQPDQITPVY 148 (469) Q Consensus 81 g~p~~~~~~~------~~~~~~l~~~~~~--n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~~---i~~~~p~~~~~~~ 148 (469) |.+..+.... ....+.+.+++.+ ++.+.+..+. ++.-+|.++. ++|...+|... +.+.+|+.+. | T Consensus 87 ~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~~~~l~~r~~~~f~--~ 163 (526) T protein:vir:99 87 GLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWMPLAFHHRPQSWFQ--L 163 (526) T ss_pred CCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCceeEEEeeeeccccee--e Confidence 8887776432 2334567777754 3444444444 6788897554 55554445433 3444443321 2 Q ss_pred eCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcc Q lcl|NC_010179. 149 ATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRV 228 (469) Q Consensus 149 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 228 (469) ++.....+ +. . +.... ...+ .+++.| T Consensus 164 ~~~~~~~l----~~-~--~~~~~----g~~l-------------------------------------------~~~k~i 189 (526) T protein:vir:99 164 NPEDQNEL----RL-R--DNSPA----GEAL-------------------------------------------QPFGWI 189 (526) T ss_pred ccCCCcEE----Ee-c--CCCCC----ceee-------------------------------------------cCCCeE Confidence 22111000 00 0 00000 0000 122222 Q ss_pred cEEEec--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchh------hhhhhhhcceeeeccc Q lcl|NC_010179. 229 PFIEFP--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQ------FMNDLREYKSIKINNA 300 (469) Q Consensus 229 Pvv~~~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~------~~~~~~~~~~~~~~~~ 300 (469) -.++-. .++.|.|.+..+-...--=+..+.+++.-++.++.|+++.+-..+...++ ....+.......++.+ T Consensus 190 ~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~d~~~iiP~~ 269 (526) T protein:vir:99 190 IHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAAGIIPET 269 (526) T ss_pred EEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCC Confidence 222221 25678888887777776667788999999999999998877432222211 1233444555556554 Q ss_pred CCCCCCcceEEeec-CCHHHHHHHHHHHHHHHHHHhCCCCcCcc-ccCCccHHHH-HHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_010179. 301 GNGDKSGVDKLQID-IPVEARDDALKITRDNIFLFGQGIDPANF-ESSNASGVAI-KMLYSHLELKAAKTQTYFEHAIN- 376 (469) Q Consensus 301 ~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~g~~Sg~Al-~~~~~~l~~k~~~~~~~~~~~l~- 376 (469) ..+++++.. ...+.++..++.+.+.|.+.--+-.++.+ +.|+.+.-|+ +....-....+..-.+.+...+. T Consensus 270 -----~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~i~~tln~ 344 (526) T protein:vir:99 270 -----MAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQLAATLSR 344 (526) T ss_pred -----ceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 358898853 45678899999999999886544444332 1122111121 11112223334455566777774 Q ss_pred HHHHHHHHHhcccCC-C-cccceEEeCCCCCCCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHh Q lcl|NC_010179. 377 ELVRAIMRYLNFSDA-D-KRHISQHWTRTKVEDSLTKAQIVSTVA--NY-SSKEAVAKANPIVDDWQQELKDLAKDREEN 451 (469) Q Consensus 377 ~~~~~i~~~~~~~~~-~-~~~i~i~f~~~~p~d~~e~~~~~~kl~--g~-iS~et~~~~l~~v~d~~~E~eri~~E~~~~ 451 (469) +++..++.+ |.... + .....+.|...-+.|.+..++.+.+++ |+ +|.+.+.+.++. +.+...-.-+....... T Consensus 345 ~Li~~l~~~-N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~Gi-p~~~~~e~~l~~~~~~~ 422 (526) T protein:vir:99 345 DLLWPLLVL-NRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI-PQPAKNEPVLRSAAQPA 422 (526) T ss_pred HHHHHHHHh-CCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCccCHHHHHHHhCC-CCCCCcccccCCCCCCc Confidence 477777765 33222 2 234678999999999999999999884 66 999999998864 32221100010000000 Q ss_pred hhh-Hhh-----cccCCCCCCCCC Q lcl|NC_010179. 452 DPY-ANQ-----ADELNGKGVDDE 469 (469) Q Consensus 452 ~~~-~~~-----~~~~~~~~~~de 469 (469) .+. ... ..........++ T Consensus 423 ~~~~~~~~~~~~~~~~~~~~~~~~ 446 (526) T protein:vir:99 423 ILSRQHGQRVAALATIVGPRYGDQ 446 (526) T ss_pred ccccccccccccccccccccCcch Confidence 000 000 000000000111 No 130 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.30 E-value=1.5e-06 Score=52.48 Aligned_cols=427 Identities=11% Similarity=0.036 Sum_probs=205.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) +--+.+++..+.+..++..-..+.+.+.+|.... +. .... ........++-.+-+...++..++.|+ T Consensus 9 ~~~~~~k~r~~~l~~~R~~~e~~w~e~~~~~lP~--~~-~~~~----------~~~~~~~~~~~dst~~~a~~~Laa~l~ 75 (535) T protein:vir:15 9 LGEDGAKATYDRLTNDRRAYETRAENCAQYTIPS--LF-PKES----------DNESTDYTTPWQAVGARGLNNLASKLM 75 (535) T ss_pred cchHHHHHHHHHHHHHhhHHHHHHHHHHHHhccc--cc-CCCC----------CcccccccccccccHHHHHHHHHHHHH Confidence 5666777777888777766677777777776542 11 1000 001111224445666667777776654 Q ss_pred cC--Cee----eccCch---------hhHHHHHH-----------HH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 81 SV--FPD----IDVGKD---------ADNKKILD-----------VL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 81 g~--p~~----~~~~~~---------~~~~~l~~-----------~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) +- |.+ +...+. .....++. .+ ..||...+.++.++..++|.+.+++-.+..+. T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 155 (535) T protein:vir:15 76 LALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPEPEGSY 155 (535) T ss_pred HhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeecCCCCc Confidence 31 221 122211 11112222 23 23666778889999999999977665555566 Q ss_pred eEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC--------------CceEEEEEEEEcCCeEEEEEeecCceeecc Q lcl|NC_010179. 134 FRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE--------------AGKYFTVHEYWTDKEAQFFRTSATDSTVIE 199 (469) Q Consensus 134 ~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 199 (469) ++++.++-.++++.-|. .+++...+|.++..... .......+++|+.- +....++.... T Consensus 156 ~~f~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v----~~~~~~~~~~~- 228 (535) T protein:vir:15 156 NPMKLYRLSSYVVQRDA--YGNVLQIVTRDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHV----YLDEESGDYLK- 228 (535) T ss_pred eeeEEEEcCeeEEeeCC--CCCeeEEEEeEeecHHHHHHHHhHhhhccccccCCCCceeEEEEE----EEecCCCcEEE- Confidence 78888887777766554 45677777766543110 00011122222210 11111111100 Q ss_pred cccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE Q lcl|NC_010179. 200 PYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL 274 (469) Q Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~ 274 (469) +....+.... ......+|..+|++.++ ++.+|.|-.++..+-+..+|.+.-..........+|.+.+ T Consensus 229 -------~~e~~g~~~~-~~~~~~~~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~lv 300 (535) T protein:vir:15 229 -------YEEVEDVEID-GSDATYPTDAMPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSMISAKVIGLV 300 (535) T ss_pred -------EEEeeCcccc-ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCceee Confidence 0001111110 01123456677887765 3457999999999999999999999999999999998665 Q ss_pred ecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHH Q lcl|NC_010179. 275 TNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVA 352 (469) Q Consensus 275 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~A 352 (469) .-.+... ...+...+...+..+ ...++..+. ...+.......++.++..|-..-..-.+...+....|+.. T Consensus 301 ~~~g~~~----~~~l~~~~~g~~v~g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~af~~~~~~~~~~~r~TAtE 373 (535) T protein:vir:15 301 NPAGITQ----PRRLTKAQTGDFVPG---RREDIDFLQLEKQADFTVAKAVSDQIEARLSYAFMLNSAVQRTGERVTAEE 373 (535) T ss_pred ccccccc----chhcccCCceeeecC---CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCccccHHH Confidence 3111111 111111221111111 223344443 3346777788888877777553221112122233345544 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcc----cCCCcccceEEeCCCCCCC-HHHHHHHHH--- Q lcl|NC_010179. 353 IKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNF----SDADKRHISQHWTRTKVED-SLTKAQIVS--- 416 (469) Q Consensus 353 l~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~----~~~~~~~i~i~f~~~~p~d-~~e~~~~~~--- 416 (469) +.. ++.++...++..+.++ ++.++.++.. ...+...++++|..++..- ..+.++.+. T Consensus 374 V~~-------r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~yis~La~aqr~~~~~~l~~~~ 446 (535) T protein:vir:15 374 IRY-------VASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCI 446 (535) T ss_pred HHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccceeEEEecHHHHHHHHHHHHHHHHHH Confidence 432 3344455555544442 3333333322 2344455788887766542 112222222 Q ss_pred -HHhcc--------CChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHhhhhHhhcccCCCCCCC-----CC Q lcl|NC_010179. 417 -TVANY--------SSKEAVAKAN---PIVD-----DWQQELKDLAKDREENDPYANQADELNGKGVD-----DE 469 (469) Q Consensus 417 -kl~g~--------iS~et~~~~l---~~v~-----d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~-----de 469 (469) .++++ +....++..+ -+|+ -.++|++++.+++.+.....++....+++-.. -| T Consensus 447 ~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a~~~g~~~~~~~~~~p~ 521 (535) T protein:vir:15 447 SAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAAATGGAGVGALATSSPE 521 (535) T ss_pred HHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhhccchhccChH Confidence 22221 2233333332 1222 12456666655443333222222221111111 11 No 131 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.24 E-value=2.1e-06 Score=51.66 Aligned_cols=428 Identities=10% Similarity=0.049 Sum_probs=191.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ..-+.+++..+.+..++..-..+.+.+.+|....- .+... ........++..+-+...++..++.|+ T Consensus 8 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~---~~~~~----------~~~~~~~~~~~dst~~~a~~~Laa~l~ 74 (536) T protein:vir:10 8 LAEDGAKSVYERLKNDRAPYETRAQNCAQYTIPSL---FPKDS----------DNASTDYQTPWQAVGARGLNNLASKLM 74 (536) T ss_pred hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCC----------CcccccccccccccHHHHHHHHHHHHH Confidence 55567777777777666655666777777765321 11100 011112234556667777777776654 Q ss_pred cC--Cee----eccCchh---------hH-----------HHHHHHH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 81 SV--FPD----IDVGKDA---------DN-----------KKILDVL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 81 g~--p~~----~~~~~~~---------~~-----------~~l~~~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) +- |.+ +...+.. .. +.+...+ ..||...+.++.++..++|.+.+++--+..+. T Consensus 75 ~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly~~e~~~~~ 154 (536) T protein:vir:10 75 LALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLYLPEPEGSN 154 (536) T ss_pred hhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEEeeCCCCc Confidence 31 211 1111111 11 1222233 23566677888999999998876553333333 Q ss_pred e-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeec------------C--CceEEEEEEEEcCCeEEEEEeecCceeec Q lcl|NC_010179. 134 F-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDP------------E--AGKYFTVHEYWTDKEAQFFRTSATDSTVI 198 (469) Q Consensus 134 ~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~------------~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 198 (469) + .++.++-.++++.-|. .+++...+|.++.... . .......+++|+. .+......... T Consensus 155 ~~~~~~~pl~~~~v~~d~--~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~----V~~~~~~~~~~- 227 (536) T protein:vir:10 155 YNPMKLYRLSSYVVQRDA--FGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTH----IYLDEASGEYL- 227 (536) T ss_pred eeeEEEEEcCeEEEeeCC--CCCeeEEeeeeeccHHHHHHhhhhhhcccccccCcccceEEEEE----EEEecCCCcEE- Confidence 3 3667776777665564 4567777766554211 0 0111122222211 01111111111 Q ss_pred ccccccccccccccccccccccccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_010179. 199 EPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (469) Q Consensus 199 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~ 273 (469) .+....+. .........+|..+|++.++- +.+|.|-.++..+-+..+|.+.-...........|.+. T Consensus 228 -------~~~e~~g~-~v~~~~g~~~f~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~~~l 299 (536) T protein:vir:10 228 -------RYEEVEGM-EVQGSDGTYPKEACPYIPIRMVRLDGESYGRSYIEEYLGDLRSLENLQEAIVKMSMISSKVIGL 299 (536) T ss_pred -------EEEeecCc-cccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc Confidence 01111111 111112234567788887653 45799999999999999998877777766666665443 Q ss_pred E-ecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHH Q lcl|NC_010179. 274 L-TNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVA 352 (469) Q Consensus 274 ~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~A 352 (469) + .+. ... ...+...+...+-++.. +...+.-+....+.......++.++..|-..-..-.+...+....|+.. T Consensus 300 v~p~g-~~~----~~~~~~~~~g~~v~g~~-~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~l~~~~~~r~TAtE 373 (536) T protein:vir:10 300 VNPAG-ITQ----PRRLTKAQTGDFVTGRP-EDISFLQLEKQADFTVAKAVSDAIEARLSFAFMLNSAVQRTGERVTAEE 373 (536) T ss_pred cCccc-ccc----hhhhccCCCcceecCCc-ccceeeeccccccchHHHHHHHHHHHHHHHHHhhhhcccCCCCCccHHH Confidence 3 211 111 11111111111111111 1112222334446666777777777777543222122222233345554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhccc----CCCcccceEEeCCCCCC-CHHHHHHHH---- Q lcl|NC_010179. 353 IKMLYSHLELKAAKTQTYFEHAINE--------LVRAIMRYLNFS----DADKRHISQHWTRTKVE-DSLTKAQIV---- 415 (469) Q Consensus 353 l~~~~~~l~~k~~~~~~~~~~~l~~--------~~~~i~~~~~~~----~~~~~~i~i~f~~~~p~-d~~e~~~~~---- 415 (469) +... +.++...++..+.+ +++.++.++... ..+...+++.+..++.. ...+.++.+ T Consensus 374 V~~r-------~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~~~ 446 (536) T protein:vir:10 374 IRYV-------ASELEDTLGGVYSILSQELQLPLVRVLLKQLQATQQIPELPKEAVEPTISTGLEAIGRGQDLDKLERCV 446 (536) T ss_pred HHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEEecHHHHHHHHHHHHHHHHH Confidence 4332 33444444444433 233333333222 22333355666544432 111222222 Q ss_pred HHHhcc--------CChHHHHHh----CCCCC-C---HHHHHHHHHHHHHHhhhhHhhcccCC---------C------- Q lcl|NC_010179. 416 STVANY--------SSKEAVAKA----NPIVD-D---WQQELKDLAKDREENDPYANQADELN---------G------- 463 (469) Q Consensus 416 ~kl~g~--------iS~et~~~~----l~~v~-d---~~~E~eri~~E~~~~~~~~~~~~~~~---------~------- 463 (469) +.++++ +....++.. +|..+ . .++|++++.+++++.....++..... + T Consensus 447 ~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 526 (536) T protein:vir:10 447 TAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGMDNGAAALAQGMAAQATASPEAMAAA 526 (536) T ss_pred HHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHhh Confidence 122221 233334433 33211 1 25677777766544433222111100 0 Q ss_pred --CCCCCC Q lcl|NC_010179. 464 --KGVDDE 469 (469) Q Consensus 464 --~~~~de 469 (469) ..+-++ T Consensus 527 ~~~~g~~~ 534 (536) T protein:vir:10 527 ADSVGLQP 534 (536) T ss_pred hhccccCC Confidence 000000 No 132 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.23 E-value=2.3e-06 Score=51.55 Aligned_cols=424 Identities=11% Similarity=0.069 Sum_probs=196.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ...+.+++..+.+..++..-..+.+.+.+|....- ..... . .......++..+-+...++..++.|+ T Consensus 7 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~------~----~~~~~~~~~~dst~~~a~~~Las~l~ 73 (522) T protein:vir:94 7 FAAEGAKAVYDRLKNGRQPYETRAQNCAAVTIPSL---FPKES------D----NSSTEYTTPWQAVGARCLNNLAAKLM 73 (522) T ss_pred hhHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---cCCCC------C----cccccccccccccHHHHHHHHHHHHH Confidence 44556666677776665555666667777755321 11100 0 01111224556667777777777654 Q ss_pred c-CCee-----eccC---------chhhHHHHHHHH------------hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 81 S-VFPD-----IDVG---------KDADNKKILDVL------------GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 81 g-~p~~-----~~~~---------~~~~~~~l~~~~------------~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) + -.|. +... +......++.|+ ..||...+.++.++..++|.+.+++--+..+. T Consensus 74 ~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~~ 153 (522) T protein:vir:94 74 LALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLYIPEPEQGT 153 (522) T ss_pred hhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEeeeccCCCc Confidence 3 2221 1111 111122233332 24666778888999999999987655444444 Q ss_pred e-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeec------------CCceEEEEEEEEcCCeEEEEEeecCceeeccc Q lcl|NC_010179. 134 F-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDP------------EAGKYFTVHEYWTDKEAQFFRTSATDSTVIEP 200 (469) Q Consensus 134 ~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (469) . .++.++-.++++.-|. .+++...+|.++.... +.......+++|+. .+. ....+.. T Consensus 154 ~~~~~~~pl~~y~v~~d~--~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~----v~~-~~~~~~~--- 223 (522) T protein:vir:94 154 YSPMRMYRLVSYVVQRDA--FGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTH----IYR-QDDEYLR--- 223 (522) T ss_pred eeeEEEEEcceEEEeeCC--CcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEE----EEe-eCCceeE--- Confidence 3 4666776665555553 4567666666554211 00011122333321 011 1111100 Q ss_pred ccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEe Q lcl|NC_010179. 201 YNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT 275 (469) Q Consensus 201 ~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~ 275 (469) +....+.... ......+|..+|++.++ ++.+|.|-.++..+-+..+|.+.-..........+|.+.+. T Consensus 224 ------~~~~~g~~~~-~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~p~~~v~ 296 (522) T protein:vir:94 224 ------YEEVEGIEVT-GTDGSYPLTACPYIPVRMVRLDGEDYGRSYCEEYLGDLNSLETITEAITKMAKVASKVVGLVN 296 (522) T ss_pred ------EeeccCceec-ccCCCCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhCCceeec Confidence 0000011110 11112356778877765 34579999999999999999999999999999999987653 Q ss_pred cCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHH Q lcl|NC_010179. 276 NYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAI 353 (469) Q Consensus 276 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al 353 (469) -.+..... .+...+.-.+..+ ...+++.+. ...+.......++.++..|...-..-.+...+..+.|+..+ T Consensus 297 ~~g~~~~~----~~~~~~~g~~v~g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TAtEV 369 (522) T protein:vir:94 297 PNGITQPR----RLNKAATGEFVAG---RVEDINFLQLTKGQDFTIAKSVADAIEQRLGWAFLLNSAVQRNAERVTAEEI 369 (522) T ss_pred ccccccch----heeccCCceeecC---CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCccccHHHH Confidence 11111111 1111111111111 122344333 33466777778888777776543222222223334556544 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhc----ccCCCcccceEEeCCCCCCC-HHHHHHHHHH--- Q lcl|NC_010179. 354 KMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLN----FSDADKRHISQHWTRTKVED-SLTKAQIVST--- 417 (469) Q Consensus 354 ~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~----~~~~~~~~i~i~f~~~~p~d-~~e~~~~~~k--- 417 (469) .. ++.++...++..+.++ ++.++.++. ....+...+++++..++..- ..+.++.+.. T Consensus 370 ~~-------r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~v~~~s~La~~qr~~~~~~l~~~~~ 442 (522) T protein:vir:94 370 RY-------VAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDLPKEAVEPTVSTGLEALGRGQDLEKLTQAVN 442 (522) T ss_pred HH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcccEEeeEecHHHHHHHHHHHHHHHHHHH Confidence 33 2334444444444332 222233332 22334445777776665541 1122222211 Q ss_pred -Hhcc--------CChHHHHHh----CCCCC--C---HHHHHHHHHHHHHHhhhhH---hhc-----ccCCCCCCCCC Q lcl|NC_010179. 418 -VANY--------SSKEAVAKA----NPIVD--D---WQQELKDLAKDREENDPYA---NQA-----DELNGKGVDDE 469 (469) Q Consensus 418 -l~g~--------iS~et~~~~----l~~v~--d---~~~E~eri~~E~~~~~~~~---~~~-----~~~~~~~~~de 469 (469) ++++ +....++.. +| |+ . .++|++.+.+++.+..... +.. ...+.+-..|. T Consensus 443 ~ia~l~P~~~~~~id~d~~~~~~a~~~G-v~~~~ivr~~ee~~~~~~q~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 519 (522) T protein:vir:94 443 MMTGLQPLSQDPDINLPTLKLRLLNALG-IDTAGLLLTQDEKIQRMAEQSSQQAVVQGASAAGANMGAAVGQGAGEDM 519 (522) T ss_pred HHHhccchhhhhcCCHHHHHHHHHHHcC-CChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccchhh Confidence 1211 222333322 23 31 1 1456666655533322211 111 11111111111 No 133 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=98.21 E-value=2.6e-06 Score=51.22 Aligned_cols=390 Identities=11% Similarity=0.032 Sum_probs=179.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchh-hhcccccc-ccccc-Ccc-ee--ccchHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKP-KVSKEGKK-DPLRS-ADN-RI--PSNFYQLLVDQ 74 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~-~~~~~~~~-~~~~~-~~~-ri--~~n~~k~iv~~ 74 (469) +..+.+++.-. .++--..+-+.+ |....-.+.+. .+...... +...+ .-. .+ ......-.+.+ T Consensus 12 ~~~~~~~~~~~----------~~~~~~~~~~~~-~~~~gltp~~l~~il~~a~~gd~~~~~~L~edm~e~D~~i~s~l~~ 80 (526) T protein:vir:79 12 IRPQQLREPQT----------SRLAGLAKEFAQ-HPAKGLTPAKLARILVEAEQGNLQAQAELFMDMEERDAHLFAEMSK 80 (526) T ss_pred cCccccchhhh----------hhhhhhhhhccc-CCCCCcCHHHHHHHHHHhhCCCHHHHHHHHHHHHhhChHHHHHHHH Confidence 32222221100 000000111111 00000000000 00000000 00000 000 00 12444455666 Q ss_pred HHHhhhcCCeeeccCc------hhhHHHHHHHHhc--cHHHHHHHHHHHHHhCCeEEE-EEEEcCCCceE---EEEEccc Q lcl|NC_010179. 75 EAGYIASVFPDIDVGK------DADNKKILDVLGD--DRALTLNSLLVDSSNAGRAWL-HYWIDEDNNFR---YGIIQPD 142 (469) Q Consensus 75 ~~~~l~g~p~~~~~~~------~~~~~~l~~~~~~--n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~~---i~~~~p~ 142 (469) ...-+.|.+..+.... ....+.+.+++.+ ++.+ +..-..++.-+|.++. ++|-..+|.+. +.+.+|+ T Consensus 81 Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~-~i~~~ldA~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~ 159 (526) T protein:vir:79 81 RKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLED-LLLDALDGIGHGYSCIELEWALQGREWMPLAFHHRPQS 159 (526) T ss_pred HHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHH-HHHHHHhhhhhcceeEEEEEeecCCceeEEEeeeeccc Confidence 6667788888876432 2334457777755 3444 3334445778887554 55554445433 3333443 Q ss_pred eeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccc Q lcl|NC_010179. 143 QITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (469) Q Consensus 143 ~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (469) .+. |++.....+ +. .....+| ..+ T Consensus 160 ~F~--~~~~~~~~l----~~-~~~~~~g------~~l------------------------------------------- 183 (526) T protein:vir:79 160 WFQ--LNPEDQNEL----RL-RDNSPAG------EAL------------------------------------------- 183 (526) T ss_pred ceE--eccCCCcEE----Ee-cCCCCCc------eee------------------------------------------- Confidence 221 232211110 00 0000000 000 Q ss_pred ccCCcccEEEec--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh------hhhhhhcce Q lcl|NC_010179. 223 HNFGRVPFIEFP--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF------MNDLREYKS 294 (469) Q Consensus 223 ~~~g~vPvv~~~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~------~~~~~~~~~ 294 (469) .+++.|-.++-. .++.|.|.+..+-...--=+..+.+++.-++.++.|+++.+-..+....+- ...+....+ T Consensus 184 ~~~k~iv~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~~a~~~ek~~L~~av~~i~~da~ 263 (526) T protein:vir:79 184 QPFGWIIHRPRARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPPGTADEEKATLLRAVTGLGHAAA 263 (526) T ss_pred cCCceEEEeecCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCCCCCHHHHHHHHHHHHHHhcCcE Confidence 122222222211 256688888877776666677888999999999999988774322222211 233444555 Q ss_pred eeecccCCCCCCcceEEeec-CCHHHHHHHHHHHHHHHHHHhCCCCcCcc----ccC-CccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQID-IPVEARDDALKITRDNIFLFGQGIDPANF----ESS-NASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~~g-~~Sg~Al~~~~~~l~~k~~~~~ 368 (469) ..++.+ ..+++++.. ...+.++..++.+.+.|.+.--+-.++.+ +.| +..|..- ..-....+..-. T Consensus 264 ~iiP~~-----~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh---~~v~~di~~aDa 335 (526) T protein:vir:79 264 GIIPET-----MAIDFQQAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVH---NEVRHDILASDA 335 (526) T ss_pred EEecCC-----ceeEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHH---HHHHHHHHHHHH Confidence 556554 358899853 45678899999999999886544334332 111 1222211 111222334445 Q ss_pred HHHHHHHH-HHHHHHHHHhcccCC-C-cccceEEeCCCCCCCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHH Q lcl|NC_010179. 369 TYFEHAIN-ELVRAIMRYLNFSDA-D-KRHISQHWTRTKVEDSLTKAQIVSTVA--NY-SSKEAVAKANPIVDDWQQELK 442 (469) Q Consensus 369 ~~~~~~l~-~~~~~i~~~~~~~~~-~-~~~i~i~f~~~~p~d~~e~~~~~~kl~--g~-iS~et~~~~l~~v~d~~~E~e 442 (469) +.+...+. ++++.++.+ |.... + .....+.|...-+.|.+..++.+.+++ |+ +|.+.+.+.++. +.++. -+ T Consensus 336 ~~i~~tln~~Li~~l~~~-N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi-p~~~~-~e 412 (526) T protein:vir:79 336 RQLAATLSRDLLWPLLVL-NRPGSPDVRRAPRLVFDLREQADITSMAQSIPALVNVGLEIPSAWVYDKLGI-PQPAK-NE 412 (526) T ss_pred HHHHHHHHHHHHHHHHHh-CCCCcCCccccceEEeCCCCcccHHHHHHHHHHHHhCCCcCCHHHHHHHhCC-CCCCC-ch Confidence 66667774 477766664 32221 2 234578999999999999999999885 55 999999998874 33221 11 Q ss_pred HHHHHHHHhhhhHh-------hcccCCCCCCCCC Q lcl|NC_010179. 443 DLAKDREENDPYAN-------QADELNGKGVDDE 469 (469) Q Consensus 443 ri~~E~~~~~~~~~-------~~~~~~~~~~~de 469 (469) .+........+... ...........++ T Consensus 413 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 446 (526) T protein:vir:79 413 PVLRPAAQPAILSRQHGQRVAALATIVGPRYGDQ 446 (526) T ss_pred hhccccCCccccccccccccccccccccccCchh Confidence 11111100000000 0000011111111 No 134 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=98.17 E-value=3.1e-06 Score=50.76 Aligned_cols=370 Identities=11% Similarity=-0.020 Sum_probs=184.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ||...+..+++....- .+.++-.+ ||.-. . ......-.+.+...-+. T Consensus 40 ltp~~l~~iL~~a~~g---d~~~~~~L--~~dm~----~------------------------~D~hi~s~l~~Rk~av~ 86 (512) T protein:vir:19 40 VTPNRAAQMLRDAERG---DLTAQADL--AFDME----E------------------------KDTHLFSELSKRRLAIQ 86 (512) T ss_pred CCHHHHHHHHHHhhCC---CHHHHHHH--HHHHH----h------------------------hChHHHHHHHHHHHHHh Confidence 5555444444332111 11111111 11110 0 12444455666667778 Q ss_pred cCCeeeccCc--h----hhHHHHHHHHhc--cHHHHHHHHHHHHHhCCeEEE-EEEEcCCCceE---EEEEccceeEEEE Q lcl|NC_010179. 81 SVFPDIDVGK--D----ADNKKILDVLGD--DRALTLNSLLVDSSNAGRAWL-HYWIDEDNNFR---YGIIQPDQITPVY 148 (469) Q Consensus 81 g~p~~~~~~~--~----~~~~~l~~~~~~--n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~~---i~~~~p~~~~~~~ 148 (469) |.+..+.... + ...+.+++++.+ ++.+.+.. ..++.-+|.++. ++|.-.+|... +.+.+|+.+ .| T Consensus 87 ~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~-lldA~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f--~~ 163 (512) T protein:vir:19 87 ALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFD-AGDAILKGYSMQEIEWGWLGKMRVPVALHHRDPALF--CA 163 (512) T ss_pred CCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHH-HHhhhhhcceeeeeEeeeeCCceeeeeeeeeccccc--ee Confidence 8888876432 1 234456777754 34443444 446888897654 55643344332 444555433 22 Q ss_pred eCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcc Q lcl|NC_010179. 149 ATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRV 228 (469) Q Consensus 149 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 228 (469) ++.....+ +.. ++.. ....+ .+++.| T Consensus 164 ~~~~~~~l----r~~-----~~~~--~G~~l-------------------------------------------~~~k~i 189 (512) T protein:vir:19 164 NPDNLNEL----RLR-----DASY--HGLEL-------------------------------------------QPFGWF 189 (512) T ss_pred ccCCCcEE----Eec-----CCCC--Cceee-------------------------------------------cCCceE Confidence 32211111 000 0000 00000 112222 Q ss_pred cEEEe--cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch------hhhhhhhhcceeeeccc Q lcl|NC_010179. 229 PFIEF--PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK------QFMNDLREYKSIKINNA 300 (469) Q Consensus 229 Pvv~~--~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~------~~~~~~~~~~~~~~~~~ 300 (469) -.++- ..++.|.|.+..+-...-.-+..+.+++.-++.++.|+++.+-..+.... .....+.......++.+ T Consensus 190 ~~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~~~~~a~~iiP~~ 269 (512) T protein:vir:19 190 MHRAKSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPTGSTNREKATLMQAVMDIGRRAGGIIPMG 269 (512) T ss_pred EEeccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCC Confidence 22221 13567888888887877788888899999999999999887643222211 11233445555555544 Q ss_pred CCCCCCcceEEeec-CCHHHHHHHHHHHHHHHHHHhCCCCcCccc--cC-CccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQID-IPVEARDDALKITRDNIFLFGQGIDPANFE--SS-NASGVAIKMLYSHLELKAAKTQTYFEHAIN 376 (469) Q Consensus 301 ~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 376 (469) ..+++++.. .....++..++.+.+.|.+.--+-.++.+. .| +..|.. ...-....+..-.+.+...+. T Consensus 270 -----~~ie~~ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g~~Gs~a~~~v---h~ev~~di~~aDa~~i~~tln 341 (512) T protein:vir:19 270 -----MTLDFQSAADGQSDPFMAMIGWAEKAISKAILGGTLTTEAGDKGARSLGEV---HDEVRREIRNADVGQLARSIN 341 (512) T ss_pred -----ceEEEeecCCCCHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHH---HHHHHHHHHHHHHHHHHHHHH Confidence 368888753 355678999999999998854443333332 12 222221 112233334455566777774 Q ss_pred -HHHHHHHHHhcccCCC-cccceEEeCCCCCCCHHHHHHHHHHH-hcc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHhh Q lcl|NC_010179. 377 -ELVRAIMRYLNFSDAD-KRHISQHWTRTKVEDSLTKAQIVSTV-ANY-SSKEAVAKANPIVDDWQQELKDLAKDREEND 452 (469) Q Consensus 377 -~~~~~i~~~~~~~~~~-~~~i~i~f~~~~p~d~~e~~~~~~kl-~g~-iS~et~~~~l~~v~d~~~E~eri~~E~~~~~ 452 (469) ++++-++.+-.....+ .....+.|...-+.|.+..++.+.++ .|+ +|.+.+.+.++. +.++.+-.-+........ T Consensus 342 ~~li~~l~~~N~~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~~G~~i~~~~i~e~~Gi-p~~~~~e~~~~~~~~~~~ 420 (512) T protein:vir:19 342 RDLIYPLLALNSDSTIDINRLPGIVFDTSEAGDITALSDAIPKLAAGMRIPVSWIQEKLHI-PQPVGDEAVFTIQPVVPD 420 (512) T ss_pred HHHHHHHHHhCCCCCCCccccceEEecCCChhhHHHHHHHHHHHhcCCCCCHHHHHHHhCC-CCCCCccccccCCCcccc Confidence 5777666542222222 23467889999999999999988876 466 898888888863 322211000110000000 Q ss_pred hhHhhcccCCCCCCCC--C Q lcl|NC_010179. 453 PYANQADELNGKGVDD--E 469 (469) Q Consensus 453 ~~~~~~~~~~~~~~~d--e 469 (469) ...............+ + T Consensus 421 ~~~~~~~~~~~~~~~~~~~ 439 (512) T protein:vir:19 421 NGSQKEAALSAEDIPQEDD 439 (512) T ss_pred ccccccccccccCCCchhh Confidence 0000000000001101 1 No 135 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=98.11 E-value=4.5e-06 Score=49.91 Aligned_cols=373 Identities=10% Similarity=-0.006 Sum_probs=182.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ||...+..+++.... ..+.++-.+.+...- ......-.+.+...-+. T Consensus 40 ltp~~l~~il~~a~~---gd~~~~~~L~~~m~e------------------------------~D~~i~s~l~~Rk~av~ 86 (528) T protein:vir:10 40 LTPAKLAHILIEAEQ---GHLQAQAELFMDMEE------------------------------RDAHLFAEMSKRKRAVL 86 (528) T ss_pred CCHHHHHHHHHhhhC---CCHHHHHHHHHHHHh------------------------------hChHHHHHHHHHHHHHh Confidence 333333333322110 011111111111110 12445556666677788 Q ss_pred cCCeeeccCc--h----hhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEE-EEEEcCCCceE---EEEEccceeEEEEe Q lcl|NC_010179. 81 SVFPDIDVGK--D----ADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWL-HYWIDEDNNFR---YGIIQPDQITPVYA 149 (469) Q Consensus 81 g~p~~~~~~~--~----~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~~---i~~~~p~~~~~~~d 149 (469) |.+..+...+ + ...+++.+++.+ .....+..-..++.-+|.++. ++|...+|... +.+.+++.+ .|+ T Consensus 87 ~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~lda~~~G~s~~Ei~w~~~~g~~~~~~~~~r~~~~f--~~~ 164 (528) T protein:vir:10 87 GLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLDCMDGVGHGYSAIELDWSLQGREWLPQAFDHRPQSWF--QLN 164 (528) T ss_pred cCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHHHhhhhhcceeEEEEEeecCCceeEEEeeeecccce--eec Confidence 8888886532 1 233456666654 223334444556788897654 55644444433 333333322 122 Q ss_pred CCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_010179. 150 TTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVP 229 (469) Q Consensus 150 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 229 (469) +... ...-++ +.... ...+ .+++.+= T Consensus 165 ~~~~--~~l~~~-----~~~~~----g~~l-------------------------------------------~~~k~iv 190 (528) T protein:vir:10 165 PDDQ--DELRLR-----DNSIA----GEVL-------------------------------------------QPFGWIM 190 (528) T ss_pred cCCC--cEEecc-----CCCCC----ceee-------------------------------------------cCCCeEE Confidence 2111 000000 00000 0000 0122222 Q ss_pred EEEec--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchh------hhhhhhhcceeeecccC Q lcl|NC_010179. 230 FIEFP--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQ------FMNDLREYKSIKINNAG 301 (469) Q Consensus 230 vv~~~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~------~~~~~~~~~~~~~~~~~ 301 (469) .++-. .++.|.|.+..+-...---+..+.+++.-++.++.|+++.+-..+...++ ....+.......++.+ T Consensus 191 ~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~L~~al~~i~~~~~~iiP~~- 269 (528) T protein:vir:10 191 HKPRSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPPGTPDEEKVTLLRAVTGLGHAAAGIIPES- 269 (528) T ss_pred EeecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhhCcEEEecCC- Confidence 22211 25668888888877777778888999999999999998876432222111 1223444455555544 Q ss_pred CCCCCcceEEeec-CCHHHHHHHHHHHHHHHHHHhCCCCcCccc-cCCccHHHH-HHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_010179. 302 NGDKSGVDKLQID-IPVEARDDALKITRDNIFLFGQGIDPANFE-SSNASGVAI-KMLYSHLELKAAKTQTYFEHAIN-E 377 (469) Q Consensus 302 ~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Al-~~~~~~l~~k~~~~~~~~~~~l~-~ 377 (469) ..+++++.. ...+.++..++.+.+.|.+.--+-.++..+ .|..+.-|+ +....-....+..-.+.+...+. + T Consensus 270 ----~~ie~~ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~i~~tln~~ 345 (528) T protein:vir:10 270 ----MSIDFQEASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQLAATLSRD 345 (528) T ss_pred ----ceeEEeecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 358899854 456788999999999998865444443321 111111121 11112223344455566777775 4 Q ss_pred HHHHHHHHhcccCCC-cccceEEeCCCCCCCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHH--HHHHHHHHHHHHh Q lcl|NC_010179. 378 LVRAIMRYLNFSDAD-KRHISQHWTRTKVEDSLTKAQIVSTVA--NY-SSKEAVAKANPIVDDWQ--QELKDLAKDREEN 451 (469) Q Consensus 378 ~~~~i~~~~~~~~~~-~~~i~i~f~~~~p~d~~e~~~~~~kl~--g~-iS~et~~~~l~~v~d~~--~E~eri~~E~~~~ 451 (469) ++..++.+-.....+ .....+.|...-+.|.++.++.+.+++ |+ +|.+.+.+.++. +.++ +++..-+... .. T Consensus 346 li~~l~~~N~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~~~G~~i~~~~i~e~~gi-p~p~~~e~~~~~~~~~-~~ 423 (528) T protein:vir:10 346 LLWPLLVLNRSGNLDARRAPRLVFDLKDRADLAAMATSLPPLVKLGVQVPVNWVQEQLGI-PLPANGEAVLGDQAGA-GI 423 (528) T ss_pred HHHHHHHhCCCCCCCccccceEEecCCCcccHHHHHHHHHHHHhCCCCCCHHHHHHHhCC-CCCCCCcccccCCCcc-cc Confidence 777776643222222 344678999999999999999999884 66 999999888874 3222 1111100000 00 Q ss_pred hh-------hHhhcccCCCCCCCCC Q lcl|NC_010179. 452 DP-------YANQADELNGKGVDDE 469 (469) Q Consensus 452 ~~-------~~~~~~~~~~~~~~de 469 (469) .+ ....+.........++ T Consensus 424 ~~~~~~~~~~~~~~~~~~~~~~~~~ 448 (528) T protein:vir:10 424 AQLSRRPGPRIAALAQVIGPRYRDQ 448 (528) T ss_pred cccCccccccccccccccccccccc Confidence 00 0000001111111111 No 136 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.11 E-value=4.5e-06 Score=49.91 Aligned_cols=368 Identities=10% Similarity=0.005 Sum_probs=156.9 Q ss_pred cccccccCcceeccchHHHHHHHHHHhhhcCCeeeccCc--------hhhHHHHHHHHh-c--c------------HHHH Q lcl|NC_010179. 52 KKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDIDVGK--------DADNKKILDVLG-D--D------------RALT 108 (469) Q Consensus 52 ~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~--------~~~~~~l~~~~~-~--n------------~~~~ 108 (469) .+ ...-..+.....|+..++-+.|-|+.+.... ....+.+.+++. . | ..+. T Consensus 1 l~------~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~ 74 (467) T protein:vir:31 1 MA------ELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNV 74 (467) T ss_pred Ch------hhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHH Confidence 00 0001246777888888888888887663211 112223333332 1 1 1122 Q ss_pred HHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEE Q lcl|NC_010179. 109 LNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQF 187 (469) Q Consensus 109 ~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (469) +..+..+...+|.+|+.+..+.+|++ .+.+++|..+.+.-+... . .... .+...+ +.++.+..... T Consensus 75 ~~~~~~~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~~d~~~---~------~~~~--~~~~~~--~~~~~~~~~~~ 141 (467) T protein:vir:31 75 LQTAWTDYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKRMDERG---F------VQLL--EEKEKY--FGVAGDRYQTN 141 (467) T ss_pred HHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEeeeecce---e------Eeec--CCceee--EEeccccceee Confidence 34566788999999999888988886 488899988877655321 1 0000 011110 01111111100 Q ss_pred EEeecCceeecccccccccccccccccccccccccccCCcccEEEecCC-----ccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 188 FRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN-----KYRLAELNKYKGLIDAYDDIYNGFIN 262 (469) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~-----~~g~~~~~~v~~liD~~~~~~s~~~~ 262 (469) ......... ...... .......+..=-|+||+.. -.|.|.+......++....+..-..+ T Consensus 142 ~~~~~~~~~----------~~~~~~-----~~~~~~~~~~~diih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~ 206 (467) T protein:vir:31 142 GNGDLDPVF----------VDADDG-----STGTSVSNPANELIFKRNHSPLYPHYGAPDIIPAVKTIRGDSAAQDYNID 206 (467) T ss_pred cccceeeee----------eeeccc-----cccceeEeccccEEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 000000000 000000 0000001111124555432 24777776666555544444433444 Q ss_pred HHHHhcCceeEE--ecCCcccchhhhhhhhh-----------------------cceeeecccCCCCCCcceEEee--cC Q lcl|NC_010179. 263 DLDDVQTVILVL--TNYGGASLKQFMNDLRE-----------------------YKSIKINNAGNGDKSGVDKLQI--DI 315 (469) Q Consensus 263 ~~~~~~~p~l~~--~g~~~~~~~~~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~l~~--~~ 315 (469) .+...+.|-.++ +|.. ..++....++. .+.+.+..+......++++... .. T Consensus 207 ~f~ng~~p~gil~~~~~~--l~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~ 284 (467) T protein:vir:31 207 FFENDGVPRIAIIVKGAE--LTEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGI 284 (467) T ss_pred HHhccCCCceEEEecCcC--CCHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEeccccC Confidence 445555564444 3421 11111111110 0111122221111222333221 11 Q ss_pred -CHHHHHHHHHHHHHHHHHHhCCCCcCcc--ccCCc-cH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc- Q lcl|NC_010179. 316 -PVEARDDALKITRDNIFLFGQGIDPANF--ESSNA-SG-VAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS- 389 (469) Q Consensus 316 -~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~g~~-Sg-~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~- 389 (469) ....+....+...+.|...-++|+.-.. ..++. |. ++.. ...+..++.-+++.+...++.+ T Consensus 285 ~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~-------------~~f~~~~l~P~~~~ie~~ln~~l 351 (467) T protein:vir:31 285 DEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQR-------------KEFAEETIQPKQHDFGELLYELV 351 (467) T ss_pred hhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHH-------------HHHHHHHHHHHHHHHHHHHHHhh Confidence 2345566677778889888888864221 11221 21 1111 1112233333333333333321 Q ss_pred ---C--CCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHH---HHHHHHHH-hhhhHhhc Q lcl|NC_010179. 390 ---D--ADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELK---DLAKDREE-NDPYANQA 458 (469) Q Consensus 390 ---~--~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~e---ri~~E~~~-~~~~~~~~ 458 (469) . .....+++.+......|.++.++++.++ +|+++.-.+.+++++-+-++.++- -+..+... ..+. ... T Consensus 352 ~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~-~~~ 430 (467) T protein:vir:31 352 HKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYGGETLVAEVTGGSGPG-GGI 430 (467) T ss_pred cchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccCCcccccccccccCCC-Ccc Confidence 1 1223456666777888999999998876 689999999998765221111110 00000000 0000 001 Q ss_pred ccCCCCCCCCC Q lcl|NC_010179. 459 DELNGKGVDDE 469 (469) Q Consensus 459 ~~~~~~~~~de 469 (469) ++....+.+++ T Consensus 431 ~~~~~~~~~~~ 441 (467) T protein:vir:31 431 GDQIEQLVEDR 441 (467) T ss_pred cCcCCCCCCCc Confidence 11111111111 No 137 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=98.07 E-value=5.3e-06 Score=49.52 Aligned_cols=376 Identities=11% Similarity=0.013 Sum_probs=181.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hccCCcccccccchhhhcccc-cccccccCcceeccchHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDY----YENKTDITTRNNGKPKVSKEG-KKDPLRSADNRIPSNFYQLLVDQE 75 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Y----y~g~~~i~~~~~~~~~~~~~~-~~~~~~~~~~ri~~n~~k~iv~~~ 75 (469) +.-..+.+ ++.......+. ..|-.. +.+........+.. ..... .......-.+.+. T Consensus 1 v~~~~l~~-----------e~at~~~~~d~~~~~~~~l~~-~~~~il~~a~~g~~~~y~~l------~~D~~i~s~l~~r 62 (488) T protein:vir:99 1 MEKPALGR-----------EIATSGDGRDITRPFISGLQV-PNDSILQRRGGNDLRVYEEI------LSDAQVKTVWGQR 62 (488) T ss_pred CCccchhH-----------HHHHHHhhhhhhccccCCCCC-CChHHHHhhccCCHHHHHHH------hhChHHHHHHHHH Confidence 11111111 11111112222 222110 00000000000000 00000 1235556667777 Q ss_pred HHhhhcCCeeeccCchh-----hHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEE-EEEEcCCCceE---EEEEccceeE Q lcl|NC_010179. 76 AGYIASVFPDIDVGKDA-----DNKKILDVLGD-DRALTLNSLLVDSSNAGRAWL-HYWIDEDNNFR---YGIIQPDQIT 145 (469) Q Consensus 76 ~~~l~g~p~~~~~~~~~-----~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~~---i~~~~p~~~~ 145 (469) ...+.|.+..+.+.++. ..+++.++++. ++.+.+.++. ++.-+|.++. ++|...+|.+. +.+.+|+.+ T Consensus 63 k~av~~~~w~i~p~~~~~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f- 140 (488) T protein:vir:99 63 QLAVVSREWKVEAGGDRPIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRF- 140 (488) T ss_pred HHHHhcCCceEEcCCCChHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccce- Confidence 77888999888754432 33567777765 4444444544 6788997654 55654445543 344444432 Q ss_pred EEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccC Q lcl|NC_010179. 146 PVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNF 225 (469) Q Consensus 146 ~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (469) .|++.. .+. +...... ......+.++ T Consensus 141 -~~d~~~--~l~------------------------------~~~~~~~---------------------~~g~~lp~~~ 166 (488) T protein:vir:99 141 -RYDQDG--GLR------------------------------LLTPNNM---------------------FEGEPCPAPY 166 (488) T ss_pred -eecCCC--ceE------------------------------EeccCCC---------------------CCccccccCc Confidence 223211 110 0000000 0000111122 Q ss_pred CcccEEEec--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC-cccchh------hhhhhhhcceee Q lcl|NC_010179. 226 GRVPFIEFP--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG-GASLKQ------FMNDLREYKSIK 296 (469) Q Consensus 226 g~vPvv~~~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~-~~~~~~------~~~~~~~~~~~~ 296 (469) +.+-.++.. .++.|.|.+..+-...--=+..+..++.-++.++.|+++.+-.. +.+..+ ....+....... T Consensus 167 ~~i~~~~~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~~a~~~ek~~l~~av~~~~~~~~~v 246 (488) T protein:vir:99 167 FWHFSTGADNDDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDDKTATPEDKAKLLAALHAIQTDSAII 246 (488) T ss_pred eEEEEeecCCCCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCCCCCCHHHHHHHHHHHHHHhcCcEEE Confidence 222112211 25678888888877777777778899999999999998877432 111111 123444555555 Q ss_pred ecccCCCCCCcceEEeec-CCHHHHHHHHHHHHHHHHHHhCCCCcCccc-cC-CccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 297 INNAGNGDKSGVDKLQID-IPVEARDDALKITRDNIFLFGQGIDPANFE-SS-NASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 297 ~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) ++.+ ..+++++.. .+.+.++..++.+.+.|.+.--+-.++.++ .| ...|..-. .-....++.-.+.+.. T Consensus 247 iP~~-----~~ie~~ea~~~~~~~~~~li~~~d~~Isk~iLGqtlts~~~~Gs~a~~~vh~---~v~~d~~~aDa~~i~~ 318 (488) T protein:vir:99 247 MPAG-----MQAELLEAGRSGTADYKTLHDTMDATIAKVGLGQVASTQGTPGRLGNDDLQA---DVRLDLVKADADLICE 318 (488) T ss_pred ecCC-----ceeEEeecCCCChHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhHHHHH---HHHHHHHHHHHHHHHH Confidence 5544 458898854 355678999999999998754332332222 22 22332211 2233344455667777 Q ss_pred HHH-HHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH---hcc-CChHHHHHhCCCCCCHHHHHHHHHHHH Q lcl|NC_010179. 374 AIN-ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV---ANY-SSKEAVAKANPIVDDWQQELKDLAKDR 448 (469) Q Consensus 374 ~l~-~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl---~g~-iS~et~~~~l~~v~d~~~E~eri~~E~ 448 (469) .+. +++..++.+ |..+. ....+.|....+.|.++.++.+.++ +|+ ++.+.+.+.++. +.++.+- +. T Consensus 319 tln~~li~~l~~~-N~~~~--~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gi-p~~~~~~-----~~ 389 (488) T protein:vir:99 319 SFNLGPARWLTEW-NFPGA--QPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGV-EVESTQA-----EA 389 (488) T ss_pred HHHHHHHHHHHHh-CcCCc--CCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCC-CCccccc-----cc Confidence 774 477766664 43332 2346788888889999999999886 366 888888888764 2211100 00 Q ss_pred HHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 449 EENDPYANQADELNGKGVDDE 469 (469) Q Consensus 449 ~~~~~~~~~~~~~~~~~~~de 469 (469) ....+...... .....+.. T Consensus 390 ~~~~~~~~~~~--~~~~~~~~ 408 (488) T protein:vir:99 390 TAPTPSTEFAE--GDQPSDPA 408 (488) T ss_pred ccCCCcccCCC--CCCCCCch Confidence 00111111110 00011111 No 138 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.06 E-value=5.8e-06 Score=49.31 Aligned_cols=427 Identities=9% Similarity=0.027 Sum_probs=195.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ...+.+++..+.+..++..-..+.+.+.+|....- ..... .. ......++-.+-+...++..++.|+ T Consensus 9 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~------~~----~~~~~~~~~dst~~~a~~~Laa~l~ 75 (543) T protein:vir:88 9 LAEEGAKAVYERLKNDRVPYETRAENCAKVTIPSL---FPKDS------DN----SSTDYTTPWQAVGARGLNNLSAKVM 75 (543) T ss_pred chHHHHHHHHHHHHHHHhHHHHHHHHHHHHhcccc---CCCCC------Cc----ccccccccccchHHHHHHHHHHHHH Confidence 34556666667777776666677777777766421 11000 00 0111123455666667777776654 Q ss_pred cC--Cee----eccCch---------hhHHHHHHHH------------hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 81 SV--FPD----IDVGKD---------ADNKKILDVL------------GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 81 g~--p~~----~~~~~~---------~~~~~l~~~~------------~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) +- |.+ +...+. .....++.|+ ..||...+.++.++..++|.+.+++-.+.... T Consensus 76 ~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~ly~~~~~~~~ 155 (543) T protein:vir:88 76 LALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTALIYLPPPDASS 155 (543) T ss_pred HhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeccCcccc Confidence 31 221 112221 1112233333 23666778888999999999976543333222 Q ss_pred eE---EEEEccceeEEEEeCCCCCceEEEEEEEEeeecC-------------CceEEEEEEEEcCCeEEEEEeecCceee Q lcl|NC_010179. 134 FR---YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE-------------AGKYFTVHEYWTDKEAQFFRTSATDSTV 197 (469) Q Consensus 134 ~~---i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (469) ++ ++.++-.++++..|. .+++...+|.++..... .......+++|+. .+.....+... T Consensus 156 ~~~~~~~~~pl~~y~v~~d~--~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~----V~pr~~~~~~~ 229 (543) T protein:vir:88 156 NSYNPMKLYTLHNHVVQRDA--FGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTH----IYIDDESGDFL 229 (543) T ss_pred ceecceEEeEcceEEEeeCC--CCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEE----EEeecCCCccc Confidence 22 333433443333343 45677777766542110 0011122333321 11111111100 Q ss_pred cccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_010179. 198 IEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVIL 272 (469) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l 272 (469) . +....+ ..........++..+|++.++ ++.+|.|-.++..+-+..+|.+.-..........+|.+ T Consensus 230 ~--------~~~~~~-~~v~~~~~~~~~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~~~~pp~ 300 (543) T protein:vir:88 230 S--------YQEIEG-VEVDGSDGQYPQDALPWIAVRWTKRDGEHYGRSHVEEYLGDLNSLESLNEAMIKFAMISSKVVG 300 (543) T ss_pred c--------cccccC-eeeecCCCccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHhcCce Confidence 0 000000 011111122345667877665 34579999999999999999999999999999999987 Q ss_pred EEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccH Q lcl|NC_010179. 273 VLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASG 350 (469) Q Consensus 273 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg 350 (469) .+.-..... ...+...+.-.+..+ ..+++..+. ...+.......++.++..|-..-..-.+...+....|+ T Consensus 301 ~v~~~g~~~----~~~~~~~~~g~~v~g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~~~~r~TA 373 (543) T protein:vir:88 301 LVNPNGITQ----VRRLVKAQTGDFVAG---RKADIEFLQLEKTADFTVAKSVADAIEARLSYVFMLNSAVQRSGERVTA 373 (543) T ss_pred eeccccccc----hhhcccCCCceeecC---CCCcceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhccCCCCcccH Confidence 653211111 111111211111111 123344443 33467778888888888775432221222222333455 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcc----cCCCcccceEEeCCCCCC-CHHHHHHHHHH Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNF----SDADKRHISQHWTRTKVE-DSLTKAQIVST 417 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~----~~~~~~~i~i~f~~~~p~-d~~e~~~~~~k 417 (469) ..+.. ++.++...++..+.++ ++.++.++.. ...+...+++.+..++.. ...+.++.+.. T Consensus 374 tEV~~-------r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~l~~l~r~~~~~~l~~ 446 (543) T protein:vir:88 374 EEIRY-------VASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQEAVEPTVTTGAEALGRGQDLDKLTQ 446 (543) T ss_pred HHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeEEecHHHHHHHHHHHHHHH Confidence 54433 3344455555544442 2222233322 223334556666543221 22222222222 Q ss_pred H---hcc---------CChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 418 V---ANY---------SSKEAVAKAN---PIVD-----DWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 418 l---~g~---------iS~et~~~~l---~~v~-----d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) . .+. +....++..+ -+|+ -.++|++++++++++......+....+++-.-+. T Consensus 447 ~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~~~~~~~~~~~~~~~~ 518 (543) T protein:vir:88 447 FLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGGLNAAAGIGSGVAAQA 518 (543) T ss_pred HHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhchhhhh Confidence 1 122 2333334332 1231 1256777777665554433333222111111111 No 139 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.02 E-value=7e-06 Score=48.85 Aligned_cols=369 Identities=14% Similarity=0.066 Sum_probs=157.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |.+= ++.....+..-.....+..++.+.. .+..... ..-+..+-....|+..+.-+- T Consensus 1 M~~f------~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~v~~---~~al~~~~V~~~v~~ia~~ia 57 (397) T protein:vir:38 1 MPLL------KLNKSHSQGFSLNDPDWVNFLTGGE--------------AQKYVSA---DTALKNSDIFSLIMQLSGDLA 57 (397) T ss_pred Ccch------hhhhcccCcccCCchhhhhhhcCCc--------------CCceech---HHhhccHHHHHHHHHHHHHHh Confidence 3321 0000000000000000111111100 0000000 000111112223444444443 Q ss_pred cCCeeeccCchhhHHHHHHHHhc-c-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD-D-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~-n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~ 154 (469) +-|+.. .+. .+..++.+ | . .+....+..+.+.+|.+|+.+-.+.+|++ .+.+++|..+-+..+.+. . T Consensus 58 ~~p~~~--~~~----~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~-~ 130 (397) T protein:vir:38 58 MVRYTS--ESD----RSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQDG-S 130 (397) T ss_pred hCcccc--ccc----HHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-c Confidence 434432 222 23334432 2 1 23345667788999999998888888876 588899999888765432 1 Q ss_pred ceEEEEEEEEeee-cCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 155 KLLGVLRSYKQLD-PEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 155 ~~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) .+. |.... ..+... ...+....+.+++... . T Consensus 131 ~~~-----y~~~~~~~~~~~---~~~~~~~eiih~~~~~----------------------------------------~ 162 (397) T protein:vir:38 131 GLI-----YNINFDEPAIGY---MENVPAADVIHIRLLS----------------------------------------K 162 (397) T ss_pred eEE-----EEEEeccccccc---eeEecCccEEEecCCC----------------------------------------C Confidence 111 11110 000000 0112222222221100 0 Q ss_pred cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh---hhhhh-------hcceeeecccCCC Q lcl|NC_010179. 234 PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF---MNDLR-------EYKSIKINNAGNG 303 (469) Q Consensus 234 ~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~---~~~~~-------~~~~~~~~~~~~~ 303 (469) .....|.|.+..+...++....+..-..+.+...+.|-.+++-......+.. ...+. ..+.+.++ T Consensus 163 ~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~----- 237 (397) T protein:vir:38 163 NGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKGGLLDAETRIARSKEISKQIHNSDGPVVID----- 237 (397) T ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecC----- Confidence 0012478888888877777776666666777777777766654322221111 11111 11112121 Q ss_pred CCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 DKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNA-SGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (469) Q Consensus 304 ~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~-Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (469) .+++|..... ....+.+..+...+.|+..-++|+.-..+.++. |..+ .....+..+|.-++. T Consensus 238 --~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e-------------~~~~~~~~~l~P~~~ 302 (397) T protein:vir:38 238 --ALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSIT-------------QISGQYAKSLNRYVQ 302 (397) T ss_pred --CCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHH-------------HHHHHHHHHHHHHHH Confidence 2345544333 445567777888889998888887544332211 1111 011233445555555 Q ss_pred HHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCC---HHHHHHHHHHHHHHhh- Q lcl|NC_010179. 381 AIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDD---WQQELKDLAKDREEND- 452 (469) Q Consensus 381 ~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d---~~~E~eri~~E~~~~~- 452 (469) .+...++.+=....+ +.+...+-.|..+.++.+.++ +|+++.-++.+.++. ++. +..+..-......... T Consensus 303 ~ie~~ln~~l~~~~~--~~~~~~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~~~~~~~~~~~~~~~~~~ 380 (397) T protein:vir:38 303 AIVGELNDKLHANIS--ANIRFAIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKDLPDPEKEPQQAIQLIQQE 380 (397) T ss_pred HHHHHHHHhccChhc--ccccccccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccccccccccccccccccc Confidence 555444432222222 223333445788888888776 679999998887643 211 1111111111100000 Q ss_pred hhHhhcccCCCCCCCCC Q lcl|NC_010179. 453 PYANQADELNGKGVDDE 469 (469) Q Consensus 453 ~~~~~~~~~~~~~~~de 469 (469) ...+...+....+.|+| T Consensus 381 ~g~~~~~~~~e~~~~~~ 397 (397) T protein:vir:38 381 GGENDGNNSDERGSDPE 397 (397) T ss_pred cCCCCCCCCCCCCCCCC Confidence 00011111122222233 No 140 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.00 E-value=7.6e-06 Score=48.66 Aligned_cols=411 Identities=10% Similarity=0.044 Sum_probs=158.0 Q ss_pred CCHHHHHHHHHHHH-----HHHHHH---------HHHHHHHHHHhccCCcccccccchhhhcccc-cccccccCcc---- Q lcl|NC_010179. 1 MELDALKKLIRNTS-----TSRNDL---------INNYKKSVDYYENKTDITTRNNGKPKVSKEG-KKDPLRSADN---- 61 (469) Q Consensus 1 ~~~~~~~~~i~~~~-----~~~~~~---------~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~-~~~~~~~~~~---- 61 (469) |.|- +.+...+. .+|.+. --.-..+..+-.++......+.........+ ...+.-++.. T Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~~ 78 (547) T protein:vir:63 1 MGLF--ESIRLAGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHG 78 (547) T ss_pred Cchh--hhhhhhcCCccccccccccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHHH Confidence 4433 11111111 000000 0000112222222211110000000000000 0000000000 Q ss_pred ---e-eccchHHHHHHHHHHhhh--cC-----------Ceeec-------cCchhhHHHHHHHHhc----------cHHH Q lcl|NC_010179. 62 ---R-IPSNFYQLLVDQEAGYIA--SV-----------FPDID-------VGKDADNKKILDVLGD----------DRAL 107 (469) Q Consensus 62 ---r-i~~n~~k~iv~~~~~~l~--g~-----------p~~~~-------~~~~~~~~~l~~~~~~----------n~~~ 107 (469) . ...+....+|+..+.-+. +. .+++. ..+....+.+.+++.. ++.. T Consensus 79 l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~~ 158 (547) T protein:vir:63 79 VLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSS 158 (547) T ss_pred HHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHHH Confidence 0 011333444444433221 11 11111 1112222344555432 1222 Q ss_pred HHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEE Q lcl|NC_010179. 108 TLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQ 186 (469) Q Consensus 108 ~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (469) .+..+..+.+.+|.+|+.+-++.+|++. +.+++|..+.++.+..... ....++++... .+... ..+....+. T Consensus 159 f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~-~~~~~~y~~~~--~~~~~----~~~~~~eii 231 (547) T protein:vir:63 159 FVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKI-PDNGNRFVQVI--DQKIV----ATFNAREMA 231 (547) T ss_pred HHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCcccc-ccCceEEEEEc--CCcEE----EEeccccEE Confidence 3345667889999999998889999875 8899999998886653210 11111221111 11110 112222222 Q ss_pred EEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 187 FFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDD 266 (469) Q Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~ 266 (469) +++.. |.........|.|-++.+...+.....+..-....+.. T Consensus 232 h~r~n-------------------------------------~~~~~~~~~~G~Spi~~~~~~i~~~~~a~~~~~~~f~N 274 (547) T protein:vir:63 232 FAVRN-------------------------------------PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSH 274 (547) T ss_pred Eeccc-------------------------------------CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 22110 00000001247777777777777666665556666666 Q ss_pred hcCce--eEEecCCcccc---hhhhhhhhh-------c-ceeeecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHH Q lcl|NC_010179. 267 VQTVI--LVLTNYGGASL---KQFMNDLRE-------Y-KSIKINNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNI 331 (469) Q Consensus 267 ~~~p~--l~~~g~~~~~~---~~~~~~~~~-------~-~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i 331 (469) .+.|- +.+.|...... ......+.. . ++..+. ..+++|....++ ...+.+..+...+.| T Consensus 275 g~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~G~~nagk~~vl~------~~g~~~~~l~~~~~d~qfle~~~~~~~~I 348 (547) T protein:vir:63 275 GGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS------AEDVKFVNMTPSARDMEFEKWLNYLINVI 348 (547) T ss_pred CCCcceEEEecCCCCCCHHHHHHHHHHHHHHhcCccccccccccc------CCCceEEEcCCChhHHHHHHHHHHHHHHH Confidence 66664 34444221111 112222211 1 111111 123555554443 345566677778888 Q ss_pred HHHhCCCCcCcccc--CC---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCC Q lcl|NC_010179. 332 FLFGQGIDPANFES--SN---ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRT 403 (469) Q Consensus 332 ~~~s~~p~~~~~~~--g~---~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~ 403 (469) +..-++|+.-..-. +. .++..+-. +... ......+..+|.-+++.|...++.. ... ..+.+.|... T Consensus 349 a~afgVPP~~lG~~~~~~~~~~~~~s~t~--sn~e---~~~~~~~~~tL~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~ 422 (547) T protein:vir:63 349 SALYGIDPAEINIPNNGGATGSKGGSLNE--GNSA---EKNQASKNKGLQPLLGFIEDFINKHIVAEFG-DKYTFQFVGG 422 (547) T ss_pred HHHhCCCHHHcCcccccccccccccccch--hhHH---HHHHHHHHHHHHHHHHHHHHHHHhhcccccC-CceEEEeecc Confidence 88888887432211 10 01111110 0000 1112334445555555444443321 111 3467888887 Q ss_pred CCCCHHHHHHHHHH-HhccCChHHHHHhCCC---CCCHHHHH-----HH----HHHHH-------HHhhhhHhhcccC-- Q lcl|NC_010179. 404 KVEDSLTKAQIVST-VANYSSKEAVAKANPI---VDDWQQEL-----KD----LAKDR-------EENDPYANQADEL-- 461 (469) Q Consensus 404 ~p~d~~e~~~~~~k-l~g~iS~et~~~~l~~---v~d~~~E~-----er----i~~E~-------~~~~~~~~~~~~~-- 461 (469) ...+.++.+..... .+|+++.-.+.++++. ++.-+.-+ .. .++++ +......+..... T Consensus 423 ~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 502 (547) T protein:vir:63 423 DIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVS 502 (547) T ss_pred ccccHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCCccccchhhccccccccCCCCC Confidence 77887777664433 3588998888887643 22111000 00 00000 0000000000000 Q ss_pred -----------CCCC-CCCC Q lcl|NC_010179. 462 -----------NGKG-VDDE 469 (469) Q Consensus 462 -----------~~~~-~~de 469 (469) ..++ .+|. T Consensus 503 ~~~~~~~~~~~~~~~~~~d~ 522 (547) T protein:vir:63 503 TDVEDIPDGKDTTGDIGKDG 522 (547) T ss_pred CCCCCCCCCcccCCCcCccc Confidence 0000 0000 No 141 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=97.91 E-value=1.1e-05 Score=47.66 Aligned_cols=387 Identities=14% Similarity=0.038 Sum_probs=176.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) +.....++-+...+. ...+..+.+.+.. +............ +.. .-+... .......-.+.+...-+. T Consensus 13 ~~~~~~~~~~~~~ia-------~~~~~~~~~~~~~-~~p~~~~il~~~~-~~~--~~y~~m-~~D~~i~s~l~~Rk~av~ 80 (491) T protein:vir:79 13 VKFGEPDKSLSSQIA-------TRARSIDFFALGM-YLPNPDPVLKALG-KDI--RVYREL-RADAHVGGCVRRRKAAVK 80 (491) T ss_pred ccccccchhHHHHHh-------hhccccccccccc-cCcchhHHHhhcc-CCH--HHHHHH-hhChHHHHHHHHHHHHHh Confidence 222222221111111 1111111111111 1111110000000 000 000000 124555556666677788 Q ss_pred cCCeeeccCc--hhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEE-EEEEcCCCce---EEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 81 SVFPDIDVGK--DADNKKILDVLGDDRALTLNSLLVDSSNAGRAWL-HYWIDEDNNF---RYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 81 g~p~~~~~~~--~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~---~i~~~~p~~~~~~~d~~~~~ 154 (469) |.+..+.+.+ +...+++.++++.-....+..-..++.-+|.++. ++|...+|.+ ++.+.+|+.+. |++.. T Consensus 81 ~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~--~d~~~-- 156 (491) T protein:vir:79 81 ALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEMLDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFV--YDPEN-- 156 (491) T ss_pred CCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHHHhhhhcceeEEEEEeecCCeeeEEeeeeeccccee--eccCC-- Confidence 8888887543 3445778888766333333333356888997654 5554445554 35555554432 33211 Q ss_pred ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 155 KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 155 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) .+. +...... .. +..-.+++.|-..+-. T Consensus 157 ~l~------------------------------l~~~~~~---------------------~~-g~~lp~~k~i~~~~~~ 184 (491) T protein:vir:79 157 QLR------------------------------FRSKEHW---------------------VQ-GEELPARKFLVPRQEA 184 (491) T ss_pred ceE------------------------------EeecCCC---------------------CC-ceeecCCCeEEEEecC Confidence 111 0000000 00 0000122223222221 Q ss_pred --CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh------hhhhhhcceeeecccCCCCCC Q lcl|NC_010179. 235 --KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF------MNDLREYKSIKINNAGNGDKS 306 (469) Q Consensus 235 --n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~ 306 (469) .++.|.|.+..+-...---+..+.+++.-++.++.|+++.+-..+...++. ...+.......++.+ . T Consensus 185 ~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~-----~ 259 (491) T protein:vir:79 185 TYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDAETNLLLDRLEDMVQDAVAVIPDD-----S 259 (491) T ss_pred CCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCeEEEecCC-----c Confidence 356788988888887777788889999999999999988774322221111 223444445555543 4 Q ss_pred cceEEeec---CCHHHHHHHHHHHHHHHHHHhCCCCcCccc-cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 307 GVDKLQID---IPVEARDDALKITRDNIFLFGQGIDPANFE-SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAI 382 (469) Q Consensus 307 ~~~~l~~~---~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i 382 (469) .+++++.. .+...++..++.+.+.|.+..-+-.++.++ .|...|..-. . -....++.-.+.+...+.++++-+ T Consensus 260 ~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~vh~-~--v~~~i~~~D~~~i~~tln~li~~l 336 (491) T protein:vir:79 260 SIEIKEAAGKSGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQAGL-E--VTDDIRDGDKAIVVEAMNMLIRWI 336 (491) T ss_pred eeEEEeccCCCCChhHHHHHHHHHHHHHHHHHhhhhhccCcccchhhHHHHH-H--HHHHHHHHHHHHHHHHHHHHHHHH Confidence 68888754 245678899999988888754332232222 2333333211 1 123334455667777888877777 Q ss_pred HHHhcccCCCcccceEEeCCCCCCCH-HHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc Q lcl|NC_010179. 383 MRYLNFSDADKRHISQHWTRTKVEDS-LTKAQIVSTVA--NY-SSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA 458 (469) Q Consensus 383 ~~~~~~~~~~~~~i~i~f~~~~p~d~-~e~~~~~~kl~--g~-iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~ 458 (469) +.+-. .+ ...+.+.|. -+.+. +..++.+.+++ |+ +|.+.+.+.++. +.+..+.+-.........+..... T Consensus 337 ~~~N~-~~--~~~p~f~~~--e~ee~~~~~a~~~~~L~~~G~~i~~~~~~e~~Gi-p~~~~~e~~~~~~~~~~~~~~~~~ 410 (491) T protein:vir:79 337 CDLNF-DG--AARPVFDMW--EQEQVDEIQAGRDEKLTRAGARFTPAYFKRAYNL-QDGDLDERPLPVSAVDAVGAASFA 410 (491) T ss_pred HHhcC-CC--CCcceEeec--CcCchhHHHHHHHHHHHhCCCccCHHHHHHHhCC-CCCCCCccccCcCccccccccccc Confidence 66532 22 222334443 34443 45678888774 66 888888888763 322211000000000000000000 Q ss_pred ccCC----------------------------------CCCCCCC Q lcl|NC_010179. 459 DELN----------------------------------GKGVDDE 469 (469) Q Consensus 459 ~~~~----------------------------------~~~~~de 469 (469) .... ..+.-+| T Consensus 411 ~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~s~~e 455 (491) T protein:vir:79 411 EFEAPDQDALDAALNALSARDLNADAQALVAPLLKRIANGASADE 455 (491) T ss_pred ccCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHH Confidence 0000 0000111 No 142 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.85 E-value=1.5e-05 Score=47.01 Aligned_cols=385 Identities=14% Similarity=0.048 Sum_probs=179.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) .+....++-+...+.- .....++..+-. .+.+...-....+ + ...-+... .......-.+.+...-+. T Consensus 13 ~~~~~~~~~~~~~ia~-------~~~~~~~~~~~~-~~~~~~~iLr~~~-~--~~~~y~~m-~~D~~i~s~l~~Rk~av~ 80 (491) T protein:vir:10 13 VTFGEPDKSLSSQIAT-------RARSIDFFALGM-YLPNPDPVLKALG-K--DIRVYREL-RADAHVGGCVRRRKAAVK 80 (491) T ss_pred cCcccCChHHHHHHHh-------hhcccccccccC-CccchHHHHHhcC-C--CHHHHHHH-hhChHHHHHHHHHHHHHh Confidence 2222222211111110 001111111110 0000000000000 0 00000000 124555666667777788 Q ss_pred cCCeeeccC--chhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEE-EEEEcCCCceE---EEEEccceeEEEEeCCCC Q lcl|NC_010179. 81 SVFPDIDVG--KDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWL-HYWIDEDNNFR---YGIIQPDQITPVYATTLD 153 (469) Q Consensus 81 g~p~~~~~~--~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~~---i~~~~p~~~~~~~d~~~~ 153 (469) |.+..+.+. ++...+++.++++. ++.+.+.++ .++.-+|.++. ++|...+|.+. +.+++|+.+. |++.. T Consensus 81 ~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~-lda~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~--~d~~~- 156 (491) T protein:vir:10 81 ALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEM-LDAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFV--YDPEN- 156 (491) T ss_pred CCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHH-HHhhhhcceeEEEEEeecCCeeEEEEeeeeccccee--eccCC- Confidence 889888753 34456778888866 444445555 47888997654 55654445443 4444554332 33211 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) .+.. ... ++.. ....+ .+++.|-..+- T Consensus 157 -~l~~-----~~~--~~~~--~g~~l-------------------------------------------~~~k~i~~~~~ 183 (491) T protein:vir:10 157 -QLRF-----RSK--DHWM--QGEEL-------------------------------------------PARKFLVPRQE 183 (491) T ss_pred -ceEE-----ecC--CCCC--Cccee-------------------------------------------cCCCEEEEEec Confidence 1110 000 0000 00000 11122222211 Q ss_pred c--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh------hhhhhhcceeeecccCCCCC Q lcl|NC_010179. 234 P--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF------MNDLREYKSIKINNAGNGDK 305 (469) Q Consensus 234 ~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~ 305 (469) . .++.|.|.+..+-...-.-+..+.+++.-++.++.|+++.+-..+...++. ...+.......++.+ T Consensus 184 ~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~~a~~~ek~~l~~al~~~~~~a~~viP~~----- 258 (491) T protein:vir:10 184 ATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRSASDGEKNLLLDCLEDMVQDAVAVVPDD----- 258 (491) T ss_pred CCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCCCCCHHHHHHHHHHHHHHhcCcEEEecCC----- Confidence 1 256788999888888888888999999999999999988775332222111 233444555555544 Q ss_pred CcceEEeecC---CHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 306 SGVDKLQIDI---PVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA 381 (469) Q Consensus 306 ~~~~~l~~~~---~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 381 (469) ..+++++... +...++..++.+.+.|.+.--+-.++.++. |...|..-... ....++.-.+.+...+.++++- T Consensus 259 ~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlTt~~~gs~a~~~vh~~v---~~di~~~D~~~i~~tln~li~~ 335 (491) T protein:vir:10 259 SSIEIKEAAGKTGSADVYERLLHFCRGEVSIALLGQNQTTEATSTRASAQAGLEV---TDDIRDGDKAVVSEAMNMLIRW 335 (491) T ss_pred ceeEEEecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcccCcccchhHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH Confidence 4688987543 345788999999998887543323333332 22333221111 2223334456667778777776 Q ss_pred HHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHHh--cc-CChHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhc Q lcl|NC_010179. 382 IMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVA--NY-SSKEAVAKANPIVDDWQQELKDLAKDREENDPYANQA 458 (469) Q Consensus 382 i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~--g~-iS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~ 458 (469) ++.+- ..+.+ ...+.|... ..+.+..++.+.+++ |+ +|.+.+.+.++. +.+..+.+-.. .+.........+ T Consensus 336 l~~~N-~~~~~--~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~~~i~e~~Gi-p~~~~~~~~~~-~~~~~~~~~~~~ 409 (491) T protein:vir:10 336 ICDLN-FDGAD--RPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTPAYFKRAYNL-QDGDLDERPLP-VSAVDTVGAASF 409 (491) T ss_pred HHHhc-CCCCC--cceEEecCc-CchhHHHHHHHHHHHhCCCcCCHHHHHHHhCC-CCCCcCccccc-cCCCCCcccccc Confidence 66543 22322 345566543 233467788888874 66 888888888763 32221100000 000000000000 Q ss_pred ccCCCCCCCCC Q lcl|NC_010179. 459 DELNGKGVDDE 469 (469) Q Consensus 459 ~~~~~~~~~de 469 (469) .. ......++ T Consensus 410 ~~-~~~~~~~~ 419 (491) T protein:vir:10 410 AE-FEAPDQDA 419 (491) T ss_pred cc-cCCCCCCc Confidence 00 01111111 No 143 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=97.74 E-value=2.3e-05 Score=45.99 Aligned_cols=392 Identities=10% Similarity=0.020 Sum_probs=167.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccc-cCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLR-SADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~-~~~~ri~~n~~k~iv~~~~~~l 79 (469) |-+ ++.+..+..... .....+.....+.-........ .+...... ....-+..+-....|+..++-+ T Consensus 1 MG~------f~~lf~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~--~g~~~~~~v~~~~al~~~~v~~ci~~ia~~i 68 (422) T protein:vir:13 1 MGF------LRGLFNKKNNND----EKRSNYDEDIGIDISDSNFWEK--FGIKLNFSVRGKRALKENTVYVCTKIRAESI 68 (422) T ss_pred Cch------hhhhhhccCCcc----chhhhhhhccccccCcchhhhh--ccccCCcccchhhhhccHHHHHHHHHHHHhh Confidence 332 122211110000 0000000000000000000000 00000000 0000012233334455666666 Q ss_pred hcCCeeeccCc-hhhHHHHHHHHhc--c-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCC Q lcl|NC_010179. 80 ASVFPDIDVGK-DADNKKILDVLGD--D-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATT 151 (469) Q Consensus 80 ~g~p~~~~~~~-~~~~~~l~~~~~~--n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~ 151 (469) -+-|+.+--.. +.....+..++.. | . .+....+..+.+.+|.+|+.+-.+..|++ .+.+++|..+.++.+.+ T Consensus 69 A~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~~~~~~~ 148 (422) T protein:vir:13 69 GKLSLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDNVTKIIDDD 148 (422) T ss_pred hhCceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCC Confidence 66676652222 2222234455542 3 2 23345677888999999999988888886 58899999999988754 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) .... ...-.+|...+..|... .+....+.+++.. . T Consensus 149 ~~~~-~~~~~~y~~~~~~g~~~-----~~~~~eiih~~~~------------------------------------~--- 183 (422) T protein:vir:13 149 NFLS-SLSKVWYVVTDKNGKEH-----KLLPDEMLHFIGD------------------------------------I--- 183 (422) T ss_pred ccee-ccceEEEEEEeCCCeEE-----EEcccceEEEcCC------------------------------------C--- Confidence 3111 01111122121121110 0111111111100 0 Q ss_pred EecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhh--------hcceeeeccc Q lcl|NC_010179. 232 EFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLR--------EYKSIKINNA 300 (469) Q Consensus 232 ~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~--------~~~~~~~~~~ 300 (469) ..+.-.|.|.++.+...++.......-..+.++..+.|-.+++-....+. ......+. ..+++.++. T Consensus 184 -~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~- 261 (422) T protein:vir:13 184 -TLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVGDLDEKAKKIFKKEFESMSNGLENAHSISLLPF- 261 (422) T ss_pred -CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhcCccccCCceecCC- Confidence 00112477888877777777666666666667777777666654222111 11111111 111222221 Q ss_pred CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (469) Q Consensus 301 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (469) +.+++-++.......+.+..+.....|+..-++|+.-.....+.+...++.. ....+...|.-+++ T Consensus 262 ----g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~----------~~~f~~~~l~P~~~ 327 (422) T protein:vir:13 262 ----GYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQ----------QKDFYVTTLQSSLT 327 (422) T ss_pred ----CceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHHHH Confidence 1233334333334455566667788899988998754433222222222111 12233444555555 Q ss_pred HHHHHhcccC---CC-cccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHH Q lcl|NC_010179. 381 AIMRYLNFSD---AD-KRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKDREE 450 (469) Q Consensus 381 ~i~~~~~~~~---~~-~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E~~~ 450 (469) .|...++.+= .+ .....+.| ..-+..|.++.++.+.++ +|+++.-++.++++. +++-+.-+....-- . T Consensus 328 ~ie~~l~~~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~--~ 405 (422) T protein:vir:13 328 VYEQEIQDKLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRRENLPPVEGGDRLLVNGNMI--P 405 (422) T ss_pred HHHHHHHHhhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccCcc--c Confidence 5544443211 11 11234555 444556889999999887 579999888888754 22211111110000 0 Q ss_pred hhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 451 NDPYANQADELNGKGVDDE 469 (469) Q Consensus 451 ~~~~~~~~~~~~~~~~~de 469 (469) -+...++.. .+++...+ T Consensus 406 l~~~~~~~~--~~g~~~g~ 422 (422) T protein:vir:13 406 IEMAGEQYK--KGGEKGGK 422 (422) T ss_pred hhhcccccc--cCCCcCCC Confidence 000011111 11222222 No 144 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=97.68 E-value=2.9e-05 Score=45.45 Aligned_cols=376 Identities=8% Similarity=0.041 Sum_probs=161.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--CCccccc----ccchhhhcccccccccccCcce--eccchHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYEN--KTDITTR----NNGKPKVSKEGKKDPLRSADNR--IPSNFYQLLV 72 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g--~~~i~~~----~~~~~~~~~~~~~~~~~~~~~r--i~~n~~k~iv 72 (469) |. .+.+.+.. ....... ........ .+.....-..+.+ +...-....| T Consensus 1 M~-----------------------~~~~~f~~~~r~~~~~~~~~~~~~~~~~~-~g~~~~~~~v~~~~al~~~~v~~~i 56 (429) T protein:vir:10 1 MD-----------------------SVKKFFNFEKRQTSQVIELNKDDEKLLEW-LGISPSTISVKGKNALKVATVFACI 56 (429) T ss_pred Cc-----------------------hhhhhhcccccCcccccccCCChHHHHHH-hcCCCCcceechhhhhccHHHHHHH Confidence 21 11111110 0000000 00000000 0000000000000 1122223345 Q ss_pred HHHHHhhhcCCeeecc-C-c---hhhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEc Q lcl|NC_010179. 73 DQEAGYIASVFPDIDV-G-K---DADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQ 140 (469) Q Consensus 73 ~~~~~~l~g~p~~~~~-~-~---~~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 140 (469) +..+.-+-+-|+.+-- . + ......+..++.. | ..+....+..+.+.+|.+|+++-.+..|++ .+.+++ T Consensus 57 ~~ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 136 (429) T protein:vir:10 57 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 136 (429) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 5555555555655311 1 1 1122335555542 2 233345677788999999999999998886 588899 Q ss_pred cceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccc Q lcl|NC_010179. 141 PDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNT 220 (469) Q Consensus 141 p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (469) |..+.+..+.... +....+.|......|.. ..+.... T Consensus 137 ~~~v~v~~~~~~~--~~~~~~~~~~~~~~g~~-----~~~~~~e------------------------------------ 173 (429) T protein:vir:10 137 ASKVTVYIDDVGL--LNSKTKMWYVVNTGGQQ-----RVLKPEE------------------------------------ 173 (429) T ss_pred CceeEEEEcCccc--ccccceEEEEEccCCeE-----EEEcccc------------------------------------ Confidence 9998887764321 11111111111111110 0112222 Q ss_pred ccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhh-- Q lcl|NC_010179. 221 LKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLR-- 290 (469) Q Consensus 221 ~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~-- 290 (469) |+|++. ...|.|.++.+...++.......-....++..+.|-.+++.....+. ......+. T Consensus 174 ---------vih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~ 244 (429) T protein:vir:10 174 ---------ILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESM 244 (429) T ss_pred ---------EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHH Confidence 333331 22477777777777776666655566666776677666654221111 11111111 Q ss_pred ------hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHHH Q lcl|NC_010179. 291 ------EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLEL 362 (469) Q Consensus 291 ------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~~ 362 (469) ..+++.++. +.+++.+........+.+..+...+.|+..-++|+.-.... ++-|+ ++. T Consensus 245 ~~g~~n~~~~~vl~~-----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn--~e~------- 310 (429) T protein:vir:10 245 SSGLQNSHRIALMPV-----GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNN--IEQ------- 310 (429) T ss_pred hccccccCceeecCC-----CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHH------- Confidence 112222221 12333333322334455566777888999889987533222 22222 111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCC---C-cccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC- Q lcl|NC_010179. 363 KAAKTQTYFEHAINELVRAIMRYLNFSDA---D-KRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI- 433 (469) Q Consensus 363 k~~~~~~~~~~~l~~~~~~i~~~~~~~~~---~-~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~- 433 (469) .....+..+|.-+++.|...++.+=+ . .....+.| +.-+..|..+.++.+.++ +|+++.-.+.++++. T Consensus 311 ---~~~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~ 387 (429) T protein:vir:10 311 ---QQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLP 387 (429) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCC Confidence 11223344555555555554442211 1 11223444 455567899999999887 689999888888753 Q ss_pred -CCCHHHH-----HHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 434 -VDDWQQE-----LKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 434 -v~d~~~E-----~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.+.- +..+..=.+...+..+...+...++++.. T Consensus 388 p~~ggD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~~~~e~~ 429 (429) T protein:vir:10 388 PEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 429 (429) T ss_pred CCCCcCeeeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 2221111 11111000000000111111111111111 No 145 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=97.48 E-value=5.8e-05 Score=43.80 Aligned_cols=375 Identities=11% Similarity=0.038 Sum_probs=163.7 Q ss_pred CCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRND-LINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~-~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |-+ ++.+..++.. .......+.+.+.+-..- . .+..... ..-+..+-....|+..+.-+ T Consensus 1 Mg~------f~~lf~r~~~~~~~~~~~~~~~~~~~~~~--~---------~g~~v~~---~~al~~~~v~~~i~~Ia~~i 60 (414) T protein:vir:44 1 MVF------FSGLFQRKSDAPVTTPAELADAIGLSYDT--Y---------TGKQISS---QRAMRLTAVFSCVRVLAESV 60 (414) T ss_pred Cch------hhhhhccCccCcccchhhHhHhhccCccc--c---------CCceech---hhhhccHHHHHHHHHHHHHh Confidence 221 1111110000 000001111111110000 0 0000000 00011222334455555555 Q ss_pred hcCCeeeccCc-----hhhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEE Q lcl|NC_010179. 80 ASVFPDIDVGK-----DADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPV 147 (469) Q Consensus 80 ~g~p~~~~~~~-----~~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~ 147 (469) -+-|+.+--.+ ......+..++.. | ..+....+...++.+|.+|+++..+ .|++ .+.+++|..+.+. T Consensus 61 a~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~-~g~~~~L~~l~~~~v~~~ 139 (414) T protein:vir:44 61 GMLPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKA-FGEVAELLPVDPGCVVPK 139 (414) T ss_pred ccCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCceEEEE Confidence 55665542111 1112334444432 2 2233446677888999999988665 5766 5888999999888 Q ss_pred EeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCc Q lcl|NC_010179. 148 YATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGR 227 (469) Q Consensus 148 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 227 (469) ++.+. ++. |.....+|.. ..+....+.+++.- T Consensus 140 ~~~~~--~~~-----y~~~~~~g~~-----~~~~~~evih~~~~------------------------------------ 171 (414) T protein:vir:44 140 LNSSW--EPV-----YQVTFPDGST-----DVLSQEDIWHVRTL------------------------------------ 171 (414) T ss_pred ECCCC--cEE-----EEEEecCceE-----EEEccccEEEecCC------------------------------------ Confidence 77532 222 2222222211 11222333322110 Q ss_pred ccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhhh--------cceee Q lcl|NC_010179. 228 VPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLRE--------YKSIK 296 (469) Q Consensus 228 vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~~--------~~~~~ 296 (469) + .+...|.|-+.-+...++....+..-..+.+...+.|-.++......+. ......+.. .+++. T Consensus 172 -~----~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~v 246 (414) T protein:vir:44 172 -T----LDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQTLSDQAYERLKKDFEERHTGLGNAHRPMI 246 (414) T ss_pred -C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCccee Confidence 0 0112477777777777776666666666667777777666654322111 111111110 11222 Q ss_pred ecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 297 INNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 297 ~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) ++ ++++|.....+ ...+.+..+.....|+..-++|+.-....++.+...++.+. ...+..+ T Consensus 247 l~-------~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~----------~~~~~~~ 309 (414) T protein:vir:44 247 LE-------MGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG----------LGFINYS 309 (414) T ss_pred cC-------CCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHH Confidence 22 23444443333 33455566677788888888887533322222211111111 2233445 Q ss_pred HHHHHHHHHHHhccc---CCCcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFS---DADKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLA 445 (469) Q Consensus 375 l~~~~~~i~~~~~~~---~~~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~ 445 (469) |.-+++.|...++.+ ..+.....+.| ..-+..|.++.++++.++ +|+++.-++.+.++. ++.-+.-+-.. T Consensus 310 l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~- 388 (414) T protein:vir:44 310 LVPYLTRIEQRINTGLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYLTPM- 388 (414) T ss_pred HHHHHHHHHHHHHhhcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceecccc- Confidence 555555554444321 11222334455 455567899999999887 679999888888754 22211111000 Q ss_pred HHHHHhhhh-HhhcccCCCCCCCCC Q lcl|NC_010179. 446 KDREENDPY-ANQADELNGKGVDDE 469 (469) Q Consensus 446 ~E~~~~~~~-~~~~~~~~~~~~~de 469 (469) +-...+. ..+..+.++.+..|| T Consensus 389 --n~~~~~~~~~~~~~~~~~~~~d~ 411 (414) T protein:vir:44 389 --NMTTKPSDGSKAGKQKDNANADE 411 (414) T ss_pred --cccccCCccccCCCCCCCCCCCC Confidence 0001111 112223333444444 No 146 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.47 E-value=6.1e-05 Score=43.70 Aligned_cols=400 Identities=12% Similarity=0.050 Sum_probs=148.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cC-Ccccccccchhhhccccccccccc----CcceeccchHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYE----NK-TDITTRNNGKPKVSKEGKKDPLRS----ADNRIPSNFYQLL 71 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~----g~-~~i~~~~~~~~~~~~~~~~~~~~~----~~~ri~~n~~k~i 71 (469) |+-+ +.-.++..+.. .++.+..+-|. |. + .|. ..... .+.+..... .-+| ....++.| T Consensus 1 ~~~~-~~~~~~~~~~~-----~~~~~~rd~l~~~~~glg~---~r~-~~~~~--~g~~~~~~~~~l~~~Yr-~~~ia~~i 67 (449) T protein:vir:10 1 MTDK-LTLAVNHALND-----ARMARARMGLMVPTMGLDN---KRH-SAWCE--YGFPELVTYENLYSLYR-RGGIAHGA 67 (449) T ss_pred Cchh-hHHHHhhhcch-----hHHHHHHHHHHHHHhcCCc---ccc-hhhhh--cCCcccCCHHHHHHHHh-cCchhHHH Confidence 7766 22222322222 22222222221 11 1 011 00110 111111110 0011 23456678 Q ss_pred HHHHHHhhh-cCCeeeccCchhhH-------HHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccce Q lcl|NC_010179. 72 VDQEAGYIA-SVFPDIDVGKDADN-------KKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQ 143 (469) Q Consensus 72 v~~~~~~l~-g~p~~~~~~~~~~~-------~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~ 143 (469) |+..++-.. ..|..+...+.+.. ..+.+++.......+.++.+++..+|.+++++-++ +|+..-..+++. T Consensus 68 Vd~~~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~~~~~~~l~ea~~~~rl~Gga~i~i~v~-d~~~l~~Pl~~~- 145 (449) T protein:vir:10 68 VEKLVGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFTNRLWRSFAEADRRRLVGRYAGILLHIR-DEKDWNLPATKG- 145 (449) T ss_pred HHhhhhhhhhcCcccccCccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhccCcEEEEEEec-CCCCCCcccccC- Confidence 888887654 22333322221111 12223333334456778888888899888877654 333221111111 Q ss_pred eEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccc-cccccc Q lcl|NC_010179. 144 ITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETG-QSNTLK 222 (469) Q Consensus 144 ~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 222 (469) ..+....-+|...- ...+++++- ....+.....|.......+. .....- T Consensus 146 ----------~~i~~i~v~~~~~i-------~~~~~~~dp-------------~sp~yg~P~~y~v~~~~~g~~~~~~~i 195 (449) T protein:vir:10 146 ----------RGLQKVSVSWAGSL-------KVAEWDTGI-------------NSKTYGQPKLWKYTERLPNGSSRRVDI 195 (449) T ss_pred ----------cceeeEEeeccccC-------ChhhhhcCC-------------CCCCCCCceEEEEeeeccCCCccceee Confidence 11111111111000 000000000 00000000111100000000 000112 Q ss_pred ccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHH-----HHHHHHHhcCc---eeEEecCCc---ccchh----hhh Q lcl|NC_010179. 223 HNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNG-----FINDLDDVQTV---ILVLTNYGG---ASLKQ----FMN 287 (469) Q Consensus 223 ~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~-----~~~~~~~~~~p---~l~~~g~~~---~~~~~----~~~ 287 (469) |.-..+.+...+ ..|.|.++.+-.-+-.++.+.-. +.+..+..... ..-+.|... ....+ +.. T Consensus 196 H~SRl~~~~~~~--~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~~e~~~~~~~~ 273 (449) T protein:vir:10 196 HPDRVFILGDYS--EDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVSIDELQDKFNE 273 (449) T ss_pred ccceeEeecCCC--CCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCCchHHHHHHHH Confidence 332222222111 12445554443222222222111 11111111100 001111110 01111 110 Q ss_pred hhh-hcc---eeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C-CccHHHHHHHHHH Q lcl|NC_010179. 288 DLR-EYK---SIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S-NASGVAIKMLYSH 359 (469) Q Consensus 288 ~~~-~~~---~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-~~Sg~Al~~~~~~ 359 (469) .+. -++ .+.+. .+. +|-+.+.+.......++.....+...+++|-.-..|. | |.+|. ++. |.. T Consensus 274 ~~~~~~~~~~~~~i~-----~~~--d~~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~D-~~n-yyd 344 (449) T protein:vir:10 274 VAGEINRGNDVLMTT-----QGA--TVTPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTED-QKY-FNA 344 (449) T ss_pred HHHHHhccchheeec-----CCc--ceEEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccchh-HHH-HHH Confidence 111 011 11111 112 3444556677778888888899999999997544432 2 22332 322 333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHH Q lcl|NC_010179. 360 LELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQ 439 (469) Q Consensus 360 l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~ 439 (469) .++.++..++..|++++.+++... ..+. ..+++|+|++-...+++|.|++..+.+...+..-.....+-+ ++ . T Consensus 345 ---~i~~~Q~~l~p~le~l~~~l~~s~-~g~~-~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~-~~-~ 417 (449) T protein:vir:10 345 ---RCQSRRVDLSFEIEDFCDKLIELK-IIDA-VAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAF-SR-E 417 (449) T ss_pred ---HHHHHHHhhhHHHHHHHHHHHHhh-cCCC-CCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCc-CH-H Confidence 334445568999999999877642 2222 247999999999999999998876654322211100011111 11 2 Q ss_pred HHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 440 ELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 440 E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) |+.... ...+.. ......++.+++ T Consensus 418 EiR~~~----~~~~~~--~~~~~~e~~de~ 441 (449) T protein:vir:10 418 EIRTAA----GYDNDD--EEPLGEEDGDEE 441 (449) T ss_pred HHHHHh----cccCCC--CCCCCCCCCccc Confidence 222111 111110 111111111111 No 147 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=97.39 E-value=7.7e-05 Score=43.13 Aligned_cols=411 Identities=9% Similarity=0.053 Sum_probs=159.1 Q ss_pred CCHHHHHHHHH-----HHHHHHHHH---------HHHHHHHHHHhccCCcccccccch-hhhcccccccccccCccee-- Q lcl|NC_010179. 1 MELDALKKLIR-----NTSTSRNDL---------INNYKKSVDYYENKTDITTRNNGK-PKVSKEGKKDPLRSADNRI-- 63 (469) Q Consensus 1 ~~~~~~~~~i~-----~~~~~~~~~---------~~~~~~~~~Yy~g~~~i~~~~~~~-~~~~~~~~~~~~~~~~~ri-- 63 (469) |.|- +.+.. ....+|.+. --.-+.+..+-.|+.....++... ..........+.-++...+ T Consensus 5 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l~~ 82 (551) T protein:vir:80 5 LGLF--ESIRLVGVNKSDAVKHIEVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLHG 82 (551) T ss_pred hhhH--HHhhhccCChhhcccccccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHHHH Confidence 2221 11110 000011000 001112333334433221111100 0000000001111111000 Q ss_pred ------ccchHHHHHHHHHHhhh--c---------CCeeecc---------CchhhHHHHHHHHhc-c---------HHH Q lcl|NC_010179. 64 ------PSNFYQLLVDQEAGYIA--S---------VFPDIDV---------GKDADNKKILDVLGD-D---------RAL 107 (469) Q Consensus 64 ------~~n~~k~iv~~~~~~l~--g---------~p~~~~~---------~~~~~~~~l~~~~~~-n---------~~~ 107 (469) ..+....+|+..+.-+. + -+..+.. .+....+.+.+++.. | +.. T Consensus 83 ~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~~ 162 (551) T protein:vir:80 83 VLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFSS 162 (551) T ss_pred HHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHHH Confidence 01333344444443321 1 1112211 111222334555432 1 112 Q ss_pred HHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEE Q lcl|NC_010179. 108 TLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQ 186 (469) Q Consensus 108 ~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 186 (469) .+..+..+.+.+|.+|+.+-.+.+|++. +.+++|..+.++.+++.. .....++++... .+... ..+....+. T Consensus 163 f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~-~~~~~~~y~~~~--~g~~~----~~~~~~eii 235 (551) T protein:vir:80 163 FVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGK-IPDNGNRFVQVI--DQKIV----ATFNAREMA 235 (551) T ss_pred HHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccc-cccCceEEEEEe--CCcEE----EEEcccceE Confidence 2334567788999999988889999875 899999999888765321 111111222111 11111 012222232 Q ss_pred EEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 187 FFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDD 266 (469) Q Consensus 187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~ 266 (469) +++.. |.........|.|-++.+...+.....+..-..+.+.. T Consensus 236 H~~~n-------------------------------------~~~~~~~~~~G~spi~~a~~~i~~~~a~~~~~~~~f~N 278 (551) T protein:vir:80 236 FAVRN-------------------------------------PRSDIYATGYGYPELEIALKQFIAHENTEAFNDRFFSH 278 (551) T ss_pred Eeccc-------------------------------------CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 22210 00000001246777777777776666665556666666 Q ss_pred hcCceeE--EecCCcccc---hhhhhhhhhc--------ceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHH Q lcl|NC_010179. 267 VQTVILV--LTNYGGASL---KQFMNDLREY--------KSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNI 331 (469) Q Consensus 267 ~~~p~l~--~~g~~~~~~---~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i 331 (469) .+.|-.+ +.|...... ......+... ++..+. ..+++|..... ....+.+..+...+.| T Consensus 279 g~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~~nag~~~vl~------~~g~~~~~l~~~~~D~qfle~~~~~~~~I 352 (551) T protein:vir:80 279 GGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGINGSWQIPVVS------AEDVKFVNMTPSARDMEFEKWLNYLINVI 352 (551) T ss_pred CCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccCcccccc------CCCceEEEccCChhHHHHHHHHHHHHHHH Confidence 6667543 344221111 1112222110 111111 12355555444 3345666677788889 Q ss_pred HHHhCCCCcCcccc--C---CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCC Q lcl|NC_010179. 332 FLFGQGIDPANFES--S---NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRT 403 (469) Q Consensus 332 ~~~s~~p~~~~~~~--g---~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~ 403 (469) ...-++|+.-..-. + +..+..+-+ +... ......+..+|.-+++.|...++.. .. ...+.+.|... T Consensus 353 a~aFgVPp~~lG~~~~~~~~~~~~~s~t~--sn~e---~~~~~f~~~tL~P~~~~ie~~ln~~L~~~~-~~~~~f~f~~~ 426 (551) T protein:vir:80 353 SALYGIDPAEINIPNNGGATGSKGGSLNE--GNSA---EKNQASKNKGLQPLLGFIEDFINKHIVAEF-GDKYTFQFVGG 426 (551) T ss_pred HHHhcCCHHHcCcccccccccccccccch--hhHH---HHHHHHHHHHHHHHHHHHHHHHHhhhcccc-CCceEEEeecc Confidence 88888886432210 0 001111100 0000 1112333344444444444333321 11 13467888877 Q ss_pred CCCCHHHHHHHHHHH-hccCChHHHHHhCCC---CCCHHHH------------H--HHHHHHHHHhh-h-hHh-----hc Q lcl|NC_010179. 404 KVEDSLTKAQIVSTV-ANYSSKEAVAKANPI---VDDWQQE------------L--KDLAKDREEND-P-YAN-----QA 458 (469) Q Consensus 404 ~p~d~~e~~~~~~kl-~g~iS~et~~~~l~~---v~d~~~E------------~--eri~~E~~~~~-~-~~~-----~~ 458 (469) ...+.++.+...... .|+++.-.+.++++. ++.-+.- . +....+++... . ..+ .. T Consensus 427 ~~~~~~~~~~~~~~~~~g~lT~NE~R~~~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 506 (551) T protein:vir:80 427 DIKSELESVKILAEKAKVAMTVNEVRKELNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVS 506 (551) T ss_pred ChhhHHHHHHHHHHHhcCCcCHHHHHHHhCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCC Confidence 777777776644332 588999888887753 2210100 0 00001100000 0 000 00 Q ss_pred ccCCC--------CCCCCC Q lcl|NC_010179. 459 DELNG--------KGVDDE 469 (469) Q Consensus 459 ~~~~~--------~~~~de 469 (469) .+.++ +++++. T Consensus 507 ~~~~~~p~~~~~~~~~~~~ 525 (551) T protein:vir:80 507 TDVEDIPDGKDTTGDIGKD 525 (551) T ss_pred CCCCCCCCccccCCCcccc Confidence 00000 000000 No 148 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=97.36 E-value=8.6e-05 Score=42.88 Aligned_cols=416 Identities=9% Similarity=0.000 Sum_probs=177.2 Q ss_pred CCHH---HHHHHH---HHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHH Q lcl|NC_010179. 1 MELD---ALKKLI---RNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQ 74 (469) Q Consensus 1 ~~~~---~~~~~i---~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~ 74 (469) |+.+ +-+++. +.+..++..-..+.+.+.+|... .+-... ..+....|+-.+-+...++. T Consensus 1 ~~~~~~~e~~~l~~r~~~Lk~~R~~~e~~w~e~~~~~lP-----~~~~~~----------~~~~~~~~~~dstg~~a~~~ 65 (517) T protein:vir:10 1 MDMRFAGNKSKIPKLYEQLVGKRSPFLSRAENYSRFTLP-----YLMADV----------NDDLSSQNAWQDDGASATNF 65 (517) T ss_pred CcccccccHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcc-----ccccCC----------CCCccccccccchHHHHHHH Confidence 6666 444444 44444444334455555555543 111000 01112234556666667777 Q ss_pred HHHhhhcC--Cee-----eccCchh---------hHHHHHH-----------HH-hccHHHHHHHHHHHHHhCCeEEEEE Q lcl|NC_010179. 75 EAGYIASV--FPD-----IDVGKDA---------DNKKILD-----------VL-GDDRALTLNSLLVDSSNAGRAWLHY 126 (469) Q Consensus 75 ~~~~l~g~--p~~-----~~~~~~~---------~~~~l~~-----------~~-~~n~~~~~~~~~~~~~~~G~~~~~v 126 (469) .++.|.+- ||. +...+.. ....++. .+ ..||...+.++.++...+|.+.+ T Consensus 66 LAa~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l-- 143 (517) T protein:vir:10 66 LSNKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMM-- 143 (517) T ss_pred HHHHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE-- Confidence 77665431 222 2222211 1111222 22 23667778889999999999854 Q ss_pred EEcCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC-----Cc-----------eEEEEEEEEcCCeEEEEEe Q lcl|NC_010179. 127 WIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE-----AG-----------KYFTVHEYWTDKEAQFFRT 190 (469) Q Consensus 127 ~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~-----~~-----------~~~~~~~~~~~~~~~~~~~ 190 (469) |.++ +...++.++-.++++.-|. .+++...+|..+..... +. .....+++|+. .+.. T Consensus 144 y~~~-~~~~~~~~pl~~y~v~~d~--~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~ 216 (517) T protein:vir:10 144 YHPD-KTSPIQAVPLHHYCVRRDN--NGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTH----AKRT 216 (517) T ss_pred EEeC-CCCcEEEEEcCeEEEeeCC--CcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEE----EEEe Confidence 4544 3334556655665554444 34565555554431110 00 00111222221 0011 Q ss_pred ecCceeecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 191 SATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLD 265 (469) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~ 265 (469) ..+... .+....+..... ....+|..+|++.++ .+.+|.|-.++..+-+..+|.+.-....... T Consensus 217 -~~~~~~--------~~~~~d~~~~~~--~s~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~~~~~~ 285 (517) T protein:vir:10 217 -KDGKYL--------IRQSADDVPVGK--ESTVTEDKSPFLILTWKRSYGEDYGRGMAEDHAGAFFVIQFLSEALARGMA 285 (517) T ss_pred -CCCceE--------EEEEeCceeecc--ccccccccCCeeeeeeeecCCCCcccchHHHhHHHHHHHHHHHHHHHHHHH Confidence 111000 011111111111 112235667777665 3457999899999999999988777777777 Q ss_pred HhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc Q lcl|NC_010179. 266 DVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANF 343 (469) Q Consensus 266 ~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 343 (469) ....|.+.+.-..... ...+.....-.+.++ ...++..+. ...+.......++.++..|-..-..-.+... T Consensus 286 ~a~~~~~lv~~~~~~~----~~~l~~~~~g~~~~g---~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~~ 358 (517) T protein:vir:10 286 LMADVKYLVKPGSYTD----INQFVEGGSGAVLHG---VEGDIHIVQLGKYADYTPIQAVLNDYRQRIGRVFMMEAMTRR 358 (517) T ss_pred HhccCCcccCcccccc----hhhccCCCccccccC---CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhhhcc Confidence 7776655442101111 111111111111111 122344443 3335666777777777766654322111111 Q ss_pred ccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcccCCCcccceEEeCCCCCC-CHHHHHHH Q lcl|NC_010179. 344 ESSNASGVAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNFSDADKRHISQHWTRTKVE-DSLTKAQI 414 (469) Q Consensus 344 ~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~~~~~~~~i~i~f~~~~p~-d~~e~~~~ 414 (469) +....|+..+. .++.+++..++..+.++ +..++..+.. ......+++.+..++.. ...+.++. T Consensus 359 ~~~rvTAtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~l~~-~l~~~~v~~~~~s~la~l~r~~~~~~ 430 (517) T protein:vir:10 359 DAERVTAYEIQ-------RDAMLVEQSLGGVYSLFATTFQGPLARWFMNGISS-ILTSKNVSPTILTGIEALGRMAELDK 430 (517) T ss_pred CCccccHHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhh-hcCCCCccceeeccHHHHHHHHHHHH Confidence 22234554443 34456666677665552 2222222211 11112334444333221 11112222 Q ss_pred HHH-------Hhc-------cCChHHHHHhC---CCCCC----HHHHHHHHHHHHHHhhhh--------------Hhhcc Q lcl|NC_010179. 415 VST-------VAN-------YSSKEAVAKAN---PIVDD----WQQELKDLAKDREENDPY--------------ANQAD 459 (469) Q Consensus 415 ~~k-------l~g-------~iS~et~~~~l---~~v~d----~~~E~eri~~E~~~~~~~--------------~~~~~ 459 (469) +.. ++. .+....++..+ -+++- .++|+++.++++.+.... ....+ T Consensus 431 i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~~~~~~~~~~~~ag~~~~~~~~~~~ 510 (517) T protein:vir:10 431 LGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQEQEATKYAAEQAGKAIPDMVKNGQ 510 (517) T ss_pred HHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Confidence 221 111 12223333322 11221 235555554443332211 11223 Q ss_pred cCCCCCC Q lcl|NC_010179. 460 ELNGKGV 466 (469) Q Consensus 460 ~~~~~~~ 466 (469) ..++++. T Consensus 511 ~~~~~~~ 517 (517) T protein:vir:10 511 INPQGGQ 517 (517) T ss_pred CCCCCCC Confidence 4444444 No 149 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=97.35 E-value=8.9e-05 Score=42.80 Aligned_cols=425 Identities=9% Similarity=0.062 Sum_probs=180.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |. +.+++..+.+..++..-..+.+.+.+|.... +-.. .+ ......+.++..+-+...++..++.|+ T Consensus 1 m~-~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~-----~~~~------~~--~~~~~~~~~~~dst~~~a~~~Laa~l~ 66 (555) T protein:vir:17 1 MK-HSAQAKYMMLRADREDYLDSGRQSARLTLPY-----ILTD------EG--HVQGGYLPTPWQSVGSKGVNVLASKLM 66 (555) T ss_pred Ch-hHHHHHHHHHHHHhhHHHHHHHHHHHHhccc-----ccCC------CC--CcccccccccccccHHHHHHHHHHHHH Confidence 43 2244555555555554455566666665431 1100 00 011122335666777777887777665 Q ss_pred cC--Cee-----eccCch---------hhHHHHHH-----------HH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCC Q lcl|NC_010179. 81 SV--FPD-----IDVGKD---------ADNKKILD-----------VL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDN 132 (469) Q Consensus 81 g~--p~~-----~~~~~~---------~~~~~l~~-----------~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~ 132 (469) +- ||. +...+. .....++. .+ ..||...+.++.++..++|.+. +|.++++ T Consensus 67 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--ly~~~~~ 144 (555) T protein:vir:17 67 LSLFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNAL--LYQGKKN 144 (555) T ss_pred HhhcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEE--EEecCCc Confidence 32 222 222221 11222222 22 2466677888899999999986 4566653 Q ss_pred ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC-----Cce------------------EEEE----EEEEcCCeE Q lcl|NC_010179. 133 NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE-----AGK------------------YFTV----HEYWTDKEA 185 (469) Q Consensus 133 ~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~-----~~~------------------~~~~----~~~~~~~~~ 185 (469) +++++-.++++..|. .+++..++|.++..... +.. ...+ .+.+....+ T Consensus 145 ---~~~~pl~~y~v~~d~--~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 219 (555) T protein:vir:17 145 ---LKLYPLDRFVVSRDG--EGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNAVGEDGPKMGVTAPGGRDKGKSNDA 219 (555) T ss_pred ---eeEEEcCeEEEeeCC--CcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhhccccchhhhhhhhcccccCCCcce Confidence 344444555444443 45666677665532211 000 0000 000001111 Q ss_pred EEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 186 QFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGF 260 (469) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~ 260 (469) ..|......... ..|-...............+|..+|++.++ ++.+|.|-.++..+-+..+|.+.-.. T Consensus 220 ~v~t~~~~~~~~-------~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~ 292 (555) T protein:vir:17 220 LVYTYVCRKDGQ-------VKWHQECDGKVIPGSNSSAPYTHNPWIPLRFNIVDGEAYGRGRVEEFMGDLKSLEALSQAM 292 (555) T ss_pred eEeecccccCCe-------eEEEEecCceeccccccccCcccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHH Confidence 111100000000 000000000000001124567778888765 34579999999999999999998889 Q ss_pred HHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEeec--CCHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_010179. 261 INDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQID--IPVEARDDALKITRDNIFLFGQGI 338 (469) Q Consensus 261 ~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p 338 (469) ........+|.+.+.-..... ...+...+.-.+..+ ...+++.+... .+.......++.++..|-..-.. T Consensus 293 l~~~~~~~~pp~lv~~~g~~~----~~~l~~~~~g~v~~g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~~I~~aFm~- 364 (555) T protein:vir:17 293 VEGSAASAKVVFMVSPSATTK----PQNLALAANGAIIQG---RPDDVSVVQANKAADFRTVLEMIQKLEQRISDAFLM- 364 (555) T ss_pred HHHHHHHhCCceeeccccccC----cceeecCCCceeecC---CcccceeeeccccchhhHHHHHHHHHHHHHHHHHhh- Confidence 999999999986653111111 111111111111111 12335554432 35666677777776666443221 Q ss_pred CcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhccc----CCCcccceEEeCCCCCC Q lcl|NC_010179. 339 DPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNFS----DADKRHISQHWTRTKVE 406 (469) Q Consensus 339 ~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~~----~~~~~~i~i~f~~~~p~ 406 (469) .+..+....|+..+.. ++.++...++..+.++ ++-++.++... ..+..-+.+.+.-++.. T Consensus 365 -~~~~d~~r~TAtEV~~-------r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i~~~l~~ 436 (555) T protein:vir:17 365 -LQVRQSERTTATEVQA-------TVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTVVAGLWG 436 (555) T ss_pred -cCCCCcccchHHHHHH-------HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccceeehHHH Confidence 1122233445544332 3344455555544443 22222222222 22222223333222211 Q ss_pred -CHHHHH----HHHHHHhcc---------CChHHHHHh----CCCCC----CHHHHHHHHHHHHHHhhhh---HhhcccC Q lcl|NC_010179. 407 -DSLTKA----QIVSTVANY---------SSKEAVAKA----NPIVD----DWQQELKDLAKDREENDPY---ANQADEL 461 (469) Q Consensus 407 -d~~e~~----~~~~kl~g~---------iS~et~~~~----l~~v~----d~~~E~eri~~E~~~~~~~---~~~~~~~ 461 (469) ...+.+ +.++.++++ +....++.. +|... ..++|++++++++++.... .++.... T Consensus 437 l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~~~qa~~~ 516 (555) T protein:vir:17 437 VGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASLINQAGQL 516 (555) T ss_pred HHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 011111 122222222 333333332 33201 2345666665443332211 1111111 Q ss_pred CCCC---------CCCC Q lcl|NC_010179. 462 NGKG---------VDDE 469 (469) Q Consensus 462 ~~~~---------~~de 469 (469) .+.. ..++ T Consensus 517 ~~~~~~~~~~~~~~~~~ 533 (555) T protein:vir:17 517 AKTPMAEQAMQLIQQQQ 533 (555) T ss_pred HhhhhhhhHHhccccch Confidence 1111 1111 No 150 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=97.31 E-value=9.9e-05 Score=42.54 Aligned_cols=370 Identities=9% Similarity=0.029 Sum_probs=160.3 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCc--ceeccchHHHHHHHHHHhhhcCCee Q lcl|NC_010179. 9 LI-RNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSAD--NRIPSNFYQLLVDQEAGYIASVFPD 85 (469) Q Consensus 9 ~i-~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~--~ri~~n~~k~iv~~~~~~l~g~p~~ 85 (469) |+ ++....+.. .+. ..... .....+........+ +-+...-....|+..+.-+.+-|+. T Consensus 1 m~f~~~~~~~~~----------------~~~-~~~~~-~~~~~g~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~ 62 (409) T protein:vir:10 1 MLFRKGFKNQSQ----------------EIS-IDDKK-ILEWLGINPSETYVNGKSCLKQATVFGCIRILSDNISKLPIK 62 (409) T ss_pred CcccccccCcCC----------------CCC-CChHH-HHHHhcCCcCcceechhhhhccHHHHHHHHHHHHhhhhCceE Confidence 11 000000000 000 00000 000000000000000 0011222233445555555555654 Q ss_pred ec-cCc---hhhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 86 ID-VGK---DADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 86 ~~-~~~---~~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~ 154 (469) +- ..+ ......+..++.. | ..+....+..+.+.+|.+|+++-.+..|++ .+.+++|..+-++.++.... T Consensus 63 ~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~ 142 (409) T protein:vir:10 63 IYQKKDGIKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLL 142 (409) T ss_pred EEEecCCeeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCccc Confidence 41 111 1112234444432 3 223345677788999999999988999886 48889999988887653211 Q ss_pred ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 155 KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 155 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) ....-+. |......|.. ..+....+ +|++ T Consensus 143 ~~~~~~~-y~~~~~~g~~-----~~~~~~ev---------------------------------------------ih~r 171 (409) T protein:vir:10 143 NSENNVW-YLYTDDLGQR-----HKFMSDEI---------------------------------------------LHFK 171 (409) T ss_pred cccceEE-EEEEeCCcee-----EEeccccE---------------------------------------------EEec Confidence 1100011 1111111110 01111222 2222 Q ss_pred C----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCccc---chhhhhhhh--------hcceeeecc Q lcl|NC_010179. 235 K----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGAS---LKQFMNDLR--------EYKSIKINN 299 (469) Q Consensus 235 n----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~---~~~~~~~~~--------~~~~~~~~~ 299 (469) + ...|.|.++.+...++....+.......++..+.|-.+++.....+ .+.....+. ..+++.++. T Consensus 172 ~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~ 251 (409) T protein:vir:10 172 GLTADGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAGDLNPEAEEVFKENFERMSSGLKNAHRIAMLPI 251 (409) T ss_pred CcCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCHHHHHHHHHHHHHHhccccccCCceecCC Confidence 1 1247777777777777766666666666777777766665422111 111111111 111222221 Q ss_pred cCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELV 379 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 379 (469) +.+++-+..+.....+.+..+...+.|+..-++|+.-....++.++..++.. ....+..+|.-++ T Consensus 252 -----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~----------~~~f~~~~l~P~~ 316 (409) T protein:vir:10 252 -----GYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQ----------NREFYIDTLQSIL 316 (409) T ss_pred -----CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHH----------HHHHHHHHHHHHH Confidence 1233433333334455666777888999999998754432222222222211 1233344555555 Q ss_pred HHHHHHhccc-----CC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHH Q lcl|NC_010179. 380 RAIMRYLNFS-----DA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKDRE 449 (469) Q Consensus 380 ~~i~~~~~~~-----~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E~~ 449 (469) +.|...++.+ +. ....+++.+..-+-.|.++.++++.++ +|+++.-.+.+.++. +++-+.=+-... T Consensus 317 ~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~lgl~p~~ggD~~~~~~n---- 392 (409) T protein:vir:10 317 NMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELEEDEPLEGGDVLLINGN---- 392 (409) T ss_pred HHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccC---- Confidence 5554444321 11 112234444455567899999999887 689998888887753 232110000000 Q ss_pred HhhhhHhhcccCCCCCCCC Q lcl|NC_010179. 450 ENDPYANQADELNGKGVDD 468 (469) Q Consensus 450 ~~~~~~~~~~~~~~~~~~d 468 (469) ..+.....+. ..+|++. T Consensus 393 -~~~~~~~~~~-~~kgGe~ 409 (409) T protein:vir:10 393 -MIPVKMAGEQ-YSKGGEK 409 (409) T ss_pred -ccchhhcccc-ccccCCC Confidence 1111111111 1122111 No 151 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=97.31 E-value=9.9e-05 Score=42.53 Aligned_cols=373 Identities=14% Similarity=0.108 Sum_probs=156.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccC-cceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSA-DNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~-~~ri~~n~~k~iv~~~~~~l 79 (469) |--+.|..-|+.-+.. + ....-+.+-++..... ........ +.-+..+-....|+..++-+ T Consensus 1 ~~~~~~~~~~k~~~~~------~--~~~~~~~~~~~~~~~~----------~~~~~~v~~~~a~~~~~v~~~i~~Ia~~i 62 (409) T protein:vir:94 1 MAKENIVTRIKKKLID------N--WIDQSASKLYDFSPWK----------NKSFWGVINNTLETNETIFSAITKLSNSM 62 (409) T ss_pred CcccccchhhhhHHhh------h--hhcCCccccccccccc----------CccccccchhhhhccHHHHHHHHHHHHhh Confidence 5555444433332100 0 0000011111100000 00000000 00011122233344444444 Q ss_pred hcCCeeeccCchhhHHHHHHHHhc--cH-H--HHH-HHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGD--DR-A--LTL-NSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTL 152 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~--n~-~--~~~-~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~ 152 (469) -.-|+.+--..+.....+..++.. |. + ..+ ..+...++.+|.+|+++..+.+|++ .+.+++|..+-++.++.. T Consensus 63 a~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~ 142 (409) T protein:vir:94 63 ASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS 142 (409) T ss_pred hhCceeEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEeCCC Confidence 455555432333333445555542 32 2 223 4567788999999999988888886 588899999888876532 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) ..+ +|......|... .+....+.+++.- ++ T Consensus 143 -~~~-----~y~~~~~~g~~~-----~~~~~dvih~r~~-------------------------------~~-------- 172 (409) T protein:vir:94 143 -REL-----YYSIHAATGNKL-----IVHNMDMLHFKHI-------------------------------VA-------- 172 (409) T ss_pred -cEE-----EEEEEcCCceEE-----EEccccEEEecCC-------------------------------CC-------- Confidence 111 122222122111 1222223322110 00 Q ss_pred ecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC-ceeEEecCCcccchhhhh----hhh-----hcceeeecccCC Q lcl|NC_010179. 233 FPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQT-VILVLTNYGGASLKQFMN----DLR-----EYKSIKINNAGN 302 (469) Q Consensus 233 ~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~-p~l~~~g~~~~~~~~~~~----~~~-----~~~~~~~~~~~~ 302 (469) .+.-.|.|.+..+...++..+.+... .+..+.. +-.++.... ...++... .+. ..+++.++ T Consensus 173 -~~~~~G~s~l~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~~-~l~~e~~~~~~~~~~~~~~~~g~~~vl~---- 243 (409) T protein:vir:94 173 -SNMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYGS-NVGKEKRQQVLEDFKQYYEENGGILFQE---- 243 (409) T ss_pred -CCccccccHHHHHHHHHHHHHHHHHH---HHHhcCCCCeeEEecCC-CCCHHHHHHHHHHHHHHhhcCCCeeecC---- Confidence 01123667666666656544433221 2333333 323332211 11111111 111 11222222 Q ss_pred CCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 GDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (469) Q Consensus 303 ~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (469) ++++|.....+ ...+.+..+....+|+..-++|+.-....++.+...++-.. ...+..+|.-+++ T Consensus 244 ---~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~----------~~f~~~~l~P~~~ 310 (409) T protein:vir:94 244 ---PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN----------RFYLQHTLLPIVK 310 (409) T ss_pred ---CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHH Confidence 23455444433 33455556667788988888887544332222222222111 1233334444444 Q ss_pred HHHHHhccc---CCCc-ccceEEeC--CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-----HHHHHH Q lcl|NC_010179. 381 AIMRYLNFS---DADK-RHISQHWT--RTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ-----ELKDLA 445 (469) Q Consensus 381 ~i~~~~~~~---~~~~-~~i~i~f~--~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~-----E~eri~ 445 (469) .|...++.+ ..+. ....+.|+ .-+-.|.++.++++.++ +|+++.-++.+.++. +++-+. -+..+. T Consensus 311 ~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~ 390 (409) T protein:vir:94 311 QYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPID 390 (409) T ss_pred HHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeEeecccccccc Confidence 444443321 1111 12345554 44567889999999887 688988888777653 222110 000110 Q ss_pred HHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 446 KDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 446 ~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+...+....+++.+.+| T Consensus 391 ------~~~~~~~~~kGG~~n~~e 408 (409) T protein:vir:94 391 ------TPLELRKSLKGGDKNVNE 408 (409) T ss_pred ------cchhhcccccCCCCCcCC Confidence 011111111222233333 No 152 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.28 E-value=0.00011 Score=42.36 Aligned_cols=373 Identities=14% Similarity=0.111 Sum_probs=155.8 Q ss_pred CCHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRN-TSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~-~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |--+.+..-++. +..+..+ .--.+-.+ ....... ...+. . .+.-+...-....|+..++-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~---~~~~~~~-~~~~v----~-~~~~~~~~~V~~ci~~Ia~~i 62 (409) T protein:vir:93 1 MAKENIVTRIKKKLIDNWID---------QSTSKLYD---FSPWKNR-SFWGV----I-NNTLETNETIFSAITKLSNSM 62 (409) T ss_pred CCccchhhhhhhhhhhhhhc---------cccccccc---cccccCc-ccccc----c-hhhhhccHHHHHHHHHHHHhh Confidence 444433222211 1110000 00001010 0000000 00000 0 000011222333445555555 Q ss_pred hcCCeeeccCchhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTL 152 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~ 152 (469) -.-|+.+--..+.....+..++.. | .+ +....+..+++.+|.+|+++..+.+|++ .+.+++|..+-+..++.. T Consensus 63 a~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~~~~~~~~ 142 (409) T protein:vir:93 63 ASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS 142 (409) T ss_pred hhCceeEeeccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC Confidence 555665533333444455566542 3 22 2234667788999999999988888875 588899998888776532 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) ..+ +|......|... .+....+.+++... . T Consensus 143 -~~~-----~y~~~~~~g~~~-----~~~~~eVih~r~~~-------------------------------~-------- 172 (409) T protein:vir:93 143 -REL-----YYSIHAATGNKL-----IVHNMDMLHFKHIV-------------------------------A-------- 172 (409) T ss_pred -cEE-----EEEEEcCCceEE-----EEccccEEEeCCCC-------------------------------C-------- Confidence 111 122222122111 12223333221100 0 Q ss_pred ecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCc-eeEEecCCcccchhhhh----hh----h-hcceeeecccCC Q lcl|NC_010179. 233 FPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTV-ILVLTNYGGASLKQFMN----DL----R-EYKSIKINNAGN 302 (469) Q Consensus 233 ~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p-~l~~~g~~~~~~~~~~~----~~----~-~~~~~~~~~~~~ 302 (469) .+.-.|.|.++.+...++..+.+... .+..+..+ -.++.. +....++... .+ . ..+++.++ T Consensus 173 -~~~~~G~s~i~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~~~~~~g~~~vl~---- 243 (409) T protein:vir:93 173 -SNMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKY-GSNVGKEKRQQVLEDFKQYYEENGGILFQE---- 243 (409) T ss_pred -CCccccccHHHHHHHHHHHHHHHHHH---HHHhcCCCCceEEec-CCCCCHHHHHHHHHHHHHHhhcCCCeeecC---- Confidence 00113666666655555544332211 23333333 222222 1111111111 11 1 11222221 Q ss_pred CCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 GDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (469) Q Consensus 303 ~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (469) ++++|.....+ ...+.+..+.....|+..-++|+.-....++.+...++... ...+..+|.-+++ T Consensus 244 ---~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~----------~~f~~~~l~P~~~ 310 (409) T protein:vir:93 244 ---PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELN----------RFYLQHTLLPIVK 310 (409) T ss_pred ---CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHH----------HHHHHHHHHHHHH Confidence 23455444433 33455556667788999889987544332222221111111 1233344555555 Q ss_pred HHHHHhccc---CCCc-ccceEEeC--CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-----HHHHHH Q lcl|NC_010179. 381 AIMRYLNFS---DADK-RHISQHWT--RTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ-----ELKDLA 445 (469) Q Consensus 381 ~i~~~~~~~---~~~~-~~i~i~f~--~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~-----E~eri~ 445 (469) .|...++.+ ..+. ....+.|+ .-+-.|.++.++++.++ +|+++.-++.+.++. ++.-+. -+..+. T Consensus 311 ~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~ggD~~~~~~n~~~~~ 390 (409) T protein:vir:93 311 QYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPID 390 (409) T ss_pred HHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccc Confidence 554433321 1111 12345553 44456889999998887 678999888888754 221110 011110 Q ss_pred HHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 446 KDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 446 ~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+...+....+++.+.+| T Consensus 391 ------~~~~~~~~~~gG~~n~~e 408 (409) T protein:vir:93 391 ------TPLELRKSLKGGDKNVNE 408 (409) T ss_pred ------cchhhcccccCCCCCcCC Confidence 011111111222233333 No 153 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=97.28 E-value=0.00011 Score=42.33 Aligned_cols=426 Identities=11% Similarity=0.066 Sum_probs=186.1 Q ss_pred CCH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHH Q lcl|NC_010179. 1 MEL---------DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLL 71 (469) Q Consensus 1 ~~~---------~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~i 71 (469) |.. +..++..+.+..++..-..+.+.+.+|....- ..... ........++..+-+... T Consensus 1 ~~~~~~~~~~~~~~~~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~----------~~~~~~~~~~~dst~~~a 67 (535) T protein:vir:94 1 MASSQKREGFAENGAKAVYDALKNDRNSYETRAENCAKYTIPSL---FPKDS----------DNASTDYTTPWQAVGARG 67 (535) T ss_pred CCchhhhhhHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---CCCCC----------CccccccCCcccccHHHH Confidence 333 33444455555554444555566666654310 00000 001112234556666777 Q ss_pred HHHHHHhhhcC--Cee----eccCc---------hhhHHHHHHHH------------hccHHHHHHHHHHHHHhCCeEEE Q lcl|NC_010179. 72 VDQEAGYIASV--FPD----IDVGK---------DADNKKILDVL------------GDDRALTLNSLLVDSSNAGRAWL 124 (469) Q Consensus 72 v~~~~~~l~g~--p~~----~~~~~---------~~~~~~l~~~~------------~~n~~~~~~~~~~~~~~~G~~~~ 124 (469) ++..++.|++- |++ +...+ ......+++|+ ..||...+.++.++..++|.+.+ T Consensus 68 ~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l 147 (535) T protein:vir:94 68 LNNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALL 147 (535) T ss_pred HHHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeE Confidence 77777665431 221 11121 11112233333 24666778888999999999976 Q ss_pred EEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC-------------CceEEEEEEEEcCCeEEEEEee Q lcl|NC_010179. 125 HYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE-------------AGKYFTVHEYWTDKEAQFFRTS 191 (469) Q Consensus 125 ~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~ 191 (469) ++-.+.....+++.++-.++++.-|. .+++...+|.++..... .......+++|+. .+... T Consensus 148 ~~~~~~~~~~~f~~~pl~~y~v~~d~--~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~~ 221 (535) T protein:vir:94 148 YIPEPEGTYNPMKLYRLSSYVVQRDA--FGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTH----IYLDE 221 (535) T ss_pred eeccCcCcccceEEEEcCeEEEeeCC--CCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEE----EEeeC Confidence 55444433456677766665554443 45676777665542110 0011122233321 01111 Q ss_pred c-CceeecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 192 A-TDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLD 265 (469) Q Consensus 192 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~ 265 (469) . ..+.+ +....+.... ......+|..+|++.++ .+.+|+|-.++..+-+..+|.+.-....... T Consensus 222 ~~~~~~~---------~~e~~g~~~~-~~~~~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~ 291 (535) T protein:vir:94 222 ESGEYLK---------YEEIDGVEVE-GTDASYPVDACPYIPVRMVRIDGESYGRSYCEEYLGDLRSLENLQEAIVKMSM 291 (535) T ss_pred CCCcEEE---------EEEecCeeec-cccccCccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 11110 0001111110 01123467788888775 3457999999999999999988777777777 Q ss_pred HhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEE--eecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc Q lcl|NC_010179. 266 DVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKL--QIDIPVEARDDALKITRDNIFLFGQGIDPANF 343 (469) Q Consensus 266 ~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~ 343 (469) ....|.+.+.-.+..+.... .. ...+.+ ++ + ...++..+ ....+.......++.++..|...-..-.+... T Consensus 292 ~a~~~~~lv~p~g~~~~~~~-~~-~~~g~~-v~-g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~ 364 (535) T protein:vir:94 292 ISAKVIGLVNPAGITQVRRL-TK-AQTGDF-VS-G---RPEDISFLQLEKAADFSVARAVSEQIEGRLSYAFMLNSAVQR 364 (535) T ss_pred HhccCCcccccccccchhhc-cc-CCCcee-ec-C---CcccceeeecccccchhHHHHHHHHHHHHHHHHHhHhhhccC Confidence 77776644421011111100 00 011111 11 1 11233333 33346677777777777777543322122222 Q ss_pred ccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcc----cCCCcccceEEeCCCCCC-CHHH Q lcl|NC_010179. 344 ESSNASGVAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNF----SDADKRHISQHWTRTKVE-DSLT 410 (469) Q Consensus 344 ~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~----~~~~~~~i~i~f~~~~p~-d~~e 410 (469) +....|+..+.. ++.++...++..+.++ ++.++.++.. ...+..-+++.+..++.. ...+ T Consensus 365 d~~rvTAtEV~~-------r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~v~~~~vs~la~l~r~~ 437 (535) T protein:vir:94 365 TGERVTAEEIRY-------VASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEAVEPTISTGMEALGRGQ 437 (535) T ss_pred CCCCccHHHHHH-------HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhhccceEeehHHHHHHHH Confidence 233345654433 2334445555544432 2222333322 222333355555444332 1111 Q ss_pred HHHHH----HHHhcc--------CChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHhhhhHh---hc-ccCC-CCC Q lcl|NC_010179. 411 KAQIV----STVANY--------SSKEAVAKAN---PIVD-----DWQQELKDLAKDREENDPYAN---QA-DELN-GKG 465 (469) Q Consensus 411 ~~~~~----~kl~g~--------iS~et~~~~l---~~v~-----d~~~E~eri~~E~~~~~~~~~---~~-~~~~-~~~ 465 (469) .++.+ +.++++ +....++..+ -+|+ -.++|++++.+++++...... +. +... ... T Consensus 438 ~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~~~~~g~~~~~~~~ 517 (535) T protein:vir:94 438 DLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNAAASAGAGAGTMAT 517 (535) T ss_pred HHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 12211 122222 2333333332 1232 124566665555443332111 11 1111 112 Q ss_pred CCCC Q lcl|NC_010179. 466 VDDE 469 (469) Q Consensus 466 ~~de 469 (469) .+-+ T Consensus 518 ~~~~ 521 (535) T protein:vir:94 518 ASPE 521 (535) T ss_pred cChH Confidence 2222 No 154 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=97.26 E-value=0.00011 Score=42.19 Aligned_cols=381 Identities=8% Similarity=0.024 Sum_probs=165.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--C---cc-cccccchhhhcccccccccccCcce--eccchHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENK--T---DI-TTRNNGKPKVSKEGKKDPLRSADNR--IPSNFYQLLV 72 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~--~---~i-~~~~~~~~~~~~~~~~~~~~~~~~r--i~~n~~k~iv 72 (469) |.|= .| +..++... . .+ ................... ..+.+ +.++-....| T Consensus 1 M~~~-----------------~r---~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~-~v~~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIV-----------------DS---VKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTI-SVKGKNALKVATVFACI 59 (432) T ss_pred CChH-----------------HH---HHHhcCccccCcccccccCCchHHHHHHhCCCcCcc-ccchhhhhccHHHHHHH Confidence 3332 11 11111100 0 00 0000000000000000000 00000 1122222345 Q ss_pred HHHHHhhhcCCeeecc--Cc---hhhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEc Q lcl|NC_010179. 73 DQEAGYIASVFPDIDV--GK---DADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQ 140 (469) Q Consensus 73 ~~~~~~l~g~p~~~~~--~~---~~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 140 (469) +..++-+-+-|+.+-- ++ ......+..++.. | ..+....+....+.+|.+|+++-.+..|++ .+.+++ T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 139 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 139 (432) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 5555555556665421 11 1123345555542 2 223345677788999999999999988986 588899 Q ss_pred cceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccc Q lcl|NC_010179. 141 PDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNT 220 (469) Q Consensus 141 p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (469) |..+.+..++... +..-.+.|......|. .. .+.... T Consensus 140 ~~~v~v~~d~~~~--~~~~~~~~y~~~~~g~-~~----~~~~~e------------------------------------ 176 (432) T protein:vir:10 140 ASKVTVYIDDVGL--LNSKTKMWYVVNTGGQ-QR----VLKPEE------------------------------------ 176 (432) T ss_pred CceeEEEEcCccc--ccccceEEEEEecCCe-EE----EEcccc------------------------------------ Confidence 9998887764321 1110111111111111 00 112222 Q ss_pred ccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch---hhhhhhh-- Q lcl|NC_010179. 221 LKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK---QFMNDLR-- 290 (469) Q Consensus 221 ~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~---~~~~~~~-- 290 (469) |+|+++ .-.|.|.+..+...++....+..-....+...+.|-.+++.....+.+ .....+. T Consensus 177 ---------iih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~ 247 (432) T protein:vir:10 177 ---------ILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESM 247 (432) T ss_pred ---------EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHH Confidence 233321 124777787777777777666666666677777787666542221111 1111111 Q ss_pred ------hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 ------EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKA 364 (469) Q Consensus 291 ------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~ 364 (469) ..+++.++. +.+++.+..+.....+....+...+.|+..-++|+.-....+..+...++. T Consensus 248 ~~g~~n~~~~~vl~~-----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~--------- 313 (432) T protein:vir:10 248 SSGLQNSHRIALMPV-----GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ--------- 313 (432) T ss_pred hcccccCCcceecCC-----CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH--------- Confidence 112222221 123444433333344556667778899998899875443222111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccc-----CC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--C Q lcl|NC_010179. 365 AKTQTYFEHAINELVRAIMRYLNFS-----DA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--V 434 (469) Q Consensus 365 ~~~~~~~~~~l~~~~~~i~~~~~~~-----~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v 434 (469) .....+..+|.-+++.|...++.+ +. ....+++.++.-+..|.++.++++.++ +|+++.-.+.+.+++ + T Consensus 314 -~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi 392 (432) T protein:vir:10 314 -QQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE 392 (432) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 112233445555555555544421 11 112234444555677999999999887 578999888887754 2 Q ss_pred CCHHHH-----HHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 435 DDWQQE-----LKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 435 ~d~~~E-----~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++-+.- +..+..-.+...+..+...+...++++.- T Consensus 393 ~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 393 AGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 221110 11111000000111111111111111111 No 155 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=97.26 E-value=0.00011 Score=42.19 Aligned_cols=381 Identities=8% Similarity=0.024 Sum_probs=165.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--C---cc-cccccchhhhcccccccccccCcce--eccchHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENK--T---DI-TTRNNGKPKVSKEGKKDPLRSADNR--IPSNFYQLLV 72 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~--~---~i-~~~~~~~~~~~~~~~~~~~~~~~~r--i~~n~~k~iv 72 (469) |.|= .| +..++... . .+ ................... ..+.+ +.++-....| T Consensus 1 M~~~-----------------~r---~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~-~v~~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIV-----------------DS---VKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTI-SVKGKNALKVATVFACI 59 (432) T ss_pred CChH-----------------HH---HHHhcCccccCcccccccCCchHHHHHHhCCCcCcc-ccchhhhhccHHHHHHH Confidence 3332 11 11111100 0 00 0000000000000000000 00000 1122222345 Q ss_pred HHHHHhhhcCCeeecc--Cc---hhhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEc Q lcl|NC_010179. 73 DQEAGYIASVFPDIDV--GK---DADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQ 140 (469) Q Consensus 73 ~~~~~~l~g~p~~~~~--~~---~~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 140 (469) +..++-+-+-|+.+-- ++ ......+..++.. | ..+....+....+.+|.+|+++-.+..|++ .+.+++ T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 139 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 139 (432) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 5555555556665421 11 1123345555542 2 223345677788999999999999988986 588899 Q ss_pred cceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccc Q lcl|NC_010179. 141 PDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNT 220 (469) Q Consensus 141 p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (469) |..+.+..++... +..-.+.|......|. .. .+.... T Consensus 140 ~~~v~v~~d~~~~--~~~~~~~~y~~~~~g~-~~----~~~~~e------------------------------------ 176 (432) T protein:vir:10 140 ASKVTVYIDDVGL--LNSKTKMWYVVNTGGQ-QR----VLKPEE------------------------------------ 176 (432) T ss_pred CceeEEEEcCccc--ccccceEEEEEecCCe-EE----EEcccc------------------------------------ Confidence 9998887764321 1110111111111111 00 112222 Q ss_pred ccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch---hhhhhhh-- Q lcl|NC_010179. 221 LKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK---QFMNDLR-- 290 (469) Q Consensus 221 ~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~---~~~~~~~-- 290 (469) |+|+++ .-.|.|.+..+...++....+..-....+...+.|-.+++.....+.+ .....+. T Consensus 177 ---------iih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~ 247 (432) T protein:vir:10 177 ---------ILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESM 247 (432) T ss_pred ---------EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHH Confidence 233321 124777787777777777666666666677777787666542221111 1111111 Q ss_pred ------hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 ------EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKA 364 (469) Q Consensus 291 ------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~ 364 (469) ..+++.++. +.+++.+..+.....+....+...+.|+..-++|+.-....+..+...++. T Consensus 248 ~~g~~n~~~~~vl~~-----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~--------- 313 (432) T protein:vir:10 248 SSGLQNSHRIALMPV-----GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ--------- 313 (432) T ss_pred hcccccCCcceecCC-----CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH--------- Confidence 112222221 123444433333344556667778899998899875443222111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccc-----CC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--C Q lcl|NC_010179. 365 AKTQTYFEHAINELVRAIMRYLNFS-----DA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--V 434 (469) Q Consensus 365 ~~~~~~~~~~l~~~~~~i~~~~~~~-----~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v 434 (469) .....+..+|.-+++.|...++.+ +. ....+++.++.-+..|.++.++++.++ +|+++.-.+.+.+++ + T Consensus 314 -~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi 392 (432) T protein:vir:10 314 -QQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE 392 (432) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 112233445555555555544421 11 112234444555677999999999887 578999888887754 2 Q ss_pred CCHHHH-----HHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 435 DDWQQE-----LKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 435 ~d~~~E-----~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++-+.- +..+..-.+...+..+...+...++++.- T Consensus 393 ~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 393 AGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 221110 11111000000111111111111111111 No 156 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=97.26 E-value=0.00011 Score=42.19 Aligned_cols=381 Identities=8% Similarity=0.024 Sum_probs=165.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--C---cc-cccccchhhhcccccccccccCcce--eccchHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENK--T---DI-TTRNNGKPKVSKEGKKDPLRSADNR--IPSNFYQLLV 72 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~--~---~i-~~~~~~~~~~~~~~~~~~~~~~~~r--i~~n~~k~iv 72 (469) |.|= .| +..++... . .+ ................... ..+.+ +.++-....| T Consensus 1 M~~~-----------------~r---~~~~~~~~~r~~~~~~~~~~~~~~~~~~~g~~~~~~-~v~~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIV-----------------DS---VKKFFNFEKRQTSQVIELNKDDEKLLEWLGISPSTI-SVKGKNALKVATVFACI 59 (432) T ss_pred CChH-----------------HH---HHHhcCccccCcccccccCCchHHHHHHhCCCcCcc-ccchhhhhccHHHHHHH Confidence 3332 11 11111100 0 00 0000000000000000000 00000 1122222345 Q ss_pred HHHHHhhhcCCeeecc--Cc---hhhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEc Q lcl|NC_010179. 73 DQEAGYIASVFPDIDV--GK---DADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQ 140 (469) Q Consensus 73 ~~~~~~l~g~p~~~~~--~~---~~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~ 140 (469) +..++-+-+-|+.+-- ++ ......+..++.. | ..+....+....+.+|.+|+++-.+..|++ .+.+++ T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 139 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 139 (432) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 5555555556665421 11 1123345555542 2 223345677788999999999999988986 588899 Q ss_pred cceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccc Q lcl|NC_010179. 141 PDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNT 220 (469) Q Consensus 141 p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 220 (469) |..+.+..++... +..-.+.|......|. .. .+.... T Consensus 140 ~~~v~v~~d~~~~--~~~~~~~~y~~~~~g~-~~----~~~~~e------------------------------------ 176 (432) T protein:vir:10 140 ASKVTVYIDDVGL--LNSKTKMWYVVNTGGQ-QR----VLKPEE------------------------------------ 176 (432) T ss_pred CceeEEEEcCccc--ccccceEEEEEecCCe-EE----EEcccc------------------------------------ Confidence 9998887764321 1110111111111111 00 112222 Q ss_pred ccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch---hhhhhhh-- Q lcl|NC_010179. 221 LKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK---QFMNDLR-- 290 (469) Q Consensus 221 ~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~---~~~~~~~-- 290 (469) |+|+++ .-.|.|.+..+...++....+..-....+...+.|-.+++.....+.+ .....+. T Consensus 177 ---------iih~r~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~ 247 (432) T protein:vir:10 177 ---------ILHFKNGITLDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVGDLNEDAKKVFRENFESM 247 (432) T ss_pred ---------EEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHH Confidence 233321 124777787777777777666666666677777787666542221111 1111111 Q ss_pred ------hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 ------EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKA 364 (469) Q Consensus 291 ------~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~ 364 (469) ..+++.++. +.+++.+..+.....+....+...+.|+..-++|+.-....+..+...++. T Consensus 248 ~~g~~n~~~~~vl~~-----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~--------- 313 (432) T protein:vir:10 248 SSGLQNSHRIALMPV-----GYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQ--------- 313 (432) T ss_pred hcccccCCcceecCC-----CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH--------- Confidence 112222221 123444433333344556667778899998899875443222111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccc-----CC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--C Q lcl|NC_010179. 365 AKTQTYFEHAINELVRAIMRYLNFS-----DA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--V 434 (469) Q Consensus 365 ~~~~~~~~~~l~~~~~~i~~~~~~~-----~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v 434 (469) .....+..+|.-+++.|...++.+ +. ....+++.++.-+..|.++.++++.++ +|+++.-.+.+.+++ + T Consensus 314 -~~~~~~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~pi 392 (432) T protein:vir:10 314 -QQQQFYTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEARSKEDLPPE 392 (432) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 112233445555555555544421 11 112234444555677999999999887 578999888887754 2 Q ss_pred CCHHHH-----HHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 435 DDWQQE-----LKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 435 ~d~~~E-----~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++-+.- +..+..-.+...+..+...+...++++.- T Consensus 393 ~ggD~~~~~~n~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 393 AGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred CCCCeEeecccccchhhccccccCCCCCCCCCCCCCCCCC Confidence 221110 11111000000111111111111111111 No 157 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=97.25 E-value=0.00012 Score=42.13 Aligned_cols=371 Identities=12% Similarity=0.084 Sum_probs=164.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+= .++...++...+-........... .+.... .+.+-+..+-....|+..++-+- T Consensus 1 MG~~--------------------~~~~~~~~~~~~~~~~~~~~~~~~-~g~~~~--~~~~al~~~~V~~~v~~Ia~~iA 57 (411) T protein:vir:81 1 MGWW--------------------SRLTRFFRPRNETVDMTNPLLLQW-LGVDPD--TPRNQLSEATYFACLKILSESLG 57 (411) T ss_pred CchH--------------------HHHHhhccCcccccccchHHHHHH-hcCccc--ChhhhhccHHHHHHHHHHHHhHh Confidence 3332 112222222211000000000000 000000 00111112223345566666666 Q ss_pred cCCeeec--cCc---hhhHHHHHHHHhc--cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEE Q lcl|NC_010179. 81 SVFPDID--VGK---DADNKKILDVLGD--DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVY 148 (469) Q Consensus 81 g~p~~~~--~~~---~~~~~~l~~~~~~--n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~ 148 (469) +-|+.+- .++ +.....+..++.. |. .+....+...++.+|.+|+++..+. |++ .+.+++|..+-++. T Consensus 58 ~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~-g~~~~l~~l~~~~v~~~~ 136 (411) T protein:vir:81 58 KLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSG-PQLQALWILPSQYVTIVV 136 (411) T ss_pred hCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CceEEEEEECCceEEEEE Confidence 6666542 111 1122334555542 32 2334456778899999999887774 554 48889999999888 Q ss_pred eCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcc Q lcl|NC_010179. 149 ATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRV 228 (469) Q Consensus 149 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 228 (469) ++........ ..+|......+.... .+.... T Consensus 137 ~~~~~~~~~~-~~~~~~~~~~~g~~~----~~~~~e-------------------------------------------- 167 (411) T protein:vir:81 137 DDRGLLGEKN-AIWYRYNDPYDGKMY----VFRNDE-------------------------------------------- 167 (411) T ss_pred cCcccccccc-eEEEEEEecCCceEE----EEcccc-------------------------------------------- Confidence 7532111001 111111111111100 112222 Q ss_pred cEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhhh--------c Q lcl|NC_010179. 229 PFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLRE--------Y 292 (469) Q Consensus 229 Pvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~~--------~ 292 (469) |+||+. .-.|.|.+.-+...++.......-..+.+...+.|-.+++....... ......+.. . T Consensus 168 -iih~k~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~n~g 246 (411) T protein:vir:81 168 -ILHFKTSVTFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTGDLNQEARDRLVKGFEQFANGSKNAG 246 (411) T ss_pred -EEEEcCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCHHHHHHHHHHHHHHhcCccccC Confidence 233321 12467777777777777666666666666777778777655322111 111111111 1 Q ss_pred ceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTY 370 (469) Q Consensus 293 ~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~ 370 (469) +++.++ .+.+|..... ....+.+..+...+.|+..-++|+.-.....+.+-..++. ..... T Consensus 247 ~~~vl~-------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~----------~~~~f 309 (411) T protein:vir:81 247 KIIPVP-------LGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEA----------QNLAF 309 (411) T ss_pred CceecC-------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHH----------HHHHH Confidence 122221 2234443333 3345556667778899998899875332221111111110 11233 Q ss_pred HHHHHHHHHHHHHHHhcccC-----C-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCH--HHH Q lcl|NC_010179. 371 FEHAINELVRAIMRYLNFSD-----A-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDW--QQE 440 (469) Q Consensus 371 ~~~~l~~~~~~i~~~~~~~~-----~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~--~~E 440 (469) +..+|.-+++.+...++.+= . ....+++.++.-+-.|..+.++++.++ +|+++.-.+.+.++.-..+ +.- T Consensus 310 ~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~~~gl~p~~ggD~~ 389 (411) T protein:vir:81 310 YVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARDYLDMPADDYGNNL 389 (411) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCee Confidence 44555555555555444321 1 112344444555667899999999887 6899998888887642211 111 Q ss_pred HHHHHHHHHHhhhhHhhcccCCCCCCCC Q lcl|NC_010179. 441 LKDLAKDREENDPYANQADELNGKGVDD 468 (469) Q Consensus 441 ~eri~~E~~~~~~~~~~~~~~~~~~~~d 468 (469) +-... ..|.....+. ..+|+|. T Consensus 390 ~~~~n-----~~pl~~~~~~-~~kgGd~ 411 (411) T protein:vir:81 390 MANGN-----YIPLSMLGAN-YGKGGDS 411 (411) T ss_pred eeccC-----ccchhhhhhh-hccCCCC Confidence 10000 0111110111 1122222 No 158 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=97.23 E-value=0.00012 Score=42.05 Aligned_cols=394 Identities=9% Similarity=0.024 Sum_probs=157.5 Q ss_pred CCHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHH Q lcl|NC_010179. 1 MELDALKKL-----------IRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQ 69 (469) Q Consensus 1 ~~~~~~~~~-----------i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k 69 (469) |-|-+.-.. .++.....-.|.. ......-+ |.+++...+. .+...... .-..++.. T Consensus 34 ~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g-~~~~~~~~-g~~~~~epp~-d~~~l~~l----------~~~np~V~ 100 (648) T protein:vir:79 34 MQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIG-LAIMDGGG-GGRDFEEPEF-DFNEITSA----------YNTEGYVR 100 (648) T ss_pred cccCCCccccCCCCcccccccccchhHHHHHhH-HHHHhhcC-CccccccCCc-CHHHHHHH----------HhcChHHH Confidence 111111000 0111000011110 00000111 3333322211 11100000 00235556 Q ss_pred HHHHHHHHhhhcCCeeeccCchhhHHH--HHHH-Hhcc----HHHHHHHHHHHHHhCCeEEEEEEEcCCCceE------- Q lcl|NC_010179. 70 LLVDQEAGYIASVFPDIDVGKDADNKK--ILDV-LGDD----RALTLNSLLVDSSNAGRAWLHYWIDEDNNFR------- 135 (469) Q Consensus 70 ~iv~~~~~~l~g~p~~~~~~~~~~~~~--l~~~-~~~n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~------- 135 (469) ..|+..+.-+.+-|..+...++...+. .... ..-| ....+..+..+.+.+|.+|+.+-.+.+|.+- T Consensus 101 ~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiRd~~G~~~~~l~~~~ 180 (648) T protein:vir:79 101 QAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSRAKDALPFQGMNVMG 180 (648) T ss_pred HHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCCCccchhhhhhh Confidence 677777777777777666544322111 1111 2222 2333456778889999999998888877421 Q ss_pred ---------EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccc Q lcl|NC_010179. 136 ---------YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITS 206 (469) Q Consensus 136 ---------i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (469) +..++|..+.+..+... .+ ..|......+.... .+.. T Consensus 181 ~~~~~~v~~l~pl~p~~v~v~~d~~g--~~----~~Y~y~~~g~~~~~----~~~~------------------------ 226 (648) T protein:vir:79 181 VGDSMPVAGYFPLNLASMKVKRDKFG--MI----KGWQQEQEGQDKPQ----KFKP------------------------ 226 (648) T ss_pred hccccceeeeEeecCceeEEEEcCCC--ce----eeeEEEecCCceeE----EecC------------------------ Confidence 22233333333322110 00 00100000000000 0011 Q ss_pred ccccccccccccccccccCCcccEEEecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEec-CCcc Q lcl|NC_010179. 207 YDLSAGYETGQSNTLKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTN-YGGA 280 (469) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g-~~~~ 280 (469) =.|+||+. ...|.|.+..+...|+....+.....+.++..+.|-.+++- .+.. T Consensus 227 ---------------------~dIIHik~~~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~~~~ 285 (648) T protein:vir:79 227 ---------------------EDIVHIYYKREKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGLEQE 285 (648) T ss_pred ---------------------ccEEEEccCCCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCcc Confidence 12444431 23578888777777776666666666667777888666652 1111 Q ss_pred cchhhhhhhh----hcceeeecccCCCCCCcceEEeec--CC--HHHHHHHHHHHHHHHHHHhCCCCcCcccc--CC-cc Q lcl|NC_010179. 281 SLKQFMNDLR----EYKSIKINNAGNGDKSGVDKLQID--IP--VEARDDALKITRDNIFLFGQGIDPANFES--SN-AS 349 (469) Q Consensus 281 ~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~l~~~--~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~-~S 349 (469) ..+.....++ ..+.+.+..+ ..+.+.+..+ .. .-.+.+..+...+.|...-++|+.-..-. ++ .+ T Consensus 286 ~~e~~k~~~e~~~~~~~~~~i~gg----~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~st 361 (648) T protein:vir:79 286 GFGAEEGEVDLVRGEVENMDVEGG----MVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRST 361 (648) T ss_pred chHHHHHHHHHHHHhccccccccc----ccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchH Confidence 1111111111 1111111111 1122222211 11 12345556777889999989997533211 22 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHH--HHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCCh Q lcl|NC_010179. 350 GVAIKMLYSHLELKAAKTQTYFEHAINELV-RAIM--RYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSK 424 (469) Q Consensus 350 g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~-~~i~--~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~ 424 (469) +.+....+... +.-.+..+...+...+ +.++ ..+...-.....+++.|+.-...|.+..++.+.++ +|++|. T Consensus 362 ae~~~~~~~~~---i~~l~~~i~~~le~~~~~~ll~e~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~~~~l~~~GilT~ 438 (648) T protein:vir:79 362 GDNLSSDFKDR---IKALQKVMATFINEFMVKEILMEGGFDPVLNPDDKVEFRFNEIDMDSKIKLENQAVFLYEHNAISE 438 (648) T ss_pred HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHhhhhhccccccccceEEEeecccchhhHHHHHHHHHHHHhCCCcCH Confidence 33333223211 1122222222222211 1111 11111001123467777777777888888888775 689999 Q ss_pred HHHHHhCCC--CCCHHH--HHH----HHHHHHHHh----hhhHhhcccCCCCCCCCC Q lcl|NC_010179. 425 EAVAKANPI--VDDWQQ--ELK----DLAKDREEN----DPYANQADELNGKGVDDE 469 (469) Q Consensus 425 et~~~~l~~--v~d~~~--E~e----ri~~E~~~~----~~~~~~~~~~~~~~~~de 469 (469) -++.++++. +++... .+. ....+..+. .+........++++.-.| T Consensus 439 NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e 495 (648) T protein:vir:79 439 DEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGDKKKKA 495 (648) T ss_pred HHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCccccccccc Confidence 999988754 332111 110 011111000 111100001111111111 No 159 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=97.23 E-value=0.00012 Score=42.02 Aligned_cols=376 Identities=13% Similarity=0.046 Sum_probs=156.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |.| ++++....... ..... +.+- ..... .. ...+.... .+.-+.++-....|+..++-+- T Consensus 1 Mgl------~~~~f~~~~~~-~~~~~----~~~~---~~~~~-~~--~~~g~~v~---~~~al~~~~v~~~v~~ia~~iA 60 (409) T protein:vir:84 1 MSL------FTRIFSGPSEE-RTLTK----ISGI---PSPAE-DW--AMHGDRPG---ANSAMTLGAFYACVTLLADTVA 60 (409) T ss_pred Cch------hhhhhcCCCcc-ccccc----cccc---ccccc-hh--hccCcccc---hhhhhccHHHHHHHHHHHHhhh Confidence 333 11111110000 00000 0000 00000 00 00000000 0000112233445666666665 Q ss_pred cCCeeeccCch---hhHHHHHHHHhc--c-H---HHHHHHHHHHHHhCCeEEEEE-EEcCCCce-EEEEEccceeEEEEe Q lcl|NC_010179. 81 SVFPDIDVGKD---ADNKKILDVLGD--D-R---ALTLNSLLVDSSNAGRAWLHY-WIDEDNNF-RYGIIQPDQITPVYA 149 (469) Q Consensus 81 g~p~~~~~~~~---~~~~~l~~~~~~--n-~---~~~~~~~~~~~~~~G~~~~~v-~~d~~~~~-~i~~~~p~~~~~~~d 149 (469) +-|+.+--.++ .....+.+++.. | . .+....+....+.+|.+|+++ +.+..|++ .+.+++|..+.+... T Consensus 61 ~lp~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~ 140 (409) T protein:vir:84 61 SLSIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIHPDCIHVTDA 140 (409) T ss_pred hCceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEcCceeEEEEc Confidence 65664321111 112234455532 2 2 233446677889999999876 45667775 488899998877654 Q ss_pred CCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_010179. 150 TTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVP 229 (469) Q Consensus 150 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 229 (469) ........ +.....+| . .++...+.+++.-. T Consensus 141 ~~~~~~~~-----~~~~~~~g-~------~~~~~dvih~~~~~------------------------------------- 171 (409) T protein:vir:84 141 KDEDGDWI-----EPVYRIDG-K------VVPNHRIMHIKRYP------------------------------------- 171 (409) T ss_pred CCCcceEE-----EEEecCCc-e------EEchhhEEEecCCC------------------------------------- Confidence 32221111 10011111 0 01222222221100 Q ss_pred EEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh--------h-cceeeeccc Q lcl|NC_010179. 230 FIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR--------E-YKSIKINNA 300 (469) Q Consensus 230 vv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~--------~-~~~~~~~~~ 300 (469) ......|.|.++.+...++....+..-..+.++..+.|-.+++...... ++....++ . .+.+.++ T Consensus 172 ---~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~-~e~~~~~~~~~~~~~~n~g~~~vl~-- 245 (409) T protein:vir:84 172 ---VAGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDADLT-PDQVKQTQKQWIQSHHNRRLPAVMS-- 245 (409) T ss_pred ---CCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCCCC-HHHHHHHHHHHHHHhccCCCeeecC-- Confidence 0001247777777777777666665556666677777766665422211 11111111 1 1122222 Q ss_pred CCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFEHAIN 376 (469) Q Consensus 301 ~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 376 (469) ++.+|.....+ ...+.+..+...+.|+..-++|+.-... .++.++..++..... .+..+|. T Consensus 246 -----~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~----------f~~~~l~ 310 (409) T protein:vir:84 246 -----AGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGIN----------FVRHTLL 310 (409) T ss_pred -----CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH----------HHHHHHH Confidence 23455444443 3445555667788999988898753322 122222222222111 1122233 Q ss_pred HHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHH-----HHHHHHH Q lcl|NC_010179. 377 ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQE-----LKDLAKD 447 (469) Q Consensus 377 ~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E-----~eri~~E 447 (469) -.++.|...++.+=.....+++.++.-+-.|.++.++++.++ +|+++.-++.+.++. +++-+.- +..+.. T Consensus 311 P~~~~ie~~l~~~L~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~~ggD~~~~~~n~~~~~~- 389 (409) T protein:vir:84 311 PWLRCIEQALDTFLPRGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVRAWEDAPPIPEGDIHLQPMNFVPLGY- 389 (409) T ss_pred HHHHHHHHHHHHhccCCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccc- Confidence 333333332221111123355555666667999999999887 689998888887754 2221110 111110 Q ss_pred HHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 448 REENDPYANQADELNGKGVDDE 469 (469) Q Consensus 448 ~~~~~~~~~~~~~~~~~~~~de 469 (469) ....++. ...++ ++....++ T Consensus 390 ~~~~~~~-~~~~~-~~~~~gn~ 409 (409) T protein:vir:84 390 VPPEEPA-QEPQP-NSATEGNK 409 (409) T ss_pred CCccccC-cCCCC-CCccCCCC Confidence 0000111 00111 11111111 No 160 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=97.18 E-value=0.00014 Score=41.71 Aligned_cols=378 Identities=11% Similarity=0.033 Sum_probs=173.9 Q ss_pred HHHHHHHHHHHHH----HHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCC Q lcl|NC_010179. 8 KLIRNTSTSRNDL----INNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVF 83 (469) Q Consensus 8 ~~i~~~~~~~~~~----~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p 83 (469) -+++++..++... ..-...+.++|-|..... +.... ...-+..+.....|+..+.-+-+-| T Consensus 1 m~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~v~---~~~al~~~~v~~~i~~Ia~~ia~l~ 65 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHEDGFNNILLNMFGGRKTAS------------GERVS---ESNSLVQPDIFACVNVLSDDIAKLP 65 (416) T ss_pred CccchhcccccCccccCccchhHHHHhhcCccccc------------Cceec---hhhhhccHHHHHHHHHHHHhhhhCc Confidence 2233332222110 001122333443321000 00000 0001122333445666666666666 Q ss_pred eee-ccCch---h-hHHHHHHHHh-c-c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCC Q lcl|NC_010179. 84 PDI-DVGKD---A-DNKKILDVLG-D-D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATT 151 (469) Q Consensus 84 ~~~-~~~~~---~-~~~~l~~~~~-~-n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~ 151 (469) +.+ ...++ . ....+..++. . | ..+....+..+.+.+|.+|+++-.+..|.+ .+.+++|..+-++.++. T Consensus 66 ~~~~~~~~~~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~ 145 (416) T protein:vir:12 66 IHTYKRTDGGIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPT 145 (416) T ss_pred eEEEEecCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCC Confidence 653 21111 1 1112333332 1 3 123345667788999999999988888876 48889999998877653 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) .. . .+|... .+|.. ++ +....+.+++.- + T Consensus 146 ~~-~-----~~~~~~-~~g~~----~~-~~~~eiih~~~~-------------------------------------~-- 174 (416) T protein:vir:12 146 TG-M-----LWYQTV-LNGKA----IE-LYDYEVLHFKGL-------------------------------------S-- 174 (416) T ss_pred Cc-E-----EEEEEe-cCCeE----EE-ecCccEEEecCc-------------------------------------C-- Confidence 31 1 122211 11211 11 222222222110 0 Q ss_pred EecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh--------hcceeeecccCCC Q lcl|NC_010179. 232 EFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR--------EYKSIKINNAGNG 303 (469) Q Consensus 232 ~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~--------~~~~~~~~~~~~~ 303 (469) .+...|.|.++.+...++....+..-..+.++..+.|-.++.-.. ...++....++ ..+++.++. T Consensus 175 --~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~-~~~~e~~~~~~~~~~~~~~~~~~~vl~~---- 247 (416) T protein:vir:12 175 --TDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA-FLDEKPKENVRKEWKRVNKVENIAIIDY---- 247 (416) T ss_pred --CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC-CCCHHHHHHHHHHHHHHhcCCCeeecCC---- Confidence 011247777777777777766666666677777777866665422 11122211111 122222221 Q ss_pred CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIM 383 (469) Q Consensus 304 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~ 383 (469) +.+++.++.......+.+..+...+.|+..-++|+.-....++.+....+.. ....+..+|.-+++.|. T Consensus 248 -g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~----------~~~f~~~~l~P~~~~ie 316 (416) T protein:vir:12 248 -GLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQ----------SIEYVRNTLQPWIVNFE 316 (416) T ss_pred -CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHH----------HHHHHHHHHHHHHHHHH Confidence 2234444433334455666777788998888888754432222111111111 11233445555555555 Q ss_pred HHhcccCC------CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHH---HHHHHH Q lcl|NC_010179. 384 RYLNFSDA------DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDL---AKDREE 450 (469) Q Consensus 384 ~~~~~~~~------~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri---~~E~~~ 450 (469) ..++.+=. ....+++.++.-+..|.++.++.+.++ +|+++.-++.+.++. +++-+.-+... ..+... T Consensus 317 ~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~Pi~ggd~~~~~~n~~~~~~~~ 396 (416) T protein:vir:12 317 QELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELLERNPIENGDKYISSLNYVFLDFLE 396 (416) T ss_pred HHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccccccccccc Confidence 54443211 112344445555778999999999887 688999888887643 33221111000 000000 Q ss_pred hhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 451 NDPYANQADELNGKGVDDE 469 (469) Q Consensus 451 ~~~~~~~~~~~~~~~~~de 469 (469) ........+..++++..+| T Consensus 397 ~~~~~~~~~~~~gge~~~~ 415 (416) T protein:vir:12 397 EYQRLKAGGAMKGGDNKNE 415 (416) T ss_pred hhhccccccccCCCCCcCC Confidence 1111112223344444444 No 161 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.17 E-value=0.00014 Score=41.67 Aligned_cols=378 Identities=13% Similarity=0.098 Sum_probs=151.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC--cccccccchhhhcccccccccccCc-ceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKT--DITTRNNGKPKVSKEGKKDPLRSAD-NRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~--~i~~~~~~~~~~~~~~~~~~~~~~~-~ri~~n~~k~iv~~~~~ 77 (469) |--+.|..=++..+ +.. +.+.. ....... +.......... .-+..+-....|+..+. T Consensus 1 ~~~~~~~~~~k~~~---------~~~----~~~~~~~~~~~~~~-------~~~~~~~~v~~~~a~~~~~V~~ci~~ia~ 60 (409) T protein:vir:96 1 MAKENIVTRIKKKL---------IDN----WIDQSASKLYDFSP-------WKNKSFWGVINNTLETNETIFSAITKLSN 60 (409) T ss_pred CccccchhhhhhHH---------hhh----hhcccccccccccc-------ccCccccccchhhHhhhHHHHHHHHHHHH Confidence 33332211111100 001 11110 0000000 00000000000 00111222233444444 Q ss_pred hhhcCCeeeccCchhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeC Q lcl|NC_010179. 78 YIASVFPDIDVGKDADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYAT 150 (469) Q Consensus 78 ~l~g~p~~~~~~~~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~ 150 (469) -+-.-|+.+--..+.....+.+++.. | .+ +....+..+++.+|.+|+++-.+.+|++ .+.+++|..+-++.++ T Consensus 61 ~ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~ 140 (409) T protein:vir:96 61 SMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIEN 140 (409) T ss_pred hhhhCceEEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeC Confidence 44444555432333333445555542 3 22 2234567788999999999988888875 5888899988887765 Q ss_pred CCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_010179. 151 TLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF 230 (469) Q Consensus 151 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 230 (469) ... .+ +|......+... .+....+.+++.. + | T Consensus 141 ~~~-~~-----~y~~~~~~g~~~-----~~~~~evih~r~~-------------------------------~-----~- 172 (409) T protein:vir:96 141 QSR-EL-----YYSIHAATGNKL-----IVHNMDMLHFKHI-------------------------------V-----A- 172 (409) T ss_pred CCc-EE-----EEEEEcCCceEE-----EEccccEEEeCCC-------------------------------C-----C- Confidence 321 11 122221112110 1222222222100 0 0 Q ss_pred EEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCc-eeEEecCCcccchhhhhh----h----hh-cceeeeccc Q lcl|NC_010179. 231 IEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTV-ILVLTNYGGASLKQFMND----L----RE-YKSIKINNA 300 (469) Q Consensus 231 v~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p-~l~~~g~~~~~~~~~~~~----~----~~-~~~~~~~~~ 300 (469) .+.-.|.|.+..+...++..+.+.. . .+..++.+ -.++.- +....++.... + .. .+++.++. T Consensus 173 ---~~~~~G~s~l~~~~~~i~~~~~~~~-~--~~~~~~~~~~~i~~~-~~~l~~e~~~~~~~~~~~~~~n~g~~~vl~~- 244 (409) T protein:vir:96 173 ---SNMVQGISPIDVLKNTTDFDNAVRT-F--NLTEMQKPDSFMLKY-GSNVSTEKRQQVLEDFKQYYEENGGILFQEP- 244 (409) T ss_pred ---CCccccccHHHHHHHHHHHHHHHHH-H--HHHhcCCCceeEEec-CCCCCHHHHHHHHHHHHHHhhcCCCeeecCC- Confidence 0112366766665555554333221 1 22333333 222221 11111111111 1 11 12222221 Q ss_pred CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVR 380 (469) Q Consensus 301 ~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~ 380 (469) +.+++-++.+.....+.+..+...++|+..-++|+.-....++.+...++.. ....+...|.-++. T Consensus 245 ----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~----------~~~f~~~~l~P~~~ 310 (409) T protein:vir:96 245 ----GVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEEL----------NRFYLQHTLLPIVK 310 (409) T ss_pred ----CceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHHHH Confidence 1233333332233345555666778899988888754432222222111111 12333344555555 Q ss_pred HHHHHhcccC---CC-cccceEEeC--CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHH Q lcl|NC_010179. 381 AIMRYLNFSD---AD-KRHISQHWT--RTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKDREE 450 (469) Q Consensus 381 ~i~~~~~~~~---~~-~~~i~i~f~--~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E~~~ 450 (469) .|...++.+= .+ .....+.|+ .-+-.|.++.++++.++ +|+++.-++.+.++. +++-+.=+-...- ..- T Consensus 311 ~ie~~l~~~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~g~~pi~ggD~~~~~~n~-~~~ 389 (409) T protein:vir:96 311 QYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDL-YPI 389 (409) T ss_pred HHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCcceeeecccc-ccc Confidence 5544443221 11 112345554 44556889999999887 688998888887753 2221110100000 000 Q ss_pred hhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 451 NDPYANQADELNGKGVDDE 469 (469) Q Consensus 451 ~~~~~~~~~~~~~~~~~de 469 (469) ..+...+....+++...+| T Consensus 390 ~~~~~~~~~~~gG~~n~~e 408 (409) T protein:vir:96 390 DTPLELRKSLKGGDKNVNE 408 (409) T ss_pred ccchhhcccccCCCCCcCC Confidence 0011111111122222333 No 162 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.03 E-value=0.0002 Score=40.85 Aligned_cols=377 Identities=14% Similarity=0.110 Sum_probs=158.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |++=.-..++..+...... +.. ..-..+-.+..... ...+.... +..-+..+-....|+..+.=+- T Consensus 1 m~~~~~~~~~~~~~~~~~~---~~~--~~~~~~~~~~~~~~------~~~~~~v~---~~~a~~~~~v~~~i~~ia~~iA 66 (412) T protein:vir:26 1 MNVIAKENIVTRIKKKLID---NWI--DQSTSKLYDFSPWK------NRSFWGVI---NNTLETNETIFSAITKLSNSMA 66 (412) T ss_pred CccchhhhhhhhhhhhHhh---hhh--cccccccccccccC------Cccccccc---hhhhhccHHHHHHHHHHHHhHh Confidence 6654333333322211100 000 00011111100000 00000000 0011122333334555555555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc--cH-H---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD--DR-A---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLD 153 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~--n~-~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~ 153 (469) +-|+.+--..+.....+..++.. |. + +....+..+++.+|.+|+++..+.+|++ .+.+++|..+.+..++.. T Consensus 67 ~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~- 145 (412) T protein:vir:26 67 SLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQS- 145 (412) T ss_pred hCceeEeeccccccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeEEEEeCCC- Confidence 55665433333344445555542 32 2 2234577889999999999989988986 588899999988877543 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) ..+ +|......+.. ..+....+.+++.-. . T Consensus 146 ~~~-----~y~~~~~~g~~-----~~~~~~evih~~~~~-------------------------------~--------- 175 (412) T protein:vir:26 146 REL-----YYSIHAATGNK-----LIVHNMDMLHFKHIV-------------------------------A--------- 175 (412) T ss_pred cEE-----EEEEEcCCceE-----EEEccccEEEeCCCC-------------------------------C--------- Confidence 111 12222112211 112233333322100 0 Q ss_pred cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC-ceeEEecCCcccchhhhh----hh----h-hcceeeecccCCC Q lcl|NC_010179. 234 PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQT-VILVLTNYGGASLKQFMN----DL----R-EYKSIKINNAGNG 303 (469) Q Consensus 234 ~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~-p~l~~~g~~~~~~~~~~~----~~----~-~~~~~~~~~~~~~ 303 (469) .+.-.|.|.++-+...++..+.+.. . .+..+.. +-.++.... ...++... .+ . ..+++.++ T Consensus 176 ~~~~~G~s~i~~~~~~i~~~~a~~~-~--~~~~~~~~~~~i~~~~~-~l~~e~~~~~~~~~~~~~~~~g~~~vl~----- 246 (412) T protein:vir:26 176 SNMVQGISPIDVLKNTTDFDNAVRT-F--NLTEMQKPDSFMLKYGS-NVGKEKRQQVLEDFKQYYEENGGILFQE----- 246 (412) T ss_pred CCCcccccHHHHHHHHHHHHHHHHH-H--HHHhcCCCCceEEecCC-CCCHHHHHHHHHHHHHHhhcCCCeeecC----- Confidence 0112366666655555554333321 1 1333333 333333211 11112111 11 1 11122221 Q ss_pred CCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 DKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA 381 (469) Q Consensus 304 ~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 381 (469) ++.+|.....+ ...+.+..+....+|+..-++|+.-....++.+...++.. ....+..+|.-++.. T Consensus 247 --~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~----------~~~f~~~~l~P~~~~ 314 (412) T protein:vir:26 247 --PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEEL----------NRFYLQHTLLPIVKQ 314 (412) T ss_pred --CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHH----------HHHHHHHHHHHHHHH Confidence 23455444433 3345555666778898888888753332222111111111 112333345555555 Q ss_pred HHHHhccc---CCCc-ccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-----HHHHHHH Q lcl|NC_010179. 382 IMRYLNFS---DADK-RHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ-----ELKDLAK 446 (469) Q Consensus 382 i~~~~~~~---~~~~-~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~-----E~eri~~ 446 (469) |...++.+ ..+. ....+.| ..-+..|.++.++++.++ +|+++.-++.+.++. +++-+. -+..+. T Consensus 315 ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~~gl~p~~ggD~~~~~~n~~~~~- 393 (412) T protein:vir:26 315 YEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWEDLPPVEGGDKPLISGDLYPID- 393 (412) T ss_pred HHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeecccccccc- Confidence 54444321 1111 1233554 444667899999999887 678999888888754 221110 000110 Q ss_pred HHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 447 DREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 447 E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+...+....+++.+.+| T Consensus 394 -----~~~~~~~~~~gG~~n~~e 411 (412) T protein:vir:26 394 -----TPLELRKSLKGGDKNVNE 411 (412) T ss_pred -----cchhhcccccCCCCCcCC Confidence 011111111122222333 No 163 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=96.94 E-value=0.00025 Score=40.36 Aligned_cols=368 Identities=11% Similarity=0.034 Sum_probs=162.6 Q ss_pred ccCC------cccccccc---hhhhc-ccccccccccCcce-eccchHHHHHHHHHHhhhcCCeeeccCch-hhHHHHHH Q lcl|NC_010179. 32 ENKT------DITTRNNG---KPKVS-KEGKKDPLRSADNR-IPSNFYQLLVDQEAGYIASVFPDIDVGKD-ADNKKILD 99 (469) Q Consensus 32 ~g~~------~i~~~~~~---~~~~~-~~~~~~~~~~~~~r-i~~n~~k~iv~~~~~~l~g~p~~~~~~~~-~~~~~l~~ 99 (469) .|-- .+...... ..... .............. +.+.=.-..|+..++-+.+-|+.+..+.. .....+.. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~ 80 (416) T protein:vir:45 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 80 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccccchHHH Confidence 2210 00000000 00000 00000000000000 00111111355555555566766543222 12233444 Q ss_pred HHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCce Q lcl|NC_010179. 100 VLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGK 172 (469) Q Consensus 100 ~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~ 172 (469) ++.. | .+ +....+....+.+|.+|+++..+.+|++ .+.+++|..+.++.+.. ..+.+. +...+..+.. T Consensus 81 lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~~~---~~~~~~~~~~ 155 (416) T protein:vir:45 81 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDAR--GRLYYF---HQRIDSNGNN 155 (416) T ss_pred HHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCC--ccEEEE---EEEecCCCce Confidence 5532 3 22 2234566778899999999999988886 48899999998887643 222211 1111111111 Q ss_pred EEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHH Q lcl|NC_010179. 173 YFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDA 252 (469) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~ 252 (469) . ...+....+.+++.. | .+.-.|.|.++.+...++. T Consensus 156 ~---~~~~~~~evihir~~-------------------------------------~----~d~~~G~s~i~~~~~~i~~ 191 (416) T protein:vir:45 156 I---ERNVKFEDMLDIKFY-------------------------------------S----LDGINGLSLLDTLSRTIES 191 (416) T ss_pred e---EEEEccccEEEeccC-------------------------------------C----CCCccccCHHHHHHHHHHH Confidence 1 111222223222110 0 0112477777777777776 Q ss_pred HHHHHHHHHHHHHHhcCceeEEecCCcccchhhh----hhhh--------hcceeeecccCCCCCCcceEEeecC--CHH Q lcl|NC_010179. 253 YDDIYNGFINDLDDVQTVILVLTNYGGASLKQFM----NDLR--------EYKSIKINNAGNGDKSGVDKLQIDI--PVE 318 (469) Q Consensus 253 ~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~----~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~~--~~~ 318 (469) ......-..+.++..+.|-.+++-......++.. ..+. ..+++.++ .+.+|..... ... T Consensus 192 ~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-------~g~~~~~l~~~~~d~ 264 (416) T protein:vir:45 192 DNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-------ESMTFDQLEVDTEVL 264 (416) T ss_pred HHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC-------CCceeEeccCCHHHH Confidence 6666555666667777776665432221111111 1111 11222222 1234443333 334 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcc Q lcl|NC_010179. 319 ARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKR 394 (469) Q Consensus 319 ~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~ 394 (469) .+.+..+...+.|+..-++|+.-.... ++.|-+... ..|..+|.-++..|...++.+ ..... T Consensus 265 q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~--------------~~~~~~l~P~~~~ie~~ln~~l~~~~~~~ 330 (416) T protein:vir:45 265 KLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKPYITCVCAELNFKFNDEYVNR 330 (416) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHHHHHHHHHHHhhhccccccCc Confidence 456666777888999888886433211 111211111 112234444444444443322 11122 Q ss_pred cceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHH-----HHHHHHhhh--hHhhcccCCC Q lcl|NC_010179. 395 HISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDL-----AKDREENDP--YANQADELNG 463 (469) Q Consensus 395 ~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri-----~~E~~~~~~--~~~~~~~~~~ 463 (469) .+++.+..-+-.|.++.++.+.++ +|+++.-++.+.++. +++.+..+-.+ ..+.....+ .......... T Consensus 331 ~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~k 410 (416) T protein:vir:45 331 EFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLK 410 (416) T ss_pred eEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccC Confidence 344444454566899999998887 689999998888743 33333221111 111111111 0111112223 Q ss_pred CCCCCC Q lcl|NC_010179. 464 KGVDDE 469 (469) Q Consensus 464 ~~~~de 469 (469) +|.++| T Consensus 411 gGe~n~ 416 (416) T protein:vir:45 411 GGEENE 416 (416) T ss_pred CCCCCC Confidence 333444 No 164 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=96.94 E-value=0.00025 Score=40.36 Aligned_cols=368 Identities=11% Similarity=0.034 Sum_probs=162.6 Q ss_pred ccCC------cccccccc---hhhhc-ccccccccccCcce-eccchHHHHHHHHHHhhhcCCeeeccCch-hhHHHHHH Q lcl|NC_010179. 32 ENKT------DITTRNNG---KPKVS-KEGKKDPLRSADNR-IPSNFYQLLVDQEAGYIASVFPDIDVGKD-ADNKKILD 99 (469) Q Consensus 32 ~g~~------~i~~~~~~---~~~~~-~~~~~~~~~~~~~r-i~~n~~k~iv~~~~~~l~g~p~~~~~~~~-~~~~~l~~ 99 (469) .|-- .+...... ..... .............. +.+.=.-..|+..++-+.+-|+.+..+.. .....+.. T Consensus 1 Mg~f~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~v~~cv~~Ia~~iA~~p~~~~~~~~~~~~~~~~~ 80 (416) T protein:vir:81 1 MGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVN 80 (416) T ss_pred CCcccccccccccCCCcchhHHHHHhccccccCccccchhhhhcchHHHHHHHHHHHhhccCceEEecCccccccchHHH Confidence 2210 00000000 00000 00000000000000 00111111355555555566766543222 12233444 Q ss_pred HHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCce Q lcl|NC_010179. 100 VLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGK 172 (469) Q Consensus 100 ~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~ 172 (469) ++.. | .+ +....+....+.+|.+|+++..+.+|++ .+.+++|..+.++.+.. ..+.+. +...+..+.. T Consensus 81 lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~~~---~~~~~~~~~~ 155 (416) T protein:vir:81 81 LLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIELKSDAR--GRLYYF---HQRIDSNGNN 155 (416) T ss_pred HHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEECCC--ccEEEE---EEEecCCCce Confidence 5532 3 22 2234566778899999999999988886 48899999998887643 222211 1111111111 Q ss_pred EEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHH Q lcl|NC_010179. 173 YFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDA 252 (469) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~ 252 (469) . ...+....+.+++.. | .+.-.|.|.++.+...++. T Consensus 156 ~---~~~~~~~evihir~~-------------------------------------~----~d~~~G~s~i~~~~~~i~~ 191 (416) T protein:vir:81 156 I---ERNVKFEDMLDIKFY-------------------------------------S----LDGINGLSLLDTLSRTIES 191 (416) T ss_pred e---EEEEccccEEEeccC-------------------------------------C----CCCccccCHHHHHHHHHHH Confidence 1 111222223222110 0 0112477777777777776 Q ss_pred HHHHHHHHHHHHHHhcCceeEEecCCcccchhhh----hhhh--------hcceeeecccCCCCCCcceEEeecC--CHH Q lcl|NC_010179. 253 YDDIYNGFINDLDDVQTVILVLTNYGGASLKQFM----NDLR--------EYKSIKINNAGNGDKSGVDKLQIDI--PVE 318 (469) Q Consensus 253 ~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~----~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~~--~~~ 318 (469) ......-..+.++..+.|-.+++-......++.. ..+. ..+++.++ .+.+|..... ... T Consensus 192 ~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~-------~g~~~~~l~~~~~d~ 264 (416) T protein:vir:81 192 DNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKVVVLD-------ESMTFDQLEVDTEVL 264 (416) T ss_pred HHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCceeecC-------CCceeEeccCCHHHH Confidence 6666555666667777776665432221111111 1111 11222222 1234443333 334 Q ss_pred HHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcc Q lcl|NC_010179. 319 ARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKR 394 (469) Q Consensus 319 ~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~ 394 (469) .+.+..+...+.|+..-++|+.-.... ++.|-+... ..|..+|.-++..|...++.+ ..... T Consensus 265 q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~--------------~~~~~~l~P~~~~ie~~ln~~l~~~~~~~ 330 (416) T protein:vir:81 265 KLIRENKSSTREIAGVFGIPLHKFGIETANMSITDAN--------------LDYLSTLKPYITCVCAELNFKFNDEYVNR 330 (416) T ss_pred HHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccHHHHH--------------HHHHHHHHHHHHHHHHHHhhhccccccCc Confidence 456666777888999888886433211 111211111 112234444444444443322 11122 Q ss_pred cceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHH-----HHHHHHhhh--hHhhcccCCC Q lcl|NC_010179. 395 HISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDL-----AKDREENDP--YANQADELNG 463 (469) Q Consensus 395 ~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri-----~~E~~~~~~--~~~~~~~~~~ 463 (469) .+++.+..-+-.|.++.++.+.++ +|+++.-++.+.++. +++.+..+-.+ ..+.....+ .......... T Consensus 331 ~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~k 410 (416) T protein:vir:81 331 EFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLK 410 (416) T ss_pred eEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecccccccccccccCcccccccccccC Confidence 344444454566899999998887 689999998888743 33333221111 111111111 0111112223 Q ss_pred CCCCCC Q lcl|NC_010179. 464 KGVDDE 469 (469) Q Consensus 464 ~~~~de 469 (469) +|.++| T Consensus 411 gGe~n~ 416 (416) T protein:vir:81 411 GGEENE 416 (416) T ss_pred CCCCCC Confidence 333444 No 165 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=96.93 E-value=0.00025 Score=40.29 Aligned_cols=417 Identities=10% Similarity=0.057 Sum_probs=183.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+++ +..+.+..++..-..+.+.+.+|.... +-... ...........++-.+-+...++..++.|. T Consensus 1 m~~~---~r~~~L~~~R~~~e~~w~e~~~~tlP~-----~~~~~------~~~~~~~~~~~~~~dstg~~a~~~LAa~l~ 66 (522) T protein:vir:10 1 MKAR---ERYNQLTTARQMFLDKAVECSELTLPY-----LIDDD------ISSRPNHKSLTVPWQSVGAKCCVTLAAKLM 66 (522) T ss_pred CchH---HHHHHHHHHhhHHHHHHHHHHHHhhhc-----ccCCC------CCCCcccccccccccchHHHHHHHHHHHHH Confidence 8855 556666666655566666777775421 11000 000011112234556666777777776654 Q ss_pred cC--Ce-----eeccCchh--------hHHHHHHH-----------H-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCc Q lcl|NC_010179. 81 SV--FP-----DIDVGKDA--------DNKKILDV-----------L-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNN 133 (469) Q Consensus 81 g~--p~-----~~~~~~~~--------~~~~l~~~-----------~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~ 133 (469) +- || ++...+.. ....++.| + ..||...+.++.++..++|.+. +|.++++ T Consensus 67 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--ly~~~~~- 143 (522) T protein:vir:10 67 LAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNAL--IFMGKDG- 143 (522) T ss_pred HhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcee--EEEcCCC- Confidence 31 22 12222211 11112222 2 2466677888999999999987 4666664 Q ss_pred eEEEEEccceeEEEEeCCCCCceEEEEEEEEeeec--------C--------CceEEEEEEEEcCCeEEEEEeecCceee Q lcl|NC_010179. 134 FRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDP--------E--------AGKYFTVHEYWTDKEAQFFRTSATDSTV 197 (469) Q Consensus 134 ~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~--------~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 197 (469) +++++-.++++.-|. .+++...+|.++..-. + .......+++|+. . +.....+... T Consensus 144 --~~~~pl~~y~v~~d~--~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~---v-~p~~~~~~~~ 215 (522) T protein:vir:10 144 --LKTFPLTRYVINRDG--DGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTY---V-KLDKSSGRWV 215 (522) T ss_pred --ceEEEcceEEEeeCC--CCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEE---E-EeeccCCceE Confidence 445555555544443 4567767766554210 0 0001111222210 0 1111111100 Q ss_pred cccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCcee Q lcl|NC_010179. 198 IEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVIL 272 (469) Q Consensus 198 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l 272 (469) +. ....+.... .....-++..+|++.++ .+.+|+|-.++..+-+..+|.+.-..........+|.+ T Consensus 216 ~~--------~~~~~~~~~-~~~s~~g~~~~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~~~~~~~a~~p~~ 286 (522) T protein:vir:10 216 WH--------QEAFDKIIP-DSRSTAPKNASPWLPLRFNTVDGEDYGRGRVEEFLGDLKSLDGLSQSLIEGAAAASKVVF 286 (522) T ss_pred EE--------EccCCcccc-ccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhcCCce Confidence 00 000000000 00113467778887765 34579999999999999999998888889999999987 Q ss_pred EEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccH Q lcl|NC_010179. 273 VLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASG 350 (469) Q Consensus 273 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg 350 (469) .+.-.+..... .+...+...+..+ ..+++..+. ...+.......++.++..|...-.. .+..+....|+ T Consensus 287 lv~~~~~~~~~----~l~~~~~~~~v~g---~~~~v~~~~~~~~~d~~~~~~~i~~~~~ri~~aFl~--~~~~d~~rvTA 357 (522) T protein:vir:10 287 LVSPSSTTKPA----TIAKAGNGAIVQG---RPEDVAVIQVGKTADFSTAANMATAIEKRLLEAFLV--MNVRNAERVTA 357 (522) T ss_pred eeccccccccc----cccCCCCcceecC---CCccceeecccccccchHHHHHHHHHHHHHHHHHhh--ccCCCCCCCCH Confidence 65321111111 1111111111111 223344443 2345667777788888877664221 12222344566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHhcccCCCc---ccceEEeCCCCCCCHHHHHHHH Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINE------------LVRAIMRYLNFSDADK---RHISQHWTRTKVEDSLTKAQIV 415 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~------------~~~~i~~~~~~~~~~~---~~i~i~f~~~~p~d~~e~~~~~ 415 (469) ..+... +.++...++..+.+ .+.++.+.--+...+. ....|++..++-+ ++.++.+ T Consensus 358 tEV~~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~~v~~is~Lar--aq~~~~l 428 (522) T protein:vir:10 358 EEVRLT-------QLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPTIVAGVNALGR--GQDRESL 428 (522) T ss_pred HHHHHH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccccccchhHHHH--HHHHHHH Confidence 555433 23444444444433 3333332111111111 1122344444433 2222222 Q ss_pred ----HHHhccCChHH---------HHHhC---CCCC-----CHHHHHHHHHHHHHHhhhh---HhhcccCCCCCCC---- Q lcl|NC_010179. 416 ----STVANYSSKEA---------VAKAN---PIVD-----DWQQELKDLAKDREENDPY---ANQADELNGKGVD---- 467 (469) Q Consensus 416 ----~kl~g~iS~et---------~~~~l---~~v~-----d~~~E~eri~~E~~~~~~~---~~~~~~~~~~~~~---- 467 (469) +.++.++..+. +++.+ -+|+ -.++|++.++++.++.... ..+.....+...- T Consensus 429 ~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~~~~~~a~~~~~~~~~~~~~ 508 (522) T protein:vir:10 429 TAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQSLVDQAGQMTGSPLMDPTK 508 (522) T ss_pred HHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccCccc Confidence 12222222233 33322 1232 1234555554443332211 1111111111111 Q ss_pred -----CC Q lcl|NC_010179. 468 -----DE 469 (469) Q Consensus 468 -----de 469 (469) ++ T Consensus 509 ~~~~~~~ 515 (522) T protein:vir:10 509 NPQLMDE 515 (522) T ss_pred cHHHHHH Confidence 11 No 166 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=96.90 E-value=0.00027 Score=40.18 Aligned_cols=375 Identities=10% Similarity=0.021 Sum_probs=163.6 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeee Q lcl|NC_010179. 8 KLIRNTSTSRND-LINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDI 86 (469) Q Consensus 8 ~~i~~~~~~~~~-~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~ 86 (469) -+++.+..++.. .......+...+-+... ...+.... +..-+..+-.-..|+..+.-+-+-|+.+ T Consensus 1 ~~f~~~f~r~~~~~~~~~~~~~~~~~~~~~-----------~~~g~~v~---~~~~l~~~~v~~~i~~Ia~~iA~~p~~~ 66 (413) T protein:vir:48 1 MFFSGLFQRKSDAPVTTPAELAEAIGLSYD-----------TYTGKRIS---SQRAMRLTAVYSCVRVLAESVGMLPCSL 66 (413) T ss_pred CccchhhccCccCCccchHHHHHhhhcCcc-----------cccCceec---hhhhhccHHHHHHHHHHHHhhhhCceEE Confidence 111111111000 00000011111110000 00000000 0000112223334555555555556553 Q ss_pred ccCc-h----hhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 87 DVGK-D----ADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 87 ~~~~-~----~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~ 154 (469) -..+ + .....+..++.. | ..+....+....+.+|.+|+++..+ .|++ .+.+++|..+.+..+... T Consensus 67 ~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~-~g~~~~L~~l~~~~v~~~~~~~~-- 143 (413) T protein:vir:48 67 YKISGTLKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKA-LGEVVELLPIDPGCVEPKLNSQW-- 143 (413) T ss_pred EEecCCcceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeC-CCcEEEEEEEcCceEEEEEcCCc-- Confidence 2111 1 112335555542 2 2233456778889999999888665 4665 488899999888776432 Q ss_pred ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 155 KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 155 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) .+. |......|.. ..+....+.+++.- . . T Consensus 144 ~~~-----y~~~~~~g~~-----~~~~~~evih~~~~--------------------------------~---------~ 172 (413) T protein:vir:48 144 QPV-----YQVTFPDGSV-----DVLTQDEIWHVRTL--------------------------------T---------L 172 (413) T ss_pred eEE-----EEEEecCceE-----EEEccccEEEecCc--------------------------------C---------C Confidence 221 2112112211 11223333332110 0 0 Q ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhhh--------cceeeecccCCC Q lcl|NC_010179. 235 KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLRE--------YKSIKINNAGNG 303 (469) Q Consensus 235 n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~~--------~~~~~~~~~~~~ 303 (469) +...|.|-+..+...++.......-..+.++..+.|-.+++....... +.....+.. .+.+.++ T Consensus 173 d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~~~~e~~~~~~~~~~~~~~g~~n~g~~~vl~----- 247 (413) T protein:vir:48 173 DGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQKLTPDAYERLKKDFEERHTGLGNAHRPMILE----- 247 (413) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcCccccCcceecC----- Confidence 122477777777777776666666666666767777666654322111 111221111 1122221 Q ss_pred CCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 DKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA 381 (469) Q Consensus 304 ~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 381 (469) ++++|..... ....+.+..+.....|+..-++|+.-....++.+...++... ...+..+|.-+++. T Consensus 248 --~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~----------~~f~~~~i~P~~~~ 315 (413) T protein:vir:48 248 --MGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG----------LGFINYSLVPYLTR 315 (413) T ss_pred --CCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH----------HHHHHHHHHHHHHH Confidence 2344444333 334455667777888999888887433322211111111111 12233344444444 Q ss_pred HHHHhccc---CC--CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHhh Q lcl|NC_010179. 382 IMRYLNFS---DA--DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKDREEND 452 (469) Q Consensus 382 i~~~~~~~---~~--~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E~~~~~ 452 (469) +...++.+ .. ....+++.+..-+-.|.++.++++.++ +|+++.-.+.++++. ++.-+.=+ +........ T Consensus 316 ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~ggD~~~--~~~n~~~~~ 393 (413) T protein:vir:48 316 IEQRINTGLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDLEDMNPRPGGDVYL--TPMNMTTSP 393 (413) T ss_pred HHHHHHhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceee--ccccccccc Confidence 44333321 11 122344444455556899999999887 678998888887754 22211101 011101111 Q ss_pred hhHhhcccCCCCCCCCC Q lcl|NC_010179. 453 PYANQADELNGKGVDDE 469 (469) Q Consensus 453 ~~~~~~~~~~~~~~~de 469 (469) ...++.+...+++..|| T Consensus 394 ~~~~~~~~~~~~~~~~~ 410 (413) T protein:vir:48 394 SAGDDNGKKKESGDADK 410 (413) T ss_pred cccccCCCCCCCCCccc Confidence 22223333344444444 No 167 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=96.90 E-value=0.00027 Score=40.16 Aligned_cols=422 Identities=9% Similarity=0.064 Sum_probs=185.8 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhc Q lcl|NC_010179. 3 LDA-LKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIAS 81 (469) Q Consensus 3 ~~~-~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g 81 (469) .+. +++..+.+..++..-..+.+.+.+|....- ..... ........++-.+-+...++..++.|++ T Consensus 1 mk~~a~~r~~~l~~~R~~~e~~w~e~~~y~lP~~---~~~~~----------~~~~~~~~~~~dstg~~a~~~Laa~l~~ 67 (542) T protein:vir:78 1 MKGLAQARYSAMRADREDFLDMARRCAALTLPYL---LTEDG----------HASGGRLQQPYQSLGSKGVNALSSKLML 67 (542) T ss_pred ChhHHHHHHHHHHHHhhHHHHHHHHHHHHhcccc---CCCCC----------CcccccccccccchHHHHHHHHHHHHHH Confidence 332 345556665555555566666666654310 00000 0011112345556667777777776653 Q ss_pred C--Ce-----eeccCc----------hhhHHHHH-----------HHH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCC Q lcl|NC_010179. 82 V--FP-----DIDVGK----------DADNKKIL-----------DVL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDN 132 (469) Q Consensus 82 ~--p~-----~~~~~~----------~~~~~~l~-----------~~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~ 132 (469) - || ++...+ ......++ ..+ ..||...+.++.++..++|.+. +|.+++ T Consensus 68 ~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~~~~- 144 (542) T protein:vir:78 68 SLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVL--VFAGKK- 144 (542) T ss_pred hhcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEE--EEecCC- Confidence 1 22 122222 11111222 222 3466777888999999999985 456665 Q ss_pred ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC---------------------CceEEEEE-EEEcCCeEEEEEe Q lcl|NC_010179. 133 NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE---------------------AGKYFTVH-EYWTDKEAQFFRT 190 (469) Q Consensus 133 ~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~---------------------~~~~~~~~-~~~~~~~~~~~~~ 190 (469) . ++.++-.++++.-|. .+++...+|.++..... .+..+..+ -++.......|.. T Consensus 145 ~--~~~~pl~~y~v~~d~--~G~vd~v~r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~~~ 220 (542) T protein:vir:78 145 T--LKVYPLDRYVIERDG--DGNVIEIITRELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVFTC 220 (542) T ss_pred C--ceEEecceeEEeeCC--CCCeEEEeeeeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccccc Confidence 2 445555554444443 45666677766543110 00111111 1111111111111 Q ss_pred ecCceeeccccccccccccccccccccc--ccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 191 SATDSTVIEPYNIITSYDLSAGYETGQS--NTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFIND 263 (469) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~ 263 (469) ..... ..+......++... .....+|..+|++.++ .+.+|+|-.++..+-+..+|.+.-..... T Consensus 221 ~~~~~---------~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~ 291 (542) T protein:vir:78 221 CKLVD---------GQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFNVVDGESYGRGRVEEFFGDLSSLDALTRSLIEG 291 (542) T ss_pred cccCC---------CeEEEEEEeccccccccccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 10000 00111111111111 1223467778887765 34579999999999999999999999999 Q ss_pred HHHhcCceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEE--eecCCHHHHHHHHHHHHHHHHHHhCCCCcC Q lcl|NC_010179. 264 LDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKL--QIDIPVEARDDALKITRDNIFLFGQGIDPA 341 (469) Q Consensus 264 ~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~ 341 (469) .....+|.+.+.-.+....... .. ...+.+. .+ ...+++.+ ....+.......++.++..|-..-.. . + T Consensus 292 ~~~a~~pp~lv~~~g~~~~~~~-~~-~~~g~iv--~g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~rI~~aFl~-~-~ 362 (542) T protein:vir:78 292 SAAAAKVVFMVSPSATTKPQSL-AR-AGTGAII--QG---RAEDVSVVQANKGADFRTVQEMIRDLSQRISDAFLI-L-N 362 (542) T ss_pred HHHHhcCceeeccccccchhhc-cc-CCCceee--cC---CccceeeeecccccchhHHHHHHHHHHHHHHHHhcc-c-c Confidence 9999999866531111111111 01 1111221 11 12234433 34446777788888888877543221 1 1 Q ss_pred ccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHhcccCCCcccceEEeCCCCCCC-H Q lcl|NC_010179. 342 NFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINEL------------VRAIMRYLNFSDADKRHISQHWTRTKVED-S 408 (469) Q Consensus 342 ~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~------------~~~i~~~~~~~~~~~~~i~i~f~~~~p~d-~ 408 (469) ..+....|+..+.. ++.++...++..+.++ +.++.+.--+...+..-+++++..++..- . T Consensus 363 ~~d~~rvTAtEV~~-------r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~lv~~~~~s~La~~~r 435 (542) T protein:vir:78 363 VRQSERTTATEVRE-------VQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLPKGLVMPTVVAGLGGVGR 435 (542) T ss_pred cCCcccccHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhceeeeeechHHHHHH Confidence 11223345544332 3344555555554443 33322211122233334677777665431 1 Q ss_pred HHHHHHH----HHHhcc---------CChHHHHHhC---CCCC-----CHHHHHHHHHHHHHHhhhhH---hhcccCCC- Q lcl|NC_010179. 409 LTKAQIV----STVANY---------SSKEAVAKAN---PIVD-----DWQQELKDLAKDREENDPYA---NQADELNG- 463 (469) Q Consensus 409 ~e~~~~~----~kl~g~---------iS~et~~~~l---~~v~-----d~~~E~eri~~E~~~~~~~~---~~~~~~~~- 463 (469) .+.++.+ +.++.+ +....++..+ -+|+ ..++|+++++++..+..... ++...... T Consensus 436 ~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~~ 515 (542) T protein:vir:78 436 GEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAKS 515 (542) T ss_pred HHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 1222222 111222 2233333322 1232 11345555444422221111 11111010 Q ss_pred ---C----------------CCCCC Q lcl|NC_010179. 464 ---K----------------GVDDE 469 (469) Q Consensus 464 ---~----------------~~~de 469 (469) + .+.+| T Consensus 516 ~~~~~~~~~~~a~~~~~~~~~~~~~ 540 (542) T protein:vir:78 516 PIGEKMMQQINAPGQEAPAGPQTGE 540 (542) T ss_pred ccccchhhhcCCCCcCCCCCCcccc Confidence 0 01111 No 168 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=96.86 E-value=0.00029 Score=39.96 Aligned_cols=419 Identities=9% Similarity=0.016 Sum_probs=184.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.+++..+.+..++..-..+.+.+.+|.... ++... .......|+-.+-+..-++..++-|. T Consensus 11 ~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~--~~~~~-------------~~~~~~~~~~dstg~~a~~~LAa~l~ 75 (516) T protein:vir:96 11 GKRSKIPKLWEKFSNKRSSFLDRAKHYSKLTLPY--LMNDK-------------GDNETSQNGWQGVGAQATNHLANKLA 75 (516) T ss_pred hhHHHHHHHHHHHHHHhhHHHHHHHHHHHhhccc--ccCCC-------------CCccccCCcccchHHHHHHHHHHHHH Confidence 5555666666666666655566667777776541 11110 01111223445666666777766554 Q ss_pred cC--Ce-----eeccCchh---------hHHHHHH-----------HH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCC Q lcl|NC_010179. 81 SV--FP-----DIDVGKDA---------DNKKILD-----------VL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDN 132 (469) Q Consensus 81 g~--p~-----~~~~~~~~---------~~~~l~~-----------~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~ 132 (469) |- || ++...++. ....++. .+ ..||...+.++.++...+|.+. +|.++++ T Consensus 76 ~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~d~~~ 153 (516) T protein:vir:96 76 QVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM--LYKPSKG 153 (516) T ss_pred hhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe--EEecCCC Confidence 31 22 22222211 1112222 23 2366677888889999999975 5667777 Q ss_pred ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeec--------C--------CceEEEEEEEEcCCeEEEEEeecCcee Q lcl|NC_010179. 133 NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDP--------E--------AGKYFTVHEYWTDKEAQFFRTSATDST 196 (469) Q Consensus 133 ~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~--------~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (469) .++ .++-.++++.-|. .+++...++..+.... . ..+....+++|+.- ........ T Consensus 154 ~~~--~~pl~~y~v~~d~--~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v-----~~~~~~~~ 224 (516) T protein:vir:96 154 AIS--AIPMHHYVVNRDT--NGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHA-----KYLGDGFW 224 (516) T ss_pred CEE--EEEcCeEEEeeCC--CCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEee-----eeeCCcee Confidence 644 4444554444443 3345444443221000 0 00000112222210 00111111 Q ss_pred ecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010179. 197 VIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVI 271 (469) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~ 271 (469) .+ +....+..... ....+|..+|++.++ .+.+|.|-.++..+-+..+|.+.-...........|. T Consensus 225 ~~--------~~~~d~~~~~~--es~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~ 294 (516) T protein:vir:96 225 EL--------KQSADDIPVGK--VSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIK 294 (516) T ss_pred EE--------EEEeCceeecc--ccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCc Confidence 00 00001111111 112345567777665 3457999889999999999988888888888888776 Q ss_pred eEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCcc Q lcl|NC_010179. 272 LVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNAS 349 (469) Q Consensus 272 l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~S 349 (469) +.+.-.+... ...+...+.-.+.++ ...++..++ ...+.......++.++..|-..-....+...+....| T Consensus 295 ~lv~p~g~~~----~~~l~~~~~g~i~~g---~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~r~~~rvT 367 (516) T protein:vir:96 295 YLIRPGAQTD----VDHFVNSGTGEVVTG---VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVT 367 (516) T ss_pred cccCcccccc----hhhhccCCCceeecC---CcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhccCCCcccc Confidence 5543111111 111111111111121 122345544 3335677777788777777553222112122223345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhcc--cCCCcccceEEeCCCCCC-CHHHHHHHH------ Q lcl|NC_010179. 350 GVAIKMLYSHLELKAAKTQTYFEHAINELVR-----AIMRYLNF--SDADKRHISQHWTRTKVE-DSLTKAQIV------ 415 (469) Q Consensus 350 g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~-----~i~~~~~~--~~~~~~~i~i~f~~~~p~-d~~e~~~~~------ 415 (469) +..+. .++.+++..++..+.++-. +|.+.+.. .......+.+.+..++.. -..+.++.+ T Consensus 368 AtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~l~~~~p~lp~~~v~~~~vs~l~~l~r~~~~~~i~~~~~~ 440 (516) T protein:vir:96 368 AVEIQ-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGESFTSDLVDPVIITGIEALGRMAELDKLANFAQY 440 (516) T ss_pred HHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhcCCCCccccccceeechHHHHHHHHHHHHHHHHHHH Confidence 55443 3456667777777766422 11122211 122222234443322211 111111111 Q ss_pred -HHHhc-------cCChHHHHHhC---CCCC----CHHHHHHHHHHHHHHhhhhHhhcc---cCCCCCCCCC Q lcl|NC_010179. 416 -STVAN-------YSSKEAVAKAN---PIVD----DWQQELKDLAKDREENDPYANQAD---ELNGKGVDDE 469 (469) Q Consensus 416 -~kl~g-------~iS~et~~~~l---~~v~----d~~~E~eri~~E~~~~~~~~~~~~---~~~~~~~~de 469 (469) ..+++ .+....++..+ -+|+ -.++|++++++++.+..+...... ..-.+...+| T Consensus 441 i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~~~~q~~~~~a~~~~~~~~~~~~~~ 512 (516) T protein:vir:96 441 MSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQMQAQQAQMLEEGVAKAVPGVIQQE 512 (516) T ss_pred HHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHhhcc Confidence 11221 12233444332 1122 124566666655554443332221 1111222222 No 169 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=96.78 E-value=0.00034 Score=39.59 Aligned_cols=419 Identities=8% Similarity=0.025 Sum_probs=192.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) .+.+.+++..+.+..++..-..+.+.+.+|...- +.... ...+...++-.+-...-++..++.|. T Consensus 10 ~~~~~l~~r~~~Lk~~R~~~e~~w~e~~~~tlP~--~~~~~-------------~~~~~~~~~~dstg~~a~~~LAa~l~ 74 (515) T protein:vir:70 10 GQRSKIPKLWEKFSKKRSPYLDRAKHFAKLTLPY--LMNNK-------------GDNETSQNGWQGVGAQATNHLANKLA 74 (515) T ss_pred CCHHHHHHHHHHHHHhhhHHHHHHHHHHHHhccc--ccCCC-------------CCcccccccccchHHHHHHHHHHHHH Confidence 7788888888888777766666777777776641 11110 00111123445555666666666554 Q ss_pred cC--Cee-----eccCch---------hhHHHHHHH-----------H-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCC Q lcl|NC_010179. 81 SV--FPD-----IDVGKD---------ADNKKILDV-----------L-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDN 132 (469) Q Consensus 81 g~--p~~-----~~~~~~---------~~~~~l~~~-----------~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~ 132 (469) +- ||. +...+. .....++.| + ..||...+.++.++...+|.+. +|.++++ T Consensus 75 ~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~d~~~ 152 (515) T protein:vir:70 75 QVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCL--LYKPSKG 152 (515) T ss_pred HhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEE--EEEeCCC Confidence 31 221 222111 111122222 2 2366677888889999999985 4567776 Q ss_pred ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC-----C-----------ceEEEEEEEEcCCeEEEEEeecCcee Q lcl|NC_010179. 133 NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE-----A-----------GKYFTVHEYWTDKEAQFFRTSATDST 196 (469) Q Consensus 133 ~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~-----~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (469) .++ .++-.++++.-|. .+++...+|.++..... + ......+++|+. ......+.. T Consensus 153 ~~~--~~pl~~y~v~~d~--~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~-----v~~~~~~~~ 223 (515) T protein:vir:70 153 AMS--AVPMHHYVVNRDT--NGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTH-----AQYAGEGFW 223 (515) T ss_pred CeE--EEEcCeEEEeeCC--CcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEE-----EEecCCCce Confidence 654 4444554444443 45566666655432110 0 000111222211 001111111 Q ss_pred ecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010179. 197 VIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVI 271 (469) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~ 271 (469) . .+....+..... ....+|..+|++.++ ++.+|+|-.++..+-+..+|.+.-..........+|. T Consensus 224 ~--------~~~e~d~~~~~~--es~y~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~p~ 293 (515) T protein:vir:70 224 K--------INQSADDIPVGK--ESRIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAMARGAALMADIK 293 (515) T ss_pred E--------EEEecCceeecc--ccccccccCCceeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 1 111111111111 112235667777665 3457999999999999999998888888888888887 Q ss_pred eEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCcc Q lcl|NC_010179. 272 LVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNAS 349 (469) Q Consensus 272 l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~S 349 (469) +.+.-.+... ...+.....-.+.++ ...++..+. ...+.......++.++..|-..-....+......+.| T Consensus 294 ~lv~~~g~~~----~~~l~~~~~g~iv~g---~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvT 366 (515) T protein:vir:70 294 YLIRPGSQTD----VDHFVNSGTGEVITG---VAEDIHIVQLGKYADLTPISAVLEVYTRRIGVIFMMETMTRRDAERVT 366 (515) T ss_pred eeeCcccccc----hhhccccCCceeecC---CcccceeeecCcccchhHHHHHHHHHHHHHHHHHhhhhhhccCCcccc Confidence 6653111111 111111111111111 223344444 3345677777788877777543222111111122345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhcc--cCCCcccceEEeCCCCCC-CHHHHHHHHHH---- Q lcl|NC_010179. 350 GVAIKMLYSHLELKAAKTQTYFEHAINELVRA-----IMRYLNF--SDADKRHISQHWTRTKVE-DSLTKAQIVST---- 417 (469) Q Consensus 350 g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~-----i~~~~~~--~~~~~~~i~i~f~~~~p~-d~~e~~~~~~k---- 417 (469) +..+. .+..+++..++..+.++-.- +...+.. .......+.+.+..++.. ...+.++.+.. T Consensus 367 AtEV~-------~r~~E~~~~LGpv~srL~~Ell~Pli~r~~~~~~p~~P~~~v~~~~vs~l~~L~r~q~~~~i~~~~q~ 439 (515) T protein:vir:70 367 AVEIQ-------RDALEIEQNMGGVYSLFAMTMQTPIAMWGLQEAGDSFTSELVDPVIVTGIEALGRMAELDKLANFAQY 439 (515) T ss_pred HHHHH-------HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhCCCCChhhcccceehhHHHHHHHHHHHHHHHHHHH Confidence 55443 34567777788777764221 1111111 111222233333222111 11111222211 Q ss_pred ---Hhc-------cCChHHHHH----hCCC---CCCHHHHHHHHHHHHHHhh---hhHhhcccCCCCCCCCC Q lcl|NC_010179. 418 ---VAN-------YSSKEAVAK----ANPI---VDDWQQELKDLAKDREEND---PYANQADELNGKGVDDE 469 (469) Q Consensus 418 ---l~g-------~iS~et~~~----~l~~---v~d~~~E~eri~~E~~~~~---~~~~~~~~~~~~~~~de 469 (469) +++ .+....+++ .++- +--.++|++.+.++.++.. ...++..+..+++.-|+ T Consensus 440 i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~~~~~~~~~a~~~~~~~~ 511 (515) T protein:vir:70 440 MSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEAMLNEGVAKAVPGVIQQE 511 (515) T ss_pred HHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccchhhh Confidence 111 122222222 2221 1112466666665544332 23344444444444455 No 170 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=96.77 E-value=0.00035 Score=39.54 Aligned_cols=384 Identities=13% Similarity=0.061 Sum_probs=165.0 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCcccccccchhhhccccccccccc---CcceeccchHHHHHHHHHHhhhcCCeeec-c Q lcl|NC_010179. 13 TSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRS---ADNRIPSNFYQLLVDQEAGYIASVFPDID-V 88 (469) Q Consensus 13 ~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~---~~~ri~~n~~k~iv~~~~~~l~g~p~~~~-~ 88 (469) +.....+++.+.+....=+.|.. +. ..+........+....... +..-+..+-....|+..++-+.+-|+.+- . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~g~~-~s-~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ci~~Ia~~ia~lp~~~~~~ 78 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLKWLGVP-IS-LTDGSFWSAWGGMGSSSGETVTADSALQLSAVWSCVRLIAETIATLPLNLYQT 78 (437) T ss_pred CCcchhhhhhhhHHhhhhhcCCc-cc-CCchhHHHhhcccccCCCceechHhhhccHHHHHHHHHHHHHHhhCceeEEEE Confidence 22223334444433211122321 10 0000000000000000000 00011122233345555555555565431 1 Q ss_pred C-c----hhhHHHHHHHHhc--cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCce Q lcl|NC_010179. 89 G-K----DADNKKILDVLGD--DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKL 156 (469) Q Consensus 89 ~-~----~~~~~~l~~~~~~--n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~ 156 (469) . + ......+..+|.. |. .+....+...++.+|.+|+++-.+. |++. +.+++|..+.+..+.+ ..+ T Consensus 79 ~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~-g~~~~L~~l~p~~v~i~~~~~--g~~ 155 (437) T protein:vir:10 79 KPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSA-GVLIGLELMLPQRTTVKRLTS--GAL 155 (437) T ss_pred cCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecC-CcEEEEEEEcCcceEEEECCC--CeE Confidence 1 1 1122334455542 32 2234456778899999999887774 7654 8889999988876542 221 Q ss_pred EEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCC Q lcl|NC_010179. 157 LGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN 236 (469) Q Consensus 157 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~ 236 (469) +|......|.. ..+....+.+++.-. .+. T Consensus 156 -----~y~~~~~~g~~-----~~~~~~dIih~r~~~-----------------------------------------~d~ 184 (437) T protein:vir:10 156 -----QYTYRNVDGTV-----STLAEDDVFHVRGFS-----------------------------------------LDG 184 (437) T ss_pred -----EEEEEecCceE-----EEEccccEEEecCcC-----------------------------------------CCC Confidence 12212222211 112223333221100 011 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhhh--------cceeeecccCCCCC Q lcl|NC_010179. 237 KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLRE--------YKSIKINNAGNGDK 305 (469) Q Consensus 237 ~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~~--------~~~~~~~~~~~~~~ 305 (469) ..|.|-++.+...++.......-..+.+...+.|-.++........ ......+.. .+++.++ T Consensus 185 ~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~nag~~~vl~------- 257 (437) T protein:vir:10 185 LMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQILQKEKRAEIRTDLAEQFGGAMQAGKTMVLE------- 257 (437) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCCCCCHHHHHHHHHHHHHHhcCccccCcceecc------- Confidence 2466767766666666665555566666777777666654322111 111111111 1222222 Q ss_pred CcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 306 SGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA 381 (469) Q Consensus 306 ~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 381 (469) ++.+|..... ....+.+..+.....|+..-++|+.-.... ++..+..++... ...+..+|.-.+.. T Consensus 258 ~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~----------~~f~~~tl~P~~~~ 327 (437) T protein:vir:10 258 AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQT----------LGFLTFTLRPWLTR 327 (437) T ss_pred CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHH----------HHHHHHHHHHHHHH Confidence 2244444333 334456666777788999888887433221 222222222111 22333444444444 Q ss_pred HHHHhcc-----cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH------HHHHHHH Q lcl|NC_010179. 382 IMRYLNF-----SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ------ELKDLAK 446 (469) Q Consensus 382 i~~~~~~-----~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~------E~eri~~ 446 (469) |...++. ...+...+++.+..-+..|..+.++++.++ +|+++.-.+.+.++. ++.-.. -+..+.+ T Consensus 328 ie~~l~~kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~ 407 (437) T protein:vir:10 328 IEQAARRSLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLMTRDECRAKENLPPMGGNAAVLTVQSALLPIDK 407 (437) T ss_pred HHHHHHhhccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcceEeecCcccchhh Confidence 4444332 111222344555566777899999998876 688999888887643 221110 1111211 Q ss_pred HHHHhhhhHhhc------ccCCCCCCCCC Q lcl|NC_010179. 447 DREENDPYANQA------DELNGKGVDDE 469 (469) Q Consensus 447 E~~~~~~~~~~~------~~~~~~~~~de 469 (469) --+...+...+. ...+....++| T Consensus 408 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 436 (437) T protein:vir:10 408 LGEHTTATAAQDALKAWLYQEEKTRATQE 436 (437) T ss_pred ccCcCCCcchhccccccCCCCCCCCcccc Confidence 111111111110 11111112222 No 171 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=96.75 E-value=0.00036 Score=39.45 Aligned_cols=363 Identities=12% Similarity=0.102 Sum_probs=154.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+ .+.+..... +. .....++.+--.-..... ...+.... +..-+..+-.-..|+..+.-+- T Consensus 1 M~~------f~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~-----~~~~~~v~---~~~al~~~~v~~~i~~ia~~ia 62 (386) T protein:vir:49 1 MPI------FNITNLATE---SP-PINQESFFDIADSDFLAS-----LNSSEWVS---AENALKNSDLFSIISQLSNDLA 62 (386) T ss_pred Cch------hhhhccCCC---Cc-ccchhhhhhhhhcccccc-----ccCCceec---hhhhhccHHHHHHHHHHHHHhh Confidence 222 011100000 00 000011100000000000 00000000 0000111222234455555555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc-c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD-D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~-n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~ 154 (469) +-|+.+.-. .. ..++.. | ..+....+..+.+.+|.+|+.+-.+.+|++ .+.+++|..+-+..++.. . T Consensus 63 ~~p~~~~~~--~~----~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~~~~-~ 135 (386) T protein:vir:49 63 TAKITTSRK--QL----QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRLDNQ-N 135 (386) T ss_pred hCceeeccc--hh----hhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEcCCC-c Confidence 556654321 11 122221 2 223345667788899999999888888886 588899999887765432 1 Q ss_pred ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 155 KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 155 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) .+.+ .+...+..+... ..+....+.+++.-. +. T Consensus 136 ~~~y---~~~~~~~~~~~~----~~~~~~evih~~~~~----------------------------------------~~ 168 (386) T protein:vir:49 136 GLYY---NITFDDPHIAPK----QHVPQNDILHFRLLS----------------------------------------VD 168 (386) T ss_pred eEEE---EEEEcCccccce----eEEccccEEEecCCC----------------------------------------CC Confidence 1111 111111111110 111222222221100 00 Q ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh---------hcceeeecccCCCCC Q lcl|NC_010179. 235 KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR---------EYKSIKINNAGNGDK 305 (469) Q Consensus 235 n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~ 305 (469) ..-.|.|.+..+...++....+..-..+.+...+.|-.+++-......+ ....+. ..+++.++. + T Consensus 169 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~-~~~~~~~~~~~~~~n~g~~~vl~~-----g 242 (386) T protein:vir:49 169 GGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGILKIKGGGLLD-FKTKVSRSRQAMKQMQGGPLVLDD-----L 242 (386) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCCCCChH-HHHHHHHHHHHhccCCCCceecCC-----C Confidence 0124778787777777766666666666677777776666532222211 111111 112222211 1 Q ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 306 SGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIM 383 (469) Q Consensus 306 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~ 383 (469) .+++-+..+.....+.+..+.....|+..-++|+.-... .+..++..++..+ ...+...++.+. T Consensus 243 ~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~--------------~~~i~~~l~~i~ 308 (386) T protein:vir:49 243 EDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNIY--------------FKSVSRYLRPFV 308 (386) T ss_pred ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHHH--------------HHHHHHHHHHHH Confidence 234444333344456667788889999998998754432 1223443333222 222222332222 Q ss_pred HHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHhhhhHhhc Q lcl|NC_010179. 384 RYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVDDWQQELKDLAKDREENDPYANQA 458 (469) Q Consensus 384 ~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~d~~~E~eri~~E~~~~~~~~~~~ 458 (469) ..++.+= ...+.+.....+-.|..+.+..+.++ +|+++.-++.+++ |+..+ |+-+.. .+ .. T Consensus 309 ~~~~~~l--~~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~---~~~~~~------~~---~~ 374 (386) T protein:vir:49 309 SEMSKKL--SCEVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPK---ELPDGK------NP---NR 374 (386) T ss_pred HHHHHHh--cchhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCC---cCcchh------cc---CC Confidence 2221110 01223333333445666777777776 6789988887765 33332 111100 00 11 Q ss_pred ccCCCCCCCCC Q lcl|NC_010179. 459 DELNGKGVDDE 469 (469) Q Consensus 459 ~~~~~~~~~de 469 (469) ...++++.++| T Consensus 375 ~~~~gGd~~~~ 385 (386) T protein:vir:49 375 TSLKGGEINEQ 385 (386) T ss_pred CCCCCCCCCCC Confidence 23344444455 No 172 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=96.72 E-value=1.4e-05 Score=47.11 Aligned_cols=170 Identities=12% Similarity=0.037 Sum_probs=90.0 Q ss_pred eeEEecCCcc---cchhhhhhh---hhc----ceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc Q lcl|NC_010179. 271 ILVLTNYGGA---SLKQFMNDL---REY----KSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP 340 (469) Q Consensus 271 ~l~~~g~~~~---~~~~~~~~~---~~~----~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~ 340 (469) ++.++|.... ........+ ..+ ..+.+... .-+|-+.+.+.+.+...+......|...+++|-. T Consensus 1 V~k~~~l~~~~~~~~~~~~~r~~~~~~~~~~~~~~~ld~~------~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t 74 (201) T protein:vir:10 1 MWKAKGLADLCDDSDGAARLRLAQVDNNSGVGQAIGIDAD------SEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEI 74 (201) T ss_pred CccchHHHHHhcCChHHHHHHHHHHHHhhhhhhhheeecC------CcceeeeecCcCChHHHHHHHHHHHHhHhcCchh Confidence 1112221100 000000011 011 11112111 1245566778889999999999999999999976 Q ss_pred Ccccc--C--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHH Q lcl|NC_010179. 341 ANFES--S--NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVS 416 (469) Q Consensus 341 ~~~~~--g--~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~ 416 (469) -..+. + |.||..=..-|...+. ...+..+...+++++.+++. ..+++++|++-...++++.|+... T Consensus 75 ~LfG~sp~Glnatge~d~~nyyd~i~--~~Qe~~l~p~le~l~~~~~~--------~~~~~~~f~pL~~~s~kekAei~~ 144 (201) T protein:vir:10 75 ILKGKNVGGVSASQNTALETFYGYVD--RKRKAELLPLLEFLLPFIVT--------EQEWSVEFNPLSQVSDKDKSEILE 144 (201) T ss_pred hhcCCCCccccccchhHHHHHHHHHH--HHHHHHHHHHHHHHHHhhcC--------CCCceEeeCCCCCCCHHHHHHHHH Confidence 55442 2 3578754444444433 23346778888888876542 246899999999999999988665 Q ss_pred HH---------hccCChHHHHHhC------CCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCC Q lcl|NC_010179. 417 TV---------ANYSSKEAVAKAN------PIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDD 468 (469) Q Consensus 417 kl---------~g~iS~et~~~~l------~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~d 468 (469) +. +|++|.+.+...| +++.+ ++.+++ +.+ ....++...|+ T Consensus 145 ~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~----~~e--------~~dp~~~~~~~ 201 (201) T protein:vir:10 145 KNVNSVAALIAAGIIDADEARDTLRAISTEVKIGEGSIQTEVV----INE--------SEDPLDVSANN 201 (201) T ss_pred HHHHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCCCCcccc----ccc--------cCCCCCCCCCC Confidence 42 4678877776553 33322 111110 100 11111122222 No 173 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=96.72 E-value=0.00039 Score=39.29 Aligned_cols=373 Identities=12% Similarity=0.146 Sum_probs=168.5 Q ss_pred CCHHHH----------HHHHH----HHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccc Q lcl|NC_010179. 1 MELDAL----------KKLIR----NTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSN 66 (469) Q Consensus 1 ~~~~~~----------~~~i~----~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n 66 (469) |+.+-. .-+|- -.+.++ +.|++ -|......- ... T Consensus 92 ~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~----~eyr~-------~~~~ia~e~---------------------~R~ 139 (698) T protein:vir:10 92 LDFNGTSMDALSFVTSSGFPGFPTLVLLAQL----PEYRA-------MHEVLADEC---------------------IRT 139 (698) T ss_pred hcccccccccchhhhccCcchHHHHHHHhhc----cchhh-------HHHHHHHHh---------------------hcc Confidence 332210 00110 011111 11111 110000000 001 Q ss_pred hHHHH--HHHHH---HhhhcCCeeeccC-chhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc------ Q lcl|NC_010179. 67 FYQLL--VDQEA---GYIASVFPDIDVG-KDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN------ 133 (469) Q Consensus 67 ~~k~i--v~~~~---~~l~g~p~~~~~~-~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~------ 133 (469) |.+.+ +...+ ++-.|. ..... +.+..+.|..-+++ +..+.+.+..+++-.||.+..++-++.++. T Consensus 140 w~~~~~~~~e~~~~~g~~~~~--~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l~~PL 217 (698) T protein:vir:10 140 WGEAIGGTKEKADTSGLAAGG--NAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPL 217 (698) T ss_pred cceeccccchhhhhhcccccc--cccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCcccccccc Confidence 10000 00000 111111 11111 22333455555543 556778899999999999987776644331 Q ss_pred -----------eE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccc Q lcl|NC_010179. 134 -----------FR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPY 201 (469) Q Consensus 134 -----------~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (469) ++ +.+++|..+.|-.-+.. .+.. -.+|.+...... +. .+... T Consensus 218 ~~~~~~I~kGslKGL~ViDp~~vtP~~~n~~--dP~s------------------pdfgkP~~y~V~---G~---~IH~S 271 (698) T protein:vir:10 218 VPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVA------------------DDFYKPSTWWMI---GS---EVHAT 271 (698) T ss_pred ccccccccCccceeeeeecccccccchhhhc--cchh------------------hccCCCceEEEe---cc---eecce Confidence 11 44555555554211000 0000 011111110000 00 00000 Q ss_pred cccccccccccccccccccccccCCcccEEEec--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC-- Q lcl|NC_010179. 202 NIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY-- 277 (469) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~-- 277 (469) .. ....-..+|-. ++ .+-.|.|....+.+-+++++++.-.....+..+....+. +++ T Consensus 272 RL-----------------~~~vg~pvpd~-LKp~y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~-~dla~ 332 (698) T protein:vir:10 272 RL-----------------HTIVSRPVGDM-LKPTYSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGIL-MDLAQ 332 (698) T ss_pred eE-----------------EEecCCCchhh-hcchhccCCccHHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHH-HHHHH Confidence 00 00000001111 00 122477878888888888887766666655444433221 111 Q ss_pred ---Cccc--ch---hhhhhhh-hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc--- Q lcl|NC_010179. 278 ---GGAS--LK---QFMNDLR-EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES--- 345 (469) Q Consensus 278 ---~~~~--~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--- 345 (469) ++.. .. +.....+ .++++.++. ..=+|-+.+.+.+.+...+.+...+|...+.+|-....+. T Consensus 333 aL~~g~~~~l~~R~eli~~~Rsn~G~~llDk------~~Eefeq~st~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPk 406 (698) T protein:vir:10 333 ALTPGANVDLSMRAELINRYRDNRNILFLDK------ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPT 406 (698) T ss_pred hcCChhhHHHHHHHHHHHHhcCccceEEEec------CCcceEEEecCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCc Confidence 0111 00 1111112 233333431 1236778888999999999999999999999998766553 Q ss_pred C-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHH------- Q lcl|NC_010179. 346 S-NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST------- 417 (469) Q Consensus 346 g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k------- 417 (469) | |.||++=..-|...+. +..+..+...+++++.+|..-.- ...+. ++.++|++-...+++|.|++-.| T Consensus 407 GlNATGE~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~rS~~-G~idp-~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~ 482 (698) T protein:vir:10 407 GLNASSEGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQLSLF-GAVDP-SIKWQWNALRELDDLEVAEARYKQAQSDVL 482 (698) T ss_pred ccCccchhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhc-CCCCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHH Confidence 2 6799876666666555 45588899999999988865431 12333 68999999999999999987644 Q ss_pred --HhccCChHHHHHhC------CCC--CCHHHHH-----HHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 418 --VANYSSKEAVAKAN------PIV--DDWQQEL-----KDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 418 --l~g~iS~et~~~~l------~~v--~d~~~E~-----eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ..|+|+......+| +|. .|+..+- ..++.+.. ..+....+|..++ T Consensus 483 ~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~-------~~~~~~~~~~~~~ 542 (698) T protein:vir:10 483 YVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLT-------YVQRMAEGGDTGA 542 (698) T ss_pred HHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHh-------hhcCCcCCCCccc Confidence 14666666555544 221 1111110 00110000 0111122222222 No 174 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=96.70 E-value=0.0004 Score=39.23 Aligned_cols=382 Identities=14% Similarity=0.158 Sum_probs=170.4 Q ss_pred CCHHHH----------HHHHH----HHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccc Q lcl|NC_010179. 1 MELDAL----------KKLIR----NTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSN 66 (469) Q Consensus 1 ~~~~~~----------~~~i~----~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n 66 (469) |+.+-. .-+|- -.+.++ +.|++. |......- ... T Consensus 92 ~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~----~eyr~~-------~~~ia~e~---------------------~R~ 139 (695) T protein:vir:78 92 LDFNGTSMDALSFVTSSGFPGFPTLVLLAQL----PEYRAM-------HEVLADEC---------------------IRT 139 (695) T ss_pred hcccccccccchhhhccCcchHHHHHHHhhc----cchhhH-------HHHHHHHh---------------------hcc Confidence 332210 00110 011111 111111 10000000 001 Q ss_pred hHHHH--HHHHH---HhhhcCCeeeccC-chhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc------ Q lcl|NC_010179. 67 FYQLL--VDQEA---GYIASVFPDIDVG-KDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN------ 133 (469) Q Consensus 67 ~~k~i--v~~~~---~~l~g~p~~~~~~-~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~------ 133 (469) |.+.+ +...+ ++-.| ...... +.+..+.|..-+++ +..+.+.+..+++-.||.+..++-++.++. T Consensus 140 w~~~~~~~~e~~~~~g~~~~--~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL 217 (695) T protein:vir:78 140 WGEAIGGTKEKADTSGLAAG--GNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPL 217 (695) T ss_pred cceeccccchhhhhhccccc--ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCcccccccc Confidence 10000 00000 11111 111111 22333455555543 556778899999999999987776654431 Q ss_pred -----------eE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccc Q lcl|NC_010179. 134 -----------FR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPY 201 (469) Q Consensus 134 -----------~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (469) ++ +.+++|..+.|-.-+.. .+..- .||+ -++|... . ..+.. +. T Consensus 218 ~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~--dP~sp-dfgk------P~~y~V~---G-~kIH~--SR---------- 272 (695) T protein:vir:78 218 VPRPYTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVAD-DFYK------PSTWWMI---G-TEVHA--TR---------- 272 (695) T ss_pred ccccccccCcceeeeEeecccccccchhhhc--cchhh-ccCC------CceEEEe---c-eEEee--ee---------- Confidence 11 55566666655321100 00000 0000 0011000 0 00000 00 Q ss_pred cccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC---- Q lcl|NC_010179. 202 NIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY---- 277 (469) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~---- 277 (469) .... .....|..+ =|. .+-.|.|....+.+-+++.+++.-.....+..+....+.. ++ T Consensus 273 -L~~f----------~g~plPd~L--Kp~----y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~-dla~~L 334 (695) T protein:vir:78 273 -LHTI----------VSRPVGDML--KPT----YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILM-DLAQAL 334 (695) T ss_pred -EEEe----------cCCCchhhh--hcc----cccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHHH-HHHHhh Confidence 0000 000000000 011 1224788888888888888887777666665544432211 11 Q ss_pred -Cccc--ch---hhhhhhh-hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C- Q lcl|NC_010179. 278 -GGAS--LK---QFMNDLR-EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S- 346 (469) Q Consensus 278 -~~~~--~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g- 346 (469) ++.. .. +.....+ .++++.++. ..=+|-+.+.+.+.+...+.+...+|...+++|-....+. | T Consensus 335 ~~g~~~~l~~R~eli~~~Rsn~G~~llDk------~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGl 408 (695) T protein:vir:78 335 MPGANVDLSMRAELINRYRDNRNILFLDK------ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGL 408 (695) T ss_pred cChhHHHHHHHHHHHHHhcCccceEEEec------CCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccc Confidence 0111 00 1111112 223333431 1236778888999999999999999999999998766553 2 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHH--------- Q lcl|NC_010179. 347 NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST--------- 417 (469) Q Consensus 347 ~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k--------- 417 (469) |.||++=..-|...+. +..+..+...+++++.+|..-.- ...+. ++.++|++-...+++|.|++..| T Consensus 409 NATGE~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~rS~~-G~idp-di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~ 484 (695) T protein:vir:78 409 NASSEGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQLSLF-GAVDP-SIKWQWNALRELDDLEVAESRYKQAQSDVLYV 484 (695) T ss_pred cccchhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhc-CCCCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHH Confidence 6799876666666655 45588899999999998865431 12233 68999999999999999987643 Q ss_pred HhccCChHHHHHhC------CCC--CCHHHHHHHHHHHH-HHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 418 VANYSSKEAVAKAN------PIV--DDWQQELKDLAKDR-EENDPYANQADELNGKGVDDE 469 (469) Q Consensus 418 l~g~iS~et~~~~l------~~v--~d~~~E~eri~~E~-~~~~~~~~~~~~~~~~~~~de 469 (469) ..|+|+......++ ++. -|+..+=-...+.. ....+.-+...+.++.|...+ T Consensus 485 ~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (695) T protein:vir:78 485 QEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG 545 (695) T ss_pred HhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCC Confidence 14666666655553 221 11110000000000 000011111111111111111 No 175 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=96.68 E-value=0.00042 Score=39.12 Aligned_cols=386 Identities=13% Similarity=0.128 Sum_probs=169.9 Q ss_pred CCHHHH----------HHHHH----HHHHHHHHHHHHHHH-HHHHhccCCcccccccchhhhcccccccccccCcceecc Q lcl|NC_010179. 1 MELDAL----------KKLIR----NTSTSRNDLINNYKK-SVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPS 65 (469) Q Consensus 1 ~~~~~~----------~~~i~----~~~~~~~~~~~~~~~-~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~ 65 (469) |+.+-. .-+|- -.+.++-+-...... ...-++--. ++.. T Consensus 92 ~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~--------------------------~~~~ 145 (695) T protein:vir:36 92 LDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWG--------------------------EAIG 145 (695) T ss_pred hcccccccccchhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccc--------------------------eecc Confidence 332210 00110 011111110000000 000011000 0000 Q ss_pred chHHHHHHHHHHhhhcCCeeeccC-chhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc---------- Q lcl|NC_010179. 66 NFYQLLVDQEAGYIASVFPDIDVG-KDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN---------- 133 (469) Q Consensus 66 n~~k~iv~~~~~~l~g~p~~~~~~-~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~---------- 133 (469) +-...+-+ .++-.| ...... +.+..+.|..-+++ +..+.+.+..+++-.||.+..++-++.++. T Consensus 146 ~~~e~~~~--~g~~~~--~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l~~PL~~~~ 221 (695) T protein:vir:36 146 GTKEKADT--SGLAAG--GNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRP 221 (695) T ss_pred cchhhhhh--cccccc--ccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCcccccccccccc Confidence 00000000 011111 111111 22334555555543 456778899999999999987776654431 Q ss_pred -------eE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccc Q lcl|NC_010179. 134 -------FR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIIT 205 (469) Q Consensus 134 -------~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (469) ++ +.+++|..+.|-.-+.. .+.. -.+|.+....... ..+....... T Consensus 222 ~~I~kGslKGl~ViDp~~vtP~~~n~~--dP~s------------------pdfgkP~~y~V~G------~kIH~SRL~~ 275 (695) T protein:vir:36 222 YTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVA------------------DDFYKPSTWWMIG------TEVHATRLHT 275 (695) T ss_pred ccccCcceeeeEeecccccccchhhhc--cchh------------------hccCCCceEEEec------eEEeeeeEEE Confidence 11 55566666655311100 0000 0011111000000 0000000000 Q ss_pred cccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC-----Ccc Q lcl|NC_010179. 206 SYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY-----GGA 280 (469) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~-----~~~ 280 (469) . .....|..+ =|. .+-.|.|....+.+-+++.+++.-.....+..+....+.. ++ ++. T Consensus 276 f----------~g~plPd~L--Kp~----y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~-dla~aL~~g~ 338 (695) T protein:vir:36 276 I----------VSRPVGDML--KPT----YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILM-DLAQALMPGA 338 (695) T ss_pred e----------cCCCchhhh--hcc----cccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhHHHHHH-HHHHhhcChh Confidence 0 000000000 011 1224778788888888888887766666665444332211 10 011 Q ss_pred c--ch---hhhhhhh-hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C-CccH Q lcl|NC_010179. 281 S--LK---QFMNDLR-EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S-NASG 350 (469) Q Consensus 281 ~--~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-~~Sg 350 (469) . .. +.....+ .++++.++. ..=+|-+.+.+.+.+...+.+...+|...+++|-....+. | |.|| T Consensus 339 ~~~l~~R~eli~~~Rsn~G~~llDk------~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATG 412 (695) T protein:vir:36 339 NVDLSMRAELINRYRDNRNILFLDK------ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASS 412 (695) T ss_pred HHHHHHHHHHHHHhcCccceEEEec------CCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccc Confidence 1 00 1111112 233333431 1236778888999999999999999999999998766553 2 6799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHH---------Hhcc Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST---------VANY 421 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k---------l~g~ 421 (469) ++=..-|...+. +..+..+...+++++.+|..-.- ...+. ++.++|++-...+++|.|+...| ..|+ T Consensus 413 E~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~rS~~-G~idp-di~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gv 488 (695) T protein:vir:36 413 EGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQLSLF-GAVDP-SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQV 488 (695) T ss_pred hhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhc-CCCCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcC Confidence 876666666655 45588899999999998865431 12333 68999999999999999987643 1466 Q ss_pred CChHHHHHhC------CCC--CCHHHHHHHHHHHH-HHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 422 SSKEAVAKAN------PIV--DDWQQELKDLAKDR-EENDPYANQADELNGKGVDDE 469 (469) Q Consensus 422 iS~et~~~~l------~~v--~d~~~E~eri~~E~-~~~~~~~~~~~~~~~~~~~de 469 (469) |+......++ ++. -|+..+=-...+.. ....+.-+...+.++.|...+ T Consensus 489 I~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (695) T protein:vir:36 489 IRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG 545 (695) T ss_pred CCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCc Confidence 6666655553 221 11110000000000 000011111111111111112 No 176 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=96.65 E-value=0.00044 Score=38.98 Aligned_cols=386 Identities=14% Similarity=0.122 Sum_probs=170.1 Q ss_pred CCHHHH----------HHHHH----HHHHHHHHHHHHHHH-HHHHhccCCcccccccchhhhcccccccccccCcceecc Q lcl|NC_010179. 1 MELDAL----------KKLIR----NTSTSRNDLINNYKK-SVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPS 65 (469) Q Consensus 1 ~~~~~~----------~~~i~----~~~~~~~~~~~~~~~-~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~ 65 (469) |+..-+ .-+|- -.+.++-+-+..... ...-++--. ++.. T Consensus 91 ~~~~~~~~~~l~~~~~~~F~Gy~~la~laQ~~eyr~~~~~ia~e~~R~w~--------------------------~~~~ 144 (694) T protein:vir:10 91 LDFNGTSMDALSFVTSSGFPGFPTLVLLAQLPEYRAMHEVLADECIRTWG--------------------------EAIG 144 (694) T ss_pred hccCcccccchhhhhccCcchHHHHHHHhhccchhhHHHHHHHHhhcccc--------------------------eecc Confidence 222110 00110 011111110000000 000011000 0000 Q ss_pred chHHHHHHHHHHhhhcCCeeeccC-chhhHHHHHHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc---------- Q lcl|NC_010179. 66 NFYQLLVDQEAGYIASVFPDIDVG-KDADNKKILDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN---------- 133 (469) Q Consensus 66 n~~k~iv~~~~~~l~g~p~~~~~~-~~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~---------- 133 (469) +-...+- ..++-.|. ..... +.+..+.|..-+++ +..+.+.+..+++-.||.+..++-++.++. T Consensus 145 ~~~e~~~--~~g~~~~~--~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l~~PL~~~~ 220 (694) T protein:vir:10 145 GTKEKAD--TSGLAAGG--NAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIMDTPLVPRP 220 (694) T ss_pred ccchhhh--hhcccccc--cccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCcccccccccccc Confidence 0000000 00111111 11111 22333455555543 556778899999999999987776644331 Q ss_pred -------eE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccc Q lcl|NC_010179. 134 -------FR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIIT 205 (469) Q Consensus 134 -------~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (469) ++ +.+++|..+.|-.-+.. .+..- .||+ -++|... . ..++. +. ... T Consensus 221 ~~I~kGslKGl~ViDp~~vtP~~~n~~--dP~sp-dfgk------P~~y~V~---G-~~IH~--SR-----------L~~ 274 (694) T protein:vir:10 221 YTVPKGSFQGLRVVEPYWVTPNNYNSI--NPVAD-DFYK------PSTWWMI---G-TEVHA--TR-----------LHT 274 (694) T ss_pred ccccCcceeeeEeecccccccchhhhc--cchhh-ccCC------CceEEEe---c-eEEee--ee-----------EEE Confidence 11 55566666655311100 00000 0000 0011000 0 00000 00 000 Q ss_pred cccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC-----Ccc Q lcl|NC_010179. 206 SYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY-----GGA 280 (469) Q Consensus 206 ~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~-----~~~ 280 (469) . .....|..+ =|. .+-.|.|....+.+-+++.+++.-.....+..++...+.. ++ ++. T Consensus 275 f----------~g~plPd~L--Kp~----y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~-dla~~L~~g~ 337 (694) T protein:vir:10 275 I----------VSRPVGDML--KPT----YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILM-DLAQALMPGA 337 (694) T ss_pred e----------cCCCchhhh--hcc----cccCcccHHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHHH-HHHHhhcChh Confidence 0 000000000 011 1224778888888888888887777666665444432211 10 011 Q ss_pred c--ch---hhhhhhh-hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc---C-CccH Q lcl|NC_010179. 281 S--LK---QFMNDLR-EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES---S-NASG 350 (469) Q Consensus 281 ~--~~---~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~---g-~~Sg 350 (469) . .. +.....+ .++++.++. ..=+|-+.+.+.+.+...+.+...+|...+++|-....+. | |.|| T Consensus 338 ~~~l~~R~eli~~~Rsn~G~~llDk------~~Eefeq~stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATG 411 (694) T protein:vir:10 338 NVDLSMRAELINRYRDNRNILFLDK------ATEEFFQFNTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASS 411 (694) T ss_pred HHHHHHHHHHHHHhcCccceEEEec------CCcceEEEecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccc Confidence 1 00 1111112 233333431 1236778888999999999999999999999998766553 2 6799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHH---------Hhcc Q lcl|NC_010179. 351 VAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVST---------VANY 421 (469) Q Consensus 351 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~k---------l~g~ 421 (469) ++=..-|...+. +..+..+...+++++.+|..-.- ...+. ++.++|++-...+++|.|++..| ..|+ T Consensus 412 E~D~rnYYD~I~--s~Qe~~L~p~L~rl~~ii~rS~~-G~idp-~i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gv 487 (694) T protein:vir:10 412 EGEIRVWYDYVR--AYQRNALQQLMNDVIVMIQLSLF-GAVDP-SIKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQV 487 (694) T ss_pred hhhHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhc-CCCCC-cceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcC Confidence 876666666655 45588899999999988865431 12333 68999999999999999987643 1466 Q ss_pred CChHHHHHhC------CCC--CCHHHHHHHHHHHH-HHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 422 SSKEAVAKAN------PIV--DDWQQELKDLAKDR-EENDPYANQADELNGKGVDDE 469 (469) Q Consensus 422 iS~et~~~~l------~~v--~d~~~E~eri~~E~-~~~~~~~~~~~~~~~~~~~de 469 (469) |+......++ ++. -|+..+=-...+.. ....+.-+...+.++.|...+ T Consensus 488 I~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 544 (694) T protein:vir:10 488 IRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG 544 (694) T ss_pred CCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCc Confidence 6666655553 221 11110000000000 000011111111111111111 No 177 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=96.63 E-value=0.00045 Score=38.92 Aligned_cols=397 Identities=10% Similarity=0.005 Sum_probs=153.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccc---cCcceeccchHHHHHHHHHHhhhcCCee Q lcl|NC_010179. 9 LIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLR---SADNRIPSNFYQLLVDQEAGYIASVFPD 85 (469) Q Consensus 9 ~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~---~~~~ri~~n~~k~iv~~~~~~l~g~p~~ 85 (469) +....+... ...+.+....=.-+.+.+.. ...... ...+.+ .+...-+.+....+|+..+.-+.+-|+. T Consensus 1 ~~~~~~~i~--s~~~~~~i~~~~~~s~~~~~-----~~~~~~-~~pp~~~~~la~l~~~n~~v~scI~~ia~~IA~l~~~ 72 (542) T protein:vir:41 1 MFNYHLSIR--SLEKYKAIKREEVESQALGE-----TRFEEY-VEPKVNPLVLLSLLQVNPYHASACSIKANDIIRTGYI 72 (542) T ss_pred Ccccccccc--ccccchhhhhcccccccccc-----ccCCcc-ccCCCCHHHHHHHHhhcHHHHHHHHHHHHHHhhCcee Confidence 111111000 00001110000000000000 000000 000000 0000012345566778888888888887 Q ss_pred eccCchhhHHHHHHHHhcc---HHHHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCceEEEEE Q lcl|NC_010179. 86 IDVGKDADNKKILDVLGDD---RALTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKLLGVLR 161 (469) Q Consensus 86 ~~~~~~~~~~~l~~~~~~n---~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~~~v~ 161 (469) +...+.. .+..++-+. ..+....+..+...+|.+|+.+-.+.+|++. +.+++|..+.+..|... . +. T Consensus 73 ~~~~~~~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~~d~~~---~---~~ 143 (542) T protein:vir:41 73 LEGDDEG---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVHKDGSR---Y---RQ 143 (542) T ss_pred eecccch---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEEEcCCe---e---Ee Confidence 7544332 244444332 2334456778899999999999889988865 88889888877655321 0 11 Q ss_pred EEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCC----- Q lcl|NC_010179. 162 SYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN----- 236 (469) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~----- 236 (469) . . ++...+.... |....... ...+ .....+..=-|+||++. T Consensus 144 ~---~--~~~~~~~~~~-y~~~~~~~--~~~g--------------------------~~~~~~~~~eIiHir~~~~~~~ 189 (542) T protein:vir:41 144 T---W--DGVNITHFKD-YRYEGEIN--PETG--------------------------EDQDSVGANELVFIHIPSPVCS 189 (542) T ss_pred e---e--cCCcceeEEe-eccccccc--cccc--------------------------ccccccCcccEEEecCCCCCCC Confidence 1 1 1111111111 11100000 0000 00000111124555532 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE--EecCCccc-----------chhhhhhhh---------hcce Q lcl|NC_010179. 237 KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV--LTNYGGAS-----------LKQFMNDLR---------EYKS 294 (469) Q Consensus 237 ~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~--~~g~~~~~-----------~~~~~~~~~---------~~~~ 294 (469) ..|.|.+......+.....+..-..+.+...+.|-.+ +.|...+. .+.....+. ..+. T Consensus 190 ~~Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~ 269 (542) T protein:vir:41 190 YYGVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTP 269 (542) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCce Confidence 2467777666655555444443344444555556444 34421111 011111111 1223 Q ss_pred eeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLELKAAKTQTY 370 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~~k~~~~~~~ 370 (469) +.+...+ +...+++|..... ....+.+..+...+.|+..-++|+.-.... +..++.-++... ... T Consensus 270 ~vL~~~~-~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~~----------~~f 338 (542) T protein:vir:41 270 LVFSIPG-GDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVTR----------RTY 338 (542) T ss_pred eEeeccC-CcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHHH----------HHH Confidence 3332211 1223455544333 344556666777888999888887533211 111111111111 122 Q ss_pred HHHHHHHHHHHHHHHhccc---CCCcccceEEeCCC--CCCCHHHHHHHHHHHhccCChHHHHHhCCCCC---CHH---- Q lcl|NC_010179. 371 FEHAINELVRAIMRYLNFS---DADKRHISQHWTRT--KVEDSLTKAQIVSTVANYSSKEAVAKANPIVD---DWQ---- 438 (469) Q Consensus 371 ~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~--~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~---d~~---- 438 (469) +...|.-+++.+...++.. ... ..+.+.|+.. +..|..+.++.+ -.+|+++...+.+.|+.++ |+- T Consensus 339 ~~~tL~P~~~~ie~~ln~~L~~~~~-~~~~~~f~~~~ll~~d~~~~~~~~-v~~GilT~NE~Re~L~g~~pgdd~~l~p~ 416 (542) T protein:vir:41 339 YESVVRPQQNIISSILTDFFQVKFN-PKTRFKFNDETLLESDSVRNCALL-VQSGVLTPAEARERLFGLDGGPDIFMVPS 416 (542) T ss_pred HHHHHHHHHHHHHHHHHhhcccccC-CceEEEecchhhcchHHHHHHHHH-HhCCCCCHHHHHHhhCCCCCCCccccccc Confidence 2333333333333333321 111 2345666533 333433333322 1368999888877664332 221 Q ss_pred -HHHHHHHHHHH--------HhhhhHhh----ccc---------CCCCCCCCC Q lcl|NC_010179. 439 -QELKDLAKDRE--------ENDPYANQ----ADE---------LNGKGVDDE 469 (469) Q Consensus 439 -~E~eri~~E~~--------~~~~~~~~----~~~---------~~~~~~~de 469 (469) ...+.++..+. +..+...+ +++ ...++.++. T Consensus 417 ~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~~~~~~~~~~ 469 (542) T protein:vir:41 417 KGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAEEKKKKIDES 469 (542) T ss_pred cccccccccCCcCCCCCchhhhhhcccccCccccccccccccchhhcccccch Confidence 00011110000 00000000 000 000111111 No 178 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=96.63 E-value=0.00045 Score=38.92 Aligned_cols=381 Identities=10% Similarity=0.008 Sum_probs=159.9 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHHhccCCcccccccchhh--hccccccccccc--CcceeccchHHHHHHHHHHhhhcCC Q lcl|NC_010179. 9 LIRNTSTSRNDLI-NNYKKSVDYYENKTDITTRNNGKPK--VSKEGKKDPLRS--ADNRIPSNFYQLLVDQEAGYIASVF 83 (469) Q Consensus 9 ~i~~~~~~~~~~~-~~~~~~~~Yy~g~~~i~~~~~~~~~--~~~~~~~~~~~~--~~~ri~~n~~k~iv~~~~~~l~g~p 83 (469) ++--+. -|...- .-+-....+|+.+. ...+...... ....+....... +.+-+.+.=....|+..++-+-+-| T Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~lf~~~~-~~~~~~~~~~~~~~~~~~~~~~~~vs~~~al~~~~v~~cv~~Ia~~iA~lp 78 (424) T protein:vir:45 1 MLYCWW-AHWLWPEGGRVLLDALFRSKS-LENPSTPITGDAVDTDGLFRADVYVSPETAMKLAAVYSCIYVLSSSLAQMP 78 (424) T ss_pred CeeEee-eceecCcchhHHHHhhccccC-CCCCccccchhhhhhhccccCCceechHHhhccHHHHHHHHHHHHHHhhCc Confidence 110000 000000 00001111122111 0000000000 000000000000 0000111112234555555555556 Q ss_pred eeecc-Cch----hhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCC Q lcl|NC_010179. 84 PDIDV-GKD----ADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATT 151 (469) Q Consensus 84 ~~~~~-~~~----~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~ 151 (469) +.+-- .+. .....+..++.. | .+ +....+..+++.+|.+|+.+-.+..|++ .+.+++|..+.+..+.+ T Consensus 79 ~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~i~~~~~ 158 (424) T protein:vir:45 79 LHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWETTLMNTGG 158 (424) T ss_pred eEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEEcCC Confidence 65421 111 112234555532 3 22 2234566788999999999888888886 48888888876654321 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) ++. |......+. ..+.+..+.+++.-. T Consensus 159 ---~~~-----y~~~~~~~~------~~~~~~eVih~r~~~--------------------------------------- 185 (424) T protein:vir:45 159 ---RYT-----YGLYNEYGA------FAISPDDMIHIRALG--------------------------------------- 185 (424) T ss_pred ---eEE-----EEEEecCce------EEECcccEEEecCcC--------------------------------------- Confidence 111 111111110 012222222221100 Q ss_pred EecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccch---hhhhhhhh---------cceeeecc Q lcl|NC_010179. 232 EFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLK---QFMNDLRE---------YKSIKINN 299 (469) Q Consensus 232 ~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~---~~~~~~~~---------~~~~~~~~ 299 (469) .+...|.|.++-+...|+.......-..+.+...+.|-.+++-....+.+ .....+.. .+++.++ T Consensus 186 --~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~l~~e~~~~~~~~~~~~~~g~~~n~g~~~vl~- 262 (424) T protein:vir:45 186 --NNQKMGLSPIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKSGLNKESWGWLKDQWQKASQALRRQENKTMLLP- 262 (424) T ss_pred --CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHHHHHHhccccccCCceeEcC- Confidence 01224667676666666555444444455566667776666542221111 11111110 1122222 Q ss_pred cCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLELKAAKTQTYFEHAI 375 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l 375 (469) .+++|.....+ ...+.+..+.....|.+.-++|+.-.... ++-|+ ++. .....+...| T Consensus 263 ------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn--~eq----------~~~~f~~~tL 324 (424) T protein:vir:45 263 ------ADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSN--ISA----------QAIQFVRYTM 324 (424) T ss_pred ------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCccc--HHH----------HHHHHHHHHH Confidence 23455444443 33455666777788888888887533222 22122 111 1123334455 Q ss_pred HHHHHHHHHHhcccCC---C-cccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHH Q lcl|NC_010179. 376 NELVRAIMRYLNFSDA---D-KRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLA 445 (469) Q Consensus 376 ~~~~~~i~~~~~~~~~---~-~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~ 445 (469) .-.++.|...++.+=. + .....+.| ..-+-.|.++.++.+.++ +|+++.-++.+.++. +++-+.-+.... T Consensus 325 ~P~~~~ie~~ln~kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R~~~gl~pi~ggD~~~~~~n 404 (424) T protein:vir:45 325 MPWVTNWEQELNRRLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEARAFEDMNPVEGLDEMLVSVN 404 (424) T ss_pred HHHHHHHHHHHHHhcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeeccc Confidence 5555555544442211 1 11223444 444567899999999887 579999998888754 232111111110 Q ss_pred HHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 446 KDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 446 ~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ...+...+..+..+++..|| T Consensus 405 ----~~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 405 ----AANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred ----ccccccccCCCCCCCCCCCC Confidence 11122333444455555555 No 179 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=96.58 E-value=0.0005 Score=38.69 Aligned_cols=380 Identities=9% Similarity=0.051 Sum_probs=165.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhc-ccccccccccCcce--eccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVS-KEGKKDPLRSADNR--IPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~-~~~~~~~~~~~~~r--i~~n~~k~iv~~~~~ 77 (469) |+-. ++......+.....+++..+.|............... ........ ..+.+ +.+.-.-..|+..++ T Consensus 1 ~~~~-------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEP-------KYTIDLRTNNGWWARLQSWFVGGRLVTPNQGSQTGPVSAHGHLGDS-SINDERILQISTVWRCVSLIST 72 (424) T ss_pred CCCC-------cceEeecCCCchHHHHHhhhcccccccccccccccccccccccccc-cccHHHhhccHHHHHHHHHHHH Confidence 2111 1111112233344445555554321110000000000 00000000 00000 111112234555555 Q ss_pred hhhcCCeee-ccC-ch-----hhHHHHHHHHhc--cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccce Q lcl|NC_010179. 78 YIASVFPDI-DVG-KD-----ADNKKILDVLGD--DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQ 143 (469) Q Consensus 78 ~l~g~p~~~-~~~-~~-----~~~~~l~~~~~~--n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~ 143 (469) -+-+-|+.+ ..+ +. .....+..++.. |. .+....+..+++.+|.+|+++-.+.+|++ .+.+++|.. T Consensus 73 ~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~pl~~~~ 152 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) T ss_pred hhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcc Confidence 555556654 111 11 112335555542 32 22334567788999999999988888886 488889998 Q ss_pred eEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccc Q lcl|NC_010179. 144 ITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKH 223 (469) Q Consensus 144 ~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (469) +.+..+.+ .+. |... .+| .. ..+....+.+++.-. T Consensus 153 V~v~~~~~---~~~-----y~~~-~~g-~~----~~~~~~eIih~r~~~------------------------------- 187 (424) T protein:vir:18 153 MDVKLVGK---KVV-----YRYQ-RDS-EY----ADFSQKEIFHLKGFG------------------------------- 187 (424) T ss_pred eEEEEcCC---eEE-----EEEE-eCC-eE----EEeccccEEEecCcC------------------------------- Confidence 87755432 111 1111 111 11 012223333221100 Q ss_pred cCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh----h-------hc Q lcl|NC_010179. 224 NFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL----R-------EY 292 (469) Q Consensus 224 ~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~----~-------~~ 292 (469) .+...|.|-++.+...++..........+.+...+.|-.++........++....+ . .. T Consensus 188 ----------~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~~l~~e~~~~~~~~~~~~~~g~nag 257 (424) T protein:vir:18 188 ----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKK 257 (424) T ss_pred ----------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccC Confidence 01124667666666666655555555555567777776666543221122221111 1 11 Q ss_pred ceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLELKAAKTQTY 370 (469) Q Consensus 293 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~~k~~~~~~~ 370 (469) +++.++. +.+++.++.......+.+..+...+.|++.-++|+.-.... ++..|..++.... .. T Consensus 258 ~~~vl~~-----g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~----------~f 322 (424) T protein:vir:18 258 RLWILEA-----GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GF 322 (424) T ss_pred CceeccC-----CceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH----------HH Confidence 2222221 12334443333344556666777888999888887533222 2222232222221 22 Q ss_pred HHHHHHHHHHHHHHHhccc-----CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHH- Q lcl|NC_010179. 371 FEHAINELVRAIMRYLNFS-----DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQE- 440 (469) Q Consensus 371 ~~~~l~~~~~~i~~~~~~~-----~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E- 440 (469) +..+|.-.++.|...++.+ +..-..+++.+..-+..|.++.++.+.++ +|+++.-++.+.++. +++-+.- T Consensus 323 ~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGD~~~ 402 (424) T protein:vir:18 323 LQYTLQPYISRWENSIQRWLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTINEMRRTDNLPPLPGGDVAM 402 (424) T ss_pred HHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Confidence 3344444555554444321 11222344445555678999999999887 688998888877643 2221110 Q ss_pred ----HHHHHHHHHHhhhhHhhcccCCCCCC Q lcl|NC_010179. 441 ----LKDLAKDREENDPYANQADELNGKGV 466 (469) Q Consensus 441 ----~eri~~E~~~~~~~~~~~~~~~~~~~ 466 (469) +..+.. ... ..+..+.|. T Consensus 403 ~~~n~~~l~~-------~~~-~~~p~~~ga 424 (424) T protein:vir:18 403 RQSQYVPITD-------LGT-NKEPRNNGA 424 (424) T ss_pred eccCccchHh-------hhc-cCCCccCCC Confidence 011110 000 111111111 No 180 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=96.49 E-value=0.00057 Score=38.37 Aligned_cols=380 Identities=9% Similarity=0.040 Sum_probs=164.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhccc-ccccccccCcce--eccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKE-GKKDPLRSADNR--IPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~-~~~~~~~~~~~r--i~~n~~k~iv~~~~~ 77 (469) |+-.+.. .....+.....+++..+.|................. ...-.. ..+.+ +.++-....|+..+. T Consensus 1 ~~~~~~~-------~~~~~~~g~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~al~~~~v~~cv~~Ia~ 72 (424) T protein:vir:18 1 MEEPKYT-------IDLRTNNGWWARLKSWFVGGRLVTPNQGSQTGPVSAHGYLGDS-SINDERILQISTVWRCVSLIST 72 (424) T ss_pred CCCCccc-------cccCCCCchHHHHHhhccccccccccchhhccccccccccccc-cccHHHhhccHHHHHHHHHHHH Confidence 3322111 111112233344455565542111110000000000 000000 00001 111122234555555 Q ss_pred hhhcCCeee-ccC-ch---h--hHHHHHHHHhc--cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccce Q lcl|NC_010179. 78 YIASVFPDI-DVG-KD---A--DNKKILDVLGD--DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQ 143 (469) Q Consensus 78 ~l~g~p~~~-~~~-~~---~--~~~~l~~~~~~--n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~ 143 (469) -+-+-|+.+ ... +. . ....+..++.. |. .+-...+..+++.+|.+|+++-.+..|++ .+.+++|.. T Consensus 73 ~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~ 152 (424) T protein:vir:18 73 LTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLPLQSAN 152 (424) T ss_pred hhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCcc Confidence 555556654 111 11 1 22345565542 32 22334567788999999999988888875 488888988 Q ss_pred eEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccc Q lcl|NC_010179. 144 ITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKH 223 (469) Q Consensus 144 ~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (469) +.+..+.+ .+. |.... +|.. . .+....+.+++.-. T Consensus 153 v~v~~~~~---~~~-----y~~~~-~g~~----~-~~~~~eVihir~~~------------------------------- 187 (424) T protein:vir:18 153 MDVKLVGK---KVV-----YRYQR-DSEY----A-DFSQKEIFHLKGFG------------------------------- 187 (424) T ss_pred eEEEEcCC---eEE-----EEEEe-CCeE----E-EeccccEEEecCcC------------------------------- Confidence 87755432 111 11111 1110 0 12222232221100 Q ss_pred cCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh----hh-------c Q lcl|NC_010179. 224 NFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL----RE-------Y 292 (469) Q Consensus 224 ~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~----~~-------~ 292 (469) .+...|.|-++.+...+...........+.+...+.|-.+++-......++....+ .. . T Consensus 188 ----------~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~nag 257 (424) T protein:vir:18 188 ----------FTGLVGLSPIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGEKVLTEQQRSQVEENFKEIAGGPVKK 257 (424) T ss_pred ----------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCcCCCHHHHHHHHHHHHHHhCCcccC Confidence 01123566666555555554444444555566666775555532221122221111 11 1 Q ss_pred ceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTY 370 (469) Q Consensus 293 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~ 370 (469) +++.++. +.+++-++.......+.+..+.....|+..-++|+.-... .++.+|..++.... .. T Consensus 258 ~~~vl~~-----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~eq~~~----------~f 322 (424) T protein:vir:18 258 RLWILEA-----GFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSGIEQQNL----------GF 322 (424) T ss_pred CceeccC-----CceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccccHHHHHH----------HH Confidence 1222221 1233333333334455666677778898888888643322 22333333332222 22 Q ss_pred HHHHHHHHHHHHHHHhccc-----CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHH- Q lcl|NC_010179. 371 FEHAINELVRAIMRYLNFS-----DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQE- 440 (469) Q Consensus 371 ~~~~l~~~~~~i~~~~~~~-----~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E- 440 (469) +..+|.-+++.|...++.+ +.....+++.++.-+..|.++.++.+.++ +|+++.-++.++++. +++-++- T Consensus 323 ~~~tl~P~~~~ie~~ln~~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~ 402 (424) T protein:vir:18 323 LQYTLQPYISRWENSIQRWLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRTINEMRRTDNMPPLPGGDVAM 402 (424) T ss_pred HHHHHHHHHHHHHHHHHhhcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeee Confidence 3344444444444444321 11223345555666778999999999887 689999888888753 2221110 Q ss_pred ----HHHHHHHHHHhhhhHhhcccCCCCCC Q lcl|NC_010179. 441 ----LKDLAKDREENDPYANQADELNGKGV 466 (469) Q Consensus 441 ----~eri~~E~~~~~~~~~~~~~~~~~~~ 466 (469) +..+.. .... .+....|. T Consensus 403 ~~~n~~~l~~-------~~~~-~~~~~n~a 424 (424) T protein:vir:18 403 RQAQYVPITD-------LGTN-KEPRNNGA 424 (424) T ss_pred eccCccchhh-------hhcc-CCccccCC Confidence 111110 0000 11111111 No 181 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=96.48 E-value=0.00059 Score=38.30 Aligned_cols=386 Identities=12% Similarity=0.069 Sum_probs=158.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-cccccccchh---hhccccccccc-ccCc---ceeccchHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKT-DITTRNNGKP---KVSKEGKKDPL-RSAD---NRIPSNFYQLLV 72 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~-~i~~~~~~~~---~~~~~~~~~~~-~~~~---~ri~~n~~k~iv 72 (469) |+.+-= +.....+...--+...+. .+........ ........... .... .|.+. .-..| T Consensus 11 ~~~~~~-----------~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~--V~~cv 77 (441) T protein:vir:94 11 VDFKSR-----------KQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSD--IFTAV 77 (441) T ss_pred cccccc-----------ccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHH--HHHHH Confidence 111100 000000010000111000 0000000000 00000000000 0000 01110 11124 Q ss_pred HHHHHhhhcCCeeeccCch-hhHHHHHHHHhc--cH-H---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEcccee Q lcl|NC_010179. 73 DQEAGYIASVFPDIDVGKD-ADNKKILDVLGD--DR-A---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQI 144 (469) Q Consensus 73 ~~~~~~l~g~p~~~~~~~~-~~~~~l~~~~~~--n~-~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~ 144 (469) +..++-+.+-|+.+.-+.. .....+..++.. |. + +....+...++.+|.+|+.+-.+.+|++ .+.+++|..+ T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:94 78 MMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred HHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 4444444455655432211 122334444432 32 2 2234566778999999999988988986 4899999999 Q ss_pred EEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccccc Q lcl|NC_010179. 145 TPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (469) Q Consensus 145 ~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (469) .+..++. ..+.+.+. ..+..+... ...+....+.+++.- + T Consensus 158 ~v~~d~~--g~~~~~~~---~~~~~~~~~---~~~~~~~dvih~k~~--------------------------------~ 197 (441) T protein:vir:94 158 ELKSDAR--GRLYYFHQ---RIDSNGNNI---ERNVKFEDMLDIKFY--------------------------------S 197 (441) T ss_pred EEEECCC--ccEEEEEE---EeccCCcee---EEEEccccEEEeccC--------------------------------C Confidence 9888753 23222111 111111111 111222222222110 0 Q ss_pred CCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhh----hhhhh--------c Q lcl|NC_010179. 225 FGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFM----NDLRE--------Y 292 (469) Q Consensus 225 ~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~----~~~~~--------~ 292 (469) + +.-.|.|-++.+...++.......-..+.++..+.|-.++.-......++.. ..+.. . T Consensus 198 ~---------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag 268 (441) T protein:vir:94 198 L---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAG 268 (441) T ss_pred C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccC Confidence 0 0124677777666666655555555556667777776665432211111111 11111 1 Q ss_pred ceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQT 369 (469) Q Consensus 293 ~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~ 369 (469) +++.++ ++.+|-.... ....+.+..+...+.|+..-++|+.-.... ++.|-+.... T Consensus 269 ~~~vl~-------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~-------------- 327 (441) T protein:vir:94 269 KVVVLD-------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL-------------- 327 (441) T ss_pred cceecC-------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH-------------- Confidence 122222 2234444333 334556666777888988888887533211 1112111111 Q ss_pred HHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHH Q lcl|NC_010179. 370 YFEHAINELVRAIMRYLNFS---DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELK 442 (469) Q Consensus 370 ~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~e 442 (469) .|...|.-+++.+...++.+ ......+++.++.-+-.|.++.++.+.++ +|+++.-++.++++. +++.+..+- T Consensus 328 ~~~~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~ 407 (441) T protein:vir:94 328 DYLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIH 407 (441) T ss_pred HHHHHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceE Confidence 12223333333333333321 11122333333444556889999988886 689999998887654 333222111 Q ss_pred H-----HHHHHHHhhh--hHhhcccCCCCCCCCC Q lcl|NC_010179. 443 D-----LAKDREENDP--YANQADELNGKGVDDE 469 (469) Q Consensus 443 r-----i~~E~~~~~~--~~~~~~~~~~~~~~de 469 (469) . +..+.....+ .....+....+|.++| T Consensus 408 ~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 408 RVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred eecccccccccccccccccccccccccCCCCCCC Confidence 1 1111111101 0111122233444444 No 182 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=96.48 E-value=0.00059 Score=38.30 Aligned_cols=386 Identities=12% Similarity=0.069 Sum_probs=158.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-cccccccchh---hhccccccccc-ccCc---ceeccchHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKT-DITTRNNGKP---KVSKEGKKDPL-RSAD---NRIPSNFYQLLV 72 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~-~i~~~~~~~~---~~~~~~~~~~~-~~~~---~ri~~n~~k~iv 72 (469) |+.+-= +.....+...--+...+. .+........ ........... .... .|.+. .-..| T Consensus 11 ~~~~~~-----------~~~~~~~~~~~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~--V~~cv 77 (441) T protein:vir:79 11 VDFKSR-----------KQSRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSD--IFTAV 77 (441) T ss_pred cccccc-----------ccchhhhhccccccccccccccCCCcchHHHHHHhcccCcccccccchhhhhccHH--HHHHH Confidence 111100 000000010000111000 0000000000 00000000000 0000 01110 11124 Q ss_pred HHHHHhhhcCCeeeccCch-hhHHHHHHHHhc--cH-H---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEcccee Q lcl|NC_010179. 73 DQEAGYIASVFPDIDVGKD-ADNKKILDVLGD--DR-A---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQI 144 (469) Q Consensus 73 ~~~~~~l~g~p~~~~~~~~-~~~~~l~~~~~~--n~-~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~ 144 (469) +..++-+.+-|+.+.-+.. .....+..++.. |. + +....+...++.+|.+|+.+-.+.+|++ .+.+++|..+ T Consensus 78 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v 157 (441) T protein:vir:79 78 MMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEI 157 (441) T ss_pred HHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcee Confidence 4444444455655432211 122334444432 32 2 2234566778999999999988988986 4899999999 Q ss_pred EEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccccc Q lcl|NC_010179. 145 TPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (469) Q Consensus 145 ~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (469) .+..++. ..+.+.+. ..+..+... ...+....+.+++.- + T Consensus 158 ~v~~d~~--g~~~~~~~---~~~~~~~~~---~~~~~~~dvih~k~~--------------------------------~ 197 (441) T protein:vir:79 158 ELKSDAR--GRLYYFHQ---RIDSNGNNI---ERNVKFEDMLDIKFY--------------------------------S 197 (441) T ss_pred EEEECCC--ccEEEEEE---EeccCCcee---EEEEccccEEEeccC--------------------------------C Confidence 9888753 23222111 111111111 111222222222110 0 Q ss_pred CCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhh----hhhhh--------c Q lcl|NC_010179. 225 FGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFM----NDLRE--------Y 292 (469) Q Consensus 225 ~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~----~~~~~--------~ 292 (469) + +.-.|.|-++.+...++.......-..+.++..+.|-.++.-......++.. ..+.. . T Consensus 198 ~---------dg~~G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~e~~e~~r~~~~~~~~G~~nag 268 (441) T protein:vir:79 198 L---------DGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAG 268 (441) T ss_pred C---------CCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCCCCCCHHHHHHHHHHHHHHhcCccccC Confidence 0 0124677777666666655555555556667777776665432211111111 11111 1 Q ss_pred ceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQT 369 (469) Q Consensus 293 ~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~ 369 (469) +++.++ ++.+|-.... ....+.+..+...+.|+..-++|+.-.... ++.|-+.... T Consensus 269 ~~~vl~-------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~-------------- 327 (441) T protein:vir:79 269 KVVVLD-------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL-------------- 327 (441) T ss_pred cceecC-------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH-------------- Confidence 122222 2234444333 334556666777888988888887533211 1112111111 Q ss_pred HHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHH Q lcl|NC_010179. 370 YFEHAINELVRAIMRYLNFS---DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELK 442 (469) Q Consensus 370 ~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~e 442 (469) .|...|.-+++.+...++.+ ......+++.++.-+-.|.++.++.+.++ +|+++.-++.++++. +++.+..+- T Consensus 328 ~~~~tl~P~~~~ie~eln~kl~~~~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T~NE~R~~~gl~Pi~ggd~~~~ 407 (441) T protein:vir:79 328 DYLSTLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIH 407 (441) T ss_pred HHHHHHHHHHHHHHHHHhhhccccccCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceE Confidence 12223333333333333321 11122333333444556889999988886 689999998887654 333222111 Q ss_pred H-----HHHHHHHhhh--hHhhcccCCCCCCCCC Q lcl|NC_010179. 443 D-----LAKDREENDP--YANQADELNGKGVDDE 469 (469) Q Consensus 443 r-----i~~E~~~~~~--~~~~~~~~~~~~~~de 469 (469) . +..+.....+ .....+....+|.++| T Consensus 408 ~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 408 RVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred eecccccccccccccccccccccccccCCCCCCC Confidence 1 1111111101 0111122233444444 No 183 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=96.32 E-value=0.00075 Score=37.73 Aligned_cols=395 Identities=13% Similarity=0.071 Sum_probs=155.9 Q ss_pred CCHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhccCCc-c-cccccchhhhcccccccccccCcceeccchHHHHHHHH Q lcl|NC_010179. 1 MELDALKK---LIRNTSTSRNDLINNYKKSVDYYENKTD-I-TTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQE 75 (469) Q Consensus 1 ~~~~~~~~---~i~~~~~~~~~~~~~~~~~~~Yy~g~~~-i-~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~ 75 (469) ..++.+.+ ++..+..+ +.++.-.+.. +.+... . ...... ...... .. ..+......-....|+.. T Consensus 65 ~~~~~~~kk~~i~~pfkkk---~~~~~~d~f~-~s~es~s~vtsls~p----daf~~v-nV-s~~~AlknsaV~scI~~I 134 (945) T protein:vir:10 65 FRKNQVLKKEKIIVPYNHQ---EPPFKFNLFE-YSPESLMYLPSISDP----DAFFLI-NL-FRKYRFNNDSKLIKVSEI 134 (945) T ss_pred ehhhhHHHhhccccccccc---ccchhhhhhh-ccCccceecccccCc----cceeee-hh-hhhhhhccHHHHHHHHHH Confidence 22332211 11111111 1111111111 222210 0 000000 000000 00 000111122333456666 Q ss_pred HHhhhcCCeeec--cCch---------hhHHHHHHHHhc-c-HH------H-HHHHHHHHHHhCCeEEEEEEEcCCCce- Q lcl|NC_010179. 76 AGYIASVFPDID--VGKD---------ADNKKILDVLGD-D-RA------L-TLNSLLVDSSNAGRAWLHYWIDEDNNF- 134 (469) Q Consensus 76 ~~~l~g~p~~~~--~~~~---------~~~~~l~~~~~~-n-~~------~-~~~~~~~~~~~~G~~~~~v~~d~~~~~- 134 (469) ++-+.+-|+.+- ..+. .....+..++.. | .+ . ....+..+++.+|.+|+.+..+.+|++ T Consensus 135 A~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eFwqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii 214 (945) T protein:vir:10 135 PKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNSWEYLLGMVLDDILTIDRGAIVKIRDEQGNLV 214 (945) T ss_pred HhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHHHHHHHHHHHHHHhhcCCeEEEEEECCCCcEE Confidence 666666676541 1111 122345556642 2 11 1 223456788999999999988999987 Q ss_pred EEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccc Q lcl|NC_010179. 135 RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYE 214 (469) Q Consensus 135 ~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (469) .+.+++|..+.|..++... ... +|.. ..++... ..+....+.++.... T Consensus 215 ~L~pLdPs~Vti~~ddDG~--~~y---~Yv~-~idG~~~----~~v~a~DvIlhirn~---------------------- 262 (945) T protein:vir:10 215 AITPVDGTTIKPILSEDTG--IVV---GYVQ-EVDGAIV----AHFDKRDVVLFRQNL---------------------- 262 (945) T ss_pred EEEEECCcceEEEEcCCCc--EEE---EEEE-ecCCceE----EEecCCceEEEeccC---------------------- Confidence 4889999999887765321 111 1111 1111111 111112111111000 Q ss_pred ccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHH-HhcCc--eeEEecCCc-------ccchh Q lcl|NC_010179. 215 TGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLD-DVQTV--ILVLTNYGG-------ASLKQ 284 (469) Q Consensus 215 ~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~-~~~~p--~l~~~g~~~-------~~~~~ 284 (469) +..|. ....|.|.++.+...+.....+.....+.+. ..+.| ++.+.|... ....+ T Consensus 263 --------s~DG~-------~~GyGlSPIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseE 327 (945) T protein:vir:10 263 --------TPDVY-------MYGYSLPPIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSRE 327 (945) T ss_pred --------CCCcc-------cccCCchHHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHH Confidence 00000 0012444454444444333333222333332 23455 333333211 11111 Q ss_pred ----hhhhhhh------cc-eeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHH Q lcl|NC_010179. 285 ----FMNDLRE------YK-SIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGV 351 (469) Q Consensus 285 ----~~~~~~~------~~-~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~ 351 (469) ....+.. ++ .+.+ ..+++|..... ....+.+..+.....|++.-++|+.-.....+.++. T Consensus 328 q~erlKe~wee~~sG~NnG~piVL-------deGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~S 400 (945) T protein:vir:10 328 QLESIQRQLQAIMMGDYTQVPILS-------GGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKA 400 (945) T ss_pred HHHHHHHHHHHHhCCcccccceec-------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcc Confidence 1111111 11 1111 12344444333 345556677778888999889987533222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHH Q lcl|NC_010179. 352 AIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF---SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEA 426 (469) Q Consensus 352 Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~---~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et 426 (469) .++.... ..+..+|.-++..+...++. .......+.+.|+.....+.++.++++.++ +|+++.-. T Consensus 401 NiEqq~~----------~Fv~~tL~Pil~~IEqeLNrkLl~~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNE 470 (945) T protein:vir:10 401 TAEVMAS----------LTKAKGLEPLMATISKGFDEVVSEFRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINE 470 (945) T ss_pred hHHHHHH----------HHHHHHHHHHHHHHHHHHHHhccccccCceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHH Confidence 2222211 12223333333333332221 112234567888877777888999988876 68999988 Q ss_pred HHHhCCC--CCCHHHHHH---HH------HHHHHHhhh-hHh---hcccCCCCCCCCC Q lcl|NC_010179. 427 VAKANPI--VDDWQQELK---DL------AKDREENDP-YAN---QADELNGKGVDDE 469 (469) Q Consensus 427 ~~~~l~~--v~d~~~E~e---ri------~~E~~~~~~-~~~---~~~~~~~~~~~de 469 (469) +.++++. +++-+.-+- .+ .+.+....+ ... ..++...+|.+|| T Consensus 471 vRe~lGLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dE 528 (945) T protein:vir:10 471 ARMEKGLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDE 528 (945) T ss_pred HHHHhCCCCCCCcceeeeccccccccccccccccCCCCcccccCCCCCCCCCCCCCCC Confidence 8887743 221111000 00 000000000 000 0111122222222 No 184 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=96.22 E-value=0.00085 Score=37.40 Aligned_cols=381 Identities=14% Similarity=0.116 Sum_probs=167.4 Q ss_pred CCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MEL----DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~----~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) .++ ....+|| .+|+.+-.+++-.. =...||+..+ T Consensus 44 ~~~e~~~~~~~eLI-----------~~YR~ma~~pEvd~-------------------------------Av~eIVneai 81 (537) T protein:vir:10 44 VDFDGTIRNDHELI-----------TRYREMVLNPECDS-------------------------------AVDDVVNETI 81 (537) T ss_pred cccccccchHHHHH-----------HHHHHHhhccchhh-------------------------------HHHHhhccee Confidence 122 2222332 33444333333221 1122232222 Q ss_pred H-hhhcCCeeeccCchhhHHHHH--------HHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEEccc Q lcl|NC_010179. 77 G-YIASVFPDIDVGKDADNKKIL--------DVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGIIQPD 142 (469) Q Consensus 77 ~-~l~g~p~~~~~~~~~~~~~l~--------~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~~p~ 142 (469) - =....|+.+..++.+..+.++ .+++- +|.....+..+.+.+.|+-|.+..+|.+ |-..+..+||+ T Consensus 82 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~lDPr 161 (537) T protein:vir:10 82 CGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYVDPR 161 (537) T ss_pred EecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCc Confidence 2 233456666665544333332 22221 4556677889999999999999988755 56678889998 Q ss_pred eeEEEEeCCCCCceEEEEEEEEeeecCCceEEE---EEEEEcCCeEEEEEeecCceeecccccccccccccccccccccc Q lcl|NC_010179. 143 QITPVYATTLDNKLLGVLRSYKQLDPEAGKYFT---VHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSN 219 (469) Q Consensus 143 ~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (469) .+-.+.--.. +....++... .+..+.. .+-+|.+.+... .. ..+ T Consensus 162 ~i~~vR~i~~--~~~~~~~~~~----~~~~v~~~~~eyf~ynp~g~~~---~~-----------------~~~------- 208 (537) T protein:vir:10 162 KIRKVTEYEA--KRPEALRTQD----LNQQLTQQSASYFLYNPKGLKN---ST-----------------NQG------- 208 (537) T ss_pred cceeeEeecc--cCCccceEEe----cceeeeecccceeeeccccccc---cC-----------------CCc------- Confidence 8766543110 0011111100 0000000 001122211110 00 000 Q ss_pred cccccCCcccE--EEec-------CCccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----h Q lcl|NC_010179. 220 TLKHNFGRVPF--IEFP-------KNKYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----K 283 (469) Q Consensus 220 ~~~~~~g~vPv--v~~~-------n~~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~ 283 (469) -+||- |.|. |.....|-+ ..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. . T Consensus 209 ------vkI~~dAI~y~hSGl~d~n~~~i~syL---hkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAe 279 (537) T protein:vir:10 209 ------MKIAPDSIAYCHSGIQDLNKNMVLSHL---HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAE 279 (537) T ss_pred ------eeccHhheeeecccceeCCCCeeeeee---hhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHH Confidence 01110 1111 112223333 233333343 344555555555555433332221111 1 Q ss_pred hh----hhhhhhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_010179. 284 QF----MNDLREYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGI 338 (469) Q Consensus 284 ~~----~~~~~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 338 (469) +. +...+ ++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+| T Consensus 280 qYlr~iM~k~K-NklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~-DV~YF~kKLy~aLnVP 357 (537) T protein:vir:10 280 QYLREVMGRYR-NKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVP 357 (537) T ss_pred HHHHHHHHhcc-ceEEEeccCceecccchhhhhhhhhcccccCCCcccceeeccccCCcChHH-HHHHHHHHHHHHhCCC Confidence 11 11111 222211111 112233455555444554443 3677778888887888 Q ss_pred C--cCccc---cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCH Q lcl|NC_010179. 339 D--PANFE---SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDS 408 (469) Q Consensus 339 ~--~~~~~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~ 408 (469) - +..++ +|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+ T Consensus 358 ~SRl~~e~~f~~Gr~~--EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~E 435 (537) T protein:vir:10 358 SSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTE 435 (537) T ss_pred ccccCCCCcccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHH Confidence 4 22222 34433 34444555556677778888888888877544333321 1223 346777765444444 Q ss_pred HHHHH-------HHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhh-hhHhhcccCCCCCCCCC Q lcl|NC_010179. 409 LTKAQ-------IVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREEND-PYANQADELNGKGVDDE 469 (469) Q Consensus 409 ~e~~~-------~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~de 469 (469) ...++ +++.+. | .+|.+++.+.+=-.+| .+++-++|++|..+.- +..+.......+..+++ T Consensus 436 lKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~~~ 510 (537) T protein:vir:10 436 LKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGDEE 510 (537) T ss_pred HHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCCcc Confidence 44333 334442 3 4799999987533333 4566777777765421 11111111111111111 No 185 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=96.21 E-value=0.00087 Score=37.37 Aligned_cols=413 Identities=12% Similarity=0.067 Sum_probs=157.2 Q ss_pred CCHHHHHHHHHHHH-HHH-HHHHH-----HHHHHHHHhccCCccccccc-chhhhcccccccccccCcc---e----e-c Q lcl|NC_010179. 1 MELDALKKLIRNTS-TSR-NDLIN-----NYKKSVDYYENKTDITTRNN-GKPKVSKEGKKDPLRSADN---R----I-P 64 (469) Q Consensus 1 ~~~~~~~~~i~~~~-~~~-~~~~~-----~~~~~~~Yy~g~~~i~~~~~-~~~~~~~~~~~~~~~~~~~---r----i-~ 64 (469) -++++.++....-+ .++ .++.. ..+.+.+.-+++......+. ............+..+++. + . . T Consensus 14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 93 (574) T protein:vir:80 14 SSIEETRNMENYKMHLREIDTNVVNNEPYSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIRNSQDLHKTLKKFGN 93 (574) T ss_pred hhHHHHHhhhhhccccchhhhhhhhccCCCHHHHHHhHhhhcccccchhhhhccccccccCcCccCCcccHHHHHHhhcc Confidence 22233322221100 000 00000 01112222222221110000 0000000000000000000 0 0 0 Q ss_pred cchHHHHHHHHHH----hh--h-----cCCeeecc--Cc-------hhhHHHHHHHHhc----------cHHHHHHHHHH Q lcl|NC_010179. 65 SNFYQLLVDQEAG----YI--A-----SVFPDIDV--GK-------DADNKKILDVLGD----------DRALTLNSLLV 114 (469) Q Consensus 65 ~n~~k~iv~~~~~----~l--~-----g~p~~~~~--~~-------~~~~~~l~~~~~~----------n~~~~~~~~~~ 114 (469) .+....+++..+. +. . |-|..+-. .+ ......|..++.+ .+...+..+.. T Consensus 94 ~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP~~~s~~ef~~~lv~ 173 (574) T protein:vir:80 94 NIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDPNRDNFTTFCKKLVR 173 (574) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCCccccHHHHHHHHHH Confidence 1222233333332 21 1 22333311 11 1122345555532 11223345667 Q ss_pred HHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecC Q lcl|NC_010179. 115 DSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSAT 193 (469) Q Consensus 115 ~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 193 (469) +.+.+|.+|+.+-.+.+|++. +.+++|..+.+..+..... .....+||... ++... ..+....+.+++... T Consensus 174 ~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~-~~~~~~y~~~~--~g~~~----~~~~~~eiih~~~~~- 245 (574) T protein:vir:80 174 ATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKL-IKNGERFVQVI--DNRIV----AKFNERELAFAVRNP- 245 (574) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccc-ccCceEEEEEe--CCceE----EEEccccEEEEeccC- Confidence 788999999988888888875 8889999998876643210 01112222211 11111 112222233222100 Q ss_pred ceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeE Q lcl|NC_010179. 194 DSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILV 273 (469) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~ 273 (469) ..+. .....|.|.++.+...|+....+..-..+.+...+.|-.+ T Consensus 246 ---------------------------~~~~---------~~~~~G~spi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gi 289 (574) T protein:vir:80 246 ---------------------------RADI---------EVGQYGYPELEIALKQFIAHENTEVFNDRFFSHGGTTRGI 289 (574) T ss_pred ---------------------------CCCc---------ccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE Confidence 0000 0012477778777777777666666666666777777644 Q ss_pred E--ecCCcccc---hhhhhhhhh-------c-ceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_010179. 274 L--TNYGGASL---KQFMNDLRE-------Y-KSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGI 338 (469) Q Consensus 274 ~--~g~~~~~~---~~~~~~~~~-------~-~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p 338 (469) + .+....+. ......+.. . ++..+.. .+++|..... ....+....+...+.|+..-++| T Consensus 290 l~~~~~~~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~------~G~~~~~l~~s~~D~qfle~~~~~~~~Ia~afgVP 363 (574) T protein:vir:80 290 LHVKTGQQQSQQALDIFRREWRSSLAGINGSWQIPVVSA------EDVKFVNMTPSANDMQFEKWLNYLINVISALYGID 363 (574) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHHhccccccccceeecC------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCC Confidence 4 33211111 112222211 1 1111211 2345544433 34455666777888898888888 Q ss_pred CcCcc--ccCCc--cH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---cCCCcccceEEeCCCCCCCHHH Q lcl|NC_010179. 339 DPANF--ESSNA--SG-VAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF---SDADKRHISQHWTRTKVEDSLT 410 (469) Q Consensus 339 ~~~~~--~~g~~--Sg-~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~---~~~~~~~i~i~f~~~~p~d~~e 410 (469) +.-.. .-+.. || ..+-+ +.... .....+..+|.-+++.+...++. .... ..+.+.|...-..+..+ T Consensus 364 p~~lG~~~~~t~~gs~~~~~n~--sn~E~---~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~~-~~~~~~f~~~d~~~~~~ 437 (574) T protein:vir:80 364 PAEINFPNNGGATGSKGGSLNE--GNSKE---KMQASQNKGLQPLLRFIEDTVNTYIVAEFG-EKYQFQFRGGDLSAQLD 437 (574) T ss_pred HHHhcccccccccccccccccc--hhHHH---HHHHHHHHHHHHHHHHHHHHHHhhhhhhcC-CceEEEecccchhhHHH Confidence 74221 11111 11 01100 00000 11122333344444433333332 1222 34678888776666666 Q ss_pred HHHHHHH-HhccCChHHHHHhCCC--CCCHH--------HHHHHH------HHHH--HHhhhhHhhcccCCCC------- Q lcl|NC_010179. 411 KAQIVST-VANYSSKEAVAKANPI--VDDWQ--------QELKDL------AKDR--EENDPYANQADELNGK------- 464 (469) Q Consensus 411 ~~~~~~k-l~g~iS~et~~~~l~~--v~d~~--------~E~eri------~~E~--~~~~~~~~~~~~~~~~------- 464 (469) ...+... .+|+++.-.+.++++. +++-+ ..+... ..+. +...+..++......+ T Consensus 438 ~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 517 (574) T protein:vir:80 438 KLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPK 517 (574) T ss_pred HHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccchhccccccccccCCCCCCCCCCCCC Confidence 5554432 2689999888887643 22110 000000 0000 0000000001100000 Q ss_pred ---CCCCC Q lcl|NC_010179. 465 ---GVDDE 469 (469) Q Consensus 465 ---~~~de 469 (469) .+++| T Consensus 518 ~~~~d~~~ 525 (574) T protein:vir:80 518 DSQNDTDV 525 (574) T ss_pred Cccccccc Confidence 01111 No 186 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=96.19 E-value=0.00089 Score=37.31 Aligned_cols=400 Identities=12% Similarity=0.052 Sum_probs=156.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-cccccccchhh---hc-ccccccccccCc-ceeccchHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKT-DITTRNNGKPK---VS-KEGKKDPLRSAD-NRIPSNFYQLLVDQ 74 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~-~i~~~~~~~~~---~~-~~~~~~~~~~~~-~ri~~n~~k~iv~~ 74 (469) |--=.-.-..-++..+... ...+...--+...+. .+......... .. ............ .-+.++=.-..|+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~al~~~~V~acv~~ 79 (441) T protein:vir:98 1 MHWYNTDCYFVDFKSRKQS-RKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKDIEAIRHSDIFTAVMM 79 (441) T ss_pred CceecCccceeccccccch-hhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccchhhhhccHHHHHHHHH Confidence 0000000000000000000 000000000000000 00000000000 00 000000000000 00000101123444 Q ss_pred HHHhhhcCCeeeccCc-hhhHHHHHHHHhc--cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEE Q lcl|NC_010179. 75 EAGYIASVFPDIDVGK-DADNKKILDVLGD--DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITP 146 (469) Q Consensus 75 ~~~~l~g~p~~~~~~~-~~~~~~l~~~~~~--n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~ 146 (469) .++-+.+-|+.+.-+. ......+..++.. |. .+....+...++.+|.+|+++-.+.+|++. +.+++|..+.+ T Consensus 80 Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~~i~~~~v~v 159 (441) T protein:vir:98 80 IASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSEIEL 159 (441) T ss_pred HHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCceeEE Confidence 4444445565543221 1122334444432 32 223345677888999999999888888864 88999999998 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) ..++. .++.+.... .+..+... ...+....+.+++.- T Consensus 160 ~~~~~--g~~~~~~~~---~~~~~~~~---~~~~~~~dviHir~~----------------------------------- 196 (441) T protein:vir:98 160 KLDAR--GRLYYFHQR---IDSNGNNI---ERNVKFEDMLDIKFY----------------------------------- 196 (441) T ss_pred EECCC--CcEEEEEEE---eccCccee---eEEEccccEEEeccC----------------------------------- Confidence 88652 333221111 11111111 011222222222110 Q ss_pred cccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh----hhhhhh--------cce Q lcl|NC_010179. 227 RVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF----MNDLRE--------YKS 294 (469) Q Consensus 227 ~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~----~~~~~~--------~~~ 294 (469) | .+.-.|.|-+..+...++..+....-..+.++..+.|-.+++-......++. ...+.. .++ T Consensus 197 --~----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~G~~nag~~ 270 (441) T protein:vir:98 197 --S----LDGINGLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKGVLDNKKARDRAREEFHKSFSGTKQAGKV 270 (441) T ss_pred --C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCCHHHHHHHHHHHHHHhcCccccCcc Confidence 0 0112366767766666666555555555666667777666542221111111 111111 112 Q ss_pred eeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) +.++. +.+.+-++.+.....+.+..+...+.|+..-++|+.-.... ++.|-+.... .|.. T Consensus 271 ~vl~~-----g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~s~~q~~~--------------~y~~ 331 (441) T protein:vir:98 271 VVLDE-----SMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMSITDANL--------------DYLS 331 (441) T ss_pred eecCC-----CceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCccHHHHHH--------------HHHH Confidence 22221 12333333333334455666777788988888887543211 1122111111 1112 Q ss_pred HHHHHHHHHHHHhcccCC-CcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHH----- Q lcl|NC_010179. 374 AINELVRAIMRYLNFSDA-DKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQEL----- 441 (469) Q Consensus 374 ~l~~~~~~i~~~~~~~~~-~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~----- 441 (469) .|.-++..+...++.+=. ......+.| +.-+-.|.++.++++.++ +|+++.-++.++++. +++.+..+ T Consensus 332 tl~P~~~~ie~~ln~~L~~~~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~gGd~~~~~~~~ 411 (441) T protein:vir:98 332 TLKPYITCVCAELNFKFNDEYVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQRDGLAPIPGGNGSIHRVDL 411 (441) T ss_pred HHHHHHHHHHHHHHhhccccccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCcceEeecc Confidence 333333333332222110 111223455 444667899999998886 689999998888643 33322211 Q ss_pred HHHHHHHHHhhhh--HhhcccCCCCCCCCC Q lcl|NC_010179. 442 KDLAKDREENDPY--ANQADELNGKGVDDE 469 (469) Q Consensus 442 eri~~E~~~~~~~--~~~~~~~~~~~~~de 469 (469) .-+..+.....+. ....+....+|.++| T Consensus 412 n~~~~~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 412 NHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred cccccccccccccccccccccccCCCCCCC Confidence 1111111111110 111122223333444 No 187 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=96.10 E-value=0.001 Score=37.00 Aligned_cols=363 Identities=14% Similarity=0.085 Sum_probs=155.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcce--eccchHHHHHHHHHHh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNR--IPSNFYQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~r--i~~n~~k~iv~~~~~~ 78 (469) |-+-. .+++. . ....-.....+..... ............-...+.+ +..+-....|+..++- T Consensus 3 m~~~~---~~~~~----~-~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~g~~v~~~~al~~~~v~~~v~~ia~~ 66 (392) T protein:vir:74 3 LPILN---FINQT----N-DPPEAGSVQSYFPDGN--------DAQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSD 66 (392) T ss_pred chhhh---hhhcc----c-CcccccccccccccCc--------hhhhhhhccCCCCcccchhhhhcchHHHHHHHHHHHh Confidence 22210 00000 0 0000000000000000 0000000000000000001 1112233345555555 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhc-cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGD-DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTL 152 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~-n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~ 152 (469) +-+-|+.+.-.. . ..++.. |. ......+..+++.+|.+|+.+-.+.+|++ .+.+++|..+-+..+... T Consensus 67 ia~lp~~~~~~~--~----~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~~ 140 (392) T protein:vir:74 67 LAIVKINAEKKK--N----QGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE 140 (392) T ss_pred hccCceeeccch--h----hhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC Confidence 555566543221 1 122221 21 22334566788999999999988988886 588899999888776432 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) ..+ +|......+.. . ....+....+.+++... T Consensus 141 -~~~-----~y~~~~~~~~~-~-~~~~~~~~evih~~~~~---------------------------------------- 172 (392) T protein:vir:74 141 -NGM-----YYNITFDDPKI-E-PILQAPQSDLIHMKLLS---------------------------------------- 172 (392) T ss_pred -ceE-----EEEEEecCCcc-c-eeEEEcCccEEEecCCC---------------------------------------- Confidence 111 12111111110 0 01112222232221100 Q ss_pred ecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc-ccchhhhhhhh--------hcceeeecccCCC Q lcl|NC_010179. 233 FPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG-ASLKQFMNDLR--------EYKSIKINNAGNG 303 (469) Q Consensus 233 ~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~-~~~~~~~~~~~--------~~~~~~~~~~~~~ 303 (469) ......|.|-++.+...|+....+..-..+.++..+.|-.+++-... ...++....+. ..+++.++. T Consensus 173 ~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~n~g~~~vl~~---- 248 (392) T protein:vir:74 173 IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLDD---- 248 (392) T ss_pred CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecCC---- Confidence 00012477878777777776666666666667777777655542111 11111111111 112222221 Q ss_pred CCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 DKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESS--NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA 381 (469) Q Consensus 304 ~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g--~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 381 (469) +.+++-++.+.....+.+..+...+.|+..-++|+.-....+ +.+..+. ...+..+|.-.++. T Consensus 249 -g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~--------------~~~~~~~l~p~~~~ 313 (392) T protein:vir:74 249 -LEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQI--------------SGMYASALNRYLRP 313 (392) T ss_pred -CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--------------HHHHHHHHHHHHHH Confidence 123344433333445666677788899988888875332222 1122222 22334444445444 Q ss_pred HHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHhhhhHh Q lcl|NC_010179. 382 IMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVDDWQQELKDLAKDREENDPYAN 456 (469) Q Consensus 382 i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~d~~~E~eri~~E~~~~~~~~~ 456 (469) |...++.+=.. .++..+..-+-.|..+.++.+.++ +|+++...+.+++ |+..+ |+.+. | . T Consensus 314 ie~~l~~~l~~--~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pn---e~r~~--e---n----- 378 (392) T protein:vir:74 314 AISELEYKLSD--HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPAP--E---N----- 378 (392) T ss_pred HHHHHHHhccc--hhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcc---ccchh--c---C----- Confidence 44444322111 122223333335667777777776 6799998887654 54332 32211 0 1 Q ss_pred hcccCCCCCCCCC Q lcl|NC_010179. 457 QADELNGKGVDDE 469 (469) Q Consensus 457 ~~~~~~~~~~~de 469 (469) ....++ |+.+| T Consensus 379 -l~~~~~-Gd~~~ 389 (392) T protein:vir:74 379 -TNKKTT-GQSNE 389 (392) T ss_pred -CCCCCC-CCCCC Confidence 112222 22333 No 188 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=96.09 E-value=0.001 Score=36.99 Aligned_cols=424 Identities=10% Similarity=0.032 Sum_probs=185.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) +..+.+++..+.+..++..-..+.+.+.+|....- ..... ....+...++-.+-+...++..++.|. T Consensus 9 ~~~~~~~~r~~~l~~~R~~~e~~w~e~~~~~lP~~---~~~~~----------~~~~~~~~~~~dst~~~a~~~LAa~L~ 75 (532) T protein:vir:99 9 FAADGAAAAYNRLKNDRGAYETRAEDCATYTIPSV---FPSAT----------ADGSTSYTTPWQSIGARGLNNLASKLM 75 (532) T ss_pred ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhhhcc---cCCCC----------CcchhhccccccchHHHHHHHHHHHHH Confidence 34566677777776666555666667777765321 00000 011122345666777777777777665 Q ss_pred cC--Ce-----eeccCchh---------hHHHHHHHH------------hccHHHHHHHHHHHHHhCCeEEEEEEEcCC- Q lcl|NC_010179. 81 SV--FP-----DIDVGKDA---------DNKKILDVL------------GDDRALTLNSLLVDSSNAGRAWLHYWIDED- 131 (469) Q Consensus 81 g~--p~-----~~~~~~~~---------~~~~l~~~~------------~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~- 131 (469) +- || ++...+.. ....++.|+ ..||...+.++.++..++|.+.+++..++. T Consensus 76 ~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~~~~~ 155 (532) T protein:vir:99 76 LALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGNVLLYIPSTEQV 155 (532) T ss_pred HhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEEecccccc Confidence 31 22 12222211 112233332 246677788899999999999876654432 Q ss_pred --CceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC-----------C---ceEEEEEEEEcCCeEEEEEeecCce Q lcl|NC_010179. 132 --NNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE-----------A---GKYFTVHEYWTDKEAQFFRTSATDS 195 (469) Q Consensus 132 --~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~-----------~---~~~~~~~~~~~~~~~~~~~~~~~~~ 195 (469) ....++.++-.++++.-|. .+++...+|.++..... + ......+++|+.- +.......+ T Consensus 156 ~~~~~~f~~~pl~~y~v~~d~--~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v---~~~~~~~~~ 230 (532) T protein:vir:99 156 EGQSNAPKLYKLHNFVVERDA--YDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHV---YRDPEAMVF 230 (532) T ss_pred cCcccceEEEEcCeEEEeeCC--CCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEE---EecCCCCee Confidence 2345666666665554443 45666666654432110 0 0111122332210 000111001 Q ss_pred eecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCc Q lcl|NC_010179. 196 TVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTV 270 (469) Q Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p 270 (469) .. +. + ..+..... .....+|..+|++.++ .+.+|.|-..+..+-+..+|.+.-...........| T Consensus 231 ~~---~~----~--~~g~~~~~-~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~a~~~ 300 (532) T protein:vir:99 231 RS---YQ----E--IDGEIVAG-TEGEYPLDSCPWIPVRLIKMPNEDYGRSFVEEYLGDLKSLENLYEAIVKMSMISSKV 300 (532) T ss_pred EE---EE----e--ecCceecc-cccccccccCCceeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCC Confidence 10 00 0 00100000 0112235567877765 345799999999999999998877777777777887 Q ss_pred eeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCc Q lcl|NC_010179. 271 ILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNA 348 (469) Q Consensus 271 ~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~ 348 (469) .+.+.-.+..+ ...+...+...+.++ ...++..+. ...+.......++.++..|...-..-.+...+.... T Consensus 301 ~~lv~p~g~~~----~~~~~~~~~g~~v~g---~~~~i~~~~~~~~~~~~~~~~~i~~~~~rI~~af~~~~~~~~d~~r~ 373 (532) T protein:vir:99 301 LFFVNPNGVTQ----IRRVAKANTGDFVAG---RKQDVEVFQLEKYNDFQVAKATADDIEKRLSYAFMLNSAVQRGGDRV 373 (532) T ss_pred Cceeccccccc----hhhhccCCCcceecC---CcccceeeecccccchhHHHHHHHHHHHHHHHHHhhhhcccCCCCcc Confidence 65543111111 111111111111111 112344443 334567777778777777754322111212223334 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcccC----CCc--ccce-EEeCCCCCCCHHHHHH Q lcl|NC_010179. 349 SGVAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNFSD----ADK--RHIS-QHWTRTKVEDSLTKAQ 413 (469) Q Consensus 349 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~~~----~~~--~~i~-i~f~~~~p~d~~e~~~ 413 (469) |+..+.. ++.++...++..+.++ +..++.++...+ .+. ..+. +++-.++- .++.++ T Consensus 374 TAtEV~~-------r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~iv~~is~La--raq~~~ 444 (532) T protein:vir:99 374 TAEEIRY-------VAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEAVEPAIATGLEALG--RGHDLN 444 (532) T ss_pred cHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhcccceeecchHHH--HHHHHH Confidence 5554432 3344455555544442 222233332222 111 1112 22222222 222222 Q ss_pred HH----HHHhcc-------CChHHHHHh----CCCCC-----CHHHHHHHHHHHHHHhhh---hHhhcc----------c Q lcl|NC_010179. 414 IV----STVANY-------SSKEAVAKA----NPIVD-----DWQQELKDLAKDREENDP---YANQAD----------E 460 (469) Q Consensus 414 ~~----~kl~g~-------iS~et~~~~----l~~v~-----d~~~E~eri~~E~~~~~~---~~~~~~----------~ 460 (469) .+ +.++.+ +....++.. +| |+ -.++|++.++++++.... ..++.. . T Consensus 445 ~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~G-V~~~~i~r~~ee~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~~~~ 523 (532) T protein:vir:99 445 KLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLG-MDTTGLILTQQDKQAKMAEASTAAGMVTAGQQMGAAGGQAAAAMM 523 (532) T ss_pred HHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhC-CChhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhH Confidence 22 222222 333444433 22 21 124455555433222211 111111 0 Q ss_pred CCCCCCCCC Q lcl|NC_010179. 461 LNGKGVDDE 469 (469) Q Consensus 461 ~~~~~~~de 469 (469) ....|-+-| T Consensus 524 ~~~~~~~~~ 532 (532) T protein:vir:99 524 QQQAGMPTQ 532 (532) T ss_pred HhhcCCCCC Confidence 011111111 No 189 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=95.97 E-value=0.0012 Score=36.63 Aligned_cols=369 Identities=8% Similarity=0.016 Sum_probs=160.3 Q ss_pred HHHHHHhccCCcccccccchhhhccccccccccc------CcceeccchHHHHHHHHHHhhhcCCeee-c-cCch----h Q lcl|NC_010179. 25 KKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRS------ADNRIPSNFYQLLVDQEAGYIASVFPDI-D-VGKD----A 92 (469) Q Consensus 25 ~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~------~~~ri~~n~~k~iv~~~~~~l~g~p~~~-~-~~~~----~ 92 (469) -.+.+++++.... .+... .....+....... +..-+...-....|+..+.-+-+-|+.+ . .++. . T Consensus 1 m~~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~g~~~~~ 77 (419) T protein:vir:57 1 MFIPQFWKGRPSE-NRVNW--QVVPGGMRSSSSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTENGGREIA 77 (419) T ss_pred CcchhhhccCCcc-ccccc--cccccccccccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcCCCceecc Confidence 1112222222100 00000 0000000000000 0001112223445555555555556654 1 1111 1 Q ss_pred hHHHHHHHHhc--cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEEEEEEEe Q lcl|NC_010179. 93 DNKKILDVLGD--DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQ 165 (469) Q Consensus 93 ~~~~l~~~~~~--n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~ 165 (469) ....+..++.. |. .+....+..+...+|.+|+++..+.+|++ .+.+++|..+.+..+.. .. .+|.. T Consensus 78 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~~~~--g~-----~~y~~ 150 (419) T protein:vir:57 78 FDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLKGPD--GM-----PYYDI 150 (419) T ss_pred ccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEECCC--ce-----EEEEE Confidence 22335565542 32 23334667788999999999988998886 58888998887765432 11 12211 Q ss_pred eecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHH Q lcl|NC_010179. 166 LDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNK 245 (469) Q Consensus 166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~ 245 (469) . ..+ .++....+.+++.- | .+...|.|-+.. T Consensus 151 ~-~~~-------~~~~~~~vih~r~~-------------------------------------~----~d~~~G~s~i~~ 181 (419) T protein:vir:57 151 P-SIG-------EILPMRMVHHIKSF-------------------------------------S----LDGYIGTSPIQT 181 (419) T ss_pred c-CCc-------eEEchhhEEEecCc-------------------------------------C----CCCcccccHHHH Confidence 1 111 01112222221100 0 011247777777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC---cccchhh----hhhhhh--------cceeeecccCCCCCCcceE Q lcl|NC_010179. 246 YKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG---GASLKQF----MNDLRE--------YKSIKINNAGNGDKSGVDK 310 (469) Q Consensus 246 v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~---~~~~~~~----~~~~~~--------~~~~~~~~~~~~~~~~~~~ 310 (469) +...++....+..-..+.+...+.|-.+++-.. ....++. ...+.. .+++.++. +.+++- T Consensus 182 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~-----g~~~~~ 256 (419) T protein:vir:57 182 NPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQE-----GMTYKQ 256 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCC-----CceEEE Confidence 777777666555555555666677755554311 1111121 111111 12333321 123333 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc- Q lcl|NC_010179. 311 LQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS- 389 (469) Q Consensus 311 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~- 389 (469) ++.+.....+.+..+...+.|+..-++|+.-....+..++..++- .....+...|.-+++.+...++.+ T Consensus 257 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~----------~~~~f~~~~l~P~~~~ie~~l~~~l 326 (419) T protein:vir:57 257 LSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEH----------QGLQYVIYTMLAILKRHESAMMRDL 326 (419) T ss_pred cCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHH----------HHHHHHHHHHHHHHHHHHHHHHhhc Confidence 333333445566677778899999899874332221111111111 112233445555555554444322 Q ss_pred --CCCcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHHHHhhhhHhhcccC Q lcl|NC_010179. 390 --DADKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKDREENDPYANQADEL 461 (469) Q Consensus 390 --~~~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E~~~~~~~~~~~~~~ 461 (469) ........+.| ..-+..|.++.++.+.++ +|+++.-.+.+.++. +++-+.=+--+. ...........+.. T Consensus 327 l~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n--~~~~~~~~~~~~~~ 404 (419) T protein:vir:57 327 LLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLTPIPGGDKYLTPLN--MVDSKALTGIGKAT 404 (419) T ss_pred cCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccc--cccccccccccCCC Confidence 11112334454 455667899999998886 689999999888754 222110000000 00000000111111 Q ss_pred CCCCCCCC Q lcl|NC_010179. 462 NGKGVDDE 469 (469) Q Consensus 462 ~~~~~~de 469 (469) ++.-.+.| T Consensus 405 ~~~~~~~~ 412 (419) T protein:vir:57 405 PQQLKDIE 412 (419) T ss_pred cccCcchh Confidence 11112222 No 190 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=95.92 E-value=0.0013 Score=36.49 Aligned_cols=367 Identities=9% Similarity=0.017 Sum_probs=145.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceec-cchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIP-SNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~-~n~~k~iv~~~~~~l 79 (469) |-+=. -|..+ .+. .......++.+. ............++. .+.....|+..++-+ T Consensus 1 Mg~~~------~f~~k-~~~--~~~~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~V~~~I~~ia~~i 56 (403) T protein:vir:80 1 MGLFN------FFRRK-TRS--EPTNAISWFLTQ---------------EAYDTLAIPGYTRLSDNPEVRMAVHKIAELI 56 (403) T ss_pred Ccccc------ccccc-ccc--cccchhhhhccc---------------ccccccccchhhhhhhhHHHHHHHHHHHHhh Confidence 22211 00000 000 000000000000 000000000001111 122334556666666 Q ss_pred hcCCeeec-c-Cc--hhhHHHHHHHHhc--cH-H--HHHH-HHHHHHHhC--CeEEEEEEEcCCCce-EEEEEccceeEE Q lcl|NC_010179. 80 ASVFPDID-V-GK--DADNKKILDVLGD--DR-A--LTLN-SLLVDSSNA--GRAWLHYWIDEDNNF-RYGIIQPDQITP 146 (469) Q Consensus 80 ~g~p~~~~-~-~~--~~~~~~l~~~~~~--n~-~--~~~~-~~~~~~~~~--G~~~~~v~~d~~~~~-~i~~~~p~~~~~ 146 (469) .+-|+.+- . ++ ......+..++.. |- + ..+. .+..+++.. |.+|+++..+..|++ .+.+++|..+-+ T Consensus 57 A~~p~~~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L~~l~p~~v~~ 136 (403) T protein:vir:80 57 SSMTIHLMQNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDELIPLAPSKVSF 136 (403) T ss_pred hhCceEEEEecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEEEEEcCCeeEE Confidence 66666641 1 11 1223334555542 32 2 1233 344455554 667877777777776 477888888877 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) +.+++. .. + +|. + ..|....+.+++.. T Consensus 137 ~~~~~g--~~---~-~y~-----~-------~~~~~~eiih~~~~----------------------------------- 163 (403) T protein:vir:80 137 VDTDTG--YQ---I-WYQ-----G-------KAYNYDEVLHFIVN----------------------------------- 163 (403) T ss_pred EEcCCc--eE---E-EEe-----e-------cccchhhEEEEecc----------------------------------- Confidence 655421 00 0 110 0 00111222211100 Q ss_pred cccEEEecCC-ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCccc---chhhhhhh--------hhcce Q lcl|NC_010179. 227 RVPFIEFPKN-KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGAS---LKQFMNDL--------REYKS 294 (469) Q Consensus 227 ~vPvv~~~n~-~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~---~~~~~~~~--------~~~~~ 294 (469) +.+.+ -.|.|-+..+...+........-....+...+.|-.++.-..... .+.....+ ...+. T Consensus 164 -----~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 238 (403) T protein:vir:80 164 -----PDPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLEASEAGQP 238 (403) T ss_pred -----CCCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCChHHHHHHHHHHHHHHhhhhhcCCe Confidence 00011 135665655555555555444444455555666666654322111 11111111 11122 Q ss_pred eeecccCCCCCCcceEEee-cCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQI-DIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) ..++.++ .+..-+++ +.....+.+..+.....|+..-++|+.-. +.+..+- + .+. ..+.. T Consensus 239 ~~~~~~~----~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~-~---~~~----------~f~~~ 299 (403) T protein:vir:80 239 WIIPAEL----LDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLL-GVGKYDK-D---EYN----------NFINS 299 (403) T ss_pred eeecccc----cccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCCccH-H---HHH----------HHHHH Confidence 2222211 11111221 22233445556667778888888876322 2221111 1 111 13344 Q ss_pred HHHHHHHHHHHHhcccCCCcccceEEeC--CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHHHH---HHH Q lcl|NC_010179. 374 AINELVRAIMRYLNFSDADKRHISQHWT--RTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVD--DWQQEL---KDL 444 (469) Q Consensus 374 ~l~~~~~~i~~~~~~~~~~~~~i~i~f~--~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~--d~~~E~---eri 444 (469) +|.-+++.|...++.+=....++.+.|+ .-+..|.++.++.+.++ +|+++.-++.+.++.-+ ..+.-+ ..+ T Consensus 300 ~l~P~~~~ie~~l~~kll~~~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t~NE~R~~~gl~p~~ggd~~~~~~n~~ 379 (403) T protein:vir:80 300 TILPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLMEGNEVRDWLGLSPKEGLSELVILENYI 379 (403) T ss_pred HHHHHHHHHHHHHHHhccCCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEeeccccc Confidence 5555555555544432222223445664 44667889999988876 68999988888875422 111100 000 Q ss_pred HHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 445 AKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 445 ~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) --+.. ......+.++.++.++.-| T Consensus 380 pl~~~-~~~~~~k~ge~~~~~~~~~ 403 (403) T protein:vir:80 380 PLDKI-GDQNKLKGGEKGGADGQTD 403 (403) T ss_pred chhhc-cchhhccCCCCCCCCCCCC Confidence 00100 0011111222222222222 No 191 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=363 Identities=9% Similarity=-0.047 Sum_probs=155.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINN-YKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~-~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |-+=+ .+.....+...+ ......++.+..... ..... ...-...+.....|+..++-+ T Consensus 1 Mg~f~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~---~~~~~~~~~v~~~i~~ia~~i 59 (406) T protein:vir:95 1 MGLFD------RWRRTKRKSKIRADTGYVGLFMSGEDVS------------FLVPG---YVRLSDNPEVRMAVHKIADLI 59 (406) T ss_pred Ccchh------hhccccccccccccchhhhhhccCcccC------------ccccC---HHHHhhcHHHHHHHHHHHHhh Confidence 32221 110000000000 000001111100000 00000 000012245555677777767 Q ss_pred hcCCeeec--cCc--hhhHHHHHHHH-hc-cH----HHHHHHHHHHHHhCCeEEE--EEEEcCCCce-EEEEEccceeEE Q lcl|NC_010179. 80 ASVFPDID--VGK--DADNKKILDVL-GD-DR----ALTLNSLLVDSSNAGRAWL--HYWIDEDNNF-RYGIIQPDQITP 146 (469) Q Consensus 80 ~g~p~~~~--~~~--~~~~~~l~~~~-~~-n~----~~~~~~~~~~~~~~G~~~~--~v~~d~~~~~-~i~~~~p~~~~~ 146 (469) .+-|+.+- .++ ......+...+ .. |. .+....+..+.+.+|.++. .+-.+.+|++ .+.+++|..+-+ T Consensus 60 a~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i~~~~v~~ 139 (406) T protein:vir:95 60 SSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPLTPSKVNF 139 (406) T ss_pred ccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEEcCceeEE Confidence 66666641 111 11122232333 22 32 2334456667777776544 4445666776 478888888877 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) +.+.+. .++ . ..+ . .+.... T Consensus 140 ~~~~~~-------~~~-~---~~~-~------~~~~~e------------------------------------------ 159 (406) T protein:vir:95 140 LDTPDG-------YQV-L---YGG-Q------TFNYDE------------------------------------------ 159 (406) T ss_pred EEcCCe-------EEE-E---ecc-E------EEchhH------------------------------------------ Confidence 665421 000 0 000 0 011122 Q ss_pred cccEEEecCC------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc---ccchhhhhhhhh------ Q lcl|NC_010179. 227 RVPFIEFPKN------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG---ASLKQFMNDLRE------ 291 (469) Q Consensus 227 ~vPvv~~~n~------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~---~~~~~~~~~~~~------ 291 (469) |+||+.+ -.|.|-++.+...++....+..-..+.+...+.|-.++.-... +........+.. T Consensus 160 ---vih~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~l~~e~~~~~~~~~~~~~~g~~ 236 (406) T protein:vir:95 160 ---VLHFIYNPDPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDAATAELSSEEGRNAVFKKYLQAT 236 (406) T ss_pred ---EEEeeccCCCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCHHHHHHHHHHHHHHhcccc Confidence 2333211 1367777777777777776666666667777777666543221 111111111111 Q ss_pred --cceeeecccCCCCCCcceEEe-ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 292 --YKSIKINNAGNGDKSGVDKLQ-IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 292 --~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~ 368 (469) .+.+.+..++ ...+-++ .+.....+.+..+.....|+..-++|+.-. |..++.. .. .. T Consensus 237 n~~~~~v~~~~~----~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~l---g~~~~~~--~~----------~~ 297 (406) T protein:vir:95 237 EAGQPWIIPAEL----LEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLL---GIGEFNR--DE----------YN 297 (406) T ss_pred ccCCceeecCCC----ccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc---CCCCchH--HH----------HH Confidence 1122222211 1122122 122234455666777888888888886433 2222211 11 12 Q ss_pred HHHHHHHHHHHHHHHHHhcccCCCcc--cceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCC--CHHH--- Q lcl|NC_010179. 369 TYFEHAINELVRAIMRYLNFSDADKR--HISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVD--DWQQ--- 439 (469) Q Consensus 369 ~~~~~~l~~~~~~i~~~~~~~~~~~~--~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~--d~~~--- 439 (469) ..+..+|.-+++.|...++.+-+... .+.+.++.-+..|.++.++.+.++ +|+++...+.++++.-. +-+. T Consensus 298 ~~~~~~l~P~~~~ie~~l~~~l~~~~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~NE~R~~~gl~p~~~gd~~~~ 377 (406) T protein:vir:95 298 NFINSTILPIAKGIEQELTRKLLISPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEGNEVRDWLGLSPKEGLSELVI 377 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeee Confidence 34556666666666655553322222 344445555667899999988886 68999999998876522 1110 Q ss_pred --HHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 440 --ELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 440 --E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) -+..+.. .......+.++..+.++.-| T Consensus 378 ~~n~~~~~~---~~~~~~~k~g~~~~~~~~~~ 406 (406) T protein:vir:95 378 LENYIPLDK---IGDQSKLKGGDNSGADGQTD 406 (406) T ss_pred ccCccchhh---cccccccCCCCCCCCCCCCC Confidence 0111100 00000111111111111111 No 192 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=95.86 E-value=0.0013 Score=36.34 Aligned_cols=385 Identities=11% Similarity=0.037 Sum_probs=160.2 Q ss_pred HHHHHHHhccCCc-----cccccc--chhhhccccccccccc-Ccceeccc--hHHHHHHHHHHhhhcCCeeeccCc--- Q lcl|NC_010179. 24 YKKSVDYYENKTD-----ITTRNN--GKPKVSKEGKKDPLRS-ADNRIPSN--FYQLLVDQEAGYIASVFPDIDVGK--- 90 (469) Q Consensus 24 ~~~~~~Yy~g~~~-----i~~~~~--~~~~~~~~~~~~~~~~-~~~ri~~n--~~k~iv~~~~~~l~g~p~~~~~~~--- 90 (469) +-.+.+.+.-.+. ...+.. ........+.....+. ...+.++. =.-..|+..+.-+-+-|+.+--.. T Consensus 1 Mg~~~~l~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~V~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~~ 80 (457) T protein:vir:13 1 MGFWSALFGRGHSPALDGIEARAWEPYDPSIYNLGAVAASGETVTPHDALQVSAVFASVRLLSETIATLPLSTYSKRGGS 80 (457) T ss_pred CchhhhhhcccccccccccccccccccchHHHhhcccccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEecCCc Confidence 1111111111000 000000 0000000000000000 00000111 112234555555555565542111 Q ss_pred --hhhHHHHHHHHhc--cHH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEEEEE Q lcl|NC_010179. 91 --DADNKKILDVLGD--DRA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGVLRS 162 (469) Q Consensus 91 --~~~~~~l~~~~~~--n~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~ 162 (469) +.....+..+++. |.+ +.+..+....+.+|.+|+.+..+ .|++ .+..++|..+.+..+....... ...+. T Consensus 81 ~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~-~g~~~~l~~l~p~~v~v~~~~~~~~~~-~~~~~ 158 (457) T protein:vir:13 81 RKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQ-GPNIVGLDVLDPTKIHVHMVMVDGLRR-KVFEA 158 (457) T ss_pred ccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEEccCceEEEEecCCCccc-eeEEE Confidence 1122334455542 222 23445667888999999888554 4664 5888999988876553322111 11111 Q ss_pred EEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcccccc Q lcl|NC_010179. 163 YKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAE 242 (469) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~ 242 (469) |... ..+.. .....+....+.+++.-. ..+.-.|.|. T Consensus 159 y~~~-~~~~~--~~~~~~~~~diih~~~~~----------------------------------------~~~~~~G~s~ 195 (457) T protein:vir:13 159 YDID-ADGNE--VLLGWFTPRDVLHIPGMM----------------------------------------LPGDFVGCSP 195 (457) T ss_pred EEEe-cCCce--eeEEeeCccceEEecCCC----------------------------------------CCCccccccH Confidence 2111 11111 122223334443332100 0001247777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhh----hhh--------hcceeeecccCCCCCCcceE Q lcl|NC_010179. 243 LNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMN----DLR--------EYKSIKINNAGNGDKSGVDK 310 (469) Q Consensus 243 ~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~----~~~--------~~~~~~~~~~~~~~~~~~~~ 310 (469) ++.+...|.....+..-..+.+...+.|-.+++-.. ...++... .+. ..+++.++. +.+++- T Consensus 196 i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~g~~nag~~~vl~~-----g~~~~~ 269 (457) T protein:vir:13 196 ISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPG-TMSEEGLARAREAWRAANSGVDNAHRVALLTE-----GAKFSK 269 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCC-CCCHHHHHHHHHHHHHHhcCccccCcceecCC-----CceEEE Confidence 777777777666666666666677777766665422 11122111 111 112233322 123444 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_010179. 311 LQIDIPVEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF 388 (469) Q Consensus 311 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~ 388 (469) ++.+.....+.+..+.....|+..-++|+.-... .++.++..++-+.. ..+..+|.-+++.|...++. T Consensus 270 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~----------~f~~~tl~P~~~~ie~~ln~ 339 (457) T protein:vir:13 270 VAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI----------AFTMFSLRPWLERIEAGFNR 339 (457) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH----------HHHHHHHHHHHHHHHHHHHH Confidence 4433333445566667778898888888753322 12222222222221 12233344444444333332 Q ss_pred -----cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCH--HHH-----HHHHHHH---HH Q lcl|NC_010179. 389 -----SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDW--QQE-----LKDLAKD---RE 449 (469) Q Consensus 389 -----~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~--~~E-----~eri~~E---~~ 449 (469) .+.....+++.++.-+-.|.++.++.+.++ +|+++.-.+.+.++. +++. +.- +..+.+. +. T Consensus 340 ~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~ 419 (457) T protein:vir:13 340 LLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEP 419 (457) T ss_pred hhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeeccccccccccccccc Confidence 122223345555566677999999999886 689998888877643 2232 111 1111110 00 Q ss_pred Hh------hhhHhhc-ccCCCCCCCCC Q lcl|NC_010179. 450 EN------DPYANQA-DELNGKGVDDE 469 (469) Q Consensus 450 ~~------~~~~~~~-~~~~~~~~~de 469 (469) .. .+..+.. +....+..||+ T Consensus 420 ~~~~~~~~~~~~~~~~~~~~~g~~d~~ 446 (457) T protein:vir:13 420 APAPPAIEPPAEEPDEEPEPEGKPDDE 446 (457) T ss_pred cCCCCCCCCCccccCCCCCCCCCCccc Confidence 00 0111111 11112222222 No 193 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=95.83 E-value=0.0014 Score=36.25 Aligned_cols=379 Identities=11% Similarity=0.056 Sum_probs=156.9 Q ss_pred HHHHh-ccCCcccccccch-------hhhcc---cccccccccCcceecc--chHHHHHHHHHHhhhcCCeeecc-C-c- Q lcl|NC_010179. 27 SVDYY-ENKTDITTRNNGK-------PKVSK---EGKKDPLRSADNRIPS--NFYQLLVDQEAGYIASVFPDIDV-G-K- 90 (469) Q Consensus 27 ~~~Yy-~g~~~i~~~~~~~-------~~~~~---~~~~~~~~~~~~ri~~--n~~k~iv~~~~~~l~g~p~~~~~-~-~- 90 (469) +.+++ +++.......... ..... .+........+...++ .=....|+..++-+-+-|+.+-- . + T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~g~~v~~~~al~~~~V~~~v~~Ia~~iA~lp~~~~~~~~~g 80 (454) T protein:vir:93 1 MWNLLRRTRKNQKSGRDVREAGWTSLFQAVAEPFAGAWQQGVKADPEAVLSFHAVFACISLISQDIAKMRLRLMQTDAQG 80 (454) T ss_pred CCCccccCcccccccccccchhhhhhhhhhhhhhcchhhcCcccChHHhhccHHHHHHHHHHHHhhccCceEEEEeccCC Confidence 22222 1111000000000 00000 0000000000000011 11122344444445555665421 1 1 Q ss_pred --h-hhHHHHHHHHhc-cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEEEE Q lcl|NC_010179. 91 --D-ADNKKILDVLGD-DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGVLR 161 (469) Q Consensus 91 --~-~~~~~l~~~~~~-n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~ 161 (469) . .....+..++.+ |. .+....+..+++.+|.+|+++-.+.+|++ .+.+++|..+-++.+++ .++. T Consensus 81 ~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~--g~~~---- 154 (454) T protein:vir:93 81 IRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILDWNRVEPLVADD--GEVF---- 154 (454) T ss_pred ccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCC--CcEE---- Confidence 1 112223444433 32 23344567788999999999988888887 58899999998887643 2222 Q ss_pred EEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccc Q lcl|NC_010179. 162 SYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLA 241 (469) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~ 241 (469) |............. ..+....+.+++.. ...+...|.| T Consensus 155 -y~~~~~~~~~~~~~-~~~~~~eViH~k~~----------------------------------------~~~~~~~G~s 192 (454) T protein:vir:93 155 -YRITPDRNCGITEA-VTVPAREVIHDRFN----------------------------------------CFFHPLIGLP 192 (454) T ss_pred -EEEEecccccccee-EEecCcceEEeccC----------------------------------------CCCCCceecc Confidence 11111111100001 11222233222110 0001224777 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh----h-------hcceeeecccCCCCCCcceE Q lcl|NC_010179. 242 ELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL----R-------EYKSIKINNAGNGDKSGVDK 310 (469) Q Consensus 242 ~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~----~-------~~~~~~~~~~~~~~~~~~~~ 310 (469) .+......+.....+.....+.+...+.|-.+++-.. ...++....+ . ..+++.++. +.+++- T Consensus 193 p~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~~-----g~~~~~ 266 (454) T protein:vir:93 193 PVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPG-SITEENAKKLKSNWDSGYTGENAGKTAILSN-----GAKYNP 266 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC-CCCHHHHHHHHHHHHHHhcccccCCceeccC-----CceEEE Confidence 7776666666665555555556666666755554322 1112221111 1 111222221 123333 Q ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC Q lcl|NC_010179. 311 LQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSD 390 (469) Q Consensus 311 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~ 390 (469) ++.......+.+..+.....|+..-++|+.-....+..+...++.. ....+..+|.-+++.+...++.+= T Consensus 267 l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~----------~~~f~~~~l~P~~~~ie~~ln~~L 336 (454) T protein:vir:93 267 TTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEAL----------EQQYYSQCLQTLIESIELLLDEAL 336 (454) T ss_pred cccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHH----------HHHHHHHHHHHHHHHHHHHHHHhh Confidence 3333333455566677778898888888753322222222111111 112223333333333333322211 Q ss_pred --CCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHH--------HHHHHHHHHHHHhhh--- Q lcl|NC_010179. 391 --ADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQ--------QELKDLAKDREENDP--- 453 (469) Q Consensus 391 --~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~--------~E~eri~~E~~~~~~--- 453 (469) .....+++.++.-+..|.++.++.+.++ +|+++.-++.+++++ +++-+ .-++.+.+......+ T Consensus 337 ~~~~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~ 416 (454) T protein:vir:93 337 ETGENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARKRENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFAS 416 (454) T ss_pred cCCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeeeeccCccchHhhhccCcccCCCCC Confidence 1112345555566678999999998887 689998888877643 22111 012222221111111 Q ss_pred hHhhccc----CCCCC----CCCC Q lcl|NC_010179. 454 YANQADE----LNGKG----VDDE 469 (469) Q Consensus 454 ~~~~~~~----~~~~~----~~de 469 (469) ......+ .+.++ .+.| T Consensus 417 ~~~~~~~~~~~~~~d~~~~~~e~~ 440 (454) T protein:vir:93 417 SGKTASVPQAVAASDGNKAITETE 440 (454) T ss_pred CccCCCCCCCCCCCCCCCCccCCc Confidence 1111111 00011 1111 No 194 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=95.80 E-value=0.0014 Score=36.18 Aligned_cols=257 Identities=13% Similarity=0.045 Sum_probs=119.7 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhc--c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCC Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGD--D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATT 151 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~--n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~ 151 (469) +-+-|+.....++.....+..++.. | ..+....+..+.+.+|.+|+.+..+.+|++ .+.+++|..+.+..++. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 3334444433333334445555532 2 223455677889999999999988888875 58889999988876643 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) .. .. +|......|.. ..+....+.+++.. ++. T Consensus 81 ~~--~~----~y~~~~~~g~~-----~~~~~~evih~~~~-------------------------------~~~------ 112 (278) T protein:vir:78 81 SR--EL----YYSIHAATGNK-----LIVHNMDMLHFKHI-------------------------------VAS------ 112 (278) T ss_pred Cc--eE----EEEEEcCCceE-----EEEccccEEEECCC-------------------------------CCC------ Confidence 21 11 12111111111 01222222222110 000 Q ss_pred EecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhc-CceeEEecCCcccchhhhhhh--------h-hcceeeecccC Q lcl|NC_010179. 232 EFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQ-TVILVLTNYGGASLKQFMNDL--------R-EYKSIKINNAG 301 (469) Q Consensus 232 ~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~-~p~l~~~g~~~~~~~~~~~~~--------~-~~~~~~~~~~~ 301 (469) +...|.|.+..+...++........ .+..++ .|-.++... ....++....+ . ..+++.++. T Consensus 113 ---~~~~G~s~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~i~~~~-~~l~~e~~~~~~~~~~~~~~~~g~~~vl~~-- 183 (278) T protein:vir:78 113 ---NMVQGISPIDVLKNTTDFDNAVRTF---NLTEMQKPDSFMLKYG-SNVGKEKRQQVLEDFKQYYEENGGILFQEP-- 183 (278) T ss_pred ---CCeeeccHHHHHHHHHHHHHHHHHH---HHHHhcCCCcEEEEeC-CCCCHHHHHHHHHHHHHHhccCCCceecCC-- Confidence 1124777777766666654443322 233333 343333321 11111111111 1 112332221 Q ss_pred CCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 302 NGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRA 381 (469) Q Consensus 302 ~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~ 381 (469) +.+++.++.+.....+.+..+...+.|+..-++|+.-....++.+...++. .....+..++.-+++. T Consensus 184 ---g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~----------~~~~~~~~~l~P~~~~ 250 (278) T protein:vir:78 184 ---GVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEE----------LNRFYLQHTLLPIVKQ 250 (278) T ss_pred ---CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHH----------HHHHHHHHHHHHHHHH Confidence 123444444334455566677888899998888864332222111111111 1123344455555555 Q ss_pred HHHHhcccCCCc----ccceEEeCCCCC Q lcl|NC_010179. 382 IMRYLNFSDADK----RHISQHWTRTKV 405 (469) Q Consensus 382 i~~~~~~~~~~~----~~i~i~f~~~~p 405 (469) |...++.+=+.. ....+.|+-+.- T Consensus 251 i~~~ln~~L~~~~e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 251 YEEEFNRKLLTKTDREKIGILNLTLNLI 278 (278) T ss_pred HHHHHHhhcCChhHhcCCceEEEecccC Confidence 555554322211 224577764433 No 195 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=95.65 E-value=0.0017 Score=35.80 Aligned_cols=383 Identities=12% Similarity=0.087 Sum_probs=156.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccC---cceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSA---DNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~---~~ri~~n~~k~iv~~~~~ 77 (469) |=+-.=+.+..--+ .+..+.+ ...|.+.... +........ ..-...+.....|+..+. T Consensus 1 ~~~~~~~~~~~p~~---~e~~~~~---~~~~~~~~~~-------------~~~~~~~~~~~~~~a~~~~~V~acV~~IA~ 61 (518) T protein:vir:10 1 MLLANGQTLSAPAM---AELSPQM---QDSYYYAPAV-------------GMQLERQFSLYGGIYKNQPWVRTVIAKRAQ 61 (518) T ss_pred CcccCceeecCchh---hhhhhhh---hccccccccc-------------ceecccccchhhHHHhhhHHHHHHHHHHHH Confidence 00000000000000 0000110 1111110000 000000000 000011233444555555 Q ss_pred hhhcCCeee---ccC--chhhHHHHHHHHhc-c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEE Q lcl|NC_010179. 78 YIASVFPDI---DVG--KDADNKKILDVLGD-D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITP 146 (469) Q Consensus 78 ~l~g~p~~~---~~~--~~~~~~~l~~~~~~-n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~ 146 (469) -+-+-|+.+ +.+ .......+..++.+ | .+ .....+..+.+.+|.+|+.+-.+.+|++ .+.+++|..+.+ T Consensus 62 ~iA~lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v 141 (518) T protein:vir:10 62 ALARLPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAI 141 (518) T ss_pred hhccCceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEE Confidence 554555543 111 11223344455543 3 22 2234566788899999999999999986 488999999988 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) ..+... ..+ +|......+... ....+....+.+++... T Consensus 142 ~~~~~~-~~~-----~y~~~~~~~~~~--~~~~~~~~eViHir~~s---------------------------------- 179 (518) T protein:vir:10 142 KRNSRT-GRY-----EYYFQAGAGVGT--QLVSFADDEVVPIRFFN---------------------------------- 179 (518) T ss_pred EEcCCC-CEE-----EEEEEecCCccc--eEEEecCCcEEEecCCC---------------------------------- Confidence 876532 111 111111111111 01112223333322100 Q ss_pred cccEEEecCC-ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh----hhhhh--------hcc Q lcl|NC_010179. 227 RVPFIEFPKN-KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF----MNDLR--------EYK 293 (469) Q Consensus 227 ~vPvv~~~n~-~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~----~~~~~--------~~~ 293 (469) .+. ..|.|-+..+...|.....+.....+.++..+.|-.++..... ..++. ...+. ..+ T Consensus 180 -------~dg~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~-ls~e~~~~~k~~~~~~~~G~~nag~ 251 (518) T protein:vir:10 180 -------PDGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSEAAQQRLREQFDRAHSGSSNTGK 251 (518) T ss_pred -------CCcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC-CCHHHHHHHHHHHHHHhcCccccCc Confidence 001 1466767666666666555555556666666777666554221 11111 11111 012 Q ss_pred eeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 294 SIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYF 371 (469) Q Consensus 294 ~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~ 371 (469) ++.+. .+++|..... ....+.+..+...+.|+..-++|+.-....++.+...++... ...+ T Consensus 252 v~vL~-------~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~----------~~f~ 314 (518) T protein:vir:10 252 TMVVE-------EGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQM----------RAFY 314 (518) T ss_pred ceEcC-------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHH----------HHHH Confidence 23232 2244444333 334456666777788988888886433211111111111111 1222 Q ss_pred HHHHHHHHHHHHHHhccc---C-CCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-H-- Q lcl|NC_010179. 372 EHAINELVRAIMRYLNFS---D-ADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ-E-- 440 (469) Q Consensus 372 ~~~l~~~~~~i~~~~~~~---~-~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~-E-- 440 (469) ..+|.-+++.|...++.+ . .....+++..+.-+..|.++.++++.++ +|+++.-.+.++++. ++++.. + T Consensus 315 ~~tL~P~l~~ie~~ln~~L~~~~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~~~gD~~~ 394 (518) T protein:vir:10 315 RDTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELY 394 (518) T ss_pred HHHHHHHHHHHHHHHHHhhcccccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeee Confidence 333433333333333211 1 1112344444455678999999998876 678999888887653 333211 1 Q ss_pred ----HHHHHHHHH-----HhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 441 ----LKDLAKDRE-----ENDPYANQADELNGKGVDDE 469 (469) Q Consensus 441 ----~eri~~E~~-----~~~~~~~~~~~~~~~~~~de 469 (469) +..+..-.. ...+.............+++ T Consensus 395 ~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 432 (518) T protein:vir:10 395 ANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQS 432 (518) T ss_pred ecccceecccccccccCCCCCCCCCCCCcccccccccc Confidence 111110000 00011100011011111111 No 196 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=95.62 E-value=0.0017 Score=35.72 Aligned_cols=357 Identities=13% Similarity=0.084 Sum_probs=148.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+ ++++...... ..+.+. +...... ......+..... ..-+..+-....|+..+.-+- T Consensus 1 Mg~------f~~~~~~~~~-------~~~~~~---~~~~~~~--~~~~~~~~~v~~---~~~l~~~~v~~~i~~ia~~ia 59 (382) T protein:vir:48 1 MPI------FNLATESPPD-------NQGGFF---DVVDSDF--LASLKGNEWVSA---ETALRNSDLFSIINQLSNDLA 59 (382) T ss_pred Ccc------ccccccCCcc-------cccccc---cchhhhc--cccccCCcccch---HhhhccHHHHHHHHHHHHhhc Confidence 222 0110000000 000000 0000000 000000000000 000111222334555555555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc-c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD-D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~-n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~ 154 (469) +-|+...-. .. + .++.. | ..+....+..+++.+|.+|+++-.+.+|++ .+.+++|..+-++.+... . T Consensus 60 ~~~~~~~~~--~~-~---~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~~~-~ 132 (382) T protein:vir:48 60 TVKLITSRK--KL-Q---GIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNK-D 132 (382) T ss_pred cCceeeecc--hh-h---hhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-C Confidence 555554322 11 1 12221 2 123345667788999999999988888886 688899999888765432 1 Q ss_pred ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 155 KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 155 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) .+ +|......... . ....+....+.+++.-. .. T Consensus 133 ~~-----~y~~~~~~~~~-~-~~~~~~~~evih~~~~~----------------------------------------~~ 165 (382) T protein:vir:48 133 GI-----YYNITFDDPRI-P-PKQHVPQNDVLHFRLLS----------------------------------------VD 165 (382) T ss_pred eE-----EEEEEecCccc-c-ceeEEcCccEEEecCCC----------------------------------------CC Confidence 11 12111111000 0 00112222222221100 00 Q ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh---------hcceeeecccCCCCC Q lcl|NC_010179. 235 KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR---------EYKSIKINNAGNGDK 305 (469) Q Consensus 235 n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~ 305 (469) ....|.|-+..+...++....+..-..+.++..+.|-.+++-......+ ....+. ..+++.++. + T Consensus 166 ~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~~~~~e-~~~~~~~~~~~~~~n~g~~~vl~~-----g 239 (382) T protein:vir:48 166 GGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGILKIKGGGLLD-FKTKLSRSRQAMKQMQGGPLVLDD-----L 239 (382) T ss_pred CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChH-HHHHHHHHHHhhccCCCCeeEcCC-----C Confidence 0134777788778878777766666777777778886666542222211 111111 122222221 1 Q ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 306 SGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNA--SGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIM 383 (469) Q Consensus 306 ~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~--Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~ 383 (469) .+++-+........+.+..+...+.|+..-++|+.-....++. +..+. ...+..+|.-+++.+. T Consensus 240 ~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~--------------~~~~~~~l~p~~~~i~ 305 (382) T protein:vir:48 240 EDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMS--------------SDLYSKAVSRYLRPFL 305 (382) T ss_pred ceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHH--------------HHHHHHHHHHHHHHHH Confidence 2333333333344556677788889999888887544322221 11121 2233344444444444 Q ss_pred HHhcccCCCccc--ceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHhhhhHh Q lcl|NC_010179. 384 RYLNFSDADKRH--ISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVDDWQQELKDLAKDREENDPYAN 456 (469) Q Consensus 384 ~~~~~~~~~~~~--i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~d~~~E~eri~~E~~~~~~~~~ 456 (469) ..++.+=..... +...+. .+.......+.++ +|++++-++.+.+ |+.++..-+.+ + T Consensus 306 ~~l~~~l~~~~~~~~~~~~~----~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~~~~~-------------~ 368 (382) T protein:vir:48 306 SELSQKLSCDVDADIFPAVD----PTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKELPNGE-------------N 368 (382) T ss_pred HHHHHHhcChhhhhhhhhhc----cchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcchhhhh-------------c Confidence 333321111111 111111 1223344445454 6788888887754 55443211110 0 Q ss_pred hcccCCCCCCCCC Q lcl|NC_010179. 457 QADELNGKGVDDE 469 (469) Q Consensus 457 ~~~~~~~~~~~de 469 (469) .....++++.+++ T Consensus 369 ~~~~~~GGd~~~~ 381 (382) T protein:vir:48 369 PNSTLKGGEEDGQ 381 (382) T ss_pred CCCCCCCCCCCCC Confidence 1122344444444 No 197 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=95.59 E-value=0.0018 Score=35.65 Aligned_cols=357 Identities=13% Similarity=0.023 Sum_probs=160.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+....-.+++. . ..+ +.|+ .. . ......-.+.+....+. T Consensus 41 ~~~~~~~~iLr~-----~---~~~----~ly~----------------------~m-----~-~D~hi~s~l~~Rk~av~ 80 (448) T protein:vir:77 41 VVDREFDELLQG-----K---DGL----LVYH----------------------KM-----L-SDGTVKNALNYIFGRIR 80 (448) T ss_pred ccccchhHhhcc-----c---cch----HHHH----------------------HH-----h-hChHHHHHHHHHHHHHh Confidence 222211111110 0 000 0010 00 0 12444445555566777 Q ss_pred cCCeeeccCch-----hhHHHHHHHHhc--------cHHHHHHHHHHHHHhCCeEEE-EEEE-cCCCceEEEEE---ccc Q lcl|NC_010179. 81 SVFPDIDVGKD-----ADNKKILDVLGD--------DRALTLNSLLVDSSNAGRAWL-HYWI-DEDNNFRYGII---QPD 142 (469) Q Consensus 81 g~p~~~~~~~~-----~~~~~l~~~~~~--------n~~~~~~~~~~~~~~~G~~~~-~v~~-d~~~~~~i~~~---~p~ 142 (469) |.+..+.+.++ ...+++.+++.. +|.+.+.+ ..++..+|.++. ++|. ..+|...+..+ ++. T Consensus 81 ~~~w~v~p~~~~~~d~~~ae~v~~~l~~~~~~~~~~~f~~~i~~-~lda~~~G~s~~Eivw~~~~dg~~~~~~l~~r~~~ 159 (448) T protein:vir:77 81 SAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAI-YENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPF 159 (448) T ss_pred cCCceEecCCCCHHHHHHHHHHHHHhhchhhhhccCCHHHHHHH-HHHhhhhcceeEEEEEeecCCCceeeccccccCCC Confidence 88888764322 233455565543 23333333 357888997654 5563 35566532222 221 Q ss_pred eeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccc Q lcl|NC_010179. 143 QITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (469) Q Consensus 143 ~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (469) . +++|. .+.++.... ........ +. ....+..+ T Consensus 160 ~----------------~~~f~-~~~~~~l~~--------------~~~~~~~~---------------~~-~~~~~~~~ 192 (448) T protein:vir:77 160 N----------------IDEVL-YDEEGGPKA--------------LKLSGEVK---------------GG-SQFVNGLE 192 (448) T ss_pred c----------------cceee-eecCCceEE--------------EecCCccc---------------cc-ccCCCccc Confidence 1 11111 111111100 00000000 00 00001112 Q ss_pred ccCCcccEEEec--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc-hhh-------hhhhh-- Q lcl|NC_010179. 223 HNFGRVPFIEFP--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL-KQF-------MNDLR-- 290 (469) Q Consensus 223 ~~~g~vPvv~~~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~-~~~-------~~~~~-- 290 (469) -+++++=+.+.. -++.|.|.+..+-...--=+..+.+++.-++.++.|+++.+...+.+. .+. ...+. T Consensus 193 lP~~~~i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~~g 272 (448) T protein:vir:77 193 IPIWKTVVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFVQK 272 (448) T ss_pred cccceEEEEecCCcCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCCCCCHHHHHHHHHHHHHHhcC Confidence 234443222211 145678888877776666677888999999999999998875333221 111 11111 Q ss_pred hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTY 370 (469) Q Consensus 291 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~ 370 (469) ....+.++.+ ..+++++.......+...++...+.|...--+..++.++.|+.++.+......-.......-.+. T Consensus 273 ~~a~~iiP~g-----~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa~~ 347 (448) T protein:vir:77 273 PRHGIILPDD-----WKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAVNIGEFVSLTQQTIISLQRE 347 (448) T ss_pred CceEEEecCC-----ceEEEEecCCCccCHHHHHHHHHHHHHHHHhccccccccccchhhhhhhhHHHHHHHHHHHHHHH Confidence 2233344443 46888887666566777889999999876555444433333322223221111111222333444 Q ss_pred HHHHHH-HHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHH Q lcl|NC_010179. 371 FEHAIN-ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSKEAVAKANPIVDDWQQELKDLAKDRE 449 (469) Q Consensus 371 ~~~~l~-~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~eri~~E~~ 449 (469) +...+. +++.-++. +|.. .+..-..+.|...-+.|.+..++.+.++++.+- +.++ +..+..+. T Consensus 348 i~~tln~~Li~~l~~-lNfg-~~~~~P~~~f~~~e~eDl~~~a~~~~~l~~~~~-----~~~~-ip~~~~~~-------- 411 (448) T protein:vir:77 348 FASAVNLYLIPKLVL-PNWP-GATRFPRLTFEMEERNDFSAAANLMGMLINAVK-----DSED-IPTELKAL-------- 411 (448) T ss_pred HHHHHHHHHHHHHHH-hcCC-CCCCCCEEEecCCChhhHHHHHHHhHHHHHHHH-----HHhc-CCccCCcC-------- Confidence 555564 46665554 3422 222235788988888999999999888875421 1111 11110000 Q ss_pred HhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 450 ENDPYANQADELNGKGVDDE 469 (469) Q Consensus 450 ~~~~~~~~~~~~~~~~~~de 469 (469) +...........+++++ T Consensus 412 ---~~~~~~~~~~~~~~~~~ 428 (448) T protein:vir:77 412 ---IDALPSKMRRALGVVDE 428 (448) T ss_pred ---CCCCchhcccccCCCCC Confidence 00000001111111111 No 198 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=95.46 E-value=0.002 Score=35.34 Aligned_cols=396 Identities=12% Similarity=0.076 Sum_probs=170.1 Q ss_pred CCHHHHH----HHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MELDALK----KLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~~~~~----~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) |+.|.-. .+.+.... ....+....-|. ..-.++.+........-.-. .... -......-...+.. T Consensus 1 ~~~~~~~~p~~~~~~~~~~----~~~~~~~~~g~~-~~D~~lr~~gg~~~~~~~l~-~~m~-----e~D~~v~s~l~~Rk 69 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIY----AMEHLGLATSYL-SEDGGYKRAGKPTYQQLSAW-DEAA-----QTEPIIAQGLDSIA 69 (446) T ss_pred CcccccCCCchhhhhhhhh----ccccchhhcccC-CcchHhhhcCCChHHHHHHH-HHHH-----hcchHHHHHHHHHH Confidence 4443221 12111111 111111111111 11122222111000000000 0000 01344444555555 Q ss_pred HhhhcCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEE-EEEEcCCCceE-EEEE------ccceeEEEE Q lcl|NC_010179. 77 GYIASVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWL-HYWIDEDNNFR-YGII------QPDQITPVY 148 (469) Q Consensus 77 ~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~-~v~~d~~~~~~-i~~~------~p~~~~~~~ 148 (469) .-+.|-+.++...+++..+++.+++++-..+.......++.-+|.++. ++|.-..|.-. .++. .|...--.| T Consensus 70 ~av~~~~w~V~p~~~~~a~~v~~~l~~~~~~~~~~~~ldai~~G~s~~Eivw~~~~g~~~p~~~~d~~~~~~~~~~r~~~ 149 (446) T protein:vir:98 70 LSVLNKVGPYQHGDKRIKKFIDDQLRNRAKTWISHCVKSIMTYGFSLSEQIYAHGARDNMPATVLDDIVNYHPLQVMLIA 149 (446) T ss_pred HHhhcCCceecCccHHHHHHHHHHHhhcCchhHHHHHHHHHhhCceeeeEEEeecccccccchhhccccccccccceeee Confidence 667788889988888888889999876544444444568888997554 56644333211 1111 111110011 Q ss_pred eCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcc Q lcl|NC_010179. 149 ATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRV 228 (469) Q Consensus 149 d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~v 228 (469) +... .+.. +.. .+...+. ...+....+.... ......... ....+-|..++ T Consensus 150 ~~~~--~~~~-----------~~~----------~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~---g~~~~iP~~kf 200 (446) T protein:vir:98 150 NDNG--RIVD-----------GDT----------VTASQYK--SGYWVPLPPYRIG-DPPKKVDVV---GSHVRLPSHKR 200 (446) T ss_pred ccCC--cccc-----------ccc----------cchhhcc--cccccCcccchhh-hhhhhcccC---cccccccccce Confidence 1100 0000 000 0000000 0000000000000 000000000 00011122222 Q ss_pred cEEEec---CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchh----------------hhh-- Q lcl|NC_010179. 229 PFIEFP---KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQ----------------FMN-- 287 (469) Q Consensus 229 Pvv~~~---n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~----------------~~~-- 287 (469) =+..+. .++.|.|.+..+--.---=+..+-.++.-++.++.|+.+.+-..+....+ ... T Consensus 201 i~~~~~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av 280 (446) T protein:vir:98 201 LFINYNTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDAL 280 (446) T ss_pred EEEEecCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHH Confidence 122221 24678888776666555567777888889999999998876432211100 111 Q ss_pred -hhhhcceeeecccCCCCCCcceEEeecCC-HHHHHHHHHHHHHHHHHHhCCCCcCc--ccc---CCccHHHHHHHHHHH Q lcl|NC_010179. 288 -DLREYKSIKINNAGNGDKSGVDKLQIDIP-VEARDDALKITRDNIFLFGQGIDPAN--FES---SNASGVAIKMLYSHL 360 (469) Q Consensus 288 -~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~--~~~---g~~Sg~Al~~~~~~l 360 (469) .+...++..++.....++..+++++.... ...++..++.+.++|.+..-+..+.. ... ++.-|..-.-. . T Consensus 281 ~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl~~~~~~~GS~ala~vh~~V---~ 357 (446) T protein:vir:98 281 RRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLVQNRETTFGTGRASEIQLEL---F 357 (446) T ss_pred HhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccccccccccchhhhHHHHHHH---H Confidence 11222333332222223457888876654 34688999999999988654443322 111 11112221111 1 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHHhcccCCC-ccc---ceEEeCCCCCCCHHHHHHHHHHHh--cc-CC--hHHHHHh Q lcl|NC_010179. 361 ELKAAKTQTYFEHAIN-ELVRAIMRYLNFSDAD-KRH---ISQHWTRTKVEDSLTKAQIVSTVA--NY-SS--KEAVAKA 430 (469) Q Consensus 361 ~~k~~~~~~~~~~~l~-~~~~~i~~~~~~~~~~-~~~---i~i~f~~~~p~d~~e~~~~~~kl~--g~-iS--~et~~~~ 430 (469) ...++.-.+.+...+. ++++-++. +|..... ... -.+.|...-+.|.+..++.+.++. |+ ++ .+.+.++ T Consensus 358 ~d~~~aDa~~i~~tln~~Li~~l~~-lNf~~~~~~~~~~~~~~~~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~ 436 (446) T protein:vir:98 358 DGKINSIFDTVIHAFTEQVIGNLIR-LNFDPALYPLASNTGYITRLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSI 436 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-hCCCccccccccccccceeccCChhhHHHHHHHHHHHHhCCccccccHHHHHHH Confidence 1223334455666664 57776666 3433222 111 134566667889999999998874 54 44 3445556 Q ss_pred CCCCCCHHHHH Q lcl|NC_010179. 431 NPIVDDWQQEL 441 (469) Q Consensus 431 l~~v~d~~~E~ 441 (469) ++ +++++.-- T Consensus 437 ~g-iP~~~~~~ 446 (446) T protein:vir:98 437 TG-LPDAISST 446 (446) T ss_pred hC-cCCCCCCC Confidence 65 33221111 No 199 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=95.25 E-value=0.0024 Score=34.91 Aligned_cols=353 Identities=10% Similarity=0.092 Sum_probs=145.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccc-cc-ccccc-Ccce--eccchHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEG-KK-DPLRS-ADNR--IPSNFYQLLVDQE 75 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~-~~-~~~~~-~~~r--i~~n~~k~iv~~~ 75 (469) |-+= ..-....+................ .. ...+. .+.+ +..+-....|+.. T Consensus 1 M~~f-----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~v~~~i~~i 57 (386) T protein:vir:48 1 MPIF-----------------------NITNLATESPPISQGGFFDITDPDFLSTLNGSEWVSAESALRNSDLFSIINQL 57 (386) T ss_pred Cccc-----------------------ccccccccccccccccccccccchhcccccCCceechhhhhcchHHHHHHHHH Confidence 1100 000000000000000000000000 00 00000 0000 1112222334444 Q ss_pred HHhhhcCCeeeccCchhhHHHHHHHHhc-c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEe Q lcl|NC_010179. 76 AGYIASVFPDIDVGKDADNKKILDVLGD-D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYA 149 (469) Q Consensus 76 ~~~l~g~p~~~~~~~~~~~~~l~~~~~~-n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d 149 (469) +.-+-+-|+.+. +.. ...++.. | ..+....+..+.+.+|.+|+.+-.+.+|++ .+.+++|..+-+..+ T Consensus 58 a~~ia~~p~~~~--~~~----~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~ 131 (386) T protein:vir:48 58 SNDLATVKLTAS--RKQ----LQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRL 131 (386) T ss_pred HHhhccCceeec--cch----hHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEc Confidence 444444455433 222 2223322 2 123344667788999999999888888875 588889998887765 Q ss_pred CCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_010179. 150 TTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVP 229 (469) Q Consensus 150 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 229 (469) ... ..+ +|....... .. .....+....+ T Consensus 132 ~~~-~~~-----~y~~~~~~~-~~-~~~~~~~~~ev-------------------------------------------- 159 (386) T protein:vir:48 132 DNK-DGI-----YYNITFDDP-RI-PPKQHVPQGDV-------------------------------------------- 159 (386) T ss_pred CCC-ceE-----EEEEEecCc-cc-cceeEecCccE-------------------------------------------- Confidence 432 111 111111110 00 00011222222 Q ss_pred EEEecCC-----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhh---------ccee Q lcl|NC_010179. 230 FIEFPKN-----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLRE---------YKSI 295 (469) Q Consensus 230 vv~~~n~-----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~---------~~~~ 295 (469) +|+++. -.|.|.+..+...+.....+..-..+.+...+.|-.+++-...... +....+.. .+++ T Consensus 160 -ih~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~~~~~-e~~~~~~~~~~~~~~n~g~~~ 237 (386) T protein:vir:48 160 -LHFKLLSVDGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGILKIKGGGLL-DFKTKLSRSRQAMKQMQGGPL 237 (386) T ss_pred -EEecCCCCCCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCH-HHHHHHHHHHHHhhcCCCCce Confidence 333211 2467777777666666665555566666666777666654332222 22111111 1111 Q ss_pred eecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 296 KINNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFE 372 (469) Q Consensus 296 ~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~ 372 (469) .+ .++++|.....+ ...+.+..+...+.|+..-++|+.-....++ .+.... ....+. T Consensus 238 vl-------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~e~~-------------~~~~~~ 297 (386) T protein:vir:48 238 VL-------DDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQSSLEM-------------SLDLYN 297 (386) T ss_pred ec-------CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHH-------------HHHHHH Confidence 11 123455444433 3345666778888999988898754322121 111110 011223 Q ss_pred HHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCC--CCCCHHHHHHHHHHHH Q lcl|NC_010179. 373 HAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANP--IVDDWQQELKDLAKDR 448 (469) Q Consensus 373 ~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~--~v~d~~~E~eri~~E~ 448 (469) .+|.-+++.|...++.+=.. .++..+...+..+....+..+.++ +|+++.-++.+.++ .++. .|+.+.. T Consensus 298 ~~l~P~~~~ie~~l~~~l~~--~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~~~--~~~~~~~--- 370 (386) T protein:vir:48 298 KAVSRYLRPFLSELSQKLSC--DVDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEILP--KELPEGE--- 370 (386) T ss_pred HHHHHHHHHHHHHHHHhhcc--hhhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCCCC--ccchhhc--- Confidence 33333333333333221111 112222222334555566666665 68999988887653 2322 1221111 Q ss_pred HHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 449 EENDPYANQADELNGKGVDDE 469 (469) Q Consensus 449 ~~~~~~~~~~~~~~~~~~~de 469 (469) ... ..+.++++.+++ T Consensus 371 ---~~~---~~~~~gGd~~~~ 385 (386) T protein:vir:48 371 ---NPN---KTTLKGGEINGE 385 (386) T ss_pred ---CCC---CCccCCCCCCCC Confidence 111 112223333333 No 200 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=95.16 E-value=0.0026 Score=34.72 Aligned_cols=363 Identities=12% Similarity=0.059 Sum_probs=153.0 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcce--eccchHHHHHHHHHHhhhcCCe Q lcl|NC_010179. 9 LIRNTSTSRND--LINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNR--IPSNFYQLLVDQEAGYIASVFP 84 (469) Q Consensus 9 ~i~~~~~~~~~--~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~r--i~~n~~k~iv~~~~~~l~g~p~ 84 (469) ++-.+.....+ ..+.......++....+ .................+ +..+-....|+..++-+-+-|+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~ 72 (392) T protein:vir:39 1 MILPILNFINQTNDPPEVGSVQSYFPDGND--------AQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred CcchhhhhhhcccccccccccccccccCch--------hhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCce Confidence 11111110000 00000000001100000 000000000000000001 1112233345555555555555 Q ss_pred eeccCchhhHHHHHHHHh-ccH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 85 DIDVGKDADNKKILDVLG-DDR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 85 ~~~~~~~~~~~~l~~~~~-~n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) .+.-. ... .++. -|. ......+..+.+.+|.+|+++-.+.+|++ .+.+++|..+-+..+... ..+ T Consensus 73 ~~~~~--~~~----~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~-~~~-- 143 (392) T protein:vir:39 73 NAEKK--KNQ----GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE-NGM-- 143 (392) T ss_pred eeccc--hhh----hHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceE-- Confidence 54322 111 2222 232 22334567788999999999988998986 588899999888776432 111 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) +|......+.. .....+....+.+++... ...... T Consensus 144 ---~y~~~~~~~~~--~~~~~~~~~eiih~~~~~----------------------------------------~~~~~~ 178 (392) T protein:vir:39 144 ---YYNITFDDPKI--EPILQAPQSDLIHMKLLS----------------------------------------IDGGKT 178 (392) T ss_pred ---EEEEEecCccc--ceeEEEccccEEEecCCC----------------------------------------CCCccc Confidence 11111111100 001112223333221100 000124 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC-Ccccchhhhhhhh--------hcceeeecccCCCCCCcce Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY-GGASLKQFMNDLR--------EYKSIKINNAGNGDKSGVD 309 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~-~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~ 309 (469) |.|-+..+...++....+..-..+.++..+.|-.+++=. +....++....+. ..+++.++ ++++ T Consensus 179 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-------~g~~ 251 (392) T protein:vir:39 179 GISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-------DLEE 251 (392) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecC-------CCce Confidence 777777777777666666555566666677775444321 1112121111111 11222222 2344 Q ss_pred EEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010179. 310 KLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYL 386 (469) Q Consensus 310 ~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~ 386 (469) |..... ....+.+..+...+.|+..-++|+.-....+. .|.. +.....+..+|.-+++.|...+ T Consensus 252 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-------------~~~~~f~~~~l~P~~~~ie~~l 318 (392) T protein:vir:39 252 FTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-------------QQISGMYASALNRYLRPAISEL 318 (392) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 444333 34455677777888998888888654332221 1211 1112234445555555554444 Q ss_pred cccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHhhhhHhhcccC Q lcl|NC_010179. 387 NFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVDDWQQELKDLAKDREENDPYANQADEL 461 (469) Q Consensus 387 ~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~d~~~E~eri~~E~~~~~~~~~~~~~~ 461 (469) +.+=... +++....-.-.|..+.++.+.++ +|+++...+.+.+ |+..+ |+.+. +..... T Consensus 319 ~~~L~~~--~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~~-----------e~l~~~ 382 (392) T protein:vir:39 319 EYKLSDH--ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPAP-----------ENTNKK 382 (392) T ss_pred HHhcccc--ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccchh-----------cCCCCC Confidence 3221111 22222222334666777777776 6789988877654 66543 33211 111222 Q ss_pred CCCCCCCC Q lcl|NC_010179. 462 NGKGVDDE 469 (469) Q Consensus 462 ~~~~~~de 469 (469) ++++.++. T Consensus 383 ~~Gd~~~p 390 (392) T protein:vir:39 383 TTGQSNEP 390 (392) T ss_pred CCCCCCCC Confidence 33322222 No 201 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=95.16 E-value=0.0026 Score=34.72 Aligned_cols=363 Identities=12% Similarity=0.059 Sum_probs=153.0 Q ss_pred HHHHHHHHHHH--HHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcce--eccchHHHHHHHHHHhhhcCCe Q lcl|NC_010179. 9 LIRNTSTSRND--LINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNR--IPSNFYQLLVDQEAGYIASVFP 84 (469) Q Consensus 9 ~i~~~~~~~~~--~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~r--i~~n~~k~iv~~~~~~l~g~p~ 84 (469) ++-.+.....+ ..+.......++....+ .................+ +..+-....|+..++-+-+-|+ T Consensus 1 m~m~~f~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~v~~~~al~~~~v~~~i~~ia~~ia~lp~ 72 (392) T protein:vir:10 1 MILPILNFINQTNDPPEVGSVQSYFPDGND--------AQIMESLLGDNNEWVSARAALRNSDLFSIILQLSSDLAIVKI 72 (392) T ss_pred CcchhhhhhhcccccccccccccccccCch--------hhhhhhhcCCCCceechHHhhccHHHHHHHHHHHHhhccCce Confidence 11111110000 00000000001100000 000000000000000001 1112233345555555555555 Q ss_pred eeccCchhhHHHHHHHHh-ccH----HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEE Q lcl|NC_010179. 85 DIDVGKDADNKKILDVLG-DDR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLG 158 (469) Q Consensus 85 ~~~~~~~~~~~~l~~~~~-~n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~ 158 (469) .+.-. ... .++. -|. ......+..+.+.+|.+|+++-.+.+|++ .+.+++|..+-+..+... ..+ T Consensus 73 ~~~~~--~~~----~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~-~~~-- 143 (392) T protein:vir:10 73 NAEKK--KNQ----GIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYE-NGM-- 143 (392) T ss_pred eeccc--hhh----hHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-ceE-- Confidence 54322 111 2222 232 22334567788999999999988998986 588899999888776432 111 Q ss_pred EEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCcc Q lcl|NC_010179. 159 VLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKY 238 (469) Q Consensus 159 ~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 238 (469) +|......+.. .....+....+.+++... ...... T Consensus 144 ---~y~~~~~~~~~--~~~~~~~~~eiih~~~~~----------------------------------------~~~~~~ 178 (392) T protein:vir:10 144 ---YYNITFDDPKI--EPILQAPQSDLIHMKLLS----------------------------------------IDGGKT 178 (392) T ss_pred ---EEEEEecCccc--ceeEEEccccEEEecCCC----------------------------------------CCCccc Confidence 11111111100 001112223333221100 000124 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC-Ccccchhhhhhhh--------hcceeeecccCCCCCCcce Q lcl|NC_010179. 239 RLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY-GGASLKQFMNDLR--------EYKSIKINNAGNGDKSGVD 309 (469) Q Consensus 239 g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~-~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~ 309 (469) |.|-+..+...++....+..-..+.++..+.|-.+++=. +....++....+. ..+++.++ ++++ T Consensus 179 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~vl~-------~g~~ 251 (392) T protein:vir:10 179 GISPLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRSRSFMKRSRSGGPVVLD-------DLEE 251 (392) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHHHHHhccccCCCeeecC-------CCce Confidence 777777777777666666555566666677775444321 1112121111111 11222222 2344 Q ss_pred EEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCC-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010179. 310 KLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSN-ASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYL 386 (469) Q Consensus 310 ~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~-~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~ 386 (469) |..... ....+.+..+...+.|+..-++|+.-....+. .|.. +.....+..+|.-+++.|...+ T Consensus 252 ~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~-------------~~~~~f~~~~l~P~~~~ie~~l 318 (392) T protein:vir:10 252 FTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSI-------------QQISGMYASALNRYLRPAISEL 318 (392) T ss_pred EEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHH-------------HHHHHHHHHHHHHHHHHHHHHH Confidence 444333 34455677777888998888888654332221 1211 1112234445555555554444 Q ss_pred cccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHhhhhHhhcccC Q lcl|NC_010179. 387 NFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVDDWQQELKDLAKDREENDPYANQADEL 461 (469) Q Consensus 387 ~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~d~~~E~eri~~E~~~~~~~~~~~~~~ 461 (469) +.+=... +++....-.-.|..+.++.+.++ +|+++...+.+.+ |+..+ |+.+. +..... T Consensus 319 ~~~L~~~--~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~---e~r~~-----------e~l~~~ 382 (392) T protein:vir:10 319 EYKLSDH--ISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK---DLPAP-----------ENTNKK 382 (392) T ss_pred HHhcccc--ccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc---ccchh-----------cCCCCC Confidence 3221111 22222222334666777777776 6789988877654 66543 33211 111222 Q ss_pred CCCCCCCC Q lcl|NC_010179. 462 NGKGVDDE 469 (469) Q Consensus 462 ~~~~~~de 469 (469) ++++.++. T Consensus 383 ~~Gd~~~p 390 (392) T protein:vir:10 383 TTGQSNEP 390 (392) T ss_pred CCCCCCCC Confidence 33322222 No 202 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=94.84 E-value=0.0034 Score=34.13 Aligned_cols=384 Identities=12% Similarity=0.077 Sum_probs=156.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccC---cceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSA---DNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~---~~ri~~n~~k~iv~~~~~ 77 (469) |=+-.=+.+..--+ .+..++ +.+-|-+.-.. +........ ..-...+.....|+.++. T Consensus 1 ~~~~~~~~~~~p~~---~~~~~~---~~~~~~~~~~~-------------g~~~~~~~~~~~~~~~~~~~V~acV~~IA~ 61 (518) T protein:vir:78 1 MLLANGQTLSAPAM---AELSPQ---MQDSYYYAPAV-------------GMQLERQFSLYGGIYKNQPWVRTVIAKRAQ 61 (518) T ss_pred CcccCceeeccchh---hhhhhh---hhhccccccee-------------ceecccccchhhHHhhhhHHHHHHHHHHHH Confidence 00000000000000 000011 11111111000 000000000 000011233445555555 Q ss_pred hhhcCCeeecc--Cc---hhhHHHHHHHHhc-cH-H---HHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEE Q lcl|NC_010179. 78 YIASVFPDIDV--GK---DADNKKILDVLGD-DR-A---LTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITP 146 (469) Q Consensus 78 ~l~g~p~~~~~--~~---~~~~~~l~~~~~~-n~-~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~ 146 (469) -+-+-|+.+-- ++ +.....+..++.+ |. + +....+..+.+.+|.+|+++-.+.+|++. +.+++|..+.+ T Consensus 62 ~iA~lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv 141 (518) T protein:vir:78 62 ALARLPVKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAI 141 (518) T ss_pred hhccCceEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEE Confidence 55555655411 11 1122334445543 32 2 22345667888999999999889988864 88999999988 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) ..+... ..+. ++|......+... ..+....+.+++.-+ + T Consensus 142 ~~~~~~-~~~~---y~~~~~~~~~~~~----~~~~~~eIiHir~~~-------------------------------~-- 180 (518) T protein:vir:78 142 KRNSRT-GRYE---YYFQAGAGVGTQL----VSFADDEVVPIRFFN-------------------------------P-- 180 (518) T ss_pred EEcCCC-CEEE---EEEEecCCcccee----EEecCCcEEEecCCC-------------------------------C-- Confidence 877532 1111 1111111111111 112222232221100 0 Q ss_pred cccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh----hh--------hcce Q lcl|NC_010179. 227 RVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND----LR--------EYKS 294 (469) Q Consensus 227 ~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~----~~--------~~~~ 294 (469) .....|.|-+..+...+.....+.....+.+...+.|-.+++.... ..++.... +. ..++ T Consensus 181 -------dg~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~~-ls~e~~~~~k~~~~~~~~G~~nag~~ 252 (518) T protein:vir:78 181 -------DGLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEKR-LSPEAQQRLREQFDRAHAGSSNTGKT 252 (518) T ss_pred -------CcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCCC-CCHHHHHHHHHHHHHHhcCcccCCce Confidence 0001366666666666666555555566666667778666654322 11221111 11 1122 Q ss_pred eeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFE 372 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~ 372 (469) +.+.. +++|..... ....+.+..+.....|+..-++|+.-....++.+...++.. ....+. T Consensus 253 ~vL~~-------G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~----------~~~f~~ 315 (518) T protein:vir:78 253 MVVEE-------GMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQ----------MRAFYR 315 (518) T ss_pred eEcCC-------CceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHH----------HHHHHH Confidence 33321 244443333 33445555666778888888888643321222221111111 112223 Q ss_pred HHHHHHHHHHHHHhccc---CC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH-H--- Q lcl|NC_010179. 373 HAINELVRAIMRYLNFS---DA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ-E--- 440 (469) Q Consensus 373 ~~l~~~~~~i~~~~~~~---~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~-E--- 440 (469) .+|.-++..|...++.+ .. ....+++..+.-+..|.++.++++.++ +|+++.-.+.++++. ++++.. + T Consensus 316 ~tL~P~~~~ie~eln~~L~~~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~~~gD~~~v 395 (518) T protein:vir:78 316 DTMAIPIARIQSAMDKYVGQYWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDDPKADELYA 395 (518) T ss_pred HHHHHHHHHHHHHHHHhhcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeee Confidence 33333333333332221 11 112334444455678999999999886 678999888887653 333211 1 Q ss_pred ---HHHHHHHHH-----HhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 441 ---LKDLAKDRE-----ENDPYANQADELNGKGVDDE 469 (469) Q Consensus 441 ---~eri~~E~~-----~~~~~~~~~~~~~~~~~~de 469 (469) +..+..-.. ...+...........+.++. T Consensus 396 ~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 432 (518) T protein:vir:78 396 NSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQS 432 (518) T ss_pred cccceecccccccccCCCCCCCCCCCCcccccccccC Confidence 111110000 00011101111111111111 No 203 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=94.47 E-value=0.0043 Score=33.55 Aligned_cols=349 Identities=14% Similarity=0.062 Sum_probs=148.1 Q ss_pred CCHHHHHHHHHHHH-HHHHHH--H-HHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTS-TSRNDL--I-NNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~~~~~~~i~~~~-~~~~~~--~-~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) |-+= ..+. .+...+ . ..-.....+..|. .....-....-+..+-....|+..+ T Consensus 1 Mg~~------~~~~~~k~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~v~~~~~l~~~~v~~~i~~ia 57 (383) T protein:vir:10 1 MGLL------TPKNFSKRNAKNMVYPSNPAFFTTTVGG-----------------MQLSYVSALSALQNTNVYSVINRIA 57 (383) T ss_pred CCcc------cccccccccccccccccchhhhhhhccC-----------------ccccccchhHhhcchHHHHHHHHHH Confidence 3221 0000 000000 0 0000000000000 0000000000011122233445555 Q ss_pred HhhhcCCeeeccCchhhHHHHHHHHhc-c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCC Q lcl|NC_010179. 77 GYIASVFPDIDVGKDADNKKILDVLGD-D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATT 151 (469) Q Consensus 77 ~~l~g~p~~~~~~~~~~~~~l~~~~~~-n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~ 151 (469) +-+-+-|+.+. +... ..++.. | ..+....+..+++.+|.+|+++..+ ...+...+|..+-+..+.. T Consensus 58 ~~ia~~~~~~~--~~~~----~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~~~---~~~~~p~~~~~v~~~~~~~ 128 (383) T protein:vir:10 58 SDVSSAHFKTE--NTAT----LNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQ---NLEHIPNSDVQINYLPGNM 128 (383) T ss_pred HhhccCceeec--ccch----hhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcC---ceeEeecCcceEEEEEcCC Confidence 54445555543 2222 223332 2 2233445677788899999877543 2233334444333332211 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) ... |......+... ..+.... |+ T Consensus 129 ---~~~-----~~~~~~~~~~~----~~~~~~e---------------------------------------------vi 151 (383) T protein:vir:10 129 ---GIV-----YTVLESNDRPK----MVLRQDQ---------------------------------------------ML 151 (383) T ss_pred ---ceE-----EEEEEcCCceE----EEEcccc---------------------------------------------eE Confidence 110 11000011100 0111222 23 Q ss_pred EecC-------CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh----hhhhhhc-------c Q lcl|NC_010179. 232 EFPK-------NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF----MNDLREY-------K 293 (469) Q Consensus 232 ~~~n-------~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~----~~~~~~~-------~ 293 (469) ||++ ...|.|.++.+...++....+..-..+.+...+.|-.++.-......++. ...+... + T Consensus 152 h~r~~~~~~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~ 231 (383) T protein:vir:10 152 HFRLMPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGR 231 (383) T ss_pred EeccCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCC Confidence 3331 12477888888888887777777777777777777555543222111111 1112111 1 Q ss_pred eeeecccCCCCCCcceEEeecCC--HHH-HHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 294 SIKINNAGNGDKSGVDKLQIDIP--VEA-RDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 294 ~~~~~~~~~~~~~~~~~l~~~~~--~~~-~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~ 368 (469) ++.++ ++++|-....+ ... +.+..+...+.|+..-++|+.-... .++.++..++. .. T Consensus 232 ~~vl~-------~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq-----------~~ 293 (383) T protein:vir:10 232 LMVLP-------DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQ-----------IK 293 (383) T ss_pred ccccC-------CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHH-----------HH Confidence 22221 23444333333 223 3456677788999998998743321 12222221111 11 Q ss_pred HHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_010179. 369 TYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDDWQQELKDLAK 446 (469) Q Consensus 369 ~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d~~~E~eri~~ 446 (469) ..|..+|.-+++.|...++.+=. ...+++.++.-+..|.++.++++.++ +|+++...+.+.++.-.-+..++ T Consensus 294 ~~~~~~l~P~~~~ie~~l~~~l~-~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d~----- 367 (383) T protein:vir:10 294 ATYLANLNSYVNPIVDELRLKMN-APDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLPDNL----- 367 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHhhC-CceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccCCcc----- Confidence 12223344444444433332111 12456666777788999999999887 68999988888775321110111 Q ss_pred HHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 447 DREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 447 E~~~~~~~~~~~~~~~~~~~~de 469 (469) + ..+....+.+|+||| T Consensus 368 ------~-~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 368 ------P-EFKPLTNETKGGDDK 383 (383) T ss_pred ------c-ccCCCcccCCCCCCC Confidence 0 011223355667777 No 204 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=94.42 E-value=0.0045 Score=33.47 Aligned_cols=393 Identities=11% Similarity=-0.011 Sum_probs=154.5 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |.|..+.+--.-....+...+ ......+||+ ++..+. ..++.--........|+..+..+. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~--------pp~~~~----------~La~~~~~n~~v~scI~~ia~~ia 66 (540) T protein:vir:41 6 LSIKSLEKYRAIKGDTDSQAL-KEDRFEEYVE--------PKVHPL----------VLLSLLQVNPYHASACSIKANDIL 66 (540) T ss_pred cChhhccchhhhhcccccccc-ccCCCCcccc--------CCCCHH----------HHHHHHHhcHHHHHHHHHHHHHHh Confidence 555443331100000000000 0000001110 000000 000000123455667788888888 Q ss_pred cCCeeeccCchhhHHHHHHHHhc---cHHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCce Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD---DRALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKL 156 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~---n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~ 156 (469) +-|..+...+...... +-+ +..+.+..+..+.+.+|.+|+.+..+.+|++ .+.+++|..+-+..+... T Consensus 67 ~~~~~i~~~~~~~~~~----lpN~~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~~~~~---- 138 (540) T protein:vir:41 67 RTGYLIDGDDGGVEEL----LRACRPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHRDGSR---- 138 (540) T ss_pred cCCceEecCccchhhh----ccCCCCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeEcCce---- Confidence 8888877665544332 222 2233445667788999999999988988886 488888888876554321 Q ss_pred EEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecCC Q lcl|NC_010179. 157 LGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKN 236 (469) Q Consensus 157 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~ 236 (469) ++...+ +.... ++..|....... .... .....+..=-|+|+++. T Consensus 139 -----~~~~~d--~~~~~-~~~~~~~~~~~~--~~~g--------------------------~~~~~~~~~eViHir~~ 182 (540) T protein:vir:41 139 -----YMQTWD--GIHVT-YFKDYRYEGEVN--PDNG--------------------------EDQDGVGANEIIFIHLP 182 (540) T ss_pred -----eEeeec--Cceee-eeecccccceee--cccc--------------------------ccceeecccceEEecCC Confidence 111111 11111 111111000000 0000 00001111124555432 Q ss_pred -----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEe--cCCcccc-----------hhhhhhh--------- Q lcl|NC_010179. 237 -----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT--NYGGASL-----------KQFMNDL--------- 289 (469) Q Consensus 237 -----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~--g~~~~~~-----------~~~~~~~--------- 289 (469) ..|.|.+......+.....+..-..+.++..+.|-.++. |.-.+.. ......+ T Consensus 183 ~~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 262 (540) T protein:vir:41 183 SPICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKE 262 (540) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccc Confidence 257777766666555555555455555566666755543 3211100 0011111 Q ss_pred hhcceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcc----ccCC-ccHHHHHHHHHHHHH Q lcl|NC_010179. 290 REYKSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANF----ESSN-ASGVAIKMLYSHLEL 362 (469) Q Consensus 290 ~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~~g~-~Sg~Al~~~~~~l~~ 362 (469) ...+.+.+... .+...+++|..... ....+.+..+...+.|+..-++|+.-.. +.+| .+.+.....+. .. T Consensus 263 nag~~~vLe~~-~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~f~--~~ 339 (540) T protein:vir:41 263 APHTPLVFSIP-GGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRTYY--ES 339 (540) T ss_pred cccceEEEecC-CCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHHHH--HH Confidence 11222333211 11223455544333 3445666777788889998888874321 1112 12222211111 01 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC---CCH Q lcl|NC_010179. 363 KAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIV---DDW 437 (469) Q Consensus 363 k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v---~d~ 437 (469) ...-....+...+.+.+ + ...+ ..+.+.|+..-.... +.+..+.++ +|+++.-.+.+.++.+ +|+ T Consensus 340 tL~P~~~~ie~~ln~~L---~-----~~~~-~~~~i~f~~~~ll~~-D~~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~ 409 (540) T protein:vir:41 340 VVRPQQEIVSSVLTDFI---Q-----LKLD-PGARFVFNEEILMES-EFVHNYALLVQCGVLTPSEVREKLFGLDGGPDM 409 (540) T ss_pred HHHHHHHHHHHHHHHhh---h-----hccC-CceEEEecchhhcch-HHHHHHHHHHhCCCCCHHHHHHHhCcCcCCCcc Confidence 11111122222222211 1 1111 235667765433221 222223333 6889888887655322 222 Q ss_pred H--------HHHHHHHHHHHHhhh-----hHhhcccCCCCCCCCC Q lcl|NC_010179. 438 Q--------QELKDLAKDREENDP-----YANQADELNGKGVDDE 469 (469) Q Consensus 438 ~--------~E~eri~~E~~~~~~-----~~~~~~~~~~~~~~de 469 (469) - .++..-.++.+...+ ...+.+....+...++ T Consensus 410 ~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~ 454 (540) T protein:vir:41 410 FMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSE 454 (540) T ss_pred cccccccccccccccccccCCCCccccccccchhcccccCccccc Confidence 1 111110000000000 0000000001111011 No 205 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=94.32 E-value=0.0048 Score=33.32 Aligned_cols=385 Identities=12% Similarity=0.057 Sum_probs=157.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCC-cccccccc------hhhhcccccccccc-cCcce--eccchHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKT-DITTRNNG------KPKVSKEGKKDPLR-SADNR--IPSNFYQL 70 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~-~i~~~~~~------~~~~~~~~~~~~~~-~~~~r--i~~n~~k~ 70 (469) |- .+.+.+...+ ........ .......+.....+ ..... +.+.=... T Consensus 1 Mg-----------------------~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~v~~ 57 (457) T protein:vir:62 1 MG-----------------------FWSALFGRGHSPALDAAEGRAWEPYDPSIYNLGATASSGERVTPHDALQVSAVFA 57 (457) T ss_pred Cc-----------------------hhhhhhccccccccccccccccccchhhhhhccccccCCceechHHhhccHHHHH Confidence 11 1111111000 00000000 00000000000000 00000 00111112 Q ss_pred HHHHHHHhhhcCCeeeccCch-----hhHHHHHHHHhc-c-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEE Q lcl|NC_010179. 71 LVDQEAGYIASVFPDIDVGKD-----ADNKKILDVLGD-D-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGII 139 (469) Q Consensus 71 iv~~~~~~l~g~p~~~~~~~~-----~~~~~l~~~~~~-n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~ 139 (469) .|+..+.-+-+-|+.+-...+ .....+..++.. | . .+.+..+..+++.+|.+|+.+..+ .|++ .+.++ T Consensus 58 ~i~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~-~g~~~~l~~l 136 (457) T protein:vir:62 58 SVRLLSETIATLPLSTYSKRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWA-GPNIAGLDVL 136 (457) T ss_pred HHHHHHHhHhhCceEEEEecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEE Confidence 344444444455665421111 112234444432 2 2 233456677889999999888544 4555 57888 Q ss_pred ccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccc Q lcl|NC_010179. 140 QPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSN 219 (469) Q Consensus 140 ~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (469) +|..+.+..+...... ....+.|... ..+... ....++...+.+++.... T Consensus 137 ~p~~v~v~~~~~~~~~-~~~~~~y~~~-~~g~~~--~~~~~~~~eiih~r~~~~-------------------------- 186 (457) T protein:vir:62 137 DPTKIHVHMVMVDGLR-RKVFEAYDID-ADGNEV--LLGWFTPRDVLHIPGMML-------------------------- 186 (457) T ss_pred cCcceEEEEeccCCcc-ceeEEEEEEc-cCCcee--EEEeeCccceEEecCCCC-------------------------- Confidence 8988877654322111 1111122211 122211 122233344433321100 Q ss_pred cccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh--------- Q lcl|NC_010179. 220 TLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR--------- 290 (469) Q Consensus 220 ~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~--------- 290 (469) . ..-.|.|-++.+...|.....+..-..+.++..+.|-.+++-.. ...++....++ T Consensus 187 -----~---------~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~ls~e~~~~~~~~~~~~~~G 251 (457) T protein:vir:62 187 -----P---------GDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPG-TMSEEGLARAREAWRAANSG 251 (457) T ss_pred -----C---------CceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCC-CCCHHHHHHHHHHHHHHhcC Confidence 0 01246777777777777666666666666777777766655422 21222211111 Q ss_pred ---hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 ---EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAA 365 (469) Q Consensus 291 ---~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~ 365 (469) ..+++.++. +.+++-++.+.....+.+..+.....|+..-++|+.-... .++.++..++-... T Consensus 252 ~~nag~~~vl~~-----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~------- 319 (457) T protein:vir:62 252 VDNAHRVALLTE-----GAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI------- 319 (457) T ss_pred ccccCcceecCC-----CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH------- Confidence 011222321 1233334333333455666677888899988998753322 22222322222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcc-----cCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCC Q lcl|NC_010179. 366 KTQTYFEHAINELVRAIMRYLNF-----SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDD 436 (469) Q Consensus 366 ~~~~~~~~~l~~~~~~i~~~~~~-----~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d 436 (469) ..+..+|.-+++.+...++. .+.....+++.++.-+-.|.++.++++.++ +|+++.-.+.+++++ +++ T Consensus 320 ---~f~~~~l~P~~~~ie~~ln~~L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~pi~~ 396 (457) T protein:vir:62 320 ---AFTMFSLRPWLERIEAGFNRLLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRAAEDMTPLPD 396 (457) T ss_pred ---HHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 12222333333333333321 111222344444555667999999999887 688999998887653 333 Q ss_pred H--HHH-----HHHHHHH---HHHh-----hhh----HhhcccCCCCCCCCC Q lcl|NC_010179. 437 W--QQE-----LKDLAKD---REEN-----DPY----ANQADELNGKGVDDE 469 (469) Q Consensus 437 ~--~~E-----~eri~~E---~~~~-----~~~----~~~~~~~~~~~~~de 469 (469) . +.- +..+... +... .+. .+..++.+.++..|| T Consensus 397 g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 448 (457) T protein:vir:62 397 GLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDE 448 (457) T ss_pred CCcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCcc Confidence 2 111 1111110 0000 000 001111111122122 No 206 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=365 Identities=13% Similarity=0.044 Sum_probs=160.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+....-.+++. .. .+. .|+- . . ......-.+.+....+. T Consensus 41 ~~~~~~~~iLr~-----~~---~~~----ly~~----------------------m-----~-~D~hi~s~l~~Rk~av~ 80 (448) T protein:vir:79 41 VVDREFDELLQG-----KD---GLL----VYHK----------------------M-----L-SDGTVKNALNYIFGRIR 80 (448) T ss_pred ccccchhHhhcc-----cc---chH----HHHH----------------------H-----h-hChHHHHHHHHHHHHHh Confidence 222211111110 00 000 0100 0 0 12444445566667778 Q ss_pred cCCeeeccCchh-----hHHHHHHHHhcc-------HHHHHHHHHHHHHhCCeEEE-EEEE-cCCCceEEE---EEccce Q lcl|NC_010179. 81 SVFPDIDVGKDA-----DNKKILDVLGDD-------RALTLNSLLVDSSNAGRAWL-HYWI-DEDNNFRYG---IIQPDQ 143 (469) Q Consensus 81 g~p~~~~~~~~~-----~~~~l~~~~~~n-------~~~~~~~~~~~~~~~G~~~~-~v~~-d~~~~~~i~---~~~p~~ 143 (469) |.+..+.+.+++ ..+.+.+++... ....+..-..++.-+|.++. ++|. ..+|...+. ..++.. T Consensus 81 ~~~w~v~p~~~~~~~~~~ae~v~~~l~~~~~~~~~~~f~~~~~~~lda~~~G~s~~Eivw~~~~~g~~~~~~l~~r~~~~ 160 (448) T protein:vir:79 81 SAKWYVEPASTDPEDIAIAAFIHAQLGIDDASVGKYPFGRLFAIYENAYIYGMAAGEIVLTLGADGKLILDKIVPIHPFN 160 (448) T ss_pred cCCceEecCCCCHHHHHHHHHHHHHhhhhhhhhccCCHHHHHHHHHHhhhhcceeEEEEeeecCCCceecccccccCCcc Confidence 888888643222 223444444321 11223344556788897664 5553 345654322 222221 Q ss_pred e-EEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccc Q lcl|NC_010179. 144 I-TPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (469) Q Consensus 144 ~-~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (469) . ...|++ ++...... ..+... ..... .+..+ T Consensus 161 ~~~f~~~~------------------d~~l~~~~--------------~~~~~~--------------~~~~~--~~~~~ 192 (448) T protein:vir:79 161 IDEVLYDE------------------EGGPKALK--------------LSGEVK--------------GGSQF--VSGLE 192 (448) T ss_pred ccceeeec------------------CCceEEee--------------cCCccc--------------ccccC--CCccc Confidence 0 111221 11110000 000000 00000 00111 Q ss_pred ccCCcccEEEecC----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc-hhh-------hhhhh Q lcl|NC_010179. 223 HNFGRVPFIEFPK----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL-KQF-------MNDLR 290 (469) Q Consensus 223 ~~~g~vPvv~~~n----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~-~~~-------~~~~~ 290 (469) -+++++ +++.. ++.|.|.+..+-...--=+..+.+++.-++.++.|+.+.+...+... .+. ...+. T Consensus 193 lP~~~~--i~~~~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga~~~~~~~~~l~~av~~i~ 270 (448) T protein:vir:79 193 IPIWKT--VVFLHNDDGSFTGQSALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSVRQGTKQWEAAKEIVKNFV 270 (448) T ss_pred cccceE--EEEecCccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCCCcCHHHHHHHHHHHHHHh Confidence 233333 22222 45678888887777777788889999999999999998775433221 111 11111 Q ss_pred --hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 --EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 291 --~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~ 368 (469) ......++.+ ..+++++...+...+...++.+.+.|.+..-+-.++.++.|+.+..+......-....++.-. T Consensus 271 ~g~~a~~iiP~~-----~~ie~~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs~~~~g~~~~~~~~~~~v~~~~~~aDa 345 (448) T protein:vir:79 271 QKPRHGIILPDD-----WKFDTVDLKSAMPDAIPYLTYHDAGIARALGIDFNTVQLNMGVQAINIGEFVSLTQQTIISLQ 345 (448) T ss_pred cCCceEEEecCC-----ceEEEEecCCCcccHHHHHHHHHHHHHHHHhhhhhccccccchhhhhhhhHHHHHHHHHHHHH Confidence 2223334433 468888876665556678888888887755443333333222222222211111122223344 Q ss_pred HHHHHHHH-HHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHHhccCCh-HHHHHhCCCCCCHHHHHHHHHH Q lcl|NC_010179. 369 TYFEHAIN-ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVANYSSK-EAVAKANPIVDDWQQELKDLAK 446 (469) Q Consensus 369 ~~~~~~l~-~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~iS~-et~~~~l~~v~d~~~E~eri~~ 446 (469) +.+...+. +++.-++.+ |.. ....-..+.|...-+.|.+..++.+.+++++... +........+.++...- T Consensus 346 ~~i~~tln~~li~~l~~l-Nfg-~~~~~P~~~f~~~e~~Dl~~~a~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~----- 418 (448) T protein:vir:79 346 REFASAVNLYLIPKLVLP-NWP-SATRFPRLTFEMEERNDFSAAANLMGMLINAVKDSEDIPTELKALIDALPSK----- 418 (448) T ss_pred HHHHHHHHHHHHHHHHHh-cCC-CcCCCcEEEecCCChHHHHHHHHHhhhhhccchhhHHHHHHhhcCCCCCCCc----- Confidence 55566664 466655553 322 1222357888888888989999999888765322 22222221222221100 Q ss_pred HHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 447 DREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 447 E~~~~~~~~~~~~~~~~~~~~de 469 (469) +.+ ..+...+..+ ...+-.|. T Consensus 419 ~~~-a~~~~~~~~~-~~~~~~~~ 439 (448) T protein:vir:79 419 MRR-ALGVVDEVRE-AVRQPADS 439 (448) T ss_pred ccc-ccCCCCcccc-cccCCccc Confidence 000 0000000000 11111111 No 207 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=93.88 E-value=0.0061 Score=32.73 Aligned_cols=416 Identities=9% Similarity=0.036 Sum_probs=188.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) ++.+.+++..+.+..++..-..+.+.+.+|.... +..+. ..+....|+-.+-+..-++..++-|. T Consensus 11 ~~~~~l~~r~~~L~~~R~~~e~~w~e~a~~~lP~--~~~~~-------------~~~~~~~~~~dstg~~a~~~LAa~l~ 75 (516) T protein:vir:10 11 GKRSKIPKLWEKFSTKRSSFLDRAKHYSKLTLPY--LMNDK-------------GDNETSQNGWQGVGAQATNHLANKLA 75 (516) T ss_pred hHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc--ccCCC-------------CCcccccccccchHHHHHHHHHHHHH Confidence 5666777777777666666566667777776541 11110 00111223445666666777666554 Q ss_pred cC--Cee-----eccCch---------hhHHHHHH-----------HH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCC Q lcl|NC_010179. 81 SV--FPD-----IDVGKD---------ADNKKILD-----------VL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDN 132 (469) Q Consensus 81 g~--p~~-----~~~~~~---------~~~~~l~~-----------~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~ 132 (469) |- ||. +...+. .....++. .+ ..||...+.++.++...+|.+. +|.++++ T Consensus 76 ~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~--l~~d~~~ 153 (516) T protein:vir:10 76 QVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM--LYKPSKG 153 (516) T ss_pred hhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe--EEecCCC Confidence 31 221 222221 11112222 22 2366677888889999999985 5678777 Q ss_pred ceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCC----------------ceEEEEEEEEcCCeEEEEEeecCcee Q lcl|NC_010179. 133 NFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEA----------------GKYFTVHEYWTDKEAQFFRTSATDST 196 (469) Q Consensus 133 ~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (469) .++ .++-.++++.-|. .+++...++..+.....- .+-...+++|+. .......+. T Consensus 154 ~~~--~~pl~~y~v~~d~--~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~-----v~~~~~~~~ 224 (516) T protein:vir:10 154 AIS--AIPMHHYVVNRDT--NGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTH-----AKYLGEGFW 224 (516) T ss_pred CeE--EEEcCeEEEeeCC--CCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEE-----EEecCCCce Confidence 654 4444554444443 345655555443211100 000111222211 001111111 Q ss_pred ecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCce Q lcl|NC_010179. 197 VIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVI 271 (469) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~ 271 (469) .+ +....+..... ....+|..+|++.++ .+.+|.|-.++..+-+..+|.+.-...........|. T Consensus 225 ~~--------~~~~d~~~~~~--~s~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~L~D~k~L~~l~~~~l~~~~~a~~~~ 294 (516) T protein:vir:10 225 EL--------KQSADDIPVGK--VSKIKSEKLPFIPLTWKRSYGEDWGRPLAEDYSGDLFVIQFLSEAVARGAALMADIK 294 (516) T ss_pred EE--------EEeeCceeecc--ccccccccCCeeeeeeeecCCCCcccchHHHhhHHHHHHHHHHHHHHHHHHHhcCCC Confidence 00 00001111111 112345567777665 3457999889999999999988877888788888776 Q ss_pred eEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCcc Q lcl|NC_010179. 272 LVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNAS 349 (469) Q Consensus 272 l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~S 349 (469) +.+.-.+... ...+...+.-.+.++ ...++..++ ...+.......++.++..|-..-....+..-...+.| T Consensus 295 ~lv~p~g~~~----~~~l~~~~~g~~~~g---~~~~v~~~q~~~~~d~~~~~~~i~~~~~rI~~af~~~~l~~rd~~rvT 367 (516) T protein:vir:10 295 YLIRPGAQTD----VDHFVNSGTGEVVTG---VEEDIHIVQLGKYADLTPISAVLEVYTRRIGVVFMMETMTRRDAERVT 367 (516) T ss_pred cccCcccccc----hhhhccCCCceeecC---CcccceeeecCcccchHHHHHHHHHHHHHHHHHHhhhhhhccCCcccc Confidence 5543111111 111112221111122 123345544 3335677777788777777543222111111222345 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHhcc--cCCCcccceEEeCCCCCCCHHHHH---HHH---- Q lcl|NC_010179. 350 GVAIKMLYSHLELKAAKTQTYFEHAINELVR-----AIMRYLNF--SDADKRHISQHWTRTKVEDSLTKA---QIV---- 415 (469) Q Consensus 350 g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~-----~i~~~~~~--~~~~~~~i~i~f~~~~p~d~~e~~---~~~---- 415 (469) +..+. .++.+++..++..+.++-. +|.+.+.. ......-+.+.... +.+....+ +.+ T Consensus 368 AtEV~-------~r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~~~~p~~P~~lv~~~~v~--~i~~L~raq~~~~i~~~~ 438 (516) T protein:vir:10 368 AVEIQ-------RDALEIEQNMGGVYSLFATTMQSPVAMWGLLEAGDSFTSDLVDPVIIT--GIEALGRMAELDKLANFA 438 (516) T ss_pred HHHHH-------HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhCCCCChhhcCcceeh--hHHHHHHHHHHHHHHHHH Confidence 55443 3556777777777766422 11111111 11111111221111 11222211 111 Q ss_pred ---HHHhcc-------CCh----HHHHHhCC----CCCCHHHHHHHHHHHHHHhhh---hHhhcccCCCCCCCCC Q lcl|NC_010179. 416 ---STVANY-------SSK----EAVAKANP----IVDDWQQELKDLAKDREENDP---YANQADELNGKGVDDE 469 (469) Q Consensus 416 ---~kl~g~-------iS~----et~~~~l~----~v~d~~~E~eri~~E~~~~~~---~~~~~~~~~~~~~~de 469 (469) ..++++ +.. +.+...++ .+ -.++|++.+++++.+... ..++.....+++.-+| T Consensus 439 q~i~~~~q~~p~v~d~id~d~~~~~~a~~~gvp~~~i-rs~eev~~~r~~~~~~q~~~~~~~~~~~~~~~~~~~~ 512 (516) T protein:vir:10 439 QYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFL-KSAEEMEQEQEAQMQAQQAQMLEEGVAKAVPGVIQQE 512 (516) T ss_pred HHHHHHhcCChHHHhhcCHHHHHHHHHHHhCCChhcc-CCHHHHHHHHHHHHHHHHHHHHHHHhhhcccchhhhh Confidence 111111 111 22222222 11 124566666555433322 2344555555566666 No 208 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=93.69 E-value=0.0067 Score=32.50 Aligned_cols=396 Identities=11% Similarity=-0.002 Sum_probs=163.7 Q ss_pred CC---HH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhc-c-CCccccccc--chhhhcccccccccccCcceeccchH Q lcl|NC_010179. 1 ME---LD-----ALKKLIRNTSTSRNDLINNYKKSVDYYE-N-KTDITTRNN--GKPKVSKEGKKDPLRSADNRIPSNFY 68 (469) Q Consensus 1 ~~---~~-----~~~~~i~~~~~~~~~~~~~~~~~~~Yy~-g-~~~i~~~~~--~~~~~~~~~~~~~~~~~~~ri~~n~~ 68 (469) |+ -. .+-++...-... .+ +.|+ - ..+.+..+. ..+... . -..... T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~------~~----~~~~~~e~~~~lr~~~~~~ly~~m-----~--------e~D~~i 57 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVD------GW----TVWDPFEQTPELQWPQSVAVYSRM-----D--------NEDSRV 57 (469) T ss_pred CCCcccCCCCccchhhhhhccccc------ch----hhccccccccccccccchHHHHHH-----H--------hhChHH Confidence 11 10 111111100000 00 0000 0 000010000 000000 0 002344 Q ss_pred HHHHHHHHHhhhcCCeeeccCc--hhhHHHHHHHHh------------------ccHHHHHHHHHHHHHhCCeEEE-EEE Q lcl|NC_010179. 69 QLLVDQEAGYIASVFPDIDVGK--DADNKKILDVLG------------------DDRALTLNSLLVDSSNAGRAWL-HYW 127 (469) Q Consensus 69 k~iv~~~~~~l~g~p~~~~~~~--~~~~~~l~~~~~------------------~n~~~~~~~~~~~~~~~G~~~~-~v~ 127 (469) .-.+.+....+.|-+.++...+ ++.-+++.+.+. ..+.+.+.++...+.-+|.++. ++| T Consensus 58 ~s~l~~rk~av~~~~w~v~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw 137 (469) T protein:vir:10 58 TSLLEAISLPIRSTPWRIRANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVY 137 (469) T ss_pred HHHHHHHHHHHhcCCceEecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeee Confidence 4445555556666666665422 222222322221 1234455566666777897655 666 Q ss_pred EcC----CCceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccc Q lcl|NC_010179. 128 IDE----DNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNI 203 (469) Q Consensus 128 ~d~----~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (469) ... +|.+.+.-+.++. .+. +.+|... .++.... +.-.....-....... T Consensus 138 ~~~~~~~dG~~~~~~l~~rp----------~~~---i~~~~~~-~~~~l~~--~~~~~~~~~~~~~~~~----------- 190 (469) T protein:vir:10 138 RPRNQSPDGRFWLRKLAPRP----------QWT---ISKFNVA-PDGGLES--IEQIAPPARTRGSLYV----------- 190 (469) T ss_pred ecccccCCCceeeeeeeecC----------ccc---ceeeeec-cCCceee--eeecCccccccccccc----------- Confidence 422 3444332221110 000 0011111 1111100 0000000000000000 Q ss_pred cccccccccccccccccccccCCcccEEEec--CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCccc Q lcl|NC_010179. 204 ITSYDLSAGYETGQSNTLKHNFGRVPFIEFP--KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGAS 281 (469) Q Consensus 204 ~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~--n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~ 281 (469) ....+....+.+.|-.++-. .++.|.|.+..+-...--=+..+..++.-++.++.|+++.+...+.. T Consensus 191 -----------~~~~~~~lp~~k~i~~~~~~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~~a~ 259 (469) T protein:vir:10 191 -----------ANIAPPEIPVNRLVVYTRNKRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASSATD 259 (469) T ss_pred -----------CCCCccccccCcEEEEEecCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCCCCC Confidence 00000000111222222221 35678888888877777777788899999999999998877543322 Q ss_pred chhh------hhhhh--hcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-CC-ccHH Q lcl|NC_010179. 282 LKQF------MNDLR--EYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES-SN-ASGV 351 (469) Q Consensus 282 ~~~~------~~~~~--~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~-~Sg~ 351 (469) ..+- ...+. ....+.++. +..+++++...+...+...++.+.+.|.+..-+..++.++. |. ..|. T Consensus 260 ~~ek~~l~~a~~~~~~g~~a~~iip~-----~~~ie~~ea~g~~~~~~~li~~~d~~Isk~iLG~tlTs~~~gGS~a~~~ 334 (469) T protein:vir:10 260 EDEVRKMAALARSVRGGINAGVGLAQ-----GQILELLGVSGNLPDIRRAIEGHDRSIALSGLAHFLNLDGKGGSYALAS 334 (469) T ss_pred HHHHHHHHHHHHHHhcCCceEEEccC-----CceEEEeecCCCchHHHHHHHHHHHHHHHHHhcccccccCccchhhHHH Confidence 2211 11111 122333433 34689998888888899999999999988665555444322 22 2232 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hcc-----CC Q lcl|NC_010179. 352 AIKMLYSHLELKAAKTQTYFEHAIN-ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANY-----SS 423 (469) Q Consensus 352 Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~-----iS 423 (469) .-.-. ....++.-.+.+...+. ++++-++.+ |. +.+.....+.|... ..+.+..++.++++ .|+ ++ T Consensus 335 vh~ev---~~d~~~sDa~~i~~tln~~li~~l~~l-N~-g~~~~~P~~~~~~~-e~~~~~~a~~i~~l~~~G~~~~~~~~ 408 (469) T protein:vir:10 335 VLEDP---FTQAVHAYATSICRIANQHIIEDLVDI-NF-GVDTPAPVLTFDPI-GSRQDLTAAAVKLLYDAGVFDDDPAV 408 (469) T ss_pred HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh-cC-CCCCCccEEEecCC-CCcHHHHHHHHHHHHhcCCccCcccc Confidence 22211 22233444556667774 466666653 32 22222346677543 34556678888776 455 44 Q ss_pred hHHHHHhCCCCCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 424 KEAVAKANPIVDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 424 ~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+.+.+.++. +.++.+-.-+..++....+..............++ T Consensus 409 ~~~~~e~~gi-p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 453 (469) T protein:vir:10 409 KRAIRQRFNL-PSELNDTPSAEPEEPAAVPNQSAAPARTRSSGNAD 453 (469) T ss_pred HHHHHHHhCC-CCCCCCcccccchhcccCCCCCccccccCCCCCcc Confidence 5566677653 33322211111111111111111111111011111 No 209 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=93.23 E-value=0.0083 Score=31.99 Aligned_cols=368 Identities=11% Similarity=0.015 Sum_probs=156.0 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccC--cceeccchHHHHHHHHHHhhhcCCeeeccC Q lcl|NC_010179. 12 NTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSA--DNRIPSNFYQLLVDQEAGYIASVFPDIDVG 89 (469) Q Consensus 12 ~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~--~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~ 89 (469) .|..+...+.. +.. ............+.......... ..-+.+.-....|+..+.-+-+-|+.+-.. T Consensus 1 ~~~~r~~~~~~----------~~~-~~~~~~~~~~~~g~~~s~~~~~vt~~~al~~~~v~~~v~~ia~~iA~lp~~~~~~ 69 (419) T protein:vir:14 1 MFFSRQLLSNL----------GQT-QMSAGGWVSALLGSSRSDSGQVVTPASALALTVLQNCVTLLAESIAQLPIELYER 69 (419) T ss_pred Ccccccccccc----------ccc-ccCcchhhHHhhcCCCccCCcccchHHhhccHHHHHHHHHHHHhhccCceEEEEe Confidence 11000000000 000 00000000000000000000000 000112223345555555555556654211 Q ss_pred ch-----hhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCceE Q lcl|NC_010179. 90 KD-----ADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKLL 157 (469) Q Consensus 90 ~~-----~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~ 157 (469) ++ .....+..++.. | .+ +....+......+|.+|+++-.+.+|++. +.+++|..+.+..+... .+. T Consensus 70 ~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~~~~~--~~~ 147 (419) T protein:vir:14 70 SGEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMRGSDL--KPV 147 (419) T ss_pred cCCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEECCCc--eEE Confidence 11 112234555542 3 22 22345677889999999999888888864 88899998887665421 111 Q ss_pred EEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec--- Q lcl|NC_010179. 158 GVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP--- 234 (469) Q Consensus 158 ~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~--- 234 (469) |.....+ . .....+ ++++ T Consensus 148 -----y~~~~~~---~------~~~~~i---------------------------------------------~h~~~~~ 168 (419) T protein:vir:14 148 -----YRVRGSD---P------MPQRLV---------------------------------------------HHVRWMS 168 (419) T ss_pred -----EEEccCc---c------cchhhe---------------------------------------------eEecCcC Confidence 1111000 0 000111 1111 Q ss_pred -CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC---cccchhhhhhhhh------------cceeeec Q lcl|NC_010179. 235 -KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG---GASLKQFMNDLRE------------YKSIKIN 298 (469) Q Consensus 235 -n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~---~~~~~~~~~~~~~------------~~~~~~~ 298 (469) +.-.|.|.++-+...++....+..-..+.++..+.|-.+++-.. ....++....++. .+++.++ T Consensus 169 ~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~ 248 (419) T protein:vir:14 169 INGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQ 248 (419) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecC Confidence 12247777777777776666665555666666777766654321 1111222221211 1223332 Q ss_pred ccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 299 NAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAIN 376 (469) Q Consensus 299 ~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~ 376 (469) .+.+|.....+ ...+.+..+...+.|+..-++|+.-.......+...++... ...+...|. T Consensus 249 -------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~----------~~f~~~~L~ 311 (419) T protein:vir:14 249 -------EGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQS----------LQFVIYTLL 311 (419) T ss_pred -------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHH----------HHHHHHHHH Confidence 22444443332 23445556667788988888886433222111211121111 223334444 Q ss_pred HHHHHHHHHhccc---CCCcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHH Q lcl|NC_010179. 377 ELVRAIMRYLNFS---DADKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKD 447 (469) Q Consensus 377 ~~~~~i~~~~~~~---~~~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E 447 (469) -.++.|...++.+ ..+.....+.| +.-+..|.++.++++.++ +|+++.-.+.++++. +++-+.-+-...-- T Consensus 312 P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~gGD~~~~~~n~~ 391 (419) T protein:vir:14 312 PWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGGDIYLSPMNMV 391 (419) T ss_pred HHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcCeeeeccccc Confidence 4444444433321 11112234455 455567899999999887 689999888888653 22211111000000 Q ss_pred HHHhhhhHhh-cccCCCCCCCCC Q lcl|NC_010179. 448 REENDPYANQ-ADELNGKGVDDE 469 (469) Q Consensus 448 ~~~~~~~~~~-~~~~~~~~~~de 469 (469) ....+...+ ..+.+..+..+| T Consensus 392 -~~~~~~~~~~~~~~~~~~~~~e 413 (419) T protein:vir:14 392 -DASKPQQLPVGKSEPTKAAIDE 413 (419) T ss_pred -cccccccccCCCCCCccccccc Confidence 000110001 112222333333 No 210 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=93.23 E-value=0.0083 Score=31.99 Aligned_cols=368 Identities=14% Similarity=0.091 Sum_probs=150.4 Q ss_pred CCHHHHHHHHH------HHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIR------NTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQ 74 (469) Q Consensus 1 ~~~~~~~~~i~------~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~ 74 (469) ++++++..+.. ..+.-.......+-. +.. T Consensus 70 ~~~~~l~~~~~~~~~~~~~i~t~~~~va~~~~---------------------------------------------i~~ 104 (535) T protein:vir:10 70 LSTKKLLKAYADNDIVQAIIRTRTNQVLTYSN---------------------------------------------PSR 104 (535) T ss_pred cCHHHHHHHhccChhHHHHHHHHHHHHHHHHH---------------------------------------------HHH Confidence 45544433211 111111111111111 111 Q ss_pred HHHhhhcCCeeeccC-------chhhHHHHHHHHhc--c-------HH-HHHHHHHHHHHhCC-eEEEEEEEcCCCceE- Q lcl|NC_010179. 75 EAGYIASVFPDIDVG-------KDADNKKILDVLGD--D-------RA-LTLNSLLVDSSNAG-RAWLHYWIDEDNNFR- 135 (469) Q Consensus 75 ~~~~l~g~p~~~~~~-------~~~~~~~l~~~~~~--n-------~~-~~~~~~~~~~~~~G-~~~~~v~~d~~~~~~- 135 (469) .+.-+.|-|+.+.-. .......+..++.. | +. ..+..+..+++.+| .+|+.+..+..|++. T Consensus 105 ~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~ 184 (535) T protein:vir:10 105 YNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDTFPRLLTKIINDMYVQDQINIERIFKNDSNELDH 184 (535) T ss_pred HhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHHHHHHHHHHHHHHHhhCCceEEEEEECCCCcEEE Confidence 111111112211100 00011122233321 1 11 12334455556554 689888888889875 Q ss_pred EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccc Q lcl|NC_010179. 136 YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYET 215 (469) Q Consensus 136 i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (469) +.+++|..+.+..++....... .+|...+ +... ..+....+.+++..... T Consensus 185 L~~l~p~~V~v~~d~~~~~~~~---~~~~~~~--~~~~----~~~~~~eiih~~~~~~~--------------------- 234 (535) T protein:vir:10 185 FNAVDASKVVISYSPRSKDQPR---KFEQFVS--ETKS----VKFSERNLTFINYWNLS--------------------- 234 (535) T ss_pred EEEeCCceeEEEEcCccccCce---EEEEEec--Ccee----EEECcccEEEEeccCCC--------------------- Confidence 8999999999887754322111 1121111 1111 11223333332211000 Q ss_pred cccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEe--cCC-cccchhhhhhhh-- Q lcl|NC_010179. 216 GQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT--NYG-GASLKQFMNDLR-- 290 (469) Q Consensus 216 ~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~--g~~-~~~~~~~~~~~~-- 290 (469) .......|.|.++-+...|.....+..-..+.+...+.|-.++. +.. ....++....++ T Consensus 235 ----------------~~~~~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~ 298 (535) T protein:vir:10 235 ----------------DTDRRGYGYSPVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQ 298 (535) T ss_pred ----------------CcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHH Confidence 00001236777777777777666666666666666677754443 321 112222222221 Q ss_pred --h--------cceeeecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCcc--c---cCCccHHHH Q lcl|NC_010179. 291 --E--------YKSIKINNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANF--E---SSNASGVAI 353 (469) Q Consensus 291 --~--------~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~--~---~g~~Sg~Al 353 (469) . .++..+. +.+++|.....+ ...+.+..+...+.|...-++|+.-.. . .+|.++... T Consensus 299 ~~~~~~G~~nag~~~vl~------~~g~~~~~l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~ 372 (535) T protein:vir:10 299 WTSQGSGLGGAWKIPILA------AKDAKFVNMTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKS 372 (535) T ss_pred HHHHhcCccccccccccc------CCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhh Confidence 1 0111111 123555554443 345556667778888888888874322 1 122222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCCCCCCHHHHHHHHHHH-hccCChHHHHH Q lcl|NC_010179. 354 KMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRTKVEDSLTKAQIVSTV-ANYSSKEAVAK 429 (469) Q Consensus 354 ~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~~p~d~~e~~~~~~kl-~g~iS~et~~~ 429 (469) ..-.+.+. ......+..+|.-+++.+...++.+ ..+ ..+.+.|+.....|.++.+++.... +|.++.-.+.+ T Consensus 373 ~~~~s~~E---~~~~~~~~~~L~P~l~~ie~~ln~~Ll~~~~-~~~~f~f~~l~~~d~~~r~~~~~~~~~g~lT~NE~R~ 448 (535) T protein:vir:10 373 VNEGSTAK---AKLESSKDKGLTPLLSFIEQVINDKIMRYVD-TDYRFSFTLGDAQDKLQEEQVWKLKLANGYFINEYRK 448 (535) T ss_pred hhhhhhHH---HHHHHHHHHHHHHHHHHHHHHHhhhcccccC-CeEEEEeccccccCHHHHHHHHHHHHcCCCCHHHHHH Confidence 11111111 1222233445555555555444432 122 2467888887888888777765443 56788888888 Q ss_pred hCCC--CCCHHHHHHHHHHHH--------HHhhhhH----------hhcccCC------CCCCCC-C Q lcl|NC_010179. 430 ANPI--VDDWQQELKDLAKDR--------EENDPYA----------NQADELN------GKGVDD-E 469 (469) Q Consensus 430 ~l~~--v~d~~~E~eri~~E~--------~~~~~~~----------~~~~~~~------~~~~~d-e 469 (469) +++. ++.-+.-+-.+..+. +...+.. +..++.. ..|.+| + T Consensus 449 ~~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~ 515 (535) T protein:vir:10 449 DHGLKTVDGLDVPGFIGSAENFINATGFGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPK 515 (535) T ss_pred HhCCCCCCCccccccccchhhcccccccccccCCCCCCCccccCCccccCcccccccccccCCCCCC Confidence 7643 221111000010000 0000000 0000000 011111 1 No 211 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=93.03 E-value=0.009 Score=31.79 Aligned_cols=384 Identities=9% Similarity=0.023 Sum_probs=154.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccc--cchh----hhcccccccccccC-c--ceeccchHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRN--NGKP----KVSKEGKKDPLRSA-D--NRIPSNFYQLL 71 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~--~~~~----~~~~~~~~~~~~~~-~--~ri~~n~~k~i 71 (469) |.-.-+ ..-..+.+..+.....+.... .... ...........+.. + .-+..+-.... T Consensus 1 ~~~~~~--------------mg~f~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:81 1 MPDEKK--------------LGLFGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CCchhh--------------cchhhhhhhhcccccccccccccccccCccchhhhcccccccCcccchHhhhccHHHHHH Confidence 222211 111222222332211100000 0000 00000000000000 0 00011112224 Q ss_pred HHHHHHhhhcCCeee-c-cCc---hhhHHHHHHHHhc--cH-H---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEE Q lcl|NC_010179. 72 VDQEAGYIASVFPDI-D-VGK---DADNKKILDVLGD--DR-A---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGII 139 (469) Q Consensus 72 v~~~~~~l~g~p~~~-~-~~~---~~~~~~l~~~~~~--n~-~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~ 139 (469) |+..++-+-+-|+.+ . ..+ ......+..++.. |. + +....+..+++.+|.+|+++..+ +|++ .+.++ T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~-~g~~~~L~~l 145 (432) T protein:vir:81 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYL 145 (432) T ss_pred HHHHHHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEE Confidence 445555455556553 1 111 1122334555532 32 2 23345667889999999888765 4665 47889 Q ss_pred ccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccc Q lcl|NC_010179. 140 QPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSN 219 (469) Q Consensus 140 ~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (469) +|..+.+..++.. ++. |.....+|... .+....+.+++.- T Consensus 146 ~~~~v~v~~~~~g--~~~-----y~~~~~~g~~~-----~~~~~~iih~r~~---------------------------- 185 (432) T protein:vir:81 146 ANDRLTITTDPKG--NTA-----YRYRRTDGQMI-----DIPKQQIWKIMGY---------------------------- 185 (432) T ss_pred cCCceEEEECCCC--cEE-----EEEEecCceEE-----EEccccEEEecCC---------------------------- Confidence 9999888876532 221 21111122110 1222222222110 Q ss_pred cccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh--------h Q lcl|NC_010179. 220 TLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR--------E 291 (469) Q Consensus 220 ~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~--------~ 291 (469) | .+.-.|.|-+..+...|+.......-..+.+...+.|-.++.-.. ...++....++ . T Consensus 186 ---------~----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~-~l~~e~~~~~~~~~~~~~na 251 (432) T protein:vir:81 186 ---------S----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR-FLTDDQYDSFAKKVSGSVEA 251 (432) T ss_pred ---------C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC-CCCHHHHHHHHHHHhhhhcC Confidence 0 011235666665555555554444444455555566655544321 11112211111 1 Q ss_pred cceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CC-ccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 292 YKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFES--SN-ASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~-~Sg~Al~~~~~~l~~k~~~~~ 368 (469) .+++.++.+ .+++-++.+.....+.+..+.....|++.-++|+.-.... ++ .+|..++-.. . T Consensus 252 g~~~vl~~g-----~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~----------~ 316 (432) T protein:vir:81 252 GRAPLLEGG-----MDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQ----------L 316 (432) T ss_pred CCceecCCC-----ceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHH----------H Confidence 223333221 2333333333334555666777888999888887433221 11 2223332222 1 Q ss_pred HHHHHHHHHHHHHHHHHhccc---CCCcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH Q lcl|NC_010179. 369 TYFEHAINELVRAIMRYLNFS---DADKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ 439 (469) Q Consensus 369 ~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~ 439 (469) ..+..+|.-.++.+...++.+ ..+.....++| ..-+..|.++.++.+.++ +|+++.-++.++++. +++- . T Consensus 317 ~f~~~tl~P~~~~ie~~l~~kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~glpp~~g~-~ 395 (432) T protein:vir:81 317 GFLTMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGN-A 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-c Confidence 222334444444444333321 11122334555 444667899999998886 689999999888753 2211 1 Q ss_pred HHH-------HHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 440 ELK-------DLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 440 E~e-------ri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++- -+..-.+...+.........+++.+.+ T Consensus 396 ~~~~~~~~~~pl~~~~~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 396 AVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred ceEeecCcccchhhhccCCCCCCCCCCCCcccccccC Confidence 110 011000000110000111111111111 No 212 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=92.97 E-value=0.0093 Score=31.73 Aligned_cols=364 Identities=11% Similarity=0.008 Sum_probs=151.9 Q ss_pred CCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRND-LINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~-~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) |=+ ++....... ..+...-+...+-|-.. .. .+.... +..-+.++-....|+..++-+ T Consensus 1 m~~-------~~~~~~~~~~~~~~~~~~~~~~~g~~~----------s~-~~~~v~---~~~al~~~~v~~cv~~ia~~i 59 (419) T protein:vir:80 1 MFF-------SRQLLSNLGQTQPGSGGWVSALLGSAR----------SE-AGQVVT---PASALSLTVLQNCVTLLAESI 59 (419) T ss_pred CCc-------ccccccccCcCCCCcchhhHHhhcccc----------cc-cCcccC---hHHhhccHHHHHHHHHHHHhh Confidence 000 000000000 00000000000000000 00 000000 000111222333555555555 Q ss_pred hcCCeeecc--Cch---hhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEE Q lcl|NC_010179. 80 ASVFPDIDV--GKD---ADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPV 147 (469) Q Consensus 80 ~g~p~~~~~--~~~---~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~ 147 (469) -+-|+.+-- ++. .....+..++.. | .+ +....+......+|.+|+.+..+.+|++. +.+++|..+-+. T Consensus 60 a~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~~~~v~i~ 139 (419) T protein:vir:80 60 AQLPVELYERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLDNEAVTVM 139 (419) T ss_pred ccCceEEEEecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEE Confidence 566665421 111 112235555542 3 22 22345667889999999999889889865 888999888776 Q ss_pred EeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCc Q lcl|NC_010179. 148 YATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGR 227 (469) Q Consensus 148 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 227 (469) .+... .+ +|... +.. .+..+. T Consensus 140 ~~~~~--~~-----~y~~~---~~~------~~~~~~------------------------------------------- 160 (419) T protein:vir:80 140 KGPDL--KP-----MYRVA---GAD------PLPQRL------------------------------------------- 160 (419) T ss_pred ECCCc--eE-----EEEEc---Ccc------ccchhh------------------------------------------- Confidence 55421 11 11110 000 001111 Q ss_pred ccEEEecC----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEec--CC-cccchhhhhhhh---------- Q lcl|NC_010179. 228 VPFIEFPK----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTN--YG-GASLKQFMNDLR---------- 290 (469) Q Consensus 228 vPvv~~~n----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g--~~-~~~~~~~~~~~~---------- 290 (469) |++++. ...|.|-+..+...++.......-..+.+...+.|-.+++- .. .....+....++ T Consensus 161 --i~h~~~~~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 238 (419) T protein:vir:80 161 --VHHVRWMSINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGS 238 (419) T ss_pred --eEEecCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCc Confidence 122221 22467777666666665555544455555666677655542 11 111111111111 Q ss_pred --hcceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 291 --EYKSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAK 366 (469) Q Consensus 291 --~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~ 366 (469) ..+++.++. +.+|..... ....+.+..+...+.|+..-++|+.-....++.+...++.. T Consensus 239 ~n~g~~~vl~~-------g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~---------- 301 (419) T protein:vir:80 239 GNAKKVALLQE-------GMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQ---------- 301 (419) T ss_pred cccCCceecCC-------CceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHH---------- Confidence 112333321 234443333 23445566677788998888888743322211111111111 Q ss_pred HHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCH Q lcl|NC_010179. 367 TQTYFEHAINELVRAIMRYLNFS---DADKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDW 437 (469) Q Consensus 367 ~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~ 437 (469) ....+...|.-+++.|...++.+ ........+.| +.-+..|.++.++.+.++ +|+++.-.+.+.++. +++- T Consensus 302 ~~~f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~~gG 381 (419) T protein:vir:80 302 SLQFVIYTLLPWVKRHEQAKTRDLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRLENMPPVKGG 381 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCc Confidence 11233334444444444433321 11112233444 455667899999999887 689999888888753 2221 Q ss_pred HHHHHHHHHHHHHhhhhHhhcccCCCCCCCC-C Q lcl|NC_010179. 438 QQELKDLAKDREENDPYANQADELNGKGVDD-E 469 (469) Q Consensus 438 ~~E~eri~~E~~~~~~~~~~~~~~~~~~~~d-e 469 (469) + ++ .+.. ...+... .++..++..++ + T Consensus 382 D-~~-~~~~---n~~~~~~-~~~~~~~~~~~~~ 408 (419) T protein:vir:80 382 D-IY-LSPM---NMVDASK-PQPIPMGKTEPTK 408 (419) T ss_pred c-ee-eecc---ccccccc-cccccCCCCCchh Confidence 1 11 0000 0001111 11111111111 1 No 213 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=92.92 E-value=0.0095 Score=31.68 Aligned_cols=367 Identities=11% Similarity=0.019 Sum_probs=161.6 Q ss_pred CCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhc--------cccccccc-ccCcce-eccchHH Q lcl|NC_010179. 1 MEL-DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVS--------KEGKKDPL-RSADNR-IPSNFYQ 69 (469) Q Consensus 1 ~~~-~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~--------~~~~~~~~-~~~~~r-i~~n~~k 69 (469) |.. .++++. +...+|+.+.............. ........ .....+ ...+... T Consensus 1 ~~~~~~~~~~----------------~~m~~F~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 64 (413) T protein:vir:96 1 MPGVSEIRKD----------------KNLKFFNNKRSPTEESKAKDEIPKAPQVVMTLPNFFKELISDGYTKLSDSPEVR 64 (413) T ss_pred CCccchhhhh----------------hcCCccccCCCcchhhhhhccccccccccccchhhHhhhccchhHHHhhchHHH Confidence 211 111111 11122322211000000000000 00000000 000001 1134455 Q ss_pred HHHHHHHHhhhcCCeeeccCc----hhhHHHHHHHHh--cc-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCc-e-EEE Q lcl|NC_010179. 70 LLVDQEAGYIASVFPDIDVGK----DADNKKILDVLG--DD-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNN-F-RYG 137 (469) Q Consensus 70 ~iv~~~~~~l~g~p~~~~~~~----~~~~~~l~~~~~--~n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~-~-~i~ 137 (469) ..|+..++-+.+-|+.+-..+ ......+..++. -| . .+....+..+.+.+|.+|+++..+.+|. + .+. T Consensus 65 ~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~ 144 (413) T protein:vir:96 65 MAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLT 144 (413) T ss_pred HHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEE Confidence 566666666666676642111 122233444443 23 2 2334567788899999999998888874 3 588 Q ss_pred EEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccc Q lcl|NC_010179. 138 IIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQ 217 (469) Q Consensus 138 ~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (469) +++|..+.+.++++. + +|.... .+ . .+.... T Consensus 145 ~l~~~~v~~~~~~~~---~-----~y~~~~-~~-~------~~~~~e--------------------------------- 175 (413) T protein:vir:96 145 PISPYKVTFNVSDDD---L-----DYSITF-DN-K------EYDPST--------------------------------- 175 (413) T ss_pred EecCceeEEEEcCCe---E-----EEEEee-cC-c------EEchhh--------------------------------- Confidence 899998887766421 1 111110 01 0 011111 Q ss_pred cccccccCCcccEEEecC-----C-ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhh Q lcl|NC_010179. 218 SNTLKHNFGRVPFIEFPK-----N-KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMND 288 (469) Q Consensus 218 ~~~~~~~~g~vPvv~~~n-----~-~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~ 288 (469) |+||+. + -.|.|-+..+...+...........+.+...+.|-.+++....... ...... T Consensus 176 ------------vih~k~~~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~~l~~e~~~~~~~~ 243 (413) T protein:vir:96 176 ------------LLHFVLNPSIERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDSDSDELSDEEGREN 243 (413) T ss_pred ------------EEEEeccCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCCCCHHHHHHHHHH Confidence 233321 1 1366767666666666665555566667777777666654221111 111111 Q ss_pred hh--------hcceeeecccCCCCCCcceEEe-ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHH Q lcl|NC_010179. 289 LR--------EYKSIKINNAGNGDKSGVDKLQ-IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSH 359 (469) Q Consensus 289 ~~--------~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~ 359 (469) +. ..+.+.+..++. ...-+. .+.....+.+..+...+.|+..-++|+.-.. .+ ++. +... T Consensus 244 ~~~~~~g~~n~g~~~vl~~~~~----~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg-~~--~~~--~~~~-- 312 (413) T protein:vir:96 244 FEEMYLKRKEAGKPWIIPEGMV----NVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLG-VG--TYN--KDEF-- 312 (413) T ss_pred HHHHhcCccccCceeeecCCcc----cccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcC-CC--cch--HHHH-- Confidence 11 112233322221 111111 1222344555666777888888888864332 11 111 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCC-CcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC Q lcl|NC_010179. 360 LELKAAKTQTYFEHAINELVRAIMRYLNFSDA-DKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIVDD 436 (469) Q Consensus 360 l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~-~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v~d 436 (469) ...+..+|.-+++.|...++.+=. +...+++.++.-+..|.++.++++.++ +|+++.-.+.++++.-+. T Consensus 313 --------~~~~~~~l~P~~~~ie~~ln~~ll~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~~p~ 384 (413) T protein:vir:96 313 --------NNFINTKIMSIAQVIQQTYNKLIVEEDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNALRRNEFRNWVGMPPD 384 (413) T ss_pred --------HHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCC Confidence 123444555555555555443211 123345555566677899999999887 689999999888865332 Q ss_pred HHHHHHHHHHHHHHhhhhH--hhcccCCCCCC Q lcl|NC_010179. 437 WQQELKDLAKDREENDPYA--NQADELNGKGV 466 (469) Q Consensus 437 ~~~E~eri~~E~~~~~~~~--~~~~~~~~~~~ 466 (469) + .-+.+.--. -..+.. ...++.++++. T Consensus 385 ~--~gd~~~~~~-n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 385 A--EMDDLLVLE-NYLQQKDLVNQKKLIQDET 413 (413) T ss_pred C--Ccceeeecc-cccchhhcccccCCCCCCC Confidence 1 111111000 001111 01111122222 No 214 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=92.91 E-value=0.0095 Score=31.67 Aligned_cols=376 Identities=8% Similarity=0.014 Sum_probs=157.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeee- Q lcl|NC_010179. 8 KLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDI- 86 (469) Q Consensus 8 ~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~- 86 (469) -++..+.....+....-.-+.....|-.--... .+.... +..-+..+-....|+..++-+-+-|+.+ T Consensus 1 m~~~~~~~~~~~~~s~~~~w~~~~~~~~~~~~~---------~g~~vt---~~~al~~~~v~~~i~~Ia~~iA~lp~~~~ 68 (421) T protein:vir:10 1 MFIPQMFEGKKRSVSGGGFWEAMLGGVRSSHSK---------AGVMIT---PETALALSAVRACVTLLAESVAQLPVELY 68 (421) T ss_pred CCCcchhcccccccCcchhhHHHhhhhccCccc---------CCceec---hHHhhccHHHHHHHHHHHHhhccCceEEE Confidence 111111111111110000000111100000000 000000 0000112222334555555555556553 Q ss_pred cc-Cch----hhHHHHHHHHhc--c-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCC Q lcl|NC_010179. 87 DV-GKD----ADNKKILDVLGD--D-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDN 154 (469) Q Consensus 87 ~~-~~~----~~~~~l~~~~~~--n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~ 154 (469) .. .+. .....+..++.. | . .+....+..+.+.+|.+|+++-.+.+|++. +.+++|..+.+..+++ . T Consensus 69 ~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~--g 146 (421) T protein:vir:10 69 RRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIPINPKKVIVLKGPD--G 146 (421) T ss_pred EEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCceEEEEECCC--c Confidence 11 111 112234555532 3 2 222345667889999999999888888864 8888898888765532 1 Q ss_pred ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 155 KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 155 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) . .+|... ..|. . +....+.+++.. + . T Consensus 147 ~-----~~y~~~-~~g~-~------~~~~eiih~~~~-------------------------------------~----~ 172 (421) T protein:vir:10 147 M-----PYYEIP-EIGE-T------LPMRMMHHVKVF-------------------------------------S----L 172 (421) T ss_pred e-----EEEEEc-CCCc-E------EchhhEEEecCc-------------------------------------C----C Confidence 1 122211 1111 0 111111111100 0 0 Q ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC---cccchhhhhhh----hh--------cceeeecc Q lcl|NC_010179. 235 KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG---GASLKQFMNDL----RE--------YKSIKINN 299 (469) Q Consensus 235 n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~---~~~~~~~~~~~----~~--------~~~~~~~~ 299 (469) +.-.|.|-++.+...++.......-..+.+...+.|-.+++-.. ....++....+ .. .+++.++ T Consensus 173 d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~- 251 (421) T protein:vir:10 173 DGYIGSSPIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQ- 251 (421) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecC- Confidence 11236676766666666555555555555666677766655321 11122221111 11 1223232 Q ss_pred cCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 377 (469) .+.+|..... ....+.+..+...+.|+..-++|+.-....+..+...++. .....+..+|.- T Consensus 252 ------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~----------~~~~f~~~tl~P 315 (421) T protein:vir:10 252 ------EGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEH----------QGLQFVMYTLLA 315 (421) T ss_pred ------CCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHH----------HHHHHHHHHHHH Confidence 2244444433 3345556667778889888888874332222111111111 112333345555 Q ss_pred HHHHHHHHhccc---CCCcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHH-H Q lcl|NC_010179. 378 LVRAIMRYLNFS---DADKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAK-D 447 (469) Q Consensus 378 ~~~~i~~~~~~~---~~~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~-E 447 (469) ++..+...++.+ ........+.| ..-+..|.++.++.+.++ +|+++.-.+.+.++. +++-+.=+-...- . T Consensus 316 ~~~~ie~~ln~kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~~~~n~~~ 395 (421) T protein:vir:10 316 WLKRHEGALQRDLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSVNDIRRMENLPPIAGGDKYLTPLNMVD 395 (421) T ss_pred HHHHHHHHHhhhccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCcceeeecccccc Confidence 555554444432 11122334555 444567899999999887 689999999888754 2221110100000 0 Q ss_pred HHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 448 REENDPYANQADELNGKGVDDE 469 (469) Q Consensus 448 ~~~~~~~~~~~~~~~~~~~~de 469 (469) .+...+ .+.......++.+|+ T Consensus 396 ~~~~~~-~~~~~~~~~~~e~d~ 416 (421) T protein:vir:10 396 SAQIIP-GDKKPTAQQMAEIDT 416 (421) T ss_pred cccccc-CCCCcccccCccccc Confidence 000011 011111122222222 No 215 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=92.40 E-value=0.012 Score=31.20 Aligned_cols=360 Identities=11% Similarity=0.068 Sum_probs=145.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+= ++.... ... + ....+..-+ +.... .......+..... ..-+..+=....|+..+.-+- T Consensus 1 Mglf------~~~~~~-~~~-~--~~~~~~~~~---~~~~~--~~~~~~~~~~v~~---~~al~~~~V~~~i~~Ia~~ia 62 (384) T protein:vir:49 1 MPIF------NITNLA-TES-P--PSNQDSFFD---ITDPE--FLDALNGSEWVSA---ETALKNSDLFSIISQLSNDLA 62 (384) T ss_pred Cccc------cccccC-ccc-c--cccchhhcc---ccchh--hcccccCCceech---hhhhccHHHHHHHHHHHHHHh Confidence 2210 000000 000 0 000000000 00000 0000000000000 000111222344555555555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc-c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD-D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDN 154 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~-n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~ 154 (469) +-|+.+.-. .. ..++.. | ..+....+..+++.+|.+|+.+-.+.+|++ .+.+++|..+-++.++.. . T Consensus 63 ~l~~~~~~~--~~----~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~-~ 135 (384) T protein:vir:49 63 TAKITTSRK--QL----QGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQ-N 135 (384) T ss_pred hCceeeecc--hh----hhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCC-c Confidence 556654321 11 122221 2 223345677888999999999988998886 588899999888765432 1 Q ss_pred ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEec Q lcl|NC_010179. 155 KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP 234 (469) Q Consensus 155 ~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~ 234 (469) .+ +|.....+.... ....+....+.+++.-. .. T Consensus 136 ~~-----~y~~~~~~~~~~--~~~~~~~~eVih~~~~~----------------------------------------~~ 168 (384) T protein:vir:49 136 GL-----YYNITFDDPRIP--PKQHVPQGDILHFRLLS----------------------------------------VD 168 (384) T ss_pred eE-----EEEEEecCcccc--ceeEecCccEEEecCCC----------------------------------------CC Confidence 11 111111111000 00112222222221100 00 Q ss_pred CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh--------hcceeeecccCCCCCC Q lcl|NC_010179. 235 KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR--------EYKSIKINNAGNGDKS 306 (469) Q Consensus 235 n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~ 306 (469) ..-.|.|-+..+...++....+.....+.+...+.|-.+++-.+....++...... ..+++.++ + T Consensus 169 ~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl~-------~ 241 (384) T protein:vir:49 169 GGLTSVSPLMALGRELNIQKASDKLTLNALKNALNANGILKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVLD-------D 241 (384) T ss_pred CceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCChHHHHHHHHHHHhcccCCccceecC-------C Confidence 01246777777777776666666666666677777766654322222222111111 11222221 2 Q ss_pred cceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 307 GVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAI 382 (469) Q Consensus 307 ~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i 382 (469) +++|..... ....+.+..+.+.+.|+..-++|+.-... .+..++..++..+...+. ..+..+...+ T Consensus 242 g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~----------~~l~pi~~~i 311 (384) T protein:vir:49 242 LEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVS----------RFLRPFVSEL 311 (384) T ss_pred CceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHH----------HHHHHHHHHH Confidence 345544433 33455666778889999988998753322 222344444333322221 1222222222 Q ss_pred HHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhC---CCCCCHHHHHHHHHHHHHHhhhhHhh Q lcl|NC_010179. 383 MRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKAN---PIVDDWQQELKDLAKDREENDPYANQ 457 (469) Q Consensus 383 ~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l---~~v~d~~~E~eri~~E~~~~~~~~~~ 457 (469) ...++.+ +.....+....+.......+..+ +|+.++-++.+.+ |+.+ .|+.++ +. T Consensus 312 ~~~l~~~------l~~~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~---ne~r~~-----------~~ 371 (384) T protein:vir:49 312 SKKLSCE------VDADILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILP---KDLPEG-----------ET 371 (384) T ss_pred HHHhchh------hhhhhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCC---hhHHHH-----------cC Confidence 2222110 00000111111111122222222 5677877776654 5443 233222 12 Q ss_pred cccCCCCCCCCC Q lcl|NC_010179. 458 ADELNGKGVDDE 469 (469) Q Consensus 458 ~~~~~~~~~~de 469 (469) ..+.++++.+|| T Consensus 372 ~~p~~gGd~~~~ 383 (384) T protein:vir:49 372 DSTLKGGETNEQ 383 (384) T ss_pred CCCCCCCCCCCC Confidence 334466666666 No 216 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=91.96 E-value=0.013 Score=30.84 Aligned_cols=342 Identities=12% Similarity=0.111 Sum_probs=137.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+ .+++..++... ...+.+. .. ... ....-+.......+|+..++-+. T Consensus 1 Mg~------f~~~f~~~~~~-------~~~~~~~--~~-------------~~~---~~~~a~~~~~v~~~i~~ia~~ia 49 (385) T protein:vir:95 1 MGL------FDSVFKRHSEL-------SWMYDLE--FL-------------QDK---SKKAYLKQIALNTVVEMVARTIS 49 (385) T ss_pred Cch------hhhhhccCccc-------ccccchh--hh-------------hcc---chhhhhhhHHHHHHHHHHHHHHc Confidence 332 11111110000 0000000 00 000 00001122334456666666666 Q ss_pred cCCeeeccCchhhHHHHHHHHhc--c-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCceEE--EEEccceeEEEEeCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD--D-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRY--GIIQPDQITPVYATTL 152 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~--n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i--~~~~p~~~~~~~d~~~ 152 (469) +-|+.+--.+......+..++.. | . .+....+..+.+.+|.+|++. +.+++..+ .++.|... .++.+ T Consensus 50 ~~p~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~--~~~~~~~~~~~~~~~~~~-~~~~~-- 124 (385) T protein:vir:95 50 QSEFRVMKNNTKEKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVK--NDEGHFFVADDFEKEDEL-GLYSH-- 124 (385) T ss_pred ccceeeeecCccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEE--ecCCCeeecccccccccc-ccccc-- Confidence 66766533333333445555542 3 2 223345677888899998654 44443211 11111110 00000 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) .++...... .. ....+... -|++ T Consensus 125 --------~~~~~~~~~-~~---~~~~~~~~---------------------------------------------eiih 147 (385) T protein:vir:95 125 --------RFTNVLVND-FE---FKRVFTMD---------------------------------------------DVIY 147 (385) T ss_pred --------cceeeeecc-cc---eeeeeccc---------------------------------------------cEEE Confidence 000000000 00 00001111 1233 Q ss_pred ecCC-----ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCc--eeEEecCCcccchhh----hhhhh---------hc Q lcl|NC_010179. 233 FPKN-----KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTV--ILVLTNYGGASLKQF----MNDLR---------EY 292 (469) Q Consensus 233 ~~n~-----~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p--~l~~~g~~~~~~~~~----~~~~~---------~~ 292 (469) ++.. ..|.|.++.+...+ +...+...+...| ++++.+... ..++. ...+. .. T Consensus 148 ~~~~~~~~~~~G~s~~~~~~~~i-------~~~~~~~~~~~~~~g~l~~~~~~~-~~~e~~~~~~~~~~~~~~g~~~~~~ 219 (385) T protein:vir:95 148 LKYNNQKLDAFSLGLFEDYGEIF-------GRMIDLQMLNNQIRGILKVDATKF-YNKEKQKELQAYIDTLFDAFQNNTI 219 (385) T ss_pred ecCCCCCcccccchHHHHHHHHH-------HHHHHHHHhcCCCceEEEeCCccC-CCHHHHHHHHHHHHHHhhhhhhcCC Confidence 3321 12444443333332 2222333333333 333333211 11111 11111 11 Q ss_pred ceeeecccCCCCCCcceEEeecC------CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAGNGDKSGVDKLQIDI------PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAK 366 (469) Q Consensus 293 ~~~~~~~~~~~~~~~~~~l~~~~------~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~ 366 (469) +++.++. +.+.+-++... ....+.+..+...+.|+..-++|+.-.. |+-|.. .+. T Consensus 220 ~i~~l~~-----g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~--~~~sn~------------e~~ 280 (385) T protein:vir:95 220 AVVPLTE-----GLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVL--GEMADL------------EKT 280 (385) T ss_pred ceEEcCC-----CceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhc--CCCcCH------------HHH Confidence 1222222 12233333211 1345666677788889998888864332 111110 112 Q ss_pred HHHHHHHHHHHHHHHHHHHhccc-----CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCH Q lcl|NC_010179. 367 TQTYFEHAINELVRAIMRYLNFS-----DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIV--DDW 437 (469) Q Consensus 367 ~~~~~~~~l~~~~~~i~~~~~~~-----~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v--~d~ 437 (469) ....+..+|.-+++.|...++.+ +.....+++.+..-+..|.++.++++.++ +|+++.-++.+.+++- +++ T Consensus 281 ~~~~~~~~l~P~~~~ie~~l~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~g~~p~~~~ 360 (385) T protein:vir:95 281 IESYLQFCINPLLRKIEAELNSKFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMTGEEPADDP 360 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCC Confidence 23444455665666555555431 11112344555566777899999999887 6889999888887642 221 Q ss_pred HHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 438 QQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 438 ~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .. ++.-- ..+....+...+++.++| T Consensus 361 ~g--d~~~~-----~~n~~~~~~~kgge~~~e 385 (385) T protein:vir:95 361 EL--DKFII-----TKNLQSADAFKGGESNEE 385 (385) T ss_pred CC--ceeee-----cccceecccccCCCCCCC Confidence 11 11100 011122344566666666 No 217 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=91.52 E-value=0.015 Score=30.51 Aligned_cols=387 Identities=16% Similarity=0.152 Sum_probs=170.5 Q ss_pred CCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MEL----DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~----~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) .++ ..-.+| +.+|+.+-.+++-.. =...||+..+ T Consensus 43 ~~~e~~~~~~~eL-----------I~~YR~ma~~pEvd~-------------------------------Av~eIVneai 80 (533) T protein:vir:10 43 VDFDGQVRNEYQL-----------ISRYREMVLQPECDS-------------------------------AVDDIVNETI 80 (533) T ss_pred eecccccchHHHH-----------HHHHHHHhhccchhh-------------------------------HHHHhhccee Confidence 111 122222 233444333333221 1122333222 Q ss_pred H-hhhcCCeeeccCchhhHHHHHHHHhc---------cHHHHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEEccc Q lcl|NC_010179. 77 G-YIASVFPDIDVGKDADNKKILDVLGD---------DRALTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGIIQPD 142 (469) Q Consensus 77 ~-~l~g~p~~~~~~~~~~~~~l~~~~~~---------n~~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~~p~ 142 (469) - =....|+.+..++.+..+.+++...+ +|.....+..+.+.+.|+-|.+.-+|.+ |-..+..+||+ T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr 160 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPR 160 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeecccc Confidence 2 23345677766654444443332222 4455677888999999999998877754 55668889999 Q ss_pred eeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccc Q lcl|NC_010179. 143 QITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (469) Q Consensus 143 ~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (469) .+-++.--.. +....++++. ...+++. +...||.....+. ... ...+ T Consensus 161 ~i~~vr~i~~--~~~~~~~~~~----------~~~~v~~-~~~eyf~Ynp~g~-~~~---------~~~~---------- 207 (533) T protein:vir:10 161 KIRKINETEQ--KRPEQLRGLP----------LNQQLSP-KSAEYFLYDPKGL-KNS---------TTQG---------- 207 (533) T ss_pred ceeeeeeeec--cCCCccceee----------cchhhhc-cceeeeeeccccc-ccc---------CCCc---------- Confidence 8777653110 0001111100 0001111 1111222211110 000 0000 Q ss_pred ccCCcccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhh---- Q lcl|NC_010179. 223 HNFGRVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQF---- 285 (469) Q Consensus 223 ~~~g~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~---- 285 (469) -+||- |.|... ..+.-.+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+. T Consensus 208 ---vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~i 284 (533) T protein:vir:10 208 ---LKIAPDSICYVHSGIMDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREV 284 (533) T ss_pred ---eecchhheeeeeccceeCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHH Confidence 00110 111100 0111122223344444443 344555555666666443332221111 111 Q ss_pred hhhhhhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--Cc Q lcl|NC_010179. 286 MNDLREYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--AN 342 (469) Q Consensus 286 ~~~~~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~ 342 (469) +...+ ++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. .. T Consensus 285 M~k~K-NklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~-DV~YF~kKLY~aLnVP~SRl~~ 362 (533) T protein:vir:10 285 MGRYR-NKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEITTLPGGQNLGELE-DVKYFQKKLYKSLNVPGSRLET 362 (533) T ss_pred HHhcc-ceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCCccccCC Confidence 11111 222221111 112233455555444555443 367777888888888842 22 Q ss_pred cc---cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHHH- Q lcl|NC_010179. 343 FE---SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ- 413 (469) Q Consensus 343 ~~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~~- 413 (469) ++ +|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...++ T Consensus 363 e~~f~~Gr~~--EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Ei 440 (533) T protein:vir:10 363 ETTFNVGRAA--EITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEI 440 (533) T ss_pred CCcccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHH Confidence 22 34433 34444555556677788888888888877544433321 1223 34677776544444444333 Q ss_pred ------HHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHh---hhhH------hhcccCCCCCCCCC Q lcl|NC_010179. 414 ------IVSTV---AN-YSSKEAVAKANPIVDD--WQQELKDLAKDREEN---DPYA------NQADELNGKGVDDE 469 (469) Q Consensus 414 ------~~~kl---~g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~---~~~~------~~~~~~~~~~~~de 469 (469) +++.+ .| .+|.+++.+.+=-.+| .+++-++|++|.++. .|.. ...++..++-..|| T Consensus 441 l~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~ 517 (533) T protein:vir:10 441 RNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEE 517 (533) T ss_pred HHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCccccc Confidence 33444 33 4799999987533343 456677777775542 1111 11122222222222 No 218 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=91.05 E-value=0.018 Score=30.18 Aligned_cols=415 Identities=11% Similarity=0.057 Sum_probs=167.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcC--C Q lcl|NC_010179. 6 LKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASV--F 83 (469) Q Consensus 6 ~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~--p 83 (469) .+++++....+.+ +-+....++++++ +-++..-. . .+.. ......++-.+-...-++..++.|.+- | T Consensus 1 mk~~~~~~~~~lk-r~~~e~~w~e~a~--~tlP~~~~---~-~~~~----~~~~~~~~~dstg~~a~~~LAa~l~~~ltp 69 (510) T protein:vir:78 1 MKSTAAMLWEKLR-DGSVEQRAIEFAK--TTLPYLMV---D-PMSG----SRGVVEHDFQSAGALLVNNLAAKLARSLFP 69 (510) T ss_pred ChhHHHHHHHHHh-ccchHHHHHHHHH--hhcccccc---C-CCCc----ccccccCcccchHHHHHHHHHHHHHHhhcC Confidence 4555554444332 3344444555543 11111100 0 0000 001112233455556666666655431 2 Q ss_pred ee-----eccCchh---------hHHHHHHH-----------Hh-ccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEE Q lcl|NC_010179. 84 PD-----IDVGKDA---------DNKKILDV-----------LG-DDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYG 137 (469) Q Consensus 84 ~~-----~~~~~~~---------~~~~l~~~-----------~~-~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~ 137 (469) |. +...+.. ....++.| +. .||...+.++.++...+|.+.+ |.++++. +++ T Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~~~~~-~~~ 146 (510) T protein:vir:78 70 TGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNSDEA-TVV 146 (510) T ss_pred CCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEeCCCC-eEE Confidence 22 2222211 11122322 32 3667778888899999999754 5566544 455 Q ss_pred EEccceeEEEEeCCCCCceEEEEEEEEeeec--------------CCceEEEEEEEEcCCeEEEEEeecC--ceeecccc Q lcl|NC_010179. 138 IIQPDQITPVYATTLDNKLLGVLRSYKQLDP--------------EAGKYFTVHEYWTDKEAQFFRTSAT--DSTVIEPY 201 (469) Q Consensus 138 ~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 201 (469) .++-.++++.-|. .+++...+|.++.... ........+++|+. .+..... .+.++. T Consensus 147 ~~pl~~y~v~~d~--~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~----V~~~~~~~~~~~sv~-- 218 (510) T protein:vir:78 147 AWSLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSGSVDLYTH----VQRRKGTAMDYAEMY-- 218 (510) T ss_pred EEEcceeEEeeCC--CcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCceEEEEEE----EEeecCCCCcEEEEE-- Confidence 5655554444443 4556666665554210 00111122222221 0111110 111110 Q ss_pred cccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEec Q lcl|NC_010179. 202 NIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTN 276 (469) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g 276 (469) ....+.... .....++..+|++.++ .+.+|.|-.++..+-+..+|.+.-...........|.+.+.- T Consensus 219 ------~e~dg~~i~--~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p 290 (510) T protein:vir:78 219 ------HEIDGVRVG--ETGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE 290 (510) T ss_pred ------EEecCeeec--cccccccccCCeeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhcCCcccCC Confidence 000111111 1112345567777665 345799989999999999998877776666666666543321 Q ss_pred CCcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHH Q lcl|NC_010179. 277 YGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIK 354 (469) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~ 354 (469) ++.. ....+.......+.++. ..++..+. ...+.......++.++..|-..-.. ++..-+....|+..+. T Consensus 291 -~g~~---~~~~l~~~~~g~~v~g~---~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF~~-~l~~~~~~rvTAtEV~ 362 (510) T protein:vir:78 291 -AKGA---VVDDYQDAEMGDYVPGG---AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVR 362 (510) T ss_pred -cccc---chhhhccCCCceeecCC---cccccccccCcccchHHHHHHHHHHHHHHHHHHhh-ccccCCCCCcCHHHHH Confidence 1111 11111111111111221 12344443 2245666677777777766553221 2222223334665443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhcccC---CCc---ccceEEeCCCCCCCHH-HHH----HHH Q lcl|NC_010179. 355 MLYSHLELKAAKTQTYFEHAINE--------LVRAIMRYLNFSD---ADK---RHISQHWTRTKVEDSL-TKA----QIV 415 (469) Q Consensus 355 ~~~~~l~~k~~~~~~~~~~~l~~--------~~~~i~~~~~~~~---~~~---~~i~i~f~~~~p~d~~-e~~----~~~ 415 (469) .. +.++...++..+.+ +++.++.++...+ ... ....|++..++-+... +.+ +.+ T Consensus 363 ~r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l 435 (510) T protein:vir:78 363 IT-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVI 435 (510) T ss_pred HH-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeecccHHHHHHHHHHHHHHHHHH Confidence 32 23444444444433 2333333332222 111 1222344333333111 111 111 Q ss_pred HHHhc---c---CChHHHHHh----CCCCC--C---HHHHHHHHHHHHHHhh--hh------Hhhccc--CCCCCC Q lcl|NC_010179. 416 STVAN---Y---SSKEAVAKA----NPIVD--D---WQQELKDLAKDREEND--PY------ANQADE--LNGKGV 466 (469) Q Consensus 416 ~kl~g---~---iS~et~~~~----l~~v~--d---~~~E~eri~~E~~~~~--~~------~~~~~~--~~~~~~ 466 (469) ..+++ + +....++.. +| |+ . .++|++.+.+++.+.. .. .++... ....|. T Consensus 436 ~~~~~~~q~~~~id~d~~~~~~a~~~G-v~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~~~~~~~~~~~g~ 510 (510) T protein:vir:78 436 AGLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HHhcChhhhhhcCCHHHHHHHHHHHhC-CChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccCCCC Confidence 11111 1 333444433 33 31 1 2456666655432211 11 111111 111122 No 219 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=90.44 E-value=0.021 Score=29.80 Aligned_cols=409 Identities=13% Similarity=0.095 Sum_probs=172.8 Q ss_pred CCHHHHHHHHHHH------------------------------HHHHHHHHHHHHHHHHHhccCCcccccccchhhhccc Q lcl|NC_010179. 1 MELDALKKLIRNT------------------------------STSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKE 50 (469) Q Consensus 1 ~~~~~~~~~i~~~------------------------------~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~ 50 (469) ..|+.-+++..+. .......+.+|+.+..+++-. T Consensus 7 f~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~pEvd---------------- 70 (558) T protein:vir:10 7 FSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQYVDIEGAYRSEYDLIRRYREMALHPEAD---------------- 70 (558) T ss_pred chhhhhhhhccCCccccCCCccccccceeccceeeeeecccchhhhHHHHHHHHHHHhhccchh---------------- Confidence 1111111111000 001111223344433333221 Q ss_pred ccccccccCcceeccchHHHHHHHHHH-hhhcCCeeeccCchhhH----HHHHHHHhc-----cHHHHHHHHHHHHHhCC Q lcl|NC_010179. 51 GKKDPLRSADNRIPSNFYQLLVDQEAG-YIASVFPDIDVGKDADN----KKILDVLGD-----DRALTLNSLLVDSSNAG 120 (469) Q Consensus 51 ~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l~g~p~~~~~~~~~~~----~~l~~~~~~-----n~~~~~~~~~~~~~~~G 120 (469) +=...||+..+- =-...|+.+..++.+.. +.+.+-++. +|.....+..+.+.+.| T Consensus 71 ---------------~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDg 135 (558) T protein:vir:10 71 ---------------GAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDG 135 (558) T ss_pred ---------------hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeee Confidence 112223332222 23446666666554433 333333322 45566778899999999 Q ss_pred eEEEEEEEcCC----CceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCcee Q lcl|NC_010179. 121 RAWLHYWIDED----NNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDST 196 (469) Q Consensus 121 ~~~~~v~~d~~----~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 196 (469) +-|.+..+|.+ |-..+..+||+.+-+|..-.. +..-........+..+.. +-+....||.+...... T Consensus 136 RiyfHKiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~--~~~~~~~~~~~~~~~~~~-------~~~~~~eyy~Y~~~~~~ 206 (558) T protein:vir:10 136 RVFYLKVIDTKNPQEGIQDLRYIDPLKIKFIRQEKR--KPGNQDPAIRVRSEQDVV-------PNPEFEEFYIYTPKVQH 206 (558) T ss_pred EEEEEEEEeCCCccccceeeeeeCcccceeeeeecc--ccccccceeeeeccccee-------eccceeEeeeecCCccc Confidence 99999998755 666789999998876654211 111111111111111110 01111222221111100 Q ss_pred ecccccccccccccccccccccccccccCCcccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhc Q lcl|NC_010179. 197 VIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQ 268 (469) Q Consensus 197 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~ 268 (469) ..... ..... .+++ +||- |.|... ..+.-.+.-+..-|..+|. ++-+.+-..+..+ T Consensus 207 ~~~~~-----~~~~~----------~~~v-kI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlEDAlVIYRitR 270 (558) T protein:vir:10 207 PTGMV-----GQMGG----------KNSI-KIAKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIEDSLVIYRLSR 270 (558) T ss_pred ccccc-----eeecC----------CCce-eechhheeeecccceecCCCeeeecchHhhHhHHhhHHHHhhHHHHhhhc Confidence 00000 00000 0000 1111 111111 0011112223344444443 3445555556666 Q ss_pred CceeEEecCCcccc-----hhh----hhhhhhcceeeeccc---------------------CCCCCCcceEEeecCCHH Q lcl|NC_010179. 269 TVILVLTNYGGASL-----KQF----MNDLREYKSIKINNA---------------------GNGDKSGVDKLQIDIPVE 318 (469) Q Consensus 269 ~p~l~~~g~~~~~~-----~~~----~~~~~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~ 318 (469) .|-.-+.-.+..+. .+. +...+ ++++.=... +.+.+-.+..|....++. T Consensus 271 APERRvFYIDVGnLPk~KAeqYlr~iM~k~K-NklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnLg 349 (558) T protein:vir:10 271 APERRIFYIDVGNLPKVKAEQYLKEVMSRYR-NKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEITTLPGGQNLG 349 (558) T ss_pred cccceEEEEecCCCCchhHHHHHHHHHHhcc-ceEEEeccCceecccchhhhhHhhhcccccCCCCccceeeccccCCcc Confidence 66443332221111 111 11111 222221111 112233455555444554 Q ss_pred HHHHHHHHHHHHHHHHhCCCC--cCccc---cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCC Q lcl|NC_010179. 319 ARDDALKITRDNIFLFGQGID--PANFE---SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DAD 392 (469) Q Consensus 319 ~~~~~~~~l~~~i~~~s~~p~--~~~~~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~ 392 (469) ... .++-+.+-+|+...+|- +..++ +|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+ T Consensus 350 em~-DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~--EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~ee 426 (558) T protein:vir:10 350 ELS-DVDYFQKKLYRALGVPESRIAAEGGFNLGRSS--EILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKNIVTPED 426 (558) T ss_pred hHH-HHHHHHHHHHHHhCCCccccCCCCcccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHH Confidence 433 36777788888888884 22222 34433 34444555556677778888888888877544433321 123 Q ss_pred c----ccceEEeCCCCCCCHHHHH-------HHHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHh---h Q lcl|NC_010179. 393 K----RHISQHWTRTKVEDSLTKA-------QIVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREEN---D 452 (469) Q Consensus 393 ~----~~i~i~f~~~~p~d~~e~~-------~~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~---~ 452 (469) | ..|.+.|...-.-.+...+ ++++.+. | .+|.+++.+.+=-.+| .+++-++|++|..+. . T Consensus 427 W~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~ 506 (558) T protein:vir:10 427 WKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPD 506 (558) T ss_pred HHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCC Confidence 3 3467777654444444333 3444443 3 4799999987533333 456677777776542 1 Q ss_pred hhH---hhcccCCCCCCCCC Q lcl|NC_010179. 453 PYA---NQADELNGKGVDDE 469 (469) Q Consensus 453 ~~~---~~~~~~~~~~~~de 469 (469) |.. ...+..+++++-+. T Consensus 507 p~~~~~~~~~~~~~~~~~~~ 526 (558) T protein:vir:10 507 PSQIDPITGEPLPQEGDPAM 526 (558) T ss_pred ccccChhhccccCccCCchh Confidence 111 11122222211111 No 220 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=90.39 E-value=0.021 Score=29.77 Aligned_cols=433 Identities=9% Similarity=-0.031 Sum_probs=157.9 Q ss_pred CCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhh Q lcl|NC_010179. 1 MELD-ALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYI 79 (469) Q Consensus 1 ~~~~-~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l 79 (469) +-.. .++...++ ..+.+.++ -.++|.++. +...+ ..+.... ...-..++....|+..+..+ T Consensus 14 ~~~~~~~~~~~~~--~~~~~~~~----~~~~~~~~~-~~~p~-~~~~~L~----------~~~e~~~~~~~~i~~~~~~i 75 (651) T protein:vir:99 14 HVEGLGGEADLAK--SPNSTQIP----DHRIQSHNV-GVNPP-YNPDRLA----------AFLELNETLATGIRKKSRYE 75 (651) T ss_pred Eeecccccccccc--cccccccc----hhhhcccCC-CCCCC-CCHHHHH----------HHHhcChHHHHHHHHHhhhh Confidence 1000 00000000 00111111 112344332 22221 1111110 11123577788888888888 Q ss_pred hcCCeeecc----C-c---hhhHHHHHHHHhc------------c----HHHHHHHHHHHHHhCCeEEEEEEEcCCCce- Q lcl|NC_010179. 80 ASVFPDIDV----G-K---DADNKKILDVLGD------------D----RALTLNSLLVDSSNAGRAWLHYWIDEDNNF- 134 (469) Q Consensus 80 ~g~p~~~~~----~-~---~~~~~~l~~~~~~------------n----~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~- 134 (469) .|-++.+.. + + ....+..+.+|.. | ....+..+..+...+|.+|+-+..+..|.+ T Consensus 76 ag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~~~i~~~~~~Dle~tGna~ieiIrn~~g~pv 155 (651) T protein:vir:99 76 VGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATPERVKELARQDYHGVGWLALEMLTDIEGRPV 155 (651) T ss_pred hccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCHHHHHHHHHHHHHHHhhHhhhhhhcCccchh Confidence 776654321 1 1 1112233344322 1 122334455667778888877766666553 Q ss_pred EEEEEccceeEEEEeCCCCCceEEEE---------------EEEE---------eeecCCceEEEEEEEEcCCeEEEEEe Q lcl|NC_010179. 135 RYGIIQPDQITPVYATTLDNKLLGVL---------------RSYK---------QLDPEAGKYFTVHEYWTDKEAQFFRT 190 (469) Q Consensus 135 ~i~~~~p~~~~~~~d~~~~~~~~~~v---------------~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (469) .+..+++..+-..-+..........+ +++. ....+...............+..... T Consensus 156 ~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~q~~~~~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~ 235 (651) T protein:vir:99 156 GLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYVQIRNGNRRYFGEAGDRYRGQEVVIDESGDEPTIRYR 235 (651) T ss_pred hhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHHHHHhcCcceEEEeeccccceeeeeccCCcceeEEec Confidence 23333333221100000000000000 0000 00000000000000000000000000 Q ss_pred ecCceeecccccccccccccccccccccccccccCCccc---EEEecCC-----ccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 191 SATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVP---FIEFPKN-----KYRLAELNKYKGLIDAYDDIYNGFIN 262 (469) Q Consensus 191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP---vv~~~n~-----~~g~~~~~~v~~liD~~~~~~s~~~~ 262 (469) .... ....... .....+.... ...+....+| |+||++. ..|.|.+..+...+.....+..-..+ T Consensus 236 ~d~~-~~~~~~~-----~~~~~g~~~~--~~~~~~~~~~~~eViHir~~~~~~g~~G~spl~~a~~~i~~a~~a~~~~~~ 307 (651) T protein:vir:99 236 EDEE-SEREPIF-----VDRETGDVTT--GDANGLENRPANELIFIPNPSILEDDYGVPDWVSAIRTISADEAAKDYNRD 307 (651) T ss_pred cCcc-eeeeeec-----ccceeeeEEE--cCCCceeEecccceEEecCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHH Confidence 0000 0000000 0000000000 0000111122 5666532 24777777777766665555555556 Q ss_pred HHHHhcCceeEEecCCcccchhhhhh----hh-----hcceeeecccCC----CCCCcceEEeecCC---HHHHHHHHHH Q lcl|NC_010179. 263 DLDDVQTVILVLTNYGGASLKQFMND----LR-----EYKSIKINNAGN----GDKSGVDKLQIDIP---VEARDDALKI 326 (469) Q Consensus 263 ~~~~~~~p~l~~~g~~~~~~~~~~~~----~~-----~~~~~~~~~~~~----~~~~~~~~l~~~~~---~~~~~~~~~~ 326 (469) .+...+.|-.++.-.+....++.... +. ..+.+.+...+. ..+.+++|...... ...+.+..+. T Consensus 308 ~f~NG~~p~gil~~~~~~ls~e~~~~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~ 387 (651) T protein:vir:99 308 FFDNDTIPRMVIKVTGGELSEESKRDLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREK 387 (651) T ss_pred HHhccCCCceEEEecCCCCCHHHHHHHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHH Confidence 66666667666542111111111111 11 123344432211 11235666554432 3455666677 Q ss_pred HHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---C---CCcccceEEe Q lcl|NC_010179. 327 TRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---D---ADKRHISQHW 400 (469) Q Consensus 327 l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~---~~~~~i~i~f 400 (469) ....|.+.-++|+.-.......++..++... ...+..+|.-+++.+...++.+ . .....+.+.| T Consensus 388 ~~~eIa~afgVPp~~lG~~~~~~~sn~E~~~----------~~f~~~tL~P~~~~ie~eln~kLl~~~e~~~~~~i~~ef 457 (651) T protein:vir:99 388 NEHEIAKVLEVPPVKIGVTDSANRSNSDQQD----------KDFALEVIQPEQHTFAEWLYQIIHQQALGVTDWTIEYEL 457 (651) T ss_pred HHHHHHHHhCCCHHHhccCCCCCcccHHHHH----------HHHHHHHHHHHHHHHHHHHHHhhcCccccccCceEEEEe Confidence 7888988888886433211111111111111 1223334444444444444321 1 1112345566 Q ss_pred CC--CCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHH--HHHHHHHHHHHHh--hhh---HhhcccCCCCCCC Q lcl|NC_010179. 401 TR--TKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQ--QELKDLAKDREEN--DPY---ANQADELNGKGVD 467 (469) Q Consensus 401 ~~--~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~--~E~eri~~E~~~~--~~~---~~~~~~~~~~~~~ 467 (469) +. -+-.|.+..++.+.++ +|+++.-.+.++++. +++.. .-+..++...... .+. .++......+..+ T Consensus 458 ~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~ 537 (651) T protein:vir:99 458 RGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIGE 537 (651) T ss_pred ccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCcccccccc Confidence 53 3446888888888776 689999998888753 33311 1111111100000 000 0000000000001 Q ss_pred CC Q lcl|NC_010179. 468 DE 469 (469) Q Consensus 468 de 469 (469) .| T Consensus 538 ~e 539 (651) T protein:vir:99 538 RE 539 (651) T ss_pred ch Confidence 11 No 221 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=90.26 E-value=0.022 Score=29.70 Aligned_cols=400 Identities=11% Similarity=0.091 Sum_probs=147.8 Q ss_pred CCH--------HHH------HHHHHHHHHHHH-HHHHHHHHH----HHHhccCCcccccccchhhhcccccccccccCcc Q lcl|NC_010179. 1 MEL--------DAL------KKLIRNTSTSRN-DLINNYKKS----VDYYENKTDITTRNNGKPKVSKEGKKDPLRSADN 61 (469) Q Consensus 1 ~~~--------~~~------~~~i~~~~~~~~-~~~~~~~~~----~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 61 (469) ..+ ..| .+.|.+.+.... ....-+... ..||.-..-+.+..+. .... ....+ T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~l-------~~~~~- 93 (563) T protein:vir:95 23 VPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNL-HDVL-------KKFGN- 93 (563) T ss_pred eeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccH-HHHH-------HHhhc- Confidence 111 111 111211111100 000000000 0111100000000000 0000 00001 Q ss_pred eeccchHHHHHH----HHHHhhh---------cCCeeeccCc----hh---hHHHHHHHHh----c------cHHHHHHH Q lcl|NC_010179. 62 RIPSNFYQLLVD----QEAGYIA---------SVFPDIDVGK----DA---DNKKILDVLG----D------DRALTLNS 111 (469) Q Consensus 62 ri~~n~~k~iv~----~~~~~l~---------g~p~~~~~~~----~~---~~~~l~~~~~----~------n~~~~~~~ 111 (469) .+....+|+ ..+.|-+ |=|+.+...+ +. ....+..++. + .+.+.+.. T Consensus 94 ---n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~ 170 (563) T protein:vir:95 94 ---NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKK 170 (563) T ss_pred ---chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHH Confidence 123333333 3333321 1122221111 11 1122333332 1 12233445 Q ss_pred HHHHHHhCCeEEEEEEE--cCCCce-EEEEEccceeEEEEeCCCCCceE-EEEEEEEeeecCCceEEEEEEEEcCCeEEE Q lcl|NC_010179. 112 LLVDSSNAGRAWLHYWI--DEDNNF-RYGIIQPDQITPVYATTLDNKLL-GVLRSYKQLDPEAGKYFTVHEYWTDKEAQF 187 (469) Q Consensus 112 ~~~~~~~~G~~~~~v~~--d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~-~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (469) +..+.+.+|.+|+.+.+ +..|++ .+.+++|..+.+..+... .+. ...+++... ++... ..+....+.+ T Consensus 171 lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g--~~~~~~~~y~~~~--~g~~~----~~~~~~evI~ 242 (563) T protein:vir:95 171 IVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKG--KIIKGGKRFVQVV--DKRVV----ASFTSRELAM 242 (563) T ss_pred HHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCC--ceeccceeEEEEe--CCcee----EEecCcceEE Confidence 67788999999887654 445665 488899999998876532 111 111221111 11111 0111122111 Q ss_pred EEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010179. 188 FRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDV 267 (469) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~ 267 (469) +... |-........|.|-++.+...|.....+..-..+.+... T Consensus 243 ~~~~-------------------------------------~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng 285 (563) T protein:vir:95 243 GIRN-------------------------------------PRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHG 285 (563) T ss_pred Eecc-------------------------------------CCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHcc Confidence 1110 000000012467777766666666655555566666766 Q ss_pred cCceeEE--ecCCccc---chhhhhhhhh--------cceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHH Q lcl|NC_010179. 268 QTVILVL--TNYGGAS---LKQFMNDLRE--------YKSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIF 332 (469) Q Consensus 268 ~~p~l~~--~g~~~~~---~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~ 332 (469) +.|-.++ .|..... .......+.. .++..+- ..+++|..... ....+.+..+...+.|+ T Consensus 286 ~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl------~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia 359 (563) T protein:vir:95 286 GTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVM------ADDIKFVNMTPTANDMQFEKWLNYLINIIS 359 (563) T ss_pred CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEc------CCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 7775444 3421111 1111222211 1111111 12345544444 34556777788888999 Q ss_pred HHhCCCCcCcc--ccC----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCC Q lcl|NC_010179. 333 LFGQGIDPANF--ESS----NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRT 403 (469) Q Consensus 333 ~~s~~p~~~~~--~~g----~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~ 403 (469) ..-++|+.-.. .-+ ...|..+.... + .......+..+|.-+++.+...++.+ .. ...+.+.|.+. T Consensus 360 ~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn--~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-~~~~~~~f~r~ 433 (563) T protein:vir:95 360 ALYGIDPAEIGFPNRGGATGSKGGSTLNEAD--P---GKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-GDKYTFQFVGG 433 (563) T ss_pred HHhCCCHHHccccccccccccccccchhhcc--H---HHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-ccccEEEeccC Confidence 98899874221 111 11111111110 0 01112333444444444444433321 11 12456778776 Q ss_pred CCCCHHHHHHHHHH-HhccCChHHHHHhCCC--CCCHHH--------HHHHH----HHH---HHHhh---------hhHh Q lcl|NC_010179. 404 KVEDSLTKAQIVST-VANYSSKEAVAKANPI--VDDWQQ--------ELKDL----AKD---REEND---------PYAN 456 (469) Q Consensus 404 ~p~d~~e~~~~~~k-l~g~iS~et~~~~l~~--v~d~~~--------E~eri----~~E---~~~~~---------~~~~ 456 (469) -+.+..+..++... .+|+++.-.+.++++. +++-+. -+... ..+ ++... +... T Consensus 434 D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (563) T protein:vir:95 434 DTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDD 513 (563) T ss_pred CHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCC Confidence 55555554443221 2588998888777643 221110 00000 000 00000 0011 Q ss_pred hcccCCCCCCCCC Q lcl|NC_010179. 457 QADELNGKGVDDE 469 (469) Q Consensus 457 ~~~~~~~~~~~de 469 (469) ...+....+.+++ T Consensus 514 ~~~~~~~~~~~~~ 526 (563) T protein:vir:95 514 SEEGQSTDSSNDD 526 (563) T ss_pred CCCCCCCCCCCCc Confidence 1111111111111 No 222 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=90.26 E-value=0.022 Score=29.70 Aligned_cols=400 Identities=11% Similarity=0.091 Sum_probs=147.8 Q ss_pred CCH--------HHH------HHHHHHHHHHHH-HHHHHHHHH----HHHhccCCcccccccchhhhcccccccccccCcc Q lcl|NC_010179. 1 MEL--------DAL------KKLIRNTSTSRN-DLINNYKKS----VDYYENKTDITTRNNGKPKVSKEGKKDPLRSADN 61 (469) Q Consensus 1 ~~~--------~~~------~~~i~~~~~~~~-~~~~~~~~~----~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 61 (469) ..+ ..| .+.|.+.+.... ....-+... ..||.-..-+.+..+. .... ....+ T Consensus 23 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~l-------~~~~~- 93 (563) T protein:vir:99 23 VPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMKNEHNL-HDVL-------KKFGN- 93 (563) T ss_pred eeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCCCcccH-HHHH-------HHhhc- Confidence 111 111 111211111100 000000000 0111100000000000 0000 00001 Q ss_pred eeccchHHHHHH----HHHHhhh---------cCCeeeccCc----hh---hHHHHHHHHh----c------cHHHHHHH Q lcl|NC_010179. 62 RIPSNFYQLLVD----QEAGYIA---------SVFPDIDVGK----DA---DNKKILDVLG----D------DRALTLNS 111 (469) Q Consensus 62 ri~~n~~k~iv~----~~~~~l~---------g~p~~~~~~~----~~---~~~~l~~~~~----~------n~~~~~~~ 111 (469) .+....+|+ ..+.|-+ |=|+.+...+ +. ....+..++. + .+.+.+.. T Consensus 94 ---n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~ 170 (563) T protein:vir:99 94 ---NPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDVDRDSFQTFCKK 170 (563) T ss_pred ---chHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCCCcchHHHHHHH Confidence 123333333 3333321 1122221111 11 1122333332 1 12233445 Q ss_pred HHHHHHhCCeEEEEEEE--cCCCce-EEEEEccceeEEEEeCCCCCceE-EEEEEEEeeecCCceEEEEEEEEcCCeEEE Q lcl|NC_010179. 112 LLVDSSNAGRAWLHYWI--DEDNNF-RYGIIQPDQITPVYATTLDNKLL-GVLRSYKQLDPEAGKYFTVHEYWTDKEAQF 187 (469) Q Consensus 112 ~~~~~~~~G~~~~~v~~--d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~-~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~ 187 (469) +..+.+.+|.+|+.+.+ +..|++ .+.+++|..+.+..+... .+. ...+++... ++... ..+....+.+ T Consensus 171 lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g--~~~~~~~~y~~~~--~g~~~----~~~~~~evI~ 242 (563) T protein:vir:99 171 IVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKG--KIIKGGKRFVQVV--DKRVV----ASFTSRELAM 242 (563) T ss_pred HHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCC--ceeccceeEEEEe--CCcee----EEecCcceEE Confidence 67788999999887654 445665 488899999998876532 111 111221111 11111 0111122111 Q ss_pred EEeecCceeecccccccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_010179. 188 FRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDV 267 (469) Q Consensus 188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~ 267 (469) +... |-........|.|-++.+...|.....+..-..+.+... T Consensus 243 ~~~~-------------------------------------~~~d~~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~ng 285 (563) T protein:vir:99 243 GIRN-------------------------------------PRTELSSSGYGLSEVEIAMKEFIAYNNTESFNDRFFSHG 285 (563) T ss_pred Eecc-------------------------------------CCCCcccCcccchHHHHHHHHHHHHHHHHHHHHHHHHcc Confidence 1110 000000012467777766666666655555566666766 Q ss_pred cCceeEE--ecCCccc---chhhhhhhhh--------cceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHH Q lcl|NC_010179. 268 QTVILVL--TNYGGAS---LKQFMNDLRE--------YKSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIF 332 (469) Q Consensus 268 ~~p~l~~--~g~~~~~---~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~ 332 (469) +.|-.++ .|..... .......+.. .++..+- ..+++|..... ....+.+..+...+.|+ T Consensus 286 ~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl------~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia 359 (563) T protein:vir:99 286 GTTRGILQIRSDQQQSQHALENFKREWKSSLSGINGSWQIPVVM------ADDIKFVNMTPTANDMQFEKWLNYLINIIS 359 (563) T ss_pred CCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccccccccceEEc------CCCceEEeccCChhHHHHHHHHHHHHHHHH Confidence 7775444 3421111 1111222211 1111111 12345544444 34556777788888999 Q ss_pred HHhCCCCcCcc--ccC----CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCC Q lcl|NC_010179. 333 LFGQGIDPANF--ESS----NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRT 403 (469) Q Consensus 333 ~~s~~p~~~~~--~~g----~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~ 403 (469) ..-++|+.-.. .-+ ...|..+.... + .......+..+|.-+++.+...++.+ .. ...+.+.|.+. T Consensus 360 ~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn--~---e~~~~~f~~~tL~P~l~~ie~~ln~~L~~~~-~~~~~~~f~r~ 433 (563) T protein:vir:99 360 ALYGIDPAEIGFPNRGGATGSKGGSTLNEAD--P---GKKQQQSQNKGLQPLLRFIEDLVNRHIISEY-GDKYTFQFVGG 433 (563) T ss_pred HHhCCCHHHccccccccccccccccchhhcc--H---HHHHHHHHHHHHHHHHHHHHHHHHhhhchhc-ccccEEEeccC Confidence 98899874221 111 11111111110 0 01112333444444444444433321 11 12456778776 Q ss_pred CCCCHHHHHHHHHH-HhccCChHHHHHhCCC--CCCHHH--------HHHHH----HHH---HHHhh---------hhHh Q lcl|NC_010179. 404 KVEDSLTKAQIVST-VANYSSKEAVAKANPI--VDDWQQ--------ELKDL----AKD---REEND---------PYAN 456 (469) Q Consensus 404 ~p~d~~e~~~~~~k-l~g~iS~et~~~~l~~--v~d~~~--------E~eri----~~E---~~~~~---------~~~~ 456 (469) -+.+..+..++... .+|+++.-.+.++++. +++-+. -+... ..+ ++... +... T Consensus 434 D~~~~~e~~~~~~~~~~G~lT~NE~R~~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (563) T protein:vir:99 434 DTKSATDKLNILKLETQIFKTVNEAREEQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERLQMMMSLLEGDNDD 513 (563) T ss_pred CHHHHHHHHHHHHHhcCCccCHHHHHHHhCCCCCCCcceeecccccccccccccccCCCccccchhhhhcccccCCCCCC Confidence 55555554443221 2588998888777643 221110 00000 000 00000 0011 Q ss_pred hcccCCCCCCCCC Q lcl|NC_010179. 457 QADELNGKGVDDE 469 (469) Q Consensus 457 ~~~~~~~~~~~de 469 (469) ...+....+.+++ T Consensus 514 ~~~~~~~~~~~~~ 526 (563) T protein:vir:99 514 SEEGQSTDSSNDD 526 (563) T ss_pred CCCCCCCCCCCCc Confidence 1111111111111 No 223 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=89.91 E-value=0.024 Score=29.50 Aligned_cols=367 Identities=12% Similarity=0.026 Sum_probs=138.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |.+=.- ........-.-+..+..|.-. .......-.+. +-.-..|+..+.-+- T Consensus 1 m~~f~~---------~~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~Al~~--~~V~~~i~~Ia~~iA 53 (406) T protein:vir:97 1 MSFFQP---------LGTSKVSYDDYISSVLAGDVS----------------QKYLGVSALKN--SDILTATSIIAGDIA 53 (406) T ss_pred Cccccc---------cCCCCCCcchHHHHHhcCCCC----------------cccccchhhcc--HHHHHHHHHHHHhhh Confidence 111100 000000000001111111000 00000000011 111113444444443 Q ss_pred cCCeeeccCch--hhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcC-CCce-EEEEEccceeEEEEeC Q lcl|NC_010179. 81 SVFPDIDVGKD--ADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDE-DNNF-RYGIIQPDQITPVYAT 150 (469) Q Consensus 81 g~p~~~~~~~~--~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~-~~~~-~i~~~~p~~~~~~~d~ 150 (469) .=|+.....+. .....+..+|.. | .+ +....+...++.+|.+|+++..+. .|.+ .+.+++|..+.+..++ T Consensus 54 ~lp~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~p~~v~v~~~~ 133 (406) T protein:vir:97 54 RFPLVKKDVNGDIIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYRPSETTVEETD 133 (406) T ss_pred hCeeEEEecCccccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEECCCeeEEEEcC Confidence 34554432222 122345566642 3 22 334456778889999999888775 4554 5888899988876654 Q ss_pred CCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccE Q lcl|NC_010179. 151 TLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPF 230 (469) Q Consensus 151 ~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPv 230 (469) . .++. |......+.... .+....+.+++.- | T Consensus 134 ~--~~~~-----y~~~~~~~~~~~----~~~~~evih~r~~-------------------------------------~- 164 (406) T protein:vir:97 134 N--HEIV-----YTFTDMLTAKQV----KCFAHDVIHWKFF-------------------------------------S- 164 (406) T ss_pred C--ceEE-----EEEEecCCceEE----EEccccEEEecCC-------------------------------------C- Confidence 2 1221 111111111111 1122222222100 0 Q ss_pred EEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhh----hhhhh-------cceeeecc Q lcl|NC_010179. 231 IEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFM----NDLRE-------YKSIKINN 299 (469) Q Consensus 231 v~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~----~~~~~-------~~~~~~~~ 299 (469) .+.-.|.|.++.+...++....+..-..+.++..+.|-.++... ....++.. ..+.. .+.+.++ T Consensus 165 ---~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~-~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~- 239 (406) T protein:vir:97 165 ---HDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKG-AQLSGDARQRARQEFEKMREGSVGGSPLVFD- 239 (406) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecC-CCCCHHHHHHHHHHHHHHhcccccCceeecC- Confidence 00113666666666555544444443444455545553333221 11111111 11111 1122221 Q ss_pred cCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 377 (469) .+.+|.....+ ...+.+..+...+.|...-++|+......+.-|..+ +.....+..+|.- T Consensus 240 ------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e------------~~~~~f~~~~l~P 301 (406) T protein:vir:97 240 ------STMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVA------------QLMEDYVTNDLPF 301 (406) T ss_pred ------CCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHH------------HHHHHHHHHHHHH Confidence 23445443333 233444455567788887788875442211112111 1111223344444 Q ss_pred HHHHHHHHhccc---CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCC--CCHHH-H-HHH---HH Q lcl|NC_010179. 378 LVRAIMRYLNFS---DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPIV--DDWQQ-E-LKD---LA 445 (469) Q Consensus 378 ~~~~i~~~~~~~---~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~v--~d~~~-E-~er---i~ 445 (469) .++.|...++.+ ..+.....+.|. +..+....++++.++ +|+++.-.+.+.++.- +++.. + +-. +. T Consensus 302 ~~~~ie~~l~~kll~~~~~~~~~i~fd--~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~g~~p~~~~~gD~~~~~~n~~~ 379 (406) T protein:vir:97 302 YFDAITSELGLKTLNDKDRRLYHIEFD--TRSVTGRNVDEIVKLVNNQILTPNQGLVELGKQKSTDPNMDRYQSSLNYVF 379 (406) T ss_pred HHHHHHHHHhhhhcChhhccceeEEEe--cCccchhhHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeEeeccCccc Confidence 444444433321 111223345553 223444555666665 5789999988887532 22110 0 000 00 Q ss_pred HHHHHhhhhHhhcccCCCCCCC---CC Q lcl|NC_010179. 446 KDREENDPYANQADELNGKGVD---DE 469 (469) Q Consensus 446 ~E~~~~~~~~~~~~~~~~~~~~---de 469 (469) -+..+ .+.........+++.+ |+ T Consensus 380 ~~~~~-~~~~~~~~~~~gg~~~~~~~~ 405 (406) T protein:vir:97 380 LDKKE-EYQDKVGIKGKGGEVNAEEDK 405 (406) T ss_pred hhccc-ccccccccccCCCCCCCCCCC Confidence 00000 0000001111122222 22 No 224 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=89.10 E-value=0.028 Score=29.07 Aligned_cols=352 Identities=12% Similarity=0.054 Sum_probs=142.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+=+-+.. ............-.....++.|.- .. .... +..-+...-....|+..++-+. T Consensus 1 Mg~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~--------------~~--~~v~-~~~al~~~~v~~~i~~ia~~ia 61 (385) T protein:vir:10 1 MGLLTPRNF--NKRKAKNMVYPSNPAFFTTTVGGM--------------QL--SYVS-ALSALQNTNVYSVINRIASDVA 61 (385) T ss_pred Cccccchhc--ccccccccccccchhhhhhhcccc--------------Cc--cccC-HHHhhccHHHHHHHHHHHHHHh Confidence 222110000 000000000000000000000000 00 0000 0000112233345666666666 Q ss_pred cCCeeeccCchhhHHHHHHHHhc-cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCCCc Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD-DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNK 155 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~-n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~ 155 (469) +-|+++. +.. ...++.+ |. ......+..++..+|.+|+++..+. ..+...++..+.+..+.. . T Consensus 62 ~~p~~v~--~~~----~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~---~~~~p~~~~~v~~~~~~~---~ 129 (385) T protein:vir:10 62 SAHFKTE--NTA----TLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN---LEHIPNSDVQINYLPGNM---G 129 (385) T ss_pred hCceeee--ccc----hhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc---eeEeecCCceEEEEEcCC---c Confidence 6666653 222 2233432 32 2223456677888999998875432 233334444443332221 1 Q ss_pred eEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC Q lcl|NC_010179. 156 LLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK 235 (469) Q Consensus 156 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 235 (469) .. |......+... ..+.... |+||+. T Consensus 130 ~~-----~~~~~~~~~~~----~~~~~~e---------------------------------------------iihik~ 155 (385) T protein:vir:10 130 IV-----YTVLESNDRPQ----MVLRQDQ---------------------------------------------MLHFRL 155 (385) T ss_pred eE-----EEEEEcCCceE----EEEcccc---------------------------------------------EEEecc Confidence 00 11111111110 0111222 233321 Q ss_pred -------CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh----hhhhhh-------cceeee Q lcl|NC_010179. 236 -------NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF----MNDLRE-------YKSIKI 297 (469) Q Consensus 236 -------~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~----~~~~~~-------~~~~~~ 297 (469) ...|.|.+..+...++....+..-..+.+...+.|-.+++-......++. ...+.. .+++.+ T Consensus 156 ~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl 235 (385) T protein:vir:10 156 MPDPQYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLESAREEFEKANTGDNSGRLMVL 235 (385) T ss_pred CCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCcccc Confidence 12477878777777777666666666666777777666553222211211 111111 112222 Q ss_pred cccCCCCCCcceEEeecCCH--HH-HHHHHHHHHHHHHHHhCCCCcCccc--cCCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNGDKSGVDKLQIDIPV--EA-RDDALKITRDNIFLFGQGIDPANFE--SSNASGVAIKMLYSHLELKAAKTQTYFE 372 (469) Q Consensus 298 ~~~~~~~~~~~~~l~~~~~~--~~-~~~~~~~l~~~i~~~s~~p~~~~~~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~ 372 (469) + ++++|.....+. .. +.+..+...+.|+..-++|+.-... .++.++..++... ..|. T Consensus 236 ~-------~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~-----------~~~~ 297 (385) T protein:vir:10 236 P-------DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIK-----------ATYL 297 (385) T ss_pred C-------CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHH-----------HHHH Confidence 1 234454433332 22 2355677788899988988753322 2332222222111 1111 Q ss_pred HHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHHH Q lcl|NC_010179. 373 HAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKDR 448 (469) Q Consensus 373 ~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E~ 448 (469) .+|.-.++.|...++.+=.. ..+++.+..-+..|.++.++++.++ +|+++.-++.+.++. +++ ..+.... T Consensus 298 ~~l~P~~~~ie~~l~~~l~~-~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~p~--~~~~~~~--- 371 (385) T protein:vir:10 298 ANLNSYVNPIVDELRLKMNA-PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGFLP--DNLPEFK--- 371 (385) T ss_pred HHHHHHHHHHHHHHHHhhCC-ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCccCC--CCCcccc--- Confidence 22333333333322221111 1355555666778999999999887 578998887776532 221 1111111 Q ss_pred HHhhhhHhhcccC-CCCCCCC Q lcl|NC_010179. 449 EENDPYANQADEL-NGKGVDD 468 (469) Q Consensus 449 ~~~~~~~~~~~~~-~~~~~~d 468 (469) ...... +++++|| T Consensus 372 -------~~~~~~~~g~~~dn 385 (385) T protein:vir:10 372 -------PLTTQVKGGDEGDN 385 (385) T ss_pred -------CcccccCCCCCCCC Confidence 111111 2222222 No 225 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=87.45 E-value=0.039 Score=28.33 Aligned_cols=285 Identities=9% Similarity=-0.020 Sum_probs=124.7 Q ss_pred EEEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccc Q lcl|NC_010179. 122 AWLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPY 201 (469) Q Consensus 122 ~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 201 (469) .++++|--.+|...+.-+.+ -+ .+ . +.+|. .+.++... ........ T Consensus 1 v~Eivw~~~~g~~~~~~l~~-------r~---~~--~-~~~f~-~~~~~~l~-------------~~~~~~~~------- 46 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAW-------RP---PR--T-ISRFD-VAPDGGLV-------------AIEQWGVF------- 46 (355) T ss_pred CeEEEEEeeCCeEEEeeeee-------cC---cc--c-eeeee-eccCCcee-------------EEEecCCC------- Confidence 66666654444333221111 10 00 0 11111 11111110 00000000 Q ss_pred cccccccccccccccccccccccCCcccEEEe--cCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCc Q lcl|NC_010179. 202 NIITSYDLSAGYETGQSNTLKHNFGRVPFIEF--PKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGG 279 (469) Q Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~--~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~ 279 (469) + .+ ...-.+.+.|-..+- ..++.|.|.+..+-...--=+..+..++.-++.+..|+.+.+|..+ T Consensus 47 ----------g---~~-~~~lp~~kfi~~~~~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~ 112 (355) T protein:vir:78 47 ----------G---KA-TVRIPVDRLVVFVNEREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPL 112 (355) T ss_pred ----------C---CC-cceeccCCEEEEEeCCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCC Confidence 0 00 000011122222221 1356788888877776666677888889999999888888777532 Q ss_pred ccc---h------------hhhhhh------hhcceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_010179. 280 ASL---K------------QFMNDL------REYKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGI 338 (469) Q Consensus 280 ~~~---~------------~~~~~~------~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 338 (469) ... + +....+ .....+.++.+ .++++++.......+...++...+.|.+.--+. T Consensus 113 ~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g-----~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGq 187 (355) T protein:vir:78 113 PEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHG-----ANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAH 187 (355) T ss_pred CCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCC-----ceEEEeecCCCcccHHHHHHHHHHHHHHHHhhh Confidence 111 0 001111 11123333333 468888877666667778999999988765554 Q ss_pred CcCcccc---C-CccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHH Q lcl|NC_010179. 339 DPANFES---S-NASGVAIKMLYSHLELKAAKTQTYFEHAIN-ELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQ 413 (469) Q Consensus 339 ~~~~~~~---g-~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~-~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~ 413 (469) .+..++. | ...|..- ..-....++.-.+.+...+. ++++-++.+ |... ......+.|.. .+.+....++ T Consensus 188 tlTs~~~~~gGS~Alg~vh---~~v~~~~~~aD~~~i~~~ln~~li~~l~~l-N~~~-~~~~P~~~~~~-~~~~~~~~a~ 261 (355) T protein:vir:78 188 FLTLGGDKSTGSYALGDTF---ASFFTGSLNAVMKHIADVTQQHVVEDLVDQ-NWGP-EEPAPRLVPAQ-LGKEQPVTAE 261 (355) T ss_pred hhccccCCccchhhHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCC-CCCCCEEEecC-cChhHHHHHH Confidence 4443221 2 1223321 12222233344456666674 477766663 3222 22334667754 4556667788 Q ss_pred HHHHH--hcc-CChH----HHHHhCCCCCCHHH--HHHHHHHHHHHhhhhHhhcccCCCC-----CCC--------CC Q lcl|NC_010179. 414 IVSTV--ANY-SSKE----AVAKANPIVDDWQQ--ELKDLAKDREENDPYANQADELNGK-----GVD--------DE 469 (469) Q Consensus 414 ~~~kl--~g~-iS~e----t~~~~l~~v~d~~~--E~eri~~E~~~~~~~~~~~~~~~~~-----~~~--------de 469 (469) .+.++ .|+ ++.+ .+.+.++ ++.+.. +...-.++ ..+.........+. +.. ++ T Consensus 262 ~~~~l~~~G~~~~~~~~~~~~~e~~g-ip~p~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~ 335 (355) T protein:vir:78 262 AIRALVECGAFTADPELEKDLRARYG-LPAPAERDDGADAAAA---KAAGRRRAKRLPGQRQGAALPSRSPRADPPRR 335 (355) T ss_pred HHHHHHhCCCccccHHHHHHHHHHhC-CCCCCCCCcccCCccc---cccccccccccCCccccccccccCCCCCChhh Confidence 88877 354 5643 2345555 332211 11000111 11111111111110 000 00 No 226 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=85.19 E-value=0.055 Score=27.51 Aligned_cols=383 Identities=10% Similarity=0.038 Sum_probs=156.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccc--ccccchh---hhcc-cccccccccC---cceeccchHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDIT--TRNNGKP---KVSK-EGKKDPLRSA---DNRIPSNFYQLL 71 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~--~~~~~~~---~~~~-~~~~~~~~~~---~~ri~~n~~k~i 71 (469) |--|- -+..+.+++..+.+..+.- ......+ .... .......+.. ..-+..+-.... T Consensus 1 ~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:10 1 MPDEK--------------KLGLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CCCCc--------------ccchhhhhHhhcCCccccccccccccccCcchhhhhcccccccCcccchhhhhcchHHHHH Confidence 22111 1122333344443321100 0000000 0000 0000000000 000111222234 Q ss_pred HHHHHHhhhcCCeee--ccCc---hhhHHHHHHHHhc--cH-H---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEE Q lcl|NC_010179. 72 VDQEAGYIASVFPDI--DVGK---DADNKKILDVLGD--DR-A---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGII 139 (469) Q Consensus 72 v~~~~~~l~g~p~~~--~~~~---~~~~~~l~~~~~~--n~-~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~ 139 (469) |+..++-+-+-|+.+ ...+ ......+..+|.. |. + +....+..+++.+|.+|+++..+ +|++ .+.++ T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l 145 (432) T protein:vir:10 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYL 145 (432) T ss_pred HHHHHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEE Confidence 555555555556553 1111 1122334555532 32 2 23345677889999999888765 4664 48889 Q ss_pred ccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccc Q lcl|NC_010179. 140 QPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSN 219 (469) Q Consensus 140 ~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (469) +|..+.++.+.. .++. |.....+|.. ..+..+.+.+++.- T Consensus 146 ~~~~v~v~~~~~--g~~~-----y~~~~~~g~~-----~~~~~~~iih~~~~---------------------------- 185 (432) T protein:vir:10 146 ANDRLTITTDTK--GNTA-----YRYRRTDGQM-----IDIPKQQIWKIMGY---------------------------- 185 (432) T ss_pred cCCceEEEEcCC--CcEE-----EEEEecCceE-----EEEcCccEEEecCC---------------------------- Confidence 999998887653 2221 1111112211 01222222222100 Q ss_pred cccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh--------hh Q lcl|NC_010179. 220 TLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL--------RE 291 (469) Q Consensus 220 ~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~--------~~ 291 (469) | .+.-.|.|-++.+...++.......-..+.+...+.|-.+++...... ++....+ .. T Consensus 186 ---------~----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~-~e~~~~~~~~~~~~~na 251 (432) T protein:vir:10 186 ---------S----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRFLT-DDQYDSFAKKVSGSVEA 251 (432) T ss_pred ---------C----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCCC-HHHHHHHHHHHhhhhhC Confidence 0 011235666655555555444444334455565666766665432211 1221111 11 Q ss_pred cceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CC-ccHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 292 YKSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFES--SN-ASGVAIKMLYSHLELKAAK 366 (469) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~-~Sg~Al~~~~~~l~~k~~~ 366 (469) .+++.++ ++.+|..... ....+.+..+.....|++.-++|+.-.... |+ ..|..++... T Consensus 252 g~~~vl~-------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~--------- 315 (432) T protein:vir:10 252 GRAPLLE-------GGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQ--------- 315 (432) T ss_pred CCceecC-------CCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHH--------- Confidence 2233332 2234444334 334555667778888999888887533221 11 2233332222 Q ss_pred HHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeC--CCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCH Q lcl|NC_010179. 367 TQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWT--RTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDW 437 (469) Q Consensus 367 ~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~--~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~ 437 (469) ...+...|.-.++.|...++.+ ..+.....+.|+ .-+..|.++.++.+.++ +|+++.-.+.++++. +++- T Consensus 316 -~~f~~~tl~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~~glppi~g~ 394 (432) T protein:vir:10 316 -LGFLSMTLSPWLRRIEQSIALNLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGN 394 (432) T ss_pred -HHHHHHHHHHHHHHHHHHHHhhhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC Confidence 1222334444444443333221 111223445554 44567899999998886 678999998888753 2221 Q ss_pred HHHH------HHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 438 QQEL------KDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 438 ~~E~------eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ..-+ .-+..-.+...+.........+++.+.+ T Consensus 395 ~~~~~~~~~~~pl~~~~~~~~~~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 395 AAVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred cceEeecCcccchhhhcccCCCCCCCCCCCcccccccC Confidence 1000 0011000000110000111111111111 No 227 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=84.00 E-value=0.064 Score=27.13 Aligned_cols=362 Identities=10% Similarity=0.052 Sum_probs=145.6 Q ss_pred HHHhccCCcccccccchhhhccccccccc----ccCcceeccchHHHHHHHHHHhhhcCCeeecc-C-chh-hHHHHHHH Q lcl|NC_010179. 28 VDYYENKTDITTRNNGKPKVSKEGKKDPL----RSADNRIPSNFYQLLVDQEAGYIASVFPDIDV-G-KDA-DNKKILDV 100 (469) Q Consensus 28 ~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~----~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~-~-~~~-~~~~l~~~ 100 (469) -.+|++...................+... ...-.+++. .-..|+..++-+-+-|+.+-- . +.. ....+..+ T Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~Al~~~~--V~~cv~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~l 78 (417) T protein:vir:38 1 MKLFRGLATEVDPHWADHLLDSGVIPSFRGGYLGISALRNSD--VLTAVSIVSGDVSRFPLVITDSSTDEVIDLANIEYL 78 (417) T ss_pred CccccccccCCCccchhhhcccccccccCCceechhhcccHH--HHHHHHHHHHhhccCeeEEEEcCCcceeccchHHHH Confidence 11233322111000000000000000000 000012111 112455555555555665421 1 111 12234444 Q ss_pred Hhc--cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCc-eE-EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCce Q lcl|NC_010179. 101 LGD--DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNN-FR-YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGK 172 (469) Q Consensus 101 ~~~--n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~-~~-i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~ 172 (469) +.. |- .+....+...++.+|.+|+.+..+..|. +. +.+++|..+.+..++. .++. |......+.. T Consensus 79 L~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~--~~~~-----y~~~~~~~~~ 151 (417) T protein:vir:38 79 MNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDP--DNII-----YRFTPYNSSM 151 (417) T ss_pred HhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCC--CeEE-----EEEEEcCCcE Confidence 432 32 2223456778899999999998877643 43 6678888887765432 1221 1111111111 Q ss_pred EEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC----CccccccHHHHHH Q lcl|NC_010179. 173 YFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK----NKYRLAELNKYKG 248 (469) Q Consensus 173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n----~~~g~~~~~~v~~ 248 (469) . ..+....+ +||+. .-.|.|.++-+.. T Consensus 152 ~----~~~~~~dv---------------------------------------------iH~r~~~~d~~~G~s~l~~~~~ 182 (417) T protein:vir:38 152 Q----KVCGFEDV---------------------------------------------IHWKFFSYDTIMGRSPLLSLGD 182 (417) T ss_pred E----EEecCcce---------------------------------------------EEecCCCCCCccccCHHHHHHH Confidence 0 11112222 22221 1136676766666 Q ss_pred HHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhh-------hcceeeecccCCCCCCcceEEeecCCH- Q lcl|NC_010179. 249 LIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLR-------EYKSIKINNAGNGDKSGVDKLQIDIPV- 317 (469) Q Consensus 249 liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~l~~~~~~- 317 (469) .|........-..+.++..+.|-.++.-...... +.....+. ..+.+.++ ++.+|.....+. T Consensus 183 ~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~l~~e~~~~~~~~~~~~~~g~n~g~~~vl~-------~g~~~~~l~~~~~ 255 (417) T protein:vir:38 183 EIGLQESGVSTLQKFFKSGLKGSIIKAKESRLSAEARQKIREDFERAQAGADAGSPIIVD-------ATMDYQPLEVDTN 255 (417) T ss_pred HHHHHHHHHHHHHHHHhccCCCcEEEEeCCCCCHHHHHHHHHHHHHHhcccccCCceecc-------CCceEEEccCCHH Confidence 6665555555555556666677555543221111 11111111 11122221 234555444433 Q ss_pred -HHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCc Q lcl|NC_010179. 318 -EARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADK 393 (469) Q Consensus 318 -~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~ 393 (469) ..+.+..+.....|+..-++|+.-.... .++..++. ....++...|.-+++.|...++.+ .... T Consensus 256 d~q~le~~~~~~~~Ia~~fgVPp~~lg~~--~~~s~~e~----------~~~~~~~~tl~P~~~~ie~~l~~~Ll~~~~~ 323 (417) T protein:vir:38 256 VLNLINSNNYSTAQIAKALRVPAYRLAQN--SPNQSVKQ----------LADDYIRNDLPFYFEPITSEFELKLLDDAQR 323 (417) T ss_pred HHHHHHHHHhhHHHHHHHhCCCHHHhCCC--CcchhHHH----------HHHHHHHHHHHHHHHHHHHHHHhhhcChhhc Confidence 3444555666778888878886543221 12221111 112234445555555555444321 1122 Q ss_pred ccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH----------HHHHHHHHHHHhhhhHhhcc Q lcl|NC_010179. 394 RHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ----------ELKDLAKDREENDPYANQAD 459 (469) Q Consensus 394 ~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~----------E~eri~~E~~~~~~~~~~~~ 459 (469) ....+.|+... .+....++ +.++ +|+++.-++.+++++ +++... -++...+++..... ....+ T Consensus 324 ~~~~~~fd~~~-l~~~~~~~-~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~~~~n~~~~d~~~~~~~~~~~-~~kgg 400 (417) T protein:vir:38 324 HQYCIGFDTKS-VNGLPIAD-VNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQSTLNTVFLDQKEAYQAEHAA-ELKGG 400 (417) T ss_pred ccceEEechhh-hhHHHHHH-HHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeeeccccccccccccccccccc-ccCCC Confidence 23457775321 12222222 3333 689999998888754 333211 11111111111111 11111 Q ss_pred cCCCCC-----CCCC Q lcl|NC_010179. 460 ELNGKG-----VDDE 469 (469) Q Consensus 460 ~~~~~~-----~~de 469 (469) +.++++ +.|+ T Consensus 401 ~~~~~~~~~~~~~~~ 415 (417) T protein:vir:38 401 DTNAKGNQNGSGTNA 415 (417) T ss_pred CCCCCCCCcCCCCcC Confidence 111111 1111 No 228 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=83.38 E-value=0.069 Score=26.95 Aligned_cols=382 Identities=12% Similarity=0.116 Sum_probs=165.9 Q ss_pred CCHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDAL---KKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~---~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) .+++.. ++|| .+|+.+-.+++-.. =...||+..+- T Consensus 53 ~~~~~~~~~~eLI-----------~~YR~ma~~pEvd~-------------------------------Av~eIvne~iv 90 (511) T protein:vir:56 53 AQSEGTIPVKELI-----------KSYRALAEYHEVDD-------------------------------AIQEIVDEAIV 90 (511) T ss_pred ccccCccchHHHH-----------HHHHHHhhccchhh-------------------------------HHHHhhcceeE Confidence 222211 2232 33444443333221 11122222221 Q ss_pred -hhhcCCeeeccCchhhHHHH--------HHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCCCc-eEEEEEccceeEE Q lcl|NC_010179. 78 -YIASVFPDIDVGKDADNKKI--------LDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDEDNN-FRYGIIQPDQITP 146 (469) Q Consensus 78 -~l~g~p~~~~~~~~~~~~~l--------~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~-~~i~~~~p~~~~~ 146 (469) =-...|+.+..++.+..+.+ ..+++- +|.....+..+...+.|+-|.+.-+|++.. ..+..+||+.+-+ T Consensus 91 ~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~lDPr~i~~ 170 (511) T protein:vir:56 91 YENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIELRPLNPMKMEL 170 (511) T ss_pred ecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeehhhcCcccchh Confidence 12345666665554433332 223321 455667788999999999999888877644 4588889988766 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) |..--. + ..++...+ . +...+|.....+.. .+.............. -+-. T Consensus 171 vr~i~~--~-----------~~~~~~v~------~-~~~ey~~Y~~~~~~--~~~~~~~~~~~~~~vk--------I~~d 220 (511) T protein:vir:56 171 VREIQK--E-----------TIDGVEVV------K-GTLEYYVYKQSDYK--MPSWMSATNRAQTSFR--------IPKD 220 (511) T ss_pred hhhhhc--c-----------cccccccc------c-ceeeeeEecCCCcc--cCccccccccccccee--------echh Confidence 543111 1 11111111 0 11112221111100 0000000000000000 0000 Q ss_pred cccEEEe------cCCccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh---h Q lcl|NC_010179. 227 RVPFIEF------PKNKYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL---R 290 (469) Q Consensus 227 ~vPvv~~------~n~~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~---~ 290 (469) .|-.++. .|+....|-+. .-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ . T Consensus 221 aI~y~hSGL~d~~~~~g~i~syLh---kAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~ 297 (511) T protein:vir:56 221 AIVFAHSGLMRGCADDPYIIGYLD---RAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNV 297 (511) T ss_pred heeeecccceeccCCCCeeeccch---hhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhc Confidence 0111110 12222333333 33333443 344555555556655433332221111 1111111 0 Q ss_pred hcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--Ccc---- Q lcl|NC_010179. 291 EYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANF---- 343 (469) Q Consensus 291 ~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~---- 343 (469) .++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. ..+ T Consensus 298 kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kKLy~aLnVP~SRl~~e~q~~ 376 (511) T protein:vir:56 298 KNRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIE-DVLYFNRKLYKAMRIPTSRAASEDQTG 376 (511) T ss_pred CceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCCcccccCCCCcc Confidence 1122211111 112233455555444555443 377778888888888842 212 Q ss_pred cc--CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHHH--- Q lcl|NC_010179. 344 ES--SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ--- 413 (469) Q Consensus 344 ~~--g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~~--- 413 (469) ++ |. |..|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...++ T Consensus 377 ~f~~Gr--~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 454 (511) T protein:vir:56 377 GINFGQ--GAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILN 454 (511) T ss_pred cccccc--chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHH Confidence 12 32 2344445555566677788888888888877544433321 1223 34677776544444444333 Q ss_pred ----HHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCC Q lcl|NC_010179. 414 ----IVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELN 462 (469) Q Consensus 414 ----~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~ 462 (469) +++.+. | .+|.+++.+.+=-.+| .+++-++|++|..+ +...+..+.. T Consensus 455 ~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~--~~~~~~e~~f 511 (511) T protein:vir:56 455 SRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEEETN--PRFQQDDQGF 511 (511) T ss_pred HHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHhhcC--CCCCCcccCC Confidence 344443 3 4799999987633343 34555666666433 3322211111 No 229 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=82.84 E-value=0.073 Score=26.80 Aligned_cols=414 Identities=10% Similarity=0.035 Sum_probs=168.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcC--C Q lcl|NC_010179. 6 LKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASV--F 83 (469) Q Consensus 6 ~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~--p 83 (469) .++.+.+...+.+ |-+....++++++ +-++..-. . .+. .......|+-.+-...-++..++.|.+- | T Consensus 1 mk~~~~~~~~~lk-R~~~e~~w~e~a~--~tlP~~~~--~--~~~----~~~~~~~~~~dstg~~a~~~LAa~l~~~ltp 69 (510) T protein:vir:63 1 MKTTAAMLWEKLR-DGSVEQRAIEFAK--TTLPYLMV--D--PMS----GSRGVVEHDFQSAGALLVNNLAAKLARSLFP 69 (510) T ss_pred ChhHHHHHHHHHh-ccchHHHHHHHHH--hhccccCC--C--CCC----ccccccCCCccchHHHHHHHHHHHHHhhhcC Confidence 4444444433332 3334444444443 11111100 0 000 0011122344556666666666655431 2 Q ss_pred ee-----eccCch---------hhHHHHHH-----------HH-hccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceEEE Q lcl|NC_010179. 84 PD-----IDVGKD---------ADNKKILD-----------VL-GDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYG 137 (469) Q Consensus 84 ~~-----~~~~~~---------~~~~~l~~-----------~~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~ 137 (469) |. +...++ .....++. .+ ..||...+.++.++...+|.+ .+|.++++. +++ T Consensus 70 p~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a--~l~~~~~~~-~~~ 146 (510) T protein:vir:63 70 TGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNA--LLYRDSDAA-TVV 146 (510) T ss_pred CCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeE--EEEEcCCCc-EEE Confidence 22 222221 11112222 22 236677788888999999997 456676654 566 Q ss_pred EEccceeEEEEeCCCCCceEEEEEEEEeeecC--------------CceEEEEEEEEcCCeEEEEEeecCceeecccccc Q lcl|NC_010179. 138 IIQPDQITPVYATTLDNKLLGVLRSYKQLDPE--------------AGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNI 203 (469) Q Consensus 138 ~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (469) .++-.++++.-|. .+++...+|.++..... .......+++|+.- +...+.+. T Consensus 147 ~~pl~~y~v~~d~--~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V----~~~~~~~~-------- 212 (510) T protein:vir:63 147 AWSLRSYAVRRDA--TGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHV----QRKKGTAM-------- 212 (510) T ss_pred EEEcceeEEeeCC--CcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEE----EeecCCCc-------- Confidence 6666665544443 45566666655542110 00111122222210 11111100 Q ss_pred cccccccccccccccc-cccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecC Q lcl|NC_010179. 204 ITSYDLSAGYETGQSN-TLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY 277 (469) Q Consensus 204 ~~~~~~~~~~~~~~~~-~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~ 277 (469) ..+......++.... ....+|..+|++.++ .+.+|.|-.++..+-+..+|.+.-...........|.+.+.- T Consensus 213 -~~~sv~~e~dg~~~~~~~~~~~~e~P~~~~Rw~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p- 290 (510) T protein:vir:63 213 -EYAELYHEIDGVRVGKEGRWPIHLCPYIVPTWNLAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE- 290 (510) T ss_pred -eEEEEEEEecCceeccccccccccCceeeeeeeecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCc- Confidence 000001101111111 112345667877765 345799989999999999998877777776666666544321 Q ss_pred CcccchhhhhhhhhcceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHH Q lcl|NC_010179. 278 GGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKM 355 (469) Q Consensus 278 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~ 355 (469) ++.. ....+...+...+.++ ...++..+. ...+.......++.++..|-..-.. ++..-+....|+..+.. T Consensus 291 ~g~~---~~~~~~~~~~g~~v~g---~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~-~l~~~~~~rvTAtEV~~ 363 (510) T protein:vir:63 291 AKGA---VVDDYQDAEMGDYVPG---GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY-GANQRDAERVTAEEVRI 363 (510) T ss_pred cccc---chhhhccCCCceeecC---CcccceeeecCcccchHHHHHHHHHHHHHHHHHHHh-hcccCCCCCcCHHHHHH Confidence 1111 1111111221111111 122344444 3345666677777777766553221 22222223346654433 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHhcccC---CCcccce---EEeCCCCCCCH-HHHHH----HHH Q lcl|NC_010179. 356 LYSHLELKAAKTQTYFEHAINE--------LVRAIMRYLNFSD---ADKRHIS---QHWTRTKVEDS-LTKAQ----IVS 416 (469) Q Consensus 356 ~~~~l~~k~~~~~~~~~~~l~~--------~~~~i~~~~~~~~---~~~~~i~---i~f~~~~p~d~-~e~~~----~~~ 416 (469) . +.++...++..+.+ +++.++.++...+ .....+. +++...+-+.. .+.+. .+. T Consensus 364 r-------~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~Laraq~~~~l~~~~q~l~ 436 (510) T protein:vir:63 364 T-------AEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPALSRSAAVQSMLNASQVIA 436 (510) T ss_pred H-------HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcccceecchhHHHHHHHHHHHHHHHHHHH Confidence 2 23444444444443 2222333332222 1111222 23333332221 11111 111 Q ss_pred HHhc---c---CChHHHHHh----CCCCC-----CHHHHHHHHHHHHHHhh--hh------Hhh----cccCCCC Q lcl|NC_010179. 417 TVAN---Y---SSKEAVAKA----NPIVD-----DWQQELKDLAKDREEND--PY------ANQ----ADELNGK 464 (469) Q Consensus 417 kl~g---~---iS~et~~~~----l~~v~-----d~~~E~eri~~E~~~~~--~~------~~~----~~~~~~~ 464 (469) .+++ + +....++.. ++ |+ -.++|++++.+++.+.. .. .++ .....+- T Consensus 437 ~~~~~aq~~~~id~d~~~~~~a~~~G-v~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~~~g~ 510 (510) T protein:vir:63 437 GLAPIAQLDPRISLPKMMDTIWAAFS-VDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNALAGV 510 (510) T ss_pred HhcCchhhhccCCHHHHHHHHHHHhC-CChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC Confidence 1111 1 223334333 23 31 12455655554422211 11 111 1111111 No 230 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=82.65 E-value=0.075 Score=26.75 Aligned_cols=387 Identities=9% Similarity=0.041 Sum_probs=156.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhccccccccccc-Ccce--eccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRS-ADNR--IPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~-~~~r--i~~n~~k~iv~~~~~ 77 (469) |- +-+.+++......- ...+.. .|...+ ...+........+.....+. ...+ +.+.=.-..|+..+. T Consensus 1 ~~-~~l~~~~~~~~~~~---~~~~~~-----~~~~~~-~~~~~~~~~~~~g~~~~~g~~v~~~~al~~~~V~~~i~~ia~ 70 (434) T protein:vir:43 1 MS-KSLGKVLSSATSAP---RSSLFG-----WGGKTI-RLTDGAFWSQFLGRESSSGKKVTVDKAMKLSAVWACVRLIST 70 (434) T ss_pred Cc-cchhhhhhhccccc---chhhhc-----cccccc-ccCchHHHHHHhcCCccCCceechhhhhccHHHHHHHHHHHH Confidence 21 11122222211110 000000 001000 00000000000000000000 0000 111112234555555 Q ss_pred hhhcCCeee-ccC-c----hhhHHHHHHHHhc--cH-H---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEcccee Q lcl|NC_010179. 78 YIASVFPDI-DVG-K----DADNKKILDVLGD--DR-A---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQI 144 (469) Q Consensus 78 ~l~g~p~~~-~~~-~----~~~~~~l~~~~~~--n~-~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~ 144 (469) -+-+-|+.+ ... + ......+..++.. |. + +....+..+++.+|.+|+++..+ .|++ .+.+++|..+ T Consensus 71 ~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~-~G~~~~L~~l~p~~v 149 (434) T protein:vir:43 71 SVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRA-AGRPAALDFLLPSRV 149 (434) T ss_pred hhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeC-CCcEEEEEEEcCcce Confidence 555556653 211 1 1123345555532 32 2 23345677889999999887655 5765 4788999998 Q ss_pred EEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccccc Q lcl|NC_010179. 145 TPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (469) Q Consensus 145 ~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (469) -+..+.+ ..+. |.....+|.. ..+....+.+++.- T Consensus 150 ~~~~~~~--g~~~-----y~~~~~~g~~-----~~~~~~eVih~~~~--------------------------------- 184 (434) T protein:vir:43 150 DLECDEN--GRLK-----YFYTTKKGAR-----REIERTNMLHIPAF--------------------------------- 184 (434) T ss_pred EEEEcCC--CeEE-----EEEEecCceE-----EEEccccEEEecCc--------------------------------- Confidence 8877643 2211 1111112211 11222333322110 Q ss_pred CCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhhh-------cce Q lcl|NC_010179. 225 FGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLRE-------YKS 294 (469) Q Consensus 225 ~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~~-------~~~ 294 (469) | .+...|.|-++.+...+........-..+.+...+.|-.+++....... +.....+.. .+. T Consensus 185 ----~----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l~~e~~~~~r~~~~~~~g~~nag~~ 256 (434) T protein:vir:43 185 ----T----LDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDRILQPAQREEFREYVKSVSGAMNSGRS 256 (434) T ss_pred ----C----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCCCCCHHHHHHHHHHHHHhcCccccCCc Confidence 0 0112356656555555554444444444445555667555544221111 111111111 112 Q ss_pred eeecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLELKAAKTQTY 370 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~~k~~~~~~~ 370 (469) +.++ ++++|.....+ ...+.+..+.....|+..-++|+.-.... ++.++..++... ... T Consensus 257 ~vl~-------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~----------~~f 319 (434) T protein:vir:43 257 PVLE-------QGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQM----------LAF 319 (434) T ss_pred cccC-------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHH----------HHH Confidence 2121 23455444443 34556667777889999889986432211 222233322222 123 Q ss_pred HHHHHHHHHHHHHHHhccc-----CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHH--- Q lcl|NC_010179. 371 FEHAINELVRAIMRYLNFS-----DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQ--- 438 (469) Q Consensus 371 ~~~~l~~~~~~i~~~~~~~-----~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~--- 438 (469) +..+|.-++..|...++.+ +.....+++.++.-+..|.++.++.+.++ +|+++.-++.+.++. +++-+ T Consensus 320 ~~~~L~P~~~~ie~~ln~kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~gl~p~~ggD~~~ 399 (434) T protein:vir:43 320 LTFSISSITNQIQQCVNKRLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRKENLPELPGGDILT 399 (434) T ss_pred HHHHHHHHHHHHHHHHHhhcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCeEe Confidence 3444555555554444322 11122344444455667899999998886 689999888877643 22211 Q ss_pred -----HHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 439 -----QELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 439 -----~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .-++.+.+.++.... ........+...=+| T Consensus 400 ~~~n~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 400 VQSNLVPIDQLGQSNKSQAV-RAALMNWFSQPEPQE 434 (434) T ss_pred eccCccchhhhhccCCCcch-hhhhhccCCCCCCCC Confidence 112222211111100 000000000111111 No 231 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=81.72 E-value=0.083 Score=26.50 Aligned_cols=428 Identities=12% Similarity=0.083 Sum_probs=178.4 Q ss_pred CCH----------HHH----HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccc Q lcl|NC_010179. 1 MEL----------DAL----KKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSN 66 (469) Q Consensus 1 ~~~----------~~~----~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n 66 (469) |+- +-+ .+.|+....+...--.+.+.+.+-|.+... ..+.... | .| T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~~~~~h~r~~~~~k~y~~~~~---------------~~~~~~~---r--~n 60 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREPLEKWHTQGKEIVKRYRDERD---------------SAHDAET---R--WN 60 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhccchHHHHHHHHHHHhhcccc---------------CCCcccc---c--cc Confidence 322 111 111221111112222334444455554321 1111111 1 23 Q ss_pred hHHHHHHHHHHhhhcCCeeecc------CchhhH----HHHHHHHh------cc-HHHHHHHHHHHHHhCCeEEEEEEEc Q lcl|NC_010179. 67 FYQLLVDQEAGYIASVFPDIDV------GKDADN----KKILDVLG------DD-RALTLNSLLVDSSNAGRAWLHYWID 129 (469) Q Consensus 67 ~~k~iv~~~~~~l~g~p~~~~~------~~~~~~----~~l~~~~~------~n-~~~~~~~~~~~~~~~G~~~~~v~~d 129 (469) +.=--|..+.-=+.+.+|..++ -++... +.+...+. ++ +...+....++++.+|++.+.|.+- T Consensus 61 l~~sni~~i~P~iYar~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~v~Ye 140 (663) T protein:vir:34 61 LFSTNIQTQMASLYGQTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCRIRYE 140 (663) T ss_pred hhhhhHHHHhhhhhcCCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEEEEee Confidence 3222234444445566554432 232233 33333331 12 2233455677889999988877652 Q ss_pred C--------------CC-----------------ceEEEEEccceeEEEEeCCCCCc-e--EEEEEEEEe---------- Q lcl|NC_010179. 130 E--------------DN-----------------NFRYGIIQPDQITPVYATTLDNK-L--LGVLRSYKQ---------- 165 (469) Q Consensus 130 ~--------------~~-----------------~~~i~~~~p~~~~~~~d~~~~~~-~--~~~v~~~~~---------- 165 (469) . .+ .++|..+.-..+ ++++...++ . ++...+.+. T Consensus 141 ~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~df--l~~pAr~W~ev~wva~r~~mtk~e~~~rf~~~ 218 (663) T protein:vir:34 141 VEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDV--LWSPARVWHEVRWLAFRNLLDMREFNARFDAD 218 (663) T ss_pred cccchhccccccCCCccccchhcccccchhhcccceeeeeechhhc--ccchhhccccccceeeeccCCHHHHHHhhcCC Confidence 1 10 122222222221 112211111 0 111100000 Q ss_pred ---------e----ec---CC-----ceEEEEEEEEcCCeEEEEEe-ecCceeecccccccccccccccccccccccccc Q lcl|NC_010179. 166 ---------L----DP---EA-----GKYFTVHEYWTDKEAQFFRT-SATDSTVIEPYNIITSYDLSAGYETGQSNTLKH 223 (469) Q Consensus 166 ---------~----~~---~~-----~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (469) . .. +| ....-..|+|++..-..|.. ++.. ..+. ..++..... T Consensus 219 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEIWdK~~~~V~w~~eg~~-~~L~---------------~~~p~lgl~ 282 (663) T protein:vir:34 219 GSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEIWDKGGRKVDWYVEGYS-AVLD---------------TQPDPLGLE 282 (663) T ss_pred hhhhhhhhccCcCCccccCCCCCcchhcCcceeEEEecCCcEEEEEEcCcc-eecc---------------cCCCCCCCC Confidence 0 00 00 01223445555544433222 2111 1110 011111111 Q ss_pred cCCcccEEEecC----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhhhcceeeecc Q lcl|NC_010179. 224 NFGRVPFIEFPK----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLREYKSIKINN 299 (469) Q Consensus 224 ~~g~vPvv~~~n----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 299 (469) +|--||...+++ +-...|+|.-.+.+++++|.+-..+ |.+.+.-.|-.+..+-.+++.+.....-..+.++.++. T Consensus 283 ~ffPcPrpl~~~~~~ds~ipvpd~~~y~~~~~E~n~~t~Ri-n~l~d~ikv~gvy~~~~g~~i~~~l~~a~~n~lvpV~~ 361 (663) T protein:vir:34 283 SFFPCPKPLLANWTTDKVVPRPDFVLAQDLYKEIDLVSTRI-TLLERAIRVVGVYDKSSGLTIGRLLSEAAQNDLIPVEN 361 (663) T ss_pred CCCCCcccccceecCCCeecCCcHHHHHHHHHHHHHHHHHH-HHHHhhhhhceeeccccchhHHHHHHHhhCCCceecch Confidence 222355555443 2346789999999999999864443 44443333333322112223333333334455555544 Q ss_pred cCC-CCCCc----ceEEeecCCH---HHHHHHHHHHHHHHHHHhCCCCcCcccc-CCccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGN-GDKSG----VDKLQIDIPV---EARDDALKITRDNIFLFGQGIDPANFES-SNASGVAIKMLYSHLELKAAKTQTY 370 (469) Q Consensus 300 ~~~-~~~~~----~~~l~~~~~~---~~~~~~~~~l~~~i~~~s~~p~~~~~~~-g~~Sg~Al~~~~~~l~~k~~~~~~~ 370 (469) .+. ++.++ +.++-.+.-. .++-..-..++.++|.+|++-+..=... .+-++.|-..+-+.+-.++.+++.. T Consensus 362 ~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rga~~a~ETatAQ~IKsq~gS~RIqe~qde 441 (663) T protein:vir:34 362 WLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRGASDPRETAMAQGVKAKFGSIRLQRLQDE 441 (663) T ss_pred hhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhcccCcchhhHHHHHHHHHHhHHHHHHHHH Confidence 322 11222 3443222222 2333445667889999998765422221 2335556666677788899999999 Q ss_pred HHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHHhcc-CCh-HHHHH-hCCCCCCHHHHHHHHHHH Q lcl|NC_010179. 371 FEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTVANY-SSK-EAVAK-ANPIVDDWQQELKDLAKD 447 (469) Q Consensus 371 ~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~g~-iS~-et~~~-~l~~v~d~~~E~eri~~E 447 (469) .......+.++...++.- .+....+.-.-.-.+|. ..+.......|..- ++. ...++ -.....|..+|.+.+.+- T Consensus 442 vqR~arDi~ql~AEIl~~-~~~~etl~~m~~~elp~-~~ei~~~~~~L~n~~~r~~~ldIe~dsT~~~D~~~eK~~~~E~ 519 (663) T protein:vir:34 442 VARFASDIQRLKAEVIAE-HYDVASILAQANAEFTF-DKELAPKAAELIKSRFSMYRVEVKPEAVSLQDFAALRNEKMEV 519 (663) T ss_pred HHHHHHHHHHHHHHHHHH-hcCHHHHHHHhcCCCCc-ccchhHHHHHHhcCCCcceeeeeccCCCCcCChHHHHHHHHHH Confidence 999999999998887652 22211111111122332 22233333333211 100 00000 012334555566666655 Q ss_pred HHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 448 REENDPYANQADELNGKGVDDE 469 (469) Q Consensus 448 ~~~~~~~~~~~~~~~~~~~~de 469 (469) ...-.+.+++...+-+.+-..= T Consensus 520 l~~i~~~~qq~~pl~~q~p~~~ 541 (663) T protein:vir:34 520 LSGIASFMQGVAPLAQQVPGSA 541 (663) T ss_pred HHHHHHHHHHHHHHHHhhhhhH Confidence 5555566655544322222111 No 232 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=81.68 E-value=0.083 Score=26.50 Aligned_cols=381 Identities=9% Similarity=0.027 Sum_probs=154.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcc--cccccchh---hhcc-cccccccccCcc---eeccchHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDI--TTRNNGKP---KVSK-EGKKDPLRSADN---RIPSNFYQLL 71 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i--~~~~~~~~---~~~~-~~~~~~~~~~~~---ri~~n~~k~i 71 (469) |--|-+- ....+++..+.+..+. .......+ .... .......+...+ -+...=.-.. T Consensus 1 ~~~~~~~--------------g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~v~~~~a~~~~aV~~~ 66 (432) T protein:vir:97 1 MPDEKKL--------------GLLGQLKAMFVPPDPVDIGGGQTFTPVNATARDLGIIISDTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CCCcccC--------------chhhhhHhhcCCccccccccccccccCchhhhhhcccccccCcccchHhhhcchHHHHH Confidence 2222111 1122233333321110 00000000 0000 000000000000 0011111223 Q ss_pred HHHHHHhhhcCCeee-c-cCc---hhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEE Q lcl|NC_010179. 72 VDQEAGYIASVFPDI-D-VGK---DADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGII 139 (469) Q Consensus 72 v~~~~~~l~g~p~~~-~-~~~---~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~ 139 (469) |+..++-+-+-|+.+ . ..+ ......+..+|.. | .+ +....+...++.+|.+|+++..+ +|++ .+.++ T Consensus 67 v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~-~g~~~~L~~l 145 (432) T protein:vir:97 67 VKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVT-DGRIESLQYL 145 (432) T ss_pred HHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEec-CCcEEEEEEE Confidence 444444444455543 1 111 1122334555532 3 22 23345677889999999888776 4665 47889 Q ss_pred ccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccc Q lcl|NC_010179. 140 QPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSN 219 (469) Q Consensus 140 ~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 219 (469) +|..+.++.+... ++. |.....+|.. ..+..+.+.+++.- T Consensus 146 ~p~~v~v~~~~~g--~~~-----y~~~~~~g~~-----~~~~~~~iih~r~~---------------------------- 185 (432) T protein:vir:97 146 ANDRLTITTDTKG--NTA-----YRYRRTDGQM-----IDIPRQQIWKIMGY---------------------------- 185 (432) T ss_pred cCcceEEEEcCCC--cEE-----EEEEecCceE-----EEEccccEEEecCc---------------------------- Confidence 9999988876532 221 2222222211 11222223222110 Q ss_pred cccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhhh--------h Q lcl|NC_010179. 220 TLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDLR--------E 291 (469) Q Consensus 220 ~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~~--------~ 291 (469) ++ +.-.|.|-++.+...++..........+.+...+.|-.+++-... ..++....++ . T Consensus 186 ---------~~----dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~-l~~e~~~~~~~~~~~~~na 251 (432) T protein:vir:97 186 ---------SL----DGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF-LTDDQYDSFSKKVSGSVEA 251 (432) T ss_pred ---------CC----CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCC-CCHHHHHHHHHHHhhhhcC Confidence 00 012366666655555555444444445555666677555553222 1122222221 1 Q ss_pred cceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc--cCC-ccHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 292 YKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANFE--SSN-ASGVAIKMLYSHLELKAAKTQ 368 (469) Q Consensus 292 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~--~g~-~Sg~Al~~~~~~l~~k~~~~~ 368 (469) .+++.++. +.+.+.++.+.....+.+..+.....|+..-++|+.-... .++ ..|..++... . T Consensus 252 g~~~vl~~-----g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~----------~ 316 (432) T protein:vir:97 252 GRAPLLEG-----GMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQ----------L 316 (432) T ss_pred CCceecCC-----CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHH----------H Confidence 22333321 1233333333334455566777788899888888743321 121 1122222221 1 Q ss_pred HHHHHHHHHHHHHHHHHhccc---CCCcccceEEe--CCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHH Q lcl|NC_010179. 369 TYFEHAINELVRAIMRYLNFS---DADKRHISQHW--TRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQ 439 (469) Q Consensus 369 ~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f--~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~ 439 (469) ..+..+|.-.++.|...++.+ ..+.....++| ..-+-.|.++.++.+.++ +|+++.-.+.++++. +++- . T Consensus 317 ~f~~~tl~P~~~~ie~~ln~kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~~glpp~~g~-~ 395 (432) T protein:vir:97 317 GFLTMTLSPWLRRIEQSIALNLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIEGLPKLGGN-A 395 (432) T ss_pred HHHHHHHHHHHHHHHHHHhhhccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCC-c Confidence 223334444444444433321 11122234455 445667899999999887 678999888887653 2211 1 Q ss_pred HHHHH-----HHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 440 ELKDL-----AKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 440 E~eri-----~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) .+-.+ --+........ ++..+.+.+++ T Consensus 396 ~~~~~~~~~~pl~~~~~~~~~---~~~~~~~~~~~ 427 (432) T protein:vir:97 396 AVLTVQSAMVPLDSIGLQASP---EPASGLGNQQQ 427 (432) T ss_pred ceEeecccccchhhhcccCCC---CCCCCCCCccc Confidence 10000 00000000000 01111111111 No 233 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=80.75 E-value=0.092 Score=26.26 Aligned_cols=331 Identities=12% Similarity=0.028 Sum_probs=132.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccch--HHHHHHHHHHh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNF--YQLLVDQEAGY 78 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~--~k~iv~~~~~~ 78 (469) |-+=. .|. ++ ..+. ..+........+--......+.+.++.. .-..|+..++- T Consensus 1 M~~~~------~f~----~r----------~~~~-----~~~~~~~~~~~~~~~~~~~v~~~~al~~~av~~cv~~ia~~ 55 (359) T protein:vir:10 1 MSILN------PFE----RR----------SSIT-----PNNYYPFMVQNGSIVPNSLVDATEALKNSDLYAVTSLISSD 55 (359) T ss_pred Ccccc------hhh----cc----------ccCC-----CCcchhhhhccccccCCcccCHHHhhcchHHHHHHHHHHHh Confidence 22211 000 00 0000 0000000000000000000000111111 11234444444 Q ss_pred hhcCCeeeccCchhhHHHHHHHHhc-c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCC Q lcl|NC_010179. 79 IASVFPDIDVGKDADNKKILDVLGD-D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTL 152 (469) Q Consensus 79 l~g~p~~~~~~~~~~~~~l~~~~~~-n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~ 152 (469) +-+-|.. + ...+..++.+ | .+ +-...+....+.+|.+|+++-.+.+|.+ .+.+++|..+.+..+++ T Consensus 56 ia~~p~~---~----~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~~~~~- 127 (359) T protein:vir:10 56 IAGTRFI---G----NQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITIDLTDD- 127 (359) T ss_pred hhcCccc---c----chHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEEEcCC- Confidence 4444442 1 1223334433 3 22 2233456677889999999888888885 47888888887765532 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) .+. |......+... ..+....+.+++.-.... +++ T Consensus 128 --~~~-----y~~~~~~~~~~----~~~~~~evih~~~~~~~~---------------------------~~~------- 162 (359) T protein:vir:10 128 --TLT-----YEVNQFDDYPS----AKYNASEMIHVKIMAYGV---------------------------DTL------- 162 (359) T ss_pred --eEE-----EEEEecCCceE----EEEcccceEEeccCCCCC---------------------------Ccc------- Confidence 111 11111111111 112233333332110000 000 Q ss_pred ecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchh----hhhhhhh-------cceeeecccC Q lcl|NC_010179. 233 FPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQ----FMNDLRE-------YKSIKINNAG 301 (469) Q Consensus 233 ~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~----~~~~~~~-------~~~~~~~~~~ 301 (469) +.-.|.|-++.+...+.....+..-..+.++..+.|-.+++-..+...++ ....+.. .+++.++ T Consensus 163 --dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~~~~~~~~~~~~n~g~~~vl~--- 237 (359) T protein:vir:10 163 --HNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSIRKEFEKANGGNNSGRVMVLD--- 237 (359) T ss_pred --CccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHHHHHHHHHhCccccCCceecC--- Confidence 01236676777777776666666666666666777766655322111121 1112111 1122222 Q ss_pred CCCCCcceEEeecCCH--HHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 302 NGDKSGVDKLQIDIPV--EARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLELKAAKTQTYFEHAINE 377 (469) Q Consensus 302 ~~~~~~~~~l~~~~~~--~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~ 377 (469) ++.+|.....+. ..+.+..+...+.|+..-++|+.-..+. .+.+...++..+......+ ...+...|++ T Consensus 238 ----~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~---l~p~~~~l~~ 310 (359) T protein:vir:10 238 ----QSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRF---IEPLISELRI 310 (359) T ss_pred ----CCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHH---HHHHHHHHHH Confidence 234444333432 3445666777888988888987544322 2234433433332221111 0111111111 Q ss_pred HHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCC---CC Q lcl|NC_010179. 378 LVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANP---IV 434 (469) Q Consensus 378 ~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~---~v 434 (469) .+ ..-+ .+......-.|.......+.++ +|+++.-++.+.++ .. T Consensus 311 ~l---~~~~----------~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 311 KC---DSSI----------GVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred Hh---hhhh----------cccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCC Confidence 11 0000 0111111111223333344444 67899888887763 33 No 234 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=79.32 E-value=0.11 Score=25.94 Aligned_cols=383 Identities=12% Similarity=0.119 Sum_probs=165.6 Q ss_pred CCHHHH-HHHHHHH------HHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHH Q lcl|NC_010179. 1 MELDAL-KKLIRNT------STSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~~~~~-~~~i~~~------~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~ 73 (469) |+-... ...+..+ +......+.+|+.+..+++-. +=...||+ T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~n~~eLI~~YR~ma~~pEvd-------------------------------~Av~eIvn 95 (521) T protein:vir:10 47 IDTTAPKTAIVQSVLGYAPKIQNTKDLINQYRSLSKYHEVD-------------------------------NAIDEIIN 95 (521) T ss_pred CCccccccchhhhhhccccccchHHHHHHHHHHHhhccchh-------------------------------hHHHhhhc Confidence 111100 0000010 011112233444443333322 11222333 Q ss_pred HHHHh-hhcCCeeeccCchhhHHHHHHHHhc---------cHHHHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEE Q lcl|NC_010179. 74 QEAGY-IASVFPDIDVGKDADNKKILDVLGD---------DRALTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGII 139 (469) Q Consensus 74 ~~~~~-l~g~p~~~~~~~~~~~~~l~~~~~~---------n~~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~ 139 (469) ..+-+ -...|+.+..++.+..+.+++-..+ +|.....+..+.+.+.|+-|.+.-+|.+ |-..+..+ T Consensus 96 eaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~l 175 (521) T protein:vir:10 96 DAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRLL 175 (521) T ss_pred ceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCCCccccceeeeee Confidence 22222 2456677666554444433322221 4455677888999999999998877643 55668888 Q ss_pred ccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEE---EEEEEcCCeEEEEEeecCceeeccccccccccccccccccc Q lcl|NC_010179. 140 QPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFT---VHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETG 216 (469) Q Consensus 140 ~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 216 (469) ||+.+-.+.- ....+..+..... .+-+|.+....+|-. ... T Consensus 176 DPr~i~~vr~-------------i~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~-------------------~g~---- 219 (521) T protein:vir:10 176 DPRNVEYYRV-------------NLKSNENGNDVYKGVKEFFTYGATEDNRYNI-------------------SGN---- 219 (521) T ss_pred CCcceeeeee-------------ecCCCCCcchhhccceeeeeeccCCCceecC-------------------CCC---- Confidence 8887654432 1111111111110 001121111111100 000 Q ss_pred ccccccccCCcccE--EEec-------CCccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc--- Q lcl|NC_010179. 217 QSNTLKHNFGRVPF--IEFP-------KNKYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL--- 282 (469) Q Consensus 217 ~~~~~~~~~g~vPv--v~~~-------n~~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~--- 282 (469) +...-+||. |.|. |.....|-+ ..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. T Consensus 220 -----~~~~vkI~~daI~y~hSGL~d~~~~~i~syL---hkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~ 291 (521) T protein:vir:10 220 -----SNNLVQIPIDAIVYSHSGKVDIDGKTIVGYL---HNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNK 291 (521) T ss_pred -----CCcceeechhheeeecccceeCCCCceeccc---hhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCch Confidence 000001211 1111 112223333 333333443 344555555555655433332221111 Q ss_pred --hhhhhhh---hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhC Q lcl|NC_010179. 283 --KQFMNDL---REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQ 336 (469) Q Consensus 283 --~~~~~~~---~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~ 336 (469) .+-...+ ..++++.=... +.+.+-.+..|....++.... -++-+.+-+|+... T Consensus 292 KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~-DV~YF~kkLy~aLn 370 (521) T protein:vir:10 292 KATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMD-DVRWFNRKLYESMK 370 (521) T ss_pred hHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhC Confidence 1111111 01122111111 122233455555444555443 36777888888888 Q ss_pred CCC--cCccc----cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCC Q lcl|NC_010179. 337 GID--PANFE----SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKV 405 (469) Q Consensus 337 ~p~--~~~~~----~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p 405 (469) +|- +..++ +|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-. T Consensus 371 VP~sRl~~e~~~f~~Gr~~--EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~ 448 (521) T protein:vir:10 371 IPLSRLPQEGAGVTFGAGN--DITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSY 448 (521) T ss_pred CCccccCCCCCceeccccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecch Confidence 884 22222 34322 24445555566677788888888888877544433321 1223 346777765444 Q ss_pred CCHHHHH-------HHHHHHhc------cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCC-CC Q lcl|NC_010179. 406 EDSLTKA-------QIVSTVAN------YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVD-DE 469 (469) Q Consensus 406 ~d~~e~~-------~~~~kl~g------~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~-de 469 (469) -.+...+ ++++.+.+ .+|.+++.+.+=-.+| .+++-++|++|..+. .. ++...+ |+ T Consensus 449 f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~--~~------~~p~~e~~d 520 (521) T protein:vir:10 449 YEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDGELKDS--VY------KNPEDPMEE 520 (521) T ss_pred HHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCC--CC------CCCcchhhc Confidence 3344333 34444433 5999999988643343 455666666665432 11 011111 11 No 235 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=78.98 E-value=0.11 Score=25.86 Aligned_cols=379 Identities=12% Similarity=0.110 Sum_probs=164.6 Q ss_pred CC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 ME----LDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~----~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) .+ +....+| +.+|+.+-.+++-.. =...||+..+ T Consensus 58 ~d~~~~~~~~~~L-----------I~~YR~ma~~pEvd~-------------------------------Av~eIvneai 95 (516) T protein:vir:10 58 FGIDNNISGTKDL-----------INTYRQLTNNPEVER-------------------------------AVANIVNEAV 95 (516) T ss_pred ecccCccccHHHH-----------HHHHHHhhhccchhH-------------------------------HHHHhhccee Confidence 11 1122222 334444444444322 1122232222 Q ss_pred H-hhhcCCeeeccCchhhHHHHHHHHhc---------cHHHHHHHHHHHHHhCCeEEEEEEEcC--CCceEEEEEcccee Q lcl|NC_010179. 77 G-YIASVFPDIDVGKDADNKKILDVLGD---------DRALTLNSLLVDSSNAGRAWLHYWIDE--DNNFRYGIIQPDQI 144 (469) Q Consensus 77 ~-~l~g~p~~~~~~~~~~~~~l~~~~~~---------n~~~~~~~~~~~~~~~G~~~~~v~~d~--~~~~~i~~~~p~~~ 144 (469) - =-...|+.+..++.+..+.+++-..+ +|.....+..+...+.|+-|.+..+|. +|-..+..+||+.+ T Consensus 96 v~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~elr~lDPr~i 175 (516) T protein:vir:10 96 VYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFHKIMPNPKEGIVELRRLDPRHV 175 (516) T ss_pred EecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEEEEecCcccceeeeeeeCCcce Confidence 1 23346666666555444443332222 445567788899999999999877752 35566888888887 Q ss_pred EEEEeCCCCCceEEEEEEEEeeecCCceEEE---EEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccc Q lcl|NC_010179. 145 TPVYATTLDNKLLGVLRSYKQLDPEAGKYFT---VHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTL 221 (469) Q Consensus 145 ~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (469) ..+.- -...+.++..... .+-+|+.....+ ...+..+ ..+ T Consensus 176 ~~vR~-------------i~~~~~~~~~v~~~~~e~~~Y~~~~~~~-~~~g~~~------------~~~----------- 218 (516) T protein:vir:10 176 EYYRE-------------IVTSDVGGTSVVKGYREFFVYTTGNEGY-AYNGRLF------------EPN----------- 218 (516) T ss_pred eeEEe-------------eecccCcchhhhhceeeeeeeecCccce-ecccccc------------CCC----------- Confidence 66432 2222222221111 011122111111 0000000 000 Q ss_pred cccCCcccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhh Q lcl|NC_010179. 222 KHNFGRVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMND 288 (469) Q Consensus 222 ~~~~g~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~ 288 (469) .--+||- |.|... ..+...+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-... T Consensus 219 --~~ikI~~daI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~ 296 (516) T protein:vir:10 219 --TRIKIPRSAIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERRVFYIDVGNMPNRKATEYVNG 296 (516) T ss_pred --CceecchhheeeeecCcccCCCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHH Confidence 0001111 111100 0011112223344444444 344555555666666433332221111 111111 Q ss_pred h---hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--Cc Q lcl|NC_010179. 289 L---REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--AN 342 (469) Q Consensus 289 ~---~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~ 342 (469) + -.++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. .. T Consensus 297 iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~SRl~~ 375 (516) T protein:vir:10 297 IMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMD-DVRWFNKKLYEALRIPLSRMPR 375 (516) T ss_pred HHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHH-HHHHHHHHHHHHhCCCcccccC Confidence 1 01111111111 122233455555444555443 377778888888888842 22 Q ss_pred cccCCc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-cCCCc----ccceEEeCCCCCCCHHHHH-- Q lcl|NC_010179. 343 FESSNA---SGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNF-SDADK----RHISQHWTRTKVEDSLTKA-- 412 (469) Q Consensus 343 ~~~g~~---Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~-~~~~~----~~i~i~f~~~~p~d~~e~~-- 412 (469) ++.+++ -|..|..-.......+.+.+..|..-+.++|+.=+-+-++ +..+| ..|.+.|...-.-.+...+ T Consensus 376 e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Ei 455 (516) T protein:vir:10 376 DDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIET 455 (516) T ss_pred CCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHH Confidence 221111 2233333444445556677777777777776643333222 11233 2467777654444344333 Q ss_pred -----HHHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 413 -----QIVSTV---AN-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 413 -----~~~~kl---~g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++++.+ .| .+|.+++.+.+=-.+| .++|-++|++|..+. .. .+.+.+++ T Consensus 456 l~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~~~~--~~------~~p~~e~~ 515 (516) T protein:vir:10 456 LRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQIEKEANVK--RF------QNPENEDD 515 (516) T ss_pred HHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCC--CC------CCCCcccc Confidence 334444 23 6999999988643343 445566666665431 11 11111222 No 236 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=78.49 E-value=0.11 Score=25.76 Aligned_cols=272 Identities=7% Similarity=-0.011 Sum_probs=98.0 Q ss_pred ccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeeeccCchh-----------------hH Q lcl|NC_010179. 32 ENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDIDVGKDA-----------------DN 94 (469) Q Consensus 32 ~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~~-----------------~~ 94 (469) ..++.-........ .+ ......|.||.|..+....+. .. T Consensus 1 m~~~~~~~~~~~~~------------~~------------~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~pp~~~ 56 (340) T protein:vir:98 1 MSKRKPRKAVAMTA------------SA------------PQKMEAFTFGEPVPVLDKRDILDYVECISNGKWYEPPVSF 56 (340) T ss_pred CCCCCCCccccccc------------cC------------ccceeEEEcCCceeecCcchhhhhhhhhhcCceecCCCCH Confidence 11111000000000 00 000112333433222110000 00 Q ss_pred HHHHH---------------------HHhccH-H--HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEe Q lcl|NC_010179. 95 KKILD---------------------VLGDDR-A--LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYA 149 (469) Q Consensus 95 ~~l~~---------------------~~~~n~-~--~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d 149 (469) ..|.+ .+.-|- + ..+..++.+.+.+|.+|+.+-.+..|++ .+..++|..+-+..+ T Consensus 57 ~~la~l~~a~~~h~s~i~~k~n~l~~~~~Pn~~lt~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~~ 136 (340) T protein:vir:98 57 SGLAKSLRSAVHHSSPIYVKRNVLASTYIPHPLLSRQDFSRFALDYLVFGNAFLEQRHSVTGQLIKLLTSPAKYTRRGVD 136 (340) T ss_pred HHHHHHHHhccccchhhhhhhhHHhhccCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceEEEccc Confidence 00111 111121 1 2244566778889999999988888875 366666655433211 Q ss_pred CCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCccc Q lcl|NC_010179. 150 TTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVP 229 (469) Q Consensus 150 ~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vP 229 (469) . . ++|.... .+.. ..+..+.+ T Consensus 137 ~----~-----~~~~~~~-~~~~-----~~~~~~eV-------------------------------------------- 157 (340) T protein:vir:98 137 D----S-----VFWFVEN-FTQP-----HEFAPDTV-------------------------------------------- 157 (340) T ss_pred C----c-----EEEEEec-CCeE-----EEEccccE-------------------------------------------- Confidence 1 1 1111110 1100 01122222 Q ss_pred EEEecC-----CccccccHHHHHHHHHHHHHHHHHH-HHHHHHhcCceeE--EecCCcc--cchhhhhhhhh-------c Q lcl|NC_010179. 230 FIEFPK-----NKYRLAELNKYKGLIDAYDDIYNGF-INDLDDVQTVILV--LTNYGGA--SLKQFMNDLRE-------Y 292 (469) Q Consensus 230 vv~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~-~~~~~~~~~p~l~--~~g~~~~--~~~~~~~~~~~-------~ 292 (469) +++++ .-.|.|.+.....-++. +.....+ ...++..+.|-.+ ++|.... ..+.....++. . T Consensus 158 -iHir~~~~~~~~~Gls~~~~a~~si~l-~~aa~~~~~~~f~NGa~pg~il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~ 235 (340) T protein:vir:98 158 -FHLLEPDINQEIYGLPEYLSALNSAWL-NESATLFRRKYYQNGAHAGYIMYVTDPAQSATDVESLRDAMRNSKGLGNFK 235 (340) T ss_pred -EEEcCCCCCCCcccccHHHHHHHHHHH-HHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccC Confidence 33321 12356666554443332 2222222 2333444455443 4442211 11222222221 1 Q ss_pred ceeeecccCCCCCCcceEEe--ecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc----c---cCCccHHHHHHHHHHHHHH Q lcl|NC_010179. 293 KSIKINNAGNGDKSGVDKLQ--IDIPVEARDDALKITRDNIFLFGQGIDPANF----E---SSNASGVAIKMLYSHLELK 363 (469) Q Consensus 293 ~~~~~~~~~~~~~~~~~~l~--~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Al~~~~~~l~~k 363 (469) +++.+.+++.. .++++.- .......+.+..+....+|+..-++|+.-.. + +|++...++.+ T Consensus 236 ~~~vl~~~g~~--~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f-------- 305 (340) T protein:vir:98 236 NLFFYSPNGKP--DGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQLMGGKPENIGSLGDVEKVAKVF-------- 305 (340) T ss_pred ceeEecCCCCc--cceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHH-------- Confidence 23333433332 3445543 3333456777778888899999899874221 1 12222222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHH Q lcl|NC_010179. 364 AAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSL 409 (469) Q Consensus 364 ~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~ 409 (469) +...|.-+++.+.++.+.-+.+ -+.|.+....+.. T Consensus 306 -------~~~~l~Pl~~~iee~n~~L~~e----~~rF~~~~l~~~d 340 (340) T protein:vir:98 306 -------VRNELSPLQDRFREVNDWLGME----VIRFKEYTLDNPE 340 (340) T ss_pred -------HHHHHHHHHHHHHHHHhccccc----ccccCccccccCC Confidence 1122222222222211100101 1345433221111 No 237 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=76.93 E-value=0.13 Score=25.44 Aligned_cols=388 Identities=13% Similarity=0.119 Sum_probs=167.9 Q ss_pred CCHH-----------HH-HHHHHHH---HHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceecc Q lcl|NC_010179. 1 MELD-----------AL-KKLIRNT---STSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPS 65 (469) Q Consensus 1 ~~~~-----------~~-~~~i~~~---~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~ 65 (469) -+++ -+ .++.-.. +......+.+|+.+..+++-. T Consensus 41 ~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd------------------------------- 89 (523) T protein:vir:68 41 KEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYRNLMTNYEVD------------------------------- 89 (523) T ss_pred eeeeccccccccccchhhhhhhhccccccchHHHHHHHHHHHhhccchh------------------------------- Confidence 0000 00 0000000 011122233344433333322 Q ss_pred chHHHHHHHHHHh-hhcCCeeeccCchhhHHHHHHHHhc---------cHHHHHHHHHHHHHhCCeEEEEEEEcCC---- Q lcl|NC_010179. 66 NFYQLLVDQEAGY-IASVFPDIDVGKDADNKKILDVLGD---------DRALTLNSLLVDSSNAGRAWLHYWIDED---- 131 (469) Q Consensus 66 n~~k~iv~~~~~~-l~g~p~~~~~~~~~~~~~l~~~~~~---------n~~~~~~~~~~~~~~~G~~~~~v~~d~~---- 131 (469) +=...||+..+-+ -...|+.+..++.+..+.+++...+ +|.....+..+...+.|+-|.+..+|.+ T Consensus 90 ~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~ 169 (523) T protein:vir:68 90 NAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKE 169 (523) T ss_pred hHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhheeeeEEEEEEEeeCCCccc Confidence 2122233332222 2456677766655444443332221 4555677888999999999999988754 Q ss_pred CceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEE---EEEEEcCCeEEEEEeecCceeeccccccccccc Q lcl|NC_010179. 132 NNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFT---VHEYWTDKEAQFFRTSATDSTVIEPYNIITSYD 208 (469) Q Consensus 132 ~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (469) |-..+..+||+.+-.+.. .......|...+. .+-+|.+....+ .. +.. ... T Consensus 170 GI~Elr~lDPr~i~~vr~-------------i~~~~~~g~~vi~~~~e~f~Y~~~~~~~-~~---~g~---------~~~ 223 (523) T protein:vir:68 170 GIKELRRLDPRQVQYVRE-------------VITTTEAGVKIVKGYKEYFIYDTSHESY-AC---DGR---------IYE 223 (523) T ss_pred cceeeeeeCCcceeEEEe-------------ecCCCCcchhhhhhhhhheeeccccccc-cc---ccc---------ccC Confidence 556788889987644432 1111111111110 001122211110 00 000 000 Q ss_pred ccccccccccccccccCCcccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcc Q lcl|NC_010179. 209 LSAGYETGQSNTLKHNFGRVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGA 280 (469) Q Consensus 209 ~~~~~~~~~~~~~~~~~g~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~ 280 (469) ... + -+||- |.|... ..+.-.+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+.. T Consensus 224 ~~~------------~-ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvG 290 (523) T protein:vir:68 224 AGT------------K-IKIPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTG 290 (523) T ss_pred CCc------------c-eecchhheeeeeccceeCCCCceeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecC Confidence 000 0 01111 111111 1111122233444444554 3455555566666664433322221 Q ss_pred cc-----hhhhhhh---hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHH Q lcl|NC_010179. 281 SL-----KQFMNDL---REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNI 331 (469) Q Consensus 281 ~~-----~~~~~~~---~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i 331 (469) +. .+-...+ -.++++.-... +.+.+-.+..|....++.... -++-+.+-+ T Consensus 291 nlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkL 369 (523) T protein:vir:68 291 NMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNME-DVRWFRNAL 369 (523) T ss_pred CCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHH-HHHHHHHHH Confidence 11 1111111 11222221111 112233455555444555443 367778888 Q ss_pred HHHhCCCCc--Ccc----ccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEe Q lcl|NC_010179. 332 FLFGQGIDP--ANF----ESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHW 400 (469) Q Consensus 332 ~~~s~~p~~--~~~----~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f 400 (469) |+...+|-. ..+ .+|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.| T Consensus 370 y~aLnVP~sRl~~~~~~f~~Gr~~--EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f 447 (523) T protein:vir:68 370 YMALRIPITRIPSDQGGIQFDAGT--SITRDELSFGKFIRELQHKFEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKF 447 (523) T ss_pred HHHhCCcceeecCCCcceeccccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEee Confidence 888888842 122 234433 44445555566677788888888888877544433321 1223 3467777 Q ss_pred CCCCCCCHHHHH-------HHHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCC Q lcl|NC_010179. 401 TRTKVEDSLTKA-------QIVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVD 467 (469) Q Consensus 401 ~~~~p~d~~e~~-------~~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~ 467 (469) ...-.-.+...+ ++++.+. | .+|.+++.+.+=-.+| .++|-++|++|..+. .. .+...+ T Consensus 448 ~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~k~~--~~------~~p~~e 519 (523) T protein:vir:68 448 HRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEESKEA--RF------QDPDQE 519 (523) T ss_pred eecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcC--CC------CCCchh Confidence 654444444433 3444443 3 4799999987633343 345555566554321 11 011111 Q ss_pred CC Q lcl|NC_010179. 468 DE 469 (469) Q Consensus 468 de 469 (469) .| T Consensus 520 ~~ 521 (523) T protein:vir:68 520 QE 521 (523) T ss_pred hh Confidence 11 No 238 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=75.84 E-value=0.14 Score=25.23 Aligned_cols=380 Identities=12% Similarity=0.107 Sum_probs=164.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH-hh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG-YI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l 79 (469) -++..-.+|| .+|+.+..+++-.. =...||+..+- =- T Consensus 67 ~~~~~~~eLI-----------~~YR~ma~~pEvd~-------------------------------Av~eIVneaiv~d~ 104 (524) T protein:vir:10 67 PGMKTTRELI-----------DTYRNLMNNYEVDN-------------------------------AVSEIVSDAIVYED 104 (524) T ss_pred cccchHHHHH-----------HHHHHHhhccchhh-------------------------------HHHHhhcceeEecC Confidence 1112222222 33444433333221 11222322221 23 Q ss_pred hcCCeeeccCchhhHHHH--------HHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEEccceeEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKI--------LDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGIIQPDQITP 146 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l--------~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~~p~~~~~ 146 (469) ...|+.+..++.+..+.+ ..+++- +|.....+..+...+.|+-|.+..+|.+ |-..+..+||+.+-. T Consensus 105 ~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~ 184 (524) T protein:vir:10 105 DTEVVALNLDKSKFSPKIKNMMLDEFNDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQY 184 (524) T ss_pred CCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEeeCCCccccceeeeeeCCcccee Confidence 345666665554433333 233321 4556677889999999999999988754 556788888887654 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) +.. .......+... ++ +...+|.+..+........ ..+.... + - T Consensus 185 vr~-------------i~~~~~~~~~v------i~-~~~e~f~Y~~~~~~y~~~g---~~~~~~~------------~-i 228 (524) T protein:vir:10 185 VRE-------------IITETEAGTKI------VK-GYKEYFIYDTAHESYACDG---RMYEAGT------------K-I 228 (524) T ss_pred eee-------------eccCCCccchh------hc-chhhheeeccCccccccCc---cccCCCc------------c-e Confidence 432 11111111111 11 1111111111000000000 0000000 0 0 Q ss_pred cccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh---h Q lcl|NC_010179. 227 RVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL---R 290 (469) Q Consensus 227 ~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~---~ 290 (469) +||- |.|... ..+.-.+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ - T Consensus 229 kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~ 308 (524) T protein:vir:10 229 KIPKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTM 308 (524) T ss_pred ecchhheeeeeccceeCCCCceeccchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhc Confidence 1111 111111 1111122223444444454 344555555666666433332221111 1111111 0 Q ss_pred hcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--Ccc---- Q lcl|NC_010179. 291 EYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANF---- 343 (469) Q Consensus 291 ~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~---- 343 (469) .++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. ..+ T Consensus 309 KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~sRl~~d~~~~ 387 (524) T protein:vir:10 309 KNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNME-DVRWFRQALYMALRVPLSRIPQDQQGG 387 (524) T ss_pred CceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHH-HHHHHHHHHHHHhCCchhhcCCCCCcc Confidence 1111111111 122233455555444555443 367778888888888842 222 Q ss_pred -ccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHHH---- Q lcl|NC_010179. 344 -ESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ---- 413 (469) Q Consensus 344 -~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~~---- 413 (469) ++|. |..|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...++ T Consensus 388 f~~gr--~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~ 465 (524) T protein:vir:10 388 VMFDS--GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILER 465 (524) T ss_pred ccccc--cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 1233 2334444555556677788888888888877544433321 1223 35677776554444444433 Q ss_pred ---HHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 414 ---IVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 414 ---~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.+. | .+|.+++.+.+=-.+| .++|-++|++|..+. .. .+...++| T Consensus 466 R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~--~~------~~~~~~~~ 522 (524) T protein:vir:10 466 RINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEA--RF------QDPDQEQE 522 (524) T ss_pred HHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcC--CC------CCCchhhh Confidence 344443 3 4799999987643343 345555566554321 11 01111111 No 239 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=74.55 E-value=0.16 Score=24.99 Aligned_cols=286 Identities=8% Similarity=0.028 Sum_probs=104.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+--.+.+.+..+ ...-+||+- ++.. ..+++.+ ...+ +. T Consensus 33 ~~~~~~~~~~~~~-----------~~~~~~~~p--p~~~-------------------------~~la~l~-~~~~-~h- 71 (346) T protein:vir:10 33 LDRADILNYLECS-----------AMYEKWYNP--PMSF-------------------------DGLAKSL-RSST-HH- 71 (346) T ss_pred cCchhHHHHHHHh-----------hcCCceEec--CCCH-------------------------HHHHHHH-Hhhh-hc- Confidence 4333232222111 112234441 1100 0001000 0000 00 Q ss_pred cCCeeeccCchhhHHHHHHHHhc-c-H--HHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCc Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD-D-R--ALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNK 155 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~-n-~--~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~ 155 (469) +-++... ...+..+++. | . ...+.+++.+.+.+|.+|+.+..+..|++ .+..++|..+.+.-+.+ . T Consensus 72 ~~~i~~k------~n~l~~l~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~r~~~G~~~~L~pl~~~~v~~~~~~~---~ 142 (346) T protein:vir:10 72 ESAIITK------ANILLSTCEVDSRYLSRRDLSSFVKDYLVFGNAYFEVVRNRLGQVQRIESPLAKYVRKGLEAG---Q 142 (346) T ss_pred chhhhhh------hhhHHHHHhCCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCCceEEEEcCC---e Confidence 0000000 0011112211 1 1 11244566677889999999988888875 47777777765533221 1 Q ss_pred eEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC Q lcl|NC_010179. 156 LLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK 235 (469) Q Consensus 156 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n 235 (469) . +|.....+|.. . .+..+. |+++++ T Consensus 143 ~-----~~~~~~~~g~~-~----~~~~~d---------------------------------------------Iih~r~ 167 (346) T protein:vir:10 143 F-----YYVPQRFDHQE-H----EFAKGS---------------------------------------------IYHLLE 167 (346) T ss_pred E-----EEEEEccCCeE-E----EEeccc---------------------------------------------EEEecC Confidence 1 11111111110 0 011122 233322 Q ss_pred -----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCCcc--cchhhhhhhhh-------cceeeecc Q lcl|NC_010179. 236 -----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYGGA--SLKQFMNDLRE-------YKSIKINN 299 (469) Q Consensus 236 -----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~~~--~~~~~~~~~~~-------~~~~~~~~ 299 (469) .-.|.|.+......+..-+.+..-..+.+...+.|-.++ ++...+ ..+.....++. .+++.+.+ T Consensus 168 ~~~~~~~~G~~~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d~~l~~e~~~~i~~~~~~~~g~~n~~~~~vl~~ 247 (346) T protein:vir:10 168 PDINQDIYGLPQYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSDASQKQEDVENIRQQLKQSKGVGNFKNLFVHAP 247 (346) T ss_pred CCCCCCeeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCHHHHHHHHHHHHHhcCccccCceeEecC Confidence 124667665544444433322222233344455565444 342111 11111212211 12333333 Q ss_pred cCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc----c---cCCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 300 AGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF----E---SSNASGVAIKMLYSHLELKAAKTQTYFE 372 (469) Q Consensus 300 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~~ 372 (469) ++..++-++.-++.......+.+..+...++|...-++|+.-.. + ++++...++.+.. T Consensus 248 ~~~~~gi~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~llG~~~~~~~~~s~~e~~~~~f~~--------------- 312 (346) T protein:vir:10 248 NGKKDGIQIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQLMGIIPNNTGGFGNVADAAEVFFI--------------- 312 (346) T ss_pred CCCccceeEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHH--------------- Confidence 33322223333332223445666677778899999899875221 1 1222222222221 Q ss_pred HHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHH Q lcl|NC_010179. 373 HAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLT 410 (469) Q Consensus 373 ~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e 410 (469) ..|.-+++.+.++.+.-+.+ -|.|++...-..+| T Consensus 313 ~~l~P~~~~iee~n~~L~~e----~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 313 TEIEPLQERLKEFNQWLGQE----VIKFKPSKLLQRTQ 346 (346) T ss_pred HHHHHHHHHHHHHHhhcccc----eeeechhhhcccCC Confidence 22222222222211111111 24565443332222 No 240 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=74.46 E-value=0.16 Score=24.98 Aligned_cols=380 Identities=12% Similarity=0.108 Sum_probs=164.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH-hh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG-YI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l 79 (469) -++..-.+|| .+|+.+..+++-.. =...||+..+- =- T Consensus 67 ~~~~~~~eLI-----------~~YR~ma~~pEvd~-------------------------------Av~eIVneaiv~d~ 104 (524) T protein:vir:72 67 PGMKTTRELI-----------DTYRNLMNNYEVDN-------------------------------AVSEIVSDAIVYED 104 (524) T ss_pred cccchHHHHH-----------HHHHHHhhccchhh-------------------------------HHHHhhcceeEecC Confidence 1112222222 33444433333221 11222322221 23 Q ss_pred hcCCeeeccCchhhHHHH--------HHHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEEccceeEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKI--------LDVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGIIQPDQITP 146 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l--------~~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~~p~~~~~ 146 (469) ...|+.+..++.+..+.+ ..+++- +|.....+..+...+.|+-|.+..+|.+ |-..+..+||+.+-. T Consensus 105 ~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~ 184 (524) T protein:vir:72 105 DTEVVALNLDKSKFSPKIKNMMLDEFSDVLNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQY 184 (524) T ss_pred CCceEEEEecCcCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCcccee Confidence 345666665554433333 333321 4556677889999999999999988755 556788888887654 Q ss_pred EEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCC Q lcl|NC_010179. 147 VYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFG 226 (469) Q Consensus 147 ~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 226 (469) +.. .......+... ++ +...+|.+..+........ ..+.... + - T Consensus 185 vr~-------------i~~~~~~~~~v------i~-~~~e~f~Y~~~~~~y~~~g---~~~~~~~------------~-i 228 (524) T protein:vir:72 185 VRE-------------IITETEAGTKI------VK-GYKEYFIYDTAHESYACDG---RMYEAGT------------K-I 228 (524) T ss_pred eee-------------eccCCCccchh------hc-chhhheeeccCccccccCc---cccCCCc------------c-e Confidence 432 11111111111 11 1111111111000000000 0000000 0 0 Q ss_pred cccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh---h Q lcl|NC_010179. 227 RVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL---R 290 (469) Q Consensus 227 ~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~---~ 290 (469) +||- |.|... ..+.-.+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ - T Consensus 229 kI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~ 308 (524) T protein:vir:72 229 KIPKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTM 308 (524) T ss_pred ecchhheeeeeccceeCCCCceeccchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhc Confidence 1111 112111 1111122223444444454 344555555666666433332221111 1111111 0 Q ss_pred hcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--Ccc---- Q lcl|NC_010179. 291 EYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANF---- 343 (469) Q Consensus 291 ~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~---- 343 (469) .++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. ..+ T Consensus 309 KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~sRl~~d~~~~ 387 (524) T protein:vir:72 309 KNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNME-DIRWFRQALYMALRVPLSRIPQDQQGG 387 (524) T ss_pred CceeEEeCCCCeeccchhhhhhHhhhcccccCCCcccceeeccccCCcChHH-HHHHHHHHHHHHhCCchhhcCCCCCcc Confidence 1111111111 122233455555444555443 367778888888888842 222 Q ss_pred -ccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHHH---- Q lcl|NC_010179. 344 -ESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ---- 413 (469) Q Consensus 344 -~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~~---- 413 (469) ++|. |..|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...++ T Consensus 388 f~~gr--~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~ 465 (524) T protein:vir:72 388 VMFDS--GTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILER 465 (524) T ss_pred ccccc--cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHH Confidence 1233 2334444555556677788888888888877544433321 1223 35677776554444444333 Q ss_pred ---HHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 414 ---IVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 414 ---~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.+. | .+|.+++.+.+=-.+| .++|-++|++|..+. .. .+...+.| T Consensus 466 R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~--~~------~~~~~~~~ 522 (524) T protein:vir:72 466 RINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEESKEA--RF------QDPDQEQE 522 (524) T ss_pred HHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcC--CC------CCCchhhh Confidence 344443 3 4799999987643343 345555566654321 11 00011111 No 241 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=73.62 E-value=0.17 Score=24.83 Aligned_cols=403 Identities=11% Similarity=0.075 Sum_probs=171.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH-hh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG-YI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l 79 (469) .+...-.+|| .+|+.+..+++-. +=...||+..+- -- T Consensus 47 ~~~~n~~eLI-----------~~YR~ma~~pEVd-------------------------------~Av~eIVneaIv~d~ 84 (564) T protein:vir:10 47 QNSRNEYELI-----------RRYRDMSLHPEVD-------------------------------SAIDEIVNEFVVNDG 84 (564) T ss_pred cchhhHHHHH-----------HHHHHHhhccchh-------------------------------hHHHHhhcceeEecC Confidence 1121122222 3333333233221 112223332221 22 Q ss_pred hcCCeeeccCchhhHHHHH--------HHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC----CceEEEEEccceeEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKIL--------DVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED----NNFRYGIIQPDQITP 146 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~--------~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~i~~~~p~~~~~ 146 (469) ..+|+.+..++.+..+.++ .+++- +|.....+..+.+.+.|+-|.+.-+|.+ |-..+..+||+.+-. T Consensus 85 ~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~eLr~lDPr~i~~ 164 (564) T protein:vir:10 85 DDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDNPKKGILELRYIDSLKIRK 164 (564) T ss_pred CCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCChhhhhhhhhhhcccceee Confidence 3456666665544444432 23321 4556677889999999999998877643 445688999998887 Q ss_pred EEeCCCCC--ceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeec-Cceeecccccccccccccccccccccccccc Q lcl|NC_010179. 147 VYATTLDN--KLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSA-TDSTVIEPYNIITSYDLSAGYETGQSNTLKH 223 (469) Q Consensus 147 ~~d~~~~~--~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (469) ++..-... .-...++-+. ..+.+.+-...+.|.... .+....... ...++... ...- T Consensus 165 vr~i~~~~~~~~~~v~k~~~----------~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~--~~~~~~~~--------~ikI 224 (564) T protein:vir:10 165 VRQKLKDVDPNRKEIEKGTA----------LQYDYGDFIEYYIYNPKGFAGNIPMVTG--SMDWSNQE--------GIKI 224 (564) T ss_pred eeeeccccccccceeeeeee----------eeccccccccceeeccccccCccccccc--cccccccc--------ceee Confidence 77432211 1111111111 011111101111111100 000000000 00000000 0000 Q ss_pred cCCcccEEEecC-CccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhh----hhhhhh Q lcl|NC_010179. 224 NFGRVPFIEFPK-NKYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQF----MNDLRE 291 (469) Q Consensus 224 ~~g~vPvv~~~n-~~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~----~~~~~~ 291 (469) +-..|+.++.=- ...+.-.+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+. +...+ T Consensus 225 ~~daI~y~hSGL~d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~K- 303 (564) T protein:vir:10 225 ASDAIAQSTSGLMDLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYR- 303 (564) T ss_pred chhhcceecccceeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcC- Confidence 111122222100 00111122233444444554 345555556666666443332222111 111 11111 Q ss_pred cceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCC--cCccc---- Q lcl|NC_010179. 292 YKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGID--PANFE---- 344 (469) Q Consensus 292 ~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~---- 344 (469) ++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|- +..++ T Consensus 304 NklVYDa~TGevrddrk~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~-DV~YF~kKLY~aLnVP~SRl~~e~~~f~ 382 (564) T protein:vir:10 304 NKLVYDGQTGEIRDDKKHMSMLEDFWLPRREGGRGTEITTLPGGQNLGELK-DVEYFKKKLYNSLNLPPSRLTDDNKAFN 382 (564) T ss_pred ceEEEeccCceecccchhhhhHhhhcccccCCCcccceeeccccCCcchHH-HHHHHHHHHHHHhCCCcccccCCCceee Confidence 222221111 112233455555444554433 36777888888888884 22222 Q ss_pred cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHHH------ Q lcl|NC_010179. 345 SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ------ 413 (469) Q Consensus 345 ~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~~------ 413 (469) +|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...++ T Consensus 383 ~Gr~~--EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl 460 (564) T protein:vir:10 383 LGKST--EILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRV 460 (564) T ss_pred ccccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHH Confidence 34433 33344445555667778888888888877544433321 1223 34677776544444444333 Q ss_pred -HHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhh---hhH-hhcccCCCCCC----------CCC Q lcl|NC_010179. 414 -IVSTV---AN-YSSKEAVAKANPIVDD--WQQELKDLAKDREEND---PYA-NQADELNGKGV----------DDE 469 (469) Q Consensus 414 -~~~kl---~g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~---~~~-~~~~~~~~~~~----------~de 469 (469) +++.+ .| .+|.+++.+.+=-.+| .+++-++|++|..+.. |.. +..+.+..++. ++. T Consensus 461 ~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~ 537 (564) T protein:vir:10 461 NLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDL 537 (564) T ss_pred HHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhcccc Confidence 33444 33 4799999987533333 4566777777765432 211 01111111111 111 No 242 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=72.21 E-value=0.19 Score=24.59 Aligned_cols=386 Identities=12% Similarity=0.100 Sum_probs=166.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH-hh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG-YI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l 79 (469) .+++.. .......+.+|+.+..+++-.. =...||+..+- =- T Consensus 61 ~~~e~~-------~~~~~eLI~~YR~ma~~pEvd~-------------------------------Av~eIVneaiv~d~ 102 (521) T protein:vir:65 61 YSTDQK-------ISTTKQLVNTYRGLMNNHEVEN-------------------------------AVQNIVNDAIVFEE 102 (521) T ss_pred ccccch-------hhhHHHHHHHHHHHhhccchhh-------------------------------HHHHhhcceeEecC Confidence 222210 0111222334444433333221 11223332221 23 Q ss_pred hcCCeeeccCchhhHHHHHH--------HHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccceeEEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILD--------VLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPDQITPV 147 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~--------~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~~~~~~ 147 (469) ...|+.+..++.+..+.+++ +++- +|.....+..+...+.|+-|.+.-+|++ |-..+..+||+.+..+ T Consensus 103 ~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDPr~i~~v 182 (521) T protein:vir:65 103 GHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYV 182 (521) T ss_pred CCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCccccceeeeeeCCcceeee Confidence 34666666655444443332 2221 4555677889999999999999887654 4456888999987766 Q ss_pred EeCCCCCceEEEEEEEEeeecCCceE---EEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccc-cccc Q lcl|NC_010179. 148 YATTLDNKLLGVLRSYKQLDPEAGKY---FTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSN-TLKH 223 (469) Q Consensus 148 ~d~~~~~~~~~~v~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 223 (469) .-... .+..+... ...+-+|..+...+... +..+. .+......... ...| T Consensus 183 r~i~k-------------~~~~~~~v~~~~~e~f~Y~~~~~~~~~~-g~~~~------------~~~~vkI~~dAI~y~h 236 (521) T protein:vir:65 183 REIIT-------------EDTPEGKIYKATKEYFIYTVGNSSYCAG-GQVFS------------PNSRVKIPRSAITYAH 236 (521) T ss_pred eeecc-------------cccCCcceecceeeeeeeecCCcceecc-ceeec------------CCcceeechhheeeee Confidence 53211 11111111 11111222222211100 00000 00000000000 0000 Q ss_pred cCCcccEEEecCCccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh---hhcc Q lcl|NC_010179. 224 NFGRVPFIEFPKNKYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL---REYK 293 (469) Q Consensus 224 ~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~---~~~~ 293 (469) -|.+| .++..-.|- +..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ -.++ T Consensus 237 -SGl~d----~~~~~i~sy---LhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNk 308 (521) T protein:vir:65 237 -SGLMD----CDDKYIIGY---LHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNR 308 (521) T ss_pred -cccee----CCCCeeeec---chhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCce Confidence 01111 111111222 3334444443 345555556666666443332221111 1111111 0111 Q ss_pred eeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcC--ccc---c-- Q lcl|NC_010179. 294 SIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPA--NFE---S-- 345 (469) Q Consensus 294 ~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~--~~~---~-- 345 (469) ++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-.- .++ + T Consensus 309 lvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~ 387 (521) T protein:vir:65 309 VVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDID-DIRYFNRKLYEALRVPLSRSNLSDANMVIG 387 (521) T ss_pred eEeecccccccccccccchhhhhcccccCCCCccceeecccCCCcChHH-HHHHHHHHHHHHhCCCceeccCCCCcceec Confidence 1111111 122233455555444555443 3677788888888888432 222 2 Q ss_pred CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHH-------H Q lcl|NC_010179. 346 SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKA-------Q 413 (469) Q Consensus 346 g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~-------~ 413 (469) |.. ..|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...+ + T Consensus 388 gr~--~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~ 465 (521) T protein:vir:65 388 GDG--SEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIG 465 (521) T ss_pred ccc--chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHH Confidence 332 334344555556677778888888888877544333321 1223 3467777654444444333 3 Q ss_pred HHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 414 IVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 414 ~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.+. | .+|.+++.+.+=-.+| .+++-++|++|..+ +... +...+.| T Consensus 466 ~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~k~I~~E~~~--~~~~------~p~~~~~ 519 (521) T protein:vir:65 466 LIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEEAND--PRFK------QTPDEIE 519 (521) T ss_pred HHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHhhhC--CCCC------CCccccc Confidence 444443 3 4799999987633343 34455556655432 2111 1111111 No 243 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=70.71 E-value=0.21 Score=24.35 Aligned_cols=360 Identities=9% Similarity=0.011 Sum_probs=147.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-|- +.+.....+....-.-....+-+.... .+.. .. ..+-+...-....|+..++-+- T Consensus 1 MGl~------~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~--vt-~~~al~~~~v~~~i~~Ia~~iA 59 (394) T protein:vir:62 1 MGLR------DRFSNYLFKKAEKRGYLDNVLGKSIRY------------SGVY--VT-DSNILQSSDVYELLQDISNQMV 59 (394) T ss_pred Cchh------hhhhhhccCCCCchhhhhhhhhccccc------------Cccc--cC-hhhhhccHHHHHHHHHHHHhhc Confidence 4332 111111000000000001111111000 0000 00 0001223445556666666666 Q ss_pred cCCeeeccCch--hhHHHHHHHHhc-cH----HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEEEeCCCC Q lcl|NC_010179. 81 SVFPDIDVGKD--ADNKKILDVLGD-DR----ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPVYATTLD 153 (469) Q Consensus 81 g~p~~~~~~~~--~~~~~l~~~~~~-n~----~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~ 153 (469) +-|+.+...+. .....+..++.. |. .+....+..+++.+|.+|+++-.+..+ . +..+.|..+... T Consensus 60 ~lp~~v~~~~g~~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~~~-----~--~~~~~~~~~~~~- 131 (394) T protein:vir:62 60 LADIVVEDEFGNEIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPILNGAQIH-----L--ASNVFTELDDNL- 131 (394) T ss_pred ccceEEEcCCCcccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEEecceee-----c--cccceEEECCce- Confidence 66766543221 122334445543 32 222335677888999998865321111 1 122333332110 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) ..+|.. .+ . .+.... |+|+ T Consensus 132 ------~~~~~~---~~-~------~~~~~e---------------------------------------------iih~ 150 (394) T protein:vir:62 132 ------VEHFNI---GG-H------EIPPCM---------------------------------------------IRHV 150 (394) T ss_pred ------EEEEee---CC-E------Eechhh---------------------------------------------eEEe Confidence 001100 00 0 001111 2333 Q ss_pred cC----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEe--cCCcccc---hhhhhhhh--------hcceee Q lcl|NC_010179. 234 PK----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLT--NYGGASL---KQFMNDLR--------EYKSIK 296 (469) Q Consensus 234 ~n----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~--g~~~~~~---~~~~~~~~--------~~~~~~ 296 (469) +. .-.|.|.+..+...++.......-..+.+...+.|-.+++ +...... +.....+. ..+++. T Consensus 151 r~~~~d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~v 230 (394) T protein:vir:62 151 KNIGADHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAHINPQNGAQSKLINAILDQLESIDEARSVKM 230 (394) T ss_pred cCcCCCCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCCCCcCHHHHHHHHHHHHHHhccccccCceeE Confidence 21 1236677776666666666655555566666677755554 3211111 11111111 112222 Q ss_pred ecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 297 INNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 297 ~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) ++. +.+.++..... ....+.+..+...+.|+..-++|+.-.....+.+.+ +.....+..+ T Consensus 231 l~~-----g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e-------------~~~~~~~~~~ 292 (394) T protein:vir:62 231 IPL-----GKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIKEDIE-------------KAMMYIHNKA 292 (394) T ss_pred eeC-----CCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHH-------------HHHHHHHHHH Confidence 322 12345544333 334455566777888988888887543322221111 1112233344 Q ss_pred HHHHHHHHHHHhccc---CCCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFS---DADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLAKD 447 (469) Q Consensus 375 l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~~E 447 (469) |.-+++.+...++.+ ......+.+.|+.....+....++++.++ +|+++.-.+.+++++ ++++....-.+..- T Consensus 293 l~P~~~~ie~~l~~kll~~~~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl~p~~~~~gd~~~~~~n 372 (394) T protein:vir:62 293 VRPIMKNFEDHLSLLFYAQNSGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGFPKQNTKESQAIYISND 372 (394) T ss_pred HHHHHHHHHHHHhhhhcCccccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeeecccc Confidence 444444444433321 11223577888877767777788888776 578999898888754 32222110001100 Q ss_pred HHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 448 REENDPYANQADELNGKGVDDE 469 (469) Q Consensus 448 ~~~~~~~~~~~~~~~~~~~~de 469 (469) -..... .+..++...+|.++| T Consensus 373 ~~~~~~-~~~~~~~~kgge~~e 393 (394) T protein:vir:62 373 VTEIGK-KEATDGSLGGGEENE 393 (394) T ss_pred cccccc-cccccccCCCCCCCC Confidence 000000 001111222333334 No 244 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=69.08 E-value=0.23 Score=24.10 Aligned_cols=349 Identities=12% Similarity=0.098 Sum_probs=124.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+ +.+.+..+.+........ .+.... ...-+........|+..++-+- T Consensus 1 Mg~-----------------------f~~lf~~~~~~~~~~~~~-----~~~~v~---~~~~~~~~~v~~~i~~Ia~~iA 49 (395) T protein:vir:95 1 MSI-----------------------LEKIFKTRKDITYMLDLD-----MIEDLS---QQAYVKRLAIDSCIEFVARAVA 49 (395) T ss_pred Cch-----------------------hhhhhccCccccccccch-----hccccc---hhhhhhhHHHHHHHHHHHHhhc Confidence 222 111222221110000000 000000 0001122334445555555555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEE--EEeCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITP--VYATTL 152 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~--~~d~~~ 152 (469) +-|+.+-.........+..++.. | .+ +....+..+.+..|.+|+++.. ++.+ ...++..+.+ +++. T Consensus 50 ~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~-- 123 (395) T protein:vir:95 50 QSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSD--SKEL--LIADSFYREEYALYDD-- 123 (395) T ss_pred cceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEec--CCCe--EecCCccceeEeecCc-- Confidence 55665443333344445555532 3 22 2233455566666776654432 2222 1122221111 1110 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) ....+.. . +..+ ...+.... |+| T Consensus 124 ---~~~~~~~---~---~~~~---~~~~~~~e---------------------------------------------vih 146 (395) T protein:vir:95 124 ---IFKDVTV---K---DYTY---QRTFTMQE---------------------------------------------VIY 146 (395) T ss_pred ---ceeEEEE---c---Ccee---eeeecccc---------------------------------------------EEE Confidence 0000000 0 0000 00111111 233 Q ss_pred ecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh----hh-cceeeecccCC Q lcl|NC_010179. 233 FPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL----RE-YKSIKINNAGN 302 (469) Q Consensus 233 ~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~----~~-~~~~~~~~~~~ 302 (469) ++. ...|.|-++.+...++. ..+.+...+.+--++.-......++....+ .. .+...-...+- T Consensus 147 ~~~~~~~~~~~G~spi~~~~~~~~~-------~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v 219 (395) T protein:vir:95 147 LKYNNNKVTHFVESLFEDYGKIFGR-------MIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAI 219 (395) T ss_pred EccCCCCcccccchHHHHHHHHHHH-------HHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcce Confidence 321 12345545444444433 223333444443333222211112221111 11 00000000000 Q ss_pred C-CCCcceEEeecCC-------HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 G-DKSGVDKLQIDIP-------VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 303 ~-~~~~~~~l~~~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) - -+++.+|-....+ ...+.+..+...+.|+..-++|+.-.. |+-|+.+ +.....+..+ T Consensus 220 ~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e------------~~~~~~~~~~ 285 (395) T protein:vir:95 220 APLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY--GETADLE------------KNTLVFEKFC 285 (395) T ss_pred EEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHH------------HHHHHHHHHH Confidence 0 0122333322222 124556666777889888888875332 1112111 1112233334 Q ss_pred HHHHHHHHHHHhcccCC---C-cccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHH-HH----- Q lcl|NC_010179. 375 INELVRAIMRYLNFSDA---D-KRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQ-QE----- 440 (469) Q Consensus 375 l~~~~~~i~~~~~~~~~---~-~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~-~E----- 440 (469) |.-++..|...++.+=. . ...+++.++.-+-.|..+.++++.++ +|+++.-++.+.+++ +++.. .+ T Consensus 286 l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:95 286 LTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred HHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 44444444443332111 1 11234555666677889999999876 678999888887654 33321 01 Q ss_pred -HHHHH---HHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 441 -LKDLA---KDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 441 -~eri~---~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +..++ .++....+ ....+++.++. T Consensus 366 n~~~~~~~~~~~~~~~~-----~~~kgg~~~~~ 393 (395) T protein:vir:95 366 NYEKANSGENDEKEKDE-----NTLKGGDEDES 393 (395) T ss_pred ccccccccccccCcccc-----cccCCCCCCCC Confidence 11111 11111111 11111111111 No 245 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=69.08 E-value=0.23 Score=24.10 Aligned_cols=349 Identities=12% Similarity=0.098 Sum_probs=124.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+ +.+.+..+.+........ .+.... ...-+........|+..++-+- T Consensus 1 Mg~-----------------------f~~lf~~~~~~~~~~~~~-----~~~~v~---~~~~~~~~~v~~~i~~Ia~~iA 49 (395) T protein:vir:10 1 MSI-----------------------LEKIFKTRKDITYMLDLD-----MIEDLS---QQAYVKRLAIDSCIEFVARAVA 49 (395) T ss_pred Cch-----------------------hhhhhccCccccccccch-----hccccc---hhhhhhhHHHHHHHHHHHHhhc Confidence 222 111222221110000000 000000 0001122334445555555555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEE--EEeCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITP--VYATTL 152 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~--~~d~~~ 152 (469) +-|+.+-.........+..++.. | .+ +....+..+.+..|.+|+++.. ++.+ ...++..+.+ +++. T Consensus 50 ~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~-- 123 (395) T protein:vir:10 50 QSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSD--SKEL--LIADSFYREEYALYDD-- 123 (395) T ss_pred cceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEec--CCCe--EecCCccceeEeecCc-- Confidence 55665443333344445555532 3 22 2233455566666776654432 2222 1122221111 1110 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) ....+.. . +..+ ...+.... |+| T Consensus 124 ---~~~~~~~---~---~~~~---~~~~~~~e---------------------------------------------vih 146 (395) T protein:vir:10 124 ---IFKDVTV---K---DYTY---QRTFTMQE---------------------------------------------VIY 146 (395) T ss_pred ---ceeEEEE---c---Ccee---eeeecccc---------------------------------------------EEE Confidence 0000000 0 0000 00111111 233 Q ss_pred ecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh----hh-cceeeecccCC Q lcl|NC_010179. 233 FPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL----RE-YKSIKINNAGN 302 (469) Q Consensus 233 ~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~----~~-~~~~~~~~~~~ 302 (469) ++. ...|.|-++.+...++. ..+.+...+.+--++.-......++....+ .. .+...-...+- T Consensus 147 ~~~~~~~~~~~G~spi~~~~~~~~~-------~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v 219 (395) T protein:vir:10 147 LKYNNNKVTHFVESLFEDYGKIFGR-------MIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAI 219 (395) T ss_pred EccCCCCcccccchHHHHHHHHHHH-------HHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcce Confidence 321 12345545444444433 223333444443333222211112221111 11 00000000000 Q ss_pred C-CCCcceEEeecCC-------HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 G-DKSGVDKLQIDIP-------VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 303 ~-~~~~~~~l~~~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) - -+++.+|-....+ ...+.+..+...+.|+..-++|+.-.. |+-|+.+ +.....+..+ T Consensus 220 ~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e------------~~~~~~~~~~ 285 (395) T protein:vir:10 220 APLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY--GETADLE------------KNTLVFEKFC 285 (395) T ss_pred EEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHH------------HHHHHHHHHH Confidence 0 0122333322222 124556666777889888888875332 1112111 1112233334 Q ss_pred HHHHHHHHHHHhcccCC---C-cccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHH-HH----- Q lcl|NC_010179. 375 INELVRAIMRYLNFSDA---D-KRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQ-QE----- 440 (469) Q Consensus 375 l~~~~~~i~~~~~~~~~---~-~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~-~E----- 440 (469) |.-++..|...++.+=. . ...+++.++.-+-.|..+.++++.++ +|+++.-++.+.+++ +++.. .+ T Consensus 286 l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:10 286 LTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred HHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 44444444443332111 1 11234555666677889999999876 678999888887654 33321 01 Q ss_pred -HHHHH---HHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 441 -LKDLA---KDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 441 -~eri~---~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +..++ .++....+ ....+++.++. T Consensus 366 n~~~~~~~~~~~~~~~~-----~~~kgg~~~~~ 393 (395) T protein:vir:10 366 NYEKANSGENDEKEKDE-----NTLKGGDEDES 393 (395) T ss_pred ccccccccccccCcccc-----cccCCCCCCCC Confidence 11111 11111111 11111111111 No 246 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=69.08 E-value=0.23 Score=24.10 Aligned_cols=349 Identities=12% Similarity=0.098 Sum_probs=124.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+ +.+.+..+.+........ .+.... ...-+........|+..++-+- T Consensus 1 Mg~-----------------------f~~lf~~~~~~~~~~~~~-----~~~~v~---~~~~~~~~~v~~~i~~Ia~~iA 49 (395) T protein:vir:10 1 MSI-----------------------LEKIFKTRKDITYMLDLD-----MIEDLS---QQAYVKRLAIDSCIEFVARAVA 49 (395) T ss_pred Cch-----------------------hhhhhccCccccccccch-----hccccc---hhhhhhhHHHHHHHHHHHHhhc Confidence 222 111222221110000000 000000 0001122334445555555555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEE--EEeCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITP--VYATTL 152 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~--~~d~~~ 152 (469) +-|+.+-.........+..++.. | .+ +....+..+.+..|.+|+++.. ++.+ ...++..+.+ +++. T Consensus 50 ~~p~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~--~~~~--~~~~~~~~~~~~~~~~-- 123 (395) T protein:vir:10 50 QSHFKVLEGNRIQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSD--SKEL--LIADSFYREEYALYDD-- 123 (395) T ss_pred cceeEeccCCccccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEec--CCCe--EecCCccceeEeecCc-- Confidence 55665443333344445555532 3 22 2233455566666776654432 2222 1122221111 1110 Q ss_pred CCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEE Q lcl|NC_010179. 153 DNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIE 232 (469) Q Consensus 153 ~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~ 232 (469) ....+.. . +..+ ...+.... |+| T Consensus 124 ---~~~~~~~---~---~~~~---~~~~~~~e---------------------------------------------vih 146 (395) T protein:vir:10 124 ---IFKDVTV---K---DYTY---QRTFTMQE---------------------------------------------VIY 146 (395) T ss_pred ---ceeEEEE---c---Ccee---eeeecccc---------------------------------------------EEE Confidence 0000000 0 0000 00111111 233 Q ss_pred ecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhhh----hh-cceeeecccCC Q lcl|NC_010179. 233 FPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMNDL----RE-YKSIKINNAGN 302 (469) Q Consensus 233 ~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~~----~~-~~~~~~~~~~~ 302 (469) ++. ...|.|-++.+...++. ..+.+...+.+--++.-......++....+ .. .+...-...+- T Consensus 147 ~~~~~~~~~~~G~spi~~~~~~~~~-------~~~~~~~~~~~~gii~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v 219 (395) T protein:vir:10 147 LKYNNNKVTHFVESLFEDYGKIFGR-------MIGAQLKNYQIRGILKSASSAYDEKNIEKLQAFTNKLFNTFNKNQLAI 219 (395) T ss_pred EccCCCCcccccchHHHHHHHHHHH-------HHHHHHhcCCCceEEEeCCCCCCHHHHHHHHHHHHHHhccccccCcce Confidence 321 12345545444444433 223333444443333222211112221111 11 00000000000 Q ss_pred C-CCCcceEEeecCC-------HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 G-DKSGVDKLQIDIP-------VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 303 ~-~~~~~~~l~~~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) - -+++.+|-....+ ...+.+..+...+.|+..-++|+.-.. |+-|+.+ +.....+..+ T Consensus 220 ~~l~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~--~~~sn~e------------~~~~~~~~~~ 285 (395) T protein:vir:10 220 APLIEGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIY--GETADLE------------KNTLVFEKFC 285 (395) T ss_pred EEcCCCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhc--CcccCHH------------HHHHHHHHHH Confidence 0 0122333322222 124556666777889888888875332 1112111 1112233334 Q ss_pred HHHHHHHHHHHhcccCC---C-cccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHH-HH----- Q lcl|NC_010179. 375 INELVRAIMRYLNFSDA---D-KRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQ-QE----- 440 (469) Q Consensus 375 l~~~~~~i~~~~~~~~~---~-~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~-~E----- 440 (469) |.-++..|...++.+=. . ...+++.++.-+-.|..+.++++.++ +|+++.-++.+.+++ +++.. .+ T Consensus 286 l~P~~~~ie~~l~~kL~~~~~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:10 286 LTPLLKKIQNELNAKLITQSMYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred HHHHHHHHHHHHHHhhcChhhhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 44444444443332111 1 11234555666677889999999876 678999888887654 33321 01 Q ss_pred -HHHHH---HHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 441 -LKDLA---KDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 441 -~eri~---~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +..++ .++....+ ....+++.++. T Consensus 366 n~~~~~~~~~~~~~~~~-----~~~kgg~~~~~ 393 (395) T protein:vir:10 366 NYEKANSGENDEKEKDE-----NTLKGGDEDES 393 (395) T ss_pred ccccccccccccCcccc-----cccCCCCCCCC Confidence 11111 11111111 11111111111 No 247 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=66.07 E-value=0.27 Score=23.67 Aligned_cols=379 Identities=11% Similarity=0.103 Sum_probs=165.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH-hh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG-YI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l 79 (469) -++....+| +.+|+.+..+++-.. =...||+..+- =- T Consensus 62 ~~~~~~~eL-----------I~~YR~ma~~pEvd~-------------------------------Av~eIVneaiv~d~ 99 (516) T protein:vir:10 62 NNISGTKDL-----------INTYRQLINNPEVER-------------------------------AVANIVNEAIVYER 99 (516) T ss_pred cccchHHHH-----------HHHHHHHhhccchhh-------------------------------HHHHhhcceeEecC Confidence 122222223 333444433333221 11222322221 23 Q ss_pred hcCCeeeccCchhhHHHHHH--------HHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcC--CCceEEEEEccceeEEEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILD--------VLGD-DRALTLNSLLVDSSNAGRAWLHYWIDE--DNNFRYGIIQPDQITPVY 148 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~--------~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~--~~~~~i~~~~p~~~~~~~ 148 (469) ...|+.+..++.+..+.+++ +++- +|.....+..+.+.+.|+-|.+..+|. +|-..+..+||+.+..+. T Consensus 100 ~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR 179 (516) T protein:vir:10 100 GHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYR 179 (516) T ss_pred CCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEe Confidence 34666666655443333322 2221 455667788999999999999876752 355668888988876643 Q ss_pred eCCCCCceEEEEEEEEeeecCCceEEE---EEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccC Q lcl|NC_010179. 149 ATTLDNKLLGVLRSYKQLDPEAGKYFT---VHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNF 225 (469) Q Consensus 149 d~~~~~~~~~~v~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (469) - -...+.+|..... .+-+|...... +...+.. +..+... T Consensus 180 ~-------------i~~~~~~~~~v~~~~~e~~~Y~~~~~~-~~~~g~~------------~~~~~~i------------ 221 (516) T protein:vir:10 180 E-------------IVTSDIGGTTIVKGYREFFIYTTGNEG-YSYNGRI------------FEPNTRI------------ 221 (516) T ss_pred e-------------ecccccccchhhhhhhheeeeccCccc-cccccce------------eCCCcce------------ Confidence 2 2222222221110 11122221111 1100000 0000000 Q ss_pred CcccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh--- Q lcl|NC_010179. 226 GRVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL--- 289 (469) Q Consensus 226 g~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~--- 289 (469) +||- |.|... ..+...+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ T Consensus 222 -kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k 300 (516) T protein:vir:10 222 -KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQS 300 (516) T ss_pred -eechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHh Confidence 1110 111110 0111112223334444443 344555555666665433332221111 1111111 Q ss_pred hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--CccccC Q lcl|NC_010179. 290 REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFESS 346 (469) Q Consensus 290 ~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g 346 (469) -.++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. ..++.+ T Consensus 301 ~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~sRl~~e~~~ 379 (516) T protein:vir:10 301 LKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMD-DVRWFNKKLYEALRIPLSRIPRDDGG 379 (516) T ss_pred cCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCCcccccCCCCc Confidence 01111111111 122233455555444555443 377778888888888842 222211 Q ss_pred Cc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHHH----- Q lcl|NC_010179. 347 NA---SGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ----- 413 (469) Q Consensus 347 ~~---Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~~----- 413 (469) ++ -|..|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...++ T Consensus 380 ~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R 459 (516) T protein:vir:10 380 MVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLR 459 (516) T ss_pred eeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHH Confidence 11 22233334444555667777888888888777544333321 1223 34677776544444444333 Q ss_pred --HHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 414 --IVSTV---AN-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 414 --~~~kl---~g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.+ .| .+|.+++.+.+=-.+| .++|-++|++|..+. .. . ....+++ T Consensus 460 ~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~--~~---~---~p~~~~~ 515 (516) T protein:vir:10 460 VDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIK--RF---Q---NPENEDD 515 (516) T ss_pred HHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCC--CC---C---CCCcccc Confidence 34443 23 6999999988633333 445566666664332 11 1 0111111 No 248 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=66.07 E-value=0.27 Score=23.67 Aligned_cols=379 Identities=11% Similarity=0.103 Sum_probs=165.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH-hh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG-YI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l 79 (469) -++....+| +.+|+.+..+++-.. =...||+..+- =- T Consensus 62 ~~~~~~~eL-----------I~~YR~ma~~pEvd~-------------------------------Av~eIVneaiv~d~ 99 (516) T protein:vir:10 62 NNISGTKDL-----------INTYRQLINNPEVER-------------------------------AVANIVNEAIVYER 99 (516) T ss_pred cccchHHHH-----------HHHHHHHhhccchhh-------------------------------HHHHhhcceeEecC Confidence 122222223 333444433333221 11222322221 23 Q ss_pred hcCCeeeccCchhhHHHHHH--------HHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcC--CCceEEEEEccceeEEEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILD--------VLGD-DRALTLNSLLVDSSNAGRAWLHYWIDE--DNNFRYGIIQPDQITPVY 148 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~--------~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~--~~~~~i~~~~p~~~~~~~ 148 (469) ...|+.+..++.+..+.+++ +++- +|.....+..+.+.+.|+-|.+..+|. +|-..+..+||+.+..+. T Consensus 100 ~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR 179 (516) T protein:vir:10 100 GHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLFRRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYR 179 (516) T ss_pred CCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCccccceeeeeeCCcceeeEe Confidence 34666666655443333322 2221 455667788999999999999876752 355668888988876643 Q ss_pred eCCCCCceEEEEEEEEeeecCCceEEE---EEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccC Q lcl|NC_010179. 149 ATTLDNKLLGVLRSYKQLDPEAGKYFT---VHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNF 225 (469) Q Consensus 149 d~~~~~~~~~~v~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 225 (469) - -...+.+|..... .+-+|...... +...+.. +..+... T Consensus 180 ~-------------i~~~~~~~~~v~~~~~e~~~Y~~~~~~-~~~~g~~------------~~~~~~i------------ 221 (516) T protein:vir:10 180 E-------------IVTSDIGGTTIVKGYREFFIYTTGNEG-YSYNGRI------------FEPNTRI------------ 221 (516) T ss_pred e-------------ecccccccchhhhhhhheeeeccCccc-cccccce------------eCCCcce------------ Confidence 2 2222222221110 11122221111 1100000 0000000 Q ss_pred CcccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh--- Q lcl|NC_010179. 226 GRVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL--- 289 (469) Q Consensus 226 g~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~--- 289 (469) +||- |.|... ..+...+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ T Consensus 222 -kI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k 300 (516) T protein:vir:10 222 -KIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLEDAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQS 300 (516) T ss_pred -eechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHh Confidence 1110 111110 0111112223334444443 344555555666665433332221111 1111111 Q ss_pred hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--CccccC Q lcl|NC_010179. 290 REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFESS 346 (469) Q Consensus 290 ~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~~g 346 (469) -.++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. ..++.+ T Consensus 301 ~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~sRl~~e~~~ 379 (516) T protein:vir:10 301 LKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMD-DVRWFNKKLYEALRIPLSRIPRDDGG 379 (516) T ss_pred cCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCCcccccCCCCc Confidence 01111111111 122233455555444555443 377778888888888842 222211 Q ss_pred Cc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHHH----- Q lcl|NC_010179. 347 NA---SGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKAQ----- 413 (469) Q Consensus 347 ~~---Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~~----- 413 (469) ++ -|..|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...++ T Consensus 380 ~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R 459 (516) T protein:vir:10 380 MVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLR 459 (516) T ss_pred eeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHH Confidence 11 22233334444555667777888888888777544333321 1223 34677776544444444333 Q ss_pred --HHHHH---hc-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 414 --IVSTV---AN-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 414 --~~~kl---~g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.+ .| .+|.+++.+.+=-.+| .++|-++|++|..+. .. . ....+++ T Consensus 460 ~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E~~~~--~~---~---~p~~~~~ 515 (516) T protein:vir:10 460 VDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQEAGIK--RF---Q---NPENEDD 515 (516) T ss_pred HHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHhhhCC--CC---C---CCCcccc Confidence 34443 23 6999999988633333 445566666664332 11 1 0111111 No 249 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=64.38 E-value=0.3 Score=23.44 Aligned_cols=386 Identities=12% Similarity=0.094 Sum_probs=142.7 Q ss_pred CCHHHHHHHHHHHHHH--HHH---HHHHHHHHHHHhcc--CCcccccccchhhhcccccccccccCcceeccchHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTS--RND---LINNYKKSVDYYEN--KTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVD 73 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~--~~~---~~~~~~~~~~Yy~g--~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~ 73 (469) ++--.+..++...... +.. .+..+..+.++.++ ++++. -.-...|.+ T Consensus 52 ~~~a~~~p~~~~~~~~~~~~~~p~~~~~~~~~~~~l~~~~~npiv--------------------------~~~I~~ia~ 105 (576) T protein:vir:96 52 KQQAYAEPFLEVMDTNPEFRTKRSYMKNSDNLHDVLKQFGNNPIL--------------------------NAIILTRSN 105 (576) T ss_pred ccchhhcceeeeeecCCCccccCcchhhhhhhHHHHHHhhcCHHH--------------------------HHHHHHHHH Confidence 0000000000000000 000 00000000111100 01110 011112222 Q ss_pred HHHHhh---------hcCCeeeccCc-----hhh--HHHHHHHH----hc------cHHHHHHHHHHHHHhCCeEEEEEE Q lcl|NC_010179. 74 QEAGYI---------ASVFPDIDVGK-----DAD--NKKILDVL----GD------DRALTLNSLLVDSSNAGRAWLHYW 127 (469) Q Consensus 74 ~~~~~l---------~g~p~~~~~~~-----~~~--~~~l~~~~----~~------n~~~~~~~~~~~~~~~G~~~~~v~ 127 (469) ..+.|. .|=++.....+ ... ...+..++ .. ++.+.+..+..+.+.+|.+|+.+. T Consensus 106 ~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p~~~t~~~f~~~lv~dlll~Gna~~~i~ 185 (576) T protein:vir:96 106 QVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKDIDRDSFQSFCRKIVRDTYTYDQVNFEKV 185 (576) T ss_pred HHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCCCccccHHHHHHHHHHHHHhcCCeEEEEE Confidence 222221 11111111110 000 01111111 11 223334556778899999999887 Q ss_pred EcCC--Cce-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccc Q lcl|NC_010179. 128 IDED--NNF-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNII 204 (469) Q Consensus 128 ~d~~--~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 204 (469) ++.+ |++ .+.+++|..+.++.+... ........|.... ++... ..+....+.++.... T Consensus 186 ~~rd~~g~~~~L~pl~p~~V~v~~~~dg--~~~~~~~~~~~~~-~~~~~----~~~~~~dii~~~~~~------------ 246 (576) T protein:vir:96 186 FNKKNATTMDKFIAVDPSTIFYATDKNG--KIIKGGKRFVQVI-NKKVV----ASFTSREMAMGIRNP------------ 246 (576) T ss_pred EecCCCCceEEEEEeCCceeEEEECCCC--ceeeeeeEEEEec-CCceE----EEecccceEEEeecC------------ Confidence 6665 444 588899999988876532 2221111111111 11111 011122222111100 Q ss_pred ccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCCcccc Q lcl|NC_010179. 205 TSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYGGASL 282 (469) Q Consensus 205 ~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~~~~~ 282 (469) .+-+ .....|.|-++.+...+.....+..-..+.+...+.|-.++ .|....+. T Consensus 247 -----------------------~~d~--~~~~~G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~ 301 (576) T protein:vir:96 247 -----------------------RTEL--SSSGYGLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQ 301 (576) T ss_pred -----------------------CCCc--ccCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCH Confidence 0000 00124677777777777666666555666666666775444 34211111 Q ss_pred ---hhhhhhhhh--------cce-eeecccCCCCCCcceEEeec--CCHHHHHHHHHHHHHHHHHHhCCCCcCcc--ccC Q lcl|NC_010179. 283 ---KQFMNDLRE--------YKS-IKINNAGNGDKSGVDKLQID--IPVEARDDALKITRDNIFLFGQGIDPANF--ESS 346 (469) Q Consensus 283 ---~~~~~~~~~--------~~~-~~~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~g 346 (469) ......+.. .++ +.+. .+++|.... .....+.+..+...+.|+..-++|+.-.. ..+ T Consensus 302 e~~~~lr~~~~~~~~G~~nag~~p~vl~-------~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~ 374 (576) T protein:vir:96 302 RALENFKREWKSSFSGINGSWQVPVVMA-------DDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRG 374 (576) T ss_pred HHHHHHHHHHHHHhccccccccceeecC-------CCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccc Confidence 111222211 111 1221 124454443 34556677778888999998888874221 111 Q ss_pred Ccc----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCCCCCCHHHHHHHHHHH- Q lcl|NC_010179. 347 NAS----GVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRTKVEDSLTKAQIVSTV- 418 (469) Q Consensus 347 ~~S----g~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~~p~d~~e~~~~~~kl- 418 (469) +.+ |.++.+. ... +.....+..+|.-+++.|...++.+ .. ...+.+.|.+.-+.+.++..+..... T Consensus 375 ~~~g~~~~~s~t~s--n~e---~~~~~f~~~tL~P~~~~ie~~ln~~Ll~~~-~~~~~~~f~r~d~~~~~e~~~~~~~~~ 448 (576) T protein:vir:96 375 GATGGKGGNTLNEA--DPG---KKQQQSQNKGLQPLLRFIEDLINTHIISEY-SDKYVFQFVGGDTKSELDKIKILQEEV 448 (576) T ss_pred cccccccccccccc--cHH---HHHHHHHHHHHHHHHHHHHHHHHhhhchhc-cCceEEEeccCCHHHHHHHHHHHHHHh Confidence 111 1111110 000 1112333344444444444433321 11 13456778777666666655544332 Q ss_pred hccCChHHHHHhCCC--CCCHHHH-----HHHHH-----------HHHHHhhhhHh---h-cccCCCCCCCCC Q lcl|NC_010179. 419 ANYSSKEAVAKANPI--VDDWQQE-----LKDLA-----------KDREENDPYAN---Q-ADELNGKGVDDE 469 (469) Q Consensus 419 ~g~iS~et~~~~l~~--v~d~~~E-----~eri~-----------~E~~~~~~~~~---~-~~~~~~~~~~de 469 (469) +|+++.-.+.+.++. +++-+.= +..+. .+++...+..+ . ....+.....++ T Consensus 449 ~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 521 (576) T protein:vir:96 449 KTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTED 521 (576) T ss_pred cCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCCCCccccccccccccccCCCCCCCCCCCCCCC Confidence 589998888877643 2211100 00000 00000000000 0 000000000000 No 250 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=60.26 E-value=0.38 Score=22.91 Aligned_cols=361 Identities=9% Similarity=0.045 Sum_probs=153.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccc--------------------------cchh-hhcccccc Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRN--------------------------NGKP-KVSKEGKK 53 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~--------------------------~~~~-~~~~~~~~ 53 (469) |- +.++++|.-...... +... .....+.. T Consensus 1 Mg------------------------l~d~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 56 (431) T protein:vir:10 1 MG------------------------LFDFIRREKQPEAQARPHVEPSFQASTPTTSIPGETFEGLDDPRLKEYIRRGEL 56 (431) T ss_pred Cc------------------------chhhhhcCcccccccccccccccccccccccccccccccccchHHHHhhccCcc Confidence 22 111122210000000 0000 00000000 Q ss_pred ccccc-CcceeccchHHHHHHHHHHhhhcCCeee-ccCc---hhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeE Q lcl|NC_010179. 54 DPLRS-ADNRIPSNFYQLLVDQEAGYIASVFPDI-DVGK---DADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRA 122 (469) Q Consensus 54 ~~~~~-~~~ri~~n~~k~iv~~~~~~l~g~p~~~-~~~~---~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~ 122 (469) ..... ...-+.+.-....|+..++-+-+-|+.+ ..++ ......+..++.. | .+ +....+..+++.+|.+ T Consensus 57 ~g~~v~~~~al~~~~V~~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna 136 (431) T protein:vir:10 57 NGGTGRETRALRNMAVLRCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGES 136 (431) T ss_pred CcceechhhhhccHHHHHHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCe Confidence 00000 0000112223334455555555556654 2111 1122345555542 3 22 2234567788999999 Q ss_pred EEEEEEcCCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccc Q lcl|NC_010179. 123 WLHYWIDEDNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYN 202 (469) Q Consensus 123 ~~~v~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202 (469) |+++-.+....+.+..++|..+.+..+.+ ..+. |......|... .+....+.+++.-. T Consensus 137 ~~~i~r~~g~~~~L~pl~~~~v~~~~~~~--~~~~-----y~~~~~~g~~~-----~~~~~dViHir~~~---------- 194 (431) T protein:vir:10 137 MARIVWSGNRPIRLIPMDRGSAKGRLTST--WQIV-----YDYTTPTGDKI-----ELPAREVFHLRDLS---------- 194 (431) T ss_pred EEEEEEcCCceEEEEEEcCceeEEEEcCC--CeEE-----EEEEeCCceEE-----EEchhhEEEecCcC---------- Confidence 99988875333567888898888776542 2221 21112122111 11222222221100 Q ss_pred ccccccccccccccccccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccc Q lcl|NC_010179. 203 IITSYDLSAGYETGQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASL 282 (469) Q Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~ 282 (469) .+...|.|-++-+...+........-..+.++..+.|-.++.-... .. T Consensus 195 -------------------------------~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-ls 242 (431) T protein:vir:10 195 -------------------------------IDGVSGVSRVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPKE-LS 242 (431) T ss_pred -------------------------------CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCCC-CC Confidence 0122466666666666655555555555556666677666554221 11 Q ss_pred hhhh----hhhh--------hcceeeecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccccCCc Q lcl|NC_010179. 283 KQFM----NDLR--------EYKSIKINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFESSNA 348 (469) Q Consensus 283 ~~~~----~~~~--------~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~ 348 (469) ++.. ..+. ..+++.++ .+.+|..... ....+.+..+.....|+..-++|+.-.....+. T Consensus 243 ~e~~~~~~~~~~~~~~g~~n~g~~~vl~-------~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~ 315 (431) T protein:vir:10 243 DNAYGRMKASVQENHTGSENAGSWMLLE-------EGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTS 315 (431) T ss_pred HHHHHHHHHHHHHHhcCccccCCceecC-------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCC Confidence 1211 1111 11222232 1234444333 334455556667788988888887533322222 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----CCCcccceEEeCCCCCCCHHHHHHHHHHHh--c- Q lcl|NC_010179. 349 SGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-----DADKRHISQHWTRTKVEDSLTKAQIVSTVA--N- 420 (469) Q Consensus 349 Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-----~~~~~~i~i~f~~~~p~d~~e~~~~~~kl~--g- 420 (469) ++..++.... ..+..+|.-.++.|...++.+ ......+++.++.-+-.|.++.++.+.++. | T Consensus 316 t~sn~eq~~~----------~f~~~tL~P~~~~ie~~ln~~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~ 385 (431) T protein:vir:10 316 WGSGIEQLAI----------FFIQYGLSHWFVSWEQAAARAFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGAGG 385 (431) T ss_pred ccccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhccc Confidence 2222222221 222333444444443333321 111223444444556678999999888762 3 Q ss_pred ---cCChHHHHHhCCC--CCCHHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 421 ---YSSKEAVAKANPI--VDDWQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 421 ---~iS~et~~~~l~~--v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.-.+.++++. ++++... ++ ..|. .. ...+..+| T Consensus 386 ~~g~lT~NE~R~~~gl~p~~~~~gD--~~------~~p~----n~-~~~~~~~~ 426 (431) T protein:vir:10 386 QSPWMKQNEVREMLDLPRADDPVAD--QL------RNPM----TQ-KQKGSGDE 426 (431) T ss_pred ccCccCHHHHHHHhCCCCCCCcccc--ce------eccc----cc-ccCCCCCC Confidence 4888888877653 4443221 11 1111 11 12223333 No 251 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=59.87 E-value=0.38 Score=22.86 Aligned_cols=381 Identities=11% Similarity=0.090 Sum_probs=166.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH-hh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG-YI 79 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~-~l 79 (469) .+++..-+ .....+.+|+.+..+++-.. =...||+..+- =- T Consensus 61 ~~~e~~~~-------~~~eLI~~YR~ma~~pEvd~-------------------------------Av~eIVneaiv~d~ 102 (521) T protein:vir:81 61 YSTDQKIS-------TTKQLVNTYRGLMNNHEVEN-------------------------------AVQNIVNDAIVFEE 102 (521) T ss_pred cccccchh-------hHHHHHHHHHHHhhccchhh-------------------------------HHHHhhcceeEecC Confidence 22221100 11222334444433333221 11222322221 23 Q ss_pred hcCCeeeccCchhhHHHHHH--------HHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccceeEEE Q lcl|NC_010179. 80 ASVFPDIDVGKDADNKKILD--------VLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPDQITPV 147 (469) Q Consensus 80 ~g~p~~~~~~~~~~~~~l~~--------~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~~~~~~ 147 (469) ...|+.+..++.+..+.+++ +++- +|.....+..+.+.+.|+-|.+.-+|++ |-..+..+||+.+..+ T Consensus 103 ~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~v 182 (521) T protein:vir:81 103 GHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQDMFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYV 182 (521) T ss_pred CCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeee Confidence 34566666655444443332 2221 4555677889999999999999887654 4456888999988766 Q ss_pred EeCCCCCceEEEEEEEEeeecCCceE---EEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccccc Q lcl|NC_010179. 148 YATTLDNKLLGVLRSYKQLDPEAGKY---FTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHN 224 (469) Q Consensus 148 ~d~~~~~~~~~~v~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (469) .-... .+..+... ...+-+|.+....+... +..+ ..+.. T Consensus 183 r~i~k-------------~~~~~~~v~~~~~e~f~Y~~~~~~~~~~-g~~~------------~~~~~------------ 224 (521) T protein:vir:81 183 REIIT-------------EDTPEGKIYKATKEYFIYTVGNSSYCAG-GQVF------------SPNSR------------ 224 (521) T ss_pred eeecc-------------cccCccceecceeeeeeeecCCcccccc-ceee------------cCCcc------------ Confidence 43211 11111110 11111222222211100 0000 00000 Q ss_pred CCcccE--EEecCC----ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh-- Q lcl|NC_010179. 225 FGRVPF--IEFPKN----KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL-- 289 (469) Q Consensus 225 ~g~vPv--v~~~n~----~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~-- 289 (469) -+||- |.|... ..+.-.+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ T Consensus 225 -vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~ 303 (521) T protein:vir:81 225 -VKIPRSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLEDAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQ 303 (521) T ss_pred -eeechhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHH Confidence 00110 111100 0111112223344444443 345555556666666443332222111 1111111 Q ss_pred -hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc--Cccc- Q lcl|NC_010179. 290 -REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP--ANFE- 344 (469) Q Consensus 290 -~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~- 344 (469) -.++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|-. ..++ T Consensus 304 k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP~sRl~~e~~ 382 (521) T protein:vir:81 304 SFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVTTLPGASGMSDID-DIRYFNRKLYEALRVPLSRSNLSDA 382 (521) T ss_pred hcCceeEeecccccccccccccchhhhhcccccCCCcccceeecccCCCCChHH-HHHHHHHHHHHHhCCccccccCCCC Confidence 01111111111 122233455555444555443 367778888888888842 2222 Q ss_pred --c--CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHH--- Q lcl|NC_010179. 345 --S--SNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKA--- 412 (469) Q Consensus 345 --~--g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~--- 412 (469) + |..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...+ T Consensus 383 ~~~~~Gr~~--EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil 460 (521) T protein:vir:81 383 NMVIGGDGS--EITRDELEFSKFIRTRQSQFSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEIL 460 (521) T ss_pred cceeccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHH Confidence 2 3323 33344455556677778888888888877544333321 1223 3467777654444444433 Q ss_pred ----HHHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 413 ----QIVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 413 ----~~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++++.+. | .+|.+++.+.+=-.+| .++|-++|++|..+ +... +...+.| T Consensus 461 ~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~~~--~~~~------~p~~~~~ 519 (521) T protein:vir:81 461 ERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEEAND--PRFK------QTPDEIE 519 (521) T ss_pred HHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhC--CCCC------CCccccc Confidence 3444443 3 4799999987633343 34555556655432 2111 1111111 No 252 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=56.86 E-value=0.45 Score=22.50 Aligned_cols=392 Identities=8% Similarity=0.009 Sum_probs=140.3 Q ss_pred CCHHHHHHHHHHHHHHHH------HH--HHHHHHHHHHhccCC----cccccccchhhhcccccccccccCcceeccchH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRN------DL--INNYKKSVDYYENKT----DITTRNNGKPKVSKEGKKDPLRSADNRIPSNFY 68 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~------~~--~~~~~~~~~Yy~g~~----~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~ 68 (469) +.-......++.+-.... .. ...+...-..|..-. .|-.-+...+.....+..... ...+-. T Consensus 15 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~a~~~~~v~~~v~~ia~~iA~lp~~v~~~~~~g~~~~~------~~~~~~ 88 (460) T protein:vir:10 15 LDNKFNDAFIKYIGQTFTKYDNNGKTYLEQGYNINPDVYSCISQMAAKTVAVPYTIKVVKDTKAYQQL------NNLNIS 88 (460) T ss_pred cCCCchHHHHHhhccccCCCccchhhhhHHHHhcchHHHHHHHHHHHhhhhCceEEEeccCCccchhh------hhhhhh Confidence 111111111110000000 00 000000000010000 000000000000000000000 001111 Q ss_pred HHHHHHHHHhhhcCCeeeccCchhhHHHHHHHHhc-cH----HHHHHHHHHHHHhCCeEEEEEEEcCC----CceE-EEE Q lcl|NC_010179. 69 QLLVDQEAGYIASVFPDIDVGKDADNKKILDVLGD-DR----ALTLNSLLVDSSNAGRAWLHYWIDED----NNFR-YGI 138 (469) Q Consensus 69 k~iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~-n~----~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~-i~~ 138 (469) ..........+...|+......+.. ...++.. |. .+....+..+.+.+|.+|+++..+.. |.+. +.+ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~~~~~G~~~~L~~ 165 (460) T protein:vir:10 89 TKGLYSFTQSLQKNRLDTKAFSETE---KAFPLESPNPTQTWADIYSLYKTYMRLNGNCYFYLMSPDDGINAGVPSQMYV 165 (460) T ss_pred hhhhHHHHHHhhcchhhhcccchhH---HHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCccCceeEEEEE Confidence 1111222223333333222222222 2223322 21 22234566788999999998877544 4554 788 Q ss_pred EccceeEEEEeCCCCCce-EEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccc Q lcl|NC_010179. 139 IQPDQITPVYATTLDNKL-LGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQ 217 (469) Q Consensus 139 ~~p~~~~~~~d~~~~~~~-~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 217 (469) ++|..+-+..+.+..... ...++.|... .+... ..+....+.+++....... T Consensus 166 l~~~~v~v~~~~~~~~~~~~~~~~~~~~~--~~g~~----~~~~~~evih~r~~~~~~~--------------------- 218 (460) T protein:vir:10 166 LPAHLIKIVLKDDINLLSTDSPIKSYMLI--QGDQF----IEFNEDEVIHTKYANPNFD--------------------- 218 (460) T ss_pred EcCceEEEEEcCCCceeeeeeeeeEEEEe--cCcee----EEecccceEEEecCCCCcc--------------------- Confidence 899998887654331110 0111111111 01110 1122223332221110000 Q ss_pred cccccccCCcccEEEecCCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhh----hhhhh--- Q lcl|NC_010179. 218 SNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQF----MNDLR--- 290 (469) Q Consensus 218 ~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~----~~~~~--- 290 (469) .....-.|.|.+..+...+........-..+.+...+.|-.++..... ..++. ...+. T Consensus 219 --------------~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~~-l~~e~~~~~~~~~~~~~ 283 (460) T protein:vir:10 219 --------------LQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGSTG-LTQPQADSLKQRLTEMD 283 (460) T ss_pred --------------cccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCCC-CCHHHHHHHHHHHHHHh Confidence 000012466777777777766666655555556666666555443221 11121 11111 Q ss_pred -----hcceeeecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccHHHHHHHHHHHH Q lcl|NC_010179. 291 -----EYKSIKINNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFES--SNASGVAIKMLYSHLE 361 (469) Q Consensus 291 -----~~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg~Al~~~~~~l~ 361 (469) ..+++.++ ++.+|.....+ ...+.+..+...+.|+..-++|+.-.... ++.++..++... T Consensus 284 ~g~~n~g~~~vl~-------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~---- 352 (460) T protein:vir:10 284 KSPDRLSQIAGAS-------GEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEER---- 352 (460) T ss_pred cCccccCCceecC-------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHH---- Confidence 11122222 23455444443 34556667777888988888886533211 111222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccc---C-CCcccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC-- Q lcl|NC_010179. 362 LKAAKTQTYFEHAINELVRAIMRYLNFS---D-ADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI-- 433 (469) Q Consensus 362 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~-~~~~~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~-- 433 (469) ...+..+|.-++..|...++.+ . .......+.|+-.......+...+...+ +|+++.-.+.+.++. T Consensus 353 ------~~f~~~~l~P~~~~ie~~ln~kl~~~~~~~~~~~i~~d~~~l~~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~p 426 (460) T protein:vir:10 353 ------KRVVTDNIQPDLVILKQAFDKKFIKRFKGYENAVIEWDISELPEMQTDMVAMASWLNTIPVTPNEIRIAMKYET 426 (460) T ss_pred ------HHHHHHHHHHHHHHHHHHHHHhhcCcccccCCceEEeecchhhhHHHHHHHHHHHHhCCCCCHHHHHHHhCCCC Confidence 2233334444444444433321 1 1122334555322221122222222222 689999888888753 Q ss_pred CCCHHH-H------HHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 434 VDDWQQ-E------LKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 434 v~d~~~-E------~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) +++.-. + +..+.+ ..+ +...++..-++ T Consensus 427 i~~~~gD~~~~~~n~~~~~~-------~~~--~~~~~~~nq~~ 460 (460) T protein:vir:10 427 LNQDGMDIVFMPSNKVRIDD-------VSN--NLIDSAFNQNQ 460 (460) T ss_pred CCCCCCCeeeecccccchhh-------ccc--ccCCCcccCCC Confidence 222111 1 111110 000 00011111111 No 253 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=52.53 E-value=0.55 Score=22.00 Aligned_cols=382 Identities=12% Similarity=0.113 Sum_probs=166.1 Q ss_pred CCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHH Q lcl|NC_010179. 1 MEL----DALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~----~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~ 76 (469) -++ ..-.+|| .+|+.+..+++-.. =...||+..+ T Consensus 65 ~~~e~~~~~~~eLI-----------~~YR~ma~~pEvd~-------------------------------Av~eIVneaI 102 (524) T protein:vir:98 65 SGQDPAIQNKEQLI-----------NTYRGIMSYPEVEN-------------------------------AVSEIIDDAI 102 (524) T ss_pred cccccccchHHHHH-----------HHHHHHhhccchhh-------------------------------HHHhhhccee Confidence 111 2223333 33444433333221 1122333222 Q ss_pred -HhhhcCCeeeccCchhhHHHHH--------HHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC---CceEEEEEccce Q lcl|NC_010179. 77 -GYIASVFPDIDVGKDADNKKIL--------DVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED---NNFRYGIIQPDQ 143 (469) Q Consensus 77 -~~l~g~p~~~~~~~~~~~~~l~--------~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~---~~~~i~~~~p~~ 143 (469) .--...|+.+..++.+..+.++ .+++- +|.....+..+...+.|+-|.+.-+|++ |-..+..+||+. T Consensus 103 v~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~ 182 (524) T protein:vir:98 103 VNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDNMGARLFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRC 182 (524) T ss_pred EecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCcc Confidence 2223466776665554433333 23221 4556677889999999999999887654 445688889988 Q ss_pred eEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccc Q lcl|NC_010179. 144 ITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKH 223 (469) Q Consensus 144 ~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 223 (469) +-+|..--.. .+..+++.+. + ..-+-+|......+ ...+ .. . ..+.. T Consensus 183 i~~vr~~~~~-~~~~~~~v~~-----~---~~e~f~Y~~~~~~~-~~~g-~~--~---------~~~~~----------- 229 (524) T protein:vir:98 183 MELIRESITE-TLDGGVKVFR-----G---YREFFVYSAPKAGY-TYNG-QI--Y---------QANQK----------- 229 (524) T ss_pred ceeeeecccc-ccccchhhcc-----c---eeeeeeeccCCCcc-cccc-ce--e---------cCCCc----------- Confidence 7665432110 0111111100 0 01111222111111 0000 00 0 00000 Q ss_pred cCCccc---EEEecCC--ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc-----hhhhhhh-- Q lcl|NC_010179. 224 NFGRVP---FIEFPKN--KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL-----KQFMNDL-- 289 (469) Q Consensus 224 ~~g~vP---vv~~~n~--~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~-----~~~~~~~-- 289 (469) -+|| |++.--. +.+...+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. .+-...+ T Consensus 230 --ikI~~dAIvy~hSGL~d~~~~iisyLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~ 307 (524) T protein:vir:98 230 --IKIPRSAIVYAHSGLEDCSNNIIGYLHRAVKPANQLRLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQ 307 (524) T ss_pred --eeechhheeeeccCcccCCCCeeeehhHhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHH Confidence 0111 1211100 0111122333444455554 345556666666666443333222111 1111111 Q ss_pred -hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCC--cC-cc- Q lcl|NC_010179. 290 -REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGID--PA-NF- 343 (469) Q Consensus 290 -~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~-~~- 343 (469) -.++++.=... +.+.+-.+..|....++.... .++-+.+-+|+...+|- +. .. T Consensus 308 k~kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEItTLpggqnlgem~-DV~YF~kkLy~aLnVP~sRl~~~~~ 386 (524) T protein:vir:98 308 GLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMD-DIKWFNRKLYEALRVPLSRMPRDDG 386 (524) T ss_pred hcCceeEeeccCceeeccccccchhhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCCceeccCCCC Confidence 01111111111 122233455555444555443 36777888888888884 22 11 Q ss_pred --ccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCCCHHHHH---- Q lcl|NC_010179. 344 --ESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKA---- 412 (469) Q Consensus 344 --~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~d~~e~~---- 412 (469) .+|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.-.+...+ T Consensus 387 ~f~~Gr~~--EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~ 464 (524) T protein:vir:98 387 GMQIGGGG--EITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILE 464 (524) T ss_pred cccccccc--chhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHH Confidence 234433 34444555556677778888888888877544333321 1223 3467777654444444333 Q ss_pred ---HHHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 413 ---QIVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVDDE 469 (469) Q Consensus 413 ---~~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~de 469 (469) ++++.+. | .+|.+++.+.+=-.+| .+++-++|++|.++ +.. . ......|+ T Consensus 465 ~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E~k~--~~~---~--~p~~e~~~ 523 (524) T protein:vir:98 465 RRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEESKE--ERF---K--NPEAEEEN 523 (524) T ss_pred HHHHHHHHhccccccccchHHHHHHHhccCHHHHHHHHHHHHHHHhC--CCC---c--CCcccccc Confidence 3444443 4 5999999988644444 23334444444321 111 1 01111111 No 254 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=51.45 E-value=0.58 Score=21.88 Aligned_cols=369 Identities=9% Similarity=-0.011 Sum_probs=145.6 Q ss_pred CCcccccccchhhhcccccccccccC-cceeccchHHHHHHHHHHhhhcCCeeeccCch--hhHHHHHHHHhc--c-HH- Q lcl|NC_010179. 34 KTDITTRNNGKPKVSKEGKKDPLRSA-DNRIPSNFYQLLVDQEAGYIASVFPDIDVGKD--ADNKKILDVLGD--D-RA- 106 (469) Q Consensus 34 ~~~i~~~~~~~~~~~~~~~~~~~~~~-~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~--~~~~~l~~~~~~--n-~~- 106 (469) ...++....... .+.-....... ..-...+....-|+..++=+-+-|+.+--.+. .....+..++.. | .+ T Consensus 1 ~~~~~~~~g~~~---~~~~~~~~~~~~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~~~~~l~~lL~~~PN~~~t 77 (723) T protein:vir:94 1 MTTFPSGAGGWN---AWSADSVFGNGAKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELDELHPLSQLWNVMPNRAMP 77 (723) T ss_pred CcccccCCCccc---cccccccccccHHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccchhhHHHHHHhhCCCCCCC Confidence 111111111000 00000000000 00011122223344444444445655432221 122335555542 3 22 Q ss_pred --HHHHHHHHHHHhCCeEEEEEEEcCC---Cce-EEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEE Q lcl|NC_010179. 107 --LTLNSLLVDSSNAGRAWLHYWIDED---NNF-RYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYW 180 (469) Q Consensus 107 --~~~~~~~~~~~~~G~~~~~v~~d~~---~~~-~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 180 (469) .....+..++..+|.+|+.+-.+.. |.+ .+..++|+.+.++...+...........|.....+|... .+ T Consensus 78 ~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~~~~~y~~~~~~G~~~-----~~ 152 (723) T protein:vir:94 78 AQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQAQIIGYVIERTDGVRV-----PV 152 (723) T ss_pred HHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceeeeeeEEEEEecCceeE-----Ee Confidence 2233456678889999988765432 443 356666665555443322111111111111111111000 01 Q ss_pred cCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC-----CccccccHHHHHHHHHHHHH Q lcl|NC_010179. 181 TDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK-----NKYRLAELNKYKGLIDAYDD 255 (469) Q Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~~~g~~~~~~v~~liD~~~~ 255 (469) .... |+||+. .-.|.|.+......|..... T Consensus 153 ~~~d---------------------------------------------IiHir~~~~~dg~~G~Spi~~a~~~i~~~~a 187 (723) T protein:vir:94 153 LADE---------------------------------------------MLWLRFSDPYDPLAVMAPWKAARAAVDADFY 187 (723) T ss_pred cccc---------------------------------------------eEEecCCCCCCCcccccHHHHHHHHHHHHHH Confidence 1111 233321 22467777666666655554 Q ss_pred HHHHHHHHHHHhcCceeEEecCCcccc---hhhhhhhh--------hcceeeecccCCC---CCCcceEEeecCC--HHH Q lcl|NC_010179. 256 IYNGFINDLDDVQTVILVLTNYGGASL---KQFMNDLR--------EYKSIKINNAGNG---DKSGVDKLQIDIP--VEA 319 (469) Q Consensus 256 ~~s~~~~~~~~~~~p~l~~~g~~~~~~---~~~~~~~~--------~~~~~~~~~~~~~---~~~~~~~l~~~~~--~~~ 319 (469) +..-..+.+...+.|-.++.. +..+. ......+. ..+.+.+...+.+ .+.+.+|.....+ ... T Consensus 188 a~~~~~~~f~NG~~p~giL~~-~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~vl~~G~~~~~l~~s~~D~q 266 (723) T protein:vir:94 188 AATWQRQSFKNGARPGGVVNL-GDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGAAGKGATFTSLSMSPAEMD 266 (723) T ss_pred HHHHHHHHHhcCCCcceEEEc-CCCCHHHHHHHHHHHHHHhhchhhcCcceeecccccccccccCCceEEEccCCHHHHH Confidence 444445555566667666653 21111 11111111 1223333221111 0123455444443 344 Q ss_pred HHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccc Q lcl|NC_010179. 320 RDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHI 396 (469) Q Consensus 320 ~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i 396 (469) +.+......+.|...-++|+....+.++-|.. .... ...+...|.-.++.|...++.+ .. ...+ T Consensus 267 ~le~r~~~~~eIa~afgVPp~~i~~~st~sN~--e~~~----------~~f~~~tL~P~~~~ie~~ln~~Ll~~~-g~~~ 333 (723) T protein:vir:94 267 YINSRMHSAEEVMLAFGIRKDALLGGSTYENQ--AEAK----------AAVWTETLIPQMEVMASITDLQLLPDI-GWTV 333 (723) T ss_pred HHHHHHHhHHHHHHHhCCChhHcCCCCCcccH--HHHH----------HHHHHHHHHHHHHHHHHHHhHhhcccc-cCce Confidence 55556667778888888886433221111111 1111 1122334444444444333321 11 1246 Q ss_pred eEEeCCC--CCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHH-----------------------HHHHHH- Q lcl|NC_010179. 397 SQHWTRT--KVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQE-----------------------LKDLAK- 446 (469) Q Consensus 397 ~i~f~~~--~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E-----------------------~eri~~- 446 (469) .+.|+.. +..|.++.++.+.++ +|+++.-.+.+.++. +++-+.. -.|+.. T Consensus 334 ~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~ 413 (723) T protein:vir:94 334 EWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQMTLTPYRAQFAPAPAPAPAVEEGAARMLAL 413 (723) T ss_pred EEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCcccceeccccccccCCCCCCccchhhhHhhhhh Confidence 6777653 457888899988876 689998888877643 2211110 001110 Q ss_pred -H-HHHhhhhHhhcc---c--CCCCCCCCC Q lcl|NC_010179. 447 -D-REENDPYANQAD---E--LNGKGVDDE 469 (469) Q Consensus 447 -E-~~~~~~~~~~~~---~--~~~~~~~de 469 (469) | ..+..|..+.-. . ..+.+-+.+ T Consensus 414 ~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 443 (723) T protein:vir:94 414 LERVAADRPLPELPVRATTVLHHDPGPDPQ 443 (723) T ss_pred ccccccccCcCCCCCCCCCCCCCCcccCCc Confidence 0 000011111000 0 011111111 No 255 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=45.44 E-value=0.77 Score=21.21 Aligned_cols=366 Identities=10% Similarity=0.062 Sum_probs=139.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+.. ++..+.. --++..+.+. .+..+.+.... ...+.-.| ..-...-|+..++-+. T Consensus 1 mg~~~-------~~~~~~~---~~~~~~~~~~---~~~~~~~~~~~--------~t~~~~~~--~~~v~~cv~~Ia~~ia 57 (403) T protein:vir:10 1 MGFKS-------WITEKLN---PGQRIIRDME---PVSHRTNRKPF--------TTGQAYSK--IEILNRTANMVIDSAA 57 (403) T ss_pred Ccchh-------hhhhccc---hhhhhhhccc---ccccccCCccc--------ccHHHHHH--HHHHHHHHHHHHHHHh Confidence 43321 1111100 0001111111 11000000000 00000001 1111122333444444 Q ss_pred cCCeeecc-------CchhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccceeEEE Q lcl|NC_010179. 81 SVFPDIDV-------GKDADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQITPV 147 (469) Q Consensus 81 g~p~~~~~-------~~~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~~ 147 (469) .-|..+.. .+......+..++.. | .+ +....+...++.+|.+|++. +.. .+..++|..+.+. T Consensus 58 ~~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~~~---~l~~l~~~~~~v~ 132 (403) T protein:vir:10 58 ECSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--DGT---SLYHVPAALMQVE 132 (403) T ss_pred hCceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--eCc---eeEeecCcceEEE Confidence 44544321 111122334555543 3 22 22335567788899998644 221 2445555554443 Q ss_pred EeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCc Q lcl|NC_010179. 148 YATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGR 227 (469) Q Consensus 148 ~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~ 227 (469) .+.. .. +++|. . ++.. . +....+.++ .. T Consensus 133 ~~~~---~~---~~~~~-~---~~~~----~-~~~~eiih~-------------------------------------~~ 160 (403) T protein:vir:10 133 ADAN---KF---IKKFI-F---NNQI----N-YRVDEIIFI-------------------------------------KD 160 (403) T ss_pred EcCC---ce---EEEEE-e---cCce----e-ecccceEEe-------------------------------------cc Confidence 3221 11 11110 0 0000 0 000111111 00 Q ss_pred ccEEEec-CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcccchhhhhh----hhh--------cce Q lcl|NC_010179. 228 VPFIEFP-KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGASLKQFMND----LRE--------YKS 294 (469) Q Consensus 228 vPvv~~~-n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~~~~~~~~~----~~~--------~~~ 294 (469) ..+++.. +...|.|.+..+...++..+.+..-..+.+...+.|-.+++..... .++.... +.. .++ T Consensus 161 ~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~l-~~e~~~~~~~~~~~~~~g~~n~g~~ 239 (403) T protein:vir:10 161 NSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDEIL-NKKLRERKQEELQLDYNPSTGQSSV 239 (403) T ss_pred cccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCC-CHHHHHHHHHHHHHHhCCcccCcce Confidence 1111111 2335777777777777766666655566666666776666643221 1222211 111 112 Q ss_pred eeecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 295 IKINNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFE 372 (469) Q Consensus 295 ~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~ 372 (469) +.++. +.+++.++...+ ...+.+..+...+.|+..-++|+.-.....+.+-+. .....+. T Consensus 240 ~vl~~-----g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~sn~e~-------------~~~~f~~ 301 (403) T protein:vir:10 240 LILDG-----GMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNNANIRP-------------NIELFYY 301 (403) T ss_pred eecCC-----CceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcCHHH-------------HHHHHHH Confidence 22221 123344432222 234455566778889888888875332211111111 1122233 Q ss_pred HHHHHHHHHHHHHhcccCCCcccceEEeCCC--CCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHHHH- Q lcl|NC_010179. 373 HAINELVRAIMRYLNFSDADKRHISQHWTRT--KVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKDLA- 445 (469) Q Consensus 373 ~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~--~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~eri~- 445 (469) ..|.-.++.|...++.+= ...+.+.++.- +-.|....++++.++ +|+++.-++.+.++. ++++ ...+.. T Consensus 302 ~tl~P~~~~ie~~l~~~L--~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~lT~NE~R~~~gl~pi~~~--~~d~~~~ 377 (403) T protein:vir:10 302 MTIIPMLNKLTSSLTFFF--GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGIITGNEARSELNLEPLDDE--QMNKIRI 377 (403) T ss_pred HHHHHHHHHHHHHHHHhc--CceeeeccchhhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcc--ccccccc Confidence 334444444443333211 11233333322 445778888888876 689999998888754 3321 221111 Q ss_pred -HHHHH-hhhhHhhcccCCCCCCCCC Q lcl|NC_010179. 446 -KDREE-NDPYANQADELNGKGVDDE 469 (469) Q Consensus 446 -~E~~~-~~~~~~~~~~~~~~~~~de 469 (469) ..-+. ..+........++.+.+.| T Consensus 378 p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 378 PANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred ccccccccccCCCCcCCCCCCCcCCC Confidence 11000 0111111111122222223 No 256 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=43.87 E-value=0.83 Score=21.03 Aligned_cols=339 Identities=12% Similarity=0.024 Sum_probs=126.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |-+ ++++..+ +..+..-... ....... ...-+...-....|+..++-+. T Consensus 1 Mg~------f~~l~~~-----------------~~~~~~~~~~-----~~~~~~~---~~~~l~~~~v~~~i~~Ia~~ia 49 (376) T protein:vir:78 1 MGF------FSELFKR-----------------NKEIEWMWDL-----DFLEDKT---TKVYLKKMALNTCVKHIARTIA 49 (376) T ss_pred Cch------hhhhhcc-----------------CCccccccch-----hhccccc---hhhhhhhHHHHHHHHHHHHhhc Confidence 443 1111100 0000000000 0000000 0000112333445555565555 Q ss_pred cCCeeeccCchhhHHHHHHHHhc--c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCC Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGD--D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLD 153 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~--n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~ 153 (469) +-|+.+.-........+..++.. | .+ +....+....+.+|.+|+++..+..+.+. ...+.+..+.+ T Consensus 50 ~~p~~~~~~~~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~~~~~~~~~~~~~------- 122 (376) T protein:vir:78 50 KSDFRLKNGETSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIADSYVRKEFAFFP------- 122 (376) T ss_pred ccceeeccccccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeeccceeecccceee------- Confidence 55665543333334445555542 3 22 22345667788899999887766655431 11112211111 Q ss_pred CceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEe Q lcl|NC_010179. 154 NKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEF 233 (469) Q Consensus 154 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~ 233 (469) ..++.....+. . ....+....+. || T Consensus 123 ------~~~~~~~~~~~-~---~~~~~~~~evi---------------------------------------------h~ 147 (376) T protein:vir:78 123 ------DVFEGVTVKDY-R---YNRNFSMDDVI---------------------------------------------FL 147 (376) T ss_pred ------eeeeeeeeecc-e---eeeeeccccEE---------------------------------------------Ee Confidence 11111000000 0 00011122222 22 Q ss_pred cC-CccccccHHHHHHHHHHHHHHHHHHHHHHHH--hcCceeEEecCCcccchhhhh----hhh-h-------cc-eeee Q lcl|NC_010179. 234 PK-NKYRLAELNKYKGLIDAYDDIYNGFINDLDD--VQTVILVLTNYGGASLKQFMN----DLR-E-------YK-SIKI 297 (469) Q Consensus 234 ~n-~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~--~~~p~l~~~g~~~~~~~~~~~----~~~-~-------~~-~~~~ 297 (469) +. ...+.+ ...+++..+..+.......... ...+.+++.. .....++... .+. . .. ++.+ T Consensus 148 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~g~~~~~~~v~~l 223 (376) T protein:vir:78 148 EYGNERLSA---FTDGMFEDYGELFGKMIRAQMRNFQIRGAVNFKM-AGVADKDKQTKLQEYIDKVYASFNNNEIAIVPQ 223 (376) T ss_pred ccCCCCchh---hhhHHHHHHHHHHHHHHHHHHhcCCCceeEEEcc-CCCCCHHHHHHHHHHHHHHhccccccCcceEEc Confidence 11 111111 1123333344333333333322 2234444432 1111111111 111 0 00 1112 Q ss_pred cccCCCCCCcceEEeecCC-------HHHHHHHHHHHHHHHHHHhCCCCcCccc-cCCccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 298 NNAGNGDKSGVDKLQIDIP-------VEARDDALKITRDNIFLFGQGIDPANFE-SSNASGVAIKMLYSHLELKAAKTQT 369 (469) Q Consensus 298 ~~~~~~~~~~~~~l~~~~~-------~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~g~~Sg~Al~~~~~~l~~k~~~~~~ 369 (469) + .+++|-....+ ...+.+..+...+.|+..-++|+.-..+ .+|.+... .. T Consensus 224 ~-------~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s~~e~~~---------------~~ 281 (376) T protein:vir:78 224 L-------EGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMADLSNNM---------------KA 281 (376) T ss_pred C-------CCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCHHHHH---------------HH Confidence 1 22344332221 1245666677778888888888753321 12222211 22 Q ss_pred HHHHHHHHHHHHHHHHhcccCCCcc--cceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCC--CCCHHHHHHH Q lcl|NC_010179. 370 YFEHAINELVRAIMRYLNFSDADKR--HISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAKANPI--VDDWQQELKD 443 (469) Q Consensus 370 ~~~~~l~~~~~~i~~~~~~~~~~~~--~i~i~f~~~~p~d~~e~~~~~~kl--~g~iS~et~~~~l~~--v~d~~~E~er 443 (469) .+..+|.-.++.|...++.+=.... .+...|..-+-.|.++.++++.++ .|+++.-.+.+.++. +++.. .++ T Consensus 282 f~~~~l~P~~~~ie~~l~~kll~~~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~~p~~~g~--~d~ 359 (376) T protein:vir:78 282 YMEYCIDPLTKKLEDELNAKLFTFSEFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGAERVDNPE--LDK 359 (376) T ss_pred HHHHHHHHHHHHHHHHHHhhhCCcccceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCC--Cce Confidence 3334444444444444432211111 222333334556888999999886 588999888888754 22221 111 Q ss_pred HHHHHHHhhhhHhhcccCCCCC Q lcl|NC_010179. 444 LAKDREENDPYANQADELNGKG 465 (469) Q Consensus 444 i~~E~~~~~~~~~~~~~~~~~~ 465 (469) .--- .+....++.+.+| T Consensus 360 ~~~~-----~n~~~~~~~~e~g 376 (376) T protein:vir:78 360 YLIT-----KNYQSADEGGEDG 376 (376) T ss_pred eeec-----cCceehhccccCC Confidence 1000 0000111111111 No 257 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=39.55 E-value=1 Score=20.55 Aligned_cols=391 Identities=11% Similarity=0.101 Sum_probs=165.9 Q ss_pred CCH--HHHH------HHH---HHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHH Q lcl|NC_010179. 1 MEL--DALK------KLI---RNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQ 69 (469) Q Consensus 1 ~~~--~~~~------~~i---~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k 69 (469) ++. ..+. .+. .--.......+.+|+.+..+++-. +=.. T Consensus 45 I~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR~ma~~pEvd-------------------------------~Av~ 93 (524) T protein:vir:10 45 IETQEQNIPYNALMQQMFGSNEPEVKNTRELIDTYRNLMNNYEVD-------------------------------NAVQ 93 (524) T ss_pred eccCcccccchhhhhhhhhcccchhhhHHHHHHHHHHHhhccchh-------------------------------hHHH Confidence 000 0000 000 000011122233444443333322 1112 Q ss_pred HHHHHHHH-hhhcCCeeeccCchhhHHHHH--------HHHhc-cHHHHHHHHHHHHHhCCeEEEEEEEcCC----CceE Q lcl|NC_010179. 70 LLVDQEAG-YIASVFPDIDVGKDADNKKIL--------DVLGD-DRALTLNSLLVDSSNAGRAWLHYWIDED----NNFR 135 (469) Q Consensus 70 ~iv~~~~~-~l~g~p~~~~~~~~~~~~~l~--------~~~~~-n~~~~~~~~~~~~~~~G~~~~~v~~d~~----~~~~ 135 (469) .||+..+- =-...|+.+..++.+..+.++ .+++- +|.....+..+.+.+.|+-|.+.-+|.+ |-.. T Consensus 94 eIVneaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~E 173 (524) T protein:vir:10 94 EIVSDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKDGVQE 173 (524) T ss_pred HhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCcccccee Confidence 23332222 233466676665544333332 23321 4556677889999999999998877643 4556 Q ss_pred EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccc Q lcl|NC_010179. 136 YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYET 215 (469) Q Consensus 136 i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (469) +..+||+.+-.+.- .......+... ++.. ..+|.+..+....... ...+..... T Consensus 174 lr~lDPr~i~~vr~-------------i~~~~~~~~~v------i~~~-~e~f~Y~~~~~~~~~~---~~~~~~~~~--- 227 (524) T protein:vir:10 174 LRRLDPRQVQYIRE-------------IVTRMEDGVKI------VDGY-REFFVYDTGHESYCAD---GRIYSAGTK--- 227 (524) T ss_pred eeeeCCccceeeee-------------ecccCcccchh------hcch-hhheeecCCCcccccC---cceecCCcc--- Confidence 88888887654432 11111111111 1110 0111111100000000 000000000 Q ss_pred cccccccccCCccc---EEEecCC---ccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCcccc----- Q lcl|NC_010179. 216 GQSNTLKHNFGRVP---FIEFPKN---KYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGASL----- 282 (469) Q Consensus 216 ~~~~~~~~~~g~vP---vv~~~n~---~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~~----- 282 (469) -+|| |++.--. ..+.-.+.-+..-|..+|. ++-+.+-..+..+.|-.-+.-.+..+. T Consensus 228 ----------ikI~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnlPk~KA 297 (524) T protein:vir:10 228 ----------VKIPRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTGNMPSRKA 297 (524) T ss_pred ----------eecchhheeeeccCcccCCCCceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhH Confidence 0121 2221110 1111122233444444554 345555566666666443332222111 Q ss_pred hhhhhhh---hhcceeeeccc---------------------CCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_010179. 283 KQFMNDL---REYKSIKINNA---------------------GNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGI 338 (469) Q Consensus 283 ~~~~~~~---~~~~~~~~~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p 338 (469) .+-...+ -.++++.-... +.+.+-.+..|....++.... .++-+.+-+|+...+| T Consensus 298 eqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~-DV~YF~kkLy~aLnVP 376 (524) T protein:vir:10 298 AAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMD-DVLYFRTALYRALRIP 376 (524) T ss_pred HHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCccceeeccccCCcChHH-HHHHHHHHHHHHhCCC Confidence 1111111 11222221111 112233455555444555443 3677788888888888 Q ss_pred Cc--Ccc---c--cCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-CCCc----ccceEEeCCCCCC Q lcl|NC_010179. 339 DP--ANF---E--SSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVE 406 (469) Q Consensus 339 ~~--~~~---~--~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-~~~~----~~i~i~f~~~~p~ 406 (469) -. ..+ + +|..| .|..-.......+.+.+..|..-+.++++.=+-+-++- ..+| ..|.+.|...-.- T Consensus 377 ~sRl~~e~~~~f~~gr~~--EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f 454 (524) T protein:vir:10 377 ESRIPSESNSGVMFDAGT--AITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYF 454 (524) T ss_pred chhccCCCCccccccccc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchH Confidence 42 212 1 23333 33344455556677778888888888877544433321 1223 3467777654444 Q ss_pred CHHHHHH-------HHHHHh---c-cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHhhhhHhhcccCCCCCCC-CC Q lcl|NC_010179. 407 DSLTKAQ-------IVSTVA---N-YSSKEAVAKANPIVDD--WQQELKDLAKDREENDPYANQADELNGKGVD-DE 469 (469) Q Consensus 407 d~~e~~~-------~~~kl~---g-~iS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~-de 469 (469) .+...++ +++.+. | .+|.+++.+.+=-.+| .+++-++|++|..+. .. .+...+ |+ T Consensus 455 ~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~--~~------~~~~~~~~~ 523 (524) T protein:vir:10 455 SEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEA--RF------QNPDEEEED 523 (524) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcC--CC------CCCChhhhc Confidence 4444333 344443 3 4799999987633343 345555566554321 11 001111 11 No 258 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=34.29 E-value=1.3 Score=19.96 Aligned_cols=281 Identities=8% Similarity=0.006 Sum_probs=98.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-.++-..+. . -..-+||+- ++... .+++.+ ...+ + - T Consensus 35 ~~~~~~~~~~~----~--------~~~~~~~~p--p~~~~-------------------------~la~~~-~a~~-~-h 72 (344) T protein:vir:56 35 LDRRDILDYVE----C--------ISNGRWYEP--PVSFT-------------------------GLAKSL-RAAV-H-H 72 (344) T ss_pred cCcchhhhHHH----h--------hhcCccccC--CCCHH-------------------------HHHHHH-hhhh-h-h Confidence 44333322221 1 111234442 11000 011110 0000 0 0 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HH--HHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCce Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RA--LTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKL 156 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~--~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~ 156 (469) +-++.+.. + .+...+.-| .+ ..+..++.+.+.+|.+|+.+-.+..|++. +..++|..+-+.-+.. T Consensus 73 ~s~i~~k~-----n-~l~~~~~Pnp~~t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~----- 141 (344) T protein:vir:56 73 SSPIYVKR-----N-ILASTFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED----- 141 (344) T ss_pred Cccceehh-----h-hHHhhcCCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCceeEEeecCC----- Confidence 00111000 0 000011111 00 12344566778899999998888888753 6666666554322110 Q ss_pred EEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC- Q lcl|NC_010179. 157 LGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK- 235 (469) Q Consensus 157 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n- 235 (469) ++|... ..+..+ .+..+. |+++++ T Consensus 142 ----~~~~~~-~~g~~~-----~~~~~d---------------------------------------------IiHir~~ 166 (344) T protein:vir:56 142 ----VYWWVP-SFNEPT-----AFAPGS---------------------------------------------VFHLLEP 166 (344) T ss_pred ----EEEEEe-cCCeEE-----EEcCcc---------------------------------------------EEEECCC Confidence 011110 001000 011111 233321 Q ss_pred ----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCCcc--cchhhhhhhhhc------ceeee-ccc Q lcl|NC_010179. 236 ----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYGGA--SLKQFMNDLREY------KSIKI-NNA 300 (469) Q Consensus 236 ----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~~~--~~~~~~~~~~~~------~~~~~-~~~ 300 (469) .-.|.|.+.....-++.-+.+..-..+.+...+.|-.++ +|.... ..+.....+... +.+.+ .++ T Consensus 167 ~~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d~~ls~e~~~~lk~~~~~~~g~~~~r~l~l~~p~ 246 (344) T protein:vir:56 167 DINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQ 246 (344) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCCCCccceEEecCC Confidence 124666665444333322222112223334445564444 442111 111222222211 22222 222 Q ss_pred CCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCccc-------cCCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFE-------SSNASGVAIKMLYSHLELKAAKTQTYF 371 (469) Q Consensus 301 ~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-------~g~~Sg~Al~~~~~~l~~k~~~~~~~~ 371 (469) + ++.++++..... ....+.+..+.....|...-++|+.-..- +|++...++.+....+.=.+ T Consensus 247 g--~~~G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~llGi~~~~t~~~~n~eq~~~~f~~~tL~Pl~------- 317 (344) T protein:vir:56 247 G--KADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQ------- 317 (344) T ss_pred C--CccceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHHHhccCCCCCCccccHHHHHHHHHHHHHHHHH------- Confidence 2 233455554332 34456677777888899999998753321 12222222222211111111 Q ss_pred HHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHH Q lcl|NC_010179. 372 EHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLT 410 (469) Q Consensus 372 ~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e 410 (469) ..++++. ..++.. .+.|.+..-..... T Consensus 318 -~~ie~~n----~~l~~~-------~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 318 -DRIREIN----GWIGQE-------VIRFKNYSLDTDNG 344 (344) T ss_pred -HHHHHHH----hhhccc-------cccCCCccccccCC Confidence 1111111 122211 13343332221111 No 259 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=34.18 E-value=1.3 Score=19.94 Aligned_cols=369 Identities=11% Similarity=0.021 Sum_probs=142.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccc-chhhhccccccccc-ccCccee--ccchHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNN-GKPKVSKEGKKDPL-RSADNRI--PSNFYQLLVDQEA 76 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~-~~~~~~~~~~~~~~-~~~~~ri--~~n~~k~iv~~~~ 76 (469) |- -+.++.. . ....-.... .............. ......+ .++.....|+..+ T Consensus 1 Mg-----------------~~~~~~~-----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~i~~ia 57 (423) T protein:vir:81 1 MG-----------------FLQKLGL-----A-PSVVATPEPIELVGPIFESLKLSTKNMTVEQIWEDQPHLRTVTTFIA 57 (423) T ss_pred Cc-----------------hhHhhcc-----c-cccccCccccccccccccccccccchhhHHHHHHhhhHHHHHHHHHH Confidence 11 1111100 0 000000000 00000000000000 0000000 1233445666666 Q ss_pred HhhhcCCeee---ccCc--h-hhHHHHHHHHhc-c-H---HHHHHHHHHHHHhCCeEEEEEEEcCCCceEEEEEccce-- Q lcl|NC_010179. 77 GYIASVFPDI---DVGK--D-ADNKKILDVLGD-D-R---ALTLNSLLVDSSNAGRAWLHYWIDEDNNFRYGIIQPDQ-- 143 (469) Q Consensus 77 ~~l~g~p~~~---~~~~--~-~~~~~l~~~~~~-n-~---~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~-- 143 (469) +-+-+-|+.+ ..+. + .....+..++.. | . .+....+..+...+|.+|.++..+..+...+..+.|.. T Consensus 58 ~~ia~lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~~~~~~l~p~~~~ 137 (423) T protein:vir:81 58 RNVASLQLQAFERVEDGGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVDTPTLDIRPIPVS 137 (423) T ss_pred HhHhhCceEEEEEecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcCcceEEEeecccc Confidence 6666666653 1111 1 122334555543 3 2 22334566788899999988876654333332232222 Q ss_pred -eEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccccccccccccccccccc Q lcl|NC_010179. 144 -ITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLK 222 (469) Q Consensus 144 -~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 222 (469) +.+.........+.+-+ ......+| .. ++ +.... T Consensus 138 ~v~~~~~~~~~~~~~Y~~--~~~~~~~g-~~---~~-~~~~e-------------------------------------- 172 (423) T protein:vir:81 138 WVQRRAYKDGWGSLDYII--IESGDNDG-RS---VK-VPGER-------------------------------------- 172 (423) T ss_pred eeeeeeccCCCcceEEEE--EEecCCCc-eE---EE-Ecccc-------------------------------------- Confidence 21111100001110000 00000000 00 00 11111 Q ss_pred ccCCcccEEEecC----C-ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCC----cccchhh----hhhh Q lcl|NC_010179. 223 HNFGRVPFIEFPK----N-KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG----GASLKQF----MNDL 289 (469) Q Consensus 223 ~~~g~vPvv~~~n----~-~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~----~~~~~~~----~~~~ 289 (469) |+|+++ . ..|.|.+..+...++.......-....++..+.|-.+++-.. +...++. ...+ T Consensus 173 -------vih~r~~~~~~~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~ 245 (423) T protein:vir:81 173 -------VIHRHGYNPKTMKRGKSPVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANL 245 (423) T ss_pred -------eEEecCCCCCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHH Confidence 233331 1 147777766666666555555555555666666766654211 1111221 1111 Q ss_pred hh---------cceeeecccCCCCCCcceEEeecCC--HHHHHHHHHHHHHHHHHHhCCCCcCccccCCccHHHHHHHHH Q lcl|NC_010179. 290 RE---------YKSIKINNAGNGDKSGVDKLQIDIP--VEARDDALKITRDNIFLFGQGIDPANFESSNASGVAIKMLYS 358 (469) Q Consensus 290 ~~---------~~~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g~~Sg~Al~~~~~ 358 (469) .. .+++.++ .+++|-....+ ...+.+..+.....|+..-++|+.-....++.+...++.. T Consensus 246 ~~~~~~~~~n~g~~~vl~-------~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~-- 316 (423) T protein:vir:81 246 RASFSPKSSDVGGTLLLE-------DGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREF-- 316 (423) T ss_pred HHHhccccccCCcceecC-------CCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHH-- Confidence 11 1222222 12344433332 2344445566677888888888643322222221111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----CCCcccceEEeC--CCCCCCHHHHHHHHHHH---hccCChHHHH Q lcl|NC_010179. 359 HLELKAAKTQTYFEHAINELVRAIMRYLNFS-----DADKRHISQHWT--RTKVEDSLTKAQIVSTV---ANYSSKEAVA 428 (469) Q Consensus 359 ~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~-----~~~~~~i~i~f~--~~~p~d~~e~~~~~~kl---~g~iS~et~~ 428 (469) ....+..+|.-.+..+...++.+ +.+.....+.|+ .-+..|.++.++++.++ +|+++.-++. T Consensus 317 --------~~~f~~~~L~P~~~~ie~~l~~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~~T~NE~R 388 (423) T protein:vir:81 317 --------RKALYGDNLGSWIRIIQDVMNLFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAWMTINEVR 388 (423) T ss_pred --------HHHHHHHHHHHHHHHHHHHHhhhhcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCCcCHHHHH Confidence 12233334444444444433321 222223345554 44566888888887764 4789988888 Q ss_pred HhCCCCCCHHHHHHHHHHHHHHhhhhH-hhcccCCCCCCCCC Q lcl|NC_010179. 429 KANPIVDDWQQELKDLAKDREENDPYA-NQADELNGKGVDDE 469 (469) Q Consensus 429 ~~l~~v~d~~~E~eri~~E~~~~~~~~-~~~~~~~~~~~~de 469 (469) +.++.-+.+ .-+.+. .|.. ...+.....+.+.| T Consensus 389 ~~~gl~p~~--gGD~~~------~p~n~~~~~~~~~~~~~~~ 422 (423) T protein:vir:81 389 AMDNLPSID--GGDDLA------RPLNTEFGDSEDAPGEEVE 422 (423) T ss_pred HHhCCCCCC--Ccceee------cccccccCccCCCCCCCCC Confidence 877542211 111111 1111 11222222222222 No 260 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=33.10 E-value=1.4 Score=19.82 Aligned_cols=280 Identities=10% Similarity=0.017 Sum_probs=100.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+--++...+. +-..-+||+- ++.... . . .-. +++ .+..-++......+. T Consensus 35 ~~~~~~~~~~~------------~~~~~~~~~p--p~~~~~--l---a-----~~~-~a~-----~~h~~~i~~k~n~l~ 84 (344) T protein:vir:60 35 LDRRDILDYVE------------CISNGRWYEP--PISFTG--L---A-----KSL-RAA-----VHHSSPIYVKRNILA 84 (344) T ss_pred cCCcchhHHHH------------hhhcCccccC--CCCHHH--H---H-----HHH-Hhh-----hhhccchhhhhhHHH Confidence 33332222221 1111233431 110000 0 0 000 000 000000111111111 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HH--HHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCce Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RA--LTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKL 156 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~--~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~ 156 (469) + .+.-| .+ ..+..++.+.+.+|.+|+.+-.+..|++. +..++|..+-+..+.+ T Consensus 85 ~------------------~~~Pn~~~t~~~f~~~~~d~ll~Gnay~~i~rn~~G~~~~L~~l~~~~vr~~~~~~----- 141 (344) T protein:vir:60 85 S------------------TFIPHPWLSQQDFSRFVLDFLVFGNAFLEKRYSTTGKVIRLETSPAKYTRRGVEED----- 141 (344) T ss_pred h------------------hccCCCCCCHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEcCcceEEEeecCC----- Confidence 1 11111 11 12345566778899999998888888764 6666666554432211 Q ss_pred EEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC- Q lcl|NC_010179. 157 LGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK- 235 (469) Q Consensus 157 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n- 235 (469) ++|... ..+.. ..+..+ .|+++++ T Consensus 142 ----~~~~v~-~~~~~-----~~~~~~---------------------------------------------eIiHir~~ 166 (344) T protein:vir:60 142 ----VYWWVP-SFNEP-----TAFAPG---------------------------------------------SVFHLLEP 166 (344) T ss_pred ----eEEEEc-cCCeE-----EEEcCc---------------------------------------------cEEEEcCC Confidence 111110 00000 001111 1233332 Q ss_pred ----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCCcc--cchhhhhhhhhc------ceeee-ccc Q lcl|NC_010179. 236 ----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYGGA--SLKQFMNDLREY------KSIKI-NNA 300 (469) Q Consensus 236 ----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~~~--~~~~~~~~~~~~------~~~~~-~~~ 300 (469) .-.|.|.+.....-++.-+.+..-..+.+...+.|-.++ +|.... ..+.....++.. +.+.+ .++ T Consensus 167 ~~~~~~yGlsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~~~ls~e~~~~ik~~~~~~~g~~~~r~~~l~~p~ 246 (344) T protein:vir:60 167 DINQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTDAVQDRNDIEMLRENMVKSKGRNNFKNLFLYAPQ 246 (344) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcCCCHHHHHHHHHHHHHhcCCCCCcceEEecCC Confidence 124666665444333322222212223334445554444 442111 112222222211 22222 222 Q ss_pred CCCCCCcceEEeec--CCHHHHHHHHHHHHHHHHHHhCCCCcCcc----c---cCCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQID--IPVEARDDALKITRDNIFLFGQGIDPANF----E---SSNASGVAIKMLYSHLELKAAKTQTYF 371 (469) Q Consensus 301 ~~~~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~ 371 (469) + +..++++.... .....+.+..+...+.|...-++|+.-.. + +||+...++.+....+.=.+. T Consensus 247 g--~~~g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~~n~e~~~~~f~~~~L~Pl~~------ 318 (344) T protein:vir:60 247 G--KADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQLMGGKPENVGSLGDIEKVAKVFVRNELIPLQD------ 318 (344) T ss_pred C--CccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHHhcccCCCCCccccHHHHHHHHHHHHHHHHHH------ Confidence 2 22345554432 23455677778888899999999874321 1 222222222222221111111 Q ss_pred HHHHHHHHHHHHHHhcccCCCcccceEEeCCC-CCCCHH Q lcl|NC_010179. 372 EHAINELVRAIMRYLNFSDADKRHISQHWTRT-KVEDSL 409 (469) Q Consensus 372 ~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~-~p~d~~ 409 (469) .|++ +...++.. .+.|.+. +..+.+ T Consensus 319 --~~e~----ln~~lg~~-------~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 319 --RIRE----INGWLGQE-------VIRFKNYSLDTDNG 344 (344) T ss_pred --HHHH----HHHhcCCc-------ccccCccccCCCCC Confidence 1111 22233321 1344433 222222 No 261 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=31.73 E-value=1.5 Score=19.66 Aligned_cols=275 Identities=11% Similarity=0.024 Sum_probs=98.4 Q ss_pred ccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhhcCCeeeccCch--h----------------- Q lcl|NC_010179. 32 ENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIASVFPDIDVGKD--A----------------- 92 (469) Q Consensus 32 ~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~g~p~~~~~~~~--~----------------- 92 (469) .+++ ..+..... +.. ..-.|.||.|..+-..-+ + T Consensus 1 m~~~------~~~~~~~~---------~~~------------~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~pP~ 53 (337) T protein:vir:78 1 MTKR------QQQPAQAA---------ASS------------PRPSVVFSMPEAIDPTAWMTDYTGVFYNPYGEYYQPPI 53 (337) T ss_pred CCCc------ccCccccc---------ccC------------ceeEEEecCcccccCcchhHhhhhhhhccCcceecCCC Confidence 2211 00000000 000 001233443322211000 0 Q ss_pred hHHHHHHH----------H--hccH-----H---HHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCC Q lcl|NC_010179. 93 DNKKILDV----------L--GDDR-----A---LTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATT 151 (469) Q Consensus 93 ~~~~l~~~----------~--~~n~-----~---~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~ 151 (469) ....|.+. + +-|. . ..+..++.+.+.+|.+|+.+-.+..|++. +..++|..+-..-| T Consensus 54 ~~~~La~l~~~~~~h~~~L~~k~N~~~~~f~~~~~~~~~~~~d~ll~GNay~~~~rn~~G~~~~L~pl~~~~v~~~~d-- 131 (337) T protein:vir:78 54 DRKGLAKVARANAHHGAILMARRNMVAGRFTNQRATITAFVHNYLQFGDGGLLKLRNSFGQVVGLHPLSSVYLRRRED-- 131 (337) T ss_pred CHHHHHHHhhcchhhhhHHHhhhccccccCcCcHHHHHHHHHHHHhhCCeEEEEEECCCCcEEEEEEeCCceeEeeeC-- Confidence 00011111 1 1121 1 12345666778899999999888888754 66666655432211 Q ss_pred CCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEE Q lcl|NC_010179. 152 LDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFI 231 (469) Q Consensus 152 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv 231 (469) .. .+|... .+.. ..+..+. |+ T Consensus 132 --~~-----~~~~~~--~~~~-----~~~~~~e---------------------------------------------Ii 152 (337) T protein:vir:78 132 --GC-----FVYLQQ--GKPN-----LIYRPDD---------------------------------------------VI 152 (337) T ss_pred --Ce-----EEEEEc--CCce-----EEECCcc---------------------------------------------EE Confidence 00 011000 0000 0011111 23 Q ss_pred EecC-----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCC--cccchhhhhhhhh-------ccee Q lcl|NC_010179. 232 EFPK-----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYG--GASLKQFMNDLRE-------YKSI 295 (469) Q Consensus 232 ~~~n-----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~--~~~~~~~~~~~~~-------~~~~ 295 (469) ++++ .-.|.|.+.....-+..-..+..-..+.+...+.|-.++ +|.. .+..+.....++. .+++ T Consensus 153 Hik~~~~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~~~l~~e~~~~lk~~~~~~~G~~n~~~~~ 232 (337) T protein:vir:78 153 WLAQYDPEQQVYGMPDYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATDPNMDDDTEEEMKEMIANSKGVGNFRSMF 232 (337) T ss_pred EECCCCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCCCCCCHHHHHHHHHHHHHhcCcccccceE Confidence 3332 113666555444433322222222233344445665554 3321 1111122222221 1122 Q ss_pred eecccCCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcccc--CCccH--HHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 296 KINNAGNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANFES--SNASG--VAIKMLYSHLELKAAKTQT 369 (469) Q Consensus 296 ~~~~~~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--g~~Sg--~Al~~~~~~l~~k~~~~~~ 369 (469) .+.+++. ..++++..... ....+.+..+...++|...-++|+.-.+-. ++.+| .+-+.. .. T Consensus 233 v~~~~g~--~~Gi~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~llGi~~~~~~~~~~n~e~~~-----------~~ 299 (337) T protein:vir:78 233 VNIPDGK--PDGIKLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPALAGIIPTNGGGGLGDPEKYD-----------AT 299 (337) T ss_pred EEcCCCC--ccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcccccCCCcCccccHHHHH-----------HH Confidence 2333222 33456654332 344556666677788999888887432111 11111 111111 12 Q ss_pred HHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCH Q lcl|NC_010179. 370 YFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDS 408 (469) Q Consensus 370 ~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~ 408 (469) .+...|.-+++.+...++....+. ...+.|..+.-.-. T Consensus 300 f~~~~L~P~~~~ie~~~n~~ll~~-~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 300 YARNEVLPLCELVQDAINSAGLPR-ALWVTFRETIGAAV 337 (337) T ss_pred HHHHHHHHHHHHHHHHHhhhcCCh-hhceeccccccccC Confidence 222333333333333333222111 11123332222211 No 262 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=30.64 E-value=1.6 Score=19.52 Aligned_cols=291 Identities=12% Similarity=0.040 Sum_probs=106.3 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+--++-+.+.-+ +.-..+||+- ++.... . .+-. + +.++...++......+. T Consensus 30 ~~~~~~~~~~~~~----------~~~~~~~~ep--p~~~~~------L----a~l~-~-----~n~~h~~~i~~k~N~l~ 81 (348) T protein:vir:26 30 DTNSWMTRYCELF----------YNDFDDYWEP--PISLKG------L----AEIA-N-----ANGYHGSLLKARANYVA 81 (348) T ss_pred cCcchHHHHHHHH----------hcCCCccccC--CCCHHH------H----HHHH-h-----hhhhhhhhHhhhhhHHh Confidence 4433333332221 1112245542 110000 0 0000 0 01111112222222222 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCceE-EEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNFR-YGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~-i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) +. +....... ...+.+++.+.+.+|.+|+.+-.+..|++. +..++|..+-+.-| .. T Consensus 82 ~~---~~Pn~~~t------------~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~v~~~~d----~~---- 138 (348) T protein:vir:26 82 GR---FMNGGGLP------------MYKMNSACWDYFGLGMSAFVKIRSYLKNVIALEPLPMVHMRKRKN----GD---- 138 (348) T ss_pred hc---ccCCCCCC------------HHHHHHHHHHHHhcCCeEEEEEEcCCCcEEEEEEecCceeEeeec----Cc---- Confidence 20 11000000 112345666778889999999888888753 66677665433211 00 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC---- Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK---- 235 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n---- 235 (469) +|... ..+... .+..+.+ +++++ T Consensus 139 --~~~~~-~~g~~~-----~f~~~dI---------------------------------------------iHir~~~~~ 165 (348) T protein:vir:26 139 --FVQLL-RNNEQK-----VFKAKDV---------------------------------------------IFIPQYDPQ 165 (348) T ss_pred --EEEEE-ecCeEE-----EEcCccE---------------------------------------------EEEcCCCCC Confidence 11100 011100 1122222 22221 Q ss_pred -CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCCc--ccchhhhhhhhh-------cceeeecccCCC Q lcl|NC_010179. 236 -NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYGG--ASLKQFMNDLRE-------YKSIKINNAGNG 303 (469) Q Consensus 236 -~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~~--~~~~~~~~~~~~-------~~~~~~~~~~~~ 303 (469) .-.|.|.+...+.-+..-+.+..-....++..+.|-.++ ++... +..+.....+.. .+++.+.+++.. T Consensus 166 ~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~~~ls~e~~~~lk~~~~~~~G~~n~~~~~vl~~~g~~ 245 (348) T protein:vir:26 166 QQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATDPNLSEADEKALKEKIASSKGIGNFRSMFVNIPNGKE 245 (348) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeEEcCCCCc Confidence 124666665544444322222222233345555665554 33211 111222222221 123334333332 Q ss_pred CCCcceEEee--cCCHHHHHHHHHHHHHHHHHHhCCCCcCcc----c---cCCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 DKSGVDKLQI--DIPVEARDDALKITRDNIFLFGQGIDPANF----E---SSNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 304 ~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) .++++... ......+.+..+.....|+..-++|+.-.. + +|++...+.. .+... T Consensus 246 --~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~llGi~~~~~~~~sn~e~~~~~---------------f~~~~ 308 (348) T protein:vir:26 246 --KGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAGMGGMLPQQGANVPDPLKVSQV---------------YDFYE 308 (348) T ss_pred --cceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHHHccccCCCCCccccHHHHHHH---------------HHHHH Confidence 34455432 223345666667777889999899874321 1 1222222222 22222 Q ss_pred HHHHHHHHHHHhcccCCCcccceEEeC--CCCCCCHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFSDADKRHISQHWT--RTKVEDSLTKA 412 (469) Q Consensus 375 l~~~~~~i~~~~~~~~~~~~~i~i~f~--~~~p~d~~e~~ 412 (469) |.-+++.+...++..-.-...+.+.|. +..-+..+..+ T Consensus 309 l~P~~~~ie~~ln~~l~~~~~~~~~fdl~~~~e~~~~~a~ 348 (348) T protein:vir:26 309 VIPVCKRFMDAVNNDPEIPDNLKLKFNLNPGVESANGSAV 348 (348) T ss_pred HHHHHHHHHHHHhhhhCCCCccEEEEecCcccccchhhcC Confidence 333333333322211000112334443 32222222222 No 263 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=29.54 E-value=1.7 Score=19.39 Aligned_cols=393 Identities=13% Similarity=0.015 Sum_probs=146.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc---------ccchhhhccccccc--c-ccc---Ccceecc Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTR---------NNGKPKVSKEGKKD--P-LRS---ADNRIPS 65 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~---------~~~~~~~~~~~~~~--~-~~~---~~~ri~~ 65 (469) |- +++.+....... ++ .....+-...+..... ..........+... . ... ...-+.. T Consensus 1 M~------~~~~l~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~g~~v~~~~a~~~ 72 (466) T protein:vir:81 1 MR------LIDRLLSTRGAA-PR-MSIDDYAQMLNEFAFNGIGYGFGGGVPRIQQTLAGPSTELAPDTFVGLATQAYQAN 72 (466) T ss_pred Cc------hhHHHhhccCcc-cc-cchhhhhhhhhhhhccccccccccccHHHHHhhccccccccCccccccchhhhhcc Confidence 22 122222111000 00 0001111110000000 00000000000000 0 000 0001223 Q ss_pred chHHHHHHHHHHhhhcCCeeeccCch-----hhHHHHHHHHhc-c-HH---HHHHHHHHHHHhCCeEEEEEEEcCCCc-- Q lcl|NC_010179. 66 NFYQLLVDQEAGYIASVFPDIDVGKD-----ADNKKILDVLGD-D-RA---LTLNSLLVDSSNAGRAWLHYWIDEDNN-- 133 (469) Q Consensus 66 n~~k~iv~~~~~~l~g~p~~~~~~~~-----~~~~~l~~~~~~-n-~~---~~~~~~~~~~~~~G~~~~~v~~d~~~~-- 133 (469) +.....|+..+.-+.+-|+.+.-.++ .....+..++.+ | .+ +....+..+++.+|.+|+.+..++.+. T Consensus 73 ~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l~ 152 (466) T protein:vir:81 73 GPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRMR 152 (466) T ss_pred HHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCccccc Confidence 44455666666666666766532211 112234445542 3 22 223456778899999999998877654 Q ss_pred -------eEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeeccccccccc Q lcl|NC_010179. 134 -------FRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITS 206 (469) Q Consensus 134 -------~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 206 (469) ..+..++|..+.+..+......+. ..|... +.........+... T Consensus 153 ~~~~g~~~~l~~l~~~~v~~~~~~~~~~~~~---y~~~~~---~~~~~~~~~~~~~~----------------------- 203 (466) T protein:vir:81 153 PDWVDVVVEERMVRGGRGELGGGQLGWRKVG---YLYTEG---GRQSGNESVGFLAE----------------------- 203 (466) T ss_pred cccCcceeEEEEecCcceEEEEcCCCceEEE---EEEEec---Ccccccceeeeccc----------------------- Confidence 346667777766665432211111 111100 00000000011111 Q ss_pred ccccccccccccccccccCCcccEEEecCC------ccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEEecCCcc Q lcl|NC_010179. 207 YDLSAGYETGQSNTLKHNFGRVPFIEFPKN------KYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGA 280 (469) Q Consensus 207 ~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~------~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~~g~~~~ 280 (469) -|+||+.. -.|.|-+......|+....+..-....++..+.|-.+++-.... T Consensus 204 ----------------------dviHir~~~~~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~~l 261 (466) T protein:vir:81 204 ----------------------DVVHFAPIPDPLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNPMA 261 (466) T ss_pred ----------------------cEEEEcCCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCCCC Confidence 23344321 14667677666666665555555566667777776666532111 Q ss_pred c---chhhhhhhhh--------cceeeecccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcc---ccC Q lcl|NC_010179. 281 S---LKQFMNDLRE--------YKSIKINNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDPANF---ESS 346 (469) Q Consensus 281 ~---~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~---~~g 346 (469) . .......+.. .+++.+.. +.+++-++.+.....+.+..+...+.|+..-++|+.-.. +.+ T Consensus 262 ~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-----g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~~~~~~ 336 (466) T protein:vir:81 262 DPAAVKKWADEVNSKHAGVDNAWKNLNLYP-----GADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGLSEGLA 336 (466) T ss_pred CHHHHHHHHHHHHHHhcCccccccceEcCC-----CceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccccCCC Confidence 1 1111111111 11222221 123444443333445566677788899998889875332 122 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc---CCCcccceEEeCCC--CCCCHHHHHHHH------ Q lcl|NC_010179. 347 NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFS---DADKRHISQHWTRT--KVEDSLTKAQIV------ 415 (469) Q Consensus 347 ~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~---~~~~~~i~i~f~~~--~p~d~~e~~~~~------ 415 (469) ..++..++-... ..+..+|.-+++.+...++.+ ..+...+.+.|+.. +-.|.++.+++. T Consensus 337 ~st~sn~eq~~~----------~f~~~tl~P~~~~ie~~l~~~L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~ 406 (466) T protein:vir:81 337 AATYSNYGQARR----------RLADGTAHPLWQNLSGCIGHVMPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRAET 406 (466) T ss_pred ccccccHHHHHH----------HHHHHHHHHHHHHHHHHHHhhcCCcccCcceEEEecchhhhccCHHHHHHHHHHHHHH Confidence 233222222221 222333333333333333211 11222345666533 444666655542 Q ss_pred -HHH--hccCChHHHHHhCCCCCCH--H-HHHHHHHHHHHHhhhhHhhcccCCCCCC-CCC Q lcl|NC_010179. 416 -STV--ANYSSKEAVAKANPIVDDW--Q-QELKDLAKDREENDPYANQADELNGKGV-DDE 469 (469) Q Consensus 416 -~kl--~g~iS~et~~~~l~~v~d~--~-~E~eri~~E~~~~~~~~~~~~~~~~~~~-~de 469 (469) ..+ +|+ ....+....+.-++. . .-+.-.+.-. ..........+...+|+ ++. T Consensus 407 ~~~~~~~g~-t~nE~r~~~~~gd~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~Gg~~ng 465 (466) T protein:vir:81 407 INTLITAGY-EPESVVAAVNSGDLRLLKHTGLTSVQLLP-PGVSASASSDTPTSGGADDNG 465 (466) T ss_pred HHHHHHcCC-ChhhccccccCCccccccCCCcchhhhcc-cccccccCCCCcccCCCCcCC Confidence 222 343 444444433221110 0 0000000000 00111111111111222 222 No 264 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=28.21 E-value=1.8 Score=19.22 Aligned_cols=399 Identities=10% Similarity=0.065 Sum_probs=159.7 Q ss_pred CCHHHH-HHHHHHHHHH---H----HHH--------HHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceec Q lcl|NC_010179. 1 MELDAL-KKLIRNTSTS---R----NDL--------INNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIP 64 (469) Q Consensus 1 ~~~~~~-~~~i~~~~~~---~----~~~--------~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~ 64 (469) .+-+.. .++.+++..- + ... ....-....||-|.- . +.+..+..... . -+. T Consensus 10 ~de~~~~~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~~--~---n~~eLI~~YR~---m-----a~~ 76 (533) T protein:vir:58 10 LNEAVNFTNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGIE--F---NRFFLYDMYDR---M-----DYT 76 (533) T ss_pred hhHHHHHHHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhcccc--c---cHHHHHHHHHH---h-----hcc Confidence 000000 1111111000 0 000 000111222333310 0 00000000000 0 001 Q ss_pred cchHHH----HHHHH-HHhhhcCCeeeccCchhhHHHHHHHHhc--cHHHHHHHHHHHHHhCCeEEEEEEEc-CCCce-E Q lcl|NC_010179. 65 SNFYQL----LVDQE-AGYIASVFPDIDVGKDADNKKILDVLGD--DRALTLNSLLVDSSNAGRAWLHYWID-EDNNF-R 135 (469) Q Consensus 65 ~n~~k~----iv~~~-~~~l~g~p~~~~~~~~~~~~~l~~~~~~--n~~~~~~~~~~~~~~~G~~~~~v~~d-~~~~~-~ 135 (469) ++-+-. ||+.. +..-...|+.+..++.+..+.+++.+.+ ||.....+..+.+.+.|+.|.+.-++ +++.| . T Consensus 77 ~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~lldf~~~~~~~fR~WYVDGriy~Hkiik~~k~GI~e 156 (533) T protein:vir:58 77 DPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVINIEKNAYPIIRNMIKYGDMFLHILEKGSDGTIEK 156 (533) T ss_pred CcchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHhcchhhhhHHHHhhhhcceeEEEeccCCcccchhh Confidence 122222 33322 2234577888887777777777666543 46677889999999999999888542 33334 6 Q ss_pred EEEEccceeEEEEeCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccc Q lcl|NC_010179. 136 YGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYET 215 (469) Q Consensus 136 i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 215 (469) +..+||+.+-++++.... . .+| +|++...... +.... ..... T Consensus 157 lr~lDPr~i~~vr~~~t~--~----eyy---------------vy~~~~~~~~---s~~~~--------------~kI~~ 198 (533) T protein:vir:58 157 FQVVSPYIFSKRYNPETD--T----WYY---------------VITDVYRNVV---SGYFN--------------EDIPE 198 (533) T ss_pred heecCCeeeEEEEeeccc--e----EEE---------------eecccccccc---cCccc--------------cccch Confidence 899999998888765332 1 111 2222211100 00000 00000 Q ss_pred cccccccccCCcccEEEecCCccccccHHHHHHHHHHHHH--HHHHHHHHHHHhcCceeEEecCCccc-----chhhhhh Q lcl|NC_010179. 216 GQSNTLKHNFGRVPFIEFPKNKYRLAELNKYKGLIDAYDD--IYNGFINDLDDVQTVILVLTNYGGAS-----LKQFMND 288 (469) Q Consensus 216 ~~~~~~~~~~g~vPvv~~~n~~~g~~~~~~v~~liD~~~~--~~s~~~~~~~~~~~p~l~~~g~~~~~-----~~~~~~~ 288 (469) ..+.-..+++- . .+.+.+.|-+. .-|..+|. ++-+.+-..+..+.|-.-+.-.+..+ ..+-... T Consensus 199 daI~y~~SGl~--d----~~~~~iisyLh---kAiKp~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~ 269 (533) T protein:vir:58 199 EDVIHFSHKID--T----NFFPYGRSYLE---SARAIWNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTN 269 (533) T ss_pred hheeeeeeccc--c----CCCCceehhhh---HHHHHHHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHH Confidence 00000011110 0 01223334443 33333443 23444444555555432222211111 1111111 Q ss_pred h---hhcceeee------------------------cccCCCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCc- Q lcl|NC_010179. 289 L---REYKSIKI------------------------NNAGNGDKSGVDKLQIDIPVEARDDALKITRDNIFLFGQGIDP- 340 (469) Q Consensus 289 ~---~~~~~~~~------------------------~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~- 340 (469) + -.++++.- +--+.+.+-.++.|.. .++ ....-+.-+.+.+|+...+|-. T Consensus 270 im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLpRReGgrgTEI~TLpG-g~l-gemeDV~YF~kkLy~ALnVP~sR 347 (533) T protein:vir:58 270 IAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIPRRGDRRAVEIDILQG-SKV-DLAEDVEYMLNRLISALKVPKAF 347 (533) T ss_pred HHHhcccceEEeccCCeEeeccchhhhhhhHhhhcccccCCCccceeeecCC-CCC-CcHHHHHHHHHHHHHHhCCCeee Confidence 1 01111110 0001122234555542 344 3345678888999998898853 Q ss_pred --CccccCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHH------- Q lcl|NC_010179. 341 --ANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTK------- 411 (469) Q Consensus 341 --~~~~~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~------- 411 (469) .+.++|..|..+ .-.......+.+.+..|..-|++.| ++ ++-....+..+.|...-.-.+... T Consensus 348 l~~e~~fgr~~eIt--RDEiKF~KFI~rLR~rF~~ll~~qL--il----k~iit~eew~~~f~~Dn~f~ElKe~Eil~~R 419 (533) T protein:vir:58 348 IGYEGDVNAKNTLA--TQDIKFNNTIKRIQGFFVEELERMV--RM----NKEFADQDFRLVMNRSNSIVEGERFAVIEQR 419 (533) T ss_pred cCCCCCCccchhhh--HHHHHHHHHHHHHHHHHHHHHhccc--cc----ccCcchhheeeeeeccchHHHHHHHHHHHHH Confidence 223455433332 2222233334444555555544422 11 111222334677764433333333 Q ss_pred HHHHHHHhccCChHHHHHhC-CCCCCHHHHHHHHHHHHHHhh-hhHhhcccCCCCCCCCC Q lcl|NC_010179. 412 AQIVSTVANYSSKEAVAKAN-PIVDDWQQELKDLAKDREEND-PYANQADELNGKGVDDE 469 (469) Q Consensus 412 ~~~~~kl~g~iS~et~~~~l-~~v~d~~~E~eri~~E~~~~~-~~~~~~~~~~~~~~~de 469 (469) +++++.+.+.++++++.+.+ -..+|...+.+.|++|..+.. +......+......+.| T Consensus 420 i~~l~~~dpyvgk~yi~k~ILr~tdei~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~ 479 (533) T protein:vir:58 420 IGIAERLKGWVREDWIYSNILQIPYDLKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGE 479 (533) T ss_pred HHHHHHhcchhhHHHHHHHHhcCChhhhHHHHHHHHhhcCCCCCCCCcccccCCcccCcc Confidence 34445566789999988864 445555555566666643311 00000000011111111 No 265 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=25.84 E-value=2 Score=18.92 Aligned_cols=412 Identities=10% Similarity=0.013 Sum_probs=160.3 Q ss_pred CCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHH Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDL---INNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAG 77 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~---~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~ 77 (469) |.- -...+..+ .+| ..+.+.+.+|... .+-.. ... ... ......+.-.+-...-++..++ T Consensus 1 m~~-----~~~~l~~k-~~R~~~e~~w~e~a~~~lP-----~~~~~-~~~-~~~----~~~~~~~~~dstg~~a~~~LAa 63 (514) T protein:vir:80 1 MRQ-----QASAMWAE-YRDSTAIRKAEDFAKFTIA-----SLMVD-PLD-KTH----QAEVVEYDFQSAGAFLVNNLTA 63 (514) T ss_pred Ccc-----chHHHHHH-hhcchHHHHHHHHHHHhcc-----cccCC-CCC-Ccc----cccccccccchhHHHHHHHHHH Confidence 332 22333222 123 3333444444332 11000 000 000 0000112223444445555555 Q ss_pred hhhc--CCe-----eeccCch---------hhHHHHHHH-----------H-hccHHHHHHHHHHHHHhCCeEEEEEEEc Q lcl|NC_010179. 78 YIAS--VFP-----DIDVGKD---------ADNKKILDV-----------L-GDDRALTLNSLLVDSSNAGRAWLHYWID 129 (469) Q Consensus 78 ~l~g--~p~-----~~~~~~~---------~~~~~l~~~-----------~-~~n~~~~~~~~~~~~~~~G~~~~~v~~d 129 (469) -|.+ -|| ++..+++ .....++.| + ..||...+.++.++...+|.+.++ .+ T Consensus 64 ~l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~--~~ 141 (514) T protein:vir:80 64 KLALTLFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALFY--RE 141 (514) T ss_pred HHHhhhcCCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEEE--Ee Confidence 4433 122 2222221 111122332 2 246677788889999999998654 44 Q ss_pred CCCceEEEEEccceeEEEEeCCCCCceEEEEEEEEeeecC--------------CceEEEEEEEEcCCeEEEEEeec--C Q lcl|NC_010179. 130 EDNNFRYGIIQPDQITPVYATTLDNKLLGVLRSYKQLDPE--------------AGKYFTVHEYWTDKEAQFFRTSA--T 193 (469) Q Consensus 130 ~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~v~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~--~ 193 (469) ++. -.++.++-.++++.-|. .+++...+|..+..... .......+++|+. .+...+ . T Consensus 142 ~~~-~~~~~~pl~~y~v~~d~--~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~----v~~~~~~~~ 214 (514) T protein:vir:80 142 PGT-GKMLVWTMQSYTVRRTS--HGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTV----IEWQPTPNG 214 (514) T ss_pred cCC-CcEEEEEcCeEEEeeCC--CcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEE----EEeecCCCC Confidence 432 23555655555544443 34555555554432110 0001112222221 011111 1 Q ss_pred ceeecccccccccccccccccccccccccccCCcccEEEec-----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_010179. 194 DSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFP-----KNKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQ 268 (469) Q Consensus 194 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-----n~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~ 268 (469) .+.++ +....+....... ..++..+|++.++ .+.+|.|-.++..+-+..+|.+.-.......... T Consensus 215 ~~~sv--------~~e~~g~~i~~es--~y~~~e~P~i~~Rw~~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~~~a~ 284 (514) T protein:vir:80 215 KRCAV--------WHELEGKRVGPES--SYPAHLCPYVPVAWNVPDGEHYGRGYVEEYSGDFARLSILSERLGLYEFEAL 284 (514) T ss_pred eEEEE--------EEeccceeecccC--ccccccCCeeeeeeEecCCCCcccchHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 11111 1111111111111 1223446776654 3457999999999999999988777777777777 Q ss_pred CceeEEecCCcccchhhhhhhhhcceeeecccCCCCCCcceEEee--cCCHHHHHHHHHHHHHHHHHHhCCCCcCccccC Q lcl|NC_010179. 269 TVILVLTNYGGASLKQFMNDLREYKSIKINNAGNGDKSGVDKLQI--DIPVEARDDALKITRDNIFLFGQGIDPANFESS 346 (469) Q Consensus 269 ~p~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~g 346 (469) .|.+.+.-.+... ...+.....-.+.++ ...++..+.. ..+.......++.++..|...-.. ........ T Consensus 285 ~~~~~v~~~g~~~----~~~l~~~~~g~~v~g---~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aFml-~~~~rd~~ 356 (514) T protein:vir:80 285 SLLNLVDEAKGGA----VDDYRDAETGDFVPG---QVGSVASYERGDYNKIAQASASVESIVMRLNRAFMY-TGQVRDAE 356 (514) T ss_pred CCCceeCcccccc----hhhhcccCCceeecC---CCccceeeecCcccchHHHHHHHHHHHHHHHHHHhh-hccCCCCC Confidence 7665442111111 111111111111111 1234555543 336777778888888777542111 11111223 Q ss_pred CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHhcc------cCCCcccceEEeCCCCCC-CHHHH Q lcl|NC_010179. 347 NASGVAIKMLYSHLELKAAKTQTYFEHAINEL--------VRAIMRYLNF------SDADKRHISQHWTRTKVE-DSLTK 411 (469) Q Consensus 347 ~~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~--------~~~i~~~~~~------~~~~~~~i~i~f~~~~p~-d~~e~ 411 (469) +.|+..+.. +..+++..++..+.++ +...+.++.. ......-+.+.+..++.. ...+. T Consensus 357 rvTAtEV~~-------r~~E~~~~LGpv~~rl~~Ell~Pli~r~~~il~r~~~g~lP~~p~~l~~~~~vs~la~l~r~~~ 429 (514) T protein:vir:80 357 RVTVEEIRT-------VAEEAENLLGGVYSLLAETLQAPLAYLTMYEASRGNGGMLLGIAQGVYRPSIITGIPALTRNIE 429 (514) T ss_pred CCCHHHHHH-------HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCchhhcceeeecHHHHHHHHH Confidence 346655442 3345555555555443 2222222211 122222344555433211 11111 Q ss_pred H-------HHHHHHhcc-------CChHHHHHhC----C-----CCCCHHH---HHHHHHHHHHHhhhh-------Hhhc Q lcl|NC_010179. 412 A-------QIVSTVANY-------SSKEAVAKAN----P-----IVDDWQQ---ELKDLAKDREENDPY-------ANQA 458 (469) Q Consensus 412 ~-------~~~~kl~g~-------iS~et~~~~l----~-----~v~d~~~---E~eri~~E~~~~~~~-------~~~~ 458 (469) + +.+..++++ +....++..+ | .+.+.+. +.+|.++++++.... ..+. T Consensus 430 ~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 509 (514) T protein:vir:80 430 TANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLDVASGALAAETSA 509 (514) T ss_pred HHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 2 222222222 3344444442 2 1222211 112221111111111 0111 Q ss_pred ccCCC Q lcl|NC_010179. 459 DELNG 463 (469) Q Consensus 459 ~~~~~ 463 (469) +-+.. T Consensus 510 ~~~~~ 514 (514) T protein:vir:80 510 GVLTS 514 (514) T ss_pred cccCC Confidence 22222 No 266 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=22.96 E-value=2.4 Score=18.53 Aligned_cols=286 Identities=9% Similarity=0.045 Sum_probs=106.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-.++-..+.-+ ..-+||+- ++.... .. +-. | +..+...++...+..+. T Consensus 40 ~~~~~~~~~~~~~------------~~~~~~~p--p~~~~~------la----~~~-----~-~~~~h~~~l~~k~n~l~ 89 (351) T protein:vir:78 40 MNRAEILDYVECW------------SNGEWFEP--PVSFAG------LA----KSF-----R-ASTHHSSALFFKANVLA 89 (351) T ss_pred cCcchhhhhhhhh------------ccCceecC--CCCHHH------HH----HHH-----h-hhHhhhhhhhhhhhHHh Confidence 4433322222111 11123331 111000 00 000 0 01111222222233332 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) +. +...... -...+.+++.+.+.+|.+|+.+-.+..|++ .+..++|..+.+.-+.+ T Consensus 90 ~~---~~Pn~~~------------t~~~f~~~~~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~v~~~~~~~-------- 146 (351) T protein:vir:78 90 ST---FRPHRWL------------SRHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFS-------- 146 (351) T ss_pred hc---ccCCCCC------------CHHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEecCcceEEeeeCC-------- Confidence 21 1100000 011244566678889999999988888875 47777777665543321 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC---- Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK---- 235 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n---- 235 (469) ++|... ..+.. ..|..+. |+++++ T Consensus 147 -~~~~~~-~~~~~-----~~~~~~e---------------------------------------------Vihir~~~~~ 174 (351) T protein:vir:78 147 -GFVYVN-GWQER-----HEFAPDS---------------------------------------------VFQLVRPDIN 174 (351) T ss_pred -eEEEEe-cCCeE-----EEEcccc---------------------------------------------EEEEcCCCCC Confidence 111110 01100 0111222 223321 Q ss_pred -CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCC--cccchhhhhhhhh-------cceeeecccCCC Q lcl|NC_010179. 236 -NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYG--GASLKQFMNDLRE-------YKSIKINNAGNG 303 (469) Q Consensus 236 -~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~--~~~~~~~~~~~~~-------~~~~~~~~~~~~ 303 (469) .-.|.|.+......+..-+.+..-..+.++..+.|-.++ +|.. .+..+.....++. .+++.+.+++. T Consensus 175 ~~~yGl~~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~~~ls~e~~~~lr~~~~~~~G~~N~~~~~v~~~~g~- 253 (351) T protein:vir:78 175 QEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGK- 253 (351) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCcccccceeeecCCCC- Confidence 124667666555544433333222334445555564444 3421 1111122222221 11233333332 Q ss_pred CCCcceEEeec--CCHHHHHHHHHHHHHHHHHHhCCCCcCcc----c---cCCccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 304 DKSGVDKLQID--IPVEARDDALKITRDNIFLFGQGIDPANF----E---SSNASGVAIKMLYSHLELKAAKTQTYFEHA 374 (469) Q Consensus 304 ~~~~~~~l~~~--~~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 374 (469) ..++++.... .....+.+..+....+|+..-++|+.-.. + +|++...++. .+... T Consensus 254 -~~g~k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~---------------f~~~~ 317 (351) T protein:vir:78 254 -KDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARV---------------FGRNE 317 (351) T ss_pred -ccceeEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHH---------------HHHHH Confidence 2344554322 23445666677778889999999874321 1 1222222221 12222 Q ss_pred HHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHH Q lcl|NC_010179. 375 INELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKA 412 (469) Q Consensus 375 l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~ 412 (469) |.-+++.+..+.+.-+.+ -+.|++..-..-.+.+ T Consensus 318 l~P~~~~iee~n~~l~~~----~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 318 IRPLQARFAELNDWLGDE----VVRFDDYEIPPAPVAA 351 (351) T ss_pred HHHHHHHHHHHHhhcCcc----ceecChhhhccccccC Confidence 222222222211111111 1455533221111111 No 267 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=22.95 E-value=2.4 Score=18.52 Aligned_cols=279 Identities=9% Similarity=0.030 Sum_probs=101.1 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-.++.+.+.-+ ..-+||+- +|.... .. +-. ++ ..+...++......+. T Consensus 43 ~~~~~~~~y~~~~------------~~~~~~~p--p~~~~~-----la-----~~~-~~-----~~~h~~~l~~k~n~l~ 92 (350) T protein:vir:11 43 LDGRGILDYLECW------------PNGRWYEP--PLSMEG-----LA-----KSV-GS-----SVYLQSGLKFKRNMLA 92 (350) T ss_pred cCcchhhHHHHHh------------hcCccccC--CCCHHH-----HH-----HHH-hh-----hhhhccchhhhhhhhh Confidence 4443333332111 01123331 110000 00 000 00 0000000111111111 Q ss_pred cCCeeeccCchhhHHHHHHHHhcc-HH--HHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCce Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDD-RA--LTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKL 156 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n-~~--~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~ 156 (469) + .+.-| .+ ..+.+++.+.+.+|.+|+.+..+..|++ .+..++|..+-+.-+.. T Consensus 93 ~------------------~~~Pn~~~t~~~f~~~v~d~ll~Gnay~~~~rn~~G~~~~L~~l~~~~vr~~~~~~----- 149 (350) T protein:vir:11 93 K------------------TFIPHRLLSRATFEQFSLDWLTFGSAYLEQPRSRLGTRMPLQAPLAKYMRRGTDLE----- 149 (350) T ss_pred h------------------cccCCCCCCHHHHHHHHHHHHhcCCeEEEEEEcCCCCEEEEEEeCCceeEeeecCC----- Confidence 0 00111 11 1234566677889999999988888875 47777776654322210 Q ss_pred EEEEEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC- Q lcl|NC_010179. 157 LGVLRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK- 235 (469) Q Consensus 157 ~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n- 235 (469) ++|.... .+..+ .+..+. |+++++ T Consensus 150 ----~~~~~~~-~~~~~-----~~~~~e---------------------------------------------Vihir~~ 174 (350) T protein:vir:11 150 ----TFYQVRS-WKDEH-----EFEKGS---------------------------------------------VIQLREA 174 (350) T ss_pred ----eEEEEee-CCeEE-----EECccc---------------------------------------------EEEeCCC Confidence 1111110 11100 111222 223321 Q ss_pred ----CccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCceeEE--ecCC--cccchhhhhhhhh-------cceeeeccc Q lcl|NC_010179. 236 ----NKYRLAELNKYKGLIDAYDDIYNGFINDLDDVQTVILVL--TNYG--GASLKQFMNDLRE-------YKSIKINNA 300 (469) Q Consensus 236 ----~~~g~~~~~~v~~liD~~~~~~s~~~~~~~~~~~p~l~~--~g~~--~~~~~~~~~~~~~-------~~~~~~~~~ 300 (469) .-.|.|.+......+..-+.+..-..+.+...+.|-.++ +|.. .++.+.....++. .+++.+.++ T Consensus 175 ~~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~~~ls~e~~~~l~~~~~~~~G~~N~~~~~v~~~~ 254 (350) T protein:vir:11 175 DINQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTDAAQNEEDIDALRTALKTAKGPGNFRNLFVYAPN 254 (350) T ss_pred CCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeeecCC Confidence 124667665554444432222222233344455554443 4421 1111222222221 122333333 Q ss_pred CCCCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcc----c---cCCccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 301 GNGDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANF----E---SSNASGVAIKMLYSHLELKAAKTQTYF 371 (469) Q Consensus 301 ~~~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~ 371 (469) +. ..++++..... ....+.+..+....+|+..-++|+.-.+ + +|++...+..+... T Consensus 255 g~--~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~llGi~~~~t~~~sn~e~~~~~f~~~------------- 319 (350) T protein:vir:11 255 GK--KEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQLMGVVPQNAGGFGSISDAAAVWASL------------- 319 (350) T ss_pred CC--ccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcCCHHHHHHHHHHH------------- Confidence 32 33455543332 3445677777888889999899864221 1 22222222222221 Q ss_pred HHHHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCH Q lcl|NC_010179. 372 EHAINELVRAIMRYLNFSDADKRHISQHWTRTKVEDS 408 (469) Q Consensus 372 ~~~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~ 408 (469) .|.-+++.+.++...-+.+ .+.|.+...... T Consensus 320 --~L~P~~~~ie~ln~~l~~~----~~~F~~~~~~~l 350 (350) T protein:vir:11 320 --ELAPMQTRLQQVNEMIGEE----VVRFAQFDAPGL 350 (350) T ss_pred --HHHHHHHHHHHHHhhcCcc----ccccCcccccCC Confidence 2222222222111100000 123332222222 No 268 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=22.38 E-value=2.5 Score=18.44 Aligned_cols=285 Identities=9% Similarity=0.038 Sum_probs=106.0 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccchhhhcccccccccccCcceeccchHHHHHHHHHHhhh Q lcl|NC_010179. 1 MELDALKKLIRNTSTSRNDLINNYKKSVDYYENKTDITTRNNGKPKVSKEGKKDPLRSADNRIPSNFYQLLVDQEAGYIA 80 (469) Q Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~~~ri~~n~~k~iv~~~~~~l~ 80 (469) |+-.++-..+.-+ .+-+||+- +|... ..... . | +..+..-++...+..+. T Consensus 65 ~~~~~~~~~~~~~------------~~~~~~~p--p~~~~------~La~~----~-----~-~~~~h~s~l~~k~n~l~ 114 (376) T protein:vir:10 65 MNRAEILDYVECW------------SNGEWFEP--PVSFA------GLAKS----F-----R-ASTHHSSALFFKANVLA 114 (376) T ss_pred cCcchhhhhhhhh------------hcCceecC--CCCHH------HHHHH----H-----h-hhHHhhhhHHHHhHHHH Confidence 4433322222111 11234442 11100 00000 0 0 01111112222223332 Q ss_pred cCCeeeccCchhhHHHHHHHHhccHHHHHHHHHHHHHhCCeEEEEEEEcCCCce-EEEEEccceeEEEEeCCCCCceEEE Q lcl|NC_010179. 81 SVFPDIDVGKDADNKKILDVLGDDRALTLNSLLVDSSNAGRAWLHYWIDEDNNF-RYGIIQPDQITPVYATTLDNKLLGV 159 (469) Q Consensus 81 g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~-~i~~~~p~~~~~~~d~~~~~~~~~~ 159 (469) +. +....-. . ...+.+++.+.+.+|.+|+.+-.+..|++ .+..++|..+-+..+.+ T Consensus 115 ~~---~~Pnp~l---------T---~~~f~~~v~d~ll~Gnay~~~~rn~~G~~~~L~pl~~~~vr~~~d~~-------- 171 (376) T protein:vir:10 115 ST---FRPHRWL---------S---RHAFERWALDFLTFGNGYLERRRNMVGGTLRLEPALAKYVRRKADFN-------- 171 (376) T ss_pred hc---cCCCCCC---------C---HHHHHHHHHHHHhcCCeEEEEEECCCCCEEEEEEeCCcceEEEeeCC-------- Confidence 21 1100000 0 11244566677889999999988888875 47888888766544432 Q ss_pred EEEEEeeecCCceEEEEEEEEcCCeEEEEEeecCceeecccccccccccccccccccccccccccCCcccEEEecC---- Q lcl|NC_010179. 160 LRSYKQLDPEAGKYFTVHEYWTDKEAQFFRTSATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFPK---- 235 (469) Q Consensus 160 v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n---- 235 (469) ++|.... .+ .. ..|..+.+ +++++ T Consensus 172 -~~~~~~~-~~-~~----~~~~~~eV---------------------------------------------iHir~~~~~ 199 (376) T protein:vir:10 172 -GFVYVNG-WQ-ER----HEFEPDSV---------------------------------------------FQLVRPDIN 199 (376) T ss_pred -eEEEEEc-CC-eE----EEEccccE---------------------------------------------EEecCCCCC Confidence 1111110 01 00 01122222 23321 Q ss_pred -CccccccHHHHHHHHHHHHHHHHHH-HHHHHHhcCceeE--EecCC--cccchhhhhhhhh-------cceeeecccCC Q lcl|NC_010179. 236 -NKYRLAELNKYKGLIDAYDDIYNGF-INDLDDVQTVILV--LTNYG--GASLKQFMNDLRE-------YKSIKINNAGN 302 (469) Q Consensus 236 -~~~g~~~~~~v~~liD~~~~~~s~~-~~~~~~~~~p~l~--~~g~~--~~~~~~~~~~~~~-------~~~~~~~~~~~ 302 (469) .-.|.|.+...+.-++.-+ ....+ ...++..+.|=.+ ++|.. .+..+.....+.. .+++.+.+++. T Consensus 200 ~~~yGls~~~~a~~si~l~~-aa~~f~~~~f~NGa~pggIl~~~d~~l~~e~~~~lr~~~~~~~G~~N~~~~~vl~~~g~ 278 (376) T protein:vir:10 200 QEVYGLPEYLSSLHSAWLNE-SSTLFRRKYYENGSHAGFILYMTDAAQKQDDVDNMRDALKNAKGPGNFRNVFMYAPGGK 278 (376) T ss_pred CCcccccHHHHHHHHHHHHH-HHHHHHHHHHhccCCCceEEEecCCCCCHHHHHHHHHHHHHhcCccccCceeEecCCCC Confidence 1246666655544444322 22222 2333444455444 34421 1111222222221 12333333332 Q ss_pred CCCCcceEEeecC--CHHHHHHHHHHHHHHHHHHhCCCCcCcc----c---cCCccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_010179. 303 GDKSGVDKLQIDI--PVEARDDALKITRDNIFLFGQGIDPANF----E---SSNASGVAIKMLYSHLELKAAKTQTYFEH 373 (469) Q Consensus 303 ~~~~~~~~l~~~~--~~~~~~~~~~~l~~~i~~~s~~p~~~~~----~---~g~~Sg~Al~~~~~~l~~k~~~~~~~~~~ 373 (469) +.++++..... ....+.+..+...++|+..-++|+.-.+ + +|++....+.+... T Consensus 279 --~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp~llGi~~~~t~~~sn~eq~~~~f~~~--------------- 341 (376) T protein:vir:10 279 --KDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQLLGIVPSNSGGFGTPDTAARVFGRN--------------- 341 (376) T ss_pred --ccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHhcccCCCCCCcccHHHHHHHHHHH--------------- Confidence 23455554333 3455667777778889999899874221 1 12222222222111 Q ss_pred HHHHHHHHHHHHhcccCCCcccceEEeCCCCCCCHHHHHHHHHHH Q lcl|NC_010179. 374 AINELVRAIMRYLNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV 418 (469) Q Consensus 374 ~l~~~~~~i~~~~~~~~~~~~~i~i~f~~~~p~d~~e~~~~~~kl 418 (469) .|.-+++.+.++.+.-+.+ -+.|++.. ....-.|+ T Consensus 342 ~L~Pl~~~ieeln~~L~~~----~~~F~~~~------Llr~d~ka 376 (376) T protein:vir:10 342 EIRPLQARFAELNDWLGEE----VVRFDDYE------IPPAPVAA 376 (376) T ss_pred HHHHHHHHHHHHHhhcccc----ccccChhH------hhcccccC Confidence 1221122111111100111 14454332 11111111 Done!