Query lcl|NC_012753.1_cdsid_YP_002925099.1 [gene=st5093phage_16] [protein=minor capsid protein] [protein_id=YP_002925099.1] [location=11352..12860] Match_columns 502 No_of_seqs 125 out of 235 Neff 8.7 Searched_HMMs 1612 Date Thu Nov 7 12:51:16 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_16 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_16_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:3028 Length: 500 # 100.0 1E-140 8E-144 787.5 57.7 499 1-502 1-500 (500) 2 protein:vir:9815 Length: 500 # 100.0 1E-140 8E-144 787.5 57.7 499 1-502 1-500 (500) 3 protein:vir:4782 Length: 522 # 100.0 3E-139 2E-142 780.0 56.0 501 1-502 1-520 (522) 4 protein:vir:79703 Length: 505 100.0 9E-139 5E-142 777.4 57.1 495 1-502 1-505 (505) 5 protein:vir:98883 Length: 517 100.0 2E-136 1E-139 764.9 56.7 498 1-502 1-515 (517) 6 protein:vir:1587 Length: 508 # 100.0 2E-134 1E-137 754.0 57.5 494 1-502 1-508 (508) 7 protein:vir:80959 Length: 499 100.0 5E-129 3E-132 723.8 55.9 489 1-502 3-497 (499) 8 protein:vir:38 Length: 496 # N 100.0 1E-124 6E-128 700.4 56.1 489 1-502 3-494 (496) 9 protein:vir:78907 Length: 518 100.0 6E-108 4E-111 608.3 51.3 480 1-502 1-516 (518) 10 protein:vir:105461 Length: 470 100.0 6.1E-67 3.8E-70 383.6 45.2 448 2-502 1-469 (470) 11 protein:vir:5961 Length: 503 # 100.0 1.6E-66 1E-69 381.2 44.6 461 1-502 9-485 (503) 12 protein:vir:96240 Length: 511 100.0 1.2E-65 7.5E-69 376.5 46.4 452 1-502 39-498 (511) 13 protein:vir:99781 Length: 511 100.0 7E-65 4.3E-68 372.3 46.2 452 1-502 39-494 (511) 14 protein:vir:103951 Length: 511 100.0 4E-64 2.5E-67 368.1 46.5 452 1-502 39-498 (511) 15 protein:vir:78805 Length: 511 100.0 3.9E-64 2.4E-67 368.2 46.3 452 1-502 39-498 (511) 16 protein:vir:96366 Length: 511 100.0 3.9E-64 2.4E-67 368.2 46.3 452 1-502 39-498 (511) 17 protein:vir:9922 Length: 489 # 100.0 2.2E-63 1.3E-66 364.1 47.9 451 1-502 13-481 (489) 18 protein:vir:97171 Length: 512 100.0 1E-63 6.4E-67 365.9 45.9 451 1-502 31-495 (512) 19 protein:vir:9306 Length: 511 # 100.0 1E-63 6.4E-67 365.9 45.8 468 1-502 4-498 (511) 20 protein:vir:79043 Length: 479 100.0 2.2E-63 1.4E-66 364.1 45.8 454 1-502 7-478 (479) 21 protein:vir:2732 Length: 501 # 100.0 3.6E-63 2.3E-66 362.9 46.6 440 1-502 38-487 (501) 22 protein:vir:94498 Length: 474 100.0 1.5E-63 9.2E-67 365.0 44.2 446 1-502 7-462 (474) 23 protein:vir:97447 Length: 474 100.0 1.5E-63 9.2E-67 365.0 44.2 446 1-502 7-462 (474) 24 protein:vir:102950 Length: 471 100.0 1.7E-63 1E-66 364.8 43.6 443 1-502 1-469 (471) 25 protein:vir:4898 Length: 502 # 100.0 7.1E-63 4.4E-66 361.3 47.0 440 1-502 39-484 (502) 26 protein:vir:94546 Length: 506 100.0 3.4E-63 2.1E-66 363.1 44.7 465 1-502 3-493 (506) 27 protein:vir:96494 Length: 501 100.0 9.4E-63 5.8E-66 360.6 46.6 440 1-502 38-483 (501) 28 protein:vir:96839 Length: 474 100.0 5.1E-63 3.1E-66 362.1 44.4 449 1-502 1-465 (474) 29 protein:vir:96179 Length: 468 100.0 4.2E-63 2.6E-66 362.6 43.7 446 1-502 1-462 (468) 30 protein:vir:1236 Length: 483 # 100.0 3.5E-63 2.2E-66 363.0 43.3 430 1-502 34-471 (483) 31 protein:vir:95113 Length: 474 100.0 5.7E-63 3.6E-66 361.8 44.0 446 1-502 7-462 (474) 32 protein:vir:94805 Length: 492 100.0 7E-63 4.4E-66 361.3 43.6 448 1-502 4-480 (492) 33 protein:vir:107112 Length: 478 100.0 1.9E-62 1.2E-65 358.9 44.2 451 1-502 2-468 (478) 34 protein:vir:105292 Length: 478 100.0 1.1E-62 6.7E-66 360.3 42.7 450 1-502 1-467 (478) 35 protein:vir:106571 Length: 499 100.0 4.4E-62 2.7E-65 357.0 45.0 449 1-502 1-472 (499) 36 protein:vir:93747 Length: 472 100.0 2.6E-62 1.6E-65 358.2 43.6 447 1-502 5-460 (472) 37 protein:vir:96266 Length: 474 100.0 3.4E-62 2.1E-65 357.6 43.2 445 1-502 7-462 (474) 38 protein:vir:95899 Length: 474 100.0 3.4E-62 2.1E-65 357.6 43.2 445 1-502 7-462 (474) 39 protein:vir:97336 Length: 492 100.0 3.6E-62 2.3E-65 357.4 42.9 448 1-502 4-480 (492) 40 protein:vir:106639 Length: 481 100.0 4.9E-61 3E-64 351.2 48.3 456 1-502 6-474 (481) 41 protein:vir:3609 Length: 452 # 100.0 2.2E-61 1.4E-64 353.1 46.4 424 1-502 17-448 (452) 42 protein:vir:3964 Length: 453 # 100.0 4.3E-61 2.7E-64 351.5 45.7 427 1-502 11-452 (453) 43 protein:vir:78083 Length: 537 100.0 5.8E-61 3.6E-64 350.8 45.0 457 1-502 8-505 (537) 44 protein:vir:99522 Length: 470 100.0 8E-60 4.9E-63 344.6 48.6 437 1-502 19-468 (470) 45 protein:vir:733 Length: 453 # 100.0 4E-60 2.5E-63 346.2 46.7 443 1-499 1-453 (453) 46 protein:vir:9871 Length: 429 # 100.0 2.5E-60 1.6E-63 347.3 43.9 420 18-502 1-422 (429) 47 protein:vir:102330 Length: 451 100.0 4.3E-60 2.7E-63 346.0 42.9 425 18-496 1-451 (451) 48 protein:vir:94101 Length: 474 100.0 1.9E-59 1.2E-62 342.5 44.0 445 1-502 1-467 (474) 49 protein:vir:105889 Length: 474 100.0 1.9E-59 1.2E-62 342.5 44.0 445 1-502 1-467 (474) 50 protein:vir:95806 Length: 440 100.0 4.6E-59 2.8E-62 340.4 42.9 429 22-502 1-437 (440) 51 protein:vir:78537 Length: 480 100.0 1.1E-53 6.7E-57 311.0 43.4 443 1-502 1-466 (480) 52 protein:vir:78227 Length: 480 100.0 7.3E-53 4.5E-56 306.4 43.2 442 1-502 1-466 (480) 53 protein:vir:80680 Length: 441 100.0 1E-51 6.5E-55 300.1 46.4 429 1-497 3-441 (441) 54 protein:vir:2427 Length: 485 # 100.0 6.3E-52 3.9E-55 301.3 43.6 438 1-502 6-471 (485) 55 protein:vir:4223 Length: 486 # 100.0 5E-51 3.1E-54 296.4 44.8 437 1-502 1-471 (486) 56 protein:vir:104082 Length: 485 100.0 7.3E-51 4.5E-54 295.4 43.7 442 1-502 5-472 (485) 57 protein:vir:99072 Length: 479 100.0 9.2E-52 5.7E-55 300.4 38.0 436 14-502 1-458 (479) 58 protein:vir:102602 Length: 456 100.0 7.5E-51 4.6E-54 295.4 42.5 444 1-502 1-455 (456) 59 protein:vir:105819 Length: 456 100.0 7.5E-51 4.6E-54 295.4 42.5 444 1-502 1-455 (456) 60 protein:vir:2500 Length: 501 # 100.0 3.8E-51 2.4E-54 297.0 40.7 454 1-502 22-498 (501) 61 protein:vir:7987 Length: 456 # 100.0 2.6E-50 1.6E-53 292.4 44.1 440 1-498 4-456 (456) 62 protein:vir:7768 Length: 484 # 100.0 3.4E-50 2.1E-53 291.8 41.9 442 1-502 1-468 (484) 63 protein:vir:2341 Length: 488 # 100.0 7.1E-50 4.4E-53 290.0 42.7 440 1-502 7-480 (488) 64 protein:vir:98444 Length: 434 100.0 1.7E-46 1E-49 271.6 41.6 414 55-501 1-434 (434) 65 protein:vir:99916 Length: 504 100.0 2.2E-45 1.4E-48 265.4 42.8 444 1-502 1-487 (504) 66 protein:vir:9751 Length: 422 # 100.0 1.4E-41 8.7E-45 244.5 38.9 412 1-487 1-422 (422) 67 protein:vir:9568 Length: 410 # 100.0 1.1E-40 6.6E-44 239.7 40.9 399 32-490 1-410 (410) 68 protein:vir:8184 Length: 474 # 100.0 7.3E-39 4.5E-42 229.7 39.6 444 1-502 12-474 (474) 69 protein:vir:94742 Length: 409 100.0 2.4E-39 1.5E-42 232.4 36.4 398 1-472 1-409 (409) 70 protein:vir:1634 Length: 409 # 100.0 5.1E-39 3.1E-42 230.5 34.1 398 1-472 1-409 (409) 71 protein:vir:101494 Length: 527 100.0 2.1E-37 1.3E-40 221.7 36.1 463 14-502 1-513 (527) 72 protein:vir:102239 Length: 527 100.0 2.4E-37 1.5E-40 221.3 36.1 463 14-502 1-513 (527) 73 protein:vir:7430 Length: 563 # 100.0 3.6E-37 2.2E-40 220.4 36.0 464 14-502 1-528 (563) 74 protein:vir:94956 Length: 452 99.8 8.1E-18 5E-21 114.3 40.5 423 1-498 1-452 (452) 75 protein:vir:93630 Length: 776 99.7 4.8E-17 3E-20 110.1 32.7 457 1-502 44-655 (776) 76 protein:vir:97265 Length: 513 99.7 5.6E-16 3.5E-19 104.2 37.7 437 18-502 1-491 (513) 77 protein:vir:80453 Length: 535 99.7 1.3E-14 8.3E-18 96.7 40.6 448 1-502 32-535 (535) 78 protein:vir:95149 Length: 501 99.7 2.2E-14 1.3E-17 95.5 41.6 438 1-502 1-495 (501) 79 protein:vir:96783 Length: 488 99.7 6.7E-15 4.2E-18 98.3 36.7 435 1-487 14-488 (488) 80 protein:vir:8846 Length: 705 # 99.7 1.9E-15 1.2E-18 101.3 32.3 449 1-502 10-588 (705) 81 protein:vir:95014 Length: 491 99.6 4.2E-13 2.6E-16 88.5 37.7 435 17-502 1-481 (491) 82 protein:vir:78393 Length: 489 99.6 6.4E-13 4E-16 87.4 39.2 435 17-502 1-479 (489) 83 protein:vir:80165 Length: 651 99.5 4.5E-12 2.8E-15 82.8 36.7 457 1-502 1-618 (651) 84 protein:vir:80040 Length: 461 99.5 2.2E-13 1.4E-16 90.0 27.9 427 1-496 1-461 (461) 85 protein:vir:79538 Length: 502 99.5 7.4E-12 4.6E-15 81.6 40.6 430 1-502 1-499 (502) 86 protein:vir:108295 Length: 711 99.5 1.1E-11 6.8E-15 80.7 36.1 480 1-502 1-644 (711) 87 protein:vir:104437 Length: 714 99.4 1.2E-11 7.3E-15 80.5 34.1 461 1-502 1-611 (714) 88 protein:vir:95542 Length: 548 99.4 1.4E-11 8.7E-15 80.1 40.9 436 1-502 1-504 (548) 89 protein:vir:5249 Length: 437 # 99.4 5.1E-12 3.1E-15 82.5 30.6 401 1-502 1-433 (437) 90 protein:vir:2764 Length: 714 # 99.4 2.7E-11 1.7E-14 78.5 35.9 458 1-502 6-611 (714) 91 protein:vir:9950 Length: 714 # 99.4 2.7E-11 1.7E-14 78.5 35.9 458 1-502 6-611 (714) 92 protein:vir:817 Length: 714 # 99.4 2.7E-11 1.7E-14 78.5 35.9 458 1-502 6-611 (714) 93 protein:vir:3296 Length: 714 # 99.4 2.7E-11 1.7E-14 78.5 35.9 458 1-502 6-611 (714) 94 protein:vir:10117 Length: 714 99.4 2.7E-11 1.7E-14 78.5 35.9 458 1-502 6-611 (714) 95 protein:vir:105619 Length: 772 99.4 2.8E-11 1.7E-14 78.5 32.9 473 1-502 1-635 (772) 96 protein:vir:95449 Length: 584 99.3 5.5E-12 3.4E-15 82.3 26.1 441 1-502 1-560 (584) 97 protein:vir:95821 Length: 763 99.3 1.5E-10 9.5E-14 74.4 33.7 457 1-502 26-654 (763) 98 protein:vir:107662 Length: 427 99.2 1.5E-11 9.2E-15 79.9 21.9 401 1-502 1-426 (427) 99 protein:vir:77597 Length: 725 99.2 8.7E-10 5.4E-13 70.2 31.5 460 1-502 1-590 (725) 100 protein:vir:104338 Length: 422 99.2 5.7E-11 3.5E-14 76.7 22.9 390 1-496 1-422 (422) 101 protein:vir:107742 Length: 537 99.2 1.3E-10 7.8E-14 74.9 24.4 437 1-502 25-523 (537) 102 protein:vir:9263 Length: 725 # 99.1 1.1E-09 6.7E-13 69.7 28.8 459 1-502 1-588 (725) 103 protein:vir:172 Length: 708 # 99.1 1.1E-09 6.8E-13 69.7 28.8 480 1-502 1-633 (708) 104 protein:vir:105429 Length: 708 99.1 1.7E-09 1.1E-12 68.7 31.0 458 1-502 6-600 (708) 105 protein:vir:96068 Length: 765 99.1 1.2E-10 7.2E-14 75.1 21.6 429 1-502 37-519 (765) 106 protein:vir:96738 Length: 505 99.1 2.2E-09 1.4E-12 68.0 39.5 434 1-502 8-497 (505) 107 protein:vir:79647 Length: 435 99.1 2.2E-10 1.4E-13 73.5 21.6 401 1-502 1-433 (435) 108 protein:vir:99563 Length: 862 99.0 2.8E-09 1.7E-12 67.5 26.0 435 1-502 66-575 (862) 109 protein:vir:389 Length: 530 # 99.0 6.3E-09 3.9E-12 65.5 39.6 435 1-502 1-520 (530) 110 protein:vir:94049 Length: 532 99.0 4.5E-09 2.8E-12 66.3 24.8 442 1-502 17-510 (532) 111 protein:vir:3420 Length: 533 # 98.9 1.1E-08 6.8E-12 64.2 40.1 432 1-502 3-523 (533) 112 protein:vir:100920 Length: 725 98.9 1.2E-08 7.6E-12 64.0 31.4 464 1-502 1-619 (725) 113 protein:vir:10321 Length: 495 98.9 1.4E-08 8.5E-12 63.7 36.8 430 1-502 1-495 (495) 114 protein:vir:6382 Length: 553 # 98.9 1.5E-08 9E-12 63.6 37.6 452 1-502 2-553 (553) 115 protein:vir:3139 Length: 599 # 98.9 8.6E-09 5.3E-12 64.8 25.2 472 1-501 1-599 (599) 116 protein:vir:102668 Length: 547 98.9 1.7E-08 1.1E-11 63.2 31.1 444 1-502 1-546 (547) 117 protein:vir:3520 Length: 720 # 98.9 2.3E-08 1.4E-11 62.5 29.1 469 1-502 1-624 (720) 118 protein:vir:102855 Length: 432 98.9 2.4E-08 1.5E-11 62.3 28.1 404 1-502 1-430 (432) 119 protein:vir:105002 Length: 432 98.9 2.4E-08 1.5E-11 62.3 28.1 404 1-502 1-430 (432) 120 protein:vir:107605 Length: 432 98.9 2.4E-08 1.5E-11 62.3 28.1 404 1-502 1-430 (432) 121 protein:vir:94599 Length: 641 98.8 2.8E-08 1.7E-11 62.0 29.9 454 1-502 20-600 (641) 122 protein:vir:105520 Length: 706 98.8 3.7E-08 2.3E-11 61.3 32.0 471 1-502 1-599 (706) 123 protein:vir:1380 Length: 422 # 98.8 6.3E-08 3.9E-11 60.1 27.3 405 1-502 1-422 (422) 124 protein:vir:80644 Length: 551 98.7 1.2E-07 7.2E-11 58.6 25.8 431 1-502 5-521 (551) 125 protein:vir:95315 Length: 559 98.7 1.2E-07 7.3E-11 58.6 30.0 459 1-502 1-541 (559) 126 protein:vir:81152 Length: 411 98.7 1.4E-07 8.7E-11 58.2 30.3 387 1-502 1-410 (411) 127 protein:vir:1538 Length: 535 # 98.6 2E-07 1.2E-10 57.4 35.8 446 1-502 1-521 (535) 128 protein:vir:103765 Length: 549 98.6 2.4E-07 1.5E-10 56.9 25.5 460 1-502 1-539 (549) 129 protein:vir:3361 Length: 535 # 98.6 2.6E-07 1.6E-10 56.7 34.9 446 4-502 1-515 (535) 130 protein:vir:102080 Length: 429 98.6 2.7E-07 1.7E-10 56.6 25.8 400 1-502 1-427 (429) 131 protein:vir:1785 Length: 555 # 98.6 2.8E-07 1.7E-10 56.5 32.0 445 7-502 1-534 (555) 132 protein:vir:7407 Length: 392 # 98.4 6.6E-07 4.1E-10 54.5 27.7 378 1-502 3-389 (392) 133 protein:vir:79772 Length: 648 98.4 7.4E-07 4.6E-10 54.2 33.5 438 1-502 8-490 (648) 134 protein:vir:4454 Length: 414 # 98.4 7.4E-07 4.6E-10 54.2 29.3 392 1-502 1-410 (414) 135 protein:vir:107404 Length: 555 98.4 9.6E-07 6E-10 53.6 38.3 463 1-502 1-541 (555) 136 protein:vir:98506 Length: 555 98.4 9.6E-07 6E-10 53.6 38.3 463 1-502 1-541 (555) 137 protein:vir:107822 Length: 555 98.4 9.6E-07 6E-10 53.6 38.3 463 1-502 1-541 (555) 138 protein:vir:63755 Length: 547 98.3 1.2E-06 7.2E-10 53.1 29.5 430 1-502 1-517 (547) 139 protein:vir:6240 Length: 457 # 98.3 1.2E-06 7.3E-10 53.1 27.1 408 1-502 1-445 (457) 140 protein:vir:4952 Length: 386 # 98.3 1.4E-06 8.8E-10 52.6 28.0 378 1-502 1-385 (386) 141 protein:vir:94572 Length: 535 98.3 1.5E-06 9.3E-10 52.5 35.5 445 1-502 1-521 (535) 142 protein:vir:1266 Length: 416 # 98.3 1.5E-06 9.4E-10 52.5 28.5 392 21-502 1-415 (416) 143 protein:vir:99672 Length: 532 98.2 2.5E-06 1.5E-09 51.3 30.1 443 1-502 1-532 (532) 144 protein:vir:10447 Length: 536 98.2 2.8E-06 1.7E-09 51.0 35.8 441 1-502 1-521 (536) 145 protein:vir:3843 Length: 397 # 98.2 2.9E-06 1.8E-09 50.9 27.5 384 1-502 1-395 (397) 146 protein:vir:8883 Length: 543 # 98.2 3E-06 1.8E-09 50.9 31.7 445 1-502 1-523 (543) 147 protein:vir:81072 Length: 432 98.1 4.4E-06 2.7E-09 50.0 29.2 401 1-502 7-432 (432) 148 protein:vir:1326 Length: 457 # 98.1 4.9E-06 3E-09 49.7 29.7 408 1-502 1-450 (457) 149 protein:vir:78696 Length: 542 98.1 5.1E-06 3.2E-09 49.6 36.6 450 7-502 1-540 (542) 150 protein:vir:94709 Length: 522 98.0 6E-06 3.7E-09 49.2 39.0 442 1-502 1-514 (522) 151 protein:vir:102118 Length: 409 98.0 6E-06 3.7E-09 49.2 27.7 390 21-502 1-408 (409) 152 protein:vir:1023 Length: 392 # 98.0 6.1E-06 3.8E-09 49.2 27.1 378 1-502 3-389 (392) 153 protein:vir:3989 Length: 392 # 98.0 6.1E-06 3.8E-09 49.2 27.1 378 1-502 3-389 (392) 154 protein:vir:4194 Length: 540 # 98.0 6.4E-06 3.9E-09 49.1 29.0 411 1-502 6-474 (540) 155 protein:vir:3153 Length: 467 # 98.0 7E-06 4.3E-09 48.8 34.7 384 66-502 1-439 (467) 156 protein:vir:104500 Length: 537 98.0 7.3E-06 4.5E-09 48.8 25.2 447 14-502 1-508 (537) 157 protein:vir:78942 Length: 510 98.0 8.9E-06 5.5E-09 48.3 36.0 431 7-499 1-510 (510) 158 protein:vir:100249 Length: 431 98.0 9.1E-06 5.6E-09 48.2 26.8 398 1-497 1-431 (431) 159 protein:vir:2198 Length: 536 # 97.9 1.1E-05 6.5E-09 47.9 34.1 442 1-502 1-521 (536) 160 protein:vir:8418 Length: 409 # 97.9 1.1E-05 6.8E-09 47.8 27.2 382 1-499 1-409 (409) 161 protein:vir:100187 Length: 385 97.9 1.1E-05 7.1E-09 47.7 24.1 372 1-502 1-384 (385) 162 protein:vir:95378 Length: 406 97.9 1.2E-05 7.6E-09 47.5 25.4 378 1-502 1-401 (406) 163 protein:vir:10362 Length: 432 97.9 1.3E-05 8E-09 47.4 28.9 397 1-502 7-425 (432) 164 protein:vir:7321 Length: 556 # 97.8 1.6E-05 1E-08 46.8 34.3 458 1-502 1-535 (556) 165 protein:vir:6322 Length: 510 # 97.8 1.7E-05 1E-08 46.8 39.1 439 7-499 1-510 (510) 166 protein:vir:104259 Length: 403 97.7 2.3E-05 1.4E-08 46.0 27.6 375 1-502 1-403 (403) 167 protein:vir:100039 Length: 522 97.7 2.9E-05 1.8E-08 45.5 31.7 431 1-502 1-513 (522) 168 protein:vir:4854 Length: 386 # 97.7 3E-05 1.9E-08 45.3 28.8 373 1-502 1-385 (386) 169 protein:vir:103177 Length: 533 97.6 3.3E-05 2E-08 45.2 23.0 444 15-502 1-516 (533) 170 protein:vir:6896 Length: 523 # 97.6 3.3E-05 2.1E-08 45.1 24.0 430 1-499 1-523 (523) 171 protein:vir:106282 Length: 521 97.6 3.9E-05 2.4E-08 44.7 28.9 423 1-499 47-521 (521) 172 protein:vir:101189 Length: 516 97.6 4.4E-05 2.7E-08 44.5 26.4 435 1-499 1-516 (516) 173 protein:vir:101806 Length: 516 97.6 4.4E-05 2.7E-08 44.5 26.4 435 1-499 1-516 (516) 174 protein:vir:100882 Length: 383 97.6 4.4E-05 2.7E-08 44.5 23.9 371 1-502 1-381 (383) 175 protein:vir:2683 Length: 412 # 97.5 5.1E-05 3.2E-08 44.1 29.7 390 1-502 1-410 (412) 176 protein:vir:97060 Length: 432 97.5 5.2E-05 3.2E-08 44.1 29.5 397 1-502 7-426 (432) 177 protein:vir:6210 Length: 394 # 97.5 5.4E-05 3.3E-08 44.0 24.7 373 1-502 1-393 (394) 178 protein:vir:80796 Length: 574 97.4 6.8E-05 4.2E-08 43.4 28.2 434 1-502 1-511 (574) 179 protein:vir:96980 Length: 409 97.4 7.3E-05 4.5E-08 43.3 27.8 385 1-502 4-407 (409) 180 protein:vir:4156 Length: 542 # 97.4 8.2E-05 5.1E-08 43.0 27.0 412 3-502 1-460 (542) 181 protein:vir:105782 Length: 449 97.3 9.1E-05 5.6E-08 42.7 22.2 411 1-502 1-443 (449) 182 protein:vir:4828 Length: 382 # 97.3 0.00011 6.6E-08 42.4 27.5 375 1-502 1-381 (382) 183 protein:vir:6596 Length: 521 # 97.2 0.00012 7.5E-08 42.0 30.8 440 1-499 2-521 (521) 184 protein:vir:93943 Length: 409 97.2 0.00013 8E-08 41.9 28.9 389 1-502 4-407 (409) 185 protein:vir:9359 Length: 348 # 97.1 0.00017 1E-07 41.3 24.3 327 87-502 1-346 (348) 186 protein:vir:483 Length: 413 # 97.1 0.00017 1.1E-07 41.2 28.5 389 21-502 1-408 (413) 187 protein:vir:103458 Length: 524 97.1 0.00018 1.1E-07 41.2 27.1 426 1-499 1-524 (524) 188 protein:vir:7208 Length: 524 # 97.1 0.00018 1.1E-07 41.1 27.0 426 1-499 1-524 (524) 189 protein:vir:1082 Length: 359 # 96.9 0.00028 1.7E-07 40.1 28.3 350 1-474 1-359 (359) 190 protein:vir:106999 Length: 564 96.9 0.00028 1.7E-07 40.0 26.8 439 1-502 13-536 (564) 191 protein:vir:7853 Length: 518 # 96.8 0.00032 2E-07 39.8 28.8 400 10-502 1-431 (518) 192 protein:vir:94426 Length: 409 96.8 0.00036 2.2E-07 39.5 29.9 389 1-502 4-407 (409) 193 protein:vir:960 Length: 413 # 96.7 0.00037 2.3E-07 39.4 26.2 388 1-499 1-413 (413) 194 protein:vir:100598 Length: 516 96.7 0.00043 2.7E-07 39.0 25.3 449 1-499 1-516 (516) 195 protein:vir:9408 Length: 441 # 96.6 0.00048 3E-07 38.8 28.7 396 13-502 1-441 (441) 196 protein:vir:79984 Length: 441 96.6 0.00048 3E-07 38.8 28.7 396 13-502 1-441 (441) 197 protein:vir:101648 Length: 518 96.6 0.0005 3.1E-07 38.7 28.6 400 10-502 1-440 (518) 198 protein:vir:81218 Length: 423 96.6 0.00051 3.2E-07 38.6 28.0 393 1-502 1-419 (423) 199 protein:vir:81017 Length: 521 96.5 0.00052 3.2E-07 38.6 31.0 439 1-499 2-521 (521) 200 protein:vir:4995 Length: 384 # 96.5 0.00055 3.4E-07 38.5 29.9 372 21-500 1-384 (384) 201 protein:vir:103330 Length: 517 96.4 0.00064 4E-07 38.1 37.1 424 1-498 1-517 (517) 202 protein:vir:100150 Length: 437 96.4 0.00064 4E-07 38.1 30.2 397 1-502 1-428 (437) 203 protein:vir:5665 Length: 511 # 96.4 0.0007 4.3E-07 37.9 25.2 430 1-499 32-511 (511) 204 protein:vir:78589 Length: 695 96.3 0.00076 4.7E-07 37.7 20.8 396 1-502 92-546 (695) 205 protein:vir:189 Length: 424 # 96.3 0.00079 4.9E-07 37.6 28.5 382 1-497 14-424 (424) 206 protein:vir:101541 Length: 694 96.2 0.00088 5.5E-07 37.3 20.8 396 1-502 91-545 (694) 207 protein:vir:3648 Length: 695 # 96.2 0.00091 5.6E-07 37.3 20.5 396 1-502 92-546 (695) 208 protein:vir:108049 Length: 524 96.2 0.00091 5.6E-07 37.3 28.2 416 1-499 49-524 (524) 209 protein:vir:80211 Length: 514 96.1 0.00095 5.9E-07 37.1 33.8 429 11-502 1-514 (514) 210 protein:vir:1884 Length: 424 # 96.1 0.00096 5.9E-07 37.1 28.8 383 1-497 14-424 (424) 211 protein:vir:102727 Length: 945 96.1 0.00098 6.1E-07 37.1 30.5 427 1-502 25-535 (945) 212 protein:vir:101647 Length: 460 96.1 0.0011 6.6E-07 36.9 30.4 400 3-496 1-460 (460) 213 protein:vir:99853 Length: 488 96.0 0.0012 7.2E-07 36.7 26.7 381 29-502 1-407 (488) 214 protein:vir:7017 Length: 515 # 95.9 0.0013 7.9E-07 36.4 32.4 430 7-502 1-510 (515) 215 protein:vir:99312 Length: 563 95.9 0.0013 8.4E-07 36.3 26.2 426 7-502 1-523 (563) 216 protein:vir:95599 Length: 563 95.9 0.0013 8.4E-07 36.3 26.2 426 7-502 1-523 (563) 217 protein:vir:4598 Length: 416 # 95.8 0.0014 8.6E-07 36.2 28.8 387 1-502 1-413 (416) 218 protein:vir:81095 Length: 416 95.8 0.0014 8.6E-07 36.2 28.8 387 1-502 1-413 (416) 219 protein:vir:4337 Length: 434 # 95.7 0.0016 1E-06 35.9 26.6 400 3-496 1-434 (434) 220 protein:vir:98396 Length: 441 95.5 0.0019 1.2E-06 35.5 29.3 397 13-502 1-441 (441) 221 protein:vir:105641 Length: 516 95.5 0.0019 1.2E-06 35.5 32.5 430 1-502 1-512 (516) 222 protein:vir:105064 Length: 421 95.3 0.0023 1.4E-06 35.1 30.3 390 1-502 1-420 (421) 223 protein:vir:8317 Length: 409 # 95.1 0.0028 1.7E-06 34.6 24.5 384 1-499 1-409 (409) 224 protein:vir:104892 Length: 558 95.0 0.0029 1.8E-06 34.5 30.1 436 1-502 7-523 (558) 225 protein:vir:3868 Length: 417 # 94.9 0.0032 2E-06 34.2 27.3 377 21-502 1-413 (417) 226 protein:vir:103219 Length: 201 94.8 0.00061 3.8E-07 38.2 8.2 181 253-497 1-201 (201) 227 protein:vir:103860 Length: 528 94.6 0.004 2.5E-06 33.7 29.8 410 1-502 1-439 (528) 228 protein:vir:93610 Length: 454 94.5 0.0043 2.7E-06 33.6 28.5 397 3-502 1-434 (454) 229 protein:vir:100691 Length: 535 94.4 0.0046 2.8E-06 33.4 32.5 424 1-502 1-511 (535) 230 protein:vir:5839 Length: 533 # 94.0 0.0057 3.5E-06 32.9 26.3 407 1-502 17-479 (533) 231 protein:vir:5737 Length: 419 # 93.7 0.0067 4.2E-06 32.5 27.8 387 21-502 1-418 (419) 232 protein:vir:98265 Length: 524 93.6 0.007 4.3E-06 32.4 29.4 436 1-499 4-524 (524) 233 protein:vir:106716 Length: 698 93.6 0.0071 4.4E-06 32.4 21.9 409 1-502 92-553 (698) 234 protein:vir:79063 Length: 491 93.5 0.0073 4.5E-06 32.3 27.9 392 1-502 3-417 (491) 235 protein:vir:99232 Length: 526 93.2 0.0083 5.1E-06 32.0 30.9 407 1-502 1-437 (526) 236 protein:vir:96988 Length: 516 92.9 0.0096 5.9E-06 31.7 28.8 430 1-502 1-511 (516) 237 protein:vir:4509 Length: 424 # 92.8 0.01 6.2E-06 31.5 26.3 387 14-496 1-424 (424) 238 protein:vir:8100 Length: 466 # 92.7 0.01 6.4E-06 31.5 29.7 418 1-502 1-466 (466) 239 protein:vir:4089 Length: 395 # 92.6 0.011 6.6E-06 31.4 23.3 376 1-502 1-391 (395) 240 protein:vir:345 Length: 663 # 92.4 0.011 7E-06 31.2 30.2 443 1-502 1-596 (663) 241 protein:vir:79233 Length: 526 92.4 0.012 7.2E-06 31.2 31.6 408 1-502 1-437 (526) 242 protein:vir:101289 Length: 395 92.3 0.012 7.3E-06 31.2 22.4 367 21-502 1-393 (395) 243 protein:vir:9507 Length: 395 # 92.3 0.012 7.3E-06 31.2 22.4 367 21-502 1-393 (395) 244 protein:vir:100650 Length: 395 92.3 0.012 7.3E-06 31.2 22.4 367 21-502 1-393 (395) 245 protein:vir:107880 Length: 491 91.8 0.014 8.7E-06 30.7 31.4 383 1-502 15-411 (491) 246 protein:vir:1431 Length: 419 # 91.0 0.018 1.1E-05 30.2 26.4 387 21-502 1-416 (419) 247 protein:vir:78641 Length: 278 90.2 0.022 1.4E-05 29.7 23.3 269 87-437 1-278 (278) 248 protein:vir:1986 Length: 512 # 90.2 0.022 1.4E-05 29.6 27.4 409 1-502 1-445 (512) 249 protein:vir:80134 Length: 403 89.2 0.028 1.7E-05 29.1 25.0 374 1-502 1-401 (403) 250 protein:vir:78161 Length: 355 88.9 0.03 1.8E-05 28.9 20.4 292 152-502 1-322 (355) 251 protein:vir:80333 Length: 419 86.2 0.047 2.9E-05 27.9 29.5 388 1-502 1-413 (419) 252 protein:vir:96579 Length: 576 85.2 0.054 3.4E-05 27.5 30.3 430 3-502 1-526 (576) 253 protein:vir:95965 Length: 385 83.8 0.066 4.1E-05 27.1 22.1 365 14-502 1-385 (385) 254 protein:vir:9702 Length: 406 # 78.3 0.12 7.2E-05 25.7 28.5 375 1-502 1-402 (406) 255 protein:vir:9641 Length: 395 # 72.3 0.18 0.00011 24.6 22.9 366 1-502 1-392 (395) 256 protein:vir:108215 Length: 469 70.2 0.21 0.00013 24.3 27.8 398 26-502 1-451 (469) 257 protein:vir:78310 Length: 376 62.9 0.33 0.0002 23.3 21.9 359 1-502 1-374 (376) 258 protein:vir:98643 Length: 395 56.9 0.45 0.00028 22.5 22.0 368 21-502 1-392 (395) 259 protein:vir:99452 Length: 651 54.8 0.49 0.00031 22.3 26.3 434 1-502 46-538 (651) 260 protein:vir:94002 Length: 378 40.1 0.99 0.00061 20.6 17.9 350 1-502 1-378 (378) 261 protein:vir:98816 Length: 446 23.8 2.3 0.0014 18.6 25.6 395 1-475 1-446 (446) 262 protein:vir:94666 Length: 723 20.8 2.7 0.0017 18.2 31.6 383 23-502 1-436 (723) 263 protein:vir:94869 Length: 378 20.5 2.8 0.0017 18.2 22.5 355 1-502 1-378 (378) No 1 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=100.00 E-value=1.3e-140 Score=787.51 Aligned_cols=499 Identities=66% Similarity=1.038 Sum_probs=476.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |++|++||+|||++++.+..++|++|++|.+++++++|+++|+.|++||+|+++++.+++..+....++++|+|+|+.|| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:30 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999999999998889999999999999999999999999999999999999888888888889999999999999 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQ 160 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~ 160 (502) +++|+|||++|++|++++++.+++|++++++|+|..++.++++.|+++|++|++||||+++++|++++|++++|+++|++ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~ 160 (500) T protein:vir:30 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQ 160 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC-eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 161 DVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE-TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~-~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) ++.+++|+++.++..++..++||++|+|+|+++ +|+|+|++|++.+...+|++|||+++|++++++.+++|+++|+|+| T Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~ 240 (500) T protein:vir:30 161 DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTY 240 (500) T ss_pred CeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEE Confidence 999999999888877777789999999999876 6999999999999999999999999999999999999999999999 Q ss_pred ecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhc Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYE 319 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~ 319 (502) |++|.+|+++.+||+|+|+|++++++||+||++||+++|+|++++++|+||++|++.++++.++...+.+.|+.+.++|. T Consensus 241 ~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~ 320 (500) T protein:vir:30 241 LKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYI 320 (500) T ss_pred ecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCCCcceEE Confidence 99999999999999999999999999999999999999999999999999999999988887777777778898999998 Q ss_pred cccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 320 QFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEK 399 (502) Q Consensus 320 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~ 399 (502) .+...++ ++.+|++++|+||+++|.++++.++++++++||||+++||++++|.+|||||++++++|+++++++++.|+. T Consensus 321 ~~~~~~~-~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~ 399 (500) T protein:vir:30 321 RMGGRDL-DSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQ 399 (500) T ss_pred EcCCCCC-cCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8876543 456899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHH Q lcl|NC_012753. 400 SLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQK 479 (502) Q Consensus 400 ~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~r 479 (502) +|++|+++|+++++.++++++......+++|+|+|++++|++++++++++++++|+||+++||+++|||+||||++|++| T Consensus 400 al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eeea~~~l~~ 479 (500) T protein:vir:30 400 SLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEKAQEIAAE 479 (500) T ss_pred HHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHHHHHHHHH Confidence 99999999999999998888888888899999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 480 INDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 480 i~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |++|+++..+.++ +..|+||| T Consensus 480 i~~E~~~~~~~~~--~~~~~~g~ 500 (500) T protein:vir:30 480 INTGIVDEINQQR--TDTHLYGE 500 (500) T ss_pred HHHhccccCCCCC--ccccccCC Confidence 9999988777764 58899999 No 2 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=100.00 E-value=1.3e-140 Score=787.51 Aligned_cols=499 Identities=66% Similarity=1.038 Sum_probs=476.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |++|++||+|||++++.+..++|++|++|.+++++++|+++|+.|++||+|+++++.+++..+....++++|+|+|+.|| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (500) T protein:vir:98 1 MGVIQKIKNLVTRSKYVMTTQSLTNITDHPKIAISKLEYDRITTNLKYYKSDWDSVLYLNTDGETKKRDLNHLPIARTAA 80 (500) T ss_pred CchHHHHHHHHHHHHHHhhcchhhhhhccccccCCHHHHHHHHHHHHHhcCCCCCcccccCCCCcccCceeecchHHHHH Confidence 99999999999999998889999999999999999999999999999999999999888888888889999999999999 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQ 160 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~ 160 (502) +++|+|||++|++|++++++.+++|++++++|+|..++.++++.|+++|++|++||||+++++|++++|++++|+++|++ T Consensus 81 ~~~A~lv~~e~~~i~~~d~~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~I~~v~ad~~~P~~~d~~ 160 (500) T protein:vir:98 81 KKIASLVFNEQAEIKVDDDAANEFISETLKNDRFNKNFERYLESCLALGGLAMRPYVDGDKVRVAFVQAPVFLPLQSNTQ 160 (500) T ss_pred HHHhhhhcCCcceEecCChHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeeEEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC-eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 161 DVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE-TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~-~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) ++.+++|+++.++..++..++||++|+|+|+++ +|+|+|++|++.+...+|++|||+++|++++++.+++|+++|+|+| T Consensus 161 ~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~ 240 (500) T protein:vir:98 161 DVSSAAVVIKSVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTY 240 (500) T ss_pred CeEEEEEEEEEeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEE Confidence 999999999888877777789999999999876 6999999999999999999999999999999999999999999999 Q ss_pred ecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhc Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYE 319 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~ 319 (502) |++|.+|+++.+||+|+|+|++++++||+||++||+++|+|++++++|+||++|++.++++.++...+.+.|+.+.++|. T Consensus 241 ~~~~~~N~~~~~sp~G~S~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~~~~~~d~~~~~~~ 320 (500) T protein:vir:98 241 LKTPGMNNKDINSPLGLSIFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPESLTALTVRTTDGDVVPRPRFESDQNVYI 320 (500) T ss_pred ecCCccccccCCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhCcceeeechHHhcccCCCCCccccCCcccCCCcceEE Confidence 99999999999999999999999999999999999999999999999999999999988887777777778898999998 Q ss_pred cccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 320 QFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEK 399 (502) Q Consensus 320 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~ 399 (502) .+...++ ++.+|++++|+||+++|.++++.++++++++||||+++||++++|.+|||||++++++|+++++++++.|+. T Consensus 321 ~~~~~~~-~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~g~~TAtei~s~~~~~~~t~~~~~~~~~~ 399 (500) T protein:vir:98 321 RMGGRDL-DSSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGKSMKTATEIVSENSDTYQMRNSIVALVEQ 399 (500) T ss_pred EcCCCCC-cCcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcCccccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8876543 456899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHH Q lcl|NC_012753. 400 SLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQK 479 (502) Q Consensus 400 ~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~r 479 (502) +|++|+++|+++++.++++++......+++|+|+|++++|++++++++++++++|+||+++||+++|||+||||++|++| T Consensus 400 al~~lv~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~v~aGi~s~~~~i~~~~g~~eeea~~~l~~ 479 (500) T protein:vir:98 400 SLKELVISIFEIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAELDYWIKVVNAGFGTREMAIQKVLNVTEEKAQEIAAE 479 (500) T ss_pred HHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCHHHHHHHHHH Confidence 99999999999999998888888888899999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 480 INDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 480 i~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |++|+++..+.++ +..|+||| T Consensus 480 i~~E~~~~~~~~~--~~~~~~g~ 500 (500) T protein:vir:98 480 INTGIVDEINQQR--TDTHLYGE 500 (500) T ss_pred HHHhccccCCCCC--ccccccCC Confidence 9999988777764 58899999 No 3 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=100.00 E-value=3.1e-139 Score=779.97 Aligned_cols=501 Identities=58% Similarity=0.962 Sum_probs=470.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |++|++||+|||+|++++..+++++|++|++|+++++|+.+|+.|++||+|+++.+.+++..+++.+++++|+|+|+.|| T Consensus 1 m~~~~~~k~~~~k~~~~~~~~~~~~i~~~~~i~~~~~~~~~i~~~~~~y~g~~~~~~~~~~~~~~~~~~~~slnl~~~i~ 80 (522) T protein:vir:47 1 MSLFQKVKDFFSRGRYYMQTSNLNSILEHPKIAVTQEEYDRIKRNLVYYQSKWDDVQYKNTDGDIKSRPMNHLPIARTAS 80 (522) T ss_pred CchHHHHHHHHHHHHHHhhcccchhccccCCCCCCHHHHHHHHHHHHHhcCCcccccccccCcchhcccceecchHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQ 160 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~ 160 (502) +++|+|+|++|++|+++|+..+++|++++++|+|..++.++++.|+++|++||+||||+++++|.+|+|++++|++++++ T Consensus 81 ~~~A~lv~~e~~~i~v~d~~~~~~l~~~l~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~i~~v~ad~~~P~~~~~~ 160 (522) T protein:vir:47 81 KKIASLVYNEQATITTKNEILQKFLDDMLTNDRFNKNFERYLESCLALGGLAMRPYIDGDKVRVAFIQAPVFFPLESNTQ 160 (522) T ss_pred HHHhhhhcCCcceeecCChHHHHHHHHHHhhcchHHHHHHHHHHhhccCCEEEEEEEcCCceEEEEEcCCceEEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC------------eEEEEEEEEecCCccccCceeecccc--ccCCCcc Q lcl|NC_012753. 161 DVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE------------TYTISNELYESESKTIIGQRVPLSTL--YEDLEET 226 (502) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~------------~~~I~~~l~~~~~~~~lG~~v~l~~~--~~~l~~~ 226 (502) ++.+++|++++++.+++.++|||++|+|+|.++ +|+|+|++|++.+..++|.+|||+++ |++|++. T Consensus 161 ~~~e~a~~~~~~~~~~~~~~~yt~lE~he~~~~~~~~~~~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~ 240 (522) T protein:vir:47 161 DVSSAAILTKTIKSEGRKNVYYTLVEFHEWVTADGQETGSTNDKKYYRITNELYRSDVNDVLGQRVNLSELDKYKNLEPV 240 (522) T ss_pred ceEEEEEEEEEEeecccceeEEEEEEEeeecccccccccccccCCceEEEEEEeecCCCcccCccccccccccccCCCCc Confidence 999999999999999999999999999998654 69999999999999999999999998 8899999 Q ss_pred eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC Q lcl|NC_012753. 227 VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT 306 (502) Q Consensus 227 ~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~ 306 (502) ++++|+++|+|+||++|.+|+++.+||+|+|+|++++++||+||++||+++|+|++|+.+|+||++||+...++.++... T Consensus 241 ~~~~~~~~Plf~y~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~g~~~ 320 (522) T protein:vir:47 241 TVFENLSRPLFTYLKTPGMNNKDINSPLGLSIFDNAKTTIDFINRSYDEFMWEVRMGQRRVIVPEHLTQRQYQRPDGTID 320 (522) T ss_pred eEeCCCCcceEEEecCCcccccccCCCcCCchhhhhHHHHHHHHHHHHHHHHHHHhccceeecchHHhccCCCCCCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999998888888777 Q ss_pred ccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHH Q lcl|NC_012753. 307 VKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDT 386 (502) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l 386 (502) ..+.|+.+.++|.++..+++ ++.+|++++|+||+++|.++++.+++.|+++||||+++||+++++.+|||||++++++| T Consensus 321 ~~~~fd~~~~~f~~~~~~~~-~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~~~kTAtEi~s~~~~~ 399 (522) T protein:vir:47 321 FRPRFDVEQNVYMQIGGSSM-DAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQGMKTATEIVSENSDT 399 (522) T ss_pred cccccCcccceEeecCCCCC-CCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCccccccccHHHHHHHHHHH Confidence 77789999999999876554 35579999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Q lcl|NC_012753. 387 YQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTL 466 (502) Q Consensus 387 ~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~ 466 (502) +++++++++.|+.+|++|+++|+++++.++++++......+++|+|+|++++|++++++++++++++|+||+++||+++| T Consensus 400 ~~t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~~~~~~v~aG~~s~e~~i~~~~ 479 (522) T protein:vir:47 400 YQMRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELDYWAKMVAAGFSTKKRAIGKTL 479 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Confidence 99999999999999999999999999999888888888889999999999999999999999999999999999999999 Q ss_pred CCCHHHHHHHHHHHHHhhhcccCCCCCccc-----cCCCCC Q lcl|NC_012753. 467 NVTKEQAQEIYQKINDETMVSTDSFRTSEE-----VDIYGE 502 (502) Q Consensus 467 ~~~deea~~el~ri~~E~~~~~~~~~~~~~-----~~~~g~ 502 (502) ||||+||++|++||++|++++.|...+..+ ..-.|| T Consensus 480 g~~eeea~~el~ri~~E~~~~~~~~~~~~~~~~~~~~~~d~ 520 (522) T protein:vir:47 480 NISGVEAEKELNAINSELLPMNDAELAIYGMHDQNEEKADD 520 (522) T ss_pred CCChHHHHHHHHHHHHhhccCCCCCCCCCCCCCcccccCCC Confidence 999999999999999998865433221110 111122 No 4 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=100.00 E-value=8.8e-139 Score=777.45 Aligned_cols=495 Identities=36% Similarity=0.614 Sum_probs=464.4 Q ss_pred CChhHHHHHHHHHHhhcc-cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVI-TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~-~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) |++|++||+|||+|.++. ..++|++|++|.+|+++++|+++|+.|++||.|+++++++++..+....++++|+|+|+.| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~i~d~~~i~~~~~~~~~i~~~~~~Y~g~~~~l~~~~~~~~~~~~~~~slnl~~~i 80 (505) T protein:vir:79 1 MAFWDTLKNLFRKGSAAVGMTKSLGQIIDDPRINLPADEVERIARDKRYYMDDFKQVTHKNSYGDTQKHELQSVNVTKLA 80 (505) T ss_pred CchHHHHHHHHHHhhhhhcchhhhhhhhcccCCCCCHHHHHHHHHHHHHhcCCCccccccccCCCccccceeecchHHHH Confidence 999999999999988755 5688999999999999999999999999999999999999888888899999999999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcC Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANT 159 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~ 159 (502) |+++|+|||++|++|++++++.+++|++++++|+|..+++++++.|+++|++||+||||+++++|.+++|++++|+++|+ T Consensus 81 ~~~~A~ll~~e~~~i~~~d~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~~~k~~~D~~~~~i~~v~ad~~~P~~~d~ 160 (505) T protein:vir:79 81 SAKLASLIFNEQCQVTVSDETANDFLDDVFQQNDFYTTFEEKLEEWIALGSGCVRPYVDSGKIKLAWATADQVYPLQADT 160 (505) T ss_pred HHHHHhhhcCCCceeecCChHHHHHHHHHHHhccHHHHHHHHHHHHhhcCCeEEEEEEeCCceEEEEEcCCeeEEEEEcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc--ccCCCcceeecCCCcceE Q lcl|NC_012753. 160 QDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL--YEDLEETVTLNGLTRPLF 237 (502) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~--~~~l~~~~~~~~~~~~~f 237 (502) +++.+++|+.++++.+++.+.|||++|+|+|++++|+|+|++|++.+.+.+|.+||++++ |++++++++++|+++|+| T Consensus 161 ~~~~~~a~~~~~~~~~~~~~~~yt~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f 240 (505) T protein:vir:79 161 NQVNELAIASRTTEVENHRTIYYTLLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLF 240 (505) T ss_pred CCeEEEEEEEEEEEecCCcceEEEEEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceE Confidence 999999999999998888889999999999999999999999999999999999999988 789999999999999999 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCc-cccccccch Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTV-KREFETGHN 316 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~-~~~~~~~~~ 316 (502) +||++|.+|+++.+||+|+|+|++++++||+||++||+++|+|++|+++|+||++||+.+++++|..... .+.|+.+.+ T Consensus 241 ~~~~~~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~~~~~~fd~~~~ 320 (505) T protein:vir:79 241 AFYRNKGANNKNFTSPMGMSLIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQASETHPPMFDPDET 320 (505) T ss_pred EEecCCcccccccCCccCCchhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccccccccCCCccce Confidence 9999999999999999999999999999999999999999999999999999999999998888765543 345888889 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) +|..+..+++ +.++++++|+||+++|.++++.++++|+++||+|+++||++++|.+|||||++++++|+++++++++. T Consensus 321 ~y~~~~~~~~--~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~TAtei~s~~~~l~~t~~~~~~~ 398 (505) T protein:vir:79 321 VYQAMYGDAS--EVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPSGIQTATEVVTNNSQTYQTRSSYITQ 398 (505) T ss_pred eeeeccCCCC--CCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCccccchHHHHHHHHhHHHHHHHHHHHH Confidence 9988876544 45799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCC------CcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTG------EIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~------~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) |+.+|++|+++|++++..+.+.+. ...+..+++|+|+|++++|+++++++.++++++|+||++++|+++++||| T Consensus 399 ~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~v~~Gi~s~e~~l~~~~~~~e 478 (505) T protein:vir:79 399 VEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRAADLQAVQAQVMPKKQFLMRNYGLDE 478 (505) T ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHHhcCCCCh Confidence 999999999999999998765442 33445689999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 471 EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +||++|++||++|++..+|+ -.+++|| T Consensus 479 eea~~el~ri~~E~~~~~p~-----~~~~gg~ 505 (505) T protein:vir:79 479 EEADEWLAQIDAENSTAEPE-----FNQFGGD 505 (505) T ss_pred HHHHHHHHHHHHhccccCCC-----chhccCC Confidence 99999999999999875544 3667788 No 5 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=100.00 E-value=1.7e-136 Score=764.95 Aligned_cols=498 Identities=43% Similarity=0.756 Sum_probs=469.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |++|++||+|||++++++..+++++++++.+|+++++|+.+|..|++||+|+++++++++..+....++++|+|+|+.|| T Consensus 1 m~~~~~ik~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~I~~w~~~Y~g~~~~~~~~~~~~~~~~~~~~sl~~~~~i~ 80 (517) T protein:vir:98 1 MKVIQRIKNFFKRGGYALSGQTLKSINDHEKINIDPNELARIERNLRQYEGDYPQVEYINSQGKIQERDYMTLNLRKLSA 80 (517) T ss_pred CchHHHHHHHHHHHHHHhcccchhHhhcCCceecCHHHHHHHHHHHHHhcCCCcccccccccccccccceeecCcHHHHH Confidence 99999999999999999999999999999999999999999999999999999999998888888899999999999999 Q ss_pred HHHhhhhhcCcceEeeCCH-----------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNE-----------VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQA 149 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~-----------~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~ 149 (502) +++|++||+||++|++++. ..+++|++++++|+|..++.++++.|+++|++||+||||+++++|++|+| T Consensus 81 ~~~A~Ll~~e~~~i~v~d~~~~~~~~~~~~~~~e~l~~i~~~n~f~~~~~~~~e~a~a~G~~a~k~~~d~~~~~I~~v~a 160 (517) T protein:vir:98 81 DVLSGLVFNEQCEVYVSDAKDEEKKDNSFKTAHEFIQHVFQHNKFIKNLSDYLEPTFALGGLTVRPYVDNGEIEFSWALA 160 (517) T ss_pred HHhhhhhcCCcceEEecccccccccccchhHHHHHHHHHHHhccHHHHHHHHHHHHhhhCCEEEEEEEeCCeeEEEEEcC Confidence 9999999999999999863 37899999999999999999999999999999999999999999999999 Q ss_pred CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeC-----CeEEEEEEEEecCCccccCceeeccccccCCC Q lcl|NC_012753. 150 TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNK-----ETYTISNELYESESKTIIGQRVPLSTLYEDLE 224 (502) Q Consensus 150 ~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~-----~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~ 224 (502) ++++|+.++++++.+++|++..++..+++..+||++|+|+|++ ++|+|+|++|++.+...+|.+|||+++|++|+ T Consensus 161 d~~~Pl~~~~~~v~~~ai~~~~~~~~~~~~~~Yt~lE~H~~~~~~~~~~~y~I~n~ly~s~~~~~lG~~v~L~~~~e~l~ 240 (517) T protein:vir:98 161 NAFYPLRSNSNGISEGVMKSVTTKVIGNKTVYYTLLEFHEWEKTEEGESLYVITNELYKSDNEGEIGKRIPLEELYEGMQ 240 (517) T ss_pred CeeEEEEecCCCeEEEEEEEEEEEeecCCceEEEEEEEEecCceeccCCcEEEEEEEEecCCCccccccccccccccCCC Confidence 9999999999999999999888888888888999999999975 67999999999999999999999999999999 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~ 304 (502) +.++++|+++|+|+||++|.+|+++.+||+|+|+|++++++||+||++||+++|+|++|+++|+||++||+.+++++|.. T Consensus 241 ~~~~~~g~~~Plf~y~~~p~~N~~~~~splG~S~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~vp~~~l~~~~~~~g~~ 320 (517) T protein:vir:98 241 EKTYIQGLSRPLFNYLKPSGFNNINPHSPLGLGITDNSVSTLKKINDTYDQFWWEIKMGQRTVFVSDVMLRTVPDESGMP 320 (517) T ss_pred cceeECCCCcceEEEecCCcccccccCCCCCCchhhhhHHHHHHHHHHHHHHHHHHHhCCcceecChhhhccccCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999888876654 Q ss_pred cCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 305 VTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) +...|+.+.++|..+..++ ++.++++++|+||+++|.++++.+|++|+++||+|+++||+++.+.+|||||+++++ T Consensus 321 --~~~~~d~~~~~y~~~~~~~--~~~~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~~~kTATEi~s~~~ 396 (517) T protein:vir:98 321 --PPQVFDPDVNVYKSIRMGT--DEEFVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGRSMKTATEIVSEND 396 (517) T ss_pred --cCCCCCcccceeeeccCCC--CCCceeeeccccchHHHHHHHHHHHHHHHHHhCCCcccccccccccccHHHHHHHHH Confidence 3456888899999887654 346799999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) +++++++++++.|+.+|++|+++|++++++++++++......+++|+|+|++++|++++++++++++++|+||+++||++ T Consensus 397 ~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~~~~v~aG~ms~~~~i~~ 476 (517) T protein:vir:98 397 LTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFYGQAKTFGFIPTVEAIQR 476 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHH Confidence 99999999999999999999999999999999999988888899999999999999999999999999999999999999 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcccCCC-CCccccCCCCC Q lcl|NC_012753. 465 TLNVTKEQAQEIYQKINDETMVSTDSF-RTSEEVDIYGE 502 (502) Q Consensus 465 ~~~~~deea~~el~ri~~E~~~~~~~~-~~~~~~~~~g~ 502 (502) +|||+|+||++|++||++|++++.|.. ..+.+.++.|+ T Consensus 477 ~~g~~eeeA~~e~~~i~~E~~~~~~~~~~~~~~~~~~gd 515 (517) T protein:vir:98 477 IFKVPKKTAEQWLEEIRKDQIELDPVTISQRAQKRMFGD 515 (517) T ss_pred hCCCChHHHHHHHHHHHHhccccCCCCccccccCCCCCC Confidence 999999999999999999998765432 23446668877 No 6 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=100.00 E-value=1.6e-134 Score=754.04 Aligned_cols=494 Identities=43% Similarity=0.724 Sum_probs=453.3 Q ss_pred CChhHHHHHHHHHHhhcc-cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVI-TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~-~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) |++|++||+|||++.++. ..++|++|++|.+|+++++|+.+|+.|++||.|+++++.++...+.+..++++|+|+|+.| T Consensus 1 m~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~ri~~~~~~y~g~~~~~~~~~~~~~~~~~~~~sln~~~~i 80 (508) T protein:vir:15 1 MGLIQRIKDLFWKGAAATGVTGSLSKITDDPRISIDPDEYVRIQTDLDYYSDKLQYIHYQASDGIKKKRLKNTINMAKTA 80 (508) T ss_pred CChHHHHHHHHHHHHHHhccccchHHhhcccccccCHHHHHHHHHHHHHhcCCCcccccccCCCCccccceeecchHHHH Confidence 999999999999977654 6699999999999999999999999999999999999999888888888889999999999 Q ss_pred HHHHhhhhhcCcceEeeC-CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVD-NEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQAN 158 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~-d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d 158 (502) |+++|+|||++|++|+++ ++..+++|++++++|+|..+++++++.|+++|++|++||||+++++|++|+|++++|++++ T Consensus 81 ~~~~A~lv~~e~~~i~v~~~~~~~e~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~v~ad~~~P~~~d 160 (508) T protein:vir:15 81 ARRIASVVFNEKAEIHVKDNNEADKFLNDVLEDNDFKNKFEEALEKGVALGGFAMRPYIDGNHIKIAWVRADQFYPLQSN 160 (508) T ss_pred HHHHHhhhhCCCceEEeCCchHHHHHHHHHHHhccHHHHHHHHHHHHhhcCceEEEEEEeCCeeEEEEEcCCeeEEEEEc Confidence 999999999999999994 5667789999999999999999999999999999999999999999999999999999999 Q ss_pred CCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeC-CeEEEEEEEEecCCccccCceeecccc--ccCCCcceeecCCCcc Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNK-ETYTISNELYESESKTIIGQRVPLSTL--YEDLEETVTLNGLTRP 235 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~-~~~~I~~~l~~~~~~~~lG~~v~l~~~--~~~l~~~~~~~~~~~~ 235 (502) ++++++++|+.++++.++.++++||++|+|+|.+ ++|+|+|++|++.+..++|.+|||+++ |++++++++++|+++| T Consensus 161 ~~~~~~~af~~~~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p 240 (508) T protein:vir:15 161 TNDISEAAIASRTQRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSPDIVGNQVPLSTLPVYKELAPQVTISGLQRP 240 (508) T ss_pred CCCeEEEEEEEEEEeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCchhcCcccchhhcccccCCCcceEecCCCcc Confidence 9999999999999998888889999999999865 689999999999999999999999988 7799999999999999 Q ss_pred eEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccc Q lcl|NC_012753. 236 LFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGH 315 (502) Q Consensus 236 ~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~ 315 (502) +|+||++|.+|+++.+||+|+|+|++++++||+||++||+++|+|++++.+|+||+++++.++++. +.|+.+. T Consensus 241 ~f~y~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~d~~~~-------~~~~~~~ 313 (508) T protein:vir:15 241 LFAYFKTPGANNINIESPLGLGVVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPGMLRFDDEHK-------PTFDTEQ 313 (508) T ss_pred eeEEecCCccccccCCCCcCCchHhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcCCCCCc-------cccCCCC Confidence 999999999999999999999999999999999999999999999999999999999998765532 3467778 Q ss_pred hhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 316 NVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIAT 395 (502) Q Consensus 316 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~ 395 (502) ++|..+..+++ ++.+|+++||+||+++|.++++.++++|++.||+|+++||+++++.+|||||++++++|+++++++++ T Consensus 314 ~~~~~~~~~~~-~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~~~~TAtei~s~~~~~~~t~~~~~~ 392 (508) T protein:vir:15 314 NVYVGVLSDDN-NGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSNDGVKTATEVVSNNSMTYQTRSSYLT 392 (508) T ss_pred eeEEeccCCCC-CCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccCccccHHHHHHHHHHHHHHHHHHHH Confidence 88888776543 45679999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccCCCc--------ccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Q lcl|NC_012753. 396 LVEKSLKELVISILELAKVYNLYTGEI--------PTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLN 467 (502) Q Consensus 396 ~~~~~l~~l~~~il~~~~~~~~~~~~~--------~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~ 467 (502) .|+.+|++|+++|+++++.+.+.+++. ..+.+++|+|+|++++|++++++++++++++|+||++++|++++| T Consensus 393 ~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~~~~~~~~v~aGi~s~e~~i~~~~g 472 (508) T protein:vir:15 393 MVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQLEEDAKVLAIGALSKQTFLQRNYG 472 (508) T ss_pred HHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Confidence 999999999999999999877665542 345679999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHHHHHhhhcccCCCC-CccccCCCCC Q lcl|NC_012753. 468 VTKEQAQEIYQKINDETMVSTDSFR-TSEEVDIYGE 502 (502) Q Consensus 468 ~~deea~~el~ri~~E~~~~~~~~~-~~~~~~~~g~ 502 (502) ||||||++|++||++|+++..+... .....|--|| T Consensus 473 ~~deea~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 473 MTDEQAAEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred CChHHHHHHHHHHHHhccccCccccccccCCCCCCC Confidence 9999999999999999876543322 1222333466 No 7 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=100.00 E-value=5.5e-129 Score=723.77 Aligned_cols=489 Identities=34% Similarity=0.648 Sum_probs=450.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC--CCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS--NGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~--~~~~~~~~~~~~n~~k~ 78 (502) =+|+++||+++++|++ .+++++++++.+|+++++++.+|..|++||.|++++|..+.. .+++..++++|+|+|++ T Consensus 3 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~s~n~~~~ 79 (499) T protein:vir:80 3 NQIIAGVKGVMRRMGL---LKSLKDVTDHKKVNANDEDYKYIDMWKRLYQGNYAEWHNLNYEHNGNPVNRRQLSMNLPKV 79 (499) T ss_pred hHHHHHHHHHHHHhcc---ccchhhhhcCCCCcCCHHHHHHHHHHHHHhcCCcchhhccccccCCCccccceeecchHHH Confidence 3566677777777644 578999999999999999999999999999999998876543 35567788999999999 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEE Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQA 157 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~ 157 (502) ||+++|+||||+|++|++++++.+++|++++++|+|+.++.++++.|+++|++|++||||. ++++|.+++|+++||+++ T Consensus 80 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~a~~~~Pi~~ 159 (499) T protein:vir:80 80 TAKYMSKLLFNEKVKINIDDETAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSN 159 (499) T ss_pred HHHHHHHhhhCCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCcEEEEEEECCCCcEEEEEEcCCceEEEEe Confidence 9999999999999999999999999999999999999999999999999999999999985 689999999999999999 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC---eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE---TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~---~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) +++++++++|+..+.. .+++||++|+|+|.+. .|+|+|.+|++.+...+|.+||++++|+++++..+++|+++ T Consensus 160 d~~~~~~~~f~~~~~~----~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~~~lG~~v~l~~~~~~~~~~~~~~~~~~ 235 (499) T protein:vir:80 160 DSENVDECLIANSFHK----NNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDPNELGGKVSLKLLFNDIEPVVPLPSLTR 235 (499) T ss_pred cCCCeEEEEEEEEEee----cCeEEEEEEEEEecccceeeEEEEEEEEeccCccccCcccchhhhccCcCCceeecCCCc Confidence 9999999999876543 3468999999999874 68999999999999999999999999999999999999999 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) |||+||++|.+|+++.+||+|+|+|+++++|||+||+++|+++|+|++++.+|+||++||+..++.+|... +.|+.+ T Consensus 236 p~f~~~~~~~~N~~~~~splG~S~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~---~~~~~~ 312 (499) T protein:vir:80 236 PTFIYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT---QYFDST 312 (499) T ss_pred cceEeecCCccccccCCCccCCchHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcc---cCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999998887777643 467888 Q ss_pred chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~ 394 (502) +++|..+...+++++.+|++++|+||+|+|.++++.++++|+++||+|+++||++++|.+|||||++++++|++++++++ T Consensus 313 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g~~TAtei~s~~~~l~~~~~~~~ 392 (499) T protein:vir:80 313 DEAFFLYQGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHS 392 (499) T ss_pred cceeeEeeccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCcccchhHHHHHHHHHHHHHHHHHHH Confidence 99999888877777789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQ 474 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~ 474 (502) +.|+++|++|+++|+++++++...++......+++|+|+|++++|++++++++++++++|+||++|+|+++++++|+||+ T Consensus 393 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d~ea~ 472 (499) T protein:vir:80 393 QLIEQGIKEMIVSILEVGKLIKAYDGDTVELDTITVDFDDSIAQDEDTTINRYTTAKNQGMIPLKIALQRAWNITEAEAD 472 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHhccccCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHHcCCCCHHHHHhhcCCCChHHHH Confidence 99999999999999999999888777777778999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 475 EIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 475 ~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|++||++|++.+.|. ++.+|++|| T Consensus 473 ~el~~i~~E~~~~~~~---~d~~g~~ge 497 (499) T protein:vir:80 473 EWAEMLAKEKQAEIPN---NDMTGIFGE 497 (499) T ss_pred HHHHHHHHHhhcCCCC---CCccccCCC Confidence 9999999999876544 345689999 No 8 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=100.00 E-value=1e-124 Score=700.39 Aligned_cols=489 Identities=33% Similarity=0.648 Sum_probs=450.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc--CCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD--SNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~--~~~~~~~~~~~~~n~~k~ 78 (502) =+|+++||+||++|+ ..+++++++++.+++++++++.+|..|++||.|+|+++.++. ..+++..++++|+|+|++ T Consensus 3 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~~~~~~~~n~~k~ 79 (496) T protein:vir:38 3 NQIIAGVKGVMRRMG---LLKALKDVKDHKKVNANDEDYKYIDMWKRLYQGHYAEWHNLNYEHNGNPVNRRQLSMNLPKV 79 (496) T ss_pred hHHHHHHHHHHHHhc---cchhhHHHHhcCCCcCCHHHHHHHHHHHHHhcCCCchhhcchhccCCCccccceeecchHHH Confidence 356666777776654 458899999999999999999999999999999999887654 345667788999999999 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEE Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQA 157 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~ 157 (502) ||+++|+||||+||+|++++++.+++|++++++++|.+++.++++.|+++|++|++||+|. +++++++++|+++||+++ T Consensus 80 i~~~~a~~l~~~p~~i~~~d~~~~e~l~~~~~~n~f~~~~~~~~~~a~~~G~~~~~~~~D~~~~~~i~~v~~~~~~P~~~ 159 (496) T protein:vir:38 80 TAKYMSKLLFNEKVKINIDDKAAEEFVLNVLKTNGFTKNMERYIEYGEAMGGFVIKVYHDGNKNVKVSFATADCMYPLSN 159 (496) T ss_pred HHHHHhhhhhCCcceEeeCChHHHHHHHHHHhccCHHHHHHHHHHHHhhhCcEEEEEEEcCCCcEEEEEEcccceEEEEe Confidence 9999999999999999999999999999999999999999999999999999999999995 689999999999999999 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) +++++.+++|+..+.. .+++|+++|+|+|++++|+|+|.+|++.+...+|++||++++|+++++...++|+++||| T Consensus 160 ~~~~~~~~~f~~~~~~----~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~~~~~~~~~~~~~~~~~f 235 (496) T protein:vir:38 160 DSENVDECVIANSFHK----NNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLFDDIEPVVPLPDFTRPTF 235 (496) T ss_pred cCCcEEEEEEEEEEEe----CCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccccccccceeecCCCcceE Confidence 9999999999876543 346899999999999999999999999999999999999999999999999999999999 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchh Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNV 317 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~ 317 (502) +||++|.+|+.+.++|+|+|+|++++++||+||+++|+++|+|++++++|+||+++++..++..|... +.|+.++++ T Consensus 236 ~~~~~~~~N~~~~~~p~G~Sd~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~---~~~~~~~~~ 312 (496) T protein:vir:38 236 IYIKPNIANNKNLTSPLGISVYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT---QYFDSTDEA 312 (496) T ss_pred EEecCCcccccccCCcCCCchHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccc---cCCCCccce Confidence 99999999999999999999999999999999999999999999999999999999998887777643 457788888 Q ss_pred hccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV 397 (502) Q Consensus 318 ~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~ 397 (502) |..+..++++++.+++.++++||+++|+++++.+++++++.+|+|+++||++++|.+||||+++++++|+++++++++.| T Consensus 313 ~~~~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g~~tAtei~~~~~~l~~~~~~~~~~~ 392 (496) T protein:vir:38 313 FFLYQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENGLKTATEVVSEKSETYQTKNSHSQLI 392 (496) T ss_pred EEEeecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccccchHHHHHHHHHHHHHHHHHHHHHH Confidence 88888877777789999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHH Q lcl|NC_012753. 398 EKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIY 477 (502) Q Consensus 398 ~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el 477 (502) +++|++|+++|+++++.+..+++......+++|+|+|++|.|++++++++++++++|+||++|+|+++++++|+||++|+ T Consensus 393 ~~~l~~l~~~il~~~~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~GiiS~et~l~~~~~~~d~ea~~el 472 (496) T protein:vir:38 393 EQGIKEMIVSILEVGKFIEAYSGEVVELDTITVDFDDSIAQDEDTTINRYTNAKNQGMIPLKIALQRAWNITEAEADEWA 472 (496) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCCccceEEEeCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCChHHHHHHH Confidence 99999999999999998888887777788899999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 478 QKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 478 ~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +||++|+.++.+ .++.++++|| T Consensus 473 ~ri~~E~~~~~~---~~d~~~~~~~ 494 (496) T protein:vir:38 473 EMLAKEKQAEMP---NNDMNGIFGE 494 (496) T ss_pred HHHHHhhhccCc---cccccCCCCC Confidence 999999987654 3455678888 No 9 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=100.00 E-value=6.2e-108 Score=608.34 Aligned_cols=480 Identities=14% Similarity=0.107 Sum_probs=383.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccc--------cCCCcccccccee Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYR--------DSNGSQVKRDFNH 72 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~--------~~~~~~~~~~~~~ 72 (502) |+|++.||++||. ++++.+ ++.++.+|..++++|.+....+.++ ....++..++++| T Consensus 1 ~~~~~~~~~~i~~-w~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~ 65 (518) T protein:vir:78 1 MGVWSVMTRFIKG-WLNGKP--------------NGSEPELIPKYLPLVPDNQKEWSKDSYLTSLWAQGYVPTVHDKLMN 65 (518) T ss_pred CcchhhHHHHHHH-hhcCCC--------------CccchhccHHHhhhcccchhhhhhhhhhhhhcccCCCCcccccccc Confidence 9999999999975 333222 2234445555555554442221111 1224566778999 Q ss_pred cchHHHHHHHHhhhhhcCcceEee------CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEE Q lcl|NC_012753. 73 LPIGRTASKKVASLVFNEQATIRV------DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSF 146 (502) Q Consensus 73 ~n~~k~iv~~~a~~l~~ep~~i~~------~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~ 146 (502) +|+|+.||+++|+|||++|++|++ +++.++++|++++++|+|..++.++++.|+++|++|++||||+++++|.+ T Consensus 66 ~~l~~~i~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~~l~~il~~n~f~~~~~~~~e~a~a~G~~~~k~~~d~~~~~i~~ 145 (518) T protein:vir:78 66 SGTGNEIVVVAAEYISGKPLSIDVTGVNGSKDENLTKQLKEALRIDNFDSKSVKIVELAGGSGVSAVKINILNGRPSISV 145 (518) T ss_pred CChHHHHHHHHHHhhcCCCceEEecCccccCcHHHHHHHHHHHHhccHHHHHHHHHHHhhccCceEEEEEEECCeeEEEE Confidence 999999999999999999999988 46778999999999999999999999999999999999999999999999 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEE--------eCCeEEEEEEEEecCCcccc-Cceeecc Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEW--------NKETYTISNELYESESKTII-GQRVPLS 217 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~--------~~~~~~I~~~l~~~~~~~~l-G~~v~l~ 217 (502) |+|++++|++++ ++.++++|+..... +++..+||++|+|++ .+++|+|+|++|++...... +..+|+. T Consensus 146 v~ad~~~P~~~~-g~~~~~~f~~~~~~--~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~ 222 (518) T protein:vir:78 146 HSSSQFWIDFKN-NEPFRFNFFEEIPT--SNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLP 222 (518) T ss_pred EcCCeeEEEeec-CcEEEEEEEEEeec--CCcceeEEEEEeeccccccceeecccceeEEEEEeeecCcccccccccccc Confidence 999999999765 67888888865444 334568999999985 45679999999997533322 2333332 Q ss_pred ----c--cccCCCcceee-cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeec Q lcl|NC_012753. 218 ----T--LYEDLEETVTL-NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVP 290 (502) Q Consensus 218 ----~--~~~~l~~~~~~-~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~ 290 (502) . .|++++++... +|...|+|.+++|+.+|+++.+||||+|+|++++++||+||++||+++|+|++++.+|+|| T Consensus 223 ~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~splG~S~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~ 302 (518) T protein:vir:78 223 EQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLNLGESDLSQCTNYLFAVDYFFTVYMREGEKTKTKIAAS 302 (518) T ss_pred cccccccccccCccceeeccCCccceEEeeccccccccccCCCcCcchHhhhhHHHHHHHHHHHHHHHHHHhCCceeeec Confidence 2 24566666554 4544444444566678999999999999999999999999999999999999999999999 Q ss_pred hHHhccCCCCCCcccCccccccccchhhccccCCCC---ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcc Q lcl|NC_012753. 291 TQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDM---DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFS 367 (502) Q Consensus 291 ~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~ 367 (502) ++||+.++++.|.. +.+.|+.+.++|.++....+ +.+..|++++|+||+++|.++++.+++.++++||+|+++|| T Consensus 303 ~~~l~~~~~~~~~~--~~~~fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg 380 (518) T protein:vir:78 303 ERMFRKKVNKSTDK--EEWSMNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFN 380 (518) T ss_pred hhHhccCCCCCCCc--cccccCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcC Confidence 99999877765543 34568888899998876543 33457999999999999999999999999999999999999 Q ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc--CCCcccccceEEEeCCCccCCHHHHHH Q lcl|NC_012753. 368 FDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLY--TGEIPTMDEVSVDLDDGVFTDRNAEFD 445 (502) Q Consensus 368 ~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~--~~~~~~~~~i~v~f~d~i~~d~~~~~~ 445 (502) .+ ++.+|||||++++++|+++++++++.++.+|++|+++|++++..+... ........+++|+|+|++++|++++++ T Consensus 381 ~~-~~~~TATei~s~~~~~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~ 459 (518) T protein:vir:78 381 LG-NREVKATEIWSLQDATVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSS 459 (518) T ss_pred cc-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHH Confidence 86 467899999999999999999999999999999999999998876332 233445678999999999999999999 Q ss_pred HHHHHHhcCCCCHHHHHHhc-CCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 446 YWSKMVAAGFAPKTMAIEKT-LNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 446 ~~~~~~~~Gi~S~et~l~~~-~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++++++++|+||++++++++ ++|+|+||++|++||++|++......+ .+-+++.=+ T Consensus 460 ~~~~~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p-~~~~g~~~~ 516 (518) T protein:vir:78 460 TLNNMNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDP-EAIGGMETK 516 (518) T ss_pred HHHHHHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCC-ccccCCCCC Confidence 99999999999999999875 589999999999999999886432111 111111111 No 10 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=100.00 E-value=6.1e-67 Score=383.58 Aligned_cols=448 Identities=14% Similarity=0.135 Sum_probs=328.5 Q ss_pred ChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCC----------Cccccccce Q lcl|NC_012753. 2 GIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSN----------GSQVKRDFN 71 (502) Q Consensus 2 ~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~----------~~~~~~~~~ 71 (502) =.++.|+.+|.+. +....+.+.+|...++||.|+|+++...... ..+..++|+ T Consensus 1 ~~~~~~~~~i~~~-----------------~~~~~~~~~~~~~~~~Yy~g~~~I~~~~~~~~~~~~~~~~~~~~~~~~ki 63 (470) T protein:vir:10 1 MELDALKKLIQNT-----------------STSRNDLINNYKQAVNYYENKTDITTRNNGKAKLNKEGKKDPLRSADNRI 63 (470) T ss_pred CchHHHHHHHHHH-----------------HHHHHHHHHHHHHHHHHhccccchhccccchhcccccccccccccCCccc Confidence 1223455554432 1122466789999999999999988654321 233556799 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++||++.||++.|+||||+|++++++++..++.|+++++ ++|...+.++++.++++|.+|.++|+|. +.+++..++|. T Consensus 64 ~~n~~k~Iv~~~~~yl~G~p~~~~~~d~~~~~~l~~~~~-~~~~~~~~~l~~~~~~~G~a~~~~y~d~~~~~~~~~~~p~ 142 (470) T protein:vir:10 64 PSNFYQLLVDQEAGYVASVFPDIDVGKDADNKKIIDVLG-DDRALTLNGLLVDSSNAGRAWLHYWIDEDGNFRYGIIQPD 142 (470) T ss_pred ccchHHHHHHhhhhheeccceeeecCchHHHHHHHHHHh-hhHHHHHHHHHHHHhhcCeeEEEEEecCCCceEEEEEccc Confidence 999999999999999999999999999999999999997 4789999999999999999999999986 57999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCcccc--Cceeecccc---ccCCCc Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTII--GQRVPLSTL---YEDLEE 225 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~l--G~~v~l~~~---~~~l~~ 225 (502) ++||+|.++......++++.+...+.+...+++++|.|+-+...++. ..+...... ....+.... .++... T Consensus 143 ~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 218 (470) T protein:vir:10 143 QITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFR----TNATDSTVIEPYNIITSYDLSAGYETGQS 218 (470) T ss_pred ceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEE----eecCcceeccccccccccccccccccccc Confidence 99999877655445555555555555566677788887633333322 221111100 011111111 112223 Q ss_pred ceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCccc Q lcl|NC_012753. 226 TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKV 305 (502) Q Consensus 226 ~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~ 305 (502) ....++++++|+++|++| +.|+|+|+++++|||+||.++|+++++++.....++| +......... T Consensus 219 ~~~~~~~g~vPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv----l~g~~~~~~~-- 283 (470) T protein:vir:10 219 NTLKHNFGRVPFIEFSKN---------KYRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILV----LTNYGGADLH-- 283 (470) T ss_pred cccccCCCeeeEEEeecC---------CCCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCccee----eecCCccccc-- Confidence 345688999999999864 3589999999999999999999999999988888888 4332211111 Q ss_pred CccccccccchhhccccCCC--CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHH Q lcl|NC_012753. 306 TVKREFETGHNVYEQFDSGD--MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQ 383 (502) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~--~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~ 383 (502) .+......+..+...+ .+.+.+++.++.+++.+++...++.+.+.|...++.+. +++.+.|+.||.|+++++ T Consensus 284 ----~~~~~~~~~~~i~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~~~~gn~Sg~Alk~~~ 357 (470) T protein:vir:10 284 ----QFMNDLRKYKSIKINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGID--PANFESSNASGVAIKMLY 357 (470) T ss_pred ----hhhhhhhhcCeEeccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCC--CCccccccchHHHHHHHH Confidence 1222223333333332 23456688899999999999999988888877776654 344455778999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) +.+.++|+.+++.|+++|++++++|+.+++. ...+...++|+|++++|.|..+.+++++++ +|+||.||+++ T Consensus 358 ~~l~~k~~~~~~~~~~~l~~~~~~i~~~l~~------~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~--~g~iS~et~l~ 429 (470) T protein:vir:10 358 SHLELKAAKTQTYFEHAINELVRAIMRYLNF------SDADKRHISQHWTRTKVEDSLTKAQIVSTV--ANYSSKEAVAK 429 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------cCcccceeeEEeccCCCCCHHHHHHHHHHH--hccCcHHHHHH Confidence 9999999999999999999999999987543 223456899999999999999999999887 59999999987 Q ss_pred hcCCCCHHHHHHHHHHHHHhhhccc---CCCCCccccCCCCC Q lcl|NC_012753. 464 KTLNVTKEQAQEIYQKINDETMVST---DSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~~~~~deea~~el~ri~~E~~~~~---~~~~~~~~~~~~g~ 502 (502) .++.++| +++|++||++|+.+.. +.++...+.++.+| T Consensus 430 ~~p~v~D--~~~E~eri~~E~~e~~~~~~~~~~~~~~~~dde 469 (470) T protein:vir:10 430 ANPIVDD--WQQELKDLAKDKEENDPYSNQADELNGKGVNDE 469 (470) T ss_pred hCCCCCC--HHHHHHHHHHHHHHHHHhhccccccCCCCCCCC Confidence 7655554 7889999999876654 34445556666666 No 11 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=100.00 E-value=1.6e-66 Score=381.24 Aligned_cols=461 Identities=13% Similarity=0.083 Sum_probs=314.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC---------CCccccccce Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS---------NGSQVKRDFN 71 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~---------~~~~~~~~~~ 71 (502) =+-+..++.+++.....+.......|.+. ++..+..++....+||.|+|+++..... ......++|+ T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~----i~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~~~ri 84 (503) T protein:vir:59 9 KTHTEELNEIIVESAKEIAEPDTTMIQKL----IDEHNPEPLLKGVRYYMCENDIEKKRRTYYDAAGQQLVDDTKTNNRT 84 (503) T ss_pred hhhHHhHHHhhhhhhhhccchhHHHHHHH----HHhhcHHHHHHHHHHhccccchhhccchhccccccccccccccccee Confidence 11222333333332222222222222211 1223457899999999999987654321 1223446789 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++||+++||++.|+|+||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|+ |++++.+++|. T Consensus 85 ~~n~~~~ivd~~~~yl~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~ 163 (503) T protein:vir:59 85 SHAWHKLFVDQKTQYLVGEPVTFTSDNKTLLEYVNELAD-DDFDDILNETVKNMSNKGIEYWHPFVDEEGEFDYVIFPAE 163 (503) T ss_pred ecchHHHHHHHHHhhhhcCCeeeccCcHHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEeecCCCceEEEEEccc Confidence 999999999999999999999999999999999999886 7899999999999999999999999985 68999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeec Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLN 230 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 230 (502) +++|+|.+.......+++ +++......+..++++|.|+.+...++.. .. +...++...+.......+......+ T Consensus 164 ~~~~i~d~~~~~~~~~~i-r~~~~~~~~~~~~~~~evy~~~~i~~~~~----~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (503) T protein:vir:59 164 EMIVVYKDNTRRDILFAL-RYYSYKGIMGEETQKAELYTDTHVYYYEK----ID-GVYQMDYSYGENNPRPHMTKGGQAI 237 (503) T ss_pred eeEEEEeCCCCCceEEEE-EEEEEecCCCceEEEEEEEeCCcEEEEEE----cC-Ccccccccccccccccceeecceec Confidence 999998766444334444 44444444555677889887544433221 11 1122222221111111222334567 Q ss_pred CCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccc Q lcl|NC_012753. 231 GLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKRE 310 (502) Q Consensus 231 ~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 310 (502) +++++|+++|++| ++|+|+|+++++|||+||.++|+++++++....+++| +.... +... .. T Consensus 238 ~~~~vPiv~~~nn---------~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v----~~g~~---~~~~---~~ 298 (503) T protein:vir:59 238 GWGRVPIIPFKNN---------EEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYV----LKNYD---GENP---KE 298 (503) T ss_pred cCCccceEEecCC---------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeE----eecCC---cccc---ch Confidence 8999999999764 4689999999999999999999999999999999888 33211 1110 11 Q ss_pred ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhc---CCChhhccccccccccHHHHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQL---GVSTGMFSFDGKSMKTATEVVSEQSDTY 387 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~---g~s~~~~~~~~~~~~tAtei~~~~~~l~ 387 (502) +......+..+...++ ..++.++.+++.+++...++.+.+.|...+ +++++.+ +|+.||+|++++++.+. T Consensus 299 ~~~~~~~~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~----~~~~Sg~Ai~~~~~~l~ 371 (503) T protein:vir:59 299 FTANLRYHSVIKVSGD---GGVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETI----GGGATGPALENLYALLD 371 (503) T ss_pred hhhhhhcccceeccCC---CcceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccc----cccccHHHHHHHHHHHH Confidence 1112222222332222 235566677777777766666666554444 4444433 35678999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCC Q lcl|NC_012753. 388 QMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLN 467 (502) Q Consensus 388 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~ 467 (502) ++|+.+++.|+.+|++++++|+.++... ..+.......++|.|++++|.|..++++++++++++|+||.||++..+++ T Consensus 372 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~--~~~~~~~~~~i~i~f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~l~~ 449 (503) T protein:vir:59 372 LKANMAERKIRAGLRLFFWFFAEYLRNT--GKGDFNPDKELTMTFTRTRIQNDSEIVQSLVQGVTGGIMSKETAVARNPF 449 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc--cCcccccccceeEEeCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHhCCC Confidence 9999999999999999999999887642 22333344569999999999999999999999999999999999988766 Q ss_pred CCHHHHHHHHHHHHHhhhcccC---CCCCccccCCCCC Q lcl|NC_012753. 468 VTKEQAQEIYQKINDETMVSTD---SFRTSEEVDIYGE 502 (502) Q Consensus 468 ~~deea~~el~ri~~E~~~~~~---~~~~~~~~~~~g~ 502 (502) +++ +++|++||++|+..... ...+...+.--++ T Consensus 450 v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 485 (503) T protein:vir:59 450 VQD--PEEELARIEEEMNQYAEMQGNLLDDEGGDDDLE 485 (503) T ss_pred CCC--HHHHHHHHHHHHHHHHhhhccccCccCCCCCCC Confidence 654 78899999988754322 1112222222222 No 12 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=100.00 E-value=1.2e-65 Score=376.50 Aligned_cols=452 Identities=9% Similarity=-0.006 Sum_probs=312.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC-CCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS-NGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~~~~~n~~k~i 79 (502) +.-++.|+.+|.+. ...+..+|.++++||.|+|+++..... ......++|+++|||++| T Consensus 39 ~~~~~~i~~~i~~~--------------------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:96 39 LQNVNEVSKYIEHH--------------------MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hccHHHHHHHHHHH--------------------HHhhHHHHHHHHHHhcccCccccccCcCcccccCcceeecchHHHH Confidence 11223333333321 123456899999999999998765432 334456789999999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQAN 158 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d 158 (502) |++.++||||+||+++++++..++.|+++++.|+|...+.++++.++++|.+|.++|+|+ |.+++.+++|.+++|+|.+ T Consensus 99 v~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vydd 178 (511) T protein:vir:96 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDN 178 (511) T ss_pred HHHHHhhhccCCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999985 6899999999999999877 Q ss_pred CCCeEEEEEEEEEEEee--CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcce Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTE--GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPL 236 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~--~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~ 236 (502) +......++++.+.... +.......++|.|+.+. |.+ |...+.. ..++. ........++++.+| T Consensus 179 ~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~----i~~--~~~~~~~----~~~~~----~~~~~~~~~~~~~vP 244 (511) T protein:vir:96 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG----VYR--YLTSRTN----GLKLT----PRENGFESHSFERMP 244 (511) T ss_pred CCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc----EEE--EEecCCC----ccccc----ccccccccccCCcee Confidence 66555555554433322 22222333455554222 111 2221111 11110 111223457788999 Q ss_pred EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccch Q lcl|NC_012753. 237 FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 237 f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~ 316 (502) +++|+++ +.|+|+|+++++|||+||.++|++++.++....+++|..++...... .+......+.+..... T Consensus 245 vv~~~nn---------~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~-~~~~~~~~~~~~~~~~ 314 (511) T protein:vir:96 245 ITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPV-EVRKQKEANVLFLEPT 314 (511) T ss_pred eEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccCCch-hhcccccccceecccc Confidence 9999764 35899999999999999999999999999988998885543322211 1111111111211111 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) ..........+.+..++.++.+++++.+...++.+.+.|...++.+.-+++. .+|+.||.|++++++.+.++|+.+++. T Consensus 315 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~-~~~n~Sg~Al~~~~~~l~~k~~~k~~~ 393 (511) T protein:vir:96 315 VYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLFGLEQRTKTKEGL 393 (511) T ss_pred cccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHH Confidence 1111222222334457777888888888877777777777766655433322 235679999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEI 476 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~e 476 (502) |+.+|++++++|+.+....... .....+..+++.|++++|.|..+++++++++ +|+||.+|++..+++++| +++| T Consensus 394 ~~~~l~~~~~li~~~~~~~~~~-~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~G~iS~et~l~~l~~v~D--~~~E 468 (511) T protein:vir:96 394 FTKGLRRRAKLLETILKNTWSI-DANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD--PELE 468 (511) T ss_pred HHHHHHHHHHHHHHHHHhhcCc-ccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHH Confidence 9999999999999876543211 1123455799999999999999999999887 699999999987765654 7889 Q ss_pred HHHHHHhhhcccCCCCC----ccccCCCCC Q lcl|NC_012753. 477 YQKINDETMVSTDSFRT----SEEVDIYGE 502 (502) Q Consensus 477 l~ri~~E~~~~~~~~~~----~~~~~~~g~ 502 (502) ++||++|+......... ..+..--|| T Consensus 469 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:96 469 VKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred HHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 99999997654322211 111112222 No 13 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=100.00 E-value=7e-65 Score=372.31 Aligned_cols=452 Identities=9% Similarity=-0.002 Sum_probs=309.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC-CCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS-NGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~~~~~n~~k~i 79 (502) +.-.+.|+.+|.+. ...+..+|+++++||.|+|+++..... ......++|+++|||++| T Consensus 39 ~~~~~~i~~~i~~~--------------------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:99 39 LQNVNEVSKYIEHH--------------------MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hccHHHHHHHHHHH--------------------HHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHH Confidence 11222333333321 123456889999999999998765432 344556789999999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQAN 158 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d 158 (502) |++.++||+|+|++++++++.+++.|++++++|+|...+.++++.|+++|.+|+++|+|+ |.+++.+++|.++||++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~a~~~vy~ded~~~~i~~~~p~~~~~vyd~ 178 (511) T protein:vir:99 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFVIYDN 178 (511) T ss_pred HHHHHhhhcccCceeecCchHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999985 6899999999999999877 Q ss_pred CCCeEEEEEEEEEEEe--eCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcce Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKT--EGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPL 236 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~--~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~ 236 (502) +......++++.+... ++.......++|.|+.+. |.+ |...+.. ...+ .........++++.+| T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~----i~~--~~~~~~~----~~~~----~~~~~~~~~~~~g~vP 244 (511) T protein:vir:99 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG----VYR--YLTSRTN----GLKL----TPRENGFESHSFERMP 244 (511) T ss_pred CCCCceEEEEEEEEeeecccCccceEEEEEEEeCCc----EEE--EEecCCc----cccc----cccccccccCCCCccc Confidence 6544444554433322 222222334456654322 111 2211111 0000 0111223457889999 Q ss_pred EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccch Q lcl|NC_012753. 237 FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 237 f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~ 316 (502) +++|+++ +.|+|+|+++++|||+||.++|++++.++....++++-.++...... ........+.+..... T Consensus 245 vv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~-~~~~~~~~~~~~~~~~ 314 (511) T protein:vir:99 245 ITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPV-EVRKQKEANVLFLEPT 314 (511) T ss_pred eEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCch-hhcccccccceecccc Confidence 9999764 46899999999999999999999999999888887773332211111 1111111112222222 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) .+......+.+.+..++.++.+++++.+.+.++.+.+.|...++.+.-+++. .+|+.||.|++++++.+.++|+.+++. T Consensus 315 ~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~-~~gn~Sg~Alk~~~~~l~~ka~~k~~~ 393 (511) T protein:vir:99 315 VYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLFGLEQRTKTKEGL 393 (511) T ss_pred cccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHH Confidence 2222222222334456777778888877777777777776666555433221 235679999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEI 476 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~e 476 (502) |+.+|++++++|+.++...+-.. ....+..++|.|++++|.|..+++++++++ +|++|.||++..+++++| +++| T Consensus 394 ~~~~l~~~~~li~~~~~~~~~~~-~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl--~GiiS~et~l~~l~~v~D--~~~E 468 (511) T protein:vir:99 394 FTKGLRRRAKLLETILKNTRSID-VSKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFSFFQD--PELE 468 (511) T ss_pred HHHHHHHHHHHHHHHHHhcCCcc-cccccccceEEeCCCCCcCHHHHHHHHHHH--hccCCHHHHHHhCCCCCC--HHHH Confidence 99999999999998876532111 223445789999999999999999999987 499999999988766665 7889 Q ss_pred HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++||++|+.............+..+. T Consensus 469 ~~ri~~E~~~~~~~~~~~~~~~~~~~ 494 (511) T protein:vir:99 469 VKKIEEDEKESIKKAQKNMYQDPRNI 494 (511) T ss_pred HHHHHHHHHHHHHHHhhcccccCCCC Confidence 99999987654322222211111111 No 14 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=100.00 E-value=4e-64 Score=368.13 Aligned_cols=452 Identities=10% Similarity=-0.010 Sum_probs=311.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC-CCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS-NGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~~~~~n~~k~i 79 (502) |.-++.|+++|.+. ......+|+++++||.|+|+++..... ..+...++++++|||++| T Consensus 39 ~~~~~~i~~~i~~~--------------------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:10 39 LQNVNEVSKCIEHH--------------------MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred ccCHHHHHHHHHHH--------------------HHhhHHHHHHHHHHhcccCccccccCcccccccCcceeecchHHHH Confidence 22223333333221 123456889999999999998765432 334456789999999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQAN 158 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d 158 (502) |+..++||||+|++++++++..++.|++++++|+|...+.++++.++++|.+|.++|+|+ |.+++.+++|.+++|++.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~dedg~~~i~~~~p~~~~~vydd 178 (511) T protein:vir:10 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYEIMIRNQDDETRLYKSDAMSTFVIYDN 178 (511) T ss_pred HHHHhhhhcccCceeecCchHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEccceeEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999986 6899999999999999877 Q ss_pred CCCeEEEEEEEEEEEe--eCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcce Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKT--EGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPL 236 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~--~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~ 236 (502) +......++++.+... ++.......++|.|+.+.... |...+. + ..... ........++++.+| T Consensus 179 ~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~------~~~~~~---~-~~~~~----~~~~~~~~~~~~~vP 244 (511) T protein:vir:10 179 TIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR------YLTSRT---N-GLKLT----PRENGFESHSFERMP 244 (511) T ss_pred CCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEE------EEecCC---C-ccccc----ccccccccccCccee Confidence 6654445555433322 222223344456654222111 121111 1 01000 111223347788999 Q ss_pred EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccch Q lcl|NC_012753. 237 FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 237 f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~ 316 (502) +++|+++ ..|+|+|+++++|||+||.++|++++.++....+++|..++....... .........+..... T Consensus 245 vv~f~nn---------~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~-~~~~~~~~~~~~~~~ 314 (511) T protein:vir:10 245 ITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVE-VRKQKEANVLFLEPT 314 (511) T ss_pred EEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchh-hccchhccceecccc Confidence 9999764 358999999999999999999999999999888888844432221111 111111111222212 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) .+......+.+.+..++.++.+++++.+...++.+.+.|...++.+.-+++. .+|+.||.|++++++.+.+++..+++. T Consensus 315 ~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~-~~~n~Sg~Al~~~~~~l~~k~~~k~~~ 393 (511) T protein:vir:10 315 VYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLFGLEQRTKTKEGL 393 (511) T ss_pred cccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHHHHHHHHHHHHHHHHHHHH Confidence 2222222222333446677778888888877777777777666655433322 235679999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEI 476 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~e 476 (502) |+.+|++++++|+.+....+.. .....+..++|.|++++|.|..++++++++++ |++|.||+++.++++++ +++| T Consensus 394 f~~~l~~~~~li~~~~~~~~~~-~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~--G~iS~et~~~~l~~v~d--~~~E 468 (511) T protein:vir:10 394 FTKGLRRRAKLLETILKNTRSI-DANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELE 468 (511) T ss_pred HHHHHHHHHHHHHHHHHhhCCc-ccccccceeeEEeCCCCCcCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCC--HHHH Confidence 9999999999998876543211 12234567999999999999999999999985 99999999988766665 6789 Q ss_pred HHHHHHhhhcccCCCCCccccCCC----CC Q lcl|NC_012753. 477 YQKINDETMVSTDSFRTSEEVDIY----GE 502 (502) Q Consensus 477 l~ri~~E~~~~~~~~~~~~~~~~~----g~ 502 (502) ++||++|+.............+-. |+ T Consensus 469 ~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:10 469 VKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred HHHHHHHHHHHHHHHhhhcccCCCCCCCCC Confidence 999999876543222211111111 22 No 15 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=100.00 E-value=3.9e-64 Score=368.21 Aligned_cols=452 Identities=10% Similarity=0.002 Sum_probs=311.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc-CCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD-SNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~-~~~~~~~~~~~~~n~~k~i 79 (502) |..++.|+.+|.+. ...+..+|.++++||.|+|+++.... ...+...++|+++|||+.| T Consensus 39 ~~~~~~i~~~i~~~--------------------~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:78 39 LQNVNEVSKYIEHH--------------------MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hcCHHHHHHHHHHH--------------------HHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHH Confidence 22333333333321 12345688899999999999876543 2344556789999999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQAN 158 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d 158 (502) |+..++||||+|++++++++.+++.|+++++.|+|...+.++++.++++|.+|.++|+|+ |.+++.+++|.+++|+|.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd 178 (511) T protein:vir:78 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDN 178 (511) T ss_pred HHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999985 6799999999999999877 Q ss_pred CCCeEEEEEEEEEEEe--eCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcce Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKT--EGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPL 236 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~--~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~ 236 (502) +......++++.+... ++.......++|.|+.+. +.+ |...+.. ..++. ........++++.+| T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~----i~~--~~~~~~~----~~~~~----~~~~~~~~~~~g~vP 244 (511) T protein:vir:78 179 TVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG----VYR--YLTNRTN----GLKLT----PRENSFESHSFERMP 244 (511) T ss_pred CCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc----EEE--EEecCCC----ccccc----ccccccccCcCcccc Confidence 6554444444333221 222223344566655322 111 2211111 11111 111223446788999 Q ss_pred EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccch Q lcl|NC_012753. 237 FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 237 f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~ 316 (502) +++|+++ +.|+|+|+++++|||+||.++|++++.++....+++|-.++...... ........+.+..... T Consensus 245 vv~~~n~---------~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~-~~~~~~~~~~~~~~~~ 314 (511) T protein:vir:78 245 ITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPV-EVRKQKEANVLFLEPT 314 (511) T ss_pred eEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCch-hhcccccccceecccc Confidence 9999764 46899999999999999999999999999888887773322111111 0111111111222222 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) .+......+.+.+..++.++.+++++++...++.+.+.|...++.+.-+++.. +|+.||.|++++++.+.++|..+++. T Consensus 315 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~-~~n~Sg~Al~~~~~~l~~ka~~~~~~ 393 (511) T protein:vir:78 315 VYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-SGTQSGEAMKYKLFGLEQRTKTKEGL 393 (511) T ss_pred ceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHH Confidence 22222222333344567778888888888888887777777666554333222 35679999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEI 476 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~e 476 (502) |+.+|++++++|+.+....+- ......+..++|.|++++|.|..++++++++++ |++|.+|++..+++++| +++| T Consensus 394 f~~~l~~~~~li~~~~~~~~~-~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d--~~~E 468 (511) T protein:vir:78 394 FTKGLRRRAKLLETILKNTRS-IDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELE 468 (511) T ss_pred HHHHHHHHHHHHHHHHHhcCC-CccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHH Confidence 999999999999887654321 112334567899999999999999999999984 99999999988765554 7899 Q ss_pred HHHHHHhhhcccCCCCCccccCCCC----C Q lcl|NC_012753. 477 YQKINDETMVSTDSFRTSEEVDIYG----E 502 (502) Q Consensus 477 l~ri~~E~~~~~~~~~~~~~~~~~g----~ 502 (502) ++||++|+...........+.+..| | T Consensus 469 l~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:78 469 VKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred HHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 9999999765433222222222222 2 No 16 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=100.00 E-value=3.9e-64 Score=368.21 Aligned_cols=452 Identities=10% Similarity=0.002 Sum_probs=311.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc-CCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD-SNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~-~~~~~~~~~~~~~n~~k~i 79 (502) |..++.|+.+|.+. ...+..+|.++++||.|+|+++.... ...+...++|+++|||+.| T Consensus 39 ~~~~~~i~~~i~~~--------------------~~~~~~r~~~l~~Yy~g~~~il~~~~~~~~~~~~~~ki~~n~~k~I 98 (511) T protein:vir:96 39 LQNVNEVSKYIEHH--------------------MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRVAHDYASYI 98 (511) T ss_pred hcCHHHHHHHHHHH--------------------HHhhhHHHHHHHHHhhccCccccccCcccccccCcceeecchHHHH Confidence 22333333333321 12345688899999999999876543 2344556789999999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQAN 158 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d 158 (502) |+..++||||+|++++++++.+++.|+++++.|+|...+.++++.++++|.+|.++|+|+ |.+++.+++|.+++|+|.+ T Consensus 99 v~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~d~dg~~~i~~~~p~~~~~v~dd 178 (511) T protein:vir:96 99 SDFINGYFLGNPIQYQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAMSTFIIYDN 178 (511) T ss_pred HHHHhhhhcccCceeecCchHHHHHHHHHHhhcChhHHHHHHHHHHHhcCeeEEEEEeCCCCceEEEEEcccceEEEEcC Confidence 999999999999999999999999999999999999999999999999999999999985 6799999999999999877 Q ss_pred CCCeEEEEEEEEEEEe--eCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcce Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKT--EGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPL 236 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~--~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~ 236 (502) +......++++.+... ++.......++|.|+.+. +.+ |...+.. ..++. ........++++.+| T Consensus 179 ~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~----i~~--~~~~~~~----~~~~~----~~~~~~~~~~~g~vP 244 (511) T protein:vir:96 179 TVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG----VYR--YLTNRTN----GLKLT----PRENSFESHSFERMP 244 (511) T ss_pred CCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc----EEE--EEecCCC----ccccc----ccccccccCcCcccc Confidence 6554444444333221 222223344566655322 111 2211111 11111 111223446788999 Q ss_pred EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccch Q lcl|NC_012753. 237 FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 237 f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~ 316 (502) +++|+++ +.|+|+|+++++|||+||.++|++++.++....+++|-.++...... ........+.+..... T Consensus 245 vv~~~n~---------~~g~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~~-~~~~~~~~~~~~~~~~ 314 (511) T protein:vir:96 245 ITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPV-EVRKQKEANVLFLEPT 314 (511) T ss_pred eEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCCch-hhcccccccceecccc Confidence 9999764 46899999999999999999999999999888887773322111111 0111111111222222 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) .+......+.+.+..++.++.+++++++...++.+.+.|...++.+.-+++.. +|+.||.|++++++.+.++|..+++. T Consensus 315 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~-~~n~Sg~Al~~~~~~l~~ka~~~~~~ 393 (511) T protein:vir:96 315 VYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNF-SGTQSGEAMKYKLFGLEQRTKTKEGL 393 (511) T ss_pred ceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-ccccHHHHHHHHHHHHHHHHHHHHHH Confidence 22222222333344567778888888888888887777777666554333222 35679999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEI 476 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~e 476 (502) |+.+|++++++|+.+....+- ......+..++|.|++++|.|..++++++++++ |++|.+|++..+++++| +++| T Consensus 394 f~~~l~~~~~li~~~~~~~~~-~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~--G~iS~et~l~~l~~v~d--~~~E 468 (511) T protein:vir:96 394 FTKGLRRRAKLLETILKNTRS-IDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG--GKISQTTLMSLFSFFQD--PELE 468 (511) T ss_pred HHHHHHHHHHHHHHHHHhcCC-CccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCChHHHHHhCCCCCC--HHHH Confidence 999999999999887654321 112334567899999999999999999999984 99999999988765554 7899 Q ss_pred HHHHHHhhhcccCCCCCccccCCCC----C Q lcl|NC_012753. 477 YQKINDETMVSTDSFRTSEEVDIYG----E 502 (502) Q Consensus 477 l~ri~~E~~~~~~~~~~~~~~~~~g----~ 502 (502) ++||++|+...........+.+..| | T Consensus 469 l~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:96 469 VKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred HHHHHHHHHHHHHHHhhccccCCCCCCCCC Confidence 9999999765433222222222222 2 No 17 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=100.00 E-value=2.2e-63 Score=364.12 Aligned_cols=451 Identities=11% Similarity=0.033 Sum_probs=324.7 Q ss_pred CCh-hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGI-IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~-~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) |++ .+.|+++|.+. ..++..+|+.+++||.|+|+++...........++|+++|||++| T Consensus 13 ~~~~~~~~~~~i~~~--------------------~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~i 72 (489) T protein:vir:99 13 SKLWIDQLKNYISRF--------------------KAEQLERLKELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYI 72 (489) T ss_pred CCCCHHHHHHHHHHH--------------------HHHHHHHHHHHHHHhcccCccccccccccccCCcceeecchHHHH Confidence 444 34566665542 124567899999999999988776554455556778999999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe-----CCceEEEEEcCCeEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYID-----GDQIRVSFVQATVFFP 154 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d-----~~~~~i~~v~~~~~~P 154 (502) |++.|+||||+|++++++++..+++|++++++|+|...+.++++.++++|.+|..+|+. ++.+++.+++|.+++| T Consensus 73 v~~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~~d~~~~~~i~~~~p~~~~~ 152 (489) T protein:vir:99 73 TVFEQGYMLGVPVEYKNENKDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKIDDKKTEVKLYQLPAEQTFV 152 (489) T ss_pred HHHHhhhhccCCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccCcCCCcceEEEEEcccceEE Confidence 99999999999999999999999999999999999999999999999999999999973 3579999999999999 Q ss_pred EEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 155 LQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 155 i~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) +|.+.......+++ +++..+...+..+.+++.|+.+. .|+ |+..+....+.. ......+++++ T Consensus 153 v~dd~~~~~~~~~i-~~~~~~~~~~~~~~~~~~y~~~~-i~~-----~~~~~~~~~~~~----------~~~~~~~~~g~ 215 (489) T protein:vir:99 153 IYDDTYQRNSLMAV-HFYDIDYGSGKRKQIIKAYTSDT-IYT-----YEDYNLETKGMR----------LKDYEGHFFKG 215 (489) T ss_pred EEcCCCCCceEEEE-EEEEEecCCCceEEEEEEEeCCc-EEE-----EEecCCCcccce----------ecccccccCCc Confidence 98765544344444 34444444444555666664221 111 222221111111 01123467889 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccc-- Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFE-- 312 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~-- 312 (502) +|+++|+++ +.|.|+|+++++|+|+||.++|+++++++....++++-.++..... ...........+ T Consensus 216 vPvv~~~n~---------~~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~--~~~~~~~~~~~~~~ 284 (489) T protein:vir:99 216 VPVNEYANN---------EERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGA--DENDYLDDGRLNPN 284 (489) T ss_pred eeEEEeecC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccc--cchhhhhhcccccc Confidence 999999764 3589999999999999999999999999887777766322211110 000000000000 Q ss_pred ------ccchhhccccCCC----CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHH Q lcl|NC_012753. 313 ------TGHNVYEQFDSGD----MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSE 382 (502) Q Consensus 313 ------~~~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~ 382 (502) ........+...+ .+.+..++.++.+++.+.+.+.++.+.+.|...++.+.-.+ ...+|+.||.+++++ T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~~~n~Sg~Al~~~ 363 (489) T protein:vir:99 285 GRLAISIGFKKAQVLILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQD-MKFSGVQSGESMKYK 363 (489) T ss_pred cccccccccccceeeeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCccccc-ccccccchHHHHHHH Confidence 0000011111111 11223456677788888888888888888877776653222 122466799999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 383 QSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAI 462 (502) Q Consensus 383 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 462 (502) ++.+.++|..+++.|+.+|++++++|+.+....+........+.+++|+|++++|.|..+.++++++++ |+||+||++ T Consensus 364 ~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~--giis~et~~ 441 (489) T protein:vir:99 364 LMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY--GIVSDQTIF 441 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHH Confidence 999999999999999999999999999887643322222334567999999999999999999999874 999999999 Q ss_pred HhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 463 EKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 463 ~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +.+++++++++++|++||++|+.......+....++..|+ T Consensus 442 ~~l~~v~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 481 (489) T protein:vir:99 442 EILNTVTGVDAEAELKRLKEEADKKQSLPEPRLVGDASGQ 481 (489) T ss_pred HhcCCCCchhHHHHHHHHHHHHHHHhccccccccCCCCCC Confidence 9998998888999999999998887777777778888888 No 18 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=100.00 E-value=1e-63 Score=365.91 Aligned_cols=451 Identities=9% Similarity=-0.022 Sum_probs=306.8 Q ss_pred CCh--------hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC-CCccccccce Q lcl|NC_012753. 1 MGI--------IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS-NGSQVKRDFN 71 (502) Q Consensus 1 m~~--------~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~~~ 71 (502) |+- ++.|+.+|.+. ...+..+|.++.+||.|+|+++..... ..+...++|+ T Consensus 31 ~~~~e~~~~~~~~~i~~~i~~~--------------------~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~ki 90 (512) T protein:vir:97 31 YDGTESDLLQNINEVSKYIEHH--------------------MDYQRPRLKVLSDYYEGKTKNLVELTRRKEEYMADNRV 90 (512) T ss_pred cCchhhhhhhhHHHHHHHHHHH--------------------HHhhHHHHHHHHHHhcccCccccccCcccccccCccee Confidence 321 22333333321 113456889999999999998765432 3345567899 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++|||++||++.++||+|+|++++++++.+++.|++++++|+|...+.++++.++++|.+|+++|+|+ |.+++.+++|. T Consensus 91 ~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~i~G~ay~~vy~ded~~~~i~~~~p~ 170 (512) T protein:vir:97 91 AHDYASYISDFINGYFLGNPIQCQDDDKDVLEAIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQDDETRLYKSDAM 170 (512) T ss_pred ecchHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHhhcCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccc Confidence 99999999999999999999999999999999999999999999999999999999999999999985 68999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEE--eeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTK--TEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~--~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) +++|+|.++......++++.+.. .++.......++|.|+.+. |.+ |...+... ..+. ....... T Consensus 171 ~~~~iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~----i~~--~~~~~~~~----~~~~----~~~~~~~ 236 (512) T protein:vir:97 171 STFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHG----VYR--YLTSRTNG----LKLT----PRENGFE 236 (512) T ss_pred ceEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCc----EEE--EEecCCCc----cccc----ccccccc Confidence 99999876654444444433322 1222223344556654322 111 22211110 0000 1112234 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) .++++.+|+++|+++ +.|.|+|+++++|||+||.++|++++.++....+++|-.++...... .+...... T Consensus 237 ~~~~g~vPvv~~~nn---------~~~~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~-~~~~~~~~ 306 (512) T protein:vir:97 237 SHSFERMPITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPV-EVRKQKEA 306 (512) T ss_pred cccCcccceEeecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCch-hhhhhhhc Confidence 578899999999764 46899999999999999999999999999988888883332211111 11111111 Q ss_pred cccccc--chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHH Q lcl|NC_012753. 309 REFETG--HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDT 386 (502) Q Consensus 309 ~~~~~~--~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l 386 (502) ..+... ......... +.+.+..++.++.++..+.+...++.+.+.|...++.+.-+++. .+|+.||.|++++++.+ T Consensus 307 ~~~~~~~~~~~~~~~~~-~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~-~~gn~Sg~Al~~~~~~l 384 (512) T protein:vir:97 307 NVLFLEPTVYENRDTGI-ETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGEAMKYKLFGL 384 (512) T ss_pred ccccccccchhhccccc-CCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccc-ccccchHHHHHHHHHHH Confidence 111111 111111111 11223335667777777777777777777666666555433322 12567999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Q lcl|NC_012753. 387 YQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTL 466 (502) Q Consensus 387 ~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~ 466 (502) .++++.+++.|+.+|++++++|+.++...+... ....+..++++|++++|.|..+++++++++ +|++|.||++..++ T Consensus 385 ~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~-~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl--~giiS~et~~~~l~ 461 (512) T protein:vir:97 385 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSID-ANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKISQTTLMSLFS 461 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcc-cccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCC Confidence 999999999999999999999998865432111 223455799999999999999999999987 49999999998876 Q ss_pred CCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 467 NVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 467 ~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++++ +++|++||++|+.............+-.|. T Consensus 462 ~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~ 495 (512) T protein:vir:97 462 FFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDI 495 (512) T ss_pred CCCC--HHHHHHHHHHHHHHHHHHHhhcccCCCCCC Confidence 6665 678999999997654433222222222222 No 19 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=100.00 E-value=1e-63 Score=365.90 Aligned_cols=468 Identities=10% Similarity=-0.002 Sum_probs=308.1 Q ss_pred CC-------hhHHHHHHHHHHhhccccc------------chhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC Q lcl|NC_012753. 1 MG-------IIQTIKNFIKRSNYVITNQ------------SLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS 61 (502) Q Consensus 1 m~-------~~~~ik~~i~~~~~~~~~~------------~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~ 61 (502) .+ +-..++..+++-....+.+ .+.+++.+ ....+..+|.++++||.|+|+++..... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~e~~~~~~~~~i~~~i~~----~~~~~~~r~~~l~~Yy~g~~~il~~~~~ 79 (511) T protein:vir:93 4 VNEFETDTDLRGNINYLFNDEANVVYTYDGTESDLLQNVNEVSKYIEH----HMDYQRPRLKVLSDYYEGKTKNLVELTR 79 (511) T ss_pred ccchhhhhhhhhhhhhhhhhhhCCcccccchhhhhhccHHHHHHHHHH----HHHhhHHHHHHHHHHhcccCccccccCc Confidence 00 0011222222111110000 01111100 0124457899999999999998765432 Q ss_pred -CCccccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC- Q lcl|NC_012753. 62 -NGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG- 139 (502) Q Consensus 62 -~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~- 139 (502) ......++|+++|||++||+..++||+|+|++++++++..++.|+++++.|+|...+.++++.|+++|.+|+++|+|+ T Consensus 80 ~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~ay~~vy~de~ 159 (511) T protein:vir:93 80 RKEEYMADNRVAHDYASYISDFINGYFLGNPIQYQDDDKDVLEVIEAFNDLNDVESHNRSLGLDLSIYGKAYELMIRNQD 159 (511) T ss_pred CcccccCcceeecchHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHhhcCHhHHHHHHHHHHHhcCeeEEEEEeCCC Confidence 334456789999999999999999999999999999999999999999999999999999999999999999999985 Q ss_pred CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe--eCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecc Q lcl|NC_012753. 140 DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKT--EGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLS 217 (502) Q Consensus 140 ~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~--~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~ 217 (502) +.+++.+++|.+++|+|.++......++++.+... ++.......++|.|+.+.... |...+.. ...+. T Consensus 160 ~~~~i~~~~p~~~~~vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~------~~~~~~~----~~~~~ 229 (511) T protein:vir:93 160 DETRLYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYR------YLTSRTN----GLKLT 229 (511) T ss_pred CceEEEEEccceeEEEEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEE------EEecCCC----ccccc Confidence 67999999999999998776544444444333222 222223344566654322111 2221111 00000 Q ss_pred ccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccC Q lcl|NC_012753. 218 TLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTE 297 (502) Q Consensus 218 ~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~ 297 (502) ........++++.+|+++|+++ +.|.|+|+++++|||+||.++|++++.++....+++|...+.... T Consensus 230 ----~~~~~~~~~~~g~vPvv~~~nn---------~~g~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~ 296 (511) T protein:vir:93 230 ----PRENGFESHSFERMPITEFSNN---------ERRKGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD 296 (511) T ss_pred ----cccccccccCCCccceEEecCC---------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccC Confidence 1112234578899999999763 468999999999999999999999999998888888743332211 Q ss_pred CCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHH Q lcl|NC_012753. 298 YDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT 377 (502) Q Consensus 298 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt 377 (502) .. ..........+......+......+.+.+..++.++.++..+.+...++.+.+.|...++.+.-+++. .+|+.||. T Consensus 297 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~-~~~n~Sg~ 374 (511) T protein:vir:93 297 PV-EVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN-FSGTQSGE 374 (511) T ss_pred ch-hhcccccccceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHH Confidence 11 11111111111111111111222222334456667777777777777777777776666655433322 23567999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 378 EVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 378 ei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) |++++++.+.++|+.+++.|+.+|++++++|+.+.....- ......+..+++.|++++|.|..++++++.++ +|+|| T Consensus 375 Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~~~-~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl--~g~iS 451 (511) T protein:vir:93 375 AMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTWS-IDANKDFNTVRYVYNRNLPKSLIEELKAYIDS--GGKIS 451 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-cccccccccceEEeCCCCCCCHHHHHHHHHHH--hccCc Confidence 9999999999999999999999999999999987654221 11123445789999999999999999999987 59999 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCC----C Q lcl|NC_012753. 458 KTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYG----E 502 (502) Q Consensus 458 ~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g----~ 502 (502) .||++..++++++ +++|++||++|+.............+-.| | T Consensus 452 ~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 498 (511) T protein:vir:93 452 QTTLMSLFSFFQD--PELEVKKIEEDEKESIKKAQKGIYKDPRDINDDE 498 (511) T ss_pred hHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHHHhhhcccCCCCCCCCC Confidence 9999988766655 67899999998764432221111111111 1 No 20 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=100.00 E-value=2.2e-63 Score=364.09 Aligned_cols=454 Identities=12% Similarity=0.102 Sum_probs=315.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCC--------Ccccccccee Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSN--------GSQVKRDFNH 72 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~--------~~~~~~~~~~ 72 (502) |+.-.-...+ ....+..+.+.+.+ .++..++.+|+.+++||.|+|+++..+... .....++|++ T Consensus 7 ~~~~~~~~~~-----~~~~~~~~~~~i~~---~~~~~~~~~~~~~~~yy~g~~~i~~~~~~~~~~~~~~~~~~~~~~ki~ 78 (479) T protein:vir:79 7 SETDLIKVQL-----KKESTINLVKVIEH---YILKHRPEKYKQGEEYYYGNTDVNNKRRYYLLDGAKVDDFTKVNNKAI 78 (479) T ss_pred cccceEeecc-----ccCChhHHHHHHHH---HHhhhhHHHHHHHHHHhccCCcccccccccccccccccccccCcceee Confidence 2211110000 00111111111111 123346788999999999999887654321 1224556899 Q ss_pred cchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCe Q lcl|NC_012753. 73 LPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATV 151 (502) Q Consensus 73 ~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~ 151 (502) +||+++||++.|+||||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|.++|+|. |.+++.+++|.+ T Consensus 79 ~~~~~~Ivd~~~~~l~g~p~~~~~~~~~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~ 157 (479) T protein:vir:79 79 NNYHKLLVDQKVGYSVGNPIVFNADDDNLTKLLNDLLG-EEFDDTITELYLNASNKGVEWLHPYINRKGEFKYVIIPAEE 157 (479) T ss_pred cchHHHHHHHHHhhhhcCCceeccCCHHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEccce Confidence 99999999999999999999999999999999988875 7999999999999999999999999985 579999999999 Q ss_pred EEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecc------cccc-CCC Q lcl|NC_012753. 152 FFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLS------TLYE-DLE 224 (502) Q Consensus 152 ~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~------~~~~-~l~ 224 (502) ++|+|.+.......++++. +......+....++|.|+.+...+++. .+ ......+... .... ... T Consensus 158 ~~~v~d~~~~~~~~~~ir~-y~~~~~~~~~~~~~e~y~~~~i~~~~~----~~---~~~~~~~~~~~~~~~~~~~~~~~~ 229 (479) T protein:vir:79 158 AIPIWDSKRQRELVAFIRF-YYIEDIDGNKIKRVEYYTENDITYFIE----RG---NSFIQEFLYDEYGKMTDIQEGHFR 229 (479) T ss_pred eEEEEeCCCCCceEEEEEE-EEEeecCCceEEEEEEEeCCcEEEEEe----cC---Cccccccccccccccccccccccc Confidence 9999766554444444433 333333334456688887544443321 11 1111111110 0000 111 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~ 304 (502) .....++++++|+++|+++ ++|+|+|+++++|||+||.++|++++.++....+++| +........ T Consensus 230 ~~~~~~~~~~vPvv~~~nn---------~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v----~~g~~~~~~-- 294 (479) T protein:vir:79 230 INNKEQGWGKVPFIPFKNN---------EKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYV----LKEYPGTSL-- 294 (479) T ss_pred ccccccCCCcccEEEecCC---------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceee----eecCCcccc-- Confidence 2234578899999999764 4689999999999999999999999999998888887 332211111 Q ss_pred cCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 305 VTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) ..+.........+...++ ..++.++.++..+.+.+.++.+.+.|...++.+. +++...|+.||+|++++++ T Consensus 295 ----~~~~~~~~~~~~i~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~gn~Sg~Ai~~~~~ 365 (479) T protein:vir:79 295 ----QEFIDNIRYYKSIKVDGG---GGVDKLEINIPVEAKKELLDRLEKNIIIFGQGVN--PESQNTGDKSGVALKFLYS 365 (479) T ss_pred ----ccchhhhhhccceecCCC---CcceEEeccCCHHHHHHHHHHHHHHHHHHhCccc--cccccccchhHHHHHHHHH Confidence 112222333334443332 2366777888888888888888888877776553 4444557789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) .+.++|+.+++.|+.+|++++++|+.+++. .++......+++|.|++++|.|+.+.+++++++ +|+||.||+++. T Consensus 366 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl--~g~iS~et~l~~ 440 (479) T protein:vir:79 366 LLDLKCSKTEKKFKKAIRELLWFVCEYLKI---SGNKSYDYKTVQITFNHSMIINEAEKIDMAAKS--TGIVSDETIVSN 440 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---cCCCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHh Confidence 999999999999999999999999987654 344455667899999999999999999999887 599999999988 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcccCCCC--CccccCCCCC Q lcl|NC_012753. 465 TLNVTKEQAQEIYQKINDETMVSTDSFR--TSEEVDIYGE 502 (502) Q Consensus 465 ~~~~~deea~~el~ri~~E~~~~~~~~~--~~~~~~~~g~ 502 (502) +++++| +++|++||++|+.+...... ...+.++.-| T Consensus 441 l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~e 478 (479) T protein:vir:79 441 HPWVED--VNDELERLKKQEDTQKEYDDLIPNNQDGVIDE 478 (479) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHHhccCcccCCCcCc Confidence 766665 77899999999765432221 1222222223 No 21 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=100.00 E-value=3.6e-63 Score=362.89 Aligned_cols=440 Identities=12% Similarity=0.060 Sum_probs=300.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcc-ccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSV-TYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~-~~~~~~~~~~~~~~~~~n~~k~i 79 (502) ++.++.|+.+|.+. ......+|+++.+||.|+++++ .+.....+...++|+++|||++| T Consensus 38 ~~~~~~l~~~i~~~--------------------~~~~~~r~~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~I 97 (501) T protein:vir:27 38 VNNWELLKNFINHH--------------------KLRQAPRIQELLDYARGENHDVLQFGRRKDREMADKRAVHNYGRMI 97 (501) T ss_pred cccHHHHHHHHHHH--------------------HHHHHHHHHHHHHHhcCCCccccccCccCccccccceeccchHHHH Confidence 34444455444321 2345678999999999987654 44334445556789999999999 Q ss_pred HHHHhhhhhcCcceEeeCC----HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDN----EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFP 154 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d----~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~P 154 (502) |++.++||||+|+++++++ +..+++|+++++.|+|+..+.++++.|+++|.+|+++|+|+ |.+++.+++|.+++| T Consensus 98 vd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~vy~ded~~~~i~~~~p~~~~~ 177 (501) T protein:vir:27 98 SKFKTGYLAGNPIRVEYDDNDNNSQNDDTIKRIGRINDIDSHNRTLIRDLSQTGRAYEVIYRNEYDETRIKRLNPLETFV 177 (501) T ss_pred HHHHhhhhcccCeeEecCCccchHHHHHHHHHHHHhcChhHHHHHHHHHHhhCCeEEEEEEeCCCCceEEEEEccceeEE Confidence 9999999999999999976 44667889999999999999999999999999999999985 579999999999999 Q ss_pred EEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 155 LQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 155 i~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) +|.++......++++.+.....+.. .+++|.|+.+.. |+ |...+. +.. .....+++++ T Consensus 178 v~d~~~~~~~~~~ir~~~~~~~~~~--~~~~~vyt~~~v-~~-----~~~~~~---~~~-----------~~~~~~~~g~ 235 (501) T protein:vir:27 178 IYDNSLEDNSIAAVRYYNRGTLQNA--KDVVEIYTNEHI-YT-----LDASDD---FNE-----------ISVTTHAFGT 235 (501) T ss_pred EecCCCCCceEEEEEEEEeeecCCc--EEEEEEEeCCeE-EE-----EEeCCc---eee-----------ccccccCCCc Confidence 9876654444455544443333333 334566543221 21 221110 011 1123467889 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) +|+++|+++ +.|+|+|+++++|||+||.++|++++.++....++++..++.....+..+......+.+ T Consensus 236 vPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~~~~~~~~~~~~~~~--- 303 (501) T protein:vir:27 236 VPITEFLNN---------VDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLM--- 303 (501) T ss_pred ccEEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCCcccchhhhhhcCce--- Confidence 999999764 46999999999999999999999999999988888884333211111111111110000 Q ss_pred chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~ 394 (502) .+.........+.+..++.++.++..+.+...++.+.+.|...++.+..+++. .+++.||.+++++++.+.+++..++ T Consensus 304 -~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~Al~~~~~~l~~ka~~~~ 381 (501) T protein:vir:27 304 -QLKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTN-FSGNTSGEALKYKLFGLDQDRVDTQ 381 (501) T ss_pred -eecccccccCCCCCcceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCccc-cccCchHHHHHHHHHHHHHHHHHHH Confidence 00000001111222345666777776666666666666665555544332221 1356789999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQ 474 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~ 474 (502) +.|+.+|++++++|+.+++..+ .+.......++|.|++++|.|..+.+++++++ +|++|++|+++.+++++| ++ T Consensus 382 ~~~~~~l~~~~~li~~~~~~~~--~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl--~g~iS~et~l~~l~~v~D--~~ 455 (501) T protein:vir:27 382 SQFTQGLKRRYRLAARIGSLVN--EFKDFDESLLKITFTPNLPKSLNEQVSILTGL--GGQVSQETALSLSGLVES--PN 455 (501) T ss_pred HHHHHHHHHHHHHHHHHHhhcc--cccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCC--HH Confidence 9999999999999998865432 22233456799999999999999999999987 599999999988876665 77 Q ss_pred HHHHHHHHhhhcccCCCCCccc----cCCCCC Q lcl|NC_012753. 475 EIYQKINDETMVSTDSFRTSEE----VDIYGE 502 (502) Q Consensus 475 ~el~ri~~E~~~~~~~~~~~~~----~~~~g~ 502 (502) +|++||++|+...........- ++...+ T Consensus 456 ~E~eri~~E~~e~~~~~~~~~~~~~~~~~~d~ 487 (501) T protein:vir:27 456 EELDKINKEVSEIDFKGYSNDFNEHVGKYTDE 487 (501) T ss_pred HHHHHHHHHHHhhhHhhhcCccccccccccCC Confidence 8999999987643322211111 111111 No 22 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=100.00 E-value=1.5e-63 Score=365.04 Aligned_cols=446 Identities=12% Similarity=0.107 Sum_probs=309.0 Q ss_pred CChhHHHH-HHHHHHhhcc--cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccccccce Q lcl|NC_012753. 1 MGIIQTIK-NFIKRSNYVI--TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVKRDFN 71 (502) Q Consensus 1 m~~~~~ik-~~i~~~~~~~--~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~ 71 (502) ||+=..+. ..++.+.... ..+.|.+ .+....++..+|..+++||.|+|+++..... ......++++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~-----~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:94 7 MPWDKPYGEEVVEQLKPQFETQEEMIVR-----LIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRI 81 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHH-----HHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCccee Confidence 33222111 1122111110 0011111 1122345678999999999999988754321 1233456789 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++|||++||++.|+||||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|+ +.+++.+++|. T Consensus 82 ~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~ 160 (474) T protein:vir:94 82 TTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAE 160 (474) T ss_pred ecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEccc Confidence 999999999999999999999999999999999999986 7899999999999999999999999986 57999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeec Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLN 230 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 230 (502) +++|+|.++......++++.+. .++. . .+|.|+.+...+ |...+. ......... -.........+ T Consensus 161 ~~~~v~d~~~~~~~~~~ir~~~-~~~~--~---~~~~yt~~~~~~------y~~~~~-~~~~~~~~~--~~~~~~~~~~~ 225 (474) T protein:vir:94 161 QAIPIWVDKEREELKSFIRYYK-FNNE--E---KVEFWTDTTVTY------YVLENG-GLIPDYYYG--ANHVQSHFSNG 225 (474) T ss_pred ceEEEEcCCCCCceEEEEEEEE-ecCe--E---EEEEEeCCeEEE------EEEcCC-ccccccccC--cCccccccccc Confidence 9999987655555555555443 2221 1 234433211111 111111 111111000 01223344567 Q ss_pred CCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccc Q lcl|NC_012753. 231 GLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKRE 310 (502) Q Consensus 231 ~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 310 (502) +++++|+++|+++ +.|+|+|+++++|||+||.++|+++++++.....++| +.... +.. ... T Consensus 226 ~~g~vPvv~~~nn---------~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv----~~g~~---~~~---~~~ 286 (474) T protein:vir:94 226 NWGRVPFIAFKNN---------PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYI----LKGYE---GED---LEE 286 (474) T ss_pred CCCccceEEecCC---------cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceee----eecCC---ccc---chh Confidence 8999999999764 5699999999999999999999999999988888877 32211 111 011 Q ss_pred ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMR 390 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~ 390 (502) +..+...+..+...++ ..++.++.+++.+++.+.++.+.+.|...++.+.-+++ ..+|+.||.|++++++.+.++| T Consensus 287 ~~~~~~~~~~i~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Al~~~~~~l~~k~ 362 (474) T protein:vir:94 287 FMRGLKYYKAINVDGD---GGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-KFGSAPSGIALKFLYGNLDLKA 362 (474) T ss_pred hhhhhhccceeeccCC---CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-ccccccHHHHHHHHHHHHHHHH Confidence 2223334444444333 24667778888898888888888887777765532221 1235679999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) +.+++.|+.+|++++++|+.+.. ....+..++|+|++++|.|..+.++.+.+ +|+||++|+++.+++++| T Consensus 363 ~~k~~~~~~~l~~~~~li~~~~~-------~~~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D 432 (474) T protein:vir:94 363 NKLKNKATVAIQELISFIIDFNN-------LKTDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDD 432 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCcccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCC Confidence 99999999999999999987642 23456679999999999999888887655 599999999988766665 Q ss_pred HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 471 EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++|++||++|+.+.....+..++.+..++ T Consensus 433 --~~~E~eri~~E~~~~~~~~~~~~~~~~~~~ 462 (474) T protein:vir:94 433 --YKAELERIEQEQMEYNKQLPNLDDGGADGA 462 (474) T ss_pred --HHHHHHHHHHHHHHHHhhccccCCCCCCCc Confidence 678999999998776655555555555544 No 23 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=100.00 E-value=1.5e-63 Score=365.04 Aligned_cols=446 Identities=12% Similarity=0.107 Sum_probs=309.0 Q ss_pred CChhHHHH-HHHHHHhhcc--cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccccccce Q lcl|NC_012753. 1 MGIIQTIK-NFIKRSNYVI--TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVKRDFN 71 (502) Q Consensus 1 m~~~~~ik-~~i~~~~~~~--~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~ 71 (502) ||+=..+. ..++.+.... ..+.|.+ .+....++..+|..+++||.|+|+++..... ......++++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~-----~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:97 7 MPWDKPYGEEVVEQLKPQFETQEEMIVR-----LIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVHGNIDYDKPDWRI 81 (474) T ss_pred ccCCCchhhHHHHhhhhcccCHHHHHHH-----HHHHHHHHHHHHHHHHHHhccccchhcccchhccccccccccCccee Confidence 33222111 1122111110 0011111 1122345678999999999999988754321 1233456789 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++|||++||++.|+||||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|+ +.+++.+++|. T Consensus 82 ~~n~~k~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~ 160 (474) T protein:vir:97 82 TTNFHQNLVDQKVSYVASKPVTYSCEDENVLKVIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAE 160 (474) T ss_pred ecchHHHHHHHHHhhhhcCCceeccCcHHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCceEEEEEecCCCeeEEEEEccc Confidence 999999999999999999999999999999999999986 7899999999999999999999999986 57999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeec Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLN 230 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 230 (502) +++|+|.++......++++.+. .++. . .+|.|+.+...+ |...+. ......... -.........+ T Consensus 161 ~~~~v~d~~~~~~~~~~ir~~~-~~~~--~---~~~~yt~~~~~~------y~~~~~-~~~~~~~~~--~~~~~~~~~~~ 225 (474) T protein:vir:97 161 QAIPIWVDKEREELKSFIRYYK-FNNE--E---KVEFWTDTTVTY------YVLENG-GLIPDYYYG--ANHVQSHFSNG 225 (474) T ss_pred ceEEEEcCCCCCceEEEEEEEE-ecCe--E---EEEEEeCCeEEE------EEEcCC-ccccccccC--cCccccccccc Confidence 9999987655555555555443 2221 1 234433211111 111111 111111000 01223344567 Q ss_pred CCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccc Q lcl|NC_012753. 231 GLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKRE 310 (502) Q Consensus 231 ~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 310 (502) +++++|+++|+++ +.|+|+|+++++|||+||.++|+++++++.....++| +.... +.. ... T Consensus 226 ~~g~vPvv~~~nn---------~~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv----~~g~~---~~~---~~~ 286 (474) T protein:vir:97 226 NWGRVPFIAFKNN---------PEEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYI----LKGYE---GED---LEE 286 (474) T ss_pred CCCccceEEecCC---------cCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceee----eecCC---ccc---chh Confidence 8999999999764 5699999999999999999999999999988888877 32211 111 011 Q ss_pred ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMR 390 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~ 390 (502) +..+...+..+...++ ..++.++.+++.+++.+.++.+.+.|...++.+.-+++ ..+|+.||.|++++++.+.++| T Consensus 287 ~~~~~~~~~~i~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Al~~~~~~l~~k~ 362 (474) T protein:vir:97 287 FMRGLKYYKAINVDGD---GGVETIQVEVPVSSTKEYIDLMRVYIMEFGQGVDFQTD-KFGSAPSGIALKFLYGNLDLKA 362 (474) T ss_pred hhhhhhccceeeccCC---CceeEEeecCCHHHHHHHHHHHHHHHHHHhCccccCcc-ccccccHHHHHHHHHHHHHHHH Confidence 2223334444444333 24667778888898888888888887777765532221 1235679999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) +.+++.|+.+|++++++|+.+.. ....+..++|+|++++|.|..+.++.+.+ +|+||++|+++.+++++| T Consensus 363 ~~k~~~~~~~l~~~~~li~~~~~-------~~~d~~~i~v~f~~~~p~~~~e~a~~~~~---~g~iS~et~l~~l~~v~D 432 (474) T protein:vir:97 363 NKLKNKATVAIQELISFIIDFNN-------LKTDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDD 432 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCcccCHHHHHHHHHH---cCCCCHHHHHHhCCCCCC Confidence 99999999999999999987642 23456679999999999999888887655 599999999988766665 Q ss_pred HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 471 EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++|++||++|+.+.....+..++.+..++ T Consensus 433 --~~~E~eri~~E~~~~~~~~~~~~~~~~~~~ 462 (474) T protein:vir:97 433 --YKAELERIEQEQMEYNKQLPNLDDGGADGA 462 (474) T ss_pred --HHHHHHHHHHHHHHHHhhccccCCCCCCCc Confidence 678999999998776655555555555544 No 24 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=100.00 E-value=1.7e-63 Score=364.77 Aligned_cols=443 Identities=13% Similarity=0.122 Sum_probs=316.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC--------------CCccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS--------------NGSQV 66 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~--------------~~~~~ 66 (502) |.+ +.|+..|.+. +....++..+|...++||.|+|+++..+.. ..... T Consensus 1 ~~~-e~~~~~i~~~-----------------~~~~~~~~~~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~ 62 (471) T protein:vir:10 1 MEI-EVIKKIISSQ-----------------MVKHGKFVSQAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRN 62 (471) T ss_pred CCH-HHHHHHHHHH-----------------HHHHHHHHHHHHHHHHHhccccccccccchhhhhccccccccccccccc Confidence 432 1223333221 111235678999999999999988754311 12234 Q ss_pred cccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC--CceEE Q lcl|NC_012753. 67 KRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG--DQIRV 144 (502) Q Consensus 67 ~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~~i 144 (502) .++|+++||++.||++.++|+||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|+ |.+++ T Consensus 63 ~~~ki~~n~~~~Ivd~~~~yl~G~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~g~~~~ 141 (471) T protein:vir:10 63 ADNRISHNWHQLLLDQKKAYALTYPPTFDVDDKKVNDMIVDVLG-DDYERISKQLCVNAGNAGIAWLHVWKDASDNSFRY 141 (471) T ss_pred ccceeccchhHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhCCeEEEEEEeeCCCCeeEE Confidence 56789999999999999999999999999999999999999986 7899999999999999999999999983 68999 Q ss_pred EEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCcee------eccc Q lcl|NC_012753. 145 SFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRV------PLST 218 (502) Q Consensus 145 ~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v------~l~~ 218 (502) ..++|.+++|+|.++......++++.+...+...+...+++|.|+.+...+++ ..+.. ....+ +... T Consensus 142 ~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~------~~~~~-~~~~~~~~~~~~~~~ 214 (471) T protein:vir:10 142 ACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYR------HEKEK-PLEELETFQAISLID 214 (471) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEE------ecCCc-ccccccccccccccc Confidence 99999999999877655444555544444333344445556776532222222 11111 01111 1000 Q ss_pred c--ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc Q lcl|NC_012753. 219 L--YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT 296 (502) Q Consensus 219 ~--~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~ 296 (502) . ..........++++++|+++|+++ ..|.|+|+.+++|||+||.++|++++.++.....++| +.. T Consensus 215 ~~~~~~~~~~~~~~~~g~iPvv~~~n~---------~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv----~~g 281 (471) T protein:vir:10 215 TMNGDRSSDNSFKHDFGLVPFIPFKNN---------EIETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFV----LTN 281 (471) T ss_pred cccccccccccccCCCCceeEEEeccC---------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceee----eec Confidence 0 011223345678999999999764 3489999999999999999999999999988888887 332 Q ss_pred CCCCCCcccCccccccccchhhccccCC--CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccc Q lcl|NC_012753. 297 EYDTNGEKVTVKREFETGHNVYEQFDSG--DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK 374 (502) Q Consensus 297 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~ 374 (502) ....... .+......+..+... +.+.+..++.++.+++.+++...++.+.+.|...++.+.. ++...|+. T Consensus 282 ~~~~~~~------~~~~~~~~~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~--~~~~~gn~ 353 (471) T protein:vir:10 282 YGGQDKQ------EFLEDLKRYKMIKMDNDGMGDQSGVTTIAIDIPTEARNLILERTKKQIFISGQGVNP--ETDKLGNS 353 (471) T ss_pred CCccccc------hhHHHhhcCCeEEecCCCCccCccceEEeecCChHHHHHHHHHHHHHHHHHhCCcCC--CcccccCc Confidence 2111111 111122222333332 2234456788888999999999999988888887766543 33444677 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_012753. 375 TATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG 454 (502) Q Consensus 375 tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G 454 (502) ||+|++++++.+.++|+.+++.|+++|++++++|+.+.+. .++..++|+|++++|.|+.+.+++++++ +| T Consensus 354 Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~--------~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g 423 (471) T protein:vir:10 354 SGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGL--------SDKLKIKQTWTRNSINNDTEMAQVVSTL--AT 423 (471) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------CCCceeEEEeCCCCCCCHHHHHHHHHHH--hc Confidence 9999999999999999999999999999999999877532 2345789999999999999999999987 59 Q ss_pred CCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 455 FAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 455 i~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +||.||+++.+++++| +++|++||++|+.......++.++++---| T Consensus 424 ~iS~et~~~~~p~v~D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~e 469 (471) T protein:vir:10 424 ITSRENVAKSNPIVED--WQDELRLQKAEQEGRSEKLYDMEEVEHESE 469 (471) T ss_pred cCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhcccccCCCCCccc Confidence 9999999988877765 788999999998766554444333332223 No 25 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=100.00 E-value=7.1e-63 Score=361.29 Aligned_cols=440 Identities=12% Similarity=0.055 Sum_probs=306.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCC-ccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFD-SVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~-~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) ...++.|+.+|.+ .......+|+++.+||.|+++ ++...........++++++|||++| T Consensus 39 ~~~~~~i~~~i~~--------------------h~~~~~~rl~~l~~yY~g~~~~i~~~~~~~~~~~~~~ki~~n~~k~I 98 (502) T protein:vir:48 39 VNNWELLKNFINH--------------------HKLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMI 98 (502) T ss_pred cccHHHHHHHHHH--------------------HHHHHHHHHHHHHHHhcCCCccccccccccccccccceeecchHHHH Confidence 1112222222221 123456789999999999864 5554444455566789999999999 Q ss_pred HHHHhhhhhcCcceEeeCCH----HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNE----VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFP 154 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~----~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~P 154 (502) |++.++||||+|++++++++ ..++.|++++++|+|...+.++++.++++|.+|+++|+|+ |++++.+++|.+++| T Consensus 99 vd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~ 178 (502) T protein:vir:48 99 SKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFV 178 (502) T ss_pred HHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCCCCceEEEEEcccceEE Confidence 99999999999999999753 4667899999999999999999999999999999999985 689999999999999 Q ss_pred EEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 155 LQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 155 i~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) +|.++......++++.+.....+.. ++++|.|+-+ ..|+ |...+. .. ......+++++ T Consensus 179 vydd~~~~~~~~~ir~~~~~~~~~~--~~~~~iyt~~-~i~~-----~~~~~~---~~-----------~~~~~~~~~g~ 236 (502) T protein:vir:48 179 IYDNSLEDNSIAAVRYYNRGTLQNA--KDVVEIYTNQ-HIYT-----LDASDS---FN-----------EISVTPHAFGT 236 (502) T ss_pred EEcCCCCCceEEEEEEEEEeecCCc--EEEEEEEeCC-eEEE-----EEeCCc---ee-----------eccceecCCCc Confidence 9876544434444443333333333 3345665421 1111 111110 00 01123467889 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) +|+++|+++ +.|+|+|+++++|||+||.++|++++.++....+++|..++.....+..+......+.+... T Consensus 237 vPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~~~~~~~~~~ 307 (502) T protein:vir:48 237 VPITEFLNN---------ADGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQASDMKRTRLMQLK 307 (502) T ss_pred cceEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchhhhhhcceeecc Confidence 999999753 46899999999999999999999999999999998883332211111111111111111000 Q ss_pred chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~ 394 (502) ........+.+..++.++.+++.+.+...++.+.+.|...++.+..+++.. +|+.||.|++++++.+.++++.++ T Consensus 308 ----~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~-~~n~Sg~Alk~~~~~l~~k~~~~~ 382 (502) T protein:vir:48 308 ----PPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNHF-SGNASGEALKYKLFGLDQDRVDTQ 382 (502) T ss_pred ----ccccccccccCcceeEeeecCCHHHHHHHHHHHHHHHHHHhCCCCcCcccc-ccCchHHHHHHHHHHHHHHHHHHH Confidence 001111122334577788888888888888888888887777665444322 356799999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQ 474 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~ 474 (502) +.|+.+|++++++|+.+.... ..+.......++|+|++++|.|..++++.+.++ +|++|.+|+++.+++++| ++ T Consensus 383 ~~~~~~l~~~~~li~~~~~~~--~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~l~~l~~v~D--~~ 456 (502) T protein:vir:48 383 SQFTQGLKRRYRLAARIGSLV--NEFKDFDESRLKITFTPNLPKSLYEQVSILNDL--GGQVSQETALSLSGLVEN--PT 456 (502) T ss_pred HHHHHHHHHHHHHHHHHHhhc--ccccccccccceEEeCCCCCcCHHHHHHHHHHH--hccCcHHHHHHhCCCCCC--HH Confidence 999999999999999886542 122334456799999999999999999999987 599999999888765555 67 Q ss_pred HHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 475 EIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 475 ~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|++||++|+.+.......+...+..|. T Consensus 457 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 484 (502) T protein:vir:48 457 EELDKINEESSKIDFKGYPSYFYDNVGK 484 (502) T ss_pred HHHHHHHHHHHhhhhhcccccccccccc Confidence 8999999988654322223333333333 No 26 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=100.00 E-value=3.4e-63 Score=363.06 Aligned_cols=465 Identities=11% Similarity=0.055 Sum_probs=310.1 Q ss_pred CChhHHHHHHHHHHh--hcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC--CCccccccceecchH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSN--YVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS--NGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~~~~~ik~~i~~~~--~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~--~~~~~~~~~~~~n~~ 76 (502) -.+....+..+...- -.+....+.+++.+ ...++..+|+++++||.|+|+++..+.. ..+...++|+++||| T Consensus 3 ~~~~~~~~~~~~~~~~~~~l~~~~i~~li~~----~~~~~~~r~~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~ 78 (506) T protein:vir:94 3 YDLTEHKQANLIYQESLENLTPNKIMKFITH----HFNYQRPRLEMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFA 78 (506) T ss_pred cchhhhhcceeecccchhcCCHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCccccccccccccccCCcceeecchH Confidence 111111111110000 00011111121111 0124567899999999999987644322 233456688999999 Q ss_pred HHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEE Q lcl|NC_012753. 77 RTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPL 155 (502) Q Consensus 77 k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi 155 (502) ++||++.|+||||+|++++++++..++.|++++++|+|+..+.++++.++++|.+|+++|+|+ |.+++.+++|.+++|+ T Consensus 79 ~~Iv~~~~~~l~G~p~~~~~~d~~~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~ded~~~~i~~~~p~~~~~v 158 (506) T protein:vir:94 79 KYIADFQTSYSVGNPINVKLPDDGSNSGFDTFNKANDVDAENYDLFLDMSRYGRAYEYVYRGEDNEEHLAKLDPLDTFVI 158 (506) T ss_pred HHHHHHhhhhhcccCceeecCcchHHHHHHHHHhccCHhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEE Confidence 999999999999999999999999999999999999999999999999999999999999985 6899999999999999 Q ss_pred EEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEE-EeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 156 QANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHE-WNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 156 ~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~-~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) |.+.......++++.+.....+....++..++++ |....+++ |.. ...+..+ .....+++++ T Consensus 159 ~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~----~~~---~~~~~~~----------~~~~~~~~g~ 221 (506) T protein:vir:94 159 YSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTL----YNP---TPIMGKM----------QVDTTKPITT 221 (506) T ss_pred ecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEE----ecc---ccCccce----------eccccccCCc Confidence 8876554445555443322222222222222221 22222211 221 1222211 1123478889 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc--------------CCCC Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT--------------EYDT 300 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~--------------~~~~ 300 (502) +|+++|+++ +.|+|+|+++++|||+||.++|++++.++.....+++-.++... ..++ T Consensus 222 vPvv~~~n~---------~~~~sd~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 292 (506) T protein:vir:94 222 FPVVEFKNS---------NFRLGDFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDA 292 (506) T ss_pred cceEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhcccccccccccc Confidence 999999764 35899999999999999999999999998777777662221100 0011 Q ss_pred CCcccCccccccccchhhccccCCCC------ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccc Q lcl|NC_012753. 301 NGEKVTVKREFETGHNVYEQFDSGDM------DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK 374 (502) Q Consensus 301 ~g~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~ 374 (502) .+........+......++.+...++ +.+..++.++.++..+++...++.+.+.|...++.+...++ ..+++. T Consensus 293 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~ 371 (506) T protein:vir:94 293 MAKLAKDKLELIKEMKDANMLLLKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTDE-NFASNS 371 (506) T ss_pred ccccccchhHHHhhhhhcCeeeecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCccccccc-cccccc Confidence 11111111112222222333322221 12234666777888888888888888888887776653322 223667 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_012753. 375 TATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG 454 (502) Q Consensus 375 tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G 454 (502) ||.|++++++.+.++|+.+++.|+.+|++++++|+.++...+ ++......+++|.|++++|.|..+.+++++++ +| T Consensus 372 Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~--~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl--~g 447 (506) T protein:vir:94 372 SGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIH--GDWTFDPQELTFTFRDNLPADNISQIKALVQA--GA 447 (506) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CccccccccceEEeCCCCCcCHHHHHHHHHHH--hc Confidence 999999999999999999999999999999999998876421 22334556799999999999999999999987 59 Q ss_pred CCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 455 FAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 455 i~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +||++|++..+++++| +++|++||++|+......++.....+--++ T Consensus 448 ~iS~et~~~~lp~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 493 (506) T protein:vir:94 448 TLPQKYLYQQLPGVTN--PQDIVDMMKEQSANGDYSFDQNGVISNDGQ 493 (506) T ss_pred cCChHHHHHhCCCCCC--HHHHHHHHHHHHHHHhhcchhhcCCCcccC Confidence 9999999998766665 678999999998765544433322222222 No 27 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=100.00 E-value=9.4e-63 Score=360.63 Aligned_cols=440 Identities=12% Similarity=0.047 Sum_probs=305.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCC-ccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFD-SVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~-~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) ++..+.|+.+|.+. ......+|+++.+||.|+++ ++...........++++++|||++| T Consensus 38 ~~~~~~i~~~i~~~--------------------~~~~~~r~~~~~~yY~g~~~~i~~~~~~~~~~~~~~ri~~n~~k~I 97 (501) T protein:vir:96 38 VNNWELLKNFINHH--------------------KLRQAPRIQELLDYARGENHDVLKSGRRKDNEMADKRAVHNYGRMI 97 (501) T ss_pred CChHHHHHHHHHHH--------------------HHHHHHHHHHHHHHhcCCCCcccCccccCccccccceeecchHHHH Confidence 45555555555431 12445789999999999864 5555444455566789999999999 Q ss_pred HHHHhhhhhcCcceEeeCC----HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDN----EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFP 154 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d----~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~P 154 (502) |++.++||||+|+++++++ +..++.|++++++|+|...+.++++.++++|.+|+++|+|+ |.+++.+++|.+++| T Consensus 98 vd~~~~yl~g~p~~~~~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~v~~dedg~~~i~~~~p~~~~~ 177 (501) T protein:vir:96 98 SKFKTGYLAGNPIRVEYDDNDDNSQNDDAIKRIGRINDLDSLNRTLIRDLSQTGRAYEVIYRSEYDETRIKRLSPLETFV 177 (501) T ss_pred HHHHhhhhcccCeeEeeCCccchhHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEEEEccceeEE Confidence 9999999999999999975 45677899999999999999999999999999999999985 689999999999999 Q ss_pred EEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 155 LQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 155 i~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) +|.++......++++.++....... .++++.|+.+ ..|+ |..... +.. .....+++++ T Consensus 178 v~d~~~~~~~~~~v~~~~~~~~~~~--~~~~~vyt~~-~i~~-----~~~~~~---~~~-----------~~~~~~~~g~ 235 (501) T protein:vir:96 178 IYDNSLEDNSIAAVRYYNRGTLQSA--KDVVEIYTDE-HIYT-----LDASDD---FNE-----------ISVTTHAFGT 235 (501) T ss_pred EEcCCCCCceEEEEEEEEeecCCCc--EEEEEEEcCC-cEEE-----EeeCCC---cee-----------ccccccCCCc Confidence 9876644434444444433333222 2345554321 1111 121110 011 1123467889 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) +|+++|+++ +.|+|+|+++++|||+||.++|++++.++....++++..++.....+..+......+.+..+ T Consensus 236 vPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~~~~~~~~~~~~~ 306 (501) T protein:vir:96 236 VPITEYLNN---------IDGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQASDMKRTRLMQLK 306 (501) T ss_pred cceEEecCC---------ccCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccchhhhhhcCeeeec Confidence 999999753 56999999999999999999999999999888888873333211111111111111111110 Q ss_pred chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~ 394 (502) . .........+..++.++.++..+.+...++.+.+.|...++.+...++.. +++.||.+++++++.+.++|+.++ T Consensus 307 ~----~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~ka~~~~ 381 (501) T protein:vir:96 307 P----PKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNF-SGNTSGEALKYKLFGLDQDRVDTQ 381 (501) T ss_pred c----cccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccc-cccchHHHHHHHHHHHHHHHHHHH Confidence 0 00011112223456666777777777777777666666666554333322 356699999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQ 474 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~ 474 (502) +.|+.+|++++++|+.++...+ .+.......++|.|++++|.|..+.++++++++ |++|.+|+++.+++++| ++ T Consensus 382 ~~~~~~l~~~~~li~~~~~~~~--~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~--g~iS~et~~~~l~~v~D--~~ 455 (501) T protein:vir:96 382 SQFTKGLKRRYRLAARIGSLVN--EFKDFDESLLKITFTPNLPKSLNEQVSILTGLG--GQVSQETALSLSGLVES--PN 455 (501) T ss_pred HHHHHHHHHHHHHHHHHHHhcc--cccccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HH Confidence 9999999999999998865422 233344567999999999999999999999984 99999999998876665 67 Q ss_pred HHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 475 EIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 475 ~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|++||++|+.............+..|+ T Consensus 456 ~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 483 (501) T protein:vir:96 456 EELDKINKEMSEIDFKGYSNDFNEHVGK 483 (501) T ss_pred HHHHHHHHHHHHhhccccccchhhcccc Confidence 8999999988765433333333333443 No 28 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=100.00 E-value=5.1e-63 Score=362.11 Aligned_cols=449 Identities=13% Similarity=0.111 Sum_probs=309.7 Q ss_pred CChhH------HHHHHHHHHhhcc--cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccc Q lcl|NC_012753. 1 MGIIQ------TIKNFIKRSNYVI--TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQV 66 (502) Q Consensus 1 m~~~~------~ik~~i~~~~~~~--~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~ 66 (502) |-.|+ ++..+++.+--.. ....|.+.++ -...++.++..+++||.|+|+++..... ..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~-----~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~ 75 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVVEQIKPKYETQEEMIIRLIN-----DHKPKIDDITVGERYYNHDPDVLRLAPKLDNKGEIDPLK 75 (474) T ss_pred CeeeccCCCchhhhhHHHHhhhccCChHHHHHHHHH-----HHHHHHHHHHHHHHHhccCCcchhccchhcccccccccc Confidence 44332 4444444321111 1122222222 1235678999999999999987654321 12335 Q ss_pred cccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEE Q lcl|NC_012753. 67 KRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVS 145 (502) Q Consensus 67 ~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~ 145 (502) .++|+++|||+.||++.|+||||+|++++++++..++.|+++++ +++...+.++++.|+++|.+|+++|+|. |.+++. T Consensus 76 ~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~y~d~~~~~~i~ 154 (474) T protein:vir:96 76 PDWRMFTNYHQNLVDQKVAYAVANPVTFSSDDDKSLKTIQEVLN-HKWDDKLVDILTAASNKGIEWLQPYIDENGEFKTF 154 (474) T ss_pred cchhcccchHHHHHHhhhhhhcccCceeecCchHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeeEEEEEecCCCceEEE Confidence 66789999999999999999999999999999999999999986 6799999999999999999999999975 679999 Q ss_pred EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccccc-CCC Q lcl|NC_012753. 146 FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYE-DLE 224 (502) Q Consensus 146 ~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~-~l~ 224 (502) +++|+++||+|.++......+++ +++..++.. .+|.|+.. .|.+..+. +...+...+....... ... T Consensus 155 ~~~p~~~~~v~d~~~~~~~~~~v-r~~~~~~~~-----~~~~yt~~----~v~~~~~~--~~~~~~~~~~~~~~~~~~~~ 222 (474) T protein:vir:96 155 RVPAEQAIPIWTNKERDTLKAFI-RYYRLDGAE-----RVEYWTDS----DVTYYEYQ--DGILIPDYYHGEEHIQSHYY 222 (474) T ss_pred EEcccceEEEEcCCCCCceEEEE-EEEeecCce-----EEEEEeCC----eEEEEEec--CCceeecccccccccccccc Confidence 99999999998765444444444 344443321 13333211 11111111 1111110000000000 001 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~ 304 (502) .....++++++|+++|+++ +.|+|+|+.+++|||+||.++|+++++++.....++| +........ T Consensus 223 ~~~~~~~~g~iPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv----~~g~~~~~~-- 287 (474) T protein:vir:96 223 VGNKRVSWGRVPFIPFKNN---------PQEMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYI----LKGYEGQDL-- 287 (474) T ss_pred ccccccCCCceeEEEeccC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceee----eecCCcccc-- Confidence 1123478899999999864 4689999999999999999999999999998898888 332211111 Q ss_pred cCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 305 VTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) ..+..+...++.+...+ .++.++.++.++..+++...++.+.+.|...++.+..+++ ..+++.||.|++++++ T Consensus 288 ----~~~~~~~~~~~~i~~~~--~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Al~~~~~ 360 (474) T protein:vir:96 288 ----DEFMRNLKYYKAINVDG--DGSGVDTIQIEVPVQSSKEYLDMLRDYVIEFGQGVDFQQD-KFGNSPSGIALKFMYS 360 (474) T ss_pred ----cchhhhhhcCceEEecC--CCCceeEEeecCChHHHHHHHHHHHHHHHHHhCCcccccc-ccccccHHHHHHHHHH Confidence 11223333344444432 2334777888888999998899888888888876654332 2235679999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) .+.++|+.+++.|+++|++++++|+.+.. .......++|+|++++|.|+.+.++.+ +.+|+||++|+++. T Consensus 361 ~l~~k~~~k~~~~~~~l~~~~~~i~~~~~-------~~~~~~~i~i~f~~~~p~~~~e~~~~~---~~ag~iS~et~~~~ 430 (474) T protein:vir:96 361 NLDLKANKLKNKTLTALQELLQYIIDFYK-------LNIKVQDVEITFNFNVMVNELEQSQIG---VQSQYLSKETVVTN 430 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCCCcCHHHHHHHH---HhcCCCchHHHHHh Confidence 99999999999999999999999887642 234456789999999999988877754 45799999999988 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 465 TLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 465 ~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++++| +++|++||++|+.+.....++..+ |..|+ T Consensus 431 ~~~v~d--~~~E~~ri~~E~~e~~~~~~~~~~-~~~~~ 465 (474) T protein:vir:96 431 HPWVDD--PVAELERIEQDNIDFNKQLPPLEG-DANGR 465 (474) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHhccccccc-ccccc Confidence 766765 778999999998765544433333 33333 No 29 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=100.00 E-value=4.2e-63 Score=362.56 Aligned_cols=446 Identities=14% Similarity=0.099 Sum_probs=300.1 Q ss_pred CChhH--HHHHHHHHHhh------cccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCC------Cccc Q lcl|NC_012753. 1 MGIIQ--TIKNFIKRSNY------VITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSN------GSQV 66 (502) Q Consensus 1 m~~~~--~ik~~i~~~~~------~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~------~~~~ 66 (502) |.=+. .=|.|+.+..- .+....|.+ -+....++..+|..+++||.|+|+++..+... .+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~-----~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~ 75 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILR-----LITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFK 75 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHH-----HHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccc Confidence 11000 00001110000 000011111 11223356788999999999999887654321 2334 Q ss_pred cccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEE Q lcl|NC_012753. 67 KRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVS 145 (502) Q Consensus 67 ~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~ 145 (502) .++++++|||+.||++.++||||+|++++++++..++.|+++++ |+|...+.++++.|+++|.+|+++|+|+ |.+++. T Consensus 76 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~ 154 (468) T protein:vir:96 76 PDWRMYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLN-HKWDDKLVDILTAASNKGVEWIQPYVDEQGEFKTF 154 (468) T ss_pred cccccccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHh-cCHHHHHHHHHHHHhhcCeEEEEEEEcCCCceEEE Confidence 56789999999999999999999999999999999999999996 6899999999999999999999999985 679999 Q ss_pred EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCc-eeeccccccCCC Q lcl|NC_012753. 146 FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQ-RVPLSTLYEDLE 224 (502) Q Consensus 146 ~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~-~v~l~~~~~~l~ 224 (502) +++|++++|+|.+.......+|+ +++..++.. .+|.|+.+ ++.+ |...+...+-. ............ T Consensus 155 ~~~p~~~~~v~~~~~~~~~~~~i-r~~~~~~~~-----~~~~~~~~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 222 (468) T protein:vir:96 155 RVPAEQAIPIWTNKERDELKAFI-RLYELDGGE-----RVEYWTAN----DVTF--YELKDGQLIPDYYQGEEHVQAHYY 222 (468) T ss_pred EEcccceEEEEcCCCCCceEEEE-EEEEecCce-----EEEEEeCC----eEEE--EEEcCCceeeccccccccccccee Confidence 99999999997654433344444 444443321 23444321 1211 11111110000 000000000111 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~ 304 (502) .....++++++|+++|+++ +.|+|+|+++++|||+||.++|+++++++.....+++ +........ T Consensus 223 ~~~~~~~~~~iPvv~~~n~---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv----~~g~~~~~~-- 287 (468) T protein:vir:96 223 VGNKSMSWNRVPFIPFKNN---------PQEVSDLFMYKTIIDAMDKRLSDTQNTFDEATELIYV----LKGYEGEDL-- 287 (468) T ss_pred eccccccCCcccEEEecCC---------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceee----eecCCcccc-- Confidence 1223477899999999763 5689999999999999999999999999887777777 322111111 Q ss_pred cCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 305 VTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) ..+..+...++.+...+ +.+..++.++.++..+++...++.+.+.|...++.+.-.+. ..+|+.||.|++++++ T Consensus 288 ----~~~~~~~~~~~~i~~~~-d~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~ 361 (468) T protein:vir:96 288 ----EEFMYNLKYYKAINVDG-DGSGGVDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVDFQQD-KFGNSPSGIALKFMYS 361 (468) T ss_pred ----chhhhhhhcCceEEecC-CCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccccc-ccccchHHHHHHHHHH Confidence 11222222233333322 22334677888888888888888888888777766532221 2246779999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) .+.++++.+++.|+.+|++++++|+.+. +.......++|+|++++|.|..+.++++++ +|+||.+|+++. T Consensus 362 ~l~~k~~~k~~~~~~~l~~~~~li~~~~-------g~~~d~~~i~i~f~~~~p~d~~e~a~~~~~---~g~iS~et~i~~ 431 (468) T protein:vir:96 362 NLDLKANKLKNKTLTALQELLQYIIDFY-------KLSIKVQDVEITFNFNVMVNELEQSQIGVN---SQYLSKETVVTN 431 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCCcccceeeEEecCCCCcCHHHHHHHHHh---cCCCchHHHHHh Confidence 9999999999999999999999998763 223455679999999999999888876654 699999999988 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 465 TLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 465 ~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++++| +++|++||++|+.+.....+ +++|. T Consensus 432 l~~v~D--~~~E~~ri~~E~~~~~~~~~-----~~~~~ 462 (468) T protein:vir:96 432 HPWVDD--PVAEMERIDQEELALPSIEE-----GLNGK 462 (468) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHHhh-----ccCCC Confidence 766665 78999999999876544332 23444 No 30 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=100.00 E-value=3.5e-63 Score=362.99 Aligned_cols=430 Identities=14% Similarity=0.146 Sum_probs=306.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccccccceecc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVKRDFNHLP 74 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~~~n 74 (502) +.+.+.|+.+|.+ ...++.+|..+.+||.|+|+++..... ..+...++|+++| T Consensus 34 e~~~~~i~~~i~~---------------------~~~~~~r~~~l~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n 92 (483) T protein:vir:12 34 ETLEEMIVRYIKQ---------------------HLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMITN 92 (483) T ss_pred hhHHHHHHHHHHH---------------------HHHHHHHHHHHHHHhccccccccccccccccccccccccccccccc Confidence 2223333333322 124567899999999999988765432 1233456789999 Q ss_pred hHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEE Q lcl|NC_012753. 75 IGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFF 153 (502) Q Consensus 75 ~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~ 153 (502) ||++||++.|+||||+|++++++++..++.|+++++ |+|...+.++++.++++|.+|+++|+|+ |.+++.+++|.+++ T Consensus 93 ~~k~Ivd~~~~~l~G~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~y~~v~~d~d~~~~i~~~~p~~~~ 171 (483) T protein:vir:12 93 FHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQGI 171 (483) T ss_pred hHHHHHHHHhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEEcCCCceEEEEEcccceE Confidence 999999999999999999999999999999999986 6899999999999999999999999985 67999999999999 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-ccCCCcceeecCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-YEDLEETVTLNGL 232 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~~~l~~~~~~~~~ 232 (502) |+|.++......++++ ++..++.. .+|.|+. ..+.+..+.+ |..++.... ..........+++ T Consensus 172 ~v~d~~~~~~~~~~ir-~~~~~~~~-----~~~~y~~----~~v~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~ 235 (483) T protein:vir:12 172 PIWTDKEHEELEAFIR-MYKLENET-----KVEYWDK----VTVNYYVYEN------GSLIPDYSNNLENSKTHFSTGSW 235 (483) T ss_pred EEEcCCCCCceEEEEE-EEEeecce-----EEEEEec----CeEEEEEEeC------CeeeecccccccccccccccCCC Confidence 9987654443444443 33333321 2344432 1222211221 111110000 1122233445788 Q ss_pred CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccc Q lcl|NC_012753. 233 TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFE 312 (502) Q Consensus 233 ~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~ 312 (502) +.+|+++|+++ +.|+|+|+.+++|||+||.++|++++.++....++++ +.......... +. T Consensus 236 g~vPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv----~~g~~~~~~~~------~~ 296 (483) T protein:vir:12 236 GKIPFIPFKNN---------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV----LTNYDDQELPE------FK 296 (483) T ss_pred CccceEEecCC---------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceee----eecCCcccchh------HH Confidence 99999999763 4689999999999999999999999999988887777 43322221111 11 Q ss_pred ccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 313 TGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS 392 (502) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~ 392 (502) .....+..+..+++ ..++.++.++..+.+...++.+.+.|...++.+.-+++ ..+++.||.|++++++.+..+|+. T Consensus 297 ~~~~~~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Al~~~~~~l~~k~~~ 372 (483) T protein:vir:12 297 RLLRYYGAIKVSDN---GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSD-KFGSAPSGVALEFLYTNLNLKADK 372 (483) T ss_pred HhhhhccccccCCC---CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCcc-ccccCcHHHHHHHHHHHHHHHHHH Confidence 22233333333332 23566677777888777777777777666655433322 123566899999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHH Q lcl|NC_012753. 393 IATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQ 472 (502) Q Consensus 393 ~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~dee 472 (502) +++.|+.+|++++++|+.+.. ....+.+++|.|++++|.|..+++++++++ +|+||.||++..+++++| T Consensus 373 ~~~~f~~~l~~~~~li~~~~~-------~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~GiiS~et~~~~~~~v~d-- 441 (483) T protein:vir:12 373 LARKAKVAIQELLWFVFEHFD-------IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVED-- 441 (483) T ss_pred HHHHHHHHHHHHHHHHHHHhc-------CCCccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCC-- Confidence 999999999999999887643 223456899999999999999999999987 599999999988766655 Q ss_pred HHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 473 AQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 473 a~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++|++||++|+.......++..+++..++ T Consensus 442 ~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~ 471 (483) T protein:vir:12 442 LQAELERIEQEQMEYNKQLPNLDDGGADGA 471 (483) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccCCc Confidence 788999999998776666655555555544 No 31 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=100.00 E-value=5.7e-63 Score=361.81 Aligned_cols=446 Identities=12% Similarity=0.097 Sum_probs=307.4 Q ss_pred CChh-HHHHHHHHHHhhcc--cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCC------Cccccccce Q lcl|NC_012753. 1 MGII-QTIKNFIKRSNYVI--TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSN------GSQVKRDFN 71 (502) Q Consensus 1 m~~~-~~ik~~i~~~~~~~--~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~------~~~~~~~~~ 71 (502) ||.= ....++|..+.-.. ..+.|.+.+. ....+..+|..+++||.|+|+++...... .....++++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~-----~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:95 7 MPWDKPYGEEVVEQLKPQFETQEEMIIRLID-----DHRKQLDKITVGQRYYDKDNDIVKQMKKVDVYGNIDYDKPDWRI 81 (474) T ss_pred cCCCCchhhHHHHhhhhccCChHHHHHHHHH-----HHHHHHHHHHHHHHHhcccCchhcccccccccccccccccccee Confidence 3332 12333333322111 1122222222 22356788999999999999887543221 223446789 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++|||+.||++.|+||||+|++++++++...+.|+.+++ |+|...+.++++.++++|.+|+++|+|+ |.+++.+++|. T Consensus 82 ~~n~~~~Ivd~~~~~l~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~e~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~ 160 (474) T protein:vir:95 82 TTNFHQNLVDQKVSYVASKPVTYSCEDESVLKIIHDVLD-TRWDNKLIDILTATSNKGIDWLQVYINENGEMKLFRVPAE 160 (474) T ss_pred ccchHHHHHHHHHhhhccCCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCcEEEEEEecCCCceEEEEEccc Confidence 999999999999999999999999999999999999986 6899999999999999999999999975 67999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeec Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLN 230 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 230 (502) +++|+|.+.......++++.+ ..++.. .++.|+.+...+ |...+. .... .++.. ..........+ T Consensus 161 ~~~~v~d~~~~~~~~~~i~~~-~~~~~~-----~~~~y~~~~~~~------~~~~~~-~~~~-~~~~~-~~~~~~~~~~~ 225 (474) T protein:vir:95 161 QAIPIWVDKEREELKSFIRYY-KFNNEE-----KVEFWTDTTVTY------YVLENG-GLIP-DYYYG-ANHIQSHFSNG 225 (474) T ss_pred ceEEEEcCCCCCceEEEEEEE-EEcCee-----EEEEEeCCeEEE------EEEcCC-cccc-ccccC-ccccccccccc Confidence 999998765444455555443 332211 233333211111 111110 0000 00000 11223344557 Q ss_pred CCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccc Q lcl|NC_012753. 231 GLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKRE 310 (502) Q Consensus 231 ~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 310 (502) +++++|+++|+++ +.|+|+|+++++|||+||.++|+++++++.....+++ +........ .. T Consensus 226 ~~g~iPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv----~~g~~~~~~------~~ 286 (474) T protein:vir:95 226 NWGRVPFIAFKNN---------PEEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYI----LKGYEGQDL------EE 286 (474) T ss_pred CCCccceEeecCC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceee----eecCCcccc------hh Confidence 8899999999763 5689999999999999999999999999988888887 322211111 11 Q ss_pred ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMR 390 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~ 390 (502) +......+..+..+++ ..++.++.+++.+++...++.+.+.|...++.+.-+++ ..+++.||.|++++++.+.++| T Consensus 287 ~~~~~~~~~~i~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~~l~~k~ 362 (474) T protein:vir:95 287 FMRGLKYYKAINVDGD---GGVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVDFQTD-KFGSAPSGIALKFLYGNLDLKA 362 (474) T ss_pred hhhhhhccceeeccCC---CceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc-cccccchHHHHHHHHHHHHHHH Confidence 2233333444444332 24666777888999998888888888777766542222 2235679999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) +.+++.|+.+|++++++|+.+.. .......++|+|++++|.|+.+.++++++ +|+||++|+++.+++++| T Consensus 363 ~~k~~~~~~~l~~~~~li~~~~g-------~~~d~~~i~v~f~~~~p~d~~e~a~~~~~---~g~iS~et~i~~l~~v~d 432 (474) T protein:vir:95 363 NKLKNKATVAIQELIGFIIDFNN-------LKMDVKDIEISFNFNRMMNDAEQSQIIAQ---SQYLSRETLVKSSPLVDD 432 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeccCCCcCHHHHHHHHHh---cCCCchHHHHHhCCCCCC Confidence 99999999999999999987642 23456789999999999999888886654 699999999988766665 Q ss_pred HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 471 EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++|++||++|+...........+++.-++ T Consensus 433 --~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~ 462 (474) T protein:vir:95 433 --YKAELERIEQEQMEYNKQLPNLDDGGADGA 462 (474) T ss_pred --HHHHHHHHHHHHHHHHhcccccccccCCCC Confidence 678999999998655444433333333222 No 32 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=100.00 E-value=7e-63 Score=361.33 Aligned_cols=448 Identities=14% Similarity=0.176 Sum_probs=303.9 Q ss_pred CChhHHHHHHHHHHhhcccc------cchhhhhc------------cccccCCHHHHHHHHHHHHHhcCCCCccccccC- Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITN------QSLNSITD------------HPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS- 61 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~------~~l~~i~~------------~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~- 61 (502) .-+++.+..-+-++|.-+++ ..++++.- ..-+....+++.++..+++||.|+|+++..... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~~~~ 83 (492) T protein:vir:94 4 IQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV 83 (492) T ss_pred HHHHHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccc Confidence 56666544433333322211 11111100 001111234667899999999999988765432 Q ss_pred -----CCccccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEE Q lcl|NC_012753. 62 -----NGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPY 136 (502) Q Consensus 62 -----~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~ 136 (502) ......++|+++|||++||++.|+||||+|++++++++...+.|+++++ |+|...+.++++.++++|.+|+.+| T Consensus 84 ~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~a~~~G~a~~~v~ 162 (492) T protein:vir:94 84 DATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred cccccccccccccccccchHHHHHHHHHhhhcccCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEE Confidence 1233456789999999999999999999999999999999999999986 7899999999999999999999999 Q ss_pred EeC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceee Q lcl|NC_012753. 137 IDG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVP 215 (502) Q Consensus 137 ~d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~ 215 (502) +|+ |.+++..++|.+++|+|.++......++++ ++..+... .+|.|+.....+++ +.. .. .++ T Consensus 163 ~d~dg~~~~~~~~p~~~~~v~d~~~~~~~~a~ir-~~~~~~~~-----~~~~y~~~~v~~~~----~~~--~~----~~~ 226 (492) T protein:vir:94 163 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIR-MYKLENET-----KVEYWDKVTVNYYV----YEN--GS----LIP 226 (492) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEEE-EEeeccce-----eEEEEecCeEEEEE----Eec--Ce----eee Confidence 985 689999999999999986654444445444 34333221 23444322111211 111 00 000 Q ss_pred c-cccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHh Q lcl|NC_012753. 216 L-STLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMI 294 (502) Q Consensus 216 l-~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l 294 (502) . .............++++.+|+++|+++ ++|+|+|+++++|||+||.++|++++.++....++++ + T Consensus 227 ~~~~~~~~~~~~~~~~~~g~vPvv~~~nn---------~~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv----~ 293 (492) T protein:vir:94 227 DYSNNLENSKTHFSTGSWGKIPFIPFKNN---------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV----L 293 (492) T ss_pred ccccccccccccccccCCCccceEEecCC---------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceee----e Confidence 0 000112223334578899999999764 4689999999999999999999999999988888887 3 Q ss_pred ccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHH---HHhcCCChhhcccccc Q lcl|NC_012753. 295 KTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLF---EMQLGVSTGMFSFDGK 371 (502) Q Consensus 295 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i---~~~~g~s~~~~~~~~~ 371 (502) .......... +......+..+..++++ .++.++.++..+++...++.+.+.| +..+.++.+.|+ T Consensus 294 ~g~~~~~~~~------~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~---- 360 (492) T protein:vir:94 294 KNYDDQELPE------FKRLLRYYGAIKVSDNG---GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG---- 360 (492) T ss_pred ecCCcccchh------hHHHHhhccceecCCCC---cceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCccccc---- Confidence 3222211111 11222333333333322 3455556666666555555555544 544455555443 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_012753. 372 SMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMV 451 (502) Q Consensus 372 ~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~ 451 (502) ++.||.|++++++.+..+|+.+++.|+.+|++++++|+.+... .....++.|+|++++|.|..+.++++++++ T Consensus 361 ~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~-------~~~~~~i~v~f~~~~p~~~~e~~~~~~kl~ 433 (492) T protein:vir:94 361 SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-------KGEHKDVDISFNYNKVANTELQVQTAQQSM 433 (492) T ss_pred cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CcccceeeEEecCCCCCCHHHHHHHHHHHh Confidence 4568999999999999999999999999999999998876432 224567899999999999999999999884 Q ss_pred hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 452 AAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 452 ~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |++|.+|+++.+++++| +++|++||++|+...+...++..+.+..++ T Consensus 434 --giiS~et~~~~l~~v~d--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~ 480 (492) T protein:vir:94 434 --GIVSHETVLENHPFVED--LQAELERIEQEQMEYNKQLPNLDDGGADSA 480 (492) T ss_pred --ccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhccccccccCCCC Confidence 99999999888766655 788999999997766555544444444444 No 33 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=100.00 E-value=1.9e-62 Score=358.93 Aligned_cols=451 Identities=13% Similarity=0.090 Sum_probs=308.4 Q ss_pred CCh-----hHHHHHHHHHHhhc--ccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCcccc Q lcl|NC_012753. 1 MGI-----IQTIKNFIKRSNYV--ITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVK 67 (502) Q Consensus 1 m~~-----~~~ik~~i~~~~~~--~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~ 67 (502) |.| +..++++++.+--. .....|.+.+.+ ..++..+|..+.+||.|+|+++..... ...... T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~-----~~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~ 76 (478) T protein:vir:10 2 ISINWPWDKPYHEQVVEQIKPKYETQEEMILRLVRE-----HKENIDNITMGERYYNHHPDILDAPFKRDVNGDYDETKP 76 (478) T ss_pred ccccccCCchhhhHHHHHhhhccCChHHHHHHHHHH-----HHHHHHHHHHHHHHhcccccccccchhhhcccccccccc Confidence 222 12344444332111 112223333221 235678999999999999988764322 123455 Q ss_pred ccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEE Q lcl|NC_012753. 68 RDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSF 146 (502) Q Consensus 68 ~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~ 146 (502) ++|+++||++.||++.|+||||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|+++|+|+ |.+++.+ T Consensus 77 ~~ki~~n~~k~ivd~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~~~~ 155 (478) T protein:vir:10 77 DWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTFR 155 (478) T ss_pred cceeccchHHHHHHHHhhhhcccCceeecCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEEEecCCCceEEEE Confidence 6789999999999999999999999999999999999999986 7999999999999999999999999985 6899999 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeec--cccccCCC Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPL--STLYEDLE 224 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l--~~~~~~l~ 224 (502) ++|.+++|+|.+.......++++ ++..++. +.+|.|+.+...++ ...+. .+...... ........ T Consensus 156 ~~p~~~~~v~d~~~~~~~~~~ir-~~~~~~~-----~~~~~y~~~~i~~~------~~~~~-~~~~~~~~~~~~~~~~~~ 222 (478) T protein:vir:10 156 VPAEQAVPIWTNKERDELQAFIR-VYELDGA-----ERVEYWTKDDVTFY------ELKEG-QLIPDFYRSEDHIQPHYY 222 (478) T ss_pred EcccceEEEEcCCCCCceEEEEE-EEeeeCc-----eEEEEEeCCcEEEE------EecCC-eeecccccccccccccee Confidence 99999999987654433444444 3443332 12344432211111 11110 00000000 00000011 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~ 304 (502) .....++++++|+++|+++ +.|+|+|+++++|||+||.++|+++++++....++++..+ ...... T Consensus 223 ~~~~~~~~g~vPvv~~~n~---------~~g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g----~~~~~~-- 287 (478) T protein:vir:10 223 QGNKLMSWGRVPFIPFKNN---------PQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKG----YEGEDM-- 287 (478) T ss_pred cccccccCCcceEEEeccC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeec----CCcccc-- Confidence 1123478899999999763 4689999999999999999999999999988888877322 111111 Q ss_pred cCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 305 VTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) ..+..+...++.+... ++.+..++.++.+++.+++...++.+.+.|...++.+.-+++ ..+|+.||.+++++++ T Consensus 288 ----~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Ai~~~~~ 361 (478) T protein:vir:10 288 ----KDFMHNLKYYKAISVA-GESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQD-KFGNSPSGIALKFMYS 361 (478) T ss_pred ----cchhhhhhhCceeEec-CCCCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcCcCcc-ccccchHHHHHHHHHH Confidence 1122222223333332 223345677778888888888888888877777665432222 1235679999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) .+.++|+.+++.|+.+|++++++|+.+.. ......+++|+|++++|.|..+.+++++++ +|++|.+|+++. T Consensus 362 ~l~~k~~~~~~~~~~~l~~~~~li~~~~~-------~~~d~~~i~i~f~~~~p~~~~e~~~~~~~~--~g~iS~et~i~~ 432 (478) T protein:vir:10 362 NLDLKANKLKNKTLTALQELLQYIIDFYR-------LDVRVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILGN 432 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhC-------CCcccccceEEeCCCCCCCHHHHHHHHHHH--hCCCChHHHHHh Confidence 99999999999999999999999987642 234556799999999999999999998876 699999999987 Q ss_pred cCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 465 TLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 465 ~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++++| +++|++||++|+.......++..+++-..+ T Consensus 433 ~~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~ 468 (478) T protein:vir:10 433 HSWVQD--PVAEMERIEQENIELNQQLPDIEEGLNDEQ 468 (478) T ss_pred CCCCCC--HHHHHHHHHHHHHHHHHhccccCCCCcccc Confidence 766654 789999999998776555444333333222 No 34 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=100.00 E-value=1.1e-62 Score=360.30 Aligned_cols=450 Identities=14% Similarity=0.107 Sum_probs=306.5 Q ss_pred CChh------HHHHHHHHHHhh--cccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccc Q lcl|NC_012753. 1 MGII------QTIKNFIKRSNY--VITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQV 66 (502) Q Consensus 1 m~~~------~~ik~~i~~~~~--~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~ 66 (502) |-=| ..++++++..-. ......|.+.+ .....+..++..+++||.|+|+++..+.. ..+.. T Consensus 1 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~i~~~i-----~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~ 75 (478) T protein:vir:10 1 MISINWPWDKPYHEQVVEQIKPKYETQEEMILRLV-----REHKENIDNITMGERYYNHHPDILDAPPKRDVNGDYDETK 75 (478) T ss_pred CccccCCCCchhHHHHHHHHhhccCCcHHHHHHHH-----HHHHHHHHHHHHHHHHhcCCCchhcccccccccccccccc Confidence 3322 234555443211 11122222222 22235678999999999999987754321 12334 Q ss_pred cccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEE Q lcl|NC_012753. 67 KRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVS 145 (502) Q Consensus 67 ~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~ 145 (502) .++|+++|||++||++.|+||||+|+++++++++.++.|+++++ |+|...+.++++.|+++|.+|+++|+|. |.+++. T Consensus 76 ~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~d~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~~~~d~~g~~~~~ 154 (478) T protein:vir:10 76 PDWRMYTNYHQNLVDQKVAYAVANPVTFGVDNDKALKQIQHTLN-HKWDDKLVDILTAASNKGIEWVQPYVDEEGEFKTF 154 (478) T ss_pred ccceeccchHHHHHHHHHhhhccCCeeeecCChHHHHHHHHHHh-cCHHHHHHHHHHHHHhcCeEEEEEEecCCCeeEEE Confidence 56789999999999999999999999999999999999999996 7899999999999999999999999985 689999 Q ss_pred EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCcc-ccCce-eeccccccCC Q lcl|NC_012753. 146 FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKT-IIGQR-VPLSTLYEDL 223 (502) Q Consensus 146 ~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~-~lG~~-v~l~~~~~~l 223 (502) +++|.+++|+|.++......++++. +..++. . ++|.|+.+...++.. ...... ..... .....++ T Consensus 155 ~~~p~~~~~i~d~~~~~~~~~~v~~-~~~~~~--~---~~~~y~~~~i~~~~~----~~~~~~~~~~~~~~~~~~~~--- 221 (478) T protein:vir:10 155 RVPAEQAVPIWTNKERDELQAFIRV-YELDGA--E---RVEYWTKDDVTYYEL----KEGQLIPDFYRSDDHIQPHY--- 221 (478) T ss_pred EEcccceEEEEcCCCCCceEEEEEE-EEecCc--e---EEEEEeCCeEEEEEE----cCCeeeccccccccccccce--- Confidence 9999999999876544444555543 343332 1 234433211112111 100000 00000 0000000 Q ss_pred CcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCc Q lcl|NC_012753. 224 EETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 224 ~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~ 303 (502) ......++++++|+++|+++ ++|+|+|+++++|||+||.++|+++++++.....+++..++ ...... T Consensus 222 ~~~~~~~~~~~vPvv~~~n~---------~~g~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~----~~~~~~ 288 (478) T protein:vir:10 222 YQGNKLMSWGRVPFIPFKNN---------PQEVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGY----EGEDMK 288 (478) T ss_pred ecccccccCCccceEEeccC---------CCCCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecC----Cccccc Confidence 11123477889999999753 67999999999999999999999999999888888873222 111111 Q ss_pred ccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHH Q lcl|NC_012753. 304 KVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQ 383 (502) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~ 383 (502) .+..+...+..+... ++.+..++.++.+++.+++...++.+.+.|...++.+..+++ ..+|+.||.|+++++ T Consensus 289 ------~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Al~~~~ 360 (478) T protein:vir:10 289 ------DFMHNLKYYKAISVA-GESGSGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQD-KFGNSPSGIALKFMY 360 (478) T ss_pred ------hhhhhhhhcceEEec-CCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcc-ccccccHHHHHHHHH Confidence 111111222222222 222334666777888888877777777777666655432222 123567999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) +.+.++++.+++.|+.+|++++++|+.+. +.......++|+|++++|.|..+.+++++++ +|+||.||++. T Consensus 361 ~~l~~k~~~~~~~~~~~l~~~~~li~~~~-------g~~~~~~~i~i~f~~~~p~d~~e~a~~~~kl--~g~iS~et~~~ 431 (478) T protein:vir:10 361 SNLDLKANKLKNKTLTALQELLQYIIDFY-------RLDVKVQDIEITFNFNVMVNELENSQIAMNS--TGLLSKETILS 431 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------CCCcccccceEEecCCCCCCHHHHHHHHHHH--hCCCChHHHHH Confidence 99999999999999999999999988763 2234556799999999999999999999887 79999999988 Q ss_pred hcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 464 KTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .++.++| +++|++||++|+.+......+... ++.|+ T Consensus 432 ~l~~v~D--~~~E~~ri~~E~~~~~~~~~~~~~-~~~~~ 467 (478) T protein:vir:10 432 NHAWVED--PVAEMERIEQENIELNQQLPDIEE-GLNGE 467 (478) T ss_pred hCCCCCC--HHHHHHHHHHHHHHHHhhcccccc-ccCCC Confidence 7755554 778999999998776655544433 44444 No 35 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=100.00 E-value=4.4e-62 Score=356.98 Aligned_cols=449 Identities=8% Similarity=-0.002 Sum_probs=308.7 Q ss_pred CChhH--HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQ--TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~--~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~ 78 (502) |.|.- .+-+-++. +...-+.+++++ ...+..+|..+++||.|+|+++.. ........++|+++|||++ T Consensus 1 ~~~~~~~~~~~~~~~----~~~~~i~~~i~~-----~~~~~~~~~~l~~Yy~g~~~i~~~-~~~~~~~~~~ki~~n~~~~ 70 (499) T protein:vir:10 1 MAVVIDKDLLDDVNE----PNIEAINYAIRE-----LQNRKKRLDKLSDYYNGKQEIEKH-EFDNATVEAANVMVNHAKY 70 (499) T ss_pred CccchhhhHHhhhhc----CCHHHHHHHHHH-----HHHHHHHHHHHHHHhccccchhcC-CcCcCCCCcceeecchHHH Confidence 43321 00000000 000112222221 134567899999999999987653 3345566788999999999 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC------------------ Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD------------------ 140 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~------------------ 140 (502) ||++.|+||||+|++++++++..++.|++++++|+|+..+.++++.++++|.+|.++|++++ T Consensus 71 Iv~~~~~~l~g~p~~~~~~~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~g~~~~~~~~~~~~~~~~~ 150 (499) T protein:vir:10 71 ITDMNVGFMTGNPVKYVAEKGKNIDDILEVFNQIDIHKHDIELEKDLSVFGYGYELLYLKKTDPISVRDELGNEKLTPNT 150 (499) T ss_pred HHHHHhhhhcccCceeecCChhHHHHHHHHHhhcCHhHHHHHHHHHHHhcCceEEEEEeccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999754 Q ss_pred ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccc Q lcl|NC_012753. 141 QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLY 220 (502) Q Consensus 141 ~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~ 220 (502) .+++..++|.++||++.+..+....++++.++..+.+....++.+|.|+.+ +|.+ |...+.......-+ T Consensus 151 ~~~~~~v~p~~~~~v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~----~i~~--~~~~~~~~~~~~~~----- 219 (499) T protein:vir:10 151 ELKIEVIDPRATVVVCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQ----RIVE--YRTKTTMEVSANDP----- 219 (499) T ss_pred ceEEEEEcccceEEEecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCC----eEEE--EEecCCccccCcce----- Confidence 367899999999999888766555555554444444444455667776532 2222 22212111110000 Q ss_pred cCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCC Q lcl|NC_012753. 221 EDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDT 300 (502) Q Consensus 221 ~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~ 300 (502) ......++++.+|+++|+++ +.|+|+|+++++|||+||.++|++++.++.....+++ +...... T Consensus 220 ---~~~~~~~~~g~vPvv~~~n~---------~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv----~~G~~~~ 283 (499) T protein:vir:10 220 ---IVYDGENLFGAVPIIEFRNN---------EERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLV----TFGFGLG 283 (499) T ss_pred ---ecccccCCCCccceEEecCC---------CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceee----eecCccc Confidence 11123467899999999763 4689999999999999999999999999988888877 3321110 Q ss_pred CCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHH Q lcl|NC_012753. 301 NGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVV 380 (502) Q Consensus 301 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~ 380 (502) . . ..+......+. +...+++.+..++.+++++..+.+...++.+.+.|...++.+.-+++. .+|+.||.|++ T Consensus 284 ---~-~--~~~~~~~~~~~-~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~gn~Sg~Al~ 355 (499) T protein:vir:10 284 ---D-D--KDDIQRLKRGA-IEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNMNDEK-FMGNVSGEAMK 355 (499) T ss_pred ---c-c--cchhhhhhhcc-eeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccCCchh-hcccchHHHHH Confidence 0 0 00001111111 111122223346677778888888888888888777766655433221 23566999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH Q lcl|NC_012753. 381 SEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM 460 (502) Q Consensus 381 ~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et 460 (502) ++++.+.+++..+++.|+.+|++++++|+.+++.. +....+..++|.|++++|.|..+++++++++ +|++|.|| T Consensus 356 ~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~----~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl--~g~iS~et 429 (499) T protein:vir:10 356 FKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIK----GANDDASGCKISLVANIPSNLSDVVNNVKNA--DGIIPRKY 429 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCChHH Confidence 99999999999999999999999999999886542 2344566899999999999999999999998 69999999 Q ss_pred HHHhcCCCCHHHHHHHHHHHHHhhhcccC---CCCCccccCCCCC Q lcl|NC_012753. 461 AIEKTLNVTKEQAQEIYQKINDETMVSTD---SFRTSEEVDIYGE 502 (502) Q Consensus 461 ~l~~~~~~~deea~~el~ri~~E~~~~~~---~~~~~~~~~~~g~ 502 (502) ++..++++++ +++|++||++|+..... ......+.+-+++ T Consensus 430 ~~~~l~~v~d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 472 (499) T protein:vir:10 430 TYSWLPDVDN--PQDVIDEMNQQDAETIKKNQEALRGQDPDRLEL 472 (499) T ss_pred HHHhCCCCCC--HHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCC Confidence 9988766665 67889999988654221 1111111111111 No 36 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=100.00 E-value=2.6e-62 Score=358.19 Aligned_cols=447 Identities=14% Similarity=0.124 Sum_probs=307.4 Q ss_pred CChhHHHHHHHHHHhhccccc-chhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccccccceec Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQ-SLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVKRDFNHL 73 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~-~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~~~ 73 (502) |+.-..+++=+ -+..... .+.+.+. ..+....+++.+|.++.+||.|+|+++..... ..+...++++++ T Consensus 5 ~~~~~~~~~~~---~~~~~~~~~~~~~i~-~~i~~~~~~~~~~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ri~~ 80 (472) T protein:vir:93 5 QPTQTEIFDAI---VRTNNKPETLEEMIV-RYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPVDATGAVDPLKPDDRMIT 80 (472) T ss_pred CCcchhhhhce---eeecCchhhHHHHHH-HHHHHHHHHHHHHHHHHHHhccccccccccchhhcccccccccccccccc Confidence 44433222221 1111111 1111111 11222346778999999999999988765422 122345678999 Q ss_pred chHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeE Q lcl|NC_012753. 74 PIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVF 152 (502) Q Consensus 74 n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~ 152 (502) |||++||++.|+||||+|++++++++...+.|+++++ |+|...+.++++.++++|.+|+.+|+|+ |.+++.+++|.++ T Consensus 81 n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~~~~~v~~d~d~~~~i~~~~p~~~ 159 (472) T protein:vir:93 81 NFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPYLDEEGEFKLFRVPAEQG 159 (472) T ss_pred chHHHHHHHHhhhhcccCeeeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEEECCCCceEEEEEcccce Confidence 9999999999999999999999999999999999986 6899999999999999999999999985 5799999999999 Q ss_pred EEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc-cccCCCcceeecC Q lcl|NC_012753. 153 FPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST-LYEDLEETVTLNG 231 (502) Q Consensus 153 ~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~-~~~~l~~~~~~~~ 231 (502) +|+|.++......+++ +++..++.. .+|.|.. ..+.+..+... ..++... ...........++ T Consensus 160 ~~i~d~~~~~~~~~~i-r~~~~~~~~-----~~~~~~~----~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~ 223 (472) T protein:vir:93 160 IPIWTDKEHEELEAFI-RMYKLENET-----KVEYWDK----VTVNYYVYENG------SLIPDYSNNLENSKTHFSTGS 223 (472) T ss_pred EEEEcCCCCCceEEEE-EEEEeecce-----eEEEEec----CeEEEEEEecC------eeeecccccccccccccccCC Confidence 9998665444344444 344443322 1233321 11211111110 0000000 0112223345678 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccc Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREF 311 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~ 311 (502) ++.+|+++|+++ ++|+|+|+++++|+|+||.++|+++++++....++++ +......... .+ T Consensus 224 ~~~vPvv~~~nn---------~~g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~----~~g~~~~~~~------~~ 284 (472) T protein:vir:93 224 WGKIPFIPFKNN---------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV----LTNYDDQELP------EF 284 (472) T ss_pred CCCcceEEecCC---------CCCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeE----eecCCcccch------hh Confidence 999999999763 4699999999999999999999999999988888777 3322211111 11 Q ss_pred cccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHH Q lcl|NC_012753. 312 ETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRN 391 (502) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~ 391 (502) ......+..+..+++ ..++.++.++.++++...++.+.+.|...++.+..+++. .+++.||.+++++++.|.++|+ T Consensus 285 ~~~~~~~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Al~~~~~~l~~ka~ 360 (472) T protein:vir:93 285 KRLLRYYGAIKVSDN---GGVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDK-FGSAPSGVALEFLYTNLNLKAD 360 (472) T ss_pred HHHHhhccccccCCC---CcceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCccc-cccCchHHHHHHHHHHHHHHHH Confidence 122223333333332 235666677778888888887777777666655433322 2356689999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH Q lcl|NC_012753. 392 SIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKE 471 (502) Q Consensus 392 ~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~de 471 (502) .+++.|+.+|++++++|+.+.. ....+..++|+|++++|.|..+++++++++ +|++|.+|++..+++++| T Consensus 361 ~~~~~~~~~l~~~~~li~~~~~-------~~~~~~~i~v~f~~~~p~~~~~~~~~~~k~--~giis~et~l~~l~~~~d- 430 (472) T protein:vir:93 361 KLARKAKVAIQELLWFVFEHFD-------IKGEHKDVDISFNYNKVANTELQVQTAQQS--MGIVSHETVLENHPFVED- 430 (472) T ss_pred HHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEeCCCCCCCHHHHHHHHHHH--hccCchHHHHHhCCCCCC- Confidence 9999999999999999887642 223456799999999999999999999987 599999999988876665 Q ss_pred HHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 472 QAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 472 ea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++|++||++|+...+....+..+++.-++ T Consensus 431 -~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~ 460 (472) T protein:vir:93 431 -LQAELERIEQEQMEYNKQLPNLDDGGADGA 460 (472) T ss_pred -HHHHHHHHHHHHHHHHHhccCcCcccCCCC Confidence 788999999998665555444444444443 No 37 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=100.00 E-value=3.4e-62 Score=357.57 Aligned_cols=445 Identities=12% Similarity=0.109 Sum_probs=307.2 Q ss_pred CChhH-HHHHHHHHHhhcc--cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccccccce Q lcl|NC_012753. 1 MGIIQ-TIKNFIKRSNYVI--TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVKRDFN 71 (502) Q Consensus 1 m~~~~-~ik~~i~~~~~~~--~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~ 71 (502) ||+=. ...++++.+-... ....|.+.+ ....+++.++...++||.|+|+++..... ..+...++++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i-----~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:96 7 MPWDKPYGEEVVEQMKPKVETQEEMIIRLI-----NNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRI 81 (474) T ss_pred CCCCCCCCcchhhhccccccchHHHHHHHH-----HHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccccccc Confidence 11100 0111111100000 000011111 11235678899999999999987654321 1233456789 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++|||++||++.|+||||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|.++|+|+ |.+++.+++|+ T Consensus 82 ~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~ 160 (474) T protein:vir:96 82 TTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAE 160 (474) T ss_pred ccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEccc Confidence 999999999999999999999999999999999999986 7899999999999999999999999985 68999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-ccCCCcceee Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-YEDLEETVTL 229 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~~~l~~~~~~ 229 (502) ++||+|.++......+|++. +..++. +++|.|+.+...++ .+.+. ..+..... .......... T Consensus 161 ~~~~v~d~~~~~~~~a~ir~-~~~~~~-----~~~~vy~~~~i~~~----~~~~~------~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:96 161 QAIPIWTDKEREQLNAFIRI-FTFNGE-----TKVEYWTAETVTYY----VYENG------GLIPDFYYGDEHIQTHFST 224 (474) T ss_pred ceEEEEcCCCCCceEEEEEE-EeecCe-----eEEEEEeCCeEEEE----EEcCC------ceeeccccccccccCcccc Confidence 99999876555545555544 333321 23455542222221 11111 00000000 0111223345 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR 309 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 309 (502) ++++++|+++|+++ +.|.|+|+++++|||+||.++|++++.++.....++| +........ . T Consensus 225 ~~~~~vPvv~~~nn---------~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv----~~g~~~~~~------~ 285 (474) T protein:vir:96 225 GSWERVPFIAFKNN---------PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYI----LRGYEGEDL------S 285 (474) T ss_pred cCCCccceEEecCC---------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh----hcCCCcccc------c Confidence 78899999999764 4589999999999999999999999999988888887 332211111 1 Q ss_pred cccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHH Q lcl|NC_012753. 310 EFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQM 389 (502) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~ 389 (502) .+..+...+..+...++ ..++.++.++..+++.+.++.+.+.|...++.+.-++. ..+++.||.|++++++.+.++ T Consensus 286 ~~~~~~~~~~~i~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~~l~~k 361 (474) T protein:vir:96 286 EFMEGLKYYKAINVSSD---GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-KFGSATSGIALKFLYTNLNLK 361 (474) T ss_pred chhhhhhccceeeccCC---CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-ccccccHHHHHHHHHHHHHHH Confidence 12223333333433332 34666777888888888888888888777766533222 223567999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC Q lcl|NC_012753. 390 RNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVT 469 (502) Q Consensus 390 ~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~ 469 (502) |+.+++.|+.+|++++++|+.+.. .......++++|++++|.|..+.++++++ +|+||.||++..++.++ T Consensus 362 ~~~~~~~~~~~l~~~~~~i~~~~g-------~~~d~~~i~i~f~~~~p~~~~e~a~~~~~---~giiS~et~~~~lp~v~ 431 (474) T protein:vir:96 362 ANKLKNKANVALQELMQFILDFNK-------IKLDAKEIEITFNFNVMVNDLEQSQIGAQ---SQYLSKETLVRHHPWVD 431 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEecCCCccCHHHHHHHHHH---cCCCChHHHHHhCCCCC Confidence 999999999999999999987642 23456689999999999999998887654 69999999998875555 Q ss_pred HHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 470 KEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 470 deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) | +++|++||++|+.+.........+++..++ T Consensus 432 D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~ 462 (474) T protein:vir:96 432 D--PKAELERLDEEQLELNKQLPNLDDGGADGA 462 (474) T ss_pred C--HHHHHHHHHHHHHHHHhhccccccccCCCC Confidence 4 789999999998777666666666666655 No 38 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=100.00 E-value=3.4e-62 Score=357.57 Aligned_cols=445 Identities=12% Similarity=0.109 Sum_probs=307.2 Q ss_pred CChhH-HHHHHHHHHhhcc--cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccccccce Q lcl|NC_012753. 1 MGIIQ-TIKNFIKRSNYVI--TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVKRDFN 71 (502) Q Consensus 1 m~~~~-~ik~~i~~~~~~~--~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~ 71 (502) ||+=. ...++++.+-... ....|.+.+ ....+++.++...++||.|+|+++..... ..+...++++ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i-----~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki 81 (474) T protein:vir:95 7 MPWDKPYGEEVVEQMKPKVETQEEMIIRLI-----NNHKQKLKDINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRI 81 (474) T ss_pred CCCCCCCCcchhhhccccccchHHHHHHHH-----HHHHHHHHHHHHHHHHhcccCccccccchhhhccccccccccccc Confidence 11100 0111111100000 000011111 11235678899999999999987654321 1233456789 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCC Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQAT 150 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~ 150 (502) ++|||++||++.|+||||+|++++++++..++.|+.+++ |+|...+.++++.++++|.+|.++|+|+ |.+++.+++|+ T Consensus 82 ~~n~~k~Iv~~~~~yl~g~p~~~~~~~~~~~~~l~~~~~-n~~~~~~~~l~~~~~~~G~~~~~~~~d~~~~~~i~~~~p~ 160 (474) T protein:vir:95 82 TTNFHQNLVDQKVSYVAGKPVTYAHDDDKVLDVIHQVLD-TRWDNKLIDILTAASNKGIDWLQVYINEDGELKLFRVPAE 160 (474) T ss_pred ccchHHHHHHhhhhhhcccCceeccCChHHHHHHHHHHh-ccHHHHHHHHHHHHhhCCeEEEEeeeCCCCceEEEEEccc Confidence 999999999999999999999999999999999999986 7899999999999999999999999985 68999999999 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-ccCCCcceee Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-YEDLEETVTL 229 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~~~l~~~~~~ 229 (502) ++||+|.++......+|++. +..++. +++|.|+.+...++ .+.+. ..+..... .......... T Consensus 161 ~~~~v~d~~~~~~~~a~ir~-~~~~~~-----~~~~vy~~~~i~~~----~~~~~------~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:95 161 QAIPIWTDKEREQLNAFIRI-FTFNGE-----TKVEYWTAETVTYY----VYENG------GLIPDFYYGDEHIQTHFST 224 (474) T ss_pred ceEEEEcCCCCCceEEEEEE-EeecCe-----eEEEEEeCCeEEEE----EEcCC------ceeeccccccccccCcccc Confidence 99999876555545555544 333321 23455542222221 11111 00000000 0111223345 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR 309 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 309 (502) ++++++|+++|+++ +.|.|+|+++++|||+||.++|++++.++.....++| +........ . T Consensus 225 ~~~~~vPvv~~~nn---------~~~~~d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv----~~g~~~~~~------~ 285 (474) T protein:vir:95 225 GSWERVPFIAFKNN---------PEEVSDIWMYKSFVDAIDKRLSDVQNMFDESVELIYI----LRGYEGEDL------S 285 (474) T ss_pred cCCCccceEEecCC---------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh----hcCCCcccc------c Confidence 78899999999764 4589999999999999999999999999988888887 332211111 1 Q ss_pred cccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHH Q lcl|NC_012753. 310 EFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQM 389 (502) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~ 389 (502) .+..+...+..+...++ ..++.++.++..+++.+.++.+.+.|...++.+.-++. ..+++.||.|++++++.+.++ T Consensus 286 ~~~~~~~~~~~i~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~~l~~k 361 (474) T protein:vir:95 286 EFMEGLKYYKAINVSSD---GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTD-KFGSATSGIALKFLYTNLNLK 361 (474) T ss_pred chhhhhhccceeeccCC---CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCcccc-ccccccHHHHHHHHHHHHHHH Confidence 12223333333433332 34666777888888888888888888777766533222 223567999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC Q lcl|NC_012753. 390 RNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVT 469 (502) Q Consensus 390 ~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~ 469 (502) |+.+++.|+.+|++++++|+.+.. .......++++|++++|.|..+.++++++ +|+||.||++..++.++ T Consensus 362 ~~~~~~~~~~~l~~~~~~i~~~~g-------~~~d~~~i~i~f~~~~p~~~~e~a~~~~~---~giiS~et~~~~lp~v~ 431 (474) T protein:vir:95 362 ANKLKNKANVALQELMQFILDFNK-------IKLDAKEIEITFNFNVMVNDLEQSQIGAQ---SQYLSKETLVRHHPWVD 431 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhC-------CCcccceeeEEecCCCccCHHHHHHHHHH---cCCCChHHHHHhCCCCC Confidence 999999999999999999987642 23456689999999999999998887654 69999999998875555 Q ss_pred HHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 470 KEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 470 deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) | +++|++||++|+.+.........+++..++ T Consensus 432 D--~~~E~eri~~E~~~~~~~~~~~~~~~~~~~ 462 (474) T protein:vir:95 432 D--PKAELERLDEEQLELNKQLPNLDDGGADGA 462 (474) T ss_pred C--HHHHHHHHHHHHHHHHhhccccccccCCCC Confidence 4 789999999998777666666666666655 No 39 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=100.00 E-value=3.6e-62 Score=357.42 Aligned_cols=448 Identities=15% Similarity=0.167 Sum_probs=305.2 Q ss_pred CChhHHHHHHHHHHhhcccc--cchhhhhc-----ccc-----------ccCCHHHHHHHHHHHHHhcCCCCccccccC- Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITN--QSLNSITD-----HPK-----------IAISPEEYNRIMDNLRYFAGDFDSVTYRDS- 61 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~--~~l~~i~~-----~~~-----------~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~- 61 (502) .-+++.+..-+-++|.-+.+ ++...+.. ..+ +.-..+++.++....+||.|+|+++..... T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~~~~~~ 83 (492) T protein:vir:97 4 IQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEPKPV 83 (492) T ss_pred HHHHHHHHHHHhcCCceeeccchhhhhHhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccc Confidence 66666544433333332211 11111100 000 111234567899999999999988755422 Q ss_pred -----CCccccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEE Q lcl|NC_012753. 62 -----NGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPY 136 (502) Q Consensus 62 -----~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~ 136 (502) ..+...++|+++|||++||++.++||+|+|++++++++...+.|+++++ |+|...+.++++.++++|.+|+.+| T Consensus 84 ~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~l~~~~~-n~~~~~~~~~~~~~~~~G~a~~~v~ 162 (492) T protein:vir:97 84 DATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFKHTDDEVVKRIDEVLG-NRFDDKLHSVLTGASNKGIEWLHPY 162 (492) T ss_pred cccccccccccccccccchHHHHHHHHhhhhcccCceeccCchHHHHHHHHHHh-ccHHHHHHHHHHHHhhcCeEEEEEE Confidence 1234456789999999999999999999999999999999999999986 7899999999999999999999999 Q ss_pred EeC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceee Q lcl|NC_012753. 137 IDG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVP 215 (502) Q Consensus 137 ~d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~ 215 (502) +|+ |.+++.+++|.+++|+|.++......++++ ++..++.. .+|.|+. ..+.+..+.+. . .++ T Consensus 163 ~d~dg~~~~~~~~p~~~~~i~d~~~~~~~~~~vr-~~~~~~~~-----~~~~y~~----~~v~~~~~~~~--~----~~~ 226 (492) T protein:vir:97 163 LDEEGEFKLFRVPAEQGIPIWTDKEHEELEAFIR-MYKLENET-----KVEYWDK----VTVNYYVYENG--S----LIP 226 (492) T ss_pred ecCCCceEEEEEcccceEEEEcCCCCCceEEEEE-EEeeccce-----eEEEEec----CeEEEEEEecC--e----eee Confidence 985 689999999999999987654444455544 44433321 2344432 22222122211 1 010 Q ss_pred cc-ccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHh Q lcl|NC_012753. 216 LS-TLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMI 294 (502) Q Consensus 216 l~-~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l 294 (502) .. ............++++.+|+++|+++ +.|+|+|+.+++|||+||.++|++++.++....++++ + T Consensus 227 ~~~~~~~~~~~~~~~~~~g~vPvv~~~nn---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~----~ 293 (492) T protein:vir:97 227 DYSNNLENSKTHFSTGSWGKIPFIPFKNN---------DLEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYV----L 293 (492) T ss_pred cccccccccccccccCCCCCcceEEecCC---------CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceee----e Confidence 00 00112223345578899999999764 4689999999999999999999999999988888887 3 Q ss_pred ccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCC---Chhhcccccc Q lcl|NC_012753. 295 KTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGV---STGMFSFDGK 371 (502) Q Consensus 295 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~---s~~~~~~~~~ 371 (502) ......... .+......+..+..++++ .++.++.++.++.+...++.+.+.|...++. +.+.|+ T Consensus 294 ~g~~~~~~~------~~~~~~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~---- 360 (492) T protein:vir:97 294 KNYDDQELP------EFKRLLRYYGAIKVSDNG---GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFG---- 360 (492) T ss_pred ecCCcccch------hHHHHHhhccceecCCCC---cceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCccccc---- Confidence 322221111 111122233333333322 3555666777777777777666666555544 444343 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_012753. 372 SMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMV 451 (502) Q Consensus 372 ~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~ 451 (502) ++.||.+++++++.+..+|+.+++.|+.+|++++++|+.+... ...+.+++|+|++++|.|..+++++++++ T Consensus 361 ~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~-------~~~~~~i~v~f~~~~p~~~~e~a~~~~kl- 432 (492) T protein:vir:97 361 SAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDI-------KGEHKDVDISFNYNKVANTELQVQTAQQS- 432 (492) T ss_pred cCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-------CcccceeeEEecCCCCCCHHHHHHHHHHH- Confidence 5668999999999999999999999999999999998876532 23456799999999999999999999997 Q ss_pred hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 452 AAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 452 ~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|++|.||+++.+++++| +++|++||++|+.......++..+++.-+. T Consensus 433 -~G~iS~et~l~~l~~v~d--~~~Eleri~~E~~~~~~~~~~~~~~~~~~~ 480 (492) T protein:vir:97 433 -MGIVSHETVLENHPFVED--LQAELERIEQEQTEYNKQLPNLDDGGADSA 480 (492) T ss_pred -hccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHhhhccccCCCCCC Confidence 599999999988766665 788999999987655443333333332222 No 40 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=100.00 E-value=4.9e-61 Score=351.23 Aligned_cols=456 Identities=9% Similarity=0.006 Sum_probs=316.0 Q ss_pred CChhHHHHHHHHHHhhccc-------ccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc---CCCccccccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVIT-------NQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD---SNGSQVKRDF 70 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~-------~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~---~~~~~~~~~~ 70 (502) |+.+..+-..+...-+... ...+.+.+.+ ...+++.+++.+.+||.|+++.+..+. .......++| T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~----~~~~~~~~~~~~~~yY~g~~~~i~~~~~~~~~~~~~~~~k 81 (481) T protein:vir:10 6 INNINTKFSPLANDDFVVSDLAELLKEENLRNFISR----HQTEQVPRLEMLESYYLNRNTDILAGERRLQKYGDKADHR 81 (481) T ss_pred eehhchhcccccCceeeeecchhhcCHHHHHHHHHH----HHHHHHHHHHHHHHHhcCCCcccccCccccccccccccce Confidence 6666666555443211111 0111111111 124677899999999999987654322 1223345678 Q ss_pred eecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcC Q lcl|NC_012753. 71 NHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQA 149 (502) Q Consensus 71 ~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~ 149 (502) +++|||+.||++.|+|+||+|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+|+ |.+++.+++| T Consensus 82 i~~n~~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~~~~d~dg~~~i~~~~p 161 (481) T protein:vir:10 82 AVHNYAKYVSRFIVGYLTGNPITITHQDNQTNDKIIELNDLNDADEVNSDLALNLSIYGRAYEIVYRDFEDRDTFKVLDP 161 (481) T ss_pred eecchHHHHHHHHHhhhccCCceEecCChhHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCeEEEEEEcc Confidence 999999999999999999999999999999999999999999999999999999999999999999975 6799999999 Q ss_pred CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceee Q lcl|NC_012753. 150 TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTL 229 (502) Q Consensus 150 ~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~ 229 (502) .+++|+|.+.......++++ ++......+..++++|.|+. ..|.+ |+..+. + |.. ..... T Consensus 162 ~~~~~v~d~~~~~~~~~~i~-~~~~~~~~~~~~~~~~~y~~----~~i~~--~~~~~~---~--------~~~--~~~~~ 221 (481) T protein:vir:10 162 KSTFVVYDQTLDKKVVAGVR-YFEKQDKDKVPVQHVEVYTT----DKIYY--IEIKGG---T--------YHR--VEEVE 221 (481) T ss_pred cceEEEEcCCCCCceEEEEE-EEEEeeCCCceEEEEEEEec----CeEEE--EEecCC---c--------eee--ccccc Confidence 99999977655444444443 33433333444556777652 12221 221110 0 100 01134 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR 309 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 309 (502) ++++++|+++|+++ ++|+|+|+.+++|||+||.++|++++.++....++++-..+...+ +..+..+.... T Consensus 222 ~~~g~vPvv~~~n~---------~~g~~~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~~~-~~~~~~~~~~~ 291 (481) T protein:vir:10 222 HYYNDVPIIEYLND---------QFKQGDFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVDLD-SEDAKAFRDAN 291 (481) T ss_pred ccCCceeEEEeecC---------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcCCC-ccchhhhhhcc Confidence 67888999998753 468999999999999999999999999998788777633322221 11222222111 Q ss_pred cccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHH Q lcl|NC_012753. 310 EFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQM 389 (502) Q Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~ 389 (502) .+ ...........+.++.++.++.+++.+++...++.+.+.|...++.+..+++. .+++.||.|++++++.|.++ T Consensus 292 ~~----~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Al~~~~~~l~~k 366 (481) T protein:vir:10 292 MI----HLEPGTNANGSEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQ-FSGVQSGESMKYKLFGLEQV 366 (481) T ss_pred ce----eccccccccCCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc-cccccHHHHHHHHHHHHHHH Confidence 11 11111111222334457777888888888888888888887777776555542 34667999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC Q lcl|NC_012753. 390 RNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVT 469 (502) Q Consensus 390 ~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~ 469 (502) ++.+++.|+.+|++++++++.+++. .++......++++.|++++|.|..+.++.++++ +|++|.+|+++.+++++ T Consensus 367 ~~~~~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl--~g~is~et~~~~l~~i~ 441 (481) T protein:vir:10 367 RAIKERLFKKGLMKRYKLLLNNVNL---TGLKQHNYAELTITFTPNLPKSMMESINAFNAL--SGGVSESTRLSLLDFID 441 (481) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhc---cCCCccccceeeEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCC Confidence 9999999999999999999987654 334444556899999999999999999999988 49999999998876565 Q ss_pred HHHHHHHHHHHHHhhhcccCCCCCccccCC--CCC Q lcl|NC_012753. 470 KEQAQEIYQKINDETMVSTDSFRTSEEVDI--YGE 502 (502) Q Consensus 470 deea~~el~ri~~E~~~~~~~~~~~~~~~~--~g~ 502 (502) | +++|++||++|+.+...........+. -|+ T Consensus 442 d--~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~ 474 (481) T protein:vir:10 442 N--PKEELEKMQEEEAQREKQADKRGYGEAFENHL 474 (481) T ss_pred C--HHHHHHHHHHHHHHHHhhhhhccCCccCCCCC Confidence 4 788999999887644322211111111 111 No 41 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=100.00 E-value=2.2e-61 Score=353.14 Aligned_cols=424 Identities=11% Similarity=0.027 Sum_probs=303.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |. .+.|+++|.+ ...+..|+...++||.|+|+++.... ..+...++|+++|||++|| T Consensus 17 ~~-~~~i~~~i~~---------------------~~~~~~r~~~~~~Yy~g~~~i~~~~~-~~~~~~~~ki~~n~~~~iv 73 (452) T protein:vir:36 17 IT-VEVVTKFMEK---------------------HKLEVARYEYLKNMYLGIMAIDDEPA-KDSWKPDNRLAVNFTKYIV 73 (452) T ss_pred CC-HHHHHHHHHH---------------------HHHHHHHHHHHHHHhccccccccCcc-ccccCccceeecchHHHHH Confidence 21 2333333332 12445788999999999998876543 3444567789999999999 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEcC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQANT 159 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d~ 159 (502) ++.|+||||+|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+|+ |.+++.+++|.+++|+|.+. T Consensus 74 d~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~v~d~~ 153 (452) T protein:vir:36 74 DTFTGYFNGIPVKKSHSDKEILTKLQEFDNLNDMEDEESELAKMACIYGRAFEFLYQDEDTQTNVVYNSPENMFMVYDDT 153 (452) T ss_pred HHHhhhhcccCceeecCChhHHHHHHHHHhhcChhHHHHHHHHHHHhcCeEEEEEEecCCCeeEEEEEcccceEEEEcCC Confidence 99999999999999999999999999999999999999999999999999999999985 68999999999999998776 Q ss_pred CCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 160 QDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) .+....++++ ++.... .. .++|.|+. +..++ |...+. |..+ .....++++++|+++ T Consensus 154 ~~~~~~~~i~-~~~~~~-~~---~~~~vyt~-~~i~~-----~~~~~~---~~~~----------~~~~~~~~g~iPvv~ 209 (452) T protein:vir:36 154 VKQEPLFAVR-YGVDED-KK---LQGEVYTL-LETIK-----ISGEND---EISF----------GEGTYNPYPDLPVVE 209 (452) T ss_pred CCCceEEEEE-EEEecC-ce---EEEEEEec-CeEEE-----EEEcCC---ceEE----------ecceeccCCcccEEE Confidence 5444444443 333222 21 22455431 11111 111110 1110 112346788999999 Q ss_pred ecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhc Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYE 319 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~ 319 (502) |+++ +.|+|+|+++++|+|+||.++|++++.++.....+++ +..... .+. +..+...+. T Consensus 210 ~~n~---------~~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~----~~g~~~-~~~-------~~~~~~~~~ 268 (452) T protein:vir:36 210 FYFN---------EERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLT----FLGAAV-EEE-------DLKNIRSNR 268 (452) T ss_pred ecCC---------CCCCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeE----eecCCc-Cch-------hhhhhhhcc Confidence 8764 3589999999999999999999999999888888777 322111 111 111111122 Q ss_pred ccc--CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 320 QFD--SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV 397 (502) Q Consensus 320 ~~~--~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~ 397 (502) .+. .++.+.+..++.++.++..+.+...++.+.+.|...++.+. +++...|+.||++++++++.|.++|+.+++.| T Consensus 269 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~ 346 (452) T protein:vir:36 269 VINYYADGEGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESFGSSSGVSLAYKLQAMSNLALSFQRKF 346 (452) T ss_pred eEEecCCCCccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCcccccCCcHHHHHHHHHHHHHHHHHHHHHH Confidence 222 22223344567777788888888888888888877776653 45555577899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHH Q lcl|NC_012753. 398 EKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIY 477 (502) Q Consensus 398 ~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el 477 (502) +.+|++++++|+.++... +.......++|.|++++|.|..+.+++++++ +|+||.||++..+++++| +++|+ T Consensus 347 ~~~l~~~~~li~~~~~~~----~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~--~g~iS~et~~~~~~~~~d--~~~E~ 418 (452) T protein:vir:36 347 QSSLNSRYKLFCELSTNV----SNKDSWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPD--VQAEM 418 (452) T ss_pred HHHHHHHHHHHHHHHhcc----CCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--HHHHH Confidence 999999999999887642 2334556799999999999999999999887 599999999887755554 78899 Q ss_pred HHHHHhhhcccC-----CCCCccccCCCCC Q lcl|NC_012753. 478 QKINDETMVSTD-----SFRTSEEVDIYGE 502 (502) Q Consensus 478 ~ri~~E~~~~~~-----~~~~~~~~~~~g~ 502 (502) +||++|+..... ..+..+.-+.-++ T Consensus 419 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~ 448 (452) T protein:vir:36 419 EKIKKEEASTAIFDKDKQPSEKGTDTVVSE 448 (452) T ss_pred HHHHHHHHHHHHHHhhccCCCCcccccCcc Confidence 999998765421 1111111112222 No 42 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=100.00 E-value=4.3e-61 Score=351.53 Aligned_cols=427 Identities=10% Similarity=0.031 Sum_probs=303.6 Q ss_pred CCh-----hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecch Q lcl|NC_012753. 1 MGI-----IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~-----~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~ 75 (502) ||- .+.|+++|++. .....|++++++||.|+|+++.... ..+...++|+++|| T Consensus 11 ~p~d~~~~~~~l~~~i~~~---------------------~~~~~r~~~~~~yy~g~~~i~~~~~-~~~~~~~~ki~~n~ 68 (453) T protein:vir:39 11 FPKDEPITNEVVTKFMEKH---------------------RLEVARYEYLKNMYRGIMAIDAEPT-KDLWKPDNRLTVNF 68 (453) T ss_pred cCCCCCCCHHHHHHHHHHH---------------------HHHHHHHHHHHHHhhccCchhcCCC-ccccCccceeecch Confidence 222 22334443321 2345789999999999998766543 34445668899999 Q ss_pred HHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEE Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFP 154 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~P 154 (502) |++||+++|+||||+|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+|+ |.+++.+++|.+++| T Consensus 69 ~~~ivd~~~~~l~g~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~i~~~~p~~~~~ 148 (453) T protein:vir:39 69 TKYIVDTFTGYFNGIPVKKSHSDKETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQNEETQTNVIYNTPENMFM 148 (453) T ss_pred HHHHHHHHhhhhcccCceeccCChHHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEE Confidence 9999999999999999999999999999999999999999999999999999999999999986 679999999999999 Q ss_pred EEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 155 LQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 155 i~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) +|.+..+....+++ +++..++ ..+++|.|+. .+|.+ |...+. + |. ..+...+++++ T Consensus 149 v~d~~~~~~~~~~i-r~~~~~~----~~~~~~~yt~----~~i~~--~~~~~~---~--------~~--~~~~~~~~~g~ 204 (453) T protein:vir:39 149 VYDDTIKQEPLFAV-RYGYDDD----YKLYGEVYTK----ETTYA--LNGTMG---F--------YN--MTEQAPNPFDD 204 (453) T ss_pred EecCCCCCeEEEEE-EEEEeCC----eEEEEEEEeC----CeEEE--EEecCC---c--------ee--eecccccCCCc Confidence 98765544433433 4443222 2344666642 22221 222111 1 00 01123478899 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) +|+++|+++ +.|+|+|+.+++|||+||+++|++++.++.....+++ +.... ..+... ..+.. T Consensus 205 vPvv~~~n~---------~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~----~~g~~-~~~~~~---~~~~~- 266 (453) T protein:vir:39 205 LPVVEFYFN---------EERMSIFESVISLVNAFNKAISEKANDVDYFSDQYLT----FLGAA-VEEEDL---KNIRS- 266 (453) T ss_pred eeEEEecCC---------CCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceee----eecCC-CCchhh---hhhhh- Confidence 999998753 4689999999999999999999999999877777766 22111 011100 01111 Q ss_pred chhhccccCC-CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSG-DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 315 ~~~~~~~~~~-~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~ 393 (502) ..+. .+... ..+.+..++.++.++..+.+.+.++.+.+.|...++.+. +++...|+.||.+++++++.|.++|+.+ T Consensus 267 ~~~~-~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~~ka~~~ 343 (453) T protein:vir:39 267 NRVI-NYYGESSEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDESFGSSSGVSLAYKLQAMSNLALSF 343 (453) T ss_pred ccee-eecCCCCCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccccccCChHHHHHHHHHHHHHHHHHH Confidence 1111 11111 122334577788888888888888888888877776553 3444446679999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQA 473 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea 473 (502) ++.|+.+|++++++|+.+.... +.......++|.|++++|.|..+.+++++++ +|+||.+|++..+++++| + T Consensus 344 ~~~~~~~l~~~~~li~~~~~~~----~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~l~~l~~v~D--~ 415 (453) T protein:vir:39 344 QRKFQSSLNSRYKLYCELSTNV----SNKEAWKDIEYTFTRNEPKDIKEQAETANIL--MGITSQETALSVISVIPD--V 415 (453) T ss_pred HHHHHHHHHHHHHHHHHHHhcc----CCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCChHHHHHhCCCCCC--H Confidence 9999999999999999876542 2334556799999999999999999999987 699999999987755654 7 Q ss_pred HHHHHHHHHhhhcccCCCCCc--------cccCCCCC Q lcl|NC_012753. 474 QEIYQKINDETMVSTDSFRTS--------EEVDIYGE 502 (502) Q Consensus 474 ~~el~ri~~E~~~~~~~~~~~--------~~~~~~g~ 502 (502) ++|++||++|+.+........ ++.+-.+| T Consensus 416 ~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (453) T protein:vir:39 416 QAEMEKIKKEEASTAIFDKDKQPSEKGTDTVVPETNE 452 (453) T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCCCCCCCCCCcCC Confidence 889999999976543211111 11111222 No 43 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=100.00 E-value=5.8e-61 Score=350.82 Aligned_cols=457 Identities=9% Similarity=0.031 Sum_probs=305.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC---------CCccccccce Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS---------NGSQVKRDFN 71 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~---------~~~~~~~~~~ 71 (502) |.+ ..++.+|++. |..+ ..+++..++...++||.|+|+++.++.. ......++|+ T Consensus 8 ~~~-~~~~~~~~~~-----------i~~~----~~~~~~~~~~~~~~YY~g~h~Il~r~~~~~~~~~~~~~d~~~~nnki 71 (537) T protein:vir:78 8 KPI-DQLGGLLNTE-----------ITTY----MASNHIKWAHIGENYYNQENDIEKSRIFYMNDKGQLREDNYASNVKI 71 (537) T ss_pred ccH-HHHHHHHHHH-----------HHHH----HHHHHHHHHHHHHHHhcccchhhhccccccccccccccccccccccc Confidence 444 4455555432 1111 2346678999999999999998865432 1223456789 Q ss_pred ecchHHHHHHHHhhhhhcCcceEeeCCH---HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEE Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRVDNE---VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFV 147 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~~d~---~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v 147 (502) ++|||+.||++.++||||+|++++++++ ...+.|+++++ ++|+..+.++++.|+++|.+|.++|+|. |.+++..+ T Consensus 72 ~~nf~k~Ivd~~~~yl~G~Pv~~~~~d~~~~e~~~~l~~~~~-~~~~~~~~el~~~~s~~G~ay~~~y~de~~~~~~~~i 150 (537) T protein:vir:78 72 SHGFFTELVDQLAQYLLSNGVEVKVKDEDNTQLDEILQEYFD-EDFQATIDTLVTNASKKGFEGIFARTTSEGKLKFQTV 150 (537) T ss_pred ccchHHHHHHHHhhhhcccCceeecCcchhHHHHHHHHHHhh-ccHHHHHHHHHHHHhhcCeeEEEeeecCCCceEEEEE Confidence 9999999999999999999999999754 45667888775 7999999999999999999999999986 57999999 Q ss_pred cCCeEEEEEEcCCCeEEEEEEEEEEEee--CCCceEEEEEEEEEEeCCeEEEEEE-----EEecCCccccCceeecccc- Q lcl|NC_012753. 148 QATVFFPLQANTQDVSSAAIVTKSTKTE--GQKVKYYSLIEFHEWNKETYTISNE-----LYESESKTIIGQRVPLSTL- 219 (502) Q Consensus 148 ~~~~~~Pi~~d~~~~~~~~~~~~~~~~~--~~~~~~yt~~E~h~~~~~~~~I~~~-----l~~~~~~~~lG~~v~l~~~- 219 (502) +|.++||+|.+++.....+.++..+... ........++|+|+-+...+++... .+.. .....+.+++.... T Consensus 151 ~p~~~~pv~d~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~evyt~~~i~~y~~~~~~~~~~~~~-~~~~~~~~i~~~~~~ 229 (537) T protein:vir:78 151 DGLTLIPVFDDYGVLKMIIRWYSEIRYSTKQQSTETIWHADVWNEEAVCYYIQDDEGVSTTYKL-DEAYNPNPAPHVLAI 229 (537) T ss_pred ccceeEEEEcCCCCceeEEEEEeeeeccccccCcceEEEEEEEcCCcEEEEEecCCcccccccc-cccccccccceeeec Confidence 9999999987776655443333322222 2233445667777654444433210 0000 00000111111000 Q ss_pred --------ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeech Q lcl|NC_012753. 220 --------YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPT 291 (502) Q Consensus 220 --------~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~ 291 (502) ..........++++++||+.|++| ..|+|+|+++++|||+||.++|+++|+++.....|+| T Consensus 230 ~~~~~~~~~~~~~~~~~~~~~g~iPvv~f~nn---------~~~~sd~e~v~~LiDayd~~~S~~an~~~~~~~~ilv-- 298 (537) T protein:vir:78 230 EESTDADFEDTDGYQVLGRSYSKFPFQLLYNN---------KDGMSDVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYV-- 298 (537) T ss_pred cccccccccccccccccccCCcceeEEEeccC---------ccCCCchhhhHHHHHHHHHHHHhhhhHHHHhcCceee-- Confidence 011112234578999999999875 3589999999999999999999999999998899988 Q ss_pred HHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccc Q lcl|NC_012753. 292 QMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGK 371 (502) Q Consensus 292 ~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~ 371 (502) +........ ..+..+...++.+...+ ++..++.++.++..++....++.+.+.|...+..+. +++... T Consensus 299 --i~g~~~~~~------~~~~~~l~~~~~i~v~~--d~~~v~~l~~~~~~~~~e~~ld~L~~~I~~~s~~~~--~~~~~~ 366 (537) T protein:vir:78 299 --VKGFSGDST------DKLRQNIKAKKMIGVNG--DNAGMEIQTVSIPYEARKAKMDIDVENIYRSGMGFN--STAVGD 366 (537) T ss_pred --eecCCCccc------hhHHHHHhhcCceeecC--CCCceeEEEecCCHHHHHHHHHHHHHHHHHhcCCCC--Cccccc Confidence 433211111 11222333344443332 233467788888888888888887777765442221 234445 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_012753. 372 SMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMV 451 (502) Q Consensus 372 ~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~ 451 (502) |+.||.|++++++.+.++|+.+++.|+++|++++++|+.+++. .++....+..+.|+|++++|.|+.+.++++++++ T Consensus 367 gn~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~---~~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~ 443 (537) T protein:vir:78 367 GNVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIAL---RGLGEYDSNDICFEIEPHVLANELDIATTRKTEA 443 (537) T ss_pred cCCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---cCCcccccceeeEEeccCCCCCHHHHHHHHHHHH Confidence 6779999999999999999999999999999999999988764 3334456678999999999999999999999999 Q ss_pred hcCCCCHHHHHHhcCCCCHHHHHHHHHHHH------------HhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 452 AAGFAPKTMAIEKTLNVTKEQAQEIYQKIN------------DETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 452 ~~Gi~S~et~l~~~~~~~deea~~el~ri~------------~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++|++|.+|+|+.++.++|.|.++..++.. +++.+..+..++ .+..+.|. T Consensus 444 ~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 505 (537) T protein:vir:78 444 ETEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQSLDVSPD-VQAMLDGL 505 (537) T ss_pred hcCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhcccccCcCcc-hhhhcCCC Confidence 999999999988865555543222111110 001111111111 11222332 No 44 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=100.00 E-value=8e-60 Score=344.58 Aligned_cols=437 Identities=12% Similarity=0.052 Sum_probs=306.9 Q ss_pred CC-----hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecch Q lcl|NC_012753. 1 MG-----IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~-----~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~ 75 (502) || ..+.|+.+|++. ......+|++.++||.|+|+++..... +...++++++|| T Consensus 19 ~~~~~~~~~~~i~~~i~~~--------------------~~~~~~~~~~l~~Yy~g~~~i~~~~~~--~~~~~~ki~~n~ 76 (470) T protein:vir:99 19 FPKGEKLTSNELLGFIAYN--------------------ETVLKPRYRENMKLYLGKHKILTAPEK--ETGADNRIVVNS 76 (470) T ss_pred eCCCCCcCHHHHHHHHHHH--------------------HHhhHHHHHHHHHHhccccccccCccc--ccCCcceeecch Confidence 22 122344443321 123456889999999999988765433 344577899999 Q ss_pred HHHHHHHHhhhhhcCcceEeeCC-HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEE Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVDN-EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFF 153 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~d-~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~ 153 (502) |++||++.++||||+|+++++++ ....+.|++++++|+|...+.++++.++++|.+|+++|+|+ |.+++.+++|.+++ T Consensus 77 ~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~dg~~~i~~~~p~~~~ 156 (470) T protein:vir:99 77 AKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWNRQENFFDTINEISKQCDIFGRSIASIYQGEDARPHLMYSSPNHAF 156 (470) T ss_pred HHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHHHhcCHhHHHHHHHHHHHhcCeeEEEEEeCCCCeEEEEEEccceeE Confidence 99999999999999999999965 45678999999999999999999999999999999999975 67999999999999 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) |+|.+..+....++++.+....+....+| ++.|. .+..|+ |...+ .+...+ ......++++ T Consensus 157 ~i~d~~~~~~~~~~vr~~~~~~~~~~~~~--~~~~~-~~~~~~-----~~~~~---~~~~~~--------~~~~~~~~~g 217 (470) T protein:vir:99 157 IIYDDTVQRQPLAFVHYQIDNSNNWTDAY--GVIQY-ADKFYK-----FKGYD---IEEDTN--------AAGYAINPYG 217 (470) T ss_pred EEEcCCCCcceEEEEEEEEEecCCeeEEE--EEEEe-cCeEEE-----EEecc---cccccc--------cccccccCCC Confidence 99877666555555544333222222222 22221 111121 11111 111110 1122347788 Q ss_pred cceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFET 313 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~ 313 (502) ++|+++|+++ ++|+|+|+++++|||+||.++|++++.++....++++-.++... .+..|.. .... T Consensus 218 ~vPvv~~~n~---------~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~~~-~~~~g~~-----~~~~ 282 (470) T protein:vir:99 218 LVPAVEFFEN---------EERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFKLP-EDDEGNP-----KFDF 282 (470) T ss_pred ccceEeecCC---------CCCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCcc-cccccch-----hhhh Confidence 9999998753 46899999999999999999999999999888888773222111 1111111 1111 Q ss_pred cchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 314 GHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 314 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~ 393 (502) .......+...+.+.+..++.++.++..+.+...++.+.+.|...++.+..+++. .+|+.||.+++++++.+.+++..+ T Consensus 283 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~-~~~n~Sg~Ai~~~~~~l~~k~~~~ 361 (470) T protein:vir:99 283 KNNRVLYVSQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKN-FAGNSSGVALQYKLFAMKNKADSK 361 (470) T ss_pred hhcceeeecCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCccccccc-cccCchHHHHHHHHHHHHHHHHHH Confidence 2222222323333444557778888889999998999989888888877544332 246679999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQA 473 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea 473 (502) ++.|+.+|++++++|+.+... .+.......+++++|++++|.|..+.++++++++ |++|.||++..++++ + + T Consensus 362 ~~~~~~~l~~~~~li~~~~~~---~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~--giis~et~l~~l~~v-d--~ 433 (470) T protein:vir:99 362 ERKFDKSLMQLYRIVLATLFN---NKQDQELWSELDFKFTRNLPEDMASAIDNAKNAE--GIVSKKTQLGMIPDI-E--P 433 (470) T ss_pred HHHHHHHHHHHHHHHHHHHhc---cCCcccccccceEEeCCCCCcCHHHHHHHHHHHh--ccCCHHHHHHhCCCC-C--H Confidence 999999999999999877543 2333445668999999999999999999999885 999999998887555 3 5 Q ss_pred HHHHHHHHHhhhccc------CCCCCccccCCCCC Q lcl|NC_012753. 474 QEIYQKINDETMVST------DSFRTSEEVDIYGE 502 (502) Q Consensus 474 ~~el~ri~~E~~~~~------~~~~~~~~~~~~g~ 502 (502) ++|++||++|+.... -...+..+.|-.|| T Consensus 434 ~~E~eri~~E~~~~~~~~~~~~~~~d~~~~d~~~e 468 (470) T protein:vir:99 434 DAEMKQIAKEKADAIKQTQQLSMPIDILKRDNNAE 468 (470) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCCcCCCCCCcc Confidence 678999998865321 11123334444444 No 45 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=100.00 E-value=4e-60 Score=346.22 Aligned_cols=443 Identities=9% Similarity=-0.002 Sum_probs=302.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |++.- +|-++......+....|.+.+. ....+..+|.++.+||.|+|+++... ...+...++|+++|||++|| T Consensus 1 ~~~~~-~~~~~~~~~~~~~~~~i~~~i~-----~~~~~~~r~~~~~~yy~g~~~i~~~~-~~~~~~~~~ki~~n~~~~iv 73 (453) T protein:vir:73 1 MNLKP-IKLMTYSRDEEITDKVVNDFMK-----KHQEEVERYEYLGNMYKGIMEISSQK-AKDSWKPDNRLTNNFAKYIV 73 (453) T ss_pred Ccccc-ceeeeccccccCCHHHHHHHHH-----HHHHHHHHHHHHHHHhccccchhcCC-CCCccCccceeecchHHHHH Confidence 32211 1111100000111112222211 12356689999999999999876543 33445567899999999999 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEcC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQANT 159 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d~ 159 (502) ++.|+||||+|++++++++..++.|++++++|+|...+.++++.++++|.+|+++|+|+ |.+++.+++|.+++|+|.+. T Consensus 74 d~~~~~l~g~~~~~~~~d~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~~~~~i~~~~p~~~~~v~dd~ 153 (453) T protein:vir:73 74 DTFVGYFNGIPIKKTHDDKSVLEAMQLFDNLNDMEDEESELAKIACVYGRAYELMYQNESTESEVIYCSPLNVFMVYDDS 153 (453) T ss_pred HHhhhhhcccCceeecCChHHHHHHHHHHHhcChhHHHHHHHHHHHhcCeEEEEEEeCCCCceEEEEEcccceEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999985 57999999999999998776 Q ss_pred CCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 160 QDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) .+....+++ +++...+ +.+ ..+.|+. . .|.+ |...+.. |. ......++++++|+++ T Consensus 154 ~~~~~~~~i-~~~~~~~--~~~--~~~vyt~--~--~i~~--~~~~~~~-----------~~--~~~~~~~~~g~vPvv~ 209 (453) T protein:vir:73 154 IKQKPLFAV-YYGFDEE--GNL--SGTVYTL--L--ETIS--ITGKAGE-----------VK--FGESTYNVYSDLPIVE 209 (453) T ss_pred CCceeEEEE-EEEEecC--ceE--EEEEEeC--C--eEEE--EEecCCc-----------eE--EccceeccCCceeEEE Confidence 555444433 3333222 222 2344431 1 1111 2221110 00 0112346788999999 Q ss_pred ecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCC--CCCCcccCccccccccchh Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEY--DTNGEKVTVKREFETGHNV 317 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~--~~~g~~~~~~~~~~~~~~~ 317 (502) |+++ +.|.|+|+++++|+|+||.++|++++.++.....+++ +.... +.........+........ T Consensus 210 ~~n~---------~~g~s~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~----~~g~~~~~~~~~~~~~~~~~~~~~~~ 276 (453) T protein:vir:73 210 YNFN---------EERQSIFEPVHSLINSYNKVTSEKANDVEYFSDQYLV----FLGAEVDEEDAKNIKDNRLINFFDKN 276 (453) T ss_pred ecCC---------CCCCcchhhHHHHHHHHHHHHHHHHHHHHHhccceee----eecCCCCchhhhcccccccccccccc Confidence 8753 4689999999999999999999999999887777766 32111 1111111111110000000 Q ss_pred hccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV 397 (502) Q Consensus 318 ~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~ 397 (502) .........+.-++.++.++..+.+...++.+.+.|...++.+. +++...|+.||.+++++++.|.++|+.+++.| T Consensus 277 --~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~ 352 (453) T protein:vir:73 277 --SNGQGTNAAKVDVKFLDKPDSDVQTENLLNRLERSIFQFTMAAN--ISDENFGNSSGVALAYKLQAMSNLALSFQRKF 352 (453) T ss_pred --cccccccccCceeEEeeecCCHHHHHHHHHHHHHHHHHHhCCcc--cCcccccCccHHHHHHHHHHHHHHHHHHHHHH Confidence 00000011112256677777788888888888777777665543 45555567899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHH Q lcl|NC_012753. 398 EKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIY 477 (502) Q Consensus 398 ~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el 477 (502) +.+|++++++|+.+.... +.......++|.|++++|.|..+.++++++++ |++|.||++..+++++| +++|+ T Consensus 353 ~~~l~~~~~li~~~~~~~----~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~--giis~et~~~~~~~~~d--~~~E~ 424 (453) T protein:vir:73 353 QSALNRRYSLWSSLSTNA----SNKDAWKDIEYTFTRNEPKDIKEQAETANILK--GITSEETALSVISVIPD--VQAEM 424 (453) T ss_pred HHHHHHHHHHHHHHHhcc----CCccccccceEEeCCCCCCCHHHHHHHHHHHh--ccCcHHHHHHhCCCCCC--HHHHH Confidence 999999999998876432 23345567999999999999999999999885 99999999987766655 77889 Q ss_pred HHHHHhhhcc-------cCCCCCccccCC Q lcl|NC_012753. 478 QKINDETMVS-------TDSFRTSEEVDI 499 (502) Q Consensus 478 ~ri~~E~~~~-------~~~~~~~~~~~~ 499 (502) +||++|+.+. ....+....++| T Consensus 425 ~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 453 (453) T protein:vir:73 425 EKIKKKKLLQLSLTRTSNLVRMKQMRGNL 453 (453) T ss_pred HHHHHHHHHHHHHHHhccCCcchhhhcCC Confidence 9999886643 234444556666 No 46 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=100.00 E-value=2.5e-60 Score=347.34 Aligned_cols=420 Identities=8% Similarity=0.016 Sum_probs=306.4 Q ss_pred ccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEeeC Q lcl|NC_012753. 18 ITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVD 97 (502) Q Consensus 18 ~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~ 97 (502) +....|.+.+.+. ..+..+|...++||.|+|+++..... .+...++++++|||++||++.++||||+|++++++ T Consensus 1 l~~~~l~~~i~~~-----~~~~~r~~~l~~yy~g~~~il~~~~~-~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~ 74 (429) T protein:vir:98 1 MTKDLLSELIQKH-----RSFNLSYSAYKQLYEGDHAILQQKQK-EQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQTSHE 74 (429) T ss_pred CCHHHHHHHHHHH-----HHHHHHHHHHHHHhcccccccccccc-ccCCCcceeecchHHHHHHHHhhhhcccCceeecC Confidence 2223333333221 24567999999999999998865543 33455679999999999999999999999999999 Q ss_pred CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeC Q lcl|NC_012753. 98 NEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEG 176 (502) Q Consensus 98 d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~ 176 (502) ++..++.|+++++.|+|...+.++++.++++|.+|+.+|+|+ |.+++.+++|.+++|+|.+..+....++++ ++...+ T Consensus 75 ~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~d~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~-~~~~~~ 153 (429) T protein:vir:98 75 NKQVSNYLELLDGYNDQDDNNAELSKICSIYGHGYELVFNDENAEAGITYLTPLEAFIVYDDSIRQKPLFAVR-YFYNKG 153 (429) T ss_pred ChHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCcEEEEEEcccceEEEEeCCCCCceEEEEE-EEEecC Confidence 999999999999999999999999999999999999999975 679999999999999988766554455443 333222 Q ss_pred CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCc Q lcl|NC_012753. 177 QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGL 256 (502) Q Consensus 177 ~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~ 256 (502) +. ++.++++ ....+ .|...+ -|..+ .+...++++++|+++|+++ ++|+ T Consensus 154 --~~--~~~~~~~--~~~~~----~~~~~~---~~~~~----------~~~~~~~~g~vPvv~~~n~---------~~g~ 201 (429) T protein:vir:98 154 --GV--LEGSYSD--ASNIT----YFKDGE---KGIEI----------GESEPHPFDGVPMIEYVEN---------EERQ 201 (429) T ss_pred --ce--EEEEEEe--CceEE----EEEecC---CceEe----------cccccccCCccceEEecCC---------CCCC Confidence 11 2233322 22211 121111 01111 1123477889999998753 4699 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCc-cccceeee Q lcl|NC_012753. 257 SIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMD-KGIGITDL 335 (502) Q Consensus 257 S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~i~~~ 335 (502) |+|+++++|+|+||.++|++++.++.....+++ +..... .+ .+..+...++.+...+++ .+..++.+ T Consensus 202 sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~----i~g~~~-~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~l 269 (429) T protein:vir:98 202 SLLASVVTLINAFNKAISEKANDVEYFADAYLK----ILGAEL-DD-------ETLKSLRDTRIINLKDTDAQQLTVEFL 269 (429) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHhcCceee----eecCCC-Cc-------chhhhHhhCceeeccCCCCCCcceeEE Confidence 999999999999999999999999988888877 332111 11 122233333444444333 23346677 Q ss_pred ccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_012753. 336 TTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVY 415 (502) Q Consensus 336 ~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~ 415 (502) +.++..+.+.+.++.+.+.|...++.+. +++.+.|+.||.+++++++.+.++++.+++.|+.+|++++++|+.+.+.. T Consensus 270 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 347 (429) T protein:vir:98 270 QKPDADATQEHLLDRLENLIFRTAMVAN--ISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSK 347 (429) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCccc--cCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 7788888888888888888877776653 44455567899999999999999999999999999999999999876432 Q ss_pred cccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCcc Q lcl|NC_012753. 416 NLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSE 495 (502) Q Consensus 416 ~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~ 495 (502) +.......++|.|++++|.|..+.+++++++ +|+||.||++..+++++| +++|++||++|+...... . T Consensus 348 ----~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl--~g~is~et~~~~l~~v~d--~~~E~~ri~~E~~~~~~~----~ 415 (429) T protein:vir:98 348 ----IGPKDWIGIKYKFTRNLPANLLEESQIAGNL--AGIVSEETQVGVLSIVEN--PQKEIERKNSDKSTLISR----Q 415 (429) T ss_pred ----CCccccccceEEeCCCCCcCHHHHHHHHHHH--hccCchHHHHHhCCCCCC--HHHHHHHHHHHHHHHHHH----H Confidence 2233456799999999999999999999987 699999999888766665 678999999997654221 2 Q ss_pred ccCCCCC Q lcl|NC_012753. 496 EVDIYGE 502 (502) Q Consensus 496 ~~~~~g~ 502 (502) ..++.|+ T Consensus 416 ~~~~~~~ 422 (429) T protein:vir:98 416 AGGLNGQ 422 (429) T ss_pred HhhhcCC Confidence 2233333 No 47 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=100.00 E-value=4.3e-60 Score=346.04 Aligned_cols=425 Identities=13% Similarity=0.150 Sum_probs=301.3 Q ss_pred ccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC------CCccccccceecchHHHHHHHHhhhhhcCc Q lcl|NC_012753. 18 ITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS------NGSQVKRDFNHLPIGRTASKKVASLVFNEQ 91 (502) Q Consensus 18 ~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~------~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep 91 (502) +....|.+.+++ ..++..+|..+++||.|+|+++..... ......++|+++||++.||++.++||||+| T Consensus 1 l~~~~i~~~i~~-----~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p 75 (451) T protein:vir:10 1 MELEKIRAIISA-----DAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYP 75 (451) T ss_pred CCHHHHHHHHHH-----HHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheeccc Confidence 222333332221 235678999999999999988764321 223345678999999999999999999999 Q ss_pred ceEeeCC-HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC---------CceEEEEEcCCeEEEEEEcCCC Q lcl|NC_012753. 92 ATIRVDN-EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG---------DQIRVSFVQATVFFPLQANTQD 161 (502) Q Consensus 92 ~~i~~~d-~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~---------~~~~i~~v~~~~~~Pi~~d~~~ 161 (502) +++++++ +...+.|+.+++ |+|...+.++++.++++|.+|.++|+|. +.+++..++|.++||+|.++.+ T Consensus 76 ~~~~~~~~~~~~~~~~~~~~-n~~~~~~~~~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~ 154 (451) T protein:vir:10 76 VLFDIDNNKELNEKVTDVLG-NEFTRKAKNLAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIE 154 (451) T ss_pred ceeecCCcHHHHHHHHHHhc-cCHHHHHHHHHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCC Confidence 9999866 456678877775 7999999999999999999999999974 5788999999999999887655 Q ss_pred eEEEEEEEEEEEeeCCCc----eEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 162 VSSAAIVTKSTKTEGQKV----KYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 162 ~~~~~~~~~~~~~~~~~~----~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) ....++++.++..++..+ ..++++|.|+.+ . +.+ |+..+....|..+ ......++++++|+ T Consensus 155 ~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~--~--~~~--~~~~~~~~~~~~~---------~~~~~~~~~g~vPv 219 (451) T protein:vir:10 155 RELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDK--I--LDK--YKFFGVSCCGSQI---------EHITVQHRFNSVPF 219 (451) T ss_pred CceEEEEEEEEeeecccccccceEEEEEEEEeCC--e--EEE--EEecccCcccccc---------ccccccCCCCeeeE Confidence 555566555444433222 234556665422 1 111 2222222222211 12234578999999 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchh Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNV 317 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~ 317 (502) ++|++| ..|.|+|+.+++|||+||.++|++++.++.....+++ +......... .+...... T Consensus 220 v~~~nn---------~~~~~d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~----~~g~~~~~~~------~~~~~~~~ 280 (451) T protein:vir:10 220 VEFSNN---------IKKQSDLSKYKKILDLYDRVMSGFANDLEDIQQIIYI----LENFGGEDTS------EFLKELKR 280 (451) T ss_pred EEeccC---------CCCCCchhhHHHHHHHHHHHHHHHHHHHHHhccceee----eecCCcccch------hhHHHHhh Confidence 999864 3478999999999999999999999999988888887 3322111111 11111122 Q ss_pred hccc--cCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQF--DSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIAT 395 (502) Q Consensus 318 ~~~~--~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~ 395 (502) +..+ ...+.+.+..++.++.+++.+++.+.++.+.+.|...++.+. +++...|+.||.|++++++.+.++|+.+++ T Consensus 281 ~~~i~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~ 358 (451) T protein:vir:10 281 YKTIKTETDSEGDSGGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTENFGNASGVALKFFYRKLELKSGLLET 358 (451) T ss_pred CCeEEecCcCCccCCcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--ccccccccccHHHHHHHHHHHHHHHHHHHH Confidence 2222 222223344577788888888888888888888877776653 344445678999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 396 LVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQE 475 (502) Q Consensus 396 ~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~ 475 (502) .|+++|++++++|+.+.+. ..+..+.+.|++++|.|..+.++++++++ |++|.||+++.++++++ +++ T Consensus 359 ~f~~~l~~~~~li~~~~~~--------~d~~~i~i~f~~~~p~n~~e~~~~~~kl~--g~iS~et~~~~~p~v~d--~~~ 426 (451) T protein:vir:10 359 EFRTSFDKLIKAILYFLGV--------TDYKKIQQTYTRNMMSNDLEDADIATKSV--GIIPTKIILRHHPWVDD--VEE 426 (451) T ss_pred HHHHHHHHHHHHHHHHhCC--------CCccceeEEecCCCCCCHHHHHHHHHHHh--ccCchHHHHHhCCCCCC--HHH Confidence 9999999999999977532 24567899999999999999999999985 99999999888766655 667 Q ss_pred HHHHHHHhhhccc----CCCCCccc Q lcl|NC_012753. 476 IYQKINDETMVST----DSFRTSEE 496 (502) Q Consensus 476 el~ri~~E~~~~~----~~~~~~~~ 496 (502) |++++++|+.... +.+.+.++ T Consensus 427 e~~~~~ee~~~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 427 AEKLYLEEKKIQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCCC Confidence 8888876654332 22222222 No 48 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=100.00 E-value=1.9e-59 Score=342.49 Aligned_cols=445 Identities=11% Similarity=0.043 Sum_probs=315.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcc---cccc-------------CCCc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSV---TYRD-------------SNGS 64 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~---~~~~-------------~~~~ 64 (502) |++.+.|...--+ -+..+.|.+.++. ......++....+||.|.++.. ..+. .... T Consensus 1 ~~~~~~~~~~~~~---~~~~e~i~~~i~~-----~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (474) T protein:vir:94 1 MTLYKLIDDIEAQ---GILPKHIEALIES-----HKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLD 72 (474) T ss_pred CchHHHHhhcccc---CCCHHHHHHHHHH-----hhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccc Confidence 8888777555211 1111223333321 1234567788888998865422 1110 1122 Q ss_pred cccccceecchHHHHHHHHhhhhhcCcceEeeC-----CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Q lcl|NC_012753. 65 QVKRDFNHLPIGRTASKKVASLVFNEQATIRVD-----NEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG 139 (502) Q Consensus 65 ~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~-----d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~ 139 (502) ...++|+++|||+.||++.++||||+|++++++ ++...++|++++++|+|...+.+++..++++|.+|.++|.|+ T Consensus 73 ~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~ 152 (474) T protein:vir:94 73 VSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT 152 (474) T ss_pred cCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC Confidence 345678999999999999999999999999985 456778999999999999999999999999999999999985 Q ss_pred -CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc Q lcl|NC_012753. 140 -DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST 218 (502) Q Consensus 140 -~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~ 218 (502) |.+++.+++|.+++|++.+++... ++++.++..+...+..+..+++|+ ....+ .|.+.+. |... T Consensus 153 ~~~~~~~~i~p~~~~~v~d~~~~~~--~~i~~~~~~~~~~~~~~~~~~~y~--~~~~~----~~~~~~~---~~~~---- 217 (474) T protein:vir:94 153 NGDIRIKNIDPYNVIFVGDNILEPT--YSLRYFYEKDDDNGTDYVYAEFYD--NAYYY----VFRGEGI---DALQ---- 217 (474) T ss_pred CCeeEEEEEcccceEEEEcCCCceE--EEEEEEEEeeCCCceEEEEEEEEc--CceEE----EEeecCC---Cccc---- Confidence 579999999999999987666554 333334444444455555666653 22221 2333221 1100 Q ss_pred cccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCC Q lcl|NC_012753. 219 LYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEY 298 (502) Q Consensus 219 ~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~ 298 (502) ......++++.+|++.|+++ +.|+|+|+.+++|||+||.++|++++.++.....+++ +.... T Consensus 218 -----~~~~~~~~~g~vPvv~~~n~---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~----i~g~~ 279 (474) T protein:vir:94 218 -----EVGRYEHLFDYNPLFGVPNN---------KEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLV----LRGMG 279 (474) T ss_pred -----ccccccCCCCccceEEecCC---------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh----hccCC Confidence 01123477889999998753 4689999999999999999999999999987777776 33211 Q ss_pred CCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH Q lcl|NC_012753. 299 DTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE 378 (502) Q Consensus 299 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte 378 (502) . ... +..+......+...++ +..++.++.++..+++...++.+.+.|...++.+..+++. .+|+.||.| T Consensus 280 ~-~~~-------~~~~~~~~~~i~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~A 348 (474) T protein:vir:94 280 M-SEE-------MIQETQKSGAFELFDK--DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDE-FNGNVPIIG 348 (474) T ss_pred C-Cch-------hhhhhhhcceeEecCC--CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHH Confidence 1 110 1111122223333222 2346777788888888888888888887777765433322 235679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPK 458 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~ 458 (502) ++++++.+.++|+.+++.|+.+|++++++|+.++...+. +........+++.|++++|.|..+.+++++++ +|++|. T Consensus 349 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~-~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~ 425 (474) T protein:vir:94 349 MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGY-NLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSE 425 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-CCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCch Confidence 999999999999999999999999999999987654211 11223445789999999999999999999988 499999 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 459 TMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 459 et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|+++.++++++ +++|++||++|+.......++..+++.-++ T Consensus 426 et~~~~l~~v~d--~~~E~eri~~E~~e~~~~~~~~~~~~~~~~ 467 (474) T protein:vir:94 426 RTRLGQSQLVDD--VDYELDEMEKESLEFNDKLPDIDEGDANDK 467 (474) T ss_pred HHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccCCCcCCC Confidence 999888766654 889999999999877777777776776666 No 49 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=100.00 E-value=1.9e-59 Score=342.49 Aligned_cols=445 Identities=11% Similarity=0.043 Sum_probs=315.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcc---cccc-------------CCCc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSV---TYRD-------------SNGS 64 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~---~~~~-------------~~~~ 64 (502) |++.+.|...--+ -+..+.|.+.++. ......++....+||.|.++.. ..+. .... T Consensus 1 ~~~~~~~~~~~~~---~~~~e~i~~~i~~-----~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (474) T protein:vir:10 1 MTLYKLIDDIEAQ---GILPKHIEALIES-----HKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLD 72 (474) T ss_pred CchHHHHhhcccc---CCCHHHHHHHHHH-----hhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccc Confidence 8888777555211 1111223333321 1234567788888998865422 1110 1122 Q ss_pred cccccceecchHHHHHHHHhhhhhcCcceEeeC-----CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Q lcl|NC_012753. 65 QVKRDFNHLPIGRTASKKVASLVFNEQATIRVD-----NEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG 139 (502) Q Consensus 65 ~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~-----d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~ 139 (502) ...++|+++|||+.||++.++||||+|++++++ ++...++|++++++|+|...+.+++..++++|.+|.++|.|+ T Consensus 73 ~~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~ 152 (474) T protein:vir:10 73 VSVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYIDT 152 (474) T ss_pred cCcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEeCC Confidence 345678999999999999999999999999985 456778999999999999999999999999999999999985 Q ss_pred -CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc Q lcl|NC_012753. 140 -DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST 218 (502) Q Consensus 140 -~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~ 218 (502) |.+++.+++|.+++|++.+++... ++++.++..+...+..+..+++|+ ....+ .|.+.+. |... T Consensus 153 ~~~~~~~~i~p~~~~~v~d~~~~~~--~~i~~~~~~~~~~~~~~~~~~~y~--~~~~~----~~~~~~~---~~~~---- 217 (474) T protein:vir:10 153 NGDIRIKNIDPYNVIFVGDNILEPT--YSLRYFYEKDDDNGTDYVYAEFYD--NAYYY----VFRGEGI---DALQ---- 217 (474) T ss_pred CCeeEEEEEcccceEEEEcCCCceE--EEEEEEEEeeCCCceEEEEEEEEc--CceEE----EEeecCC---Cccc---- Confidence 579999999999999987666554 333334444444455555666653 22221 2333221 1100 Q ss_pred cccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCC Q lcl|NC_012753. 219 LYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEY 298 (502) Q Consensus 219 ~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~ 298 (502) ......++++.+|++.|+++ +.|+|+|+.+++|||+||.++|++++.++.....+++ +.... T Consensus 218 -----~~~~~~~~~g~vPvv~~~n~---------~~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~----i~g~~ 279 (474) T protein:vir:10 218 -----EVGRYEHLFDYNPLFGVPNN---------KEMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLV----LRGMG 279 (474) T ss_pred -----ccccccCCCCccceEEecCC---------CCCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhh----hccCC Confidence 01123477889999998753 4689999999999999999999999999987777776 33211 Q ss_pred CCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH Q lcl|NC_012753. 299 DTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE 378 (502) Q Consensus 299 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte 378 (502) . ... +..+......+...++ +..++.++.++..+++...++.+.+.|...++.+..+++. .+|+.||.| T Consensus 280 ~-~~~-------~~~~~~~~~~i~~~~~--~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~-~~~n~Sg~A 348 (474) T protein:vir:10 280 M-SEE-------MIQETQKSGAFELFDK--DMDVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFNSDE-FNGNVPIIG 348 (474) T ss_pred C-Cch-------hhhhhhhcceeEecCC--CCceeEEeccCCHHHHHHHHHHHHHHHHHHhCCccccccc-ccccchHHH Confidence 1 110 1111122223333222 2346777788888888888888888887777765433322 235679999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPK 458 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~ 458 (502) ++++++.+.++|+.+++.|+.+|++++++|+.++...+. +........+++.|++++|.|..+.+++++++ +|++|. T Consensus 349 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~-~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl--~g~iS~ 425 (474) T protein:vir:10 349 MKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGY-NLDDDSYLNLIFKFTRNIPVNKLEESQVLINL--KGQVSE 425 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccC-CCCccccccceEEeCCCCCCCHHHHHHHHHHH--hccCch Confidence 999999999999999999999999999999987654211 11223445789999999999999999999988 499999 Q ss_pred HHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 459 TMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 459 et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|+++.++++++ +++|++||++|+.......++..+++.-++ T Consensus 426 et~~~~l~~v~d--~~~E~eri~~E~~e~~~~~~~~~~~~~~~~ 467 (474) T protein:vir:10 426 RTRLGQSQLVDD--VDYELDEMEKESLEFNDKLPDIDEGDANDK 467 (474) T ss_pred HHHHHhCCCCCC--HHHHHHHHHHHHHHHHhhcccccCCCcCCC Confidence 999888766654 889999999999877777777776776666 No 50 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=100.00 E-value=4.6e-59 Score=340.42 Aligned_cols=429 Identities=10% Similarity=0.018 Sum_probs=302.7 Q ss_pred chhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc-CCCccccccceecchHHHHHHHHhhhhhcCcceEeeCCH- Q lcl|NC_012753. 22 SLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD-SNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNE- 99 (502) Q Consensus 22 ~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~-~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~- 99 (502) -|.+ ...++..||+...+||.|+|+++..+. ...+...++|+++|||+.||++.++||||+|++++++++ T Consensus 1 ~~~~--------~~~~~~~r~~~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~ 72 (440) T protein:vir:95 1 MLAA--------FLGSQKQRLAILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGG 72 (440) T ss_pred Chhh--------HHHHHHHHHHHHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCc Confidence 1111 233567899999999999999765443 234445677899999999999999999999999988654 Q ss_pred --HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeC Q lcl|NC_012753. 100 --VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEG 176 (502) Q Consensus 100 --~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~ 176 (502) +..+.|++++++|+|...+.++++.|+++|.+|+++|+|+ |.+++.+++|.+++|++.+.......++++ ++..++ T Consensus 73 ~~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~-~~~~~~ 151 (440) T protein:vir:95 73 SADQLSTIKDIEWQNDINALNSDLAFDASVYGRAYEYHFRDKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVH-LPIYAD 151 (440) T ss_pred cHHHHHHHHHHHHhcCHhHHHHHHHHHHhhcCeEEEEEEecCCCceEEEEEcccceEEEEcCCCCCceEEEEE-EEEecC Confidence 4566889999999999999999999999999999999986 579999999999999986655444444443 333222 Q ss_pred CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCc Q lcl|NC_012753. 177 QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGL 256 (502) Q Consensus 177 ~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~ 256 (502) ..+ ++.|+ +. ..+.+..... .. +... ......++++++|+++|+++ ..|+ T Consensus 152 --~~~---~~vyt--~~-~~~~~~~~~~---~~-~~~~---------~~~~~~~~~g~vPvv~~~n~---------~~g~ 201 (440) T protein:vir:95 152 --KVN---MTVYT--KD-KVITYKPYSN---NS-VRLV---------VDDVKKHSYNDVPVVEWWNN---------RFRM 201 (440) T ss_pred --ceE---EEEEe--CC-eEEEEEEecC---Cc-ccee---------ecceeeccCceeeEEEeeCC---------CCCC Confidence 111 23332 11 1222221111 11 1100 11234578899999999764 3589 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhccc--cCCCCccccceee Q lcl|NC_012753. 257 SIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQF--DSGDMDKGIGITD 334 (502) Q Consensus 257 S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~i~~ 334 (502) |+|+.+++|||+||.++|+++++++.....++|...+..... ..+... ..+......+... .....+.+..++. T Consensus 202 sd~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~-~~~e~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (440) T protein:vir:95 202 GDYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIK-LSPEDA---AKMKDANMLFLKTGISTTGQQTTADASY 277 (440) T ss_pred CchhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCC-CCccch---hhhhhccceecccccccccCCCCcceeE Confidence 999999999999999999999999988888877332211000 011110 0111111111111 1111123334777 Q ss_pred eccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_012753. 335 LTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKV 414 (502) Q Consensus 335 ~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~ 414 (502) ++.++..+.+...++.+.+.|...++.+...++.- +++.||.+++++++.+.++++.+++.|+.+|++++++|+.++.. T Consensus 278 lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~-~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~ 356 (440) T protein:vir:95 278 IYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRF-NSTSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKA 356 (440) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCCcccccccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 78888889999999999998888888765444322 35679999999999999999999999999999999999887654 Q ss_pred hcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCC- Q lcl|NC_012753. 415 YNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRT- 493 (502) Q Consensus 415 ~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~- 493 (502) ..+.......++|.|++++|.|..+.+++++++ +|++|.||++.+++++++ ++|++||++|+......... T Consensus 357 ---~~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl--~g~iS~et~~~~l~~~d~---~~E~~ri~~E~~~~~~~~~~~ 428 (440) T protein:vir:95 357 ---INGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA--GGEISQETLMENASFTDY---KTEHSRILKQGGSSDLEIGQI 428 (440) T ss_pred ---cCCcccccccceEEeCCCCCCCHHHHHHHHHHH--hccCcHHHHHHhCCCCCc---HHHHHHHHHHHHHhhhhHHhh Confidence 234445567899999999999999999999987 599999999988766543 35789999988766544332 Q ss_pred ccccCCCCC Q lcl|NC_012753. 494 SEEVDIYGE 502 (502) Q Consensus 494 ~~~~~~~g~ 502 (502) .+..+-.|+ T Consensus 429 ~~~~~~~~~ 437 (440) T protein:vir:95 429 VGDADVGQA 437 (440) T ss_pred ccCCCCCCc Confidence 234444444 No 51 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=100.00 E-value=1.1e-53 Score=310.95 Aligned_cols=443 Identities=11% Similarity=0.045 Sum_probs=293.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+ +-.++|+++.... .....++....+||.|+|++........+..+++++++|||++|| T Consensus 1 ~~---t~~d~i~~L~~~~-----------------~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~iv 60 (480) T protein:vir:78 1 MT---TYHEHVERLQGLL-----------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATYL 60 (480) T ss_pred CC---CHHHHHHHHHHHH-----------------HHHHHHHHHHHHHHhccccchhcccccchhhhhhhhhcchHHHHH Confidence 44 3334443321110 134578889999999998753322222334446778899999999 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE------e-CCceEEEEEcCCeEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYI------D-GDQIRVSFVQATVFF 153 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~------d-~~~~~i~~v~~~~~~ 153 (502) +++|++|+.+.+++ .+++..++.|+++++.|+|...+.+++..++.+|.+|+.+|- | ++.++|.+++|.+++ T Consensus 61 d~~~~~l~~~g~~~-~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~~~~~i~~~~p~~~~ 139 (480) T protein:vir:78 61 RTLSDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRAYITVSHPDVESGDPAGIPLIRVESPLYMY 139 (480) T ss_pred HHHHhhhccCceec-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEeecCccccCCCCCeeEEEEEcccceE Confidence 99999998877643 356677899999999999999999999999999999999984 3 467999999999999 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) |+|.++......++++.++..++. + .+++++.|+.+ . |.+ |...+....+..+ ..+...|+++ T Consensus 140 ~i~D~~~~~~~~~~i~~~~~~d~~-~-~~~~~~~y~~~-~---~~~--~~~~~~~~~~~~~---------~~~~~~~~~g 202 (480) T protein:vir:78 140 AELDPRNTRRVTRAVRLYTTRDDV-A-VPDRATLYLPD-E---TVP--LRRNGGLNDQWVV---------DGDVIKHGLG 202 (480) T ss_pred EEEcCCCccceEEEEEEEEeecCC-c-ceEEEEEEeCC-e---EEE--EEecCCCcccccc---------cccccccCCC Confidence 997655443333333333333322 2 34556666532 1 211 2211211111111 1123457889 Q ss_pred cceEEEecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccceeeechHHhc-cCCCCCCcccCccccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIK-TEYDTNGEKVTVKREF 311 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~-~~~~~~g~~~~~~~~~ 311 (502) ++|++.|+ |+.+.+.++|+|+|+. +++|+|+||+++|++++.++....++.+ +. ...+..... .....+ T Consensus 203 ~vPvv~f~----n~~~~~~~~G~sdi~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~----i~G~~~~~~~~~-~~~~~~ 273 (480) T protein:vir:78 203 VVPVVPLT----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRV----ISGVTTDELTND-GENTTL 273 (480) T ss_pred CcceEEee----cccccCCccCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchhhh----hhCCCccccccc-cccchh Confidence 99999885 5567788999999985 8999999999999999988754443322 21 110000000 000001 Q ss_pred cccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHH Q lcl|NC_012753. 312 ETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRN 391 (502) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~ 391 (502) . .+........+....+.+++ ....+.|++.++.++++++..+++++..||..+.+..||.+++++++.|.++|+ T Consensus 274 ~----~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~l~~k~~ 348 (480) T protein:vir:78 274 D----IYYGRILTLASEAAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMAE 348 (480) T ss_pred h----hhhhhhccCCCCCceEEecC-ccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHH Confidence 0 00110111112222343333 234688999999999999999999999999877777799999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCCHHHHHHhcCCCC Q lcl|NC_012753. 392 SIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG--FAPKTMAIEKTLNVT 469 (502) Q Consensus 392 ~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et~l~~~~~~~ 469 (502) .+++.|+.+|++++++++.+.. ......+..+.|.|.++.+.|..+.++++.+++++| ++|.+|++. ++||+ T Consensus 349 ~~~~~f~~~l~~~~rl~~~~~~-----~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~-~lg~~ 422 (480) T protein:vir:78 349 RKGRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARI-DLGYT 422 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHcC-----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHh-cCCCC Confidence 9999999999999998886642 122334567999999999999999999999999876 789999655 46999 Q ss_pred HHHHHHHHHHHHHhhhc-cc-----------CCCCCccccCCCCC Q lcl|NC_012753. 470 KEQAQEIYQKINDETMV-ST-----------DSFRTSEEVDIYGE 502 (502) Q Consensus 470 deea~~el~ri~~E~~~-~~-----------~~~~~~~~~~~~g~ 502 (502) +++++++ +++++++.. .. +..+.+..++.=.| T Consensus 423 ~d~~~e~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (480) T protein:vir:78 423 ATQREQM-RDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) T ss_pred HhHHHHH-HHHHHHHHHHHHHHhhccccCCCccccCCCCCCCCCc Confidence 8866554 444433321 11 11112222222222 No 52 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=100.00 E-value=7.3e-53 Score=306.43 Aligned_cols=442 Identities=10% Similarity=0.026 Sum_probs=292.2 Q ss_pred CChhH-HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQ-TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~-~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) |+--. .|+.++++. .....++.+..+||.|+|++-.......+..+++++++|||++| T Consensus 1 ~~t~~~~i~~L~~~~---------------------~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~~i 59 (480) T protein:vir:78 1 MTTYHEHVERLQGLL---------------------ARDLPNLLEAEAYRNGTRRLKTIGIGAPPELAYLDVQPGWVATY 59 (480) T ss_pred CCCHHHHHHHHHHHH---------------------HHHHHHHHHHHHHHhccccccccccccchhHhhhhhhcchHHHH Confidence 44333 344444331 13457889999999999874222122233344667889999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE------e-CCceEEEEEcCCeE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYI------D-GDQIRVSFVQATVF 152 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~------d-~~~~~i~~v~~~~~ 152 (502) |+.++++++.+.+++ .+++..++.|+++++.|+|...+.+++..|+++|.+|+.+|. | ++.+++.+++|.++ T Consensus 60 vd~~~~~l~~~g~~~-~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~~~~~~d~~g~~~i~~~~p~~~ 138 (480) T protein:vir:78 60 LRTLSDRLDIEGFRI-SEDSEGLEELWNWWQANDLDEESVLGHDDSLTFGRSYITVSHPDVESGDPAGIPLIRVESPLYM 138 (480) T ss_pred HHHHHhhhccCceec-CCCchhHHHHHHHHHhcCHHHHHHHHHHHHhhcCceEEEEecCccccCCCCCeeEEEEEcccce Confidence 999999998777642 245667899999999999999999999999999999999985 2 35799999999999 Q ss_pred EEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCC Q lcl|NC_012753. 153 FPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGL 232 (502) Q Consensus 153 ~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~ 232 (502) +|+|.++......+++..++..++.. .+++++.|+.+. .++ |...+....+... ..+...|++ T Consensus 139 ~~~~D~~~~~~~~~~i~~~~~~~~~~--~~~~~~~y~~~~-~~~-----~~~~~~~~~~~~~---------~~~~~~~~~ 201 (480) T protein:vir:78 139 YAELDPRNTRRVTRAVRLYTTRDDVA--VPDRATLYLPDE-TVP-----LRRNGGLNDQWVV---------DGDVIKHGL 201 (480) T ss_pred EEEEcCCCccceEEEEEEEEeecCCC--ceEEEEEEeCCe-EEE-----EEecCCCcccccc---------ccccccCCC Confidence 99976543333333333333333222 234556654311 111 1111111111111 112345789 Q ss_pred CcceEEEecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccc-eeeechHHhccCCCCCCcccCcccc Q lcl|NC_012753. 233 TRPLFTYLKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQMIKTEYDTNGEKVTVKRE 310 (502) Q Consensus 233 ~~~~f~~~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~~l~~~~~~~g~~~~~~~~ 310 (502) +++|++.|+ |+.+.+.++|+|+|+. +++|+|+||+++|++++.++.... .+++ +....+..... ..+.. T Consensus 202 g~vPvv~f~----n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i----~G~~~~~~~~~-~~~~~ 272 (480) T protein:vir:78 202 GVVPVVPLT----NDPRLGNRYGRSEISPELRKVTDAASRTLMNLQSASQILGTPLRVI----SGVTTDELTND-GENTT 272 (480) T ss_pred CCcceEEee----cccccCCccCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh----hcCCccccccc-cccch Confidence 999999886 5567788999999985 899999999999999998874333 2333 11111000000 00000 Q ss_pred ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMR 390 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~ 390 (502) + ..+........+....+.+++ ....++|++.++.++++|+..+++++..||..+.+..||.++++++..|..+| T Consensus 273 ~----~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Alk~~~~~l~~ka 347 (480) T protein:vir:78 273 L----DIYYGRILTLASEAAKISEFK-AAELRNFAEEMEVFRKEAASITGLPPQYLSSSSENPASAEAIIATDSRIVKMA 347 (480) T ss_pred h----hhhhhhhccCCCCCceEEecC-ccCHHHHHHHHHHHHHHHhcccCCChHHhccccCcchHHHHHHHHHHHHHHHH Confidence 1 111110011111222344443 23578999999999999999999999999987777789999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCCHHHHHHhcCCC Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG--FAPKTMAIEKTLNV 468 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et~l~~~~~~ 468 (502) +.+++.|..+|++++++|+.+.. ......+..+.|.|.++.+.|..+.++++.+++++| ++|.+|++.. +|+ T Consensus 348 ~~~~~~f~~~l~~~~~l~~~~~g-----~~~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~-lg~ 421 (480) T protein:vir:78 348 ERKGRIFGGAWERAMRIAMQIMG-----REVTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARID-LGY 421 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHcC-----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhc-CCC Confidence 99999999999999999887642 122344567899999999999999999999999876 7999997665 689 Q ss_pred CHHHHHHHHHHHHHhhhccc------------CCCCCccccCCCCC Q lcl|NC_012753. 469 TKEQAQEIYQKINDETMVST------------DSFRTSEEVDIYGE 502 (502) Q Consensus 469 ~deea~~el~ri~~E~~~~~------------~~~~~~~~~~~~g~ 502 (502) +++++++ ++++++|+.... +..+.+..++.-+| T Consensus 422 ~~d~~~~-~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 466 (480) T protein:vir:78 422 TATQREQ-MRDWDKQETEDMIDTLYSTTKAQADATPKPTVTETKTE 466 (480) T ss_pred CHhHHHH-HHHHHHHHHHHHHHHhhccccccCCCCCCCCCCCCCCc Confidence 8876544 344444433210 11112222222233 No 53 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=100.00 E-value=1e-51 Score=300.08 Aligned_cols=429 Identities=12% Similarity=0.018 Sum_probs=285.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) =.-.+.|+++++.. .....+++...+||.|+|++........+..+++++++|||++|| T Consensus 3 ~~~~~~i~~l~~~~---------------------~~~~~r~~~l~~Yy~G~~~i~~~~~~~~~~~~~~k~~~n~~~~iv 61 (441) T protein:vir:80 3 SDELALIEGMYDRI---------------------QRLSSWHCCIEGYYEGSNRVRDLGVAIPPELQRVQTVVSWPGIAV 61 (441) T ss_pred ccHHHHHHHHHHHH---------------------HHHHHHHHHHHHHHhcCCcchhcCcccchhhhhhhhhcchHHHHH Confidence 11111233333321 123468899999999998753332223344457788999999999 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEcC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQANT 159 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d~ 159 (502) +.+|+++. +..|+++++ +.|+++++.|+|.....+++..++.+|.+|+++|.|+ |.+++.+++|.+++|+|.+. T Consensus 62 d~~~~~l~--~~g~~~~d~---~~l~~i~~~n~~~~~~~~~~~~~~~~G~a~~~v~~d~~g~~~i~~~~p~~~~~i~d~~ 136 (441) T protein:vir:80 62 DALEERLD--WLGWTNGDG---YGLDGVYAANRLATASCDVHLDALIFGLSFVAIIPHGDGTVSVRPQSPKNCTGKFSAD 136 (441) T ss_pred HHHHhhhc--cccccCCCh---HHHHHHHHhcCHHHHHHHHHHHHhhcCeeEEEEEeCCCCceEEEEEccceEEEEEeCC Confidence 99999995 455666653 4688899999999999999999999999999999985 67999999999999998665 Q ss_pred CCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 160 QDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) .+...++++. ++..+++ .+..+.|. .+..++ |...+. |..+ ......++++++|++. T Consensus 137 ~~~~~~~~~~-~~~~~~~----~~~~~vy~-~~~~~~-----~~~~~~---~~~~---------~~~~~~~~~g~vPvv~ 193 (441) T protein:vir:80 137 GSRLDAGLVV-QQTCDPE----VVEAELLL-PDVIVQ-----VERRGS---REWV---------EVDRIPNVLGAVPLVP 193 (441) T ss_pred CCceeEEEEE-EEEecCc----eEEEEEEe-cCeEEE-----EEEcCC---ccee---------eccccccCCCceeEEE Confidence 5544444433 3332221 22234442 222222 111111 1100 0112346788999988 Q ss_pred ecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc-CCCCCCcccCccccccccchh Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT-EYDTNGEKVTVKREFETGHNV 317 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~-~~~~~g~~~~~~~~~~~~~~~ 317 (502) |+ |+.....++|.|+|.. +++|||+||.++|++++..+....++.+ +.. ..+... .. .+...... T Consensus 194 ~~----n~~~~~~~~G~s~l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~----i~G~~~~~~~---~~--~~~~~~~~ 260 (441) T protein:vir:80 194 IV----NRRRTSRIDGRSEITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRW----VTGVSADEFS---QP--GWVLSMAS 260 (441) T ss_pred ee----ccccCCccCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcCceee----eecCCccccc---cc--hhhhcccc Confidence 86 4567788999999975 9999999999999999988765555444 211 111100 00 01111111 Q ss_pred hccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV 397 (502) Q Consensus 318 ~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~ 397 (502) +..+..+..+....+..++ .-..+.|++.++.+++.++..+++|+..||..+.+.+||.+++++++.|..++..+++.| T Consensus 261 i~~~~~~~~~~~~~~~~~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~~~~Sg~Al~~~~~~l~~k~~~~~~~f 339 (441) T protein:vir:80 261 VWAVDKDDDGDTPNVGSFP-VNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITSNPPSGEALAAEESRLVKRAERRQTSF 339 (441) T ss_pred cccCCCCCCCCcceeEecC-ccchHHHHHHHHHHHHHHhcccCCCHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222211111112232333 345788999999999999999999999999887777899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCC--CHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 398 EKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFA--PKTMAIEKTLNVTKEQAQE 475 (502) Q Consensus 398 ~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~--S~et~l~~~~~~~deea~~ 475 (502) +.+|++++++++.+... ..+.......+++.|++++|.|..++++.+.+++++|++ |.++++ ...|++++|+++ T Consensus 340 ~~~l~~~~~l~~~~~~~---~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~-~~l~~~~~e~~~ 415 (441) T protein:vir:80 340 GQGWLSVGFLAAKALDS---RVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVL-EMLGLDDVQVEA 415 (441) T ss_pred HHHHHHHHHHHHHHhcC---CCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHH-HhCCCCHHHHHH Confidence 99999999988876432 223333456789999999999999999999999999964 677765 556888877654 Q ss_pred HHHHHHHhhhccc-----CCCCCcccc Q lcl|NC_012753. 476 IYQKINDETMVST-----DSFRTSEEV 497 (502) Q Consensus 476 el~ri~~E~~~~~-----~~~~~~~~~ 497 (502) . ++.++|+.... .....+..+ T Consensus 416 ~-~~e~~e~~~~~~~~~~~~~~~~~~~ 441 (441) T protein:vir:80 416 V-MRHRAESSDPLAVLAGAISRQTNEV 441 (441) T ss_pred H-HHHHHHHHHHHHHHhhhhhcccccC Confidence 4 33343332211 111122222 No 54 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=100.00 E-value=6.3e-52 Score=301.29 Aligned_cols=438 Identities=12% Similarity=0.060 Sum_probs=288.8 Q ss_pred CChh------HHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecc Q lcl|NC_012753. 1 MGII------QTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLP 74 (502) Q Consensus 1 m~~~------~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n 74 (502) +++- ..+..++++ ...+..|+.+..+||.|+|++........+..+++++++| T Consensus 6 ~~~~~~~~~~~~~~~L~~~---------------------~~~~~~r~~~~~~YY~G~~~i~~~~~~~~~~~~~~~~~~n 64 (485) T protein:vir:24 6 PGQEEIADPAIARDEMVSA---------------------FEDQNQNLRSNTSYYEAERRPEAIGVTVPVQMQSLLAHVG 64 (485) T ss_pred CCCCcccchHHHHHHHHHH---------------------HHHHHHHHHHHHHHHhccCchhhcCcccchhhhhhhhccc Confidence 1110 011111211 1345678899999999998753322222233456778889 Q ss_pred hHHHHHHHHhhhhhcCcceEee-CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC---------CceEE Q lcl|NC_012753. 75 IGRTASKKVASLVFNEQATIRV-DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG---------DQIRV 144 (502) Q Consensus 75 ~~k~iv~~~a~~l~~ep~~i~~-~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~---------~~~~i 144 (502) ||++||+.+++||++.+++ + +++..++.+++++++|+|.....+++..|+++|.+|+.+|.++ +.++| T Consensus 65 ~~~~ivd~~~~~l~~~g~~--~~~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~~~~~~~~~~~~~~~i 142 (485) T protein:vir:24 65 YPRLYVDSIAERQAVEGFR--LGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNVPLI 142 (485) T ss_pred hHHHHHHHHhhhhccCcee--cCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCCCcceE Confidence 9999999999999887754 4 4466778899999999999999999999999999999999864 45789 Q ss_pred EEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC Q lcl|NC_012753. 145 SFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE 224 (502) Q Consensus 145 ~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~ 224 (502) ..++|.+++|+|.++.+... +++..++..++ ...++++.|+- +..++. +.. + |..+. T Consensus 143 ~~~~p~~~~~i~D~~~~~~~-~~~~~~~~~~~---~~~~~~~~y~~-~~~~~~----~~~-~----~~~~~--------- 199 (485) T protein:vir:24 143 RVEPPTRMYAEIDPRIGRPA-KAIRVAYDAEG---NEIQAATLYTP-NETFGW----FRA-E----GEWVE--------- 199 (485) T ss_pred EEeccceeEEEeeCCcCcee-EEEEEEEeecC---CeEEEEEEEcC-CcEEEE----Eec-C----CceEe--------- Confidence 99999999999765544333 33333333222 12334444432 222221 111 1 11111 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~ 303 (502) .....++++++|+++|+ |+.....++|+|+|+. +++|+|+||++.|++++..+....++.+- +....+..+. T Consensus 200 ~~~~~h~~g~vPvv~f~----n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i---~G~~~~~~~~ 272 (485) T protein:vir:24 200 WFSDPHGLGAVPVVPLP----NRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRLI---FGIKPEEIGV 272 (485) T ss_pred ecccccCCCcccEEEec----cCcccCCcCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchhhhh---ccCCcccccc Confidence 11123678899999886 4556788999999985 89999999999999998776433333220 1111110000 Q ss_pred cc-CccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHH Q lcl|NC_012753. 304 KV-TVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSE 382 (502) Q Consensus 304 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~ 382 (502) .. .....+.... ..+....+ .+..+.+++ ....++|++.++.++++++..+++++..||..+.+..||.+++++ T Consensus 273 ~~~~~~~~~~~~~---~~i~~~~~-~~~~~~q~~-~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~ 347 (485) T protein:vir:24 273 DPETGQTLFDAYL---ARILAFED-AEGKIQQFS-AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAA 347 (485) T ss_pred ccccccchhhhcc---cceeccCC-CCceEEeec-ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHH Confidence 00 0000111100 01111111 122233333 345789999999999999999999999999877777899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCCHHH Q lcl|NC_012753. 383 QSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG--FAPKTM 460 (502) Q Consensus 383 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et 460 (502) +..|.++|+.+++.|+.+|++++++++.+... .+.......++|.|.++.+.|..+.++.+.+++++| ++|++| T Consensus 348 ~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~----~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et 423 (485) T protein:vir:24 348 ESRLIKKVERKNAIFGGAWEEAMRLAYRLMKG----GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRER 423 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHH Confidence 99999999999999999999999998876432 223345678999999999999999999999999875 899999 Q ss_pred HHHhcCCCCHHHHHHHHHHHHHhhhccc--------CCCCCccccCCCCC Q lcl|NC_012753. 461 AIEKTLNVTKEQAQEIYQKINDETMVST--------DSFRTSEEVDIYGE 502 (502) Q Consensus 461 ~l~~~~~~~deea~~el~ri~~E~~~~~--------~~~~~~~~~~~~g~ 502 (502) ++ +++|++++++ ++++++++|+.... +......+.+=.+| T Consensus 424 ~~-~~l~~~~d~~-~e~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~e 471 (485) T protein:vir:24 424 AR-KDMGYSIAER-EEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPNPTP 471 (485) T ss_pred HH-hhCCCCHhHH-HHHHHHHHHHhhhhhhHHHhhcccCCCCCCCCCCCC Confidence 75 5679988765 46787776654211 11111111112222 No 55 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=100.00 E-value=5e-51 Score=296.37 Aligned_cols=437 Identities=12% Similarity=0.041 Sum_probs=288.8 Q ss_pred CCh-----------hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccccc Q lcl|NC_012753. 1 MGI-----------IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRD 69 (502) Q Consensus 1 m~~-----------~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~ 69 (502) |+. ...|+.+++.. .....|+....+||.|+|++........+..++. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~l~~~~---------------------~~~~~r~~~l~~YY~G~~~i~~~~~~~~~~~~~~ 59 (486) T protein:vir:42 1 MTAPLPGMEEIEDPAVVREEMISAF---------------------EDASKDLASNTSYYDAERRPEAIGVTVPREMQQL 59 (486) T ss_pred CCCCCCCCCCcccHHHHHHHHHHHH---------------------HHHHHHHHHHHHHhcccCcchhcccccchhHhhh Confidence 111 11222222221 1345688888999999987643221122223345 Q ss_pred ceecchHHHHHHHHhhhhhcCcceEeeC-CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC--------- Q lcl|NC_012753. 70 FNHLPIGRTASKKVASLVFNEQATIRVD-NEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG--------- 139 (502) Q Consensus 70 ~~~~n~~k~iv~~~a~~l~~ep~~i~~~-d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~--------- 139 (502) +.++|||++||+.+++++.... |+++ ++..++.+++++++|+|.....+++..|+++|.+|+.+|.++ T Consensus 60 ~~v~n~~~~iVd~~~~~l~~~g--~~~~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~ay~~v~~~e~~~~~~~~~ 137 (486) T protein:vir:42 60 LAHVGYPRLYVDSVAERQAVEG--FRLGDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSFITISKPDPQLDLGWDQ 137 (486) T ss_pred hhccchHHHHHHHHHhhhcccc--eecCCCchhHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccccCC Confidence 6678999999999999996554 4554 455667899999999999999999999999999999999753 Q ss_pred CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc Q lcl|NC_012753. 140 DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL 219 (502) Q Consensus 140 ~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~ 219 (502) +.++|..++|.+++++|.+..+. ..++++.++..+ +...+.+++|+. +..++. ...+ |.... T Consensus 138 ~~~~i~~~~p~~~~~i~d~~~~~-~~~~~~~~~~~~---~~~~~~~~~y~~-~~~~~~-----~~~~----~~~~~---- 199 (486) T protein:vir:42 138 NVPIIRVEPPTRMHAEIDPRINR-VSKAIRVAYDKE---GNEIQAATLYTP-METIGW-----FRAD----GEWAE---- 199 (486) T ss_pred CeeEEEEecccceEEEEeCCCCC-eEEEEEEEEecC---CCeEEEEEEEcC-CcEEEE-----EecC----CcEEe---- Confidence 35789999999999998655443 333443333222 223444555542 122221 1111 11110 Q ss_pred ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc-C Q lcl|NC_012753. 220 YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT-E 297 (502) Q Consensus 220 ~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~-~ 297 (502) .....|+++++|++.|+ |+.+...++|+|+|+. +++|+|+||+++|++.+..+....++.+ +.. . T Consensus 200 -----~~~~~h~~g~vPvv~~~----n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~----i~G~~ 266 (486) T protein:vir:42 200 -----WFNVPHGLGVVPVVPLP----NRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRL----IFGIK 266 (486) T ss_pred -----ecceecCCCCceEEEec----cccccCCCCCcccchhhHHHHHHHHHHHHHHHHHHHHhhcchHHH----hhcCC Confidence 11234788999999885 5567888999999985 8999999999999999876644333222 211 1 Q ss_pred CCCCCcc-cCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccH Q lcl|NC_012753. 298 YDTNGEK-VTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTA 376 (502) Q Consensus 298 ~~~~g~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tA 376 (502) ....+.. -.....+.... ..-+...+++ ..+.+++ ....++|++.++.++++++..+++++..||..+.+..|| T Consensus 267 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~~~~q~~-~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg 341 (486) T protein:vir:42 267 PEEIGVDSETGQTLFDAYL--ARILAFEDAE--GKIQQFS-AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASA 341 (486) T ss_pred ccccccccccccchhhhhh--chhcccCCCC--ceEEeec-ccCHHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHH Confidence 1000000 00001111100 0001111222 2233332 445789999999999999999999999999887777899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhc--C Q lcl|NC_012753. 377 TEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAA--G 454 (502) Q Consensus 377 tei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~--G 454 (502) .+++++++.|.++++.+++.|+.+|++++++++.+... .........+.|.|.++.+.|..+.++.+.+++++ | T Consensus 342 ~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~~----~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g 417 (486) T protein:vir:42 342 EAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMKG----GDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQG 417 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccC Confidence 99999999999999999999999999999998876532 11223446789999999999999999999999976 7 Q ss_pred CCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhccc--------CCCCCccccCCCCC Q lcl|NC_012753. 455 FAPKTMAIEKTLNVTKEQAQEIYQKINDETMVST--------DSFRTSEEVDIYGE 502 (502) Q Consensus 455 i~S~et~l~~~~~~~deea~~el~ri~~E~~~~~--------~~~~~~~~~~~~g~ 502 (502) ++|.+|++ .++|+++++ .+|++|+++|+.... +......+....+| T Consensus 418 ~~s~et~~-~~lg~~~d~-~~e~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 471 (486) T protein:vir:42 418 VIPRERAR-IDMGYSVKE-REEMRRWDEEEAAMGLGLLGTMVDADPTVPGSPSPTA 471 (486) T ss_pred CCCHHHHH-hcCCCChhH-HHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCCCCCC Confidence 89999975 567998875 457888887764211 11111222222222 No 56 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=100.00 E-value=7.3e-51 Score=295.45 Aligned_cols=442 Identities=11% Similarity=0.030 Sum_probs=288.5 Q ss_pred CChh---HHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGII---QTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~---~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k 77 (502) ++.+ +....+++.+ ..+ ...+..|+++..+||.|+|++........+..++++.++|||+ T Consensus 5 i~~~~~~~~~~~~~~~l------------~~~-----~~~~~~r~~~~~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n~~~ 67 (485) T protein:vir:10 5 LPGQEEIEDPAIARDEM------------VSA-----FEDSTQNLKTNTSYYEAERRPEAIGVTVPIQMQSLLAHVGYPR 67 (485) T ss_pred CCCCCCCCCHHHHHHHH------------HHH-----HHHHHHHHHHHHHHHhcCCcchhcCCCCChhhhhhhhhcCcHH Confidence 1222 1111111111 111 1234578999999999998864322222334446677789999 Q ss_pred HHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC---------CceEEEEEc Q lcl|NC_012753. 78 TASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG---------DQIRVSFVQ 148 (502) Q Consensus 78 ~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~---------~~~~i~~v~ 148 (502) +||+.+|++|+....++ -+++..++.++++++.|+|+....+++..|+++|.+|+.+|.++ +.++|.+++ T Consensus 68 ~ivd~~~~~l~~~g~~~-~~~~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~~e~~~~~~~~~~~~~i~~~~ 146 (485) T protein:vir:10 68 LYVDSIAERQAVEGFRF-GDADEADEELWQWWQANNLDIEAPLGYTDAYVHGRSYITISRPDPQIDLGWDPNTPIIRVEP 146 (485) T ss_pred HHHHHHHhhhcccceec-CCCchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEeeCCcccccccCCCeeEEEEEc Confidence 99999999997665442 24556788999999999999999999999999999999999863 467899999 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |.+++++|.+..+...+++. .++..++ ..++.+++|+.+ ..++ |...+. +... .... T Consensus 147 p~~~~~~~D~~~~~~~~~~~-~~~~~~~---~~~~~~~~y~~~-~~~~-----~~~~~~---~~~~----------~~~~ 203 (485) T protein:vir:10 147 PTRMYAEIDPRIGRVSKAIR-VAYDAEG---NEIQAATLYTPN-DIFG-----WYRVEN---EWQE----------WFNN 203 (485) T ss_pred cceeEEEEcCCCCceeEEEE-EEEeeCC---CeEEEEEEEeCC-eEEE-----EEEcCC---ceEE----------eccc Confidence 99999998655555444443 3333322 224445555421 1111 111111 1100 0112 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccceeeechHHhccC-CCCCCcc-c Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTE-YDTNGEK-V 305 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~-~~~~g~~-~ 305 (502) .|+++++|++.|. |+.+.+.++|+|+|+. +++|+|+||+++|++.+..+....++.+ +... .+..+.. - T Consensus 204 ~~~~g~vPvv~~~----n~~~~~~~~G~s~i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~----i~G~~~~~~~~~~~ 275 (485) T protein:vir:10 204 PHGLGVVPVVPIP----NRTRLSDLYGTSEITPELRSMTDAAARILMLMQATAELMGVPQRL----IFGIKPEEIGVDPE 275 (485) T ss_pred cCCCCcccEEEec----cccccCCCCCccchhHHHHHHHHHHHHHHHHHHHHHHhhcchHHH----HhcCCccccccccc Confidence 4788899999885 5567888999999985 8999999999999999877643333222 2111 1100000 0 Q ss_pred CccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHH Q lcl|NC_012753. 306 TVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSD 385 (502) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~ 385 (502) .....+..... .-+...++ +..+.+++ ....+.|++.++.++++++..+++++..||..+.+..||.++++++.. T Consensus 276 ~~~~~~~~~~~--~i~~~~~~--d~k~~q~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~n~~Sg~Al~~~~~~ 350 (485) T protein:vir:10 276 TGQTLFDAYLA--RILAFEDA--EGKIQQFS-AAELANFTNALDQIAKQVAAYTGLPPQYLSTAADNPASAEAIRAAESR 350 (485) T ss_pred ccchhhhhccc--ceeccCCC--CceEEeec-ccchHHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHH Confidence 00001111000 00111111 22233332 334688999999999999999999999999877777899999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCCHHHHHH Q lcl|NC_012753. 386 TYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG--FAPKTMAIE 463 (502) Q Consensus 386 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et~l~ 463 (502) |.++++.+++.|+.+|++++++++.+... .+.......+.|.|.++.|.|..+.++.+.+++++| ++|.+|++ T Consensus 351 l~~k~~~k~~~f~~~l~~~~~l~~~~~~~----~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~- 425 (485) T protein:vir:10 351 LIKKVERKNSIFGGAWEEAMRLAYRMMKG----GDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERAR- 425 (485) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHH- Confidence 99999999999999999999988876432 222334567899999999999999999999999876 89999976 Q ss_pred hcCCCCHHHHHHHHHHHHHhhhcc---------cCCCCCccccCCCCC Q lcl|NC_012753. 464 KTLNVTKEQAQEIYQKINDETMVS---------TDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~~~~~deea~~el~ri~~E~~~~---------~~~~~~~~~~~~~g~ 502 (502) .++|++++++ ++++++++|+... .+....+++.+-.++ T Consensus 426 ~~lg~~~~~~-~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 472 (485) T protein:vir:10 426 KDMGYSIAER-EEMRRWDEEEAAMGLGLIGTMVDPNPTVPGSPSPAPA 472 (485) T ss_pred HhCCCCHhHH-HHHHHHHHHHHHHHHHHHHHhhccCCCCCCCCCcccc Confidence 4579998865 5667776655321 011111111111111 No 57 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=100.00 E-value=9.2e-52 Score=300.38 Aligned_cols=436 Identities=11% Similarity=0.053 Sum_probs=279.2 Q ss_pred Hhhcccccchh-----hhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCc---cccccceecchHHHHHHHHhh Q lcl|NC_012753. 14 SNYVITNQSLN-----SITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGS---QVKRDFNHLPIGRTASKKVAS 85 (502) Q Consensus 14 ~~~~~~~~~l~-----~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~---~~~~~~~~~n~~k~iv~~~a~ 85 (502) |-+ ++...|. +.+...-+..-..+..|+....+||.|++++........+ ..-.++.++|||++||+.+++ T Consensus 1 ~~~-~p~~~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~~ 79 (479) T protein:vir:99 1 MID-LPDEDLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFAQ 79 (479) T ss_pred Ccc-CCcccCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHHh Confidence 111 1111111 1000000011124567899999999999886543222111 112234578999999999999 Q ss_pred hhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE-----e-CCceEEEEEcCCeEEEEEEcC Q lcl|NC_012753. 86 LVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYI-----D-GDQIRVSFVQATVFFPLQANT 159 (502) Q Consensus 86 ~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~-----d-~~~~~i~~v~~~~~~Pi~~d~ 159 (502) +++. ..|++.+...++.++++++.|+|.....+++..++++|.+|+.+|. | .+.++|.+++|.+++|+|.+. T Consensus 80 ~l~~--~gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~~~G~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~iydd~ 157 (479) T protein:vir:99 80 QLIV--DGYRKTGTNENAKGWDTWRLNQMDKQQFWLNRAVLTFGYAFIKVTSGISPLDGTTVARIKCIDPRDAFAIWEDP 157 (479) T ss_pred hccc--ccccCCCchhhHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCcCCCCceEEEEechhheEEEecCC Confidence 9964 4577888888889999999999999999999999999999999985 3 357899999999999998665 Q ss_pred CCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 160 QDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) ......++..++. ......+|+ ...+++ |...+ |... ......|+++++|+++ T Consensus 158 ~~~~~~~~~~~~~--~~~~~~~~~--------~~~~~~----~~~~~----~~~~---------~~~~~~h~~g~vPvv~ 210 (479) T protein:vir:99 158 YWDEWPKYLLERQ--PNGQYWWWT--------EEDYSI----FEFKQ----GKFI---------YRETVSHDYGHIPFVR 210 (479) T ss_pred cccceeeEEEeec--CceeEEEEe--------cceEEE----EEecC----Ccee---------eccccccCCCCcceEE Confidence 5443333332211 111111221 111111 11111 1100 0122347789999999 Q ss_pred ecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhc Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYE 319 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~ 319 (502) |+++. +. .++|+|+|+.+++|||+||++.|++.+.++.....+.+ +.......+......+. . ....+ T Consensus 211 f~n~~----~~-~~~g~sd~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~----i~G~~~~~~~~~~~~~~-~--~~~~~ 278 (479) T protein:vir:99 211 YVNVM----DL-RGVCYGDVEPLVTVAKAIDKTGLDILLVQHHQSFQIRW----ATGLMLPEGANADQEKM-R--FAQES 278 (479) T ss_pred eecCC----Cc-CcCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhh----hcCCCcccccccchhcc-c--ccccc Confidence 87542 33 46899999999999999999999999988765554433 21111111110011000 0 01111 Q ss_pred cccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 320 QFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEK 399 (502) Q Consensus 320 ~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~ 399 (502) .+...+++ ..+.+++ +...++|.+.++.++++|+..+++++..||.. ++.||.++++++..|..+++.+++.|+. T Consensus 279 i~~~~~~~--~~~~q~~-~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~~--~n~Sg~Al~~~~~~l~~ka~~~~~~f~~ 353 (479) T protein:vir:99 279 MLISQNEK--ASFGAIP-AAPLDGLLNAYKESLLEFLALAQLPPHIAGQI--VNVAADALAAGTRQTMQKLFEKQATWKA 353 (479) T ss_pred ceeecCCC--ceEEEec-ccchHHHHHHHHHHHHHHhccCCCCHHHcccc--cchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11122222 2244444 34579999999999999999999999999863 4478999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHH Q lcl|NC_012753. 400 SLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQK 479 (502) Q Consensus 400 ~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~r 479 (502) +|++++++++.+... .....+..+++.|.+..+.|..+.++.+.+++++|++|.+|+++.++++++++++++.+ T Consensus 354 al~~~~~l~~~~~~~-----~~~~~~~~i~~~w~~~~~~s~~~~ad~~~kl~~ag~is~et~l~~l~gv~~~~~e~~~~- 427 (479) T protein:vir:99 354 SHNQTMRLVNKIEGR-----TEEATDLDFTITWQDVTIQSLAQFADAWAKMVESLKIPAEGVWDMIPNLDQSTVNGWKE- 427 (479) T ss_pred HHHHHHHHHHHHcCC-----CccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHH- Confidence 999999998766421 11233457899999999999999999999999999999999999998999877654322 Q ss_pred HHHhhhc-------ccCCC-CCccccCCCCC Q lcl|NC_012753. 480 INDETMV-------STDSF-RTSEEVDIYGE 502 (502) Q Consensus 480 i~~E~~~-------~~~~~-~~~~~~~~~g~ 502 (502) .++++.+ ..+.. +....+...|. T Consensus 428 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 458 (479) T protein:vir:99 428 IYDREGDFGKYMRKLQNGPDPAEQRGGPNGA 458 (479) T ss_pred HHHHHHHHHHHHHHHhcccCcccccCCCCCC Confidence 2222111 00110 01111122222 No 58 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=100.00 E-value=7.5e-51 Score=295.39 Aligned_cols=444 Identities=11% Similarity=0.075 Sum_probs=285.0 Q ss_pred CChhH---HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccc-ccC-CCccccccceecch Q lcl|NC_012753. 1 MGIIQ---TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTY-RDS-NGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~---~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~-~~~-~~~~~~~~~~~~n~ 75 (502) |.--+ .++.++++ -.....|+....+||.|+|++... +.. ...+..++++++|| T Consensus 1 ~~~~t~~~~~~~l~~~---------------------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~ 59 (456) T protein:vir:10 1 MTASTPAEWLPVLTKR---------------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNW 59 (456) T ss_pred CCCCCHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcch Confidence 22111 22222221 123457889999999999875321 111 11222356789999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC-CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEE Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD-NEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFF 153 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~-d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~ 153 (502) |++||+.+++|++++|+++..+ |.+..+.++++++.|+|.....+++..++++|.+|..+|.|+ |.++|..++|.+++ T Consensus 60 ~~~ivd~~~~~l~~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~ 139 (456) T protein:vir:10 60 GLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMV 139 (456) T ss_pred HHHHHHHHHhhhccCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeE Confidence 9999999999999999998765 345677899999999999999999999999999999999975 67999999999999 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC-cceeecCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE-ETVTLNGL 232 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~-~~~~~~~~ 232 (502) |++.+..+....++++. +...+....+.+ ++..-.-.+++.....+...+. ...+.. ...+. .....+++ T Consensus 140 ~i~d~~~~~~~~~~i~~-~~~~d~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----~~~~~~--~~~~~~~~~~~~~~ 210 (456) T protein:vir:10 140 VSVDPLQPWRIRAAMRW-WRDLDAESDFAI--VWSGDGWQKFARPCFVQSSSRR----RLVTRI--SDSWVPVGDAVVTG 210 (456) T ss_pred EEEcCCCCcceEEEEEE-EEecCCceeEEE--EEeccceeEEEEEEEEeecccc----eeeeec--CCceeeccccCCCC Confidence 99876554433333333 332222222221 2111001112221111111110 000000 00011 01123567 Q ss_pred CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc--CCCCCCcccCcccc Q lcl|NC_012753. 233 TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT--EYDTNGEKVTVKRE 310 (502) Q Consensus 233 ~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~--~~~~~g~~~~~~~~ 310 (502) ++||+++|.| +.|+|+|+.+++|+|++|.++|+.+++.+....++.+-..+-.. ..+..|........ T Consensus 211 ~~~pvv~~~N----------~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~ 280 (456) T protein:vir:10 211 SPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASI 280 (456) T ss_pred CceeEEEecC----------CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhh Confidence 8899888742 46899999999999999999999998776433322220010000 01222222221111 Q ss_pred ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMR 390 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~ 390 (502) |..... .+-..+++ ..+..++ ....++|.+.++.++++|+..+++++..||...+ +.||.++++++..|.+++ T Consensus 281 ~~~~~~---~~~~~~~~--~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~Ai~~~~~~l~~k~ 353 (456) T protein:vir:10 281 FEAAPG---ALWELPPG--VDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKC 353 (456) T ss_pred hhhhcc---ccccCCCC--cceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHHHHHHHHHHHHHH Confidence 111110 11111122 2233333 3456889999999999999999999999987554 458999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) +.+++.|+++|++++++++.+. + ......+++.|.++.|.|..++++.+++++++|++|.++++ .++|+++ T Consensus 354 ~~~~~~f~~~l~~~~rl~~~~~-------g-~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~-~~lg~~~ 424 (456) T protein:vir:10 354 EDRLSIAKIGLEAILVKALQIE-------G-ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRR-NILNYNA 424 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHhc-------C-CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHH-hhCCCCH Confidence 9999999999999999887553 1 22345789999999999999999999999999999999964 5679998 Q ss_pred HHH-HHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 471 EQA-QEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea-~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++ ++|++|+++|+..........+ +-.|- T Consensus 425 ~~i~~~e~er~~~e~~~~~~~~~~~~--~~~~~ 455 (456) T protein:vir:10 425 DQIKQDDLDRAREQITLFAGNPVQRP--QEDGS 455 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhcC--CCCCC Confidence 775 4578899888654322111110 01111 No 59 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=100.00 E-value=7.5e-51 Score=295.39 Aligned_cols=444 Identities=11% Similarity=0.075 Sum_probs=285.0 Q ss_pred CChhH---HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccc-ccC-CCccccccceecch Q lcl|NC_012753. 1 MGIIQ---TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTY-RDS-NGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~---~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~-~~~-~~~~~~~~~~~~n~ 75 (502) |.--+ .++.++++ -.....|+....+||.|+|++... +.. ...+..++++++|| T Consensus 1 ~~~~t~~~~~~~l~~~---------------------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~k~~~n~ 59 (456) T protein:vir:10 1 MTASTPAEWLPVLTKR---------------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNW 59 (456) T ss_pred CCCCCHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHhcCCCchhcCcccChhhhhhhhhhhcch Confidence 22111 22222221 123457889999999999875321 111 11222356789999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC-CHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEE Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD-NEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFF 153 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~-d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~ 153 (502) |++||+.+++|++++|+++..+ |.+..+.++++++.|+|.....+++..++++|.+|..+|.|+ |.++|..++|.+++ T Consensus 60 ~~~ivd~~~~~l~~~~~~~~~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~d~~g~~~i~~~~p~~~~ 139 (456) T protein:vir:10 60 GLMVRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMV 139 (456) T ss_pred HHHHHHHHHhhhccCCeecCCCCCcchHHHHHHHHHhcChhhHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEEccceeE Confidence 9999999999999999998765 345677899999999999999999999999999999999975 67999999999999 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC-cceeecCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE-ETVTLNGL 232 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~-~~~~~~~~ 232 (502) |++.+..+....++++. +...+....+.+ ++..-.-.+++.....+...+. ...+.. ...+. .....+++ T Consensus 140 ~i~d~~~~~~~~~~i~~-~~~~d~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----~~~~~~--~~~~~~~~~~~~~~ 210 (456) T protein:vir:10 140 VSVDPLQPWRIRAAMRW-WRDLDAESDFAI--VWSGDGWQKFARPCFVQSSSRR----RLVTRI--SDSWVPVGDAVVTG 210 (456) T ss_pred EEEcCCCCcceEEEEEE-EEecCCceeEEE--EEeccceeEEEEEEEEeecccc----eeeeec--CCceeeccccCCCC Confidence 99876554433333333 332222222221 2111001112221111111110 000000 00011 01123567 Q ss_pred CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc--CCCCCCcccCcccc Q lcl|NC_012753. 233 TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT--EYDTNGEKVTVKRE 310 (502) Q Consensus 233 ~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~--~~~~~g~~~~~~~~ 310 (502) ++||+++|.| +.|+|+|+.+++|+|++|.++|+.+++.+....++.+-..+-.. ..+..|........ T Consensus 211 ~~~pvv~~~N----------~~g~gd~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~~~~~~~~ 280 (456) T protein:vir:10 211 SPPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGNAIDYASI 280 (456) T ss_pred CceeEEEecC----------CCCCchhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCcccccccccccccchhhh Confidence 8899888742 46899999999999999999999998776433322220010000 01222222221111 Q ss_pred ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMR 390 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~ 390 (502) |..... .+-..+++ ..+..++ ....++|.+.++.++++|+..+++++..||...+ +.||.++++++..|.+++ T Consensus 281 ~~~~~~---~~~~~~~~--~~~~q~~-~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~Sg~Ai~~~~~~l~~k~ 353 (456) T protein:vir:10 281 FEAAPG---ALWELPPG--VDIWESQ-ANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKC 353 (456) T ss_pred hhhhcc---ccccCCCC--cceEEec-ccChhHHHHHHHHHHHHHHhccCCChHHhccccc-ChHHHHHHHHHHHHHHHH Confidence 111110 11111122 2233333 3456889999999999999999999999987554 458999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) +.+++.|+++|++++++++.+. + ......+++.|.++.|.|..++++.+++++++|++|.++++ .++|+++ T Consensus 354 ~~~~~~f~~~l~~~~rl~~~~~-------g-~~~~~~~~v~w~~~~~~~~~~~ada~~kl~~~gi~~~~~~~-~~lg~~~ 424 (456) T protein:vir:10 354 EDRLSIAKIGLEAILVKALQIE-------G-ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRR-NILNYNA 424 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHhc-------C-CCcccceeEEecCCCCcCHHHHHHHHHHHHHcCCChHHHHH-hhCCCCH Confidence 9999999999999999887553 1 22345789999999999999999999999999999999964 5679998 Q ss_pred HHH-HHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 471 EQA-QEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea-~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++ ++|++|+++|+..........+ +-.|- T Consensus 425 ~~i~~~e~er~~~e~~~~~~~~~~~~--~~~~~ 455 (456) T protein:vir:10 425 DQIKQDDLDRAREQITLFAGNPVQRP--QEDGS 455 (456) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhhcC--CCCCC Confidence 775 4578899888654322111110 01111 No 60 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=100.00 E-value=3.8e-51 Score=297.00 Aligned_cols=454 Identities=10% Similarity=0.029 Sum_probs=285.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccc-ccCCCcccc-ccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTY-RDSNGSQVK-RDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~-~~~~~~~~~-~~~~~~n~~k~ 78 (502) +-..+.+.++++++.. ....+..|++...+||.|+|+.... +........ .++.++|||++ T Consensus 22 ~~~~~~~~~l~~~l~~-----------------~~~~~~~rl~~l~~YY~G~~~~~~~~~~~~~~~~~~~~~~v~n~~~~ 84 (501) T protein:vir:25 22 SMSREQLGALVADMWR-----------------LHISERQWLDRIYEYTKGLRGRPEVPEGASDEVKELAKLSVKNVLSL 84 (501) T ss_pred cCChHHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHHhcCCCchhccccCChhhhhhHhhhhcChHHH Confidence 1112222222222110 1123456888899999999874221 122222222 23456799999 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEc Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQAN 158 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d 158 (502) ||+.+|++++.+ .|++.++..++.++++++.|+|.....+++..|+++|.+|+.+|.++++++|.+++|.+++++|.| T Consensus 85 ivd~~a~~l~~~--gf~~~d~~~~~~l~~i~~~N~~d~~~~~~~~~a~i~G~ay~~v~~de~~~~i~~~sp~~~~~iy~D 162 (501) T protein:vir:25 85 VRDSFAQNLSVV--GYRNALAKENDPAWEMWQRNRMDARQAEVHRPALTYGASYVTVTPTDEGPVFRTRSPRQILAVYAD 162 (501) T ss_pred HHHHHHhhhccc--ceecCCccchHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCCCCeEEEeccccEEEEEec Confidence 999999999654 577877778888999999999999999999999999999999999987789999999999999877 Q ss_pred CCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEE-EEEEEecCCccccCceeecc--ccccCCCcceeecCCCcc Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTI-SNELYESESKTIIGQRVPLS--TLYEDLEETVTLNGLTRP 235 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I-~~~l~~~~~~~~lG~~v~l~--~~~~~l~~~~~~~~~~~~ 235 (502) ........++.+++......+. .++.+.|... ..|+. .+.++............+.. .+....+.....++++.+ T Consensus 163 ~~~~~~~~~ai~~~~~~~~~~~-~~~~~~y~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 240 (501) T protein:vir:25 163 PSVDAWPQYALETWVAQKDAKP-HRRGVLYDDT-YMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPVC 240 (501) T ss_pred CCCCcceeEEEEEEeeccccCc-ceeEEEecCe-eEEEEecCceeeeeccccccccccccccccccccccccccCCccce Confidence 5433222222233222111110 1112222110 01111 01111000000000000000 001112222345778899 Q ss_pred eEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccc Q lcl|NC_012753. 236 LFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGH 315 (502) Q Consensus 236 ~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~ 315 (502) |++.|+|+ .+ ..++|+|+|+.+++|+|+||++.|+..+..+....+..+ +.... +... ..+.... T Consensus 241 Piv~f~N~----~~-~~~~g~sdie~v~~l~Da~~~~~s~~~~~~e~~a~p~~~----i~G~~---~~~~---~~~~~~~ 305 (501) T protein:vir:25 241 PVVRFVNG----RD-ADDMIVGEVAPLILLQQAINSVNFDRLIVSRFGANPQRV----ISGWT---GSKA---EVLKASA 305 (501) T ss_pred eeEeccCc----cc-cCccccchhhhhHHHHHHHHHHHHHHHHHHHhhccHHHH----HhCCC---CCcc---chhhhcc Confidence 99988654 23 357899999999999999999999999987754443222 32111 1111 1111111 Q ss_pred hhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 316 NVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIAT 395 (502) Q Consensus 316 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~ 395 (502) . .-+...+++ ..+.+++ ....+.|.+.++.+++.|+..+++|+..|+...+ +.||.++++++..|.+++..+++ T Consensus 306 ~--~i~~~~~~~--~~~~q~~-~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~-N~Sg~Al~~~~~~l~~ka~~k~~ 379 (501) T protein:vir:25 306 L--RVWTFEDPE--VKAQAFP-PASVEPYNLILEEMLQHVAMVAQISPAQVTGKMI-NVSAEALAAAEANQQRKLAAKRE 379 (501) T ss_pred c--ceeccCCCC--ceEEEec-ccChHHHHHHHHHHHHHHHhhcCCChhhhccccC-ChHHHHHHHHHHHHHHHHHHHHH Confidence 1 111222222 2333333 3446789999999999999999999999985544 45899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 396 LVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQE 475 (502) Q Consensus 396 ~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~ 475 (502) .|+.+|++++++++.+.. +.....+..+++.|.++.|.|..++++.+.+++++|+ |.+|.+..++|++++++++ T Consensus 380 ~f~~~l~~~~rl~~~~~~-----~~~~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gi-s~et~~~~~~g~~~~~ie~ 453 (501) T protein:vir:25 380 SFGESWEQLLRLAAEMDD-----DPDTAADSGAEVLWRDTEARSFGAVVDGITKLASAGI-PIEHLLSMVPGMTQQTIQA 453 (501) T ss_pred HHHHHHHHHHHHHHHHhC-----CCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcCC-CHHHHHHHcCCCCHHHHHH Confidence 999999999998886643 1222345679999999999999999999999998885 9999999999999988766 Q ss_pred HHHHHHHhhhcc--------cCCCCCccc----------cCCCCC Q lcl|NC_012753. 476 IYQKINDETMVS--------TDSFRTSEE----------VDIYGE 502 (502) Q Consensus 476 el~ri~~E~~~~--------~~~~~~~~~----------~~~~g~ 502 (502) +.++.+++.+.. .+....+.. ++.-|+ T Consensus 454 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 498 (501) T protein:vir:25 454 IKDSLRGGEVKSLVDKLLSNEPAPVPPPPPQAAAQALNEGGVNGN 498 (501) T ss_pred HHHHHHHHhHHHHHHHhhccCcCCCCCCCCCCCccccccccCCCC Confidence 655544443311 011111111 111111 No 61 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=100.00 E-value=2.6e-50 Score=292.45 Aligned_cols=440 Identities=10% Similarity=0.062 Sum_probs=285.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccc-cCCCcc-ccccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYR-DSNGSQ-VKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~-~~~~~~-~~~~~~~~n~~k~ 78 (502) ..--+.++.+++. -.....++++..+||.|+|++.... ...... ..+++.++|||++ T Consensus 4 ~t~~~~~~~l~~~---------------------~~~~~~r~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~n~~~~ 62 (456) T protein:vir:79 4 STPAEWLPVLTKR---------------------IDDGMSRVRLLARYSNGDAPLPELTRNTSAAWRSFQREARTNWGLM 62 (456) T ss_pred CCHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHhccCChhhcCcccChhhchhhhhhhcchHHH Confidence 1222233333322 1234568899999999998864221 112222 2234577899999 Q ss_pred HHHHHhhhhhcCcceEeeCC-HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEE Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDN-EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQ 156 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d-~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~ 156 (502) ||+++|+|++++|+++..++ ...++.+++++++|+|.....+++..++++|.+|+++|.++ |.+++..++|.+++|+| T Consensus 63 ivd~~~~~l~~~g~~~~~~~d~~~~~~~~~~~~~n~~d~~~~~~~~~a~~~G~a~~~~~~~edg~~~i~~~~p~~~~~i~ 142 (456) T protein:vir:79 63 VRDSVADRIIPNGITVGGSADSDLALRARRIWRDNRMDSVCKQWVKYGLDFGESYLTCWRRDDGTATITADSPETMVVSV 142 (456) T ss_pred HHHHHHhhhccCCeecCCCCCccHHHHHHHHHHhcChhHHHHHHHHHHhhcCeeEEEEeeCCCCceEEEEeccceeEEEE Confidence 99999999999999987754 45678899999999999999999999999999999999974 67999999999999998 Q ss_pred EcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEE--EEEEEEecCCccccCceeeccccccCCC-cceeecCCC Q lcl|NC_012753. 157 ANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYT--ISNELYESESKTIIGQRVPLSTLYEDLE-ETVTLNGLT 233 (502) Q Consensus 157 ~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~--I~~~l~~~~~~~~lG~~v~l~~~~~~l~-~~~~~~~~~ 233 (502) .+..+....++++. +...+....+.+ .+ ..++.++ .....+...+ ...+......+. .....++++ T Consensus 143 d~~~~~~~~~~~~~-~~~~d~~~~~~~--~~--~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~ 211 (456) T protein:vir:79 143 DPLQPWRIRSAMRW-WRDLDAESDFAI--VW--SGDGWQKFARPCFVQSSSR------RRLVTRISDSWVPVGDAVVTGS 211 (456) T ss_pred cCCCCCceEEEEEE-EEecCCceeEEE--EE--cCCceEEEEEEEEeecccc------ceeeeccCCceeecccccCCCC Confidence 76555444444333 332222222211 11 1112111 1111111100 000000001111 112346778 Q ss_pred cceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeee--chHHhccCCCCCCcccCccccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV--PTQMIKTEYDTNGEKVTVKREF 311 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v--~~~~l~~~~~~~g~~~~~~~~~ 311 (502) +||+++|.| +.|+|+|+++++|||+||+++|+.+++.+....++.+ ....-....+..|........+ T Consensus 212 ~~pvv~~~N----------~~~~gd~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i~~~~~~ 281 (456) T protein:vir:79 212 PPPVVVYQN----------PDGMGEVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAIDYASIF 281 (456) T ss_pred ceeEEEecC----------CCCCchhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCcccccccccccccchhhhh Confidence 999988842 4689999999999999999999999877643322222 1100000112222222111111 Q ss_pred cccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHH Q lcl|NC_012753. 312 ETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRN 391 (502) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~ 391 (502) .... ..+-..+++ ..+..++ +...+.|.+.++.++++|+..+++++..|+...+ +.||.++++++..|.++++ T Consensus 282 ~~~~---~~~~~~~~~--~~~~q~~-~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~Sg~Al~~~~~~l~~k~~ 354 (456) T protein:vir:79 282 EAAP---GALWELPPG--VDIWESQ-TNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQSAEGAHNIEKGFLFKCE 354 (456) T ss_pred hhhc---cccccCCCC--cceeeec-ccChHHHHHHHHHHHHHHHhhcCCChhHhccccc-CcHHHHHHHHHHHHHHHHH Confidence 1111 111111222 2233332 3456889999999999999999999999986554 4589999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH Q lcl|NC_012753. 392 SIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKE 471 (502) Q Consensus 392 ~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~de 471 (502) .+++.|+++|++++++++.+.. ......+.|.|.++.+.|..++++.+++++++|++|.++++ ..+|++++ T Consensus 355 ~~~~~f~~~l~~~~~l~~~~~g--------~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~G~~~~~~~~-~~lg~~~~ 425 (456) T protein:vir:79 355 DRLSIAKIGLEAILVKALQIEG--------ESVEDTVDVSFESPDRVTLGEKYSAASLAKAAGESWASIRR-NILNYNAD 425 (456) T ss_pred HHHHHHHHHHHHHHHHHHHhcC--------CCccccceEEeCCCCCcCHHHHHHHHHHHHhcCCChHHHHH-hcCCCCHH Confidence 9999999999999998876531 22345789999999999999999999999999999999975 46799887 Q ss_pred HH-HHHHHHHHHhhhcccCCCC---CccccC Q lcl|NC_012753. 472 QA-QEIYQKINDETMVSTDSFR---TSEEVD 498 (502) Q Consensus 472 ea-~~el~ri~~E~~~~~~~~~---~~~~~~ 498 (502) ++ ++|++|+++|......... +++..- T Consensus 426 ~i~~~e~~r~~~e~~~~~~~~~~~~~~~~~~ 456 (456) T protein:vir:79 426 QIKQDDLDRAREQITLFAGNPVQRPQEDGSR 456 (456) T ss_pred HHHHHHHHHHHHHHHHHhhhHhhcCCCCCCC Confidence 64 5678888888654432211 111111 No 62 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=100.00 E-value=3.4e-50 Score=291.75 Aligned_cols=442 Identities=10% Similarity=0.044 Sum_probs=282.4 Q ss_pred CCh----hHHH--HHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecc Q lcl|NC_012753. 1 MGI----IQTI--KNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLP 74 (502) Q Consensus 1 m~~----~~~i--k~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n 74 (502) |+. -+-+ ..++++ ++. .-.+...++..+.+||.|++++-.......+..++.+.++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~------------l~~-----~~~~~~~rl~~l~~Yy~G~~~i~~~~~~~~~~~~~~~~~~n 63 (484) T protein:vir:77 1 MTSPLQKQENVDPEKAREE------------MLN-----LFTERTQDLGDNTAYYESERRPDAVGVTVPQQMQKLLAHVG 63 (484) T ss_pred CCCcccccCCCCHHHHHHH------------HHH-----HHHHHHHHHHHHHHHHhccccchhcccccchhHHhhhhhcC Confidence 111 1100 000100 000 00123357778899999998752221112233334456789 Q ss_pred hHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc---------eEEE Q lcl|NC_012753. 75 IGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQ---------IRVS 145 (502) Q Consensus 75 ~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~---------~~i~ 145 (502) ||++||+.++++++...+++. +++..++.+++++++|+|.....+++..|+++|.+|+++|.++++ ++|. T Consensus 64 ~~~~ivd~~~~~l~~~g~~~~-~~~~~~~~l~~i~~~N~~d~~~~~~~~~a~~~G~a~~~v~~~~~~~~~~~~~~~~~i~ 142 (484) T protein:vir:77 64 YPRLYIDAIAARQELEGFRLG-GADKADEQLWDWWQANDLDIESTLGHTDSLVHGRSYITISKPDPNIDPGVDPEVPIIR 142 (484) T ss_pred cHHHHHHHHHhhhccCceecC-CcchhHHHHHHHHHhcCHhHHHHHHHHHHhhcCceEEEEecCCCCcccccccccceEE Confidence 999999999999988776532 445677889999999999999999999999999999999997542 5799 Q ss_pred EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCc Q lcl|NC_012753. 146 FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEE 225 (502) Q Consensus 146 ~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~ 225 (502) +++|.+++++|.+..+. ..+++ +++..+.... .+.++.|. .+..++. +.. + |. |.- . T Consensus 143 ~~~p~~~~~~~D~~~~~-~~~a~-~~~~~~~~~~--~~~~~~y~-~~~~~~~----~~~-~----~~-------~~~--~ 199 (484) T protein:vir:77 143 VEPPTNLYAQIDPRTRQ-VMRAI-RAIEDEEGNE--VIGATLYL-PNNTVIW----NRE-D----GQ-------WVQ--V 199 (484) T ss_pred EeccceeEEEecCCCCc-eEEEE-EEEEeecCCc--EEEEEEEe-cCeEEEE----Eec-C----Cc-------eEe--e Confidence 99999999997654333 23333 3333332222 23344443 1222221 221 1 11 110 1 Q ss_pred ceeecCCCcceEEEecCCccccccccCcCCcchhh-hHHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc-CCCCCC- Q lcl|NC_012753. 226 TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFD-NAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT-EYDTNG- 302 (502) Q Consensus 226 ~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~-~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~-~~~~~g- 302 (502) +...|+++++|++.|. |+...+.++|+|+|+ .+++|+|+||+++|++++..+....+..+ +.. ..+.-. T Consensus 200 ~~~~~~~g~vPvv~f~----N~~~~~~~~G~s~i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~----i~G~~~~~~~~ 271 (484) T protein:vir:77 200 ANVAHNLEMVPVIPIP----NRTRLSDLYGTTEITPELRSVTDAAARTLMLMQATAELMGVPQRL----LFGVKGEELGV 271 (484) T ss_pred ccccCCCCCcceEEec----cccccCccCCcccchHHHHHHHHHHHHHHHHHHHHHHhhhhhHHH----HhCCCcchhcc Confidence 1234788999999886 455788899999998 59999999999999999987643322221 211 000000 Q ss_pred cccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHH Q lcl|NC_012753. 303 EKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSE 382 (502) Q Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~ 382 (502) ........+...... +-...+ .+..+.+++ ....+.|++.++.++++++..+++++..||..+.+..||.+++++ T Consensus 272 ~~~~~~~~~~~~~~~---~~~~~~-~~~~~~q~~-~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~n~~Sg~Al~~~ 346 (484) T protein:vir:77 272 DPETGQTLFDAYLAR---ILAFED-HESKAQQFS-AAELRNFVDALDALDRKAAAYTGLPPYYLSFSSENPASAEAIRSS 346 (484) T ss_pred cccccchhhhhhhhh---hcccCC-CCceeEeec-CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHHHHH Confidence 000000011111111 111111 122233343 334688999999999999999999999999877777899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCCHHH Q lcl|NC_012753. 383 QSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG--FAPKTM 460 (502) Q Consensus 383 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et 460 (502) ++.|.++++.+++.|+.+|++++++++.+... .........+.|.|.++.+.|..+.++.+.+++++| ++|.+| T Consensus 347 ~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~~----~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et 422 (484) T protein:vir:77 347 ESRLVKTVERKNKIFGGAWEQAMRVAYKVMNG----GDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKER 422 (484) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC----CCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHH Confidence 99999999999999999999999998876432 122234467899999999999999999999999876 999999 Q ss_pred HHHhcCCCCHHHHHHHHHHHHHhhhccc----CCCC--CccccCCCCC Q lcl|NC_012753. 461 AIEKTLNVTKEQAQEIYQKINDETMVST----DSFR--TSEEVDIYGE 502 (502) Q Consensus 461 ~l~~~~~~~deea~~el~ri~~E~~~~~----~~~~--~~~~~~~~g~ 502 (502) ++.. +|++++++ +|++++++|+.... +... .+.+++-.+. T Consensus 423 ~~~~-l~~~~~~~-~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~ 468 (484) T protein:vir:77 423 ARID-MGYSITER-EEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPDN 468 (484) T ss_pred HHhc-CCCChhHH-HHHHHHHHHHHHHHHHHHhhhccccccCCCCCCC Confidence 7655 69988765 45777776653211 0000 0001111110 No 63 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=100.00 E-value=7.1e-50 Score=290.03 Aligned_cols=440 Identities=10% Similarity=0.024 Sum_probs=286.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) +.=-+.|+.++++. .....|++...+||.|++++........+..+++++++|||++|| T Consensus 7 ~d~~~~i~~L~~~~---------------------~~~~~r~~~~~~Yy~g~~~i~~~~~~~~~~~~~~~~~~n~~~~iv 65 (488) T protein:vir:23 7 IDPEKLRDQLLDAF---------------------ENKQNELKSSKAYYDAERRPDAIGLAVPLDMRKYLAHVGYPRTYV 65 (488) T ss_pred CCHHHHHHHHHHHH---------------------HHHHHHHHHHHHHHhcccchhhcCcccchhhhhhhhhcchHHHHH Confidence 22222223332221 122468888899999998753322222344456788899999999 Q ss_pred HHHhhhhhcCcc------eE---eeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe---------CCce Q lcl|NC_012753. 81 KKVASLVFNEQA------TI---RVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYID---------GDQI 142 (502) Q Consensus 81 ~~~a~~l~~ep~------~i---~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d---------~~~~ 142 (502) +.+|++|+-+.. .. ..+++...+.|+++++.|+|.....+++..++++|.+|+.+|.+ ++.+ T Consensus 66 d~~a~~l~~~Gf~~~~~~~~~~~~~~d~~~~~~l~~i~~~N~~~~~~~~~~~~a~i~G~a~~~v~~~~~~~~~~~~~~~~ 145 (488) T protein:vir:23 66 DAIAERQELEGFRIPSANGEEPESGGENDPASELWDWWQANNLDIEATLGHTDALIYGTAYITISMPDPEVDFDVDPEVP 145 (488) T ss_pred HHHHHhhhccceeccCCcccccccccchhHHHHHHHHHHhcChhHHHHHHHHHHhhcCceEEEEecCCcccccCCCCCcc Confidence 999976653333 22 23466778899999999999999999999999999999998863 2457 Q ss_pred EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccC Q lcl|NC_012753. 143 RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYED 222 (502) Q Consensus 143 ~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~ 222 (502) +|..++|.+++|+|.+..+....+ ++.++..+++. +++.+.|+. +..++. ... +. +..+ T Consensus 146 ~i~~~~p~~~~~~~d~~~~~~~~~-~~~~~~~~~~~---~~~~~~y~~-~~~~~~----~~~-~~---~~~~-------- 204 (488) T protein:vir:23 146 LIRVEPPTALYAEVDPRTRKVLYA-IRAIYGADGNE---IVSATLYLP-DTTMTW----LRA-EG---EWEA-------- 204 (488) T ss_pred eEEEeccceeEEEEecCCCceEEE-EEEEEecCCCc---EEEEEEEec-CcEEEE----Eec-CC---ceEe-------- Confidence 899999999999987655543333 33333333222 233444432 222221 111 11 1110 Q ss_pred CCcceeecCCCcceEEEecCCccccccccCcCCcchhh-hHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCC Q lcl|NC_012753. 223 LEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFD-NAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTN 301 (502) Q Consensus 223 l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~-~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~ 301 (502) .....|+++++|+++|+ |+.....++|+|+++ .+++|+|+||+++|++++.++....++.+ +....... T Consensus 205 --~~~~~h~~g~vPvv~f~----n~~~~~~~~G~s~i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~----i~G~~~~~ 274 (488) T protein:vir:23 205 --PTSTPHGLEMVPVIPIS----NRTRLSDLYGTSEISPELRSVTDAAAQILMNMQGTANLMAIPQRL----IFGAKPEE 274 (488) T ss_pred --ccccccCCCCcceEEec----cccccCCcCCccchhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHH----HhCCCccc Confidence 11234788999999886 445678899999997 58999999999999999987754333222 21110000 Q ss_pred Cc--ccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHH Q lcl|NC_012753. 302 GE--KVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEV 379 (502) Q Consensus 302 g~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei 379 (502) .. .......+..... .+....++.+..+.+++ ....++|++.++.++++++..+++++..||....+..||.++ T Consensus 275 ~~~~~~~~~~~~~~~~~---~v~~~~~g~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~n~~Sg~Al 350 (488) T protein:vir:23 275 LGINAETGQRMFDAYMA---RILAFEGGEGAHAEQFS-AAELRNFVDALDALDRKAASYSGLPPQYLSSSSDNPASAEAI 350 (488) T ss_pred ccccccccchhhhhhhh---hhccCCCCCCceeEecC-CCChHHHHHHHHHHHHHHhcccCCCHHHhccccCcchHHHHH Confidence 00 0000001111100 11111122233454444 446799999999999999999999999999877777899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCC Q lcl|NC_012753. 380 VSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG--FAP 457 (502) Q Consensus 380 ~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S 457 (502) +++++.|.++++.+++.|+.+|++++++++.+.... ........+.+.|.++.+.|..+.++.+.+++++| ++| T Consensus 351 ~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~----~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s 426 (488) T protein:vir:23 351 KAAESRLVKKVERKNKIFGGAWEQAMRLAYKMVKGG----DIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIP 426 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----CcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCC Confidence 999999999999999999999999999998764321 12234567999999999999999999999999876 899 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhhhcc-----------cCCCCCccccCCCCC Q lcl|NC_012753. 458 KTMAIEKTLNVTKEQAQEIYQKINDETMVS-----------TDSFRTSEEVDIYGE 502 (502) Q Consensus 458 ~et~l~~~~~~~deea~~el~ri~~E~~~~-----------~~~~~~~~~~~~~g~ 502 (502) +||++.. +|+++++. ++++++++++.+. ......+++..-.++ T Consensus 427 ~et~~~~-l~~~~d~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 480 (488) T protein:vir:23 427 RERGWVD-MGYTIVER-EQMRQWLEQDQKQGLGLIGSLYGASTPEGKPGEAPVGEP 480 (488) T ss_pred HHHHHHh-CCCCchHH-HHHHHHHHHHHHHHHHHHHHHhccCCCcccCCCCCCCCC Confidence 9997655 47776643 4555554432110 011111111121222 No 64 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=100.00 E-value=1.7e-46 Score=271.56 Aligned_cols=414 Identities=8% Similarity=0.023 Sum_probs=262.6 Q ss_pred ccccccCCCccccc-cceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEE Q lcl|NC_012753. 55 SVTYRDSNGSQVKR-DFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAM 133 (502) Q Consensus 55 ~~~~~~~~~~~~~~-~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~ 133 (502) .+.+ +........ ++.++|||++||+.+++++..+ .|++.|...++.+++++++|+|.....+++..++++|.+|+ T Consensus 1 ~l~~-~~~~~~~~~~~~~v~n~~~~ivd~~~~~l~~~--gf~~~d~~~~~~~~~i~~~N~~d~~~~~~~~~a~i~G~ay~ 77 (434) T protein:vir:98 1 MLPK-NAEQAFLDFQRKARTNFCGLIANASVHRLLAL--GVTGPDGEPDTRASRWWQANRLDSRQKLVWRMAMAQSAGYM 77 (434) T ss_pred CCCC-CccHHHHHhhhhhhccchHHHHHHHHhhhccC--ceecCCCchHHHHHHHHHhcChhHHHHHHHHHHhhcCceEE Confidence 2221 111112222 3357899999999999999754 47788888899999999999999999999999999999999 Q ss_pred EEEEeCC--------ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecC Q lcl|NC_012753. 134 RPYIDGD--------QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESE 205 (502) Q Consensus 134 ~~~~d~~--------~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~ 205 (502) .+|.+++ .+.|++++|.+++++|.+..+...+++ .. +..+. .+..+..+.+ .+..++.. ++.. T Consensus 78 ~v~~~~~~~~~~~~~~~~I~~~~p~~~~~i~D~~~~~~~~ai-~~-~~~~~-~~~~~~~~~~---~~~~~~~~---~~~~ 148 (434) T protein:vir:98 78 LVGAHPTRTEDNGRPSPLITMEHPSECIVEYDPETGEPLVGL-KV-WHNDI-DGFGYARVFF---DDTSFPYR---TRER 148 (434) T ss_pred EEecCCCcccccCCceeEEEEeccceeEEEEeCCCCceEEEE-EE-EEecc-CCceEEEEEE---eCcEEEEE---Eeec Confidence 9998642 467999999999999876555433333 22 22222 2222222222 11112111 1111 Q ss_pred CccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012753. 206 SKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR 285 (502) Q Consensus 206 ~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~ 285 (502) +. .-+...+...++....+....|+++++|++.|+|+ .+.+. .|+|+|+.+++|||+||+++|+.++..+.... T Consensus 149 ~~-~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~----~~~~~-~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~ 222 (434) T protein:vir:98 149 TG-ARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARM----PDLGE-DPEPEFAGVLDIQDRVNLGILNRMAASRFSGF 222 (434) T ss_pred cc-cccccccccceecccccccccCCCCccceEEeccC----CCcCc-CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcc Confidence 10 00000111111223334445688999999998654 34333 69999999999999999999999998775433 Q ss_pred eee-echHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChh Q lcl|NC_012753. 286 RVA-VPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTG 364 (502) Q Consensus 286 ~i~-v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 364 (502) ++. +...-+....+..+........+..... ++.... +.+..+.+++ +...++|++.|+.++++++..+++++. T Consensus 223 p~~~i~G~~~~~~~~~~~~~~~~~~~~~~~~~---~i~~~~-~~~~~~~q~~-~~~~~~~~~~l~~~i~~~~~~~~~p~~ 297 (434) T protein:vir:98 223 RQKWIKGHKFAKRTDPATGMTVVDQPFVPSPS---AVWASE-GENTQFGQLD-ATDLSGFLKEHASDVRDMLTISQTPTY 297 (434) T ss_pred hhhhhcCCCcccccccccccchhhhhhhcccc---ccccCC-CCCceEEEec-CcchHHHHHHHHHHHHHHhcccCCCHH Confidence 322 2110011111111111111101111100 011111 1122233333 345789999999999999999999999 Q ss_pred hccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHH Q lcl|NC_012753. 365 MFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEF 444 (502) Q Consensus 365 ~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~ 444 (502) .||. ..++.||.++++++..|.+++..+++.|+.+|++++++++.+. +.......+.+.|.++.+.|..+++ T Consensus 298 ~~~~-~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~rl~~~~~-------g~~~~~~~~~v~w~~~~~~s~~~~a 369 (434) T protein:vir:98 298 LYAT-DLVNISADTIGALDILHVAKVREHIASFSEGLESVLALAAAQA-------GVPEDYTEAEVRWANPAHVTMAVKA 369 (434) T ss_pred Hhcc-ccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CCChhheeeeEEecCCCCCCHHHHH Confidence 9985 3456799999999999999999999999999999999887552 2334556799999999999999999 Q ss_pred HHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhc---ccCCCCCcc-------ccCCCC Q lcl|NC_012753. 445 DYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMV---STDSFRTSE-------EVDIYG 501 (502) Q Consensus 445 ~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~---~~~~~~~~~-------~~~~~g 501 (502) +.+++++++|+ |.++. ..+.|++++|+++..++.+++... ..+....+. +...-| T Consensus 370 da~~kl~~~g~-~~e~~-~~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~dg 434 (434) T protein:vir:98 370 DAATKLKSIGY-PLDVI-AEELDESPARVRRIVAGAASQALLAASLLPAPGAPSAGNVPDSGGAVDG 434 (434) T ss_pred HHHHHHHhcCC-cHHHH-HHhCCCCHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCCCcccCCCCC Confidence 99999998875 88774 566788888776665554443111 111111111 111222 No 65 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=100.00 E-value=2.2e-45 Score=265.37 Aligned_cols=444 Identities=12% Similarity=0.087 Sum_probs=278.6 Q ss_pred CC---------------hhH----HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC Q lcl|NC_012753. 1 MG---------------IIQ----TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS 61 (502) Q Consensus 1 m~---------------~~~----~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~ 61 (502) |. +-+ .|++++++. .....++....+||.|+|++ .+... T Consensus 1 ~~~~~~~~~~~~~~~~~l~~~e~~~i~~L~~~~---------------------~~~~~r~~~l~~YY~G~~~i-~~~~~ 58 (504) T protein:vir:99 1 MTEETTSASKFTFRIPELNDDVVDKVNGLYQQL---------------------VDRTPRNLLRASFYDGKYAI-RQIGN 58 (504) T ss_pred CCccCCcccccccccCCCCHHHHHHHHHHHHHH---------------------HHHhHHHHHHHHHHhccccc-hhccc Confidence 11 111 222332221 12346888889999999874 33222 Q ss_pred -CCccccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC Q lcl|NC_012753. 62 -NGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD 140 (502) Q Consensus 62 -~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~ 140 (502) ..+..++.+.++|||++||+.+|+++..+..++. +++..++.|+++++.|+|+....+++..|+++|.+|+.+|.+++ T Consensus 59 ~~p~~~~~~~~v~n~~~~iVd~~a~rl~~~Gf~~~-d~~~~~~~l~~i~~~N~ld~~~~~~~~~a~iyG~af~~v~~~~d 137 (504) T protein:vir:99 59 LIPPEYLRTATVLGWSAKAVDTLARRCNLESFVWP-DGDYGSIGGPDVWDENFFATKANNAMVSSLIHGPAFLINTEGGA 137 (504) T ss_pred cccHHHHHHhhccCcHHHHHHHHHhhhccceeeCC-CCChhhHHHHHHHHhcChhhHHHHHHHHHHhhCceeEEEecCCC Confidence 2233345667899999999999999977665422 45566778999999999999999999999999999999998753 Q ss_pred ---ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecc Q lcl|NC_012753. 141 ---QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLS 217 (502) Q Consensus 141 ---~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~ 217 (502) .+.|++++|.+++.+|.+..+...+++.+.. . +. .+ .++.++.|.. +..+++ ...+. |. T Consensus 138 ~~~~~~I~~~sP~~~~~iyD~~~~~~~~a~~~~~-~-d~-~g-~~~~~~~y~~-~~~~~~-----~~~~~---~~----- 199 (504) T protein:vir:99 138 GEPDSLIHVKSAMQATGEWNSRRNAMDSLLSITS-R-DA-EG-HPTGIALYED-GVTVTA-----DMDDD---GD----- 199 (504) T ss_pred CCceeEEEEeccceeEEEEeCCCCceeEEEEEEE-e-cC-CC-eEEEEEEEcC-CcEEEE-----EEcCC---ce----- Confidence 3679999999999998765555445443222 2 21 22 2444555431 111211 11111 11 Q ss_pred ccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhh-hHHHHHHHHHHHHHHHHHHHhhccce-eeechHHhc Q lcl|NC_012753. 218 TLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFD-NAKTTMDFINTTYDEFMWEVKMGQRR-VAVPTQMIK 295 (502) Q Consensus 218 ~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~-~~~~lid~ld~~~S~~~~~~~~~~~~-i~v~~~~l~ 295 (502) |. .+...++++ +|++.|. |+.+...|+|+|++. .+++|+|++|+++++.++..+....+ .++-.-.+. T Consensus 200 --~~---~~~~~~~~g-vPvV~~~----n~~~~~~~~G~sei~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~ 269 (504) T protein:vir:99 200 --WH---ADVRTHKLG-VPVEVLP----YKPREDRPLGSSRITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAK 269 (504) T ss_pred --ee---eccccCCCC-cceEEec----ccccCccccCcccchhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCcc Confidence 11 011123444 4566654 666778899999986 89999999999999999876532221 122000000 Q ss_pred cCCCCCCcccCccccccccchhhccccCC-CC----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccc Q lcl|NC_012753. 296 TEYDTNGEKVTVKREFETGHNVYEQFDSG-DM----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDG 370 (502) Q Consensus 296 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~-~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~ 370 (502) ...+.+|... ..+.........+..+ .+ +.+.-+.+++ ....+.|.+.|+.++++|++.+++|+..||+.+ T Consensus 270 ~~~~~d~~~~---~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q~~-~~~l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~ 345 (504) T protein:vir:99 270 NFRNKDGSMK---PAWQIALARVFALPDDEDEPDAARARADVKQFP-ASSPQPHIEMLEQIAMMFSGETSIPVESLGFSN 345 (504) T ss_pred cccccccccc---chhhhhhhhhhcCCCccccccccCccceeeecC-CCChHHHHHHHHHHHHHHHhhhCCCHHHhcccc Confidence 0001111110 0111111111111100 00 0112233332 234578999999999999999999999999765 Q ss_pred -cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHH Q lcl|NC_012753. 371 -KSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSK 449 (502) Q Consensus 371 -~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~ 449 (502) .+..||.+|+++...|..++..+++.|..+|++++++++.+... .+.....+..+.|.|.+..+.+..+.++.+.+ T Consensus 346 ~~n~sSa~Ai~~~~~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~---~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~K 422 (504) T protein:vir:99 346 RANPTSADAYIASREDLIAEAEGATDDWSPAFRRSMIRALAIKNG---LDRIPPEWKTIDSKFRSPLYLSKAAQADAGAK 422 (504) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCccccccccceeEecCCCccCHHHHHHHHHH Confidence 46789999999999999999999999999999999988876542 22334455789999999999999999999999 Q ss_pred HHhcCC--CCHHHHHHhcCCCCHHHHHHHHHHHHHhhhccc-----CCCCCccccCCC-----CC Q lcl|NC_012753. 450 MVAAGF--APKTMAIEKTLNVTKEQAQEIYQKINDETMVST-----DSFRTSEEVDIY-----GE 502 (502) Q Consensus 450 ~~~~Gi--~S~et~l~~~~~~~deea~~el~ri~~E~~~~~-----~~~~~~~~~~~~-----g~ 502 (502) ++++|. ++..+++..+.|++++|++++.++.+++++... +......+.+-. || T Consensus 423 l~~ag~~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~e 487 (504) T protein:vir:99 423 MLGAGPEWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNRRQQEAATAGEDQDQGAGE 487 (504) T ss_pred HHhhccccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhcccCCCCCCCCCCCcCCCC Confidence 999885 445555666669999888766555544432110 111111111111 11 No 66 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=100.00 E-value=1.4e-41 Score=244.54 Aligned_cols=412 Identities=11% Similarity=0.071 Sum_probs=273.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccc-cCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYR-DSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~-~~~~~~~~~~~~~~n~~k~i 79 (502) |... .|..++++. .....++....+||.|+++.-... .........++..+|||+++ T Consensus 1 m~~~-~i~~L~~~~---------------------~~~~~r~~~~~~yy~g~~~~~~~~~~~p~~~~~~~~~v~nw~~~~ 58 (422) T protein:vir:97 1 MNYM-GMGYLRRKL---------------------ALFKTGVDKRYRYYAMDDRDDTRSIVMPNNVREMYRSVLEWTAKG 58 (422) T ss_pred CChH-HHHHHHHHH---------------------HHHHHHHHHHHHHHhcCCChhhcCccccHHHHHHHHhhcchhHHH Confidence 3221 122222211 123468888999999998753211 11222233445667999999 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC--CceEEEEEcCCeEEEEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG--DQIRVSFVQATVFFPLQA 157 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~~i~~v~~~~~~Pi~~ 157 (502) |+.+|+.+.-+. |+++|.. ++++++.|+|.....+++..|+.+|.+|+.+|.++ +.++|.+++|.+++.+|. T Consensus 59 Vd~~a~rl~~~G--f~~~d~~----l~~~w~~N~ld~~~~~~~~~al~~G~sf~~v~~~~~~~~p~i~~~sp~~~~~i~D 132 (422) T protein:vir:97 59 VDSLADRIIFRE--FTNDDFN----AWEIFKANNPDIFFDTAIQSALIASCCFVYIMPGAEDGLPKMQVIEASKATGILD 132 (422) T ss_pred HHHHHhccccce--eeCCchh----HHHHHHhcChHHHHHHHHHHHHHhcceeEEEeeCCCCCeeEEEEechhhEEEEEe Confidence 999999664443 5676653 67888889999999999999999999999999864 578999999999999986 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) +..+...+++.+ + ..+. .+. .+...+ +.+... . .++. + |. .....+++++||+ T Consensus 133 ~~~~~~~~a~~~-~-~~~~-~~~-~~~~~~--~~~~~~--~--~~~~-~----~~------------~~~~~~~~g~vPv 185 (422) T protein:vir:97 133 PTTFLLTEGYAI-L-ESDS-NGN-PTLEAY--FTDKDI--W--YYPK-K----GK------------PYNIKNPTGHPLL 185 (422) T ss_pred CCCCcceeeEEE-E-EecC-CCc-EEEEEE--EcCceE--E--EEcC-C----Cc------------cccccCCCCCcce Confidence 555555455432 2 2221 221 111111 222211 1 1221 1 11 1112367788998 Q ss_pred EEecCCccccccccCcCCcchh-hhHHHHHHHHHHHHHHHHHHHhhccce-eeechHHhccCCCCCCcccCccccccccc Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIF-DNAKTTMDFINTTYDEFMWEVKMGQRR-VAVPTQMIKTEYDTNGEKVTVKREFETGH 315 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~-~~~~~lid~ld~~~S~~~~~~~~~~~~-i~v~~~~l~~~~~~~g~~~~~~~~~~~~~ 315 (502) ++|. |+.+...|+|.|.+ +.+++|+|++|+++++.....+....+ .++ +..+.++.+. . .+.... T Consensus 186 v~~~----n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i----~G~d~d~~~~--~---~~~~~~ 252 (422) T protein:vir:97 186 VPII----HRPDAVRPFGRSRITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYV----LGMDPDAKPM--E---KWRATV 252 (422) T ss_pred EEec----ccCCCccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhhhh----cccCcccccC--c---hhhhhh Confidence 8875 45577889999998 679999999999999999876532221 122 2222222211 1 111111 Q ss_pred hhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 316 NVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIAT 395 (502) Q Consensus 316 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~ 395 (502) .....+..+..+....+.+++ ....+.|.+.++.++++++..+++|++.||..+.+..||.+|++....|..++..+++ T Consensus 253 ~~i~~~~~de~~~~~~v~q~~-~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~~ 331 (422) T protein:vir:97 253 STLLEISKDEDGDKPTVGQFT-TASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSDNPSSVESIKAAHENLRAAGRKAQR 331 (422) T ss_pred hhhhccCCCCCCCcceeeecC-CCChhHHHHHHHHHHHHHhcccCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHH Confidence 111112111112222343333 2345789999999999999999999999999888778999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCC---HHHHHHHHHHHHhc--CCCCHHHHHHhcCCCCH Q lcl|NC_012753. 396 LVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTD---RNAEFDYWSKMVAA--GFAPKTMAIEKTLNVTK 470 (502) Q Consensus 396 ~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d---~~~~~~~~~~~~~~--Gi~S~et~l~~~~~~~d 470 (502) .|..+|+++++.++.+... .......+.++.+.|....+.+ ..+.++.+.+++++ |+++.++++..+ |+++ T Consensus 332 ~fg~~l~~~~rla~~~~~~---~~~~~~~~~~~~~~w~p~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~l-g~~~ 407 (422) T protein:vir:97 332 SFSSGFLNVAYIAVCLRDE---FPYLRNQFMDTVIKWEPLFEADANMLTLVGDGAIKLNQAIPGFMDADVIRDLT-GVKG 407 (422) T ss_pred HHHHHHHHHHHHHHHHhcC---CcccchhhccceEEEccCCCCChHHHHHHHHHHHHHHhhccccccHHHHHHHc-CCCc Confidence 9999999999998876432 1223345567999999776777 55667778899988 789999876554 8866 Q ss_pred HHHHHHHHHHHHhhhcc Q lcl|NC_012753. 471 EQAQEIYQKINDETMVS 487 (502) Q Consensus 471 eea~~el~ri~~E~~~~ 487 (502) ++.+.+|+.++++.. T Consensus 408 --~~~~~~~~~~~~~d~ 422 (422) T protein:vir:97 408 --ADKPIPAITEVTTDG 422 (422) T ss_pred --hhHHHHHHHhhhccC Confidence 567788888876654 No 67 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=100.00 E-value=1.1e-40 Score=239.72 Aligned_cols=399 Identities=10% Similarity=0.060 Sum_probs=268.7 Q ss_pred ccCCHHHHHHHHHHHHHhcCCCCcccccc--CCCccccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHH Q lcl|NC_012753. 32 IAISPEEYNRIMDNLRYFAGDFDSVTYRD--SNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETL 109 (502) Q Consensus 32 ~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~--~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~ 109 (502) +++. ..|+....+||.|+++. .+.. .......+.+.++|||+++|+.+|+.+.-+. |+.+|.. +++++ T Consensus 1 l~~~---~~r~~~~~~yY~g~~~~-~~~~~~~p~~~~~~~~~v~nw~~~~Vds~a~rl~~~G--f~~~d~~----l~~i~ 70 (410) T protein:vir:95 1 MNLY---QSRVNLRYKHYAMQHYE-APTGITIPAHIRAKYQAVLGWAAKGVDSLADRLIFRA--FANDDFN----VTEIF 70 (410) T ss_pred CCcc---hhhHHHHHHHhcCCCCc-cccchhccHHHHhHHHhhcchhHHHHHHhHhhhcccc--ccCCCch----HHHHH Confidence 2333 35777788999999864 2222 1222333456788999999999999886554 5566643 77888 Q ss_pred hhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEE Q lcl|NC_012753. 110 KNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFH 188 (502) Q Consensus 110 ~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h 188 (502) +.|+|.....+++..|+.+|.+|+.+|-++ ++++|.+++|.+++.+|.+..+...+++ +++..+. .+ ..+....| T Consensus 71 ~~N~ld~~~~~~~~~al~~G~sf~~v~~~~d~~~~i~~~sP~~~~~i~Dp~~~~~~~al--~~~~~~~-~~-~~~~~~~~ 146 (410) T protein:vir:95 71 DRNNPDIFFDSAILSALIGSCSFVYISKGEDDEVRLQVIESSNATGVIDPITGLLVEGY--AVLARDD-YN-RPTLEAYF 146 (410) T ss_pred hhcChHHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEEeCCCCceEEEE--EEEEecC-CC-eEEEEEEE Confidence 899999999999999999999999999865 5799999999999999865444433333 2222221 11 23333333 Q ss_pred EEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchh-hhHHHHHH Q lcl|NC_012753. 189 EWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIF-DNAKTTMD 267 (502) Q Consensus 189 ~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~-~~~~~lid 267 (502) . .+..+++ .... + .....+++++||++.|. |+.++..|+|+|.+ +.+++|+| T Consensus 147 ~-~~~~~~~-----~~~~----~-------------~~~~~~~~g~vPvV~f~----n~~~l~~~~G~s~I~~~v~~l~d 199 (410) T protein:vir:95 147 E-PNATHFI-----PKDG----E-------------PYSVTNETGIPLLVPVI----HRPDAVRPFGRSRITRAGMYYQK 199 (410) T ss_pred e-CCcEEEE-----eeCC----c-------------cccccCCCCCcceEEec----ccccCCccCCccccchhHHHHHH Confidence 2 1222221 1111 0 11234678899999885 55677889999987 67999999 Q ss_pred HHHHHHHHHHHHHhh--ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHH Q lcl|NC_012753. 268 FINTTYDEFMWEVKM--GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYI 345 (502) Q Consensus 268 ~ld~~~S~~~~~~~~--~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~ 345 (502) ++|+++++.....+. .+.+.+ +..+.++.+.. .+.........+..+.++....|.+++ ....+.|. T Consensus 200 a~~r~~~~~~~~~e~~a~pqr~i-----~G~d~d~~~~~-----~~~~~~~~i~~~~~~~~~~~~~v~q~~-~~~l~~~~ 268 (410) T protein:vir:95 200 YAKRTLERADITAEFYSWPQKYI-----LGLDPDAEPME-----KWKATVSSLLTISSSDKGVKPSVGQFT-TASMSPFT 268 (410) T ss_pred HHHHHHHHHHHHHHHhcchhhee-----eccCCCCCcCc-----hhhhhhhhheeccCCCCCCcceEEecC-CCChHHHH Confidence 999999999886553 333322 22222222211 122211111222122222223344443 33456899 Q ss_pred HHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc Q lcl|NC_012753. 346 KAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTM 425 (502) Q Consensus 346 ~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~ 425 (502) +.++.+++++++.+++++..||..+.+..||.+|++....|..++.++++.|..+|++++++++.+... .......+ T Consensus 269 ~~l~~l~~~~a~~s~lP~~~lg~~~~NpsSa~Al~a~~~~L~~ka~~k~~~fg~~l~~~~rla~~i~~~---~~~~~~~~ 345 (410) T protein:vir:95 269 EQLRTAAAGFAGEMGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQRSLGAGLLNVAYVAACLRDE---FRYTRSQF 345 (410) T ss_pred HHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC---CCCccccc Confidence 999999999999999999999988877789999999999999999999999999999999988876532 12223455 Q ss_pred cceEEEeC---CCccCCHHHHHHHHHHHHhc--CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCC Q lcl|NC_012753. 426 DEVSVDLD---DGVFTDRNAEFDYWSKMVAA--GFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDS 490 (502) Q Consensus 426 ~~i~v~f~---d~i~~d~~~~~~~~~~~~~~--Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~ 490 (502) .++.|.|. +.-..+..+.++.+.+++++ |+++.++++..+ |+++++. .|++.|...+.++ T Consensus 346 ~~~~v~W~p~~d~~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~~~l-g~~~~~~----~~~~~~e~~~~g~ 410 (410) T protein:vir:95 346 VRTAVKWEPLFEADANTMTMIGDGVVKLNQALPGYINAETIRDLT-GIAGDMS----AKPVVSEGGSNGE 410 (410) T ss_pred ceeeEEeeecCCcchhhHHHHHHHHHHHHHhccCCccHHHHHHhc-CCChHHH----HHHHHHHHHhCCC Confidence 67889997 65566778899999999988 799999966554 8887642 2333333333233 No 68 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=100.00 E-value=7.3e-39 Score=229.66 Aligned_cols=444 Identities=10% Similarity=0.022 Sum_probs=273.3 Q ss_pred CChhH--HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC-CCccccccceecchHH Q lcl|NC_012753. 1 MGIIQ--TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS-NGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~--~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~~~~~n~~k 77 (502) |+-=+ .|.+++++. .....++....+||.|+++. .+... -.+..++.+.++|||+ T Consensus 12 l~~~~~~~~~~L~~~~---------------------~~~~~~~~~~~~Yy~G~~~~-~~~~~~~p~~~r~~~~v~nw~~ 69 (474) T protein:vir:81 12 LSNDENALINGLLAQI---------------------ENLRWKNLLRTSYYENKRTI-QYVGTLIPPQYFNLGLVLGWTG 69 (474) T ss_pred CChhHHHHHHHHHHHH---------------------HHHhhHHHHHHHHhccCCCh-hhccccccHHHHHHHhhcChHH Confidence 11111 122222221 12345778888999999874 32222 1223334467899999 Q ss_pred HHHHHHhhhhhcCcceEeeCC-HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-C--ceEEEEEcCCeEE Q lcl|NC_012753. 78 TASKKVASLVFNEQATIRVDN-EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-D--QIRVSFVQATVFF 153 (502) Q Consensus 78 ~iv~~~a~~l~~ep~~i~~~d-~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~--~~~i~~v~~~~~~ 153 (502) ++|+.+|+.+.-+..+ +++ +..+..++++++.|+|......++..|+.+|.+|+.++.++ + .+.|+.++|.+++ T Consensus 70 ~~Vd~~a~rl~~~Gf~--~~d~~~~~~~l~~iw~~N~ld~~~~~~~~~al~~G~sf~~V~~~~d~~~~~~i~~~sp~~~~ 147 (474) T protein:vir:81 70 KAVDALARRCNLEGFV--WPDGDLDSLGGTEVVDDNHLLSEIDSAIVAAMQHGPAFLINTVGEDDEPEALIHVKDASEAT 147 (474) T ss_pred HHHHHHHhhhcccceE--CCCCCccchHHHHHHHhcChhHHHHHHHHHHHhhCceeEEEecCCCCCceeEEEEeccceEE Confidence 9999999999766654 443 34556789999999999999999999999999999999854 2 4889999999999 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) .+|.+..+...+++.+ +....+.. .+....|. .+..+++ ++... +.. |. .+...++++ T Consensus 148 ~~~D~~~~~~~~al~~-~~~~~~g~---~~~~~ly~-~~~~~~~----~~~~~----~~~------w~---~~~~~~~~g 205 (474) T protein:vir:81 148 GEWNRRRRGLNNLLSI-IDKDKEGK---VLSLALYL-DNETVTA----QRDKA----TLK------WQ---VDRDEHVYG 205 (474) T ss_pred EEEeCCCCcceeeeEE-EEEcCCCc---EEEEEEEe-CCcEEEE----EEcCc----cce------ee---eccCCCCCC Confidence 9986655554444432 22222222 22222221 1222222 11111 100 10 011224444 Q ss_pred cceEEEecCCccccccccCcCCcchh-hhHHHHHHHHHHHHHHHHHHHh--hccceeeechHHhccCCCCCCcccCcccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIF-DNAKTTMDFINTTYDEFMWEVK--MGQRRVAVPTQMIKTEYDTNGEKVTVKRE 310 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~-~~~~~lid~ld~~~S~~~~~~~--~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 310 (502) . |+++| .|+.+...|+|+|.+ ..+++|+|++|+++++.....+ ..+.+.+.-.. .....+.+|... .. T Consensus 206 v-PvV~~----~n~~~~~~~~G~s~i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~-~~~~~d~d~~~~---~~ 276 (474) T protein:vir:81 206 V-PAQVL----PYKPAPKRPFGQSRITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGAD-ESALKNADGTIK---SV 276 (474) T ss_pred c-ceEEe----cccccccCcCCccccchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCC-hhhccccccccc---ch Confidence 3 45554 466778899999988 5899999999999999988654 33333332000 000111111111 11 Q ss_pred ccccchhhccccCCCCcc-----ccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccc-cccccHHHHHHHHH Q lcl|NC_012753. 311 FETGHNVYEQFDSGDMDK-----GIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDG-KSMKTATEVVSEQS 384 (502) Q Consensus 311 ~~~~~~~~~~~~~~~~~~-----~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~-~~~~tAtei~~~~~ 384 (502) +.........+..+.... +..+..++ +...+.|.+.++.+++.++..++++++.||..+ ++..||.+|++.+. T Consensus 277 ~~~~~~~i~~~~~d~d~~~~~~~~~~~~q~~-~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~SaeAi~a~~~ 355 (474) T protein:vir:81 277 WEARLGRIKGLPDDADADIPQLARADVKQFP-AASPDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSAESYDASQY 355 (474) T ss_pred hhhhHHHHhcCCCcccccccccccccccccC-CCChhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHHH Confidence 111111111111111100 11123332 344678999999999999999999999999764 67789999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC--CCCHHHHH Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG--FAPKTMAI 462 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G--i~S~et~l 462 (502) .|..++.++++.|..+|++++++++.+..... .......+..+.+.|.+....+..+.++.+.+++++| +.+.++ + T Consensus 356 ~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~-~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~-~ 433 (474) T protein:vir:81 356 ELIAEAEGAVDDFTPALRKAFIRALAMKNKVA-IDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEV-G 433 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCC-ccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHH-H Confidence 99999999999999999999999887643211 0111233567899999999999999999999999987 344455 5 Q ss_pred HhcCCCCHHHHHHHHHHHHHhhhccc-CCCCCccccCCCCC Q lcl|NC_012753. 463 EKTLNVTKEQAQEIYQKINDETMVST-DSFRTSEEVDIYGE 502 (502) Q Consensus 463 ~~~~~~~deea~~el~ri~~E~~~~~-~~~~~~~~~~~~g~ 502 (502) ...+|+|++|+++..++.+++++... +...+....+=-++ T Consensus 434 ~~~lg~t~~~i~~~~~~~~~~~~~~~~~~l~~~~~~~~~aq 474 (474) T protein:vir:81 434 LELIGLTPQQARRAMADKRRVQGRGTLQALIDRSNNGATAQ 474 (474) T ss_pred HhhcCCCHHHHHHHHHHHHHHhHHHHHHHHHhcCCCCCCCC Confidence 66689999888776655554443211 11111000000111 No 69 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=100.00 E-value=2.4e-39 Score=232.35 Aligned_cols=398 Identities=11% Similarity=0.036 Sum_probs=262.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC--CCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS--NGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~--~~~~~~~~~~~~n~~k~ 78 (502) |.. +.|..++++. .....++....+||.|+++. .+... ......+.+..+|||++ T Consensus 1 ~~~-~~i~~L~~~~---------------------~~~~~r~~~~~~yY~g~~~~-~~~~~~~p~~~~~~~~~v~nw~~~ 57 (409) T protein:vir:94 1 MTE-KGIGYLRFKL---------------------SVHKRRAEMRYDQYAMKYVD-RFKGITIPQALSQQYRSILGWCAK 57 (409) T ss_pred CCH-HHHHHHHHHH---------------------HHHhHHHHHHHHHhcccCch-hhcChhhhHHHHHHHhhhcchhHH Confidence 211 0122222111 12346788888999999864 22221 22223345678899999 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEE Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQA 157 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~ 157 (502) +|+.+|+.+.-+. |+++|. .|+++++.|+|.....+++..|+.+|.+|+.+|-++ +.++|.+++|.+++.+|. T Consensus 58 iVds~a~rl~~~G--f~~~d~----~l~~i~~~N~ld~~~~~~~~~aliyG~sf~~v~~~~dg~~~i~~~sp~~~~~i~D 131 (409) T protein:vir:94 58 GVDSLADRLVFRE--FENDDF----TVNEIFEENNPDIFFDSAVLSSLIASCSFTYISKGENDAVRLQVIEAVNATGIID 131 (409) T ss_pred HHHHhHhhcccCc--ccCCch----HHHHHHHhcChhHHHHHHHHHHHHhcceeEEEecCCCCceEEEEeccceEEEEEe Confidence 9999999775443 566664 477889999999999999999999999999999874 679999999999998875 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) +..+...++ + ++...+ ..+ +.+....|. .+..++. ++. + |. | ....+++++||+ T Consensus 132 ~~~~~~~~a-~-~~~~~d-~~~-~~~~~~~~~-~~~~~~~----~~~-~----~~-------~-----~~~~n~~g~vPv 185 (409) T protein:vir:94 132 PITGLLTEG-Y-AVLERD-ENN-NVVLEAHFL-PDRTDYY----YRD-S----RN-------N-----ISIANPTGHPLL 185 (409) T ss_pred cCCCceeee-E-EEEEec-CCC-ceEEEEEEe-cCcEEEE----Eec-C----ce-------e-----EeeeCCCCCcce Confidence 544433232 2 222222 112 223322222 2222221 111 1 00 1 112367788998 Q ss_pred EEecCCccccccccCcCCcchh-hhHHHHHHHHHHHHHHHHHHHhh--ccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIF-DNAKTTMDFINTTYDEFMWEVKM--GQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~-~~~~~lid~ld~~~S~~~~~~~~--~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) +.|. |+.++..|+|+|.+ +.+++|+|++|++.++.....+. .+.+.+ +..+.++.+.. .+... T Consensus 186 V~f~----n~~~~~~~~G~s~I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-----~G~d~d~~~~~-----~~~~~ 251 (409) T protein:vir:94 186 VPII----HRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYV-----TGLSDDAEPME-----TWKAT 251 (409) T ss_pred EEec----cccccccccCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhhee-----EecCCCCcccc-----hhhhh Confidence 8885 45677889999999 57999999999999999986653 332322 21122222111 12221 Q ss_pred chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~ 394 (502) ......+..+..+.+..|.+++ ....+.|.+.++.+++++++.+++|++.||..+.+..||.+|++....|..++..++ T Consensus 252 ~~~i~~~~~d~dg~~~~v~q~~-~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~NpsSa~Al~a~~~~L~~~a~~k~ 330 (409) T protein:vir:94 252 VSSMLQFTKDEDGDKPTLGQFT-QPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQ 330 (409) T ss_pred HHHhhcCCCCCCCCCceEEecC-CCChhHHHHHHHHHHHHHhhhcCCCHHHhccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 1111222112122223344443 233578999999999999999999999999888777899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCC---HHHHHHHHHHHHhcC--CCCHHHHHHhcCCCC Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTD---RNAEFDYWSKMVAAG--FAPKTMAIEKTLNVT 469 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d---~~~~~~~~~~~~~~G--i~S~et~l~~~~~~~ 469 (502) +.|..+|++++++++.+.... +.....+..+.+.|.+..+.+ ..+.++.+.+++++| +++.++. ....|++ T Consensus 331 ~~fg~~~~~~~rla~~i~~~~---~~~~~~~~~~~v~W~p~~~~~~~~~a~~aDa~~Kl~~ag~~~~~~~~~-~~~lG~~ 406 (409) T protein:vir:94 331 RSLGAGLLNVAYLAACLRDDA---PYLREQFRKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDTI-RDLTGIE 406 (409) T ss_pred HHHHHHHHHHHHHHHHHhCCC---CccccccccceEEeccCCCcchHHHHHHHHHHHHHHHhcccccchhHH-HHHcCCC Confidence 999999999999888764321 222345568999999665555 456678889999998 6677775 4556998 Q ss_pred HHH Q lcl|NC_012753. 470 KEQ 472 (502) Q Consensus 470 dee 472 (502) +.+ T Consensus 407 ~~d 409 (409) T protein:vir:94 407 GGE 409 (409) T ss_pred CCC Confidence 865 No 70 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=100.00 E-value=5.1e-39 Score=230.54 Aligned_cols=398 Identities=11% Similarity=0.032 Sum_probs=261.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc--CCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD--SNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~--~~~~~~~~~~~~~n~~k~ 78 (502) |.. +.|..++++. .....++....+||.|+++. .+.. .......+.+..+|||++ T Consensus 1 ~~~-~~i~~L~~~~---------------------~~~~~r~~~~~~yY~g~~~~-~~~~~~~p~~~~~~~~~v~nw~~~ 57 (409) T protein:vir:16 1 MTE-KGIGYLRFKL---------------------SVHKRRAEMRYEQYAMKHVD-RFKGITIPQALSQQYRSILGWCAK 57 (409) T ss_pred CCH-HHHHHHHHHH---------------------HHHhHHHHHHHHHHhccCch-hhcchhhhHHHHHHHhhhcChhHH Confidence 221 1222222221 12346888899999999764 2222 222233345678899999 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-CceEEEEEcCCeEEEEEE Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQIRVSFVQATVFFPLQA 157 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~~i~~v~~~~~~Pi~~ 157 (502) +|+.+|+.+.-+. |+++|. .|+++++.|+|.....+++..|+++|.+|+.+|-++ +.++|.+++|.+++.++. T Consensus 58 iVds~a~rl~~~G--f~~~d~----~l~~i~~~N~ld~~~~~~~~~al~yG~sf~~v~~~~dg~~~i~~~sP~~~~~i~D 131 (409) T protein:vir:16 58 GVDSLADRLVFRE--FENDDF----TVNEIFEENNPDIFFDSTVLSALIASCSFTYISKGENDAVRLQVIEATNATGIID 131 (409) T ss_pred HHHHhHhhccccc--ccCcch----HHHHHHHhcChhHHHHHHHHHHHHhCceeEEEecCCCCceEEEEEcccceEEEee Confidence 9999999775443 556664 477889999999999999999999999999999864 679999999999999975 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) +..+...+++. . ...+.... -+....| +.+.+++ .++. + | .| ....+++++||+ T Consensus 132 ~~~~~~~~a~~-~-~~~d~~~~--~~~~~~~-~~~~~~~----~~~~-~----~-------~~-----~~~~~~~g~vPv 185 (409) T protein:vir:16 132 PITGLLTEGYA-V-LERDENNN--VVLEAHF-LPDRTDY----YYRD-S----R-------NN-----ISIANPTGNPLL 185 (409) T ss_pred cccccceeeeE-E-EEecCCCc--eEEEEEE-ecCcEEE----EEec-C----c-------cc-----cceecCCCCcce Confidence 54444333332 2 22221111 1222122 1122211 1111 1 0 01 112467888999 Q ss_pred EEecCCccccccccCcCCcchh-hhHHHHHHHHHHHHHHHHHHHhh--ccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIF-DNAKTTMDFINTTYDEFMWEVKM--GQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~-~~~~~lid~ld~~~S~~~~~~~~--~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) +.|. |+.++..|+|+|.+ +.+++|+|++|+++++.....+. .+.+.+ +..+.++.+.. .+... T Consensus 186 V~f~----n~~~~~~~~G~seI~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i-----~G~d~d~~~~~-----~~~~~ 251 (409) T protein:vir:16 186 VPII----HRPDAVRPFGRSRITRSGMYWQSNAKRTLERADVTAEFYSFPQKYV-----TGLSDDAEPME-----TWKAT 251 (409) T ss_pred EEec----ccccccccCCccccchhHHHHHHHHHHHHHHHHHHHHHhcChhhee-----EecCCCCCccc-----hhhhh Confidence 8885 56678899999998 57999999999999999886553 333322 21222221111 12221 Q ss_pred chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 315 ~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~ 394 (502) ......+..+..+.+..|.+++ ....+.|.+.++.+++++++.+++|++.||..+.+..||.+|++....|..++.+++ T Consensus 252 ~~~i~~~~~d~~g~~~~v~q~~-~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~NpsSa~Ai~a~~~~L~~ka~~k~ 330 (409) T protein:vir:16 252 VSSMLQFTKDEDGDKPTLGQFT-QPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSDNPSSVEAIKASHENLRLAGRKAQ 330 (409) T ss_pred hhHhhccCCCCCCCCceEEecC-CCChhHHHHHHHHHHHHHhhhcCCCHHHcccccCchhHHHHHHHHHHHHHHHHHHHH Confidence 1111112112222223344443 334578999999999999999999999999888777899999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCC---HHHHHHHHHHHHhcCC--CCHHHHHHhcCCCC Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTD---RNAEFDYWSKMVAAGF--APKTMAIEKTLNVT 469 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d---~~~~~~~~~~~~~~Gi--~S~et~l~~~~~~~ 469 (502) +.|..+|+++++.++.+... .+.....+..+.+.|.+..+.+ ..+.++.+.|++++|. +..++ +....|++ T Consensus 331 ~~fg~~l~~~~rla~~~~~~---~~~~~~~~~~~~v~W~~~~~~~~~s~a~~aDa~~Kl~~a~~~~~~~~v-~~~~~g~~ 406 (409) T protein:vir:16 331 RSLGAGLLNVAYLAACLRDD---VPYLREQFSKTKPKWEPLFEADASMLSLIGDGAIKLNQAIPEFINKDT-IRDLTGIK 406 (409) T ss_pred HHHHHHHHHHHHHHHHHhcC---CCccchhhccceEEecCCCCcchhhHHHHHHHHHHHHhhcccccchhH-HHHhccCC Confidence 99999999999988876432 1222334467899999766544 6788899999999973 33455 45556998 Q ss_pred HHH Q lcl|NC_012753. 470 KEQ 472 (502) Q Consensus 470 dee 472 (502) +.+ T Consensus 407 ~~d 409 (409) T protein:vir:16 407 GAE 409 (409) T ss_pred CCC Confidence 865 No 71 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=100.00 E-value=2.1e-37 Score=221.70 Aligned_cols=463 Identities=11% Similarity=0.069 Sum_probs=281.6 Q ss_pred Hhhcc----cccchhhhhccc---cccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh Q lcl|NC_012753. 14 SNYVI----TNQSLNSITDHP---KIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL 86 (502) Q Consensus 14 ~~~~~----~~~~l~~i~~~~---~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~ 86 (502) |+|.- .++.|..-.+-. =...+..++.+|+.+..||.|++-.|.-...-+.....+.+.+|.. .+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~--------~~ 72 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNG--------EK 72 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhh--------HH Confidence 33311 112221111100 1133567889999999999997655542211111112233444544 45 Q ss_pred hhcCcceEee---------CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-----ceEEEEEcCCeE Q lcl|NC_012753. 87 VFNEQATIRV---------DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD-----QIRVSFVQATVF 152 (502) Q Consensus 87 l~~ep~~i~~---------~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~-----~~~i~~v~~~~~ 152 (502) +++.+-.|.+ .++.+++.|..+++.+.+..++.+.-.+|+++|+++|++.||++ ++++..++|.++ T Consensus 73 ~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~ 152 (527) T protein:vir:10 73 LIEAKMRFLGQGLKWEFSKKDAKVDDAIKVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTY 152 (527) T ss_pred hhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCccee Confidence 5555555444 35678889999999999999999999999999999999999953 699999999999 Q ss_pred EEEEEcCC--CeEEEEEEEEEEEeeCCCceEE-EEEE--EEEEeCC-----eEEEEEEE--EecCCccccCceeecc-cc Q lcl|NC_012753. 153 FPLQANTQ--DVSSAAIVTKSTKTEGQKVKYY-SLIE--FHEWNKE-----TYTISNEL--YESESKTIIGQRVPLS-TL 219 (502) Q Consensus 153 ~Pi~~d~~--~~~~~~~~~~~~~~~~~~~~~y-t~~E--~h~~~~~-----~~~I~~~l--~~~~~~~~lG~~v~l~-~~ 219 (502) ||++.+.+ .+..+.++..+...++....+. .++- .|+.++. +|.|.|.. |...+-+.. .+.|+. +- T Consensus 153 f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~-~e~p~~~~~ 231 (527) T protein:vir:10 153 FPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDR-PESPLEPDD 231 (527) T ss_pred eeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccc-cccccchhh Confidence 99865433 3544544433322222222122 1111 1222222 23444321 111000000 112221 00 Q ss_pred cc----CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhc Q lcl|NC_012753. 220 YE----DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIK 295 (502) Q Consensus 220 ~~----~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~ 295 (502) ++ +.+.+.....++.+|++.| + |....++.||.|+++++++++|+||.+.|+..+.++.+...|++-..+-. T Consensus 232 ~~~~~~~~~l~~lp~pi~fiPvV~~-~---t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~ 307 (527) T protein:vir:10 232 IKKLSTLTEEEPLPEQITTLPVFHF-R---GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPP 307 (527) T ss_pred hhhhcCceeeecccCCCCccceEee-c---CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccc Confidence 11 1111111234566777766 2 44567899999999999999999999999999999988888888433221 Q ss_pred cCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccc-cccccc Q lcl|NC_012753. 296 TEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSF-DGKSMK 374 (502) Q Consensus 296 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~-~~~~~~ 374 (502) . +..|.. .+..+.+.....-++ ++-+..++.---.+.+...++.+.+.|...++++..+||. +.++.. T Consensus 308 v--d~~G~~--------~~~~VgPG~iweL~e-~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~ 376 (527) T protein:vir:10 308 R--DSRGNM--------VPWTISPLGMVEHGQ-NNKIYRVNGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAVAE 376 (527) T ss_pred c--cccCCc--------CccccCCceeEecCC-CcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCc Confidence 1 111211 111222111111111 1122333332245668888889999999999999999993 445567 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_012753. 375 TATEVVSEQSDTYQMRNSIATLVEKSLKELVI--SILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVA 452 (502) Q Consensus 375 tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~--~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~ 452 (502) |+.+++.+.+.|.+++.+++..|+...+++.+ +..|+.....+...+......+.+.|.+.+|.|.++.++++.++++ T Consensus 377 SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~ 456 (527) T protein:vir:10 377 SGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNSEKRFNQLLQLWE 456 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHH Confidence 99999999999999999999999988887654 2234433333333444444578999999999999999999999999 Q ss_pred cCCCCHHHHHHhc---CCCCHHHHHHHHHHHHHhhhccc------CCCCCccccCCCCC Q lcl|NC_012753. 453 AGFAPKTMAIEKT---LNVTKEQAQEIYQKINDETMVST------DSFRTSEEVDIYGE 502 (502) Q Consensus 453 ~Gi~S~et~l~~~---~~~~deea~~el~ri~~E~~~~~------~~~~~~~~~~~~g~ 502 (502) +|++|.+||+.++ .+..+ +++|++||.++.+.+. ....+.-.+|..|- T Consensus 457 aGi~S~~tAv~~L~~~~g~eD--~E~E~~~I~~era~~a~a~a~A~~~~~a~~~~~~g~ 513 (527) T protein:vir:10 457 AGLIPAKKLTEELSKIMGFEL--TEEDFKQATEDKKTQGIAQAEAADPFGAQMAAEQGI 513 (527) T ss_pred cCchhHHHHHHHHHhccCCCC--hHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCC Confidence 9999999997776 34443 6778888887754321 11112222333332 No 72 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=100.00 E-value=2.4e-37 Score=221.33 Aligned_cols=463 Identities=11% Similarity=0.069 Sum_probs=281.6 Q ss_pred Hhhcc----cccchhhhhccc---cccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh Q lcl|NC_012753. 14 SNYVI----TNQSLNSITDHP---KIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL 86 (502) Q Consensus 14 ~~~~~----~~~~l~~i~~~~---~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~ 86 (502) |+|.- .++.|..-.+-. =...+..++.+|+.+..||.|++-.|.-...-+.....+.+.+|.. .+ T Consensus 1 ~~~~~~~~~~~~~~~~g~~~~p~~v~~~d~~Rl~aY~l~~~~y~n~~~~~~~~lrg~~~~~~r~~~~ps~--------~~ 72 (527) T protein:vir:10 1 MGQDKRQYGSTQQLRAGEANFPNAVTDFDKARLASYRLYEDMYLTNTSDYQVILRGGDEGDQRPIYVPNG--------EK 72 (527) T ss_pred CCccccccCCCcCcCCccccCcccCCHHHHHHHHHHHHHHHHhcCchhheeeecCCccccccceeeehhh--------HH Confidence 33311 112221111100 1133567889999999999997655542211111112233444544 45 Q ss_pred hhcCcceEee---------CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-----ceEEEEEcCCeE Q lcl|NC_012753. 87 VFNEQATIRV---------DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD-----QIRVSFVQATVF 152 (502) Q Consensus 87 l~~ep~~i~~---------~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~-----~~~i~~v~~~~~ 152 (502) +++.+-.|.+ .++.+++.|..+++.+.+..++.+.-.+|+++|+++|++.||++ ++++..++|.++ T Consensus 73 ~~~~~~~~~~~g~~~~~~~~~e~v~~~lr~~~~~e~l~~~~~~~~r~~~vlGDg~f~l~wD~~k~~~~R~~v~~~DP~~~ 152 (527) T protein:vir:10 73 LIEAKMRFLGQGLKWEFSKKDAKVDDAIRVLFDRENWEQKFESLKRWTEIRGDYVLLLIGDDEKDEGSRLSLHEVDPSTY 152 (527) T ss_pred hhCCcceeeccCccccccchhHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccCCCcCCCceEeecCccee Confidence 5555555444 35678889999999999999999999999999999999999953 699999999999 Q ss_pred EEEEEcCC--CeEEEEEEEEEEEeeCCCceEE-EEEE--EEEEeCC-----eEEEEEEE--EecCCccccCceeecc-cc Q lcl|NC_012753. 153 FPLQANTQ--DVSSAAIVTKSTKTEGQKVKYY-SLIE--FHEWNKE-----TYTISNEL--YESESKTIIGQRVPLS-TL 219 (502) Q Consensus 153 ~Pi~~d~~--~~~~~~~~~~~~~~~~~~~~~y-t~~E--~h~~~~~-----~~~I~~~l--~~~~~~~~lG~~v~l~-~~ 219 (502) ||++.+.+ .+..+.++..+...++....+. .++- .|+.++. +|.|.|.. |...+-+.. .+.|+. +- T Consensus 153 f~~ed~d~~~~v~~v~~~~~~~~P~d~~~~~~~ar~~~~~~~l~~~g~~~~~G~~~yt~~~w~lg~w~d~-~e~p~~~~~ 231 (527) T protein:vir:10 153 FPYEDPRYPGQVLGVYLVDEYPHPDSEKKNEKCARVQKYMKTLDDDGKPVPGGAIKYTEELYEPGKWDDR-PESPLEPDD 231 (527) T ss_pred eeeecCCCCCceeeEEEeeeccCCccccccceehhhhhhhhhcCcccccccCcceeeeeceeeccccccc-cccccchhh Confidence 99865433 3544544433322222222122 1111 1222222 23443321 111000000 112221 00 Q ss_pred cc----CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhc Q lcl|NC_012753. 220 YE----DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIK 295 (502) Q Consensus 220 ~~----~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~ 295 (502) ++ +.+.+.....++.+|++.| + |....++.||.|+++++++++|+||.+.|+..+.++.+...|++-..+-. T Consensus 232 ~~~~~~~~~l~~lp~pi~fiPvV~~-~---t~p~~~~~WG~S~La~ll~l~deLn~~~Td~s~is~~sG~Pi~~~tg~~~ 307 (527) T protein:vir:10 232 IKKLSTLTEEEPLPEQITTLPVFHF-R---GHPIMNAMFGRSGLAGLESLIASVNQTMTDEDLIMVFGGLGFYATDSAPP 307 (527) T ss_pred hhhhcCceeeecccCCCCccceEee-c---CCCccccccChhhHhHHHHHHHHHhhhhhHHHHHHHHhCCceeeeccccc Confidence 11 1111111234566777766 2 44567899999999999999999999999999999988888888433221 Q ss_pred cCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccc-cccccc Q lcl|NC_012753. 296 TEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSF-DGKSMK 374 (502) Q Consensus 296 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~-~~~~~~ 374 (502) . +..|.. .+..+.+.....-++ ++-+..++.---.+.+...++.+.+.|...++++..+||. +.++.. T Consensus 308 v--d~~G~~--------~~~~VgPG~iweL~e-~ak~~~v~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~~~ 376 (527) T protein:vir:10 308 R--DSRGNM--------VPWTISPLGMVEHGQ-NNKIYRVNGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAVAE 376 (527) T ss_pred c--cccCCc--------CccccCCceeEecCC-CcceeeccchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCcCc Confidence 1 111211 111222111111111 1122333332245668888889999999999999999993 445567 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_012753. 375 TATEVVSEQSDTYQMRNSIATLVEKSLKELVI--SILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVA 452 (502) Q Consensus 375 tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~--~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~ 452 (502) |+.+++.+.+.|.+++.+++..|+...+++.+ +..|+.....+...+......+.+.|.+.+|.|.++.++++.++++ T Consensus 377 SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L~aye~v~~~d~~~~~~v~ivf~p~lP~D~~avie~v~tL~~ 456 (527) T protein:vir:10 377 SGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWLPAYEGVGIDDADKKLTVTITFRDPKPVNNEKRFAQLLELWE 456 (527) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHhhhcccCCCccccceEEEecccCCCCHHHHHHHHHHHHH Confidence 99999999999999999999999988887654 2234433333333444444578999999999999999999999999 Q ss_pred cCCCCHHHHHHhc---CCCCHHHHHHHHHHHHHhhhccc------CCCCCccccCCCCC Q lcl|NC_012753. 453 AGFAPKTMAIEKT---LNVTKEQAQEIYQKINDETMVST------DSFRTSEEVDIYGE 502 (502) Q Consensus 453 ~Gi~S~et~l~~~---~~~~deea~~el~ri~~E~~~~~------~~~~~~~~~~~~g~ 502 (502) +|++|.+||+.++ .+..+ +++|++||.++.+.+. ....+.-.+|..|- T Consensus 457 aGiiS~etAv~~L~~~~g~eD--~E~E~~~I~~era~~a~a~a~a~~~~~a~~~~~~g~ 513 (527) T protein:vir:10 457 AGLIPAKKLTEELSKIMGFEL--TEEDFRQATEDKKTQGIAQAEAADPFGAQMAAEQGI 513 (527) T ss_pred cCchhHHHHHHHHHhccCCCc--hHHHHHHHHHHHHHHhHHhhhhcCchhhhhccccCC Confidence 9999999997776 34443 6778888887754322 11112222333332 No 73 >protein:vir:7430 Length: 563 # NCBI annotation: gp7 # Family: family:all:6920 # MgeID: mge:147 # MgeName: Barnyard # Cross-refs: genbank:acc:NP_818545;genbank:gi:29566982;genbank:GeneID:1260216 Probab=100.00 E-value=3.6e-37 Score=220.37 Aligned_cols=464 Identities=11% Similarity=0.038 Sum_probs=274.5 Q ss_pred Hhhc-----ccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecc--hHHHHHHHHhhh Q lcl|NC_012753. 14 SNYV-----ITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLP--IGRTASKKVASL 86 (502) Q Consensus 14 ~~~~-----~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n--~~k~iv~~~a~~ 86 (502) |+|. ++...|-.--..|=...+..++.+|+.+..||.|+++.+.... .+ +.+..++ .++++|++++.+ T Consensus 1 m~~~~~q~~p~~~~fp~~~a~wV~~~D~~RlaaY~ly~d~y~n~~~el~~il-~G----~dr~~~~~ps~r~~V~~~~~~ 75 (563) T protein:vir:74 1 MPYNHKQYDPAKPFLRGGDDNIVDENDKNRVRAYDLYENIYLNSAETLKLVL-RG----DDSVPILMPSGRKIVEAVHRF 75 (563) T ss_pred CCccccccCCCcccccccccccCCHHHHHHHHHHHHHHHhhcCchhhhhhhc-CC----CceeeeccchHHHHHHHHHHh Confidence 3331 1112122222222234566899999999999999987654321 11 1234444 677999997755 Q ss_pred hhcCcceEeeCC--------HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-----CceEEEEEcCCeEE Q lcl|NC_012753. 87 VFNEQATIRVDN--------EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-----DQIRVSFVQATVFF 153 (502) Q Consensus 87 l~~ep~~i~~~d--------~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-----~~~~i~~v~~~~~~ 153 (502) | |++..|+|.. +.++.+|..+++.+++..++..+..+|.++|+++|++.||+ +++++..++|.++| T Consensus 76 L-g~~~~~~Ve~~~~de~~~~avq~~Lr~~~~~e~l~~~~~~~~r~a~vlGDgvf~l~wDp~K~~g~R~rv~~vDP~~~f 154 (563) T protein:vir:74 76 L-GVGFDYLVEPDMGDEGIRQSLNAYFRTTFKREAIKAKFTSNKRWGLIRGDAHFYIHADPNKKAGERISVDEVDPRQIF 154 (563) T ss_pred c-CCCcEEecCccccCcchHHHHHHHHHHHHHHhhhHHHHHHHHHhhhhhcceeEEEeeccccccCCCceEeecCCceee Confidence 5 9999998853 23567889999999999999999999999999999999995 37899999999999 Q ss_pred EEEEcCCCeEEEEEEE---EEEEe-eCCCceEEEEEEEEEEeCCeE---EEEEEEEecCCccccCce--eec-------- Q lcl|NC_012753. 154 PLQANTQDVSSAAIVT---KSTKT-EGQKVKYYSLIEFHEWNKETY---TISNELYESESKTIIGQR--VPL-------- 216 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~---~~~~~-~~~~~~~yt~~E~h~~~~~~~---~I~~~l~~~~~~~~lG~~--v~l-------- 216 (502) |+ .+.+.......+. .+... +.+++..-++--.++|++.+- .+.+.+ +...+|.- -++ T Consensus 155 p~-~dpd~v~g~~~v~v~~~~~~pdd~~~~~~r~~~~~~~lndeg~~~~~~~~da----e~w~lg~wd~r~~~~~~~~~~ 229 (563) T protein:vir:74 155 LI-EDGSTVVGFHMVDIVQDFRSPDDPSKKLARRRTFRRVRNDEGMFTGRISSEL----THWTLGNWDDRGAISDEQARR 229 (563) T ss_pred ec-cCCCCcccceeeecccCCCCCcchhccceeeeeeeeeeCCCCCccceeeecc----chhccccccccCccchhhhcc Confidence 94 4444442221111 11111 222222212212455665432 232211 00111100 000 Q ss_pred -ccccc---CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechH Q lcl|NC_012753. 217 -STLYE---DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQ 292 (502) Q Consensus 217 -~~~~~---~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~ 292 (502) ..++. +.+......-++.+|++.|+ |-...+++||.|+++++++++++||.+.|+.++.+..+...|+|-++ T Consensus 230 ~~~~~~~~~d~e~~~LP~pi~~iPiv~~~----tip~~~s~WG~S~La~ll~~~~eLn~~~Td~s~i~~~tG~pi~vl~~ 305 (563) T protein:vir:74 230 KEQVRSAQHDEEEEELPEPISQLPLYRWR----NKPPQNSSWGTSQLEGMETLAYALNQSLTDEDATIVFQGLGMYVTNA 305 (563) T ss_pred cchhhhhhhhchhhhccccccCccEEEcC----CCCCcccccchhhHHHHHHHHHHHhhhhhHHHHHHHhcCCCeEEecc Confidence 00000 11111111223455655542 44567899999999999999999999999999999999999999443 Q ss_pred HhccCCCCCCcccCccccccccchhhccc--cCCCCccccceeeeccccchHHHHHHHHHHHH-HHHHhcCCChhhcc-- Q lcl|NC_012753. 293 MIKTEYDTNGEKVTVKREFETGHNVYEQF--DSGDMDKGIGITDLTTDIRSDDYIKAINKGLS-LFEMQLGVSTGMFS-- 367 (502) Q Consensus 293 ~l~~~~~~~g~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~-~i~~~~g~s~~~~~-- 367 (502) .-..+ |..-. ..+.++.+.. ...+......+..++.---++.+..+++.+.. .+....+++..+|| T Consensus 306 ~~p~d----~~~g~-----~~~w~vgpG~i~El~~~~~~g~l~~v~g~~~l~~~q~Hm~~l~eral~~~s~tPavA~G~v 376 (563) T protein:vir:74 306 SAPVD----PNTGE-----LTDWNIGPMQIVEIAGNRNDNYFERVSGVQDVSPFQDHMKWIDEKGIAEGSGTPEVAIGRV 376 (563) T ss_pred ccccc----ccccc-----ccccccCCceeEeccCCccccceeeecchhhhHHHHHHHHHHHHHHHHhhccCcceeeccc Confidence 32222 21100 1111221111 11111112234444433233444444555555 45666789999999 Q ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHh-hcc-----cCCCccccc--ceEEEeCCC Q lcl|NC_012753. 368 FDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKEL----VISILELAKV-YNL-----YTGEIPTMD--EVSVDLDDG 435 (502) Q Consensus 368 ~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l----~~~il~~~~~-~~~-----~~~~~~~~~--~i~v~f~d~ 435 (502) +.+. ..|+.+++.+.+.|.++++.+++.+..++.++ ++..|..... +.. ..|+..... .++|+|.+. T Consensus 377 D~~~-~~SGiALeL~L~PL~a~~~ek~l~l~~~mr~~r~~~~~~lL~~~erl~~~g~~~~~~g~~~~~~~~~v~ivf~p~ 455 (563) T protein:vir:74 377 DVTS-AESGISLELQLKPLLAANEEKELEMIVVMDQFLHDWMTMWLPAYESDFQEQDGSRPFASADLLNECSVVCIFADP 455 (563) T ss_pred cccc-ccchhhhhhhhhHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHhhhhcccccccccccCCceEEEEEeCCC Confidence 4443 66888999999999999999999998888883 3333311111 000 011111122 367889999 Q ss_pred ccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC--CCCHHHHHHHHHHHHHhhhcc----cCCCCCccccCCCCC Q lcl|NC_012753. 436 VFTDRNAEFDYWSKMVAAGFAPKTMAIEKTL--NVTKEQAQEIYQKINDETMVS----TDSFRTSEEVDIYGE 502 (502) Q Consensus 436 i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~--~~~deea~~el~ri~~E~~~~----~~~~~~~~~~~~~g~ 502 (502) +|+|.++.++++..++++||+|.+||+.++- ||...+|+.|.++|+.++-.. .-..+.+.++-..|+ T Consensus 456 ~P~d~~~vv~~~~tl~~aGiiSretAv~~L~~~g~~~pdae~e~~~ie~~~i~~~~~a~a~ad~~~~~~a~~~ 528 (563) T protein:vir:74 456 MPVNKTQVTQDTLLLQQAHLILRKMAVAKLRSIGWEYPEVDDQGNALTDDDIADMLLAEAEADASLGLSAMDN 528 (563) T ss_pred CCccHHHHHHHHHHHHHcCchhHHHHHHHHHhCCCCCCcHHHHHhhcCHHHHHHHHHHHhhccCcccceeccc Confidence 9999999999999999999999999966652 666655777777665442111 011112222222222 No 74 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=99.82 E-value=8.1e-18 Score=114.29 Aligned_cols=423 Identities=11% Similarity=0.050 Sum_probs=211.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHH---HHHhcCCCCc-------cccccCC--C---cc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDN---LRYFAGDFDS-------VTYRDSN--G---SQ 65 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~---~~~Y~g~~~~-------~~~~~~~--~---~~ 65 (502) |||-.+ .+++...-..| ++-|.|.... |.+.... . .+ T Consensus 1 m~V~~~----------------------------hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~r 52 (452) T protein:vir:94 1 MPIETK----------------------------HPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAY 52 (452) T ss_pred CCCCCc----------------------------CHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHH Confidence 332111 22333322222 2334443211 1111100 0 11 Q ss_pred ccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe--CCceE Q lcl|NC_012753. 66 VKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYID--GDQIR 143 (502) Q Consensus 66 ~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d--~~~~~ 143 (502) ..+-.. .|+++.+++.+++++|.+||++++.+. .+.++.=..-++++..+..++..++.+|.+++.|=+. +++|. T Consensus 53 l~rA~~-~n~~~~t~~~~~G~vf~k~p~~~~p~~--l~~~~~D~~G~~L~~~~~~~~~~~l~~G~~~ilVD~p~~g~rPy 129 (452) T protein:vir:94 53 KQRALF-YSITSKTLSALSGMVLDQPPVITHPDA--MSKYFEDQSGIQFYEVFTRAVEETLLMGRVGVFIDRPLTGGDPY 129 (452) T ss_pred HhhccC-CchHHHHHHHHhchhhcCCceecccHH--HHHHHhcccCCCHHHHHHHHHHHHHhcCeEEEEEeeccCCCceE Confidence 111112 499999999999999999999877543 2333222344689999999999999999999998664 45799 Q ss_pred EEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe-eCCCc---eEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc Q lcl|NC_012753. 144 VSFVQATVFFPLQANTQDVSSAAIVTKSTKT-EGQKV---KYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL 219 (502) Q Consensus 144 i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~-~~~~~---~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~ 219 (502) +..++|++++=...+..+....+.+++.... ++... +..+.+-.++.+++.|+++ +|+..... .....+ T Consensus 130 ~~~~~~~~Ii~W~~~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~--~~~~~~~~----~~~~~~- 202 (452) T protein:vir:94 130 ISVYTTENILNWEEDEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQIT--VHETQDGK----VWELAK- 202 (452) T ss_pred EEEechhhhcCccccccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEE--EEEccCCc----eeeecc- Confidence 9999999998754444333334444443332 22111 1111111122446667663 34422211 110000 Q ss_pred ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCC Q lcl|NC_012753. 220 YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYD 299 (502) Q Consensus 220 ~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~ 299 (502) .......-+.++.+||+++-.. .| +...|.|-|.++..+--+.-..-|++.+-+......+.+ +....+ T Consensus 203 --~~~~~~~~~~l~~IP~v~~~~~-~~----~~~~~~pPLl~LA~ln~~hy~~~sd~~~~l~~~~~P~l~----~~g~~~ 271 (452) T protein:vir:94 203 --TSTIQNVGVTMDYIPFFCITPS-GL----SMTPAKPPMIDIVDINYSHYRTSADLEHGRHFTGLPTPW----ITGAES 271 (452) T ss_pred --ceeecCCCcccceeEEEEEcCC-CC----CCCCCccchHHHHHHHHHHhcchhHHHHHHHHcccceeE----eecCcC Confidence 0000111245678888876422 11 123467777777777777777777777777665655655 333333 Q ss_pred CCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHH Q lcl|NC_012753. 300 TNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEV 379 (502) Q Consensus 300 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei 379 (502) ..+..+.++.. +..+..+...++-++++.- .+.+.+.++.+-+++... +...+...+.+..|+++. T Consensus 272 ~~~i~iG~~~~----------~~lpe~~~~~~yie~~g~~-i~~~~~~l~~le~~m~~~---Ga~ll~~~~~~~~s~ea~ 337 (452) T protein:vir:94 272 QSTMHIGSTKA----------WVIPEVAAKVGFLEFTGQG-LQSLEKALSEKQAQLASL---SARLIDNSTRGSEATETV 337 (452) T ss_pred CCceEeccccc----------ccCCCCCCcceEEccCchh-HHHHHHHHHHHHHHHHHH---HHHhhccCCCcchHHHHH Confidence 22222222111 1222212233443444322 344555555555544221 122233333344455554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceE--EEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 380 VSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVS--VDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 380 ~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~--v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) ....+........+...++.+|+++++.+..+... .....++ -+|... ..+ .+.++.+.+++.+|.|| T Consensus 338 ~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~g~--------~~~~~v~~n~dF~~~-~~~-~~~~~al~~~~~~G~is 407 (452) T protein:vir:94 338 KLRYMSETASLKSVTRAVEALLNKAYSCIMDMESM--------GGTLNIKLNSAFLDS-KLT-AAELKAWVEAYLSGGIS 407 (452) T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCC--------CCceEEEeccccccc-cCC-HHHHHHHHHHHhcCCCc Confidence 44444434445555556677777777766655321 1112232 334332 223 46788888999999999 Q ss_pred HHHHHHhc--CCCCHHHHHHHHHHHHHhhhcc----cCCCCCccccC Q lcl|NC_012753. 458 KTMAIEKT--LNVTKEQAQEIYQKINDETMVS----TDSFRTSEEVD 498 (502) Q Consensus 458 ~et~l~~~--~~~~deea~~el~ri~~E~~~~----~~~~~~~~~~~ 498 (502) .+|++..+ .++-+ ++.|.+++..|.... .+++.+++.-- T Consensus 408 ~~t~~~~L~~~gvl~--~~~e~~~i~~E~~~~~~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 408 KEIYIHALKVGKVLP--PPGESMGVIPDPPAPEPSPSNTPPNPSSKA 452 (452) T ss_pred HHHHHHHHHhCCCCC--CccCHHHHHHHhhccCcccCCCCCCCccCC Confidence 99987754 24433 122334455443322 23333322222 No 75 >protein:vir:93630 Length: 776 # NCBI annotation: Bcep22gp51 # Family: family:all:487 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944280;genbank:gi:38640357;genbank:GeneID:2658279 Probab=99.74 E-value=4.8e-17 Score=110.05 Aligned_cols=457 Identities=9% Similarity=0.068 Sum_probs=224.7 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) -.+++++..++++. +...++-.....+..+||.|+++.-.... .-....+..++.|.-+-+| T Consensus 44 ~~~~~~l~~~~~~~-----------------~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p~~~~N~i~~~i 105 (776) T protein:vir:93 44 VELHSRLLSYYRQE-----------------LSRQQDNRAEMAVDEDYYDNIQWSQDEID-ELKERGQAPTVYNVISQSV 105 (776) T ss_pred HHHHHHHHHHHHHH-----------------HhhchHHHHHHHHHHHHhCCCCCCHHHHH-HHHhcCCceEEecchHHHH Confidence 22444444444431 11223344566678899999876421111 1112334568899999999 Q ss_pred HHHhhhhhcCcceEeeC-----CHHHHH----HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC----CceEEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVD-----NEVADA----FINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG----DQIRVSFV 147 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~-----d~~~~e----~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~----~~~~i~~v 147 (502) +...++.....+.+.+. |.+..+ .++.+.+.+++......++.++++.|.+|+.++||. +.++..++ T Consensus 106 ~~v~g~~~~nr~~~~~~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~~~af~d~~~~G~G~~~v~~d~~~~~~~~~~~~~ 185 (776) T protein:vir:93 106 NWIIGSEKRGRSDFKVLPRRKDGGKAAERKTALLKYLSDVNHTPFERSMAFEETTKAGIGWLESQVQDENDGEPIYAGAE 185 (776) T ss_pred HHHHHHHHhCCcceEEecCChhHHHHHHHHHHHHHHHHHhhcHHHHHHHHHHHhhhcCcceEEEEeeccCCCCceEeecc Confidence 99999999888777662 334444 555566678999999999999999999999999974 34667788 Q ss_pred cCCeEEEEE-EcCCCeEEEEEEEEEEEee--------------------------------------------------- Q lcl|NC_012753. 148 QATVFFPLQ-ANTQDVSSAAIVTKSTKTE--------------------------------------------------- 175 (502) Q Consensus 148 ~~~~~~Pi~-~d~~~~~~~~~~~~~~~~~--------------------------------------------------- 175 (502) +|..+|+=. ...-++..|-|+.+..+.+ T Consensus 186 ~p~~i~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 265 (776) T protein:vir:93 186 SWRNILWDSTYRRLDMDDCRYIFRVKWVDLDVMLAIFPERAAQLRAAAVDNFETWGTDDIDGDDAMDSPEYERSMNSVTA 265 (776) T ss_pred ChhheeeccccccCCHHHHhhhhhhccCCHHHHHHhcCCchHHHHHhhhhcccccchhcccccccccccccccccccccc Confidence 998877511 1111223333332211110 Q ss_pred ---CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCc----------------eeecccc---------ccC---CC Q lcl|NC_012753. 176 ---GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQ----------------RVPLSTL---------YED---LE 224 (502) Q Consensus 176 ---~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~----------------~v~l~~~---------~~~---l~ 224 (502) .......+++|+|..... ...++.+.+++.-+. .+.+... +.+ |. T Consensus 266 ~~~~~~~~~v~v~E~~~r~~~----~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~g~~~~~~~~~~~v~~~~~~g~~~l~ 341 (776) T protein:vir:93 266 GAVAYARKRVRMIEAWFRMPV----RVQRLKGRNSDFRGEVFDPNDERHVLEVESGRAVLAVSPMMRMHCAIMTTRDLMW 341 (776) T ss_pred cccccCCCeEEEEEEEEeeee----ehhhcccccccccceeecccchHHHHHhhcCceeehheeeeeeEEEEEecchhhh Confidence 000112333444321110 001111111110000 0000000 000 00 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~ 304 (502) .....-..++.||+++. ......+.+|.|++..+++.++.+|...|++.+-+ +..+++++++.+.......... T Consensus 342 ~~~~p~~~~~~Pfv~~~----~~~~~~~~~~~G~v~~~~d~Q~~~N~~~s~~~~~l--~~~~~~~~~gav~~~d~~~~~~ 415 (776) T protein:vir:93 342 AGPSPYRHNRYPFTPIW----GFRRARDGMPYGVIRFMRGMQDDVNKRLSKALYIL--STNKVLMEEGAVDDIDEFRREA 415 (776) T ss_pred ccCCCCCCCccceEEec----CceecccccccchHHhhhHHHHHHHHHHHHHHHhh--cCCceeeccccccchHHHHHhc Confidence 00000011345666553 23345667899999999999999999999998865 4556888766654321100000 Q ss_pred cCccccccccchhhccccCCCCcc-ccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHH Q lcl|NC_012753. 305 VTVKREFETGHNVYEQFDSGDMDK-GIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQ 383 (502) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~ 383 (502) .+++ .++. + ..+.. ...++ ..+.+. .++...++.....+...+|++...+|..++ ..|+.++..+- T Consensus 416 ~rp~-------~vi~-~--~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~i~~~tGi~~~~~G~~~n-~~Sg~ai~~~~ 482 (776) T protein:vir:93 416 ARPD-------AVMT-V--KNGKLGAVKMD-VDRDLA-PAHLELASRSIQMIQQVGGVTDEMLGRTTN-AVSGVAIQARQ 482 (776) T ss_pred ccCC-------ceee-e--CCccccccccc-cCcCcc-HHHHHHHHHHHHHHHHhhCcChHHhCCCcc-hhhHHHHHHHH Confidence 0111 1100 1 11111 11121 233443 568888999899999999999999997654 45788888777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----c---ccCCC-cccc----------------cceEEEeCCCccCC Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVY----N---LYTGE-IPTM----------------DEVSVDLDDGVFTD 439 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~----~---~~~~~-~~~~----------------~~i~v~f~d~i~~d 439 (502) .........+.+.|..+++++.+.++.+..-+ . +.+.. ...+ .++.|.=..+.+.= T Consensus 483 ~~~~~~~~~~~dn~~~~~~~~~~~~l~li~~~~~~~r~~ri~~~~~~~~~v~in~~~~~nd~~~~~~dv~v~~~~~~~s~ 562 (776) T protein:vir:93 483 EQGSVATNKLFDNLRLAFQQHGEKELSLIEQYMTEEKQFRITNSRGNPEYVTVNDGLPENDITRTKADFIIDEAEWRATM 562 (776) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcceEEEEeecCCCcceEEecccchhhhhccceeeEEEeecccchhH Confidence 77777777777777778888777777665432 1 11110 0000 01111111111111 Q ss_pred HHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccc----------------- Q lcl|NC_012753. 440 RNAEFDYWSKMVAAGFAPKTM------AIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEE----------------- 496 (502) Q Consensus 440 ~~~~~~~~~~~~~~Gi~S~et------~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~----------------- 496 (502) .++..+.++++. +.+.... .+..+-++.. +++.++++++......+....... T Consensus 563 r~~~~~~l~ql~--~~~~p~~~~~~~~~~~e~~d~p~--~~e~~~~l~~~~~~~~p~q~~~~~e~~~~qq~q~~~~q~q~ 638 (776) T protein:vir:93 563 RQAAVAELMEVI--GKMPPEIALTMLDLLVENMDIPN--RDELVKRIRAVNGQKDPDQDEPTPEEIAREQAQQQQQQYND 638 (776) T ss_pred HHHHHHHHHHHH--hhcChhhHHHHHHHHHHhcCccc--hHHHHHHHHHhhcccccchhhcchhHHHHHHHhhHHHHHHH Confidence 222333333333 2222221 1122222211 122223333222211111100000 Q ss_pred -----------cCCCCC Q lcl|NC_012753. 497 -----------VDIYGE 502 (502) Q Consensus 497 -----------~~~~g~ 502 (502) +...-. T Consensus 639 ~~~~a~~~~~qa~a~~~ 655 (776) T protein:vir:93 639 ALAIATLEEQQAKARKA 655 (776) T ss_pred HHhhhhhhHhhHHHHHH Confidence 000000 No 76 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=99.74 E-value=5.6e-16 Score=104.21 Aligned_cols=437 Identities=8% Similarity=-0.014 Sum_probs=216.0 Q ss_pred ccccchhhhhccccccCCHHHHH---HHHHHHHHhcCCCC-------cccccc--CCCcccccc--ceecchHHHHHHHH Q lcl|NC_012753. 18 ITNQSLNSITDHPKIAISPEEYN---RIMDNLRYFAGDFD-------SVTYRD--SNGSQVKRD--FNHLPIGRTASKKV 83 (502) Q Consensus 18 ~~~~~l~~i~~~~~~~~~~~~~~---~i~~~~~~Y~g~~~-------~~~~~~--~~~~~~~~~--~~~~n~~k~iv~~~ 83 (502) .....++++. ...+++.. +.+..+..|.|... -|.+.. .......+. -.-.|+++.+++.+ T Consensus 1 m~~~~~~~v~-----~~h~~y~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l 75 (513) T protein:vir:97 1 MADKDPKSPA-----TTSGAYDQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTL 75 (513) T ss_pred CCCCCCCCCC-----cCCHHHHHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHH Confidence 1112222211 11344433 33444455666411 111111 111111111 12248999999999 Q ss_pred hhhhhcCcceEeeCCHHHH-HHH-HHH-HhhccHHHHHHHHHHHHhhcCCEEEEEEEeC--C-----------------c Q lcl|NC_012753. 84 ASLVFNEQATIRVDNEVAD-AFI-NET-LKNDKFSKNFERYLESCLALGGLAMRPYIDG--D-----------------Q 141 (502) Q Consensus 84 a~~l~~ep~~i~~~d~~~~-e~l-~~~-~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~--~-----------------~ 141 (502) ++++|.+||+++.+..... +.| +++ ..-++++..++.++..++.+|.+++.|=+.. + + T Consensus 76 ~G~vf~k~p~~~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~r 155 (513) T protein:vir:97 76 SGKPFSEPIKLNEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKALCHVLIDMPRPAPREDGQPRTLADDRREGLR 155 (513) T ss_pred hhhhhhcCcccCcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCCCccchhHHhHHHHHhhccC Confidence 9999999998865443322 222 222 1225789999999999999999998883321 1 3 Q ss_pred eEEEEEcCCeEEEEEEc--CCC-eEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc Q lcl|NC_012753. 142 IRVSFVQATVFFPLQAN--TQD-VSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST 218 (502) Q Consensus 142 ~~i~~v~~~~~~Pi~~d--~~~-~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~ 218 (502) |-+..++|++++=...+ .++ ....+.+.+....++.-+. +..-.+..++.+.+++....-.+.. ..+..++.. T Consensus 156 Py~~~~~~e~IinW~~~~v~G~~~L~~v~l~E~~~~~Dgf~~-~~~~q~rvL~~g~~~v~r~~~~~~~--~~~e~~~~~- 231 (513) T protein:vir:97 156 PYWVMIKPECLLFARSEVINGVEVLQHVRIIEHYMEQDGFAE-VCKRRIRVLEPGLVQLWEPVKKSNA--QKEEWALAD- 231 (513) T ss_pred ceEEEecHhhhcCcceeccCcceeeeeEEEEEEEeecCCCcc-eEEEEEEEEeCceEEEEEeecCCCc--cccceEEec- Confidence 77999999999874322 222 2223333333332222111 1111223466676665432111111 111111100 Q ss_pred cccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCC Q lcl|NC_012753. 219 LYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEY 298 (502) Q Consensus 219 ~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~ 298 (502) ..-++++.+||+++-.. .| +..-|.|-|-++..+-.+.=...|++-+-+......+.+ +.... T Consensus 232 --------~g~~~l~~IP~v~~~~~-~~----~~~~~~pPLl~LA~ln~~hy~~~Sd~~~il~~~~~P~l~----~~G~~ 294 (513) T protein:vir:97 232 --------EWATGLNYVPLVTFYAD-RQ----GFMMGKPPLLDLAHLNVAHWQSASDQRHILTVSRFPILA----CSGAS 294 (513) T ss_pred --------CCCCcCCceeEEEEecC-CC----CCCCCccchHHHHHHHHHHHhhhhhHHHHHHhcccceee----eecCC Confidence 01245778898887532 11 222345555555555554445555555555544444444 22111 Q ss_pred CC--CCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccH Q lcl|NC_012753. 299 DT--NGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTA 376 (502) Q Consensus 299 ~~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tA 376 (502) +. ++..+.++ .. +..+..+...++-+++..- .+.+.+.++.+.+++. ..|. ..+ ...++++|| T Consensus 295 ~~~~~~i~iG~~--------~~--~~lpe~~~~~~yie~~g~~-i~~~~~~l~~le~qm~-~~Ga--~ll-~~~~~~~Ta 359 (513) T protein:vir:97 295 GEDSDPVVVGPN--------KV--LYNPDPAGRFYYVEHTGQA-IAAGRTDLKDLEEQMA-GYGA--EFL-KRKTGGQTA 359 (513) T ss_pred cCCCCceEeecc--------cc--ccCCCCCCcceeeccCchh-HHHHHHHHHHHHHHHH-HHHH--Hhh-ccCCccccH Confidence 11 11111111 10 1122112333444444321 2344555555555552 2222 222 234567899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_012753. 377 TEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFA 456 (502) Q Consensus 377 tei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~ 456 (502) ++.+...+........+...++.+|++.++.+..+.... .......++-+|..... ..+.++.+.++..+|.| T Consensus 360 ~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~wlg~~-----~~~~~v~in~dF~~~~~--~~~~~~al~~a~~~G~i 432 (513) T protein:vir:97 360 TARALDSAEATSDLSAMTGLFEDALAQALDITADWLRLG-----PNGGTVELVKDYDLEEM--DAPGLQALQVAREKRDI 432 (513) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-----CCccEEEeccccCcccC--CHHHHHHHHHHHhCCCC Confidence 999988888888888899999999999988877664210 00011223334543322 24567888899999999 Q ss_pred CHHHHHHhcC--CC-----CHH-HHHHHHHHHHHhhhcc----cCCCCCcc-ccCCCCC Q lcl|NC_012753. 457 PKTMAIEKTL--NV-----TKE-QAQEIYQKINDETMVS----TDSFRTSE-EVDIYGE 502 (502) Q Consensus 457 S~et~l~~~~--~~-----~de-ea~~el~ri~~E~~~~----~~~~~~~~-~~~~~g~ 502 (502) |.+|++..+- ++ +++ +.+++.+||.+..... .+...++. +++..|| T Consensus 433 s~~t~~~~L~r~gvl~~d~d~~~~~e~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 491 (513) T protein:vir:97 433 SRKTYLNGLRLRGVLPEDFDEDEDWEELMEEISEAMGRAGLDLDPAQKNPPEGGEGEGE 491 (513) T ss_pred CHHHHHHHHHhccCCCccCCHHHHHHHHHHhhhhccCCCCccccccCCCCCCCCCCCCC Confidence 9999877542 33 333 3345555555443221 11111222 2233333 No 77 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=99.70 E-value=1.3e-14 Score=96.65 Aligned_cols=448 Identities=10% Similarity=0.080 Sum_probs=195.7 Q ss_pred CC-hh------HHHHHHHHHHhhc-ccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccccccee Q lcl|NC_012753. 1 MG-II------QTIKNFIKRSNYV-ITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNH 72 (502) Q Consensus 1 m~-~~------~~ik~~i~~~~~~-~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~ 72 (502) |+ |= ..++.-++.+-.. .++..++.-...+-+..+.+...+ +.+.-| ..+..+- .- T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~--E~~~~Y-------------~~rl~rA-~~ 95 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDE--EQRRRY-------------ETYLQRA-IF 95 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCc--CCHHHH-------------HHHHhhc-cC Confidence 65 21 1111111111110 011111110000100000000000 000000 0011111 12 Q ss_pred cchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHH-HhhccHHHHHHHHHHHHhhcCCEEEEEEEe-CC---------- Q lcl|NC_012753. 73 LPIGRTASKKVASLVFNEQATIRVDNEVADAFINET-LKNDKFSKNFERYLESCLALGGLAMRPYID-GD---------- 140 (502) Q Consensus 73 ~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~-~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d-~~---------- 140 (502) .|+++.+++.+++++|.++|++++. +....+++++ ..-+++...+..++..++.+|.+++.|=+. .+ T Consensus 96 ~n~~~~tl~~l~G~vfrk~p~~~~p-~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~iLVD~P~~~~~~t~ade~~ 174 (535) T protein:vir:80 96 YNVTARTLDGMMGQVFSRDPIRQLP-PALEAIVEDIDGEGVSLDQQAKKALGYTMGFGRAAIFTDYPNVGRPVTVLEQKL 174 (535) T ss_pred CChhHHHHHHHhchhhcCCcceecc-HHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCcccHHHHHh Confidence 4899999999999999999988664 2233333222 112478899999999999999999988432 11 Q ss_pred ---ceEEEEEcCCeEEEEEEcC--CC-eEEEEEEEEEEEeeCC-----CceEEEEEEEEEEeCCeEEEEEEEEecCCccc Q lcl|NC_012753. 141 ---QIRVSFVQATVFFPLQANT--QD-VSSAAIVTKSTKTEGQ-----KVKYYSLIEFHEWNKETYTISNELYESESKTI 209 (502) Q Consensus 141 ---~~~i~~v~~~~~~Pi~~d~--~~-~~~~~~~~~~~~~~~~-----~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~ 209 (502) +|-+..++|++++=...+. ++ ....+.+.+.+..+++ ....|+.++. -.++.|+++ +|+.+.... T Consensus 175 ~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~--~~~G~y~v~--~~~~~~~~~ 250 (535) T protein:vir:80 175 GLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQL--NAEGNYQVE--RWRRETQEE 250 (535) T ss_pred cCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEe--cCCceEEEE--EEEeecCCc Confidence 3889999999998743332 22 2333334444333221 1123333332 134567765 343222111 Q ss_pred cCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeee Q lcl|NC_012753. 210 IGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV 289 (502) Q Consensus 210 lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v 289 (502) . ...... ........+.++.+||+++... .| +...|.+-|-++..+--+.=..-|++.+-+......+.+ T Consensus 251 ~--~~~~~~---~~~~~~g~~~l~~IPfv~~~~~-~~----~~~~~~pPLl~LA~lni~Hy~~ssd~~~il~~~~~P~l~ 320 (535) T protein:vir:80 251 M--YYSYSK---HVPTDGNGNPFKEIPFQFIGPL-DN----NADIDHPPLLDLCEVNIGHYRNSADYEEMAFVAGQPTAF 320 (535) T ss_pred c--ccccce---eecccCCCcccCeeEEEEeecC-CC----CCCCCccchHHHHHHHHHHhhchhHHHHHHHHhcCceee Confidence 0 000000 0111112255778888877422 11 122334444444433222222223333333333333332 Q ss_pred chHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccc Q lcl|NC_012753. 290 PTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD 369 (502) Q Consensus 290 ~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~ 369 (502) +....+...............-+. .+..+. +...++-.+++.--. .+.++.+..++.. ++...+. . T Consensus 321 ----i~G~~~~~~~~~~~~~~i~iG~~~--~~~lP~-~~~~~~~e~~~~~~a---~~~l~~~e~qM~~---lGa~ll~-~ 386 (535) T protein:vir:80 321 ----FTGLTKDWVEDVFKDFKVHLGSRA--IIPLPQ-GATAGILQITPNSVP---FEAMTHKESQMIA---MGANLLV-K 386 (535) T ss_pred ----eecCchhhhhcCCCCcceEecCcc--cccCCC-CCCcceeeeccchhH---HHHHHHHHHHHHH---HHHHhhc-c Confidence 111000000000000000000011 111222 223344445443211 2233333333222 1222232 2 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccce----EEEeCCCccCCHHHHHH Q lcl|NC_012753. 370 GKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEV----SVDLDDGVFTDRNAEFD 445 (502) Q Consensus 370 ~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i----~v~f~d~i~~d~~~~~~ 445 (502) ..++.||++.+...+........+...++.+|+++++.+..+.. .......+ +-+|... ..+ .+.++ T Consensus 387 ~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w~G-------~~~~~~~~~i~~n~dF~~~-~ld-~~~~~ 457 (535) T protein:vir:80 387 SGGNRTFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQFQT-------GIVNDETVEYNLNTDFPAA-RLT-PNERA 457 (535) T ss_pred CcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcC-------CccCCCceEEEeccccccc-cCC-HHHHH Confidence 34567888877666666677777788888888888877665421 11111222 2334332 223 35678 Q ss_pred HHHHHHhcCCCCHHHHHHhc--CCCC--HHHHHHHHHHHHHhhhcccCCCCCccc-----------------cCCCCC Q lcl|NC_012753. 446 YWSKMVAAGFAPKTMAIEKT--LNVT--KEQAQEIYQKINDETMVSTDSFRTSEE-----------------VDIYGE 502 (502) Q Consensus 446 ~~~~~~~~Gi~S~et~l~~~--~~~~--deea~~el~ri~~E~~~~~~~~~~~~~-----------------~~~~g~ 502 (502) .+.+++.+|.||.+|++..+ -++- +...++|..||+.|............+ +.--|. T Consensus 458 all~~~~~G~Is~et~~~~L~r~gvl~~~~~~eee~~ri~~E~~~~~~~~g~~~d~~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 458 ELILEWQQGAITFKEMRAGLRRAGVASEDDAKAETEGKATVEFIAKTAAAGKVGDAASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred HHHHHHhcCCCCHHHHHHHHHhCCCCCcccchHHHHHHHHhhhhhccccCCCCCCCCCCCCCcCcccCCccccccCCC Confidence 88899999999999987754 2432 112356778888874332211111111 111111 No 78 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=99.70 E-value=2.2e-14 Score=95.49 Aligned_cols=438 Identities=11% Similarity=0.047 Sum_probs=199.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCC-------cccccc---C----CCccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFD-------SVTYRD---S----NGSQV 66 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~-------~~~~~~---~----~~~~~ 66 (502) ||=+++. |+. -..+..+.+..++.+.|... -|.+.+ . ..... T Consensus 1 m~~V~~~---------------------hp~---y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~ 56 (501) T protein:vir:95 1 MPNVSFI---------------------RPE---LGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYE 56 (501) T ss_pred CCCCCCC---------------------CHH---HHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHH Confidence 5521110 000 01111222222222222210 011000 0 00011 Q ss_pred cccc--eecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHH-HhhccHHHHHHHHHHHHhhcCCEEEEEEEe--CC- Q lcl|NC_012753. 67 KRDF--NHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINET-LKNDKFSKNFERYLESCLALGGLAMRPYID--GD- 140 (502) Q Consensus 67 ~~~~--~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~-~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d--~~- 140 (502) .+.. .-.|+++.+++.+++++|.++|++++.. ....+++++ ..-+++...+..++..++.+|.+++.|=+. ++ T Consensus 57 ~rl~rA~~~n~~~~t~~~l~G~vf~k~p~~~~p~-~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~ 135 (501) T protein:vir:95 57 AYLKRAVFYNVARRTLFGLVGQVFMRDPVVKVPA-LLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGLLVDYPTTEAE 135 (501) T ss_pred HHhhccccCchHHHHHHHHhhhhhcCCcceeCcH-HHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeecCCCCc Confidence 1111 1248999999999999999999986432 233333332 112488999999999999999999988432 11 Q ss_pred -------------ceEEEEEcCCeEEEEEEcC--CC-eEEEEEEEEEEEeeCC-----CceEEEEEEEEEEeCCeEEEEE Q lcl|NC_012753. 141 -------------QIRVSFVQATVFFPLQANT--QD-VSSAAIVTKSTKTEGQ-----KVKYYSLIEFHEWNKETYTISN 199 (502) Q Consensus 141 -------------~~~i~~v~~~~~~Pi~~d~--~~-~~~~~~~~~~~~~~~~-----~~~~yt~~E~h~~~~~~~~I~~ 199 (502) +|.+..++|++++=...+. ++ ....+.+++.+...++ ....|+.++. -+++ +... T Consensus 136 ~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~--~~~g--~~~~ 211 (501) T protein:vir:95 136 GGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRL--DEEG--YYVH 211 (501) T ss_pred ccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEee--CCCc--eEEE Confidence 3789999999998743332 22 2333334444332221 1223333322 1223 3333 Q ss_pred EEEecCCcc-ccCceeeccccccCCCc---ceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHH Q lcl|NC_012753. 200 ELYESESKT-IIGQRVPLSTLYEDLEE---TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDE 275 (502) Q Consensus 200 ~l~~~~~~~-~lG~~v~l~~~~~~l~~---~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~ 275 (502) ++|+..... .-|..++........+. ...-+.++.+||+++-.. .|... -|.+ +|++-.+..++. T Consensus 212 ~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~~~~-~~~~~----~~~p------PLl~lA~lni~h 280 (501) T protein:vir:95 212 EIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFIGSE-NNDSN----PDNP------NFYDLASLNMAH 280 (501) T ss_pred EEEEecCCcccCcceecCCcccccceeeeeccCCCcCCeeeEEEEecC-CCCCC----CCcc------chHHHHHHHHHH Confidence 455532111 11222221111100000 011256778888876321 11111 1223 333333333333 Q ss_pred HHH--HHh----hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHH Q lcl|NC_012753. 276 FMW--EVK----MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAIN 349 (502) Q Consensus 276 ~~~--~~~----~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~ 349 (502) |.+ |++ .....+.+ +....+... ............+.. +..+ .+++.++-++++.--. .+.++ T Consensus 281 y~~ssd~~~~l~~~~~P~l~----i~G~~~~~~-~~~~~~~i~~G~~~~--~~lP-~~~~~~~ie~~~~~i~---~~~l~ 349 (501) T protein:vir:95 281 YRNSADYEESCYIVGQPTPV----LIGLTEEWV-TNVLKGSVNFGSRGG--IPLP-VGADAKLLQASENTML---KEAMD 349 (501) T ss_pred HhhhhHHHHHHHHcccceee----eeCCccccc-ccCCCCceeeccccc--ccCC-CCCceeEEecChhhHH---HHHHH Confidence 322 232 22222322 211111000 000000011111111 1111 1223344444443211 23344 Q ss_pred HHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceE Q lcl|NC_012753. 350 KGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVS 429 (502) Q Consensus 350 ~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~ 429 (502) .+.+++.. .| ...+ ..+.+++||++.....+........+...++.+|.++++.+..+... . .....++ T Consensus 350 ~l~~~m~~-~G--a~ll-~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w~g~---~----~~~~~v~ 418 (501) T protein:vir:95 350 TKERQMVA-LG--AKLV-EQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALKWAARWVGQ---A----DSGVKFE 418 (501) T ss_pred HHHHHHHH-HH--Hhhc-cCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHcCC---C----CCceEEE Confidence 44444322 12 1122 23446689999888888888888888888999999988877766321 1 1112233 Q ss_pred E--EeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhhhccc--CCCCCccccCCCCC Q lcl|NC_012753. 430 V--DLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEIYQKINDETMVST--DSFRTSEEVDIYGE 502 (502) Q Consensus 430 v--~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~el~ri~~E~~~~~--~~~~~~~~~~~~g~ 502 (502) + +|... ....+.++.+.+++.+|.||.+|++..+ -++.+++.++|.++|..|..... +.+.+...-.-+|. T Consensus 419 i~~df~~~--~~~~~~~~al~~~~~~G~is~~t~~~~L~~~~v~~~~~~~e~e~i~~~~~~~~~~~~~~~~~~~~~gg~ 495 (501) T protein:vir:95 419 LNTDFDIA--RMTPDERRSLVEEWQKGAITFEEMRTGLRKAGVATEDDSKAKEKIAKDTAEAMALATPANVPGDGSGGD 495 (501) T ss_pred Eecccccc--cCCHHHHHHHHHHHhCCCCcHHHHHHHHHhCCCCChhHHHHHHHHHhhhcCcccccccCCCCCCCcccc Confidence 3 23322 2234567888899999999999986644 35655445666677776654321 22222222222333 No 79 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=99.68 E-value=6.7e-15 Score=98.29 Aligned_cols=435 Identities=8% Similarity=0.019 Sum_probs=199.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccc--------cccCCC---cccccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVT--------YRDSNG---SQVKRD 69 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~--------~~~~~~---~~~~~~ 69 (502) |+|-+.=.++...... |++..+ .--..++....-|.=++.... +..... ...... T Consensus 14 m~V~~~hp~y~a~~~~-------------W~~~~d-~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~ 79 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQ-------------WLRNLD-CVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDL 79 (488) T ss_pred ecccccCHHHHHHhhh-------------hhHhhh-hhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhh Confidence 4433222222221111 110000 000111111111111100000 000000 000000 Q ss_pred ---c-eecchHHHHHHHHhhhhhcCcceEeeCCH-HHHHHHHHHH-hhccHHHHHHHHHHHHhhcCCEEEEEEEeC---- Q lcl|NC_012753. 70 ---F-NHLPIGRTASKKVASLVFNEQATIRVDNE-VADAFINETL-KNDKFSKNFERYLESCLALGGLAMRPYIDG---- 139 (502) Q Consensus 70 ---~-~~~n~~k~iv~~~a~~l~~ep~~i~~~d~-~~~e~l~~~~-~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~---- 139 (502) + .=.|+++.+++.+++++|.++|+++.++. ....+++++= .-++++..++.++..++.+|.+++.|=+.+ T Consensus 80 ~~~rA~~~n~~~~tl~~l~G~vfrk~p~~~~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~T 159 (488) T protein:vir:96 80 TWRLANYVNIVNPTMNAITGAVMRREPEFDTMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWGSRCGWLVRSHPESAT 159 (488) T ss_pred hhhccccCchhHHHHHHhcchhhccCceeccCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEecCCCcCC Confidence 1 12489999999999999999999988754 4444443321 125788999999999999999998885432 Q ss_pred --------CceEEEEEcCCeEEEEEEcC--CC-eEEEEEEEEEEEe-eCCCceEEEEEEEEEEeCCeEEEEEEEEecCCc Q lcl|NC_012753. 140 --------DQIRVSFVQATVFFPLQANT--QD-VSSAAIVTKSTKT-EGQKVKYYSLIEFHEWNKETYTISNELYESESK 207 (502) Q Consensus 140 --------~~~~i~~v~~~~~~Pi~~d~--~~-~~~~~~~~~~~~~-~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~ 207 (502) .+|-+..++|++++=...+. |+ ....+.+++.... ++.....-+.++.+++.++.|++.- +.. . T Consensus 160 ~ade~~~~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~--~~~--~ 235 (488) T protein:vir:96 160 MADWNKGKKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQE--VTD--D 235 (488) T ss_pred HHHHHHhcCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEE--Eec--C Confidence 14889999999998743332 22 2222333333322 2222112234555667777777642 221 1 Q ss_pred cccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhcccee Q lcl|NC_012753. 208 TIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRV 287 (502) Q Consensus 208 ~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i 287 (502) ...+..+|.. ..-+.++.+||+++... .| +...|.+-|-++..|--+.=..-|++-+-+......+ T Consensus 236 ~~~~e~~~~~---------~g~~~l~~IP~v~~~~~-~~----~~~~~~pPLldLA~lnl~Hy~~ssd~~~il~~~~~p~ 301 (488) T protein:vir:96 236 EYSDEWTPVL---------INSKQSDTIPFFLASSQ-SN----EWCIDSTPLTSLAEISLSIYVMNAYSNKAMILANEAK 301 (488) T ss_pred CcccceEeec---------CCCcccCeeEEEEEecC-CC----CCCCCCCchHHHHHHHHHHHhhhhHHHHHHHhcCCce Confidence 1112222211 01235677888877432 11 1112333333333321111111122211122222333 Q ss_pred eechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcc Q lcl|NC_012753. 288 AVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFS 367 (502) Q Consensus 288 ~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~ 367 (502) .| +...+...+..... -.....+..........+. +..++.....- ..+.++.+.+++.. . +...+. T Consensus 302 lv----~~~~~~~~~~~~~~---~~~g~~~~~~~~~~~~~g~--~~~~e~~~~~l-~~~~l~~l~~qm~~-~--Ga~l~~ 368 (488) T protein:vir:96 302 WM----VDMGDMNKTMASEM---NPLGFTLAGRMPYYVKNGD--VKVIQAQFSPE-TENKVEKLFEQAVK-V--GASLFT 368 (488) T ss_pred ee----eccCCCCccccccc---ccceeeecccccccccCCc--eeecCCchhHH-HHHHHHHHHHHHHH-H--hHhhcc Confidence 33 11111000000000 0001111111111111111 22223222211 13345555444421 2 222232 Q ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCC---ccCCHHHHH Q lcl|NC_012753. 368 FDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDG---VFTDRNAEF 444 (502) Q Consensus 368 ~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~---i~~d~~~~~ 444 (502) .++++||++.....+........+...++.+++++++.+..+... ..+.. ...+++|.-+.. ...+ ...+ T Consensus 369 --~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~g~---~~~~~-~~~~~~~~in~dF~~~~ld-~~~~ 441 (488) T protein:vir:96 369 --QQSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYFEG---TNLYV-NPDELVFKLNRDYFDVEVN-PQML 441 (488) T ss_pred --CCCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCC---CCCCc-CccceEEEeccCCCCccCC-HHHH Confidence 234579999888877777888888888999999988877765432 11111 122333332221 2223 4568 Q ss_pred HHHHHHHhcCCCCHHHHHHhcC--CC--CHHHHHHHHHHHHHhhhcc Q lcl|NC_012753. 445 DYWSKMVAAGFAPKTMAIEKTL--NV--TKEQAQEIYQKINDETMVS 487 (502) Q Consensus 445 ~~~~~~~~~Gi~S~et~l~~~~--~~--~deea~~el~ri~~E~~~~ 487 (502) +.+.++..+|.||.+|++..+- ++ .|-+.++|.+||+++-... T Consensus 442 ~al~~~~~~G~Is~~t~~~~L~~~gvl~~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 442 QVAYAAMMEGNLPQVSWFELLKRARVVRGDMSKEEFDEHIAELGFGM 488 (488) T ss_pred HHHHHHHhcCCCCHHHHHHHHHhCCcCCccCCHHHHHHHHhhcCCCC Confidence 8889999999999999877543 44 2223456777777543322 No 80 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=99.67 E-value=1.9e-15 Score=101.28 Aligned_cols=449 Identities=13% Similarity=0.102 Sum_probs=201.8 Q ss_pred CChh---HHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGII---QTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~---~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k 77 (502) |+-= +.+++.++.. + . -.+.+......++..||.|+...... . .+..++.+.-. T Consensus 10 ~~~~~~~~~~~~~~~~a------~------~----~~~~~~~~~~~~~~~~y~g~~~~~~~---~----~~s~~~~~~v~ 66 (705) T protein:vir:88 10 MDDEQVLRHLDQLVNDA------L------D----FNSSELSKQRSEALKYYFGEPFGNER---P----GKSGIVSRDVQ 66 (705) T ss_pred CCHHHHHHHHHHHHHHH------H------h----hhhhHHHHHHHHHHHHHhCCCCCccc---C----CCCccccHHHH Confidence 2211 2222222221 0 0 01122333556777899998554221 1 12233344444 Q ss_pred HHHHHHhh----hhhcCcceEee-----CCHHHHHHHHHH-----HhhccHHHHHHHHHHHHhhcCCEEEEEEEeC---- Q lcl|NC_012753. 78 TASKKVAS----LVFNEQATIRV-----DNEVADAFINET-----LKNDKFSKNFERYLESCLALGGLAMRPYIDG---- 139 (502) Q Consensus 78 ~iv~~~a~----~l~~ep~~i~~-----~d~~~~e~l~~~-----~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~---- 139 (502) ..|+.... .+|+.+..+.+ +|....+.+..+ .+.++....+..++.+|+..|.+++++||+. T Consensus 67 ~~v~~~~~~l~~~~~~~~~~~~~~p~~~~D~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~dal~~g~gi~kv~we~~~~~ 146 (705) T protein:vir:88 67 ETVDWIMPSLMKVFTSGGQVVKYEPDTAEDVEQAEQETEYVNYLFMRKNEGFKVMFDWFQDTLMMKTGVVKVYVEEVLKP 146 (705) T ss_pred HHHHHHHHHHHHhhcCCCceEEEeeCChhHHHHHHHHHHHHhHHHhhccchhHHHHHHHHHHhhcCCeEEEeccccccch Confidence 44444444 34454444444 344444444433 3345566778899999999999999999943 Q ss_pred ---------------------------------------------CceEEEEEcCCeEEEEEEcCCCeEEEEEEE-EEEE Q lcl|NC_012753. 140 ---------------------------------------------DQIRVSFVQATVFFPLQANTQDVSSAAIVT-KSTK 173 (502) Q Consensus 140 ---------------------------------------------~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~-~~~~ 173 (502) |.+++..|+|..|++= .+..+...+.|+. +++. T Consensus 147 ~~e~~~~~~~~~l~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~i~~V~p~d~~~d-p~a~~~~d~~~~~~~~~~ 225 (705) T protein:vir:88 147 TFERFSGLSEDMVADILSDPDTSILAQSVDDDGTYTIKIRKDKKKREIKVLCVKPENFLVD-RLATCIDDARFLCHREKY 225 (705) T ss_pred hhhhhccCChhhhhhhhhhhhhhcccccccccceeeeEEeeeeecCceeeeeccHHHceec-CCCCCcccCcEEEEEEec Confidence 5578899999998842 1222333443332 2211 Q ss_pred eeCCCc-eEE-----EEEEEEE---------------------------EeCCe-EEE-EEEEEecCCccccCceeeccc Q lcl|NC_012753. 174 TEGQKV-KYY-----SLIEFHE---------------------------WNKET-YTI-SNELYESESKTIIGQRVPLST 218 (502) Q Consensus 174 ~~~~~~-~~y-----t~~E~h~---------------------------~~~~~-~~I-~~~l~~~~~~~~lG~~v~l~~ 218 (502) ...+-. .+| ..+..+. +.+.. ..| .|+.|...+.+.-|...+..- T Consensus 226 t~~dl~~~g~~~~~~~~~~~~~~~~~~~~~e~~~~~~~d~~~~~~~~~~~~~~~~r~v~~~E~y~~~d~~~d~~~~~~~~ 305 (705) T protein:vir:88 226 TVSDLRLLGVPEDVIEELPYDEYEFSDSQPERLVRDNFDMTGQLQYNSGDDAEANREVWASECYTLLDVDGDGISELRRI 305 (705) T ss_pred cHHHHHhhcCChhHhhhhhcccccchhhhhhhccccccccccccccccccccCCceeEEEEEeeeEecccCCcceeeEEE Confidence 100000 000 0000000 00000 011 122222111111111111111 Q ss_pred cccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccC Q lcl|NC_012753. 219 LYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTE 297 (502) Q Consensus 219 ~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~ 297 (502) .+.+-. -.....++++||+.+.+ ....++.+|.|+++.++++++.+|..++++++.+.. ...++.|+.+++... T Consensus 306 ~~~g~~-il~~~~~~~~PF~~~~~----~p~~~~~~G~g~~~~~~d~Q~~~n~~~~~~~d~~~~~~~~~~~~~~g~v~~~ 380 (705) T protein:vir:88 306 LYVGDY-IISNEPWDCRPFADLNA----YRIAHKFHGMSVYDKIRDIQEIRSVLMRNIMDNIYRTNQGRSVVLDGQVNLE 380 (705) T ss_pred EEeCcc-ccccccCCCCCEEEecc----eeecCccccCChHHHHhHHHHHHHHHHHHHHHHHHhccCCceeccccccCcc Confidence 111100 00012245677876542 235578899999999999999999999999988753 555777776665321 Q ss_pred CCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccc---cccc Q lcl|NC_012753. 298 YDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDG---KSMK 374 (502) Q Consensus 298 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~---~~~~ 374 (502) . .....++..+ ....+ ..+..+++.-........++.+...+...+|++....|.++ .+.. T Consensus 381 d---~~~~~pg~vv----------~~~~~---~~i~~~~~~~~~~~~~~ll~~~~~~~~~~tGi~~~~~G~~~~~~~~~~ 444 (705) T protein:vir:88 381 D---LLTNEAAGIV----------RVKSM---NSITPLETPQLSGEVYGMLDRLEADRGKRTGITDRTRGLDQNTLHSNQ 444 (705) T ss_pred c---ccccCCCeeE----------EecCC---CccccccCCcCcHHHHHHHHHHHHHHHHhhCCchHHcCCCcccccchh Confidence 1 1111111110 00111 12333333222334566677777888889999988888543 3356 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhcccCCC-c-------c--------cccceEEEeCCCcc Q lcl|NC_012753. 375 TATEVVSEQSDTYQMRNSIATLVE-KSLKELVISILELAKVYNLYTGE-I-------P--------TMDEVSVDLDDGVF 437 (502) Q Consensus 375 tAtei~~~~~~l~~~~~~~~~~~~-~~l~~l~~~il~~~~~~~~~~~~-~-------~--------~~~~i~v~f~d~i~ 437 (502) ||+++.............+.+.|. .++++|++.++++..-+ +... . . ...++.++-..+ . T Consensus 445 Ta~~i~~~~~~~~~r~~~~~r~~a~~~~~~l~~~~~~li~~~--~~~~~~~ri~g~~v~v~~~~~~~~~~v~v~v~~~-~ 521 (705) T protein:vir:88 445 AAMSVNQLMTAAEQQIDLIARMFAETGVKRLFQLLHDHAIKY--QNQEEVFQLRGKWVAVNPANWRERSDLTVTVGIG-N 521 (705) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--CCCceEEeeccchhccchHhhccCCceEEeeccc-c Confidence 899888777777777777777774 57788887777765433 1111 0 0 001111111110 0 Q ss_pred CCHHHHHHHHHHHHhcCCCCHHHHHH---hcCC-CCHHHHHHHHHHHHHhhhcccCC--CCCccc-cCCCCC Q lcl|NC_012753. 438 TDRNAEFDYWSKMVAAGFAPKTMAIE---KTLN-VTKEQAQEIYQKINDETMVSTDS--FRTSEE-VDIYGE 502 (502) Q Consensus 438 ~d~~~~~~~~~~~~~~Gi~S~et~l~---~~~~-~~deea~~el~ri~~E~~~~~~~--~~~~~~-~~~~g~ 502 (502) .+.+...+.+..+. .....+. .+.+ .+..+..+.+.++.+........ ...+.. ...-.+ T Consensus 522 ~~~eq~~a~l~~ll-----~~~q~l~~~~~~~~~~~~~~~~~~~~el~e~~~~k~~~~~~~~~~~~e~~~~~ 588 (705) T protein:vir:88 522 MNKDQQMLHLMRIW-----EMAQAVVGGGGLGVLVSEQNLYNILKEVTENAGYKDPDRFWTNPNSPEALQAK 588 (705) T ss_pred chHHHHHHHHHHHH-----HHHHHhhcccchhhhcChHHHHHHHHHHHHhhhhhhHHHHhhhhhhHHHHHHH Confidence 11122222222211 1111111 1111 22222222222222111100000 000000 000000 No 81 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=99.60 E-value=4.2e-13 Score=88.47 Aligned_cols=435 Identities=11% Similarity=0.080 Sum_probs=202.8 Q ss_pred cccccchhhhhccccc-cCCHHHHHHHHHHH---HHhcCCCC------ccccccC-CCc--ccccc--ceecchHHHHHH Q lcl|NC_012753. 17 VITNQSLNSITDHPKI-AISPEEYNRIMDNL---RYFAGDFD------SVTYRDS-NGS--QVKRD--FNHLPIGRTASK 81 (502) Q Consensus 17 ~~~~~~l~~i~~~~~~-~~~~~~~~~i~~~~---~~Y~g~~~------~~~~~~~-~~~--~~~~~--~~~~n~~k~iv~ 81 (502) +....-. .-++ ...+++......|+ +-|.|..- -+.+... .+. ...+. -.-.|+++.+++ T Consensus 1 ~~~~~~~-----~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~ 75 (491) T protein:vir:95 1 MLTANGQ-----GSGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLS 75 (491) T ss_pred CcccCCc-----cCCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCChHHHHHH Confidence 1111111 1111 12344444444443 45777321 1111100 000 11111 122489999999 Q ss_pred HHhhhhhcCcceEeeCCHHHHHHHHHH-HhhccHHHHHHHHHHHHhhcCCEEEEEEEe--CC-----------ceEEEEE Q lcl|NC_012753. 82 KVASLVFNEQATIRVDNEVADAFINET-LKNDKFSKNFERYLESCLALGGLAMRPYID--GD-----------QIRVSFV 147 (502) Q Consensus 82 ~~a~~l~~ep~~i~~~d~~~~e~l~~~-~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d--~~-----------~~~i~~v 147 (502) .+++++|.+||++++.+. ...+++++ ..-++++..++.++..++.+|.+++.|=+. ++ +|-+..+ T Consensus 76 ~l~G~vfrk~p~~~~p~~-l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~ 154 (491) T protein:vir:95 76 GMVGSVMRKEPEINIPKE-LEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETAAATAAEQNAGLLNPTIAFY 154 (491) T ss_pred HHhchhhcCCceeeccHH-HHHHHhccCCCCCCHHHHHHHHHHHHHHcCeEEEEEecCCCcccCHHHHHHhcCCcEEEEe Confidence 999999999999876433 33333322 112578899999999999999999888442 11 4889999 Q ss_pred cCCeEEEEEEcC--CC-eEEEEEEEEEEEe-eC------CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeec- Q lcl|NC_012753. 148 QATVFFPLQANT--QD-VSSAAIVTKSTKT-EG------QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPL- 216 (502) Q Consensus 148 ~~~~~~Pi~~d~--~~-~~~~~~~~~~~~~-~~------~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l- 216 (502) +|++++=...+. ++ ....+.+.+.... +. +....|++++.- .++.|++ ++|+.... |..... T Consensus 155 ~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~--~~g~~~~--~v~r~~~~---g~~~~~~ 227 (491) T protein:vir:95 155 TTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDID--TDGNYRQ--RLFRFDAE---GGAQEEV 227 (491) T ss_pred chhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeec--CCCceEE--EEEEEcCC---Ccceeee Confidence 999998743222 22 3333444443322 21 122344444331 2344443 45553211 111100 Q ss_pred cccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc Q lcl|NC_012753. 217 STLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT 296 (502) Q Consensus 217 ~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~ 296 (502) ..+..+ .--+.++.+||+++-.. ++ +..-|.+-|-++..+--+.=..-|++-+-+......+.+ +.. T Consensus 228 ~~~~~~----~g~~~l~~IPfv~~~~~--~~---~~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~----~~G 294 (491) T protein:vir:95 228 VEIYPD----LGESLRGVIPFTFIGAT--NN---DATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLF----IYP 294 (491) T ss_pred eeeeec----CCCcccCeeEEEEEecC--CC---CCCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceee----eec Confidence 000101 11245677888877532 11 111233333333222111001112222222222333332 211 Q ss_pred CCCCCCcccCc--cccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccc Q lcl|NC_012753. 297 EYDTNGEKVTV--KREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK 374 (502) Q Consensus 297 ~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~ 374 (502) ..+........ ...+-.+.+. .+..+. ++..++-.+++.--. .+.|+.+-.++.. . +...+. .++.. T Consensus 295 ~d~~~~~~~~~~~~~~i~~g~~~--~~~lP~-~~~~~~ie~~~~~~~---~~~l~~~e~qm~~-~--Ga~l~~--~~~~~ 363 (491) T protein:vir:95 295 GDNLTPQSFKEANPNGIKFGSRC--GHNLGY-GGSAQLIQAGENNLA---RQNMLDKEQQAIQ-I--GAQLIT--PSQQI 363 (491) T ss_pred CcccCcchhhccCcceeEecCcC--CcCCCC-CCccceeecCcchHH---HHHHHHHHHHHHH-H--HHHhcc--CCcch Confidence 00000000000 0000001110 011111 222233333332111 2223333222211 1 122222 23468 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCCHHHHHHHHHHHHh Q lcl|NC_012753. 375 TATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNAEFDYWSKMVA 452 (502) Q Consensus 375 tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~~~~~~~~~~~ 452 (502) ||++.....+........+...++.+|.++++.+..+... ..... ..++-+|... +.+ .+.++.+.++.. T Consensus 364 Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~------~~~~~v~i~~n~dF~~~-~~~-~~~~~all~~~~ 435 (491) T protein:vir:95 364 TAESARIQRGADTSVMATIARNVSQAYTDALRWVAMMLGK------PEDSEVEFQLNMDFFLQ-PMT-AQDRAAWMADIN 435 (491) T ss_pred hHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCC------CCCCceEEEeecccccc-cCC-HHHHHHHHHHHh Confidence 9999888888888888888888999999988877766321 11111 1234455543 233 446888889999 Q ss_pred cCCCCHHHHHHhcC--CCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 453 AGFAPKTMAIEKTL--NVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 453 ~Gi~S~et~l~~~~--~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|.||.+|++..+- ++.+...+++.++|++|..+. +.-+-+-|| T Consensus 436 ~G~is~~t~~~~L~~~~vl~~~~e~~~~~ie~~~~~~------~~~~~~~~~ 481 (491) T protein:vir:95 436 AGLLPATAYYAALRKAGVTDWTDEDILNAIEDAPLPS------GAVTQVAGE 481 (491) T ss_pred cCCCCHHHHHHHHHhCCCCCccHHHHHHHHHhcCCCC------Ccccccccc Confidence 99999999887543 555555677888887776332 222334444 No 82 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=99.59 E-value=6.4e-13 Score=87.44 Aligned_cols=435 Identities=10% Similarity=0.072 Sum_probs=199.3 Q ss_pred cccccchhhhhccccc-cCCHHHHHHHHHHH---HHhcCCCC------ccccccCC-Cc--ccccc--ceecchHHHHHH Q lcl|NC_012753. 17 VITNQSLNSITDHPKI-AISPEEYNRIMDNL---RYFAGDFD------SVTYRDSN-GS--QVKRD--FNHLPIGRTASK 81 (502) Q Consensus 17 ~~~~~~l~~i~~~~~~-~~~~~~~~~i~~~~---~~Y~g~~~------~~~~~~~~-~~--~~~~~--~~~~n~~k~iv~ 81 (502) +.+..- ..-.+ ...+++......|+ +-|.|..- -+.+.... .. ...+. -.-.|+++.+++ T Consensus 1 ~~~~~~-----~~~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n~~~~tl~ 75 (489) T protein:vir:78 1 MLTENG-----QGSGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYNFTRRTLS 75 (489) T ss_pred CccCCC-----ccCCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCChHHHHHH Confidence 111111 11122 12344444444444 45777421 01111000 00 11110 112489999999 Q ss_pred HHhhhhhcCcceEeeCCHHHHHHHHHH-HhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-C------------ceEEEEE Q lcl|NC_012753. 82 KVASLVFNEQATIRVDNEVADAFINET-LKNDKFSKNFERYLESCLALGGLAMRPYIDG-D------------QIRVSFV 147 (502) Q Consensus 82 ~~a~~l~~ep~~i~~~d~~~~e~l~~~-~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~------------~~~i~~v 147 (502) .+++++|.+||++++.+. ...+++++ ..-++++..++.++..++.+|.+++.|=+.. + +|-+..+ T Consensus 76 ~l~G~vfrk~p~~~~p~~-l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~ 154 (489) T protein:vir:78 76 GMVGSVMRKEPEINIPKE-LEYLLKNADGSGVGLIQHAQDTLMEIDSVGRGGLLVDAPETGAATAAEQNAGLLNPTIAFY 154 (489) T ss_pred HHhchhhcCCcceeccHH-HHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEe Confidence 999999999999876433 33333322 1125788999999999999999998885421 1 5889999 Q ss_pred cCCeEEEEEEcC--CC-eEEEEEEEEEEEe-eCCC------ceEEEEEEEEEEeCCeEEEEEEEEecCCcc-ccCceeec Q lcl|NC_012753. 148 QATVFFPLQANT--QD-VSSAAIVTKSTKT-EGQK------VKYYSLIEFHEWNKETYTISNELYESESKT-IIGQRVPL 216 (502) Q Consensus 148 ~~~~~~Pi~~d~--~~-~~~~~~~~~~~~~-~~~~------~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~-~lG~~v~l 216 (502) +|++++=...+. |+ ....+.+++.... +... ...|++++.- .++.|+. ++|+..... ..+..++ T Consensus 155 ~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~--~~g~~~~--~~~r~~~~g~~~~~~~~- 229 (489) T protein:vir:78 155 TTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDID--SDGNYRQ--RLFRFDAEGGAQEDVVE- 229 (489) T ss_pred chhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecC--CCcceEE--EEEEeecCCcccceeeE- Confidence 999998743322 22 2233334443322 2111 1223333211 1233333 345432221 1111111 Q ss_pred cccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhcc Q lcl|NC_012753. 217 STLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKT 296 (502) Q Consensus 217 ~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~ 296 (502) +..+ ..-+.++.+||+++-... |. ..-|.+-|-++..|--+.=..-|++-+-+......+.+ +.. T Consensus 230 --~~~~----~g~~~l~~IPfv~~~~~~-~~----~~~~~pPLl~LA~lni~Hy~~ssd~~~~l~~~~~P~l~----i~G 294 (489) T protein:vir:78 230 --IYPD----LGESLRGVIPFTFIGATN-ND----ATIDDAPLLPLAELNIGHYRNSADNEESSFVVGQPTLF----IYP 294 (489) T ss_pred --Eecc----CCCCccCeeeEEEEecCC-CC----CCCCcCchHHHHHHHHHHhhhhhHHHHHHHHcccceee----eec Confidence 1111 112456788888775321 11 11123333333322111111112222223333333332 211 Q ss_pred CCCCCCcccCc--cccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccc Q lcl|NC_012753. 297 EYDTNGEKVTV--KREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK 374 (502) Q Consensus 297 ~~~~~g~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~ 374 (502) ..+........ ...+-.+.+.. +..+. ++..++-.+++.- -. .+.|+.+.+++. .. +...+. .++.. T Consensus 295 ~d~~~~~~~~~~~~~~i~~g~~~~--~~lp~-~~~~~~ie~~~~~--~~-r~~l~~le~qm~-~l--Ga~l~~--~~~~~ 363 (489) T protein:vir:78 295 GENLTPQAFKEANPNGIKFGSRRG--HNLGY-GGSAQLIQAGENN--LA-RQNMLDKEQQAI-QI--GAQLIT--PTQQI 363 (489) T ss_pred CccCCcccccccCccceeeCCccc--ccCCC-CCCcceeccCcch--HH-HHHHHHHHHHHH-HH--hhhhcc--CCcch Confidence 11100000000 00000011110 11111 1122233333321 11 233333333322 11 222232 23468 Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcC Q lcl|NC_012753. 375 TATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAG 454 (502) Q Consensus 375 tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~G 454 (502) ||++.+...+........+...++.+|.++++.+..+... .. +......++-+|... +.+ .+.++.+.++..+| T Consensus 364 Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w~G~---~~-~~~~~i~~n~dF~~~-~~d-~~~~~al~~~~~~G 437 (489) T protein:vir:78 364 TAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVMLGK---PE-DTEVEFRLNMDFFLE-PMT-AQDRAAWMADINAG 437 (489) T ss_pred hHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHcCC---CC-CCceEEEeecccCcc-cCC-HHHHHHHHHHHhcC Confidence 9999888888878888888888999999988877765321 10 001112234456543 223 44678888999999 Q ss_pred CCCHHHHHHhcC--CCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 455 FAPKTMAIEKTL--NVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 455 i~S~et~l~~~~--~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .||.+|++..+- ++-+.+.+++..||.++.. +.++.+-|+ T Consensus 438 ~is~~t~~~~L~~~gv~d~~~e~~~~ei~~~~~--------~~~~~~~g~ 479 (489) T protein:vir:78 438 LLPATAYYAALRKAGVTDWTDADIKDAVADQPL--------PVATEVQGE 479 (489) T ss_pred CCCHHHHHHHHHhCCCCCccHHHHHHHHhhcCC--------CcccCCccc Confidence 999999887542 4433333444456655422 233444555 No 83 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=99.50 E-value=4.5e-12 Score=82.81 Aligned_cols=457 Identities=12% Similarity=0.114 Sum_probs=194.8 Q ss_pred CChhH--------------HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccc Q lcl|NC_012753. 1 MGIIQ--------------TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQV 66 (502) Q Consensus 1 m~~~~--------------~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~ 66 (502) |+.=. +|-.++++.+...-... ..+..+|.= .-+.+..=.+...||.|.... ........ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r-~~~~~~w~~--~~~~~~~~~~~~~y~~~~~~~---~~~~~~~~ 74 (651) T protein:vir:80 1 MKLATTTTDKNRQTYDETHDVSSYVKKEYKRFCDAR-QVCEETWLE--AWGMYLSTPEAQDYLRDQVLR---SVGDVNAD 74 (651) T ss_pred CcccccccchhhhhhhhhHHHHHHHHHHHHHHHHHh-hhhhhhHHH--HHHhhcccHHHHHhhcccccc---ccCCCCCC Confidence 11100 11111111111000000 000000000 000000002334556553211 11112222 Q ss_pred cccceecchHHHHHHHHhhhhh----cCcceEee---CCHH----HHHHHHHHH----hhccHHHHHHHHHHHHhhcCCE Q lcl|NC_012753. 67 KRDFNHLPIGRTASKKVASLVF----NEQATIRV---DNEV----ADAFINETL----KNDKFSKNFERYLESCLALGGL 131 (502) Q Consensus 67 ~~~~~~~n~~k~iv~~~a~~l~----~ep~~i~~---~d~~----~~e~l~~~~----~~~~f~~~~~~~~~~~~~~G~~ 131 (502) .++++..|.....|+.+...|+ +.+.-+.+ +++. ..+.++.++ .+.+|...+..++.+++..|.+ T Consensus 75 ~rs~~~~~~v~~~ve~~~~~l~~~~~~~~~~~~~~p~~~~d~a~~~~~~~~~~~~~~l~~~~~~~~~~~~~~d~l~~G~~ 154 (651) T protein:vir:80 75 WRHKITTGKAFEAIETIHAYLMSATFPNKNWFDVVPAKPGQDNLLVSRLIKRYVQDKLTEGKFRAAYANFLRQLLITGNS 154 (651) T ss_pred CCccccChhHHHHHHHHHHHHHHhhcCCCceeEeccCCchhHHHHHHHHHHHHHHHHhhccCcHHHHHHHHHhhcccCce Confidence 3455666777666665554444 33333443 2222 334455554 3568999999999999999999 Q ss_pred EEEEEEeC--------------------------------CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCC-- Q lcl|NC_012753. 132 AMRPYIDG--------------------------------DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQ-- 177 (502) Q Consensus 132 ~~~~~~d~--------------------------------~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~-- 177 (502) ++++|||. +.++|+.|+|..+++ .....++..+.|+.+.+....+ T Consensus 155 i~kv~we~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~i~~v~p~~~~~-dp~a~~~~d~~~v~~~~~t~~~l~ 233 (651) T protein:vir:80 155 VLALPWRVETAEVKKKVQVRTPLFEDEPTFEVVSEEREVKSSPDFEVLDMFDCFY-DPNVTDPNRGAFIRKLTKTKADIL 233 (651) T ss_pred EEEEeecceeeeeehheeccccccccccceeeeccceeeeceeEEEEecHHHeee-cCCCcCccccceeeeeeeeHHHHH Confidence 99999962 357899999999986 2333455556665444332111 Q ss_pred ----CceEE----------------------------------------EEEEEEE---EeCCeEEEEEEEEecCCcccc Q lcl|NC_012753. 178 ----KVKYY----------------------------------------SLIEFHE---WNKETYTISNELYESESKTII 210 (502) Q Consensus 178 ----~~~~y----------------------------------------t~~E~h~---~~~~~~~I~~~l~~~~~~~~l 210 (502) .+.++ .++|+|. .++..+...+-.+. T Consensus 234 ~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~v~v~E~~~~~d~e~~~~~~~~v~~~------- 306 (651) T protein:vir:80 234 NLLSEGYYYGVDPLDVVEHKCKDTSDTKQDMLSTFQGVTTSLWSPHQNVELLEYWGDIHLENKTYHDVVVTIM------- 306 (651) T ss_pred HHHhcccccchhhHHHHhhhccccccCCccccccccCCCccccccccceEEEEEEEEeeccCCceEEEEEEEc------- Confidence 01100 0111110 00000000000000 Q ss_pred CceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-hccceeee Q lcl|NC_012753. 211 GQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-MGQRRVAV 289 (502) Q Consensus 211 G~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~~~~~i~v 289 (502) |..|- ...+... +...||+.++. ....++.||+|..+.+.+.+..+|.....+.+... .....+.| T Consensus 307 g~~il------~~~~~~~---~~~~Pf~~~~~----~~~~~~~yG~g~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v 373 (651) T protein:vir:80 307 GNEVL------RFEQNPY---WCGRPFVIGTY----IPTARQPYAMGALQPNLGMLHELNIITNQRLDNLELAIDQMYTL 373 (651) T ss_pred CcEEe------cccccCC---CCCCCeeeecc----eecCccccCCChHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEe Confidence 00000 0000000 11235655543 22457889999999999999999999999988775 45555566 Q ss_pred chHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeecccc-chHHHHHHHHHHHHHHHHhcCCChhhccc Q lcl|NC_012753. 290 PTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDI-RSDDYIKAINKGLSLFEMQLGVSTGMFSF 368 (502) Q Consensus 290 ~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i-r~e~~~~~l~~~l~~i~~~~g~s~~~~~~ 368 (502) +.+.+.+..+ ....++..+ . ....+ .+..+++.- ........++.+...+....|++.-..|. T Consensus 374 ~~d~~~~~~~---l~~~pg~vi-------~---~~~~~---~~~~l~~~~~~~~~~~~~l~~l~~~~~~~~gv~~~~~g~ 437 (651) T protein:vir:80 374 RSDGLLQPED---VYTEPGKVF-------L---VSDHG---DLQPLANQSSNFSITYQESSFLESTIDKNFGTGNYVGAN 437 (651) T ss_pred cCCccccHHH---hhcCCCceE-------E---ecCCC---CceeeccCcccchhHHHHHHHHHHHHHHHhcCChHHhCC Confidence 5443221100 111121111 0 11111 233333321 11223345666677778888888776665 Q ss_pred cc--cccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhc-------ccCCCcc-------cccceEEE Q lcl|NC_012753. 369 DG--KSMKTATEVVSEQSDTYQMRNSIATLVEK-SLKELVISILELAKVYN-------LYTGEIP-------TMDEVSVD 431 (502) Q Consensus 369 ~~--~~~~tAtei~~~~~~l~~~~~~~~~~~~~-~l~~l~~~il~~~~~~~-------~~~~~~~-------~~~~i~v~ 431 (502) +. .+..||+||....+.+......+-+.|.. .+..|++.++++..-+. +.+.... ...++++. T Consensus 438 ~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~ 517 (651) T protein:vir:80 438 AARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLVQQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKE 517 (651) T ss_pred CccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccceeecccccccccccccCccceeee Confidence 43 34579999988888877777777777765 67777777777654321 1111100 00123333 Q ss_pred eCCCccCC------HHHHHHHHHHHHhc-CCCC---H----HHH---HHhcCCCCHHH---------H-----HHHHHHH Q lcl|NC_012753. 432 LDDGVFTD------RNAEFDYWSKMVAA-GFAP---K----TMA---IEKTLNVTKEQ---------A-----QEIYQKI 480 (502) Q Consensus 432 f~d~i~~d------~~~~~~~~~~~~~~-Gi~S---~----et~---l~~~~~~~dee---------a-----~~el~ri 480 (502) ++- ++.. ....++.+.++.+. |-.+ . ... +.+.-|+.+.+ + ++.+... T Consensus 518 ~~i-v~~g~~~~~~r~~~~~~l~~~~q~~~~~p~~~~~~~~~~~~~~l~~~~g~~~~~~~l~~~~q~~~~~~~~~~~~q~ 596 (651) T protein:vir:80 518 VRL-VPIGSDHVIERKQYIEDRLTFIQAVAQVPEMGQLVDYKRILVDLLQHWGFEEPEAYLKQQDQQAPANPQEALLSQA 596 (651) T ss_pred eee-eeccHHHHHHHHHHHHHHHHHHHhhccCCccchhhhHHHHHHHHHHHcCCCCcHHhcCCCccchhhhhhHHHHhhH Confidence 221 1111 11223333333221 1111 1 000 11222321100 0 0000000 Q ss_pred HHhhhcccCCCCCccccC-CCCC Q lcl|NC_012753. 481 NDETMVSTDSFRTSEEVD-IYGE 502 (502) Q Consensus 481 ~~E~~~~~~~~~~~~~~~-~~g~ 502 (502) +...++.+... ...... .-|. T Consensus 597 ~~~~~~a~~~~-~~~~~~~~~~~ 618 (651) T protein:vir:80 597 KDVGGQAMSNM-LQNQLQADGGT 618 (651) T ss_pred HHHHHHHHHHH-HHHHHHHHHHH Confidence 00000000000 000000 0000 No 84 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=99.49 E-value=2.2e-13 Score=89.99 Aligned_cols=427 Identities=11% Similarity=0.060 Sum_probs=201.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCC-CCccccccC------CCccccccceec Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGD-FDSVTYRDS------NGSQVKRDFNHL 73 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~-~~~~~~~~~------~~~~~~~~~~~~ 73 (502) |.=+ +.....++ .-...-+..+. ..+=.|. .+...+... .-.....-+.+. T Consensus 1 ~~~~--------------~~a~~~~~------~~~a~~~~~~~--~~~g~~~~~d~~~~~~~~~~~~~~~~~l~~lY~~~ 58 (461) T protein:vir:80 1 MYSI--------------DKAKQAKI------DSKIVNRNDFM--VGHGKANSRDKLTRQTPGNGQKLDLKACENLYASN 58 (461) T ss_pred Cccc--------------hhhhhhhh------hhhhhhhhHHH--hhcCCcchhhhhhccccCcccccCHHHHHHHHHhC Confidence 2111 11111000 00000000000 0000011 111110000 000000122345 Q ss_pred chHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEE Q lcl|NC_012753. 74 PIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFF 153 (502) Q Consensus 74 n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~ 153 (502) .+++.+|+..|..++.+++.|++++++..+.+++.++.-++...+.+++.++..+|++++.+-..+++.+ .+...- T Consensus 59 ~l~r~iVd~~a~d~~r~g~~i~~~~~~~~~~~~~~~~~l~~~~~l~~~~~~~rl~G~a~i~i~v~d~~~~----~~~~~~ 134 (461) T protein:vir:80 59 SIAMNIVDIISEDMVRAGWSLKTDNKEMKKNIESKWRKLKTKDRFQKLYADKRLYGDGFLSIGVVSSNRE----QADLST 134 (461) T ss_pred CccchhhccchHHhhcCCeeeecCCHHHHHHHHHHHHHhhHHHHHHHHHHhhcccccEEEEEEeecCCcc----ccCccC Confidence 7889999999999999999999999888888888888888999999999999999999888866443321 122222 Q ss_pred EEEEcCC-CeEEEEEEEEEE-----EeeCCCceEEEEEEEEEEeCCeEEEEEEEEe--cCCccccCceeeccccccCCCc Q lcl|NC_012753. 154 PLQANTQ-DVSSAAIVTKST-----KTEGQKVKYYSLIEFHEWNKETYTISNELYE--SESKTIIGQRVPLSTLYEDLEE 225 (502) Q Consensus 154 Pi~~d~~-~~~~~~~~~~~~-----~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~--~~~~~~lG~~v~l~~~~~~l~~ 225 (502) ||..... ++.....+++.. ...+-....|-.-++ |+|...-.. .......|. . T Consensus 135 pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~-------y~i~~~~~~~~~~~~~~~~~-----------~- 195 (461) T protein:vir:80 135 AIDPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHFGEVEF-------FEVNRVSQLGEEILSGTTAS-----------T- 195 (461) T ss_pred CcccccccceeEEEeccccccchhhhcccCcCcccccceE-------EEEeccccccccccccccCc-----------c- Confidence 3322111 111111111110 000000000101111 111100000 000000010 0 Q ss_pred ceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCccc Q lcl|NC_012753. 226 TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKV 305 (502) Q Consensus 226 ~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~ 305 (502) ...+|- .| ++.|.+. ...+..+|+|++..+.+.+.+++++.-....=+...+..++-.. .+....+..... T Consensus 196 ~~~iH~-SR--ii~~~~~----~~~~~~~G~S~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~-~l~~~~~~~~~~- 266 (461) T protein:vir:80 196 SEQIHR-SR--IIHEQGL----RFEGETKGRSIFESLYDIITVMDTSLWSVGQILYDFAFKVYKTD-DIDALNKDDKAN- 266 (461) T ss_pred ceEEcc-cc--EEEecCC----CCCccccCcchHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecc-hHHhhhchHHHH- Confidence 011111 11 2233221 22234679999999999999999887655543322232332111 111111100000 Q ss_pred CccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-ccccccccccHHHHHHHHH Q lcl|NC_012753. 306 TVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~~tAtei~~~~~ 384 (502) ....+....+. ..+...+.+ .-++.++.+ .......++.+.+.|+..+++|... ||...++.+|+.+=. + T Consensus 267 -~~~~~~~~~~~-~g~~~~d~~--e~~e~~~~~--lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~~asge~D~---~ 337 (461) T protein:vir:80 267 -LTAMLDFMFRT-EALAIIKGD--EQLTKESTN--VSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGTLTGAQYDV---M 337 (461) T ss_pred -HHHHHHHhcCC-ceEEEEcCC--cceEEEecC--cCCHHHHHHHHHHHHhhhhcCCeeeeecccCCccccchHHH---H Confidence 00011110000 011111111 225555544 3456778888899999999999875 566666676766422 2 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHH-------HHHHHhcCCC Q lcl|NC_012753. 385 DTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDY-------WSKMVAAGFA 456 (502) Q Consensus 385 ~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~-------~~~~~~~Gi~ 456 (502) ..+..+..+| ..++..|++|+++|+.-. ........+...+++|.|+.-...++.+.++. ..+++.+|++ T Consensus 338 ~yyd~i~~~qe~~l~p~le~l~~~i~~s~--~~~~~~~~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~i 415 (461) T protein:vir:80 338 NYYARVSSIQENRLRPQLEYLTRLLMWAS--DDCGPSIDPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVL 415 (461) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHh--cccccccCccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCC Confidence 2345555555 457889999988877532 12122223334689999999888888777654 6677788999 Q ss_pred CHHHHHHhc---CCCCHH------HH-HHHHHHHHHhhhcccCCCCCccc Q lcl|NC_012753. 457 PKTMAIEKT---LNVTKE------QA-QEIYQKINDETMVSTDSFRTSEE 496 (502) Q Consensus 457 S~et~l~~~---~~~~de------ea-~~el~ri~~E~~~~~~~~~~~~~ 496 (502) |.+++...+ .+.++. .+ .+++++...+.. ....+.+ T Consensus 416 s~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~e~~~g 461 (461) T protein:vir:80 416 DPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYDAY----AKKNADG 461 (461) T ss_pred CHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccccc----cccCCCC Confidence 998875432 222211 11 122232222111 1111111 No 85 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=99.48 E-value=7.4e-12 Score=81.60 Aligned_cols=430 Identities=10% Similarity=0.014 Sum_probs=206.7 Q ss_pred CChhHHHHHHHH--HHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccc----ccCCC----------- Q lcl|NC_012753. 1 MGIIQTIKNFIK--RSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTY----RDSNG----------- 63 (502) Q Consensus 1 m~~~~~ik~~i~--~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~----~~~~~----------- 63 (502) |++++++-.++- +...+ .+.+...+-|.+-...... ..... T Consensus 1 mn~~dr~i~~~sP~~~~~R----------------------~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~~~~~~~~l 58 (502) T protein:vir:79 1 MAILDDVIGVFSPGWKAAR----------------------LRSRAVIQAYEAVKTTRTHKARRENRTADQLSQYGAVSL 58 (502) T ss_pred CchHhhHHhhcChHHHHHH----------------------HhhHHHHhhccccCcccccCCCCCCCChHHHHHHHHHHH Confidence 999999988862 11000 0111122234442211100 00000 Q ss_pred -ccccccceecchHHHHHHHHhhhhhcC-cceE----eeCC----HHHHHHHHHHHh----------hccHHHHHHHHHH Q lcl|NC_012753. 64 -SQVKRDFNHLPIGRTASKKVASLVFNE-QATI----RVDN----EVADAFINETLK----------NDKFSKNFERYLE 123 (502) Q Consensus 64 -~~~~~~~~~~n~~k~iv~~~a~~l~~e-p~~i----~~~d----~~~~e~l~~~~~----------~~~f~~~~~~~~~ 123 (502) .+..+-....++++-.++.+++.++|. .+++ ...+ ++.++.+++.|+ ..+|......++. T Consensus 59 r~RaRdl~rNn~~a~~av~~~~~nvVG~ggi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~D~~g~~~f~~~q~l~~r 138 (502) T protein:vir:79 59 REQARYLDNNHDLVIGVFDKLEERVVGKNGIIVEPHPVLRNGAIARDLAAEIRTRWSEWSVSPEVTGQFTRPMLERLMLR 138 (502) T ss_pred HHHHHHHHhcChHHHHHHHHHHHhhccCCceeeeeccCCCChhHHHHHHHHHHHHHHHhhcCcCccccCCHHHHHHHHHH Confidence 000111223468899999999999996 3333 2233 334444444443 2367777777888 Q ss_pred HHhhcCCEEEEEEEeCC---------ceEEEEEcCCeEEEEEEcCCC-eEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC Q lcl|NC_012753. 124 SCLALGGLAMRPYIDGD---------QIRVSFVQATVFFPLQANTQD-VSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE 193 (502) Q Consensus 124 ~~~~~G~~~~~~~~d~~---------~~~i~~v~~~~~~Pi~~d~~~-~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~ 193 (502) ..+.-|.+++++.+++. .+++..++|+.+ |...+.++ +...+ |+ +.. T Consensus 139 ~~~~dGE~f~~~~~~~~~~~~~g~~~~l~lq~iepd~l-~~~~~~~~~i~~GV-------------------e~---d~~ 195 (502) T protein:vir:79 139 TWLRDGEVFAQMVSGRINSLTPSAGVHFWLEALEPDFI-PMTSDESNRLNQGV-------------------FV---DDW 195 (502) T ss_pred HHHhCCceEEEEeecccCccCCCcccceEEEEecchhc-CCCCCCCCeeEeee-------------------EE---CCC Confidence 88899999999888542 258999999986 43233322 11111 11 001 Q ss_pred eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHH Q lcl|NC_012753. 194 TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTY 273 (502) Q Consensus 194 ~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~ 273 (502) +-.+-|.+++...++..+. . +.. ++..-++.+ ......+...|+|.|+.++..+..|+... T Consensus 196 Gr~~aY~i~~~hPgd~~~~--~----~~r---------vpA~~vlH~----f~~~r~gQ~RGis~lapvl~~l~~l~~~~ 256 (502) T protein:vir:79 196 GRPEKYLVYKSRPVSGRQM--E----TKE---------VDAERMLHL----KFVRRLHQMRGTSLLSGVLIRLSALKEYE 256 (502) T ss_pred CceEEEEEeecCCCCCccc--c----eeE---------echhheEEe----ecccCCccccCCchHHHHHHHHHHHhHHH Confidence 1112222232211110000 0 000 111111222 12234566789999999999999998654 Q ss_pred HHHHHHH-hhccceeeechHHhccCCCC-CCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHH Q lcl|NC_012753. 274 DEFMWEV-KMGQRRVAVPTQMIKTEYDT-NGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKG 351 (502) Q Consensus 274 S~~~~~~-~~~~~~i~v~~~~l~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~ 351 (502) +.-..-- -.+.-..|| +...+. ................+-+......-..+.-++.++|.-+..++..-+..+ T Consensus 257 dael~~a~i~A~~~~fi-----~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~~L~pGe~i~~~~p~~p~~~~~~f~~~~ 331 (502) T protein:vir:79 257 DSELTAARIAAALGMYI-----RKGDGQSYEPDGNGSKENERELTIQPGIIYDDLKPGEEIGMVKSDRPNPNLETFRNGQ 331 (502) T ss_pred HHHHHHHHHhhhheeee-----ecCCCcccccccCCCCCccccccccCCccccccCCCceeeeeCCCCCCCCHHHHHHHH Confidence 3322211 112222333 211111 110000000000000111110001011122377777877777888889999 Q ss_pred HHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccc--CCCcccccce Q lcl|NC_012753. 352 LSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKE-LVISILELAKVYNLY--TGEIPTMDEV 428 (502) Q Consensus 352 l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~-l~~~il~~~~~~~~~--~~~~~~~~~i 428 (502) ++.|....|+++..++.+-++ |=.++++.....-......+..|...+.+ +.+..+..+-+-+.. ++......-. T Consensus 332 lr~iaaglGi~ye~lt~D~s~--nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~ 409 (502) T protein:vir:79 332 LRAVAAGSRLSFSSTARNYNG--TYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAVASGVIRLPRDLDRSSLY 409 (502) T ss_pred HHHHHhhcCCCHHHHhccccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCCCCchhhc Confidence 999999999999998877543 23333433333334444444444433333 333333333222211 1111112223 Q ss_pred EEEe--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHH---hhhcccCCCC--CccccCCC- Q lcl|NC_012753. 429 SVDL--DDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKIND---ETMVSTDSFR--TSEEVDIY- 500 (502) Q Consensus 429 ~v~f--~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~---E~~~~~~~~~--~~~~~~~~- 500 (502) .+.| +.-.-+|+.++++....++.+|+.|.+..+++. |.+-+++.+++++-++ +.....+..+ .+..+..- T Consensus 410 ~~~W~~p~~~~iDP~Ke~~a~~~~i~~Gl~t~~~~~a~~-G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~ 488 (502) T protein:vir:79 410 TAVYSGPVMPWIDPVKEAEAWKIQIRGGAATESDWVRAG-GRNPDDVKRRRKAEIDENRKLDLVFDTDPASDKGGSSAAT 488 (502) T ss_pred ceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCCC Confidence 4555 445668999999999999999999999988775 7766655444333221 1111111100 00011111 Q ss_pred ---------CC Q lcl|NC_012753. 501 ---------GE 502 (502) Q Consensus 501 ---------g~ 502 (502) ++ T Consensus 489 ~~~e~~~~~~~ 499 (502) T protein:vir:79 489 KRQEPQHTDDQ 499 (502) T ss_pred CCCCCCCCCCC Confidence 11 No 86 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=99.46 E-value=1.1e-11 Score=80.67 Aligned_cols=480 Identities=10% Similarity=0.044 Sum_probs=221.0 Q ss_pred CChh---HHHHHH-HHHHh-hcccccchhhhhcc----c--cccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccccc Q lcl|NC_012753. 1 MGII---QTIKNF-IKRSN-YVITNQSLNSITDH----P--KIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRD 69 (502) Q Consensus 1 m~~~---~~ik~~-i~~~~-~~~~~~~l~~i~~~----~--~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~ 69 (502) |.=. ++|-.+ +||.- +..+......+..+ . .....++......+..+||.|+++.-.... .-....+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p 79 (711) T protein:vir:10 1 MAKKQKKSRVEQLYAKKAKVYAKNNDDDRALLATARERARDGATYWKDNWEAAEDDLKFLGGEQWPSQVRT-ERELEQRP 79 (711) T ss_pred CCcccccccccchhHHHHHhcccCcchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCCCCCHHHHH-HHHhcCCC Confidence 1100 011000 11110 00111111222111 0 112234455566788899999876422110 11123345 Q ss_pred ceecchHHHHHHHHhhhhhcCcceEeeC---------------------------CHHHHHHHHH----HHhhccHHHHH Q lcl|NC_012753. 70 FNHLPIGRTASKKVASLVFNEQATIRVD---------------------------NEVADAFINE----TLKNDKFSKNF 118 (502) Q Consensus 70 ~~~~n~~k~iv~~~a~~l~~ep~~i~~~---------------------------d~~~~e~l~~----~~~~~~f~~~~ 118 (502) .++.|.-+-+|+...++--...+.+.+. |....+.|+. +.+.++..... T Consensus 80 ~~~~N~i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~ 159 (711) T protein:vir:10 80 CLVNNVLPTFVDQVLGDQRQNRPAIKVSSTEVTRVPDAESGEDTTLKISNVAGKNDYELAEVFTGLIKNIEYNCDAETEY 159 (711) T ss_pred cEEEcchHHHHHHHhhhHhhCCcceEEecccccchhhhhhhhccccccccCCChhHHHHHHHHHHHHHHHHHhcChhHHH Confidence 7888999999999999999888877553 2345555555 44567888889 Q ss_pred HHHHHHHhhcCCEEEEEEEe-------CCceEEEEE-cCCeEEE--EEEcCCCeEEEEEEEEEEEeeC------------ Q lcl|NC_012753. 119 ERYLESCLALGGLAMRPYID-------GDQIRVSFV-QATVFFP--LQANTQDVSSAAIVTKSTKTEG------------ 176 (502) Q Consensus 119 ~~~~~~~~~~G~~~~~~~~d-------~~~~~i~~v-~~~~~~P--i~~d~~~~~~~~~~~~~~~~~~------------ 176 (502) ..+...++..|.+|+.+++| ++.++|..+ +|.++++ -.. .-+...+-|+.+..+.+. T Consensus 160 s~af~d~~~~G~G~~ev~~d~~~~d~~~~e~~i~~v~~p~~v~~Dp~a~-~~D~sDar~~~~~~~~~~~~~~~~yp~~a~ 238 (711) T protein:vir:10 160 DIAFQGAVESGMGYLRVRSDYLADDSFEQDLIIEAIQNQFSVTIDPDAK-KRDRSDMNWCLIDDTMSKEKFKALYPDATA 238 (711) T ss_pred HHHHHHhhhcCcceEEEEecccCCCCCCCCeEEeeecChhheeeCcccc-ccChhhhcceeeeecCCHHHHHHhCCchhh Confidence 99999999999999999875 256888888 5887653 111 112333444333322110 Q ss_pred ---------CC-----ceEEEEEEEEEEeCCeEEEEEEEEecCCcccc---------------Ccee------ecccc-c Q lcl|NC_012753. 177 ---------QK-----VKYYSLIEFHEWNKETYTISNELYESESKTII---------------GQRV------PLSTL-Y 220 (502) Q Consensus 177 ---------~~-----~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~l---------------G~~v------~l~~~-~ 220 (502) +. ....++.|++......+ .++...++... |..+ .--.+ | T Consensus 239 ~~~~~~~~~~~~~~~~~~~vrv~E~~~r~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~v~~ 314 (711) T protein:vir:10 239 EPVYEDSVADYDTWFTEKSVRVSEYFTREPVIR----EIALLSDGRSFWLDALEDIVDELLEAGISIVRTRKVKTFKTYW 314 (711) T ss_pred hhhhcccccccCcccCcceeeEEEEEeeeeeee----EEEeecCCceeccCcchhHHHHHHhcCchhhhhhhhceeeEEE Confidence 00 01122334432111111 11111110000 0000 00000 0 Q ss_pred cCCCcceeec-----CCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHh Q lcl|NC_012753. 221 EDLEETVTLN-----GLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMI 294 (502) Q Consensus 221 ~~l~~~~~~~-----~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l 294 (502) .-+.....+. ..++.||++|--.. .....+..+.|++.++++.++.+|...|.+++-+-. .+.+++++.+.+ T Consensus 315 ~~~~G~~~L~~~~p~~~~~~P~vp~~g~r--~~~d~~~~~~G~vr~~~d~Qr~~N~~~s~~~~~l~~~~~~~~~~~~gai 392 (711) T protein:vir:10 315 RKITGANVLEGPVEIPSTTIPVIPVWGKS--LIIKKKEIFRSIIRHSKDAQRMANYWDSAATETVALAPKAPFIGSEGNV 392 (711) T ss_pred EEEecceeecCCCCCCCCcccEEEEeeee--eccccccccchhhhhhhhhHHHHHHHHHHHHHHHHhcCCCceeecCccc Confidence 0000000011 11345555442110 111233334557999999999999999999998754 556778777666 Q ss_pred ccCCC-CCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc Q lcl|NC_012753. 295 KTEYD-TNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM 373 (502) Q Consensus 295 ~~~~~-~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~ 373 (502) .+... .......++. +..++... .....++..++.--..++...++.....+...+|++...+|..++ . T Consensus 393 ~~~~~~~~e~~~~~~~--------vi~~~~~~-~~~~~~~~~~~~~~~~~~~~ll~~~~~~i~~~tGi~~~~~G~~~n-~ 462 (711) T protein:vir:10 393 EGREDEWEQANTKNFS--------LLTYIPQY-QGDPGPRRQPPAAVPAAELTLGQNSVEKIKSTMGMYDASLGAMGN-E 462 (711) T ss_pred CChHHHHHhccccCCC--------eeEecccc-cCcCCccccCCCCCCHHHHHHHHHHHHHHHHHhCCChHHcCCCcc-c Confidence 43111 0000011111 01111111 111234444333234567788998899999999999999998754 4 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-------cccCCC-cccc-------------------- Q lcl|NC_012753. 374 KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVY-------NLYTGE-IPTM-------------------- 425 (502) Q Consensus 374 ~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~-------~~~~~~-~~~~-------------------- 425 (502) .||.+|..+..........+...+..+++++.+.++.+..-+ .+.+.. ...+ T Consensus 463 ~Sg~ai~~~q~qg~~~l~~~~dn~~~~~~~~g~~ll~li~~~~~~er~~rI~ged~~~~~v~ln~~~~~~~~G~~~~~nD 542 (711) T protein:vir:10 463 TSGRAIIARQRQGDRGSFAFIDNLTKSIRRVGKILVEMIPHIYDTERVVRLKFPDETEDFVKLNEQIFDEESGEWVTIHD 542 (711) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEEecCCCCcceEEecccccccccccceeeec Confidence 588888888777666666666667667776666666554332 111110 0000 Q ss_pred -----cceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH-----HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCcc Q lcl|NC_012753. 426 -----DEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM-----AIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSE 495 (502) Q Consensus 426 -----~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et-----~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~ 495 (502) ++|.|+=..+.+.-..+..+.++++. +.++.-. .+.++.++.. +++..++++.-..+..+..+... T Consensus 543 i~~g~~Dv~i~~~p~~~s~r~~~~~~l~ql~--~~~p~~~~~~~~~il~~~d~p~--~~el~e~lr~~~~~~~~~~~~~~ 618 (711) T protein:vir:10 543 LNVQKYDVVVTTGPAFATQRIEAAEAMIQFA--QAVPSAAAVMADLIAQNMDWPG--ADVIAERLKKIVPPNVLSKDERE 618 (711) T ss_pred cceeeeEEEEeeccCchhHHHHHHHHHHHHH--hhcchhhhHHHHHHHHhcCCCC--HHHHHHHHHhhcCcccCcchhhh Confidence 01111111111122223333333332 2222211 1223333322 22233333332222211111100 Q ss_pred c-------------------cCCCCC Q lcl|NC_012753. 496 E-------------------VDIYGE 502 (502) Q Consensus 496 ~-------------------~~~~g~ 502 (502) . ...-++ T Consensus 619 ~~qq~~~e~qq~~~~~q~~~~~~q~~ 644 (711) T protein:vir:10 619 AIEEDMPEQTEPTPEQQVEMAKSQAD 644 (711) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 000000 No 87 >protein:vir:104437 Length: 714 # NCBI annotation: putative phage portal protein # Family: family:all:487 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794061;genbank:gi:116222006;genbank:GeneID:4397502 Probab=99.45 E-value=1.2e-11 Score=80.51 Aligned_cols=461 Identities=11% Similarity=0.054 Sum_probs=217.6 Q ss_pred CC---------hhH-HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccc Q lcl|NC_012753. 1 MG---------IIQ-TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDF 70 (502) Q Consensus 1 m~---------~~~-~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~ 70 (502) |+ .-+ ...++..+. +..+.. .+.-.++......+..+||.|.++.-.-.. .-....+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~--------l~~~~~--~~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p~ 69 (714) T protein:vir:10 1 MKNEINTTAMKNDHGSTPRFSQRQ--------LLSLCS--DIDSQPLWRDAANKACAYYDGDQLAPEVIQ-VLKDRGQPM 69 (714) T ss_pred CCcCcCcccCCCcchhhhhhhHHH--------HHHHHH--HHhhhHHHHHHHHHHHHhhcCCCCCHHHHH-HHHhcCCCc Confidence 10 000 000000000 000000 111123344667788899999876421110 011133567 Q ss_pred eecchHHHHHHHHhhhhhcCcceEeeC----CH---HHHHHHHH----HHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Q lcl|NC_012753. 71 NHLPIGRTASKKVASLVFNEQATIRVD----NE---VADAFINE----TLKNDKFSKNFERYLESCLALGGLAMRPYIDG 139 (502) Q Consensus 71 ~~~n~~k~iv~~~a~~l~~ep~~i~~~----d~---~~~e~l~~----~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~ 139 (502) ++.|.-+.+|+...++--...+.+.+. ++ +..+.|+. +.+.++.......++..+++.|-+|+.+++|. T Consensus 70 ~~~N~i~~~v~~v~g~~~~nr~~~~v~pr~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~d~ 149 (714) T protein:vir:10 70 TIHNLIAPTVDGVLGMEAKTRTDLIVMSDDPNDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNS 149 (714) T ss_pred EEeccHHHHHHHHHHHHHhCCcceEEecCCCChhhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcccceEEeeecc Confidence 889999999999999999988888773 12 24555544 45567888899999999999999999999973 Q ss_pred ----CceEEEEEcCCeEEEEE-EcCCCeEEEEEEEEEEEee--------------------------------------- Q lcl|NC_012753. 140 ----DQIRVSFVQATVFFPLQ-ANTQDVSSAAIVTKSTKTE--------------------------------------- 175 (502) Q Consensus 140 ----~~~~i~~v~~~~~~Pi~-~d~~~~~~~~~~~~~~~~~--------------------------------------- 175 (502) +.+++..|+|.+++.=. ....+...+-|+.+..+.+ T Consensus 150 d~~~~~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fp~~a~~i~~~~~~~~~~~~~~~~~~~~~~~~ 229 (714) T protein:vir:10 150 EPFGPEFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLM 229 (714) T ss_pred CCCCCCeEEEecChhheeeccccccCChhhhhhhhhhccCCHHHHHHhcCCchhhhhccchhhcCcccchhhhhhccccc Confidence 46899999999987411 0011223333332211110 Q ss_pred -----------------CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccc--------cC---CCc-- Q lcl|NC_012753. 176 -----------------GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLY--------ED---LEE-- 225 (502) Q Consensus 176 -----------------~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~--------~~---l~~-- 225 (502) +...+.++++|+|... + .....+...+ |..+.+...- .+ +.. T Consensus 230 ~~~~~~~~~~~~~~~~~~~~~~rV~v~E~w~k~---~-~~~~~~~~~~----g~~~~~d~~~~~~~~~~~~g~~~~~~~~ 301 (714) T protein:vir:10 230 SAWEEYQSWDRQQNEWLQRERRRVLLQVVYYRT---F-ERLPVIELSN----GRVVAFDKNNLMQAVAVASGRVQVKVGR 301 (714) T ss_pred ccchhhcccccccccccccCcceEEEEEEEEeE---E-EEEEeecCCC----CCeeeeCccCHHHHHHHHhccceecccc Confidence 0011224455543211 1 1111111111 2211111000 00 000 Q ss_pred -----ceeecC----------C--CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceee Q lcl|NC_012753. 226 -----TVTLNG----------L--TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVA 288 (502) Q Consensus 226 -----~~~~~~----------~--~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~ 288 (502) ..++.| + ++.||+++.. ......+.|+| .+.++++.++.+|...|...+-+ +.++++ T Consensus 302 ~~rv~~~~~~g~~~L~~~~~p~p~~~fp~vP~~g--~~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~ 375 (714) T protein:vir:10 302 VSRIREAWFVGPHFIVDRPCSAPQGMFPLVPFWG--YRKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVI 375 (714) T ss_pred eeeEEEEEEecchhhhcCCCCCCCCceeeEEecc--eeeeccCccce--ehhhhhhHHHHHHHHHHHHHHHH--hCCcee Confidence 001111 0 1223332211 11112233554 68899999999999999988855 344555 Q ss_pred echHHhccCCC-CCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcc Q lcl|NC_012753. 289 VPTQMIKTEYD-TNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFS 367 (502) Q Consensus 289 v~~~~l~~~~~-~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~ 367 (502) +.++.+..... ....--+++. .-.+.............++..++.--..++...++.....|...+|++...+| T Consensus 376 ~~~gav~~~d~~~~e~~~rp~~-----vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG 450 (714) T protein:vir:10 376 MDEDATQLSDNDLMEQLERPDG-----IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLG 450 (714) T ss_pred eccccccccHHHHHHhccCCCC-----eEEecccccccCCccccccccCCCCCcHHHHHHHHHHHHHHHHhhCCCHHHcC Confidence 54433321100 0000000000 00111010111111223554443222446788899999999999999999999 Q ss_pred ccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----hc---ccCCCcc--cccceEEEe------ Q lcl|NC_012753. 368 FDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKV----YN---LYTGEIP--TMDEVSVDL------ 432 (502) Q Consensus 368 ~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~----~~---~~~~~~~--~~~~i~v~f------ 432 (502) ..++ ..|+.+|..+..........+-..+..+.+.+.+.++.+..- .+ +.+.... ....+.+++ T Consensus 451 ~~~n-a~SGvAI~~r~~qg~~~l~~~~dnl~~~~~~~g~~ll~li~~~~~~~rv~RI~~e~~~~~~~~~~~~n~~~~~~~ 529 (714) T protein:vir:10 451 QDSG-ATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGE 529 (714) T ss_pred CCcc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcccceeEeeccccCCcc Confidence 7644 457888877766655555555566666666666655554322 11 2111000 001111111 Q ss_pred ----------------CCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhhhcccCC Q lcl|NC_012753. 433 ----------------DDGVFTDRNAEFDYWSKMVAAGFAPKTM------AIEKTLNVTKEQAQEIYQKINDETMVSTDS 490 (502) Q Consensus 433 ----------------~d~i~~d~~~~~~~~~~~~~~Gi~S~et------~l~~~~~~~deea~~el~ri~~E~~~~~~~ 490 (502) ..+.+.-.++.++.++++..+ +.+++ .+.++-++.- +++.+++|++-.+...+. T Consensus 530 ~~nDi~~~~~dv~i~~~p~~~s~r~~~~~~l~ql~~~--~~p~~~~~~~~~~le~~d~p~--~~ei~~~ir~~~~~~~~~ 605 (714) T protein:vir:10 530 LTNDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSP 605 (714) T ss_pred ccccceeeeEEEEEeeccCcHHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCcC--HHHHHHHHHHHcCCCCCc Confidence 111112234444555555432 22221 1234444432 445566666554432211 Q ss_pred CCCccccCCCCC Q lcl|NC_012753. 491 FRTSEEVDIYGE 502 (502) Q Consensus 491 ~~~~~~~~~~g~ 502 (502) . ..--| T Consensus 606 ~------~~~~e 611 (714) T protein:vir:10 606 D------EMTPE 611 (714) T ss_pred c------ccCcc Confidence 1 11112 No 88 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=99.44 E-value=1.4e-11 Score=80.10 Aligned_cols=436 Identities=8% Similarity=-0.005 Sum_probs=204.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCc-----ccc-ccCC----------Cc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDS-----VTY-RDSN----------GS 64 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~-----~~~-~~~~----------~~ 64 (502) |++|+++-.++--. ..+. -.+-+...+-|.+-... +.. ...+ .. T Consensus 1 Mn~iDr~i~~~sP~------~a~~--------------R~~ar~~~~~y~aa~~~r~~~~~~~~~s~~~~i~~~~~~lr~ 60 (548) T protein:vir:95 1 MNLIDRLLEPLAPE------LVAR--------------RLAAREAIQAYEAARPGRTHKAKRQPLGADTSLQKSAVSMRE 60 (548) T ss_pred CchHHhHhhhcchH------HHHH--------------HHHhHHHhccccccCccccccccCCCCChHHHHHHHHHHHHH Confidence 99999888876210 0000 00011111223331110 000 0000 00 Q ss_pred cccccceecchHHHHHHHHhhhhhcC-cceEee----CCHH----HHH----HHHHHHhh------ccHHHHHHHHHHHH Q lcl|NC_012753. 65 QVKRDFNHLPIGRTASKKVASLVFNE-QATIRV----DNEV----ADA----FINETLKN------DKFSKNFERYLESC 125 (502) Q Consensus 65 ~~~~~~~~~n~~k~iv~~~a~~l~~e-p~~i~~----~d~~----~~e----~l~~~~~~------~~f~~~~~~~~~~~ 125 (502) +..+-..-.++++-+|+.+++.++|. ...+.. .|.. .++ .|++|.++ .+|......++... T Consensus 61 RaRdL~rNn~~a~~av~~~~~nvVG~~G~~i~p~~l~~d~~~a~~l~~~ie~~w~~Wa~~~D~~g~~~f~~lq~l~~R~~ 140 (548) T protein:vir:95 61 QCRKLDEDHDLVTGLLDRLEERVVGGSGIGVEPLPLRLDGSVHAELAMEIRSAWAEWSLSPETSGELTRPQVERLMCRTW 140 (548) T ss_pred HHHHHHhcChHHHHHHHHHHHhccCccccceeeeecCCCHHHHHHHHHHHHHHHHHhhcCccccccCCHHHHHHHHHHHH Confidence 01111122368899999999999984 333322 2322 233 33334322 35888888888889 Q ss_pred hhcCCEEEEEEEeCC---------ceEEEEEcCCeE-EEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeE Q lcl|NC_012753. 126 LALGGLAMRPYIDGD---------QIRVSFVQATVF-FPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETY 195 (502) Q Consensus 126 ~~~G~~~~~~~~d~~---------~~~i~~v~~~~~-~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~ 195 (502) +..|.++++..|+.. .+++..++|+.+ .|...+.+.+... +|+ +...- T Consensus 141 ~~dGE~f~~~~~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~i~~G-------------------IE~---D~~Gr 198 (548) T protein:vir:95 141 LRDGEGLAQKLMGRVPNYTFATSVPFALELLEPDYLPFSYNNLSKGIVQG-------------------IER---DTWRR 198 (548) T ss_pred HhCCceEEEeeecccccccCCcccceEEEEechhhcCCCCCCCCCceeee-------------------eEE---CCCCc Confidence 999999999988631 258999999986 2221111111111 121 11111 Q ss_pred EEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHH Q lcl|NC_012753. 196 TISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDE 275 (502) Q Consensus 196 ~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~ 275 (502) .+-|.++....++.. .. . .. ....-++..-++.+ ......+...|+|.|+.++..+..|+.-.+. T Consensus 199 p~aY~i~~~hPgd~~-~~---~----~~---~~~~rvpA~~VlHi----f~~~r~gQ~RGvs~lapvl~~l~~l~~y~da 263 (548) T protein:vir:95 199 KRAYHLLKDHPGNLQ-TL---G----GS---LAVKRVEAERIIHI----AYRKRIGQNRGVPMLHAVLIRLADLKDYEES 263 (548) T ss_pred eEEEEEeecCCCccc-cc---c----cc---cceeeechhHheec----ccccCCccccCcchHHHHHHHHHHHhHHHHH Confidence 222223332111100 00 0 00 00011111112221 2233456678999999999999999865543 Q ss_pred HHHHHh-hccceeeechHHhccC-CCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHH Q lcl|NC_012753. 276 FMWEVK-MGQRRVAVPTQMIKTE-YDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLS 353 (502) Q Consensus 276 ~~~~~~-~~~~~i~v~~~~l~~~-~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 353 (502) -..--. ..--..|| +.. ++..+..- ..........+-+......-..+.-|+.+++.-+..+|..-+..+++ T Consensus 264 el~~aki~A~~a~fi-----~~~~~~~~~~~~-~~~~~~~~~~~~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr 337 (548) T protein:vir:95 264 ERVAARISAALAMYI-----KKGNPDSYTVEP-GKDRKNRTIPIAPGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLR 337 (548) T ss_pred HHHHHHHhhhheeee-----ecCCCccccCCC-CcccccccccccCCccccccCCCceeeecCCCCCCCCHHHHHHHHHH Confidence 322211 11112222 211 11111000 00000000011111111100112237777777777788888999999 Q ss_pred HHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHhhccc--CCCcccccceEE Q lcl|NC_012753. 354 LFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKE-LVISILELAKVYNLY--TGEIPTMDEVSV 430 (502) Q Consensus 354 ~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~-l~~~il~~~~~~~~~--~~~~~~~~~i~v 430 (502) .|....|+|+..++.+.++ |=.++++.....-......+..|...+.+ +.+..+..+-+-+.. ++......-+.+ T Consensus 338 ~IAaglGipYe~ltgD~s~--nYSS~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~ 415 (548) T protein:vir:95 338 MIGAGTRSTYSSVSRAYDG--TYSAQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAA 415 (548) T ss_pred HHHhhcCCCHHHHhcccch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheee Confidence 9999999999998877543 33333333333333444444444433333 444444433222211 111111223556 Q ss_pred Ee--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhc---cc------------CCCCC Q lcl|NC_012753. 431 DL--DDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMV---ST------------DSFRT 493 (502) Q Consensus 431 ~f--~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~---~~------------~~~~~ 493 (502) .| +.-.-+|+.++++....++.+|+.|.+..+++. |.+-+++.+++++-.+.... .. .+... T Consensus 416 ~W~~P~~~~iDP~Kea~A~~~~i~~Gl~T~~~~~a~~-G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~ 494 (548) T protein:vir:95 416 VYQGPVMPWINPMHEANAWELLVKAGFADEAEVARAR-GRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVE 494 (548) T ss_pred eeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCC Confidence 66 444568999999999999999999999988874 77666654443332211100 00 00111 Q ss_pred c-cccCCCCC Q lcl|NC_012753. 494 S-EEVDIYGE 502 (502) Q Consensus 494 ~-~~~~~~g~ 502 (502) + .....+|- T Consensus 495 ~~~~~~~~~~ 504 (548) T protein:vir:95 495 AVQKVYLGVG 504 (548) T ss_pred chhhhccccc Confidence 1 11111111 No 89 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=99.42 E-value=5.1e-12 Score=82.51 Aligned_cols=401 Identities=10% Similarity=0.069 Sum_probs=196.7 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |.+.+...++.-.+|..-. . ....+..... ..+......| .+..+++.+| T Consensus 1 ~~~~D~~~~~~~~~g~~~~-~----~~~~~~~~~~----~~~~~l~a~Y---------------------~~~~l~~~~v 50 (437) T protein:vir:52 1 MKFFDGIKSLALKLGSKQE-Q----TYYSPSLSLT----DDLVQLEALW---------------------RDNWIANKVC 50 (437) T ss_pred CchhhhhHhHHhcCCCccc-c----ceeecCcccc----ccHHHHHHHH---------------------HhCchhhHHh Confidence 9888888887644332100 0 0000000000 0111222223 2346889999 Q ss_pred HHHhhhhhcCcceEeeCC--HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCce-----------EEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDN--EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQI-----------RVSFV 147 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d--~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-----------~i~~v 147 (502) +..|.-++.++..|++++ +...+.+++.++.-++...+.+++.++-.+|++++.+-.|+..+ .+.++ T Consensus 51 d~~a~d~~r~~~~i~~~d~~~~~~~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~~d~~~~~~pl~~~~~~~~~~v~ 130 (437) T protein:vir:52 51 IKRPEDMVRNWREIYSNDLNSKQLDLFTKFERSLKLRETLTKALQWSSLYGSVGLLVVTDSQNTSAPLKPTERLKRLIIL 130 (437) T ss_pred hcchHHhhcCCceEecCCCCHHHHHHHHHHHHhhcHHHHHHHHHHhcccccceEEEEEecCCCcccccccCCceeEEEEe Confidence 999999999999998865 33445788888877899999999999999999998887764321 12222 Q ss_pred cCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcce Q lcl|NC_012753. 148 QATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETV 227 (502) Q Consensus 148 ~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~ 227 (502) ++.++-|......+.. .- .|-..+. |+|. +.+. +..| |+..+ + T Consensus 131 ~~~~v~~~~~~~~dp~--------------s~-~fg~p~~-------y~v~-----~~~~---~~~i----H~SRi---i 173 (437) T protein:vir:52 131 PKWKISPTGTKDDDVL--------------SP-NFGRYSE-------YSIL-----GGSQ---SITV----HHSRL---I 173 (437) T ss_pred chhhcccccccccccc--------------cc-ccCcceE-------EEEe-----cCCc---ceeE----cccee---E Confidence 3322222111100000 00 0101111 1111 0000 0011 01100 0 Q ss_pred eecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceee-ec--hHHhccCCCCCCcc Q lcl|NC_012753. 228 TLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVA-VP--TQMIKTEYDTNGEK 304 (502) Q Consensus 228 ~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~-v~--~~~l~~~~~~~g~~ 304 (502) .+.| ...+. .....+|+|.+..+.+-|..++.+.-....=+...+..++ ++ .+.+.. +.... T Consensus 174 ~~~~--------~~~~~----~~~~~~G~s~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~---~~~~~ 238 (437) T protein:vir:52 174 ILNA--------NDAPL----SDNDIWGVSDLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAA---GMENE 238 (437) T ss_pred EecC--------ccCCC----ccccccCCchHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcC---CcHHH Confidence 1111 11111 1245679999999999999999776555443322233333 32 122221 11110 Q ss_pred cCcccc-ccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-ccccccccccHHHHHHH Q lcl|NC_012753. 305 VTVKRE-FETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSMKTATEVVSE 382 (502) Q Consensus 305 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~~tAtei~~~ 382 (502) .....+ +....+....+..+. +.-++.++.+ .......++...++|+..+++|... ||...+|.+|+.+=... T Consensus 239 ~~~~~~~~~~~~~~~~~~~~d~---~~~~e~~~~~--~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Glasge~D~~~ 313 (437) T protein:vir:52 239 VASVISAVQEIKSATNSLLLDA---ENEYDRKELT--FTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLASGDEDIQN 313 (437) T ss_pred HHHHHHHHHHhcCCCceEEEcC---CcceEEEecC--cCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccccccHHHHHH Confidence 000000 000001111111111 1225555443 3346677888888999999999865 56666676666643333 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHH-------HHHHHhcC Q lcl|NC_012753. 383 QSDTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDY-------WSKMVAAG 454 (502) Q Consensus 383 ~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~-------~~~~~~~G 454 (502) | +..+..+| ..++..|+.|+.+++.-+ .+.. ..+++|.|+.-...++.+.+++ ..+++++| T Consensus 314 y---yd~i~~~Qe~~l~p~le~l~~~i~~~~------~g~~--~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g 382 (437) T protein:vir:52 314 Y---HEAIRRLQETRLRPIFEIIDPLICNEL------FGGL--PADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQNG 382 (437) T ss_pred H---HHHHHHHHHHHHHHHHHHHHHHHHHHh------cCCC--CCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcC Confidence 3 34444444 467888888888765421 2222 2368999998777776665554 56667788 Q ss_pred CCCHHHHHHhc-----C-CCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 455 FAPKTMAIEKT-----L-NVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 455 i~S~et~l~~~-----~-~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++|...+...+ + .+++++.+.. + ..++.......+....+--.++ T Consensus 383 ~i~~~e~r~~L~~~g~~~~i~~~~~~~~-~--~~~~~~~~~~~~~~~~~~~~~~ 433 (437) T protein:vir:52 383 VLNEYQIANELRESGLFANISAEHIEEL-K--NADEFAGNFEEPEKMEGAQVQN 433 (437) T ss_pred CCCHHHHHHHHHhcCCCCCCCccccccc-c--CCCCCCCccCCCCCCCCCCCCC Confidence 88887765542 1 2444322111 0 0010000000000000111111 No 90 >protein:vir:2764 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612881;genbank:gi:20065798;genbank:GeneID:935623 Probab=99.41 E-value=2.7e-11 Score=78.53 Aligned_cols=458 Identities=10% Similarity=0.052 Sum_probs=217.8 Q ss_pred CChhHHHH-----HHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIK-----NFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik-----~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~ 75 (502) +..-++.. ++-++... .+.. .+...++-.....+..+||.|.++.-.-.. .-....+..++.|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~--------~~~~--~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p~~~~N~ 74 (714) T protein:vir:27 6 NTMATKNDNGATPRFSQRQLQ--------ALCS--DIDSQPKWRDAANKACAYYDGDQLPPEVLQ-VLKDRGQPMTIHNL 74 (714) T ss_pred ccccCCCCcchhHHHHHHHHH--------HHHH--HHHhhHHHHHHHHHHHHhhcCCCCCHHHHH-HHHhcCCCcEEecc Confidence 11111110 11111000 0000 111122334556677899999877421110 01123356788999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC----C-H--HHHHHH----HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC----C Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD----N-E--VADAFI----NETLKNDKFSKNFERYLESCLALGGLAMRPYIDG----D 140 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~----d-~--~~~e~l----~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~----~ 140 (502) -+.+|+...++--...+.+.+. + . +..+.| ..+.+.+++......+...+++.|-+|..+|++. + T Consensus 75 i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~ 154 (714) T protein:vir:27 75 IAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGP 154 (714) T ss_pred HHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCC Confidence 9999999999999988888773 2 2 244444 4455567888899999999999999999999874 4 Q ss_pred ceEEEEEcCCeEEEEE-EcCCCeEEEEEEEEEEEeeC------------------------------------------- Q lcl|NC_012753. 141 QIRVSFVQATVFFPLQ-ANTQDVSSAAIVTKSTKTEG------------------------------------------- 176 (502) Q Consensus 141 ~~~i~~v~~~~~~Pi~-~d~~~~~~~~~~~~~~~~~~------------------------------------------- 176 (502) .++|.+|+|.++++=. ....+...+-|+.+..+.+. T Consensus 155 ~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:27 155 EFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred CeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 6899999999987411 00122444544443322210 Q ss_pred -------------CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-c-------cC---CC-------c Q lcl|NC_012753. 177 -------------QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-Y-------ED---LE-------E 225 (502) Q Consensus 177 -------------~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~-------~~---l~-------~ 225 (502) ...+.++++|+|.. . ...+.++...+ |..+.+... . .+ +. . T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k---~-~~~~~~~~~~~----g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:27 235 YQSWDRQQNEWLQRERRRVLLQVVYYR---T-FERLPVIELSN----GRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred hccccccccccccccccEEEEEEEEEE---E-EEEEEeeccCC----CceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 00012233333210 0 00111111111 222211100 0 00 00 0 Q ss_pred ceeecC----------C--CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHH Q lcl|NC_012753. 226 TVTLNG----------L--TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQM 293 (502) Q Consensus 226 ~~~~~~----------~--~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~ 293 (502) ..+++| + ++.||+++.-- .....+.|+| .+.++++.++.+|...|...+-+ +.+++++.++. T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~--~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a 380 (714) T protein:vir:27 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGY--RKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDA 380 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeee--eeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCc Confidence 001111 0 12222222111 1111233554 68899999999999999988855 34444443332 Q ss_pred hccCCCC-CCcccCccccccccchhhccccCC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccc Q lcl|NC_012753. 294 IKTEYDT-NGEKVTVKREFETGHNVYEQFDSG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD 369 (502) Q Consensus 294 l~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~ 369 (502) +...... ...--+++. ...++.. .......++..++.--..++...++.....|...+|++...+|.. T Consensus 381 ~~~~d~~~~e~~arp~~--------vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~ 452 (714) T protein:vir:27 381 TQLSDNDLMEQIERPDG--------IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD 452 (714) T ss_pred ccccHHHHHHhccCCCC--------ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC Confidence 2111000 000001110 0111111 011112244333222345678889988999999999999999976 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hhh---cccCCCccc------------------ Q lcl|NC_012753. 370 GKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELA----KVY---NLYTGEIPT------------------ 424 (502) Q Consensus 370 ~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~----~~~---~~~~~~~~~------------------ 424 (502) ++ ..|+.+|.++...........-..++.+.+.+.+.++.+. ... .+.+..... T Consensus 453 ~n-a~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~ 531 (714) T protein:vir:27 453 SG-ATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELT 531 (714) T ss_pred cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceec Confidence 54 4578888777665554445455555555555555554433 221 122110000 Q ss_pred ------ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCC Q lcl|NC_012753. 425 ------MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM------AIEKTLNVTKEQAQEIYQKINDETMVSTDSFR 492 (502) Q Consensus 425 ------~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et------~l~~~~~~~deea~~el~ri~~E~~~~~~~~~ 492 (502) .++|.|+=..+.+.-.++..+.++++..+ +++.. .+.++-++.. +++.+++|++-.+...+... T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~~~ 607 (714) T protein:vir:27 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSPDE 607 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCccc Confidence 01122222222223345566666666532 33331 1334445533 45566677665443221111 Q ss_pred CccccCCCCC Q lcl|NC_012753. 493 TSEEVDIYGE 502 (502) Q Consensus 493 ~~~~~~~~g~ 502 (502) .. -| T Consensus 608 ~~------~e 611 (714) T protein:vir:27 608 MT------PE 611 (714) T ss_pred cc------hh Confidence 10 12 No 91 >protein:vir:9950 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859080;genbank:gi:32170835;genbank:GeneID:2653184 Probab=99.41 E-value=2.7e-11 Score=78.53 Aligned_cols=458 Identities=10% Similarity=0.052 Sum_probs=217.8 Q ss_pred CChhHHHH-----HHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIK-----NFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik-----~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~ 75 (502) +..-++.. ++-++... .+.. .+...++-.....+..+||.|.++.-.-.. .-....+..++.|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~--------~~~~--~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p~~~~N~ 74 (714) T protein:vir:99 6 NTMATKNDNGATPRFSQRQLQ--------ALCS--DIDSQPKWRDAANKACAYYDGDQLPPEVLQ-VLKDRGQPMTIHNL 74 (714) T ss_pred ccccCCCCcchhHHHHHHHHH--------HHHH--HHHhhHHHHHHHHHHHHhhcCCCCCHHHHH-HHHhcCCCcEEecc Confidence 11111110 11111000 0000 111122334556677899999877421110 01123356788999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC----C-H--HHHHHH----HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC----C Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD----N-E--VADAFI----NETLKNDKFSKNFERYLESCLALGGLAMRPYIDG----D 140 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~----d-~--~~~e~l----~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~----~ 140 (502) -+.+|+...++--...+.+.+. + . +..+.| ..+.+.+++......+...+++.|-+|..+|++. + T Consensus 75 i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~ 154 (714) T protein:vir:99 75 IAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGP 154 (714) T ss_pred HHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCC Confidence 9999999999999988888773 2 2 244444 4455567888899999999999999999999874 4 Q ss_pred ceEEEEEcCCeEEEEE-EcCCCeEEEEEEEEEEEeeC------------------------------------------- Q lcl|NC_012753. 141 QIRVSFVQATVFFPLQ-ANTQDVSSAAIVTKSTKTEG------------------------------------------- 176 (502) Q Consensus 141 ~~~i~~v~~~~~~Pi~-~d~~~~~~~~~~~~~~~~~~------------------------------------------- 176 (502) .++|.+|+|.++++=. ....+...+-|+.+..+.+. T Consensus 155 ~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:99 155 EFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred CeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 6899999999987411 00122444544443322210 Q ss_pred -------------CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-c-------cC---CC-------c Q lcl|NC_012753. 177 -------------QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-Y-------ED---LE-------E 225 (502) Q Consensus 177 -------------~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~-------~~---l~-------~ 225 (502) ...+.++++|+|.. . ...+.++...+ |..+.+... . .+ +. . T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k---~-~~~~~~~~~~~----g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:99 235 YQSWDRQQNEWLQRERRRVLLQVVYYR---T-FERLPVIELSN----GRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred hccccccccccccccccEEEEEEEEEE---E-EEEEEeeccCC----CceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 00012233333210 0 00111111111 222211100 0 00 00 0 Q ss_pred ceeecC----------C--CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHH Q lcl|NC_012753. 226 TVTLNG----------L--TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQM 293 (502) Q Consensus 226 ~~~~~~----------~--~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~ 293 (502) ..+++| + ++.||+++.-- .....+.|+| .+.++++.++.+|...|...+-+ +.+++++.++. T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~--~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a 380 (714) T protein:vir:99 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGY--RKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDA 380 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeee--eeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCc Confidence 001111 0 12222222111 1111233554 68899999999999999988855 34444443332 Q ss_pred hccCCCC-CCcccCccccccccchhhccccCC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccc Q lcl|NC_012753. 294 IKTEYDT-NGEKVTVKREFETGHNVYEQFDSG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD 369 (502) Q Consensus 294 l~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~ 369 (502) +...... ...--+++. ...++.. .......++..++.--..++...++.....|...+|++...+|.. T Consensus 381 ~~~~d~~~~e~~arp~~--------vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~ 452 (714) T protein:vir:99 381 TQLSDNDLMEQIERPDG--------IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD 452 (714) T ss_pred ccccHHHHHHhccCCCC--------ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC Confidence 2111000 000001110 0111111 011112244333222345678889988999999999999999976 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hhh---cccCCCccc------------------ Q lcl|NC_012753. 370 GKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELA----KVY---NLYTGEIPT------------------ 424 (502) Q Consensus 370 ~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~----~~~---~~~~~~~~~------------------ 424 (502) ++ ..|+.+|.++...........-..++.+.+.+.+.++.+. ... .+.+..... T Consensus 453 ~n-a~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~ 531 (714) T protein:vir:99 453 SG-ATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELT 531 (714) T ss_pred cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceec Confidence 54 4578888777665554445455555555555555554433 221 122110000 Q ss_pred ------ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCC Q lcl|NC_012753. 425 ------MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM------AIEKTLNVTKEQAQEIYQKINDETMVSTDSFR 492 (502) Q Consensus 425 ------~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et------~l~~~~~~~deea~~el~ri~~E~~~~~~~~~ 492 (502) .++|.|+=..+.+.-.++..+.++++..+ +++.. .+.++-++.. +++.+++|++-.+...+... T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~~~ 607 (714) T protein:vir:99 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSPDE 607 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCccc Confidence 01122222222223345566666666532 33331 1334445533 45566677665443221111 Q ss_pred CccccCCCCC Q lcl|NC_012753. 493 TSEEVDIYGE 502 (502) Q Consensus 493 ~~~~~~~~g~ 502 (502) .. -| T Consensus 608 ~~------~e 611 (714) T protein:vir:99 608 MT------PE 611 (714) T ss_pred cc------hh Confidence 10 12 No 92 >protein:vir:817 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050550;genbank:gi:9633447;genbank:GeneID:1262279 Probab=99.41 E-value=2.7e-11 Score=78.53 Aligned_cols=458 Identities=10% Similarity=0.052 Sum_probs=217.8 Q ss_pred CChhHHHH-----HHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIK-----NFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik-----~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~ 75 (502) +..-++.. ++-++... .+.. .+...++-.....+..+||.|.++.-.-.. .-....+..++.|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~--------~~~~--~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p~~~~N~ 74 (714) T protein:vir:81 6 NTMATKNDNGATPRFSQRQLQ--------ALCS--DIDSQPKWRDAANKACAYYDGDQLPPEVLQ-VLKDRGQPMTIHNL 74 (714) T ss_pred ccccCCCCcchhHHHHHHHHH--------HHHH--HHHhhHHHHHHHHHHHHhhcCCCCCHHHHH-HHHhcCCCcEEecc Confidence 11111110 11111000 0000 111122334556677899999877421110 01123356788999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC----C-H--HHHHHH----HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC----C Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD----N-E--VADAFI----NETLKNDKFSKNFERYLESCLALGGLAMRPYIDG----D 140 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~----d-~--~~~e~l----~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~----~ 140 (502) -+.+|+...++--...+.+.+. + . +..+.| ..+.+.+++......+...+++.|-+|..+|++. + T Consensus 75 i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~ 154 (714) T protein:vir:81 75 IAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGP 154 (714) T ss_pred HHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCC Confidence 9999999999999988888773 2 2 244444 4455567888899999999999999999999874 4 Q ss_pred ceEEEEEcCCeEEEEE-EcCCCeEEEEEEEEEEEeeC------------------------------------------- Q lcl|NC_012753. 141 QIRVSFVQATVFFPLQ-ANTQDVSSAAIVTKSTKTEG------------------------------------------- 176 (502) Q Consensus 141 ~~~i~~v~~~~~~Pi~-~d~~~~~~~~~~~~~~~~~~------------------------------------------- 176 (502) .++|.+|+|.++++=. ....+...+-|+.+..+.+. T Consensus 155 ~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:81 155 EFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred CeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 6899999999987411 00122444544443322210 Q ss_pred -------------CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-c-------cC---CC-------c Q lcl|NC_012753. 177 -------------QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-Y-------ED---LE-------E 225 (502) Q Consensus 177 -------------~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~-------~~---l~-------~ 225 (502) ...+.++++|+|.. . ...+.++...+ |..+.+... . .+ +. . T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k---~-~~~~~~~~~~~----g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:81 235 YQSWDRQQNEWLQRERRRVLLQVVYYR---T-FERLPVIELSN----GRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred hccccccccccccccccEEEEEEEEEE---E-EEEEEeeccCC----CceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 00012233333210 0 00111111111 222211100 0 00 00 0 Q ss_pred ceeecC----------C--CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHH Q lcl|NC_012753. 226 TVTLNG----------L--TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQM 293 (502) Q Consensus 226 ~~~~~~----------~--~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~ 293 (502) ..+++| + ++.||+++.-- .....+.|+| .+.++++.++.+|...|...+-+ +.+++++.++. T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~--~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a 380 (714) T protein:vir:81 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGY--RKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDA 380 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeee--eeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCc Confidence 001111 0 12222222111 1111233554 68899999999999999988855 34444443332 Q ss_pred hccCCCC-CCcccCccccccccchhhccccCC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccc Q lcl|NC_012753. 294 IKTEYDT-NGEKVTVKREFETGHNVYEQFDSG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD 369 (502) Q Consensus 294 l~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~ 369 (502) +...... ...--+++. ...++.. .......++..++.--..++...++.....|...+|++...+|.. T Consensus 381 ~~~~d~~~~e~~arp~~--------vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~ 452 (714) T protein:vir:81 381 TQLSDNDLMEQIERPDG--------IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD 452 (714) T ss_pred ccccHHHHHHhccCCCC--------ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC Confidence 2111000 000001110 0111111 011112244333222345678889988999999999999999976 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hhh---cccCCCccc------------------ Q lcl|NC_012753. 370 GKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELA----KVY---NLYTGEIPT------------------ 424 (502) Q Consensus 370 ~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~----~~~---~~~~~~~~~------------------ 424 (502) ++ ..|+.+|.++...........-..++.+.+.+.+.++.+. ... .+.+..... T Consensus 453 ~n-a~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~ 531 (714) T protein:vir:81 453 SG-ATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELT 531 (714) T ss_pred cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceec Confidence 54 4578888777665554445455555555555555554433 221 122110000 Q ss_pred ------ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCC Q lcl|NC_012753. 425 ------MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM------AIEKTLNVTKEQAQEIYQKINDETMVSTDSFR 492 (502) Q Consensus 425 ------~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et------~l~~~~~~~deea~~el~ri~~E~~~~~~~~~ 492 (502) .++|.|+=..+.+.-.++..+.++++..+ +++.. .+.++-++.. +++.+++|++-.+...+... T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~~~ 607 (714) T protein:vir:81 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSPDE 607 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCccc Confidence 01122222222223345566666666532 33331 1334445533 45566677665443221111 Q ss_pred CccccCCCCC Q lcl|NC_012753. 493 TSEEVDIYGE 502 (502) Q Consensus 493 ~~~~~~~~g~ 502 (502) .. -| T Consensus 608 ~~------~e 611 (714) T protein:vir:81 608 MT------PE 611 (714) T ss_pred cc------hh Confidence 10 12 No 93 >protein:vir:3296 Length: 714 # NCBI annotation: putative portal protein # Family: family:all:487 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049512;genbank:gi:9632518;genbank:GeneID:1262005 Probab=99.41 E-value=2.7e-11 Score=78.53 Aligned_cols=458 Identities=10% Similarity=0.052 Sum_probs=217.8 Q ss_pred CChhHHHH-----HHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIK-----NFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik-----~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~ 75 (502) +..-++.. ++-++... .+.. .+...++-.....+..+||.|.++.-.-.. .-....+..++.|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~--------~~~~--~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p~~~~N~ 74 (714) T protein:vir:32 6 NTMATKNDNGATPRFSQRQLQ--------ALCS--DIDSQPKWRDAANKACAYYDGDQLPPEVLQ-VLKDRGQPMTIHNL 74 (714) T ss_pred ccccCCCCcchhHHHHHHHHH--------HHHH--HHHhhHHHHHHHHHHHHhhcCCCCCHHHHH-HHHhcCCCcEEecc Confidence 11111110 11111000 0000 111122334556677899999877421110 01123356788999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC----C-H--HHHHHH----HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC----C Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD----N-E--VADAFI----NETLKNDKFSKNFERYLESCLALGGLAMRPYIDG----D 140 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~----d-~--~~~e~l----~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~----~ 140 (502) -+.+|+...++--...+.+.+. + . +..+.| ..+.+.+++......+...+++.|-+|..+|++. + T Consensus 75 i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~ 154 (714) T protein:vir:32 75 IAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGP 154 (714) T ss_pred HHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCC Confidence 9999999999999988888773 2 2 244444 4455567888899999999999999999999874 4 Q ss_pred ceEEEEEcCCeEEEEE-EcCCCeEEEEEEEEEEEeeC------------------------------------------- Q lcl|NC_012753. 141 QIRVSFVQATVFFPLQ-ANTQDVSSAAIVTKSTKTEG------------------------------------------- 176 (502) Q Consensus 141 ~~~i~~v~~~~~~Pi~-~d~~~~~~~~~~~~~~~~~~------------------------------------------- 176 (502) .++|.+|+|.++++=. ....+...+-|+.+..+.+. T Consensus 155 ~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:32 155 EFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred CeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 6899999999987411 00122444544443322210 Q ss_pred -------------CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-c-------cC---CC-------c Q lcl|NC_012753. 177 -------------QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-Y-------ED---LE-------E 225 (502) Q Consensus 177 -------------~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~-------~~---l~-------~ 225 (502) ...+.++++|+|.. . ...+.++...+ |..+.+... . .+ +. . T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k---~-~~~~~~~~~~~----g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:32 235 YQSWDRQQNEWLQRERRRVLLQVVYYR---T-FERLPVIELSN----GRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred hccccccccccccccccEEEEEEEEEE---E-EEEEEeeccCC----CceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 00012233333210 0 00111111111 222211100 0 00 00 0 Q ss_pred ceeecC----------C--CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHH Q lcl|NC_012753. 226 TVTLNG----------L--TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQM 293 (502) Q Consensus 226 ~~~~~~----------~--~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~ 293 (502) ..+++| + ++.||+++.-- .....+.|+| .+.++++.++.+|...|...+-+ +.+++++.++. T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~--~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a 380 (714) T protein:vir:32 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGY--RKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDA 380 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeee--eeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCc Confidence 001111 0 12222222111 1111233554 68899999999999999988855 34444443332 Q ss_pred hccCCCC-CCcccCccccccccchhhccccCC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccc Q lcl|NC_012753. 294 IKTEYDT-NGEKVTVKREFETGHNVYEQFDSG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD 369 (502) Q Consensus 294 l~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~ 369 (502) +...... ...--+++. ...++.. .......++..++.--..++...++.....|...+|++...+|.. T Consensus 381 ~~~~d~~~~e~~arp~~--------vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~ 452 (714) T protein:vir:32 381 TQLSDNDLMEQIERPDG--------IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD 452 (714) T ss_pred ccccHHHHHHhccCCCC--------ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC Confidence 2111000 000001110 0111111 011112244333222345678889988999999999999999976 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hhh---cccCCCccc------------------ Q lcl|NC_012753. 370 GKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELA----KVY---NLYTGEIPT------------------ 424 (502) Q Consensus 370 ~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~----~~~---~~~~~~~~~------------------ 424 (502) ++ ..|+.+|.++...........-..++.+.+.+.+.++.+. ... .+.+..... T Consensus 453 ~n-a~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~ 531 (714) T protein:vir:32 453 SG-ATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELT 531 (714) T ss_pred cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceec Confidence 54 4578888777665554445455555555555555554433 221 122110000 Q ss_pred ------ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCC Q lcl|NC_012753. 425 ------MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM------AIEKTLNVTKEQAQEIYQKINDETMVSTDSFR 492 (502) Q Consensus 425 ------~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et------~l~~~~~~~deea~~el~ri~~E~~~~~~~~~ 492 (502) .++|.|+=..+.+.-.++..+.++++..+ +++.. .+.++-++.. +++.+++|++-.+...+... T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~~~ 607 (714) T protein:vir:32 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSPDE 607 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCccc Confidence 01122222222223345566666666532 33331 1334445533 45566677665443221111 Q ss_pred CccccCCCCC Q lcl|NC_012753. 493 TSEEVDIYGE 502 (502) Q Consensus 493 ~~~~~~~~g~ 502 (502) .. -| T Consensus 608 ~~------~e 611 (714) T protein:vir:32 608 MT------PE 611 (714) T ss_pred cc------hh Confidence 10 12 No 94 >protein:vir:10117 Length: 714 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859247;genbank:gi:32171003;genbank:GeneID:2653343 Probab=99.41 E-value=2.7e-11 Score=78.53 Aligned_cols=458 Identities=10% Similarity=0.052 Sum_probs=217.8 Q ss_pred CChhHHHH-----HHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIK-----NFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik-----~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~ 75 (502) +..-++.. ++-++... .+.. .+...++-.....+..+||.|.++.-.-.. .-....+..++.|. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~--------~~~~--~~~~~~~~R~~a~~d~~fy~G~Qw~~~~~~-~l~~~g~p~~~~N~ 74 (714) T protein:vir:10 6 NTMATKNDNGATPRFSQRQLQ--------ALCS--DIDSQPKWRDAANKACAYYDGDQLPPEVLQ-VLKDRGQPMTIHNL 74 (714) T ss_pred ccccCCCCcchhHHHHHHHHH--------HHHH--HHHhhHHHHHHHHHHHHhhcCCCCCHHHHH-HHHhcCCCcEEecc Confidence 11111110 11111000 0000 111122334556677899999877421110 01123356788999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC----C-H--HHHHHH----HHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC----C Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD----N-E--VADAFI----NETLKNDKFSKNFERYLESCLALGGLAMRPYIDG----D 140 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~----d-~--~~~e~l----~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~----~ 140 (502) -+.+|+...++--...+.+.+. + . +..+.| ..+.+.+++......+...+++.|-+|..+|++. + T Consensus 75 i~~~v~~v~g~~~~nr~~~~v~p~~~~~~~~~~Ae~l~~~~~~~~~~~~~~~~~s~af~~~~~~G~G~~~~~~~~d~~~~ 154 (714) T protein:vir:10 75 IAPTVDGVLGMEAKTRTDLVVMSDEPDDETEKLAEAINAEFADACRLGNMNKARSDAYAEQIKAGLSWVEVRRNSDPFGP 154 (714) T ss_pred HHHHHHHHHhHHHhCCcceEEecCCCCchhHHHHHHHHHHHHHHHHhhchhHHHHHHHHHhhhcCcceEEeccccCCCCC Confidence 9999999999999988888773 2 2 244444 4455567888899999999999999999999874 4 Q ss_pred ceEEEEEcCCeEEEEE-EcCCCeEEEEEEEEEEEeeC------------------------------------------- Q lcl|NC_012753. 141 QIRVSFVQATVFFPLQ-ANTQDVSSAAIVTKSTKTEG------------------------------------------- 176 (502) Q Consensus 141 ~~~i~~v~~~~~~Pi~-~d~~~~~~~~~~~~~~~~~~------------------------------------------- 176 (502) .++|.+|+|.++++=. ....+...+-|+.+..+.+. T Consensus 155 ~i~i~~v~p~~v~~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~fP~~a~~i~~~~~~~~~~~d~~~~~~~~~~~~~~~~~ 234 (714) T protein:vir:10 155 EFKVSTVSRNEVFWDWLSREADLSDCRWLMRRRWMDTDEAKATFPGMAQVIDYAIDDWRGFVDTTVTEGQPSPLMSAWEE 234 (714) T ss_pred CeEEEecchhheeeccccccCChhhccceeeeecCCHHHHHHhcCCchhhhhhhhhhhccccccccccccccccccchhh Confidence 6899999999987411 00122444544443322210 Q ss_pred -------------CCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-c-------cC---CC-------c Q lcl|NC_012753. 177 -------------QKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-Y-------ED---LE-------E 225 (502) Q Consensus 177 -------------~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~-------~~---l~-------~ 225 (502) ...+.++++|+|.. . ...+.++...+ |..+.+... . .+ +. . T Consensus 235 ~~~~~~~~~~~~~~~~~rv~v~E~w~k---~-~~~~~~~~~~~----g~~~~~d~~~~~~~~~~~~g~~~~~~~~~~rv~ 306 (714) T protein:vir:10 235 YQSWDRQQNEWLQRERRRVLLQVVYYR---T-FERLPVIELSN----GRVVAFDKNNLMQAVAVASGRVQVKVGRVSRIR 306 (714) T ss_pred hccccccccccccccccEEEEEEEEEE---E-EEEEEeeccCC----CceEEeCccCHHHHHHHhhcchhhhccccceEE Confidence 00012233333210 0 00111111111 222211100 0 00 00 0 Q ss_pred ceeecC----------C--CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHH Q lcl|NC_012753. 226 TVTLNG----------L--TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQM 293 (502) Q Consensus 226 ~~~~~~----------~--~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~ 293 (502) ..+++| + ++.||+++.-- .....+.|+| .+.++++.++.+|...|...+-+ +.+++++.++. T Consensus 307 ~~~~~g~~~L~~~~~p~p~~~fp~vp~~g~--~~~~~g~~~G--~vr~~~d~Qr~~N~~~s~~~~~l--~~~~~~~~~~a 380 (714) T protein:vir:10 307 EAWFVGPHFIVDRPCSAPQGMFPLVPFWGY--RKDKTGEPYG--LISRAIPAQDEVNFRRIKLTWLL--QAKRVIMDEDA 380 (714) T ss_pred EEEEecCcccccCCCCCCCCceeEEEEeee--eeeccCceee--hhhhchhHHHHHHHHHHHHHHhh--cCCceeeecCc Confidence 001111 0 12222222111 1111233554 68899999999999999988855 34444443332 Q ss_pred hccCCCC-CCcccCccccccccchhhccccCC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccc Q lcl|NC_012753. 294 IKTEYDT-NGEKVTVKREFETGHNVYEQFDSG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD 369 (502) Q Consensus 294 l~~~~~~-~g~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~ 369 (502) +...... ...--+++. ...++.. .......++..++.--..++...++.....|...+|++...+|.. T Consensus 381 ~~~~d~~~~e~~arp~~--------vi~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~~tGv~~~~lG~~ 452 (714) T protein:vir:10 381 TQLSDNDLMEQIERPDG--------IIKLNPVRKNQKSVADVFRVEQDFQVASQQFQVMQESEKLIQDTMGVYSAFLGQD 452 (714) T ss_pred ccccHHHHHHhccCCCC--------ceeecccccccCCCCccccccCCCCccHHHHHHHHHHHHHHHHhhCCChHHcCCC Confidence 2111000 000001110 0111111 011112244333222345678889988999999999999999976 Q ss_pred ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----Hhh---cccCCCccc------------------ Q lcl|NC_012753. 370 GKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELA----KVY---NLYTGEIPT------------------ 424 (502) Q Consensus 370 ~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~----~~~---~~~~~~~~~------------------ 424 (502) ++ ..|+.+|.++...........-..++.+.+.+.+.++.+. ... .+.+..... T Consensus 453 ~n-a~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~~~~erv~RI~~e~~~~~~~~~v~in~~~~~~~~~ 531 (714) T protein:vir:10 453 SG-ATSGVAISNLVEQGATTLAEINDNYQFACQQVGRLLLAYLLDDLKKRRNHAVVINRDDRQRRQTIVLNAEGDNGELT 531 (714) T ss_pred cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEeccCCCcCcceEEeeccccCcceec Confidence 54 4578888777665554445455555555555555554433 221 122110000 Q ss_pred ------ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH------HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCC Q lcl|NC_012753. 425 ------MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM------AIEKTLNVTKEQAQEIYQKINDETMVSTDSFR 492 (502) Q Consensus 425 ------~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et------~l~~~~~~~deea~~el~ri~~E~~~~~~~~~ 492 (502) .++|.|+=..+.+.-.++..+.++++..+ +++.. .+.++-++.. +++.+++|++-.+...+... T Consensus 532 nDi~~~~~Dv~i~~~p~~~t~r~~~~~~l~~l~~~--~~p~~~~~~~~~~l~~~d~p~--~~el~~~ir~~~~~~~~~~~ 607 (714) T protein:vir:10 532 NDISRLNTHIALAPVQQTPAFKAQLAQRMSEVIQG--LPPQVQAVVLDLWVNLLDVPQ--KQEFVERIRAALGTPKSPDE 607 (714) T ss_pred ccceeeeEEEEEeeccCchHHHHHHHHHHHHHHhh--cCchhhhhHHHHHHHhcCCCC--HHHHHHHHHHHcCCCCCccc Confidence 01122222222223345566666666532 33331 1334445533 45566677665443221111 Q ss_pred CccccCCCCC Q lcl|NC_012753. 493 TSEEVDIYGE 502 (502) Q Consensus 493 ~~~~~~~~g~ 502 (502) .. -| T Consensus 608 ~~------~e 611 (714) T protein:vir:10 608 MT------PE 611 (714) T ss_pred cc------hh Confidence 10 12 No 95 >protein:vir:105619 Length: 772 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164304;genbank:gi:56692922;genbank:GeneID:3197230 Probab=99.40 E-value=2.8e-11 Score=78.47 Aligned_cols=473 Identities=11% Similarity=0.060 Sum_probs=221.3 Q ss_pred CChhHHHHHHHHHHhhcc----cccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVI----TNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~----~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |-|-+-+...++...... ....+..+.. .+...++......+..+||.|.++.-.-.. .-.......++.|.- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~q~~~r~~a~~d~~fy~G~QW~~~~~~-~l~~~g~p~~~~N~i 77 (772) T protein:vir:10 1 MQITENDRQYLNGLPPAGDTPLTVDEYADINY--EIEDQPAWRAVADKEMDYADGNQLDTELLR-RQQALGIPPAVEDLI 77 (772) T ss_pred CCcchhhHHhhccCCcccccccCHHHHHHHHH--HHhccHHHHHHHHHHHHhhcCCCCCHHHHH-HHHhcCCCcEEEcch Confidence 777666666665432111 1112222221 122344555667778899999877422110 011234467889999 Q ss_pred HHHHHHHhhhhhcCcceEeeC------CHHHHHHHHH----HHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC----ce Q lcl|NC_012753. 77 RTASKKVASLVFNEQATIRVD------NEVADAFINE----TLKNDKFSKNFERYLESCLALGGLAMRPYIDGD----QI 142 (502) Q Consensus 77 k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~----~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~----~~ 142 (502) +.+|+...++-....+.+.+. |....+.|+. +.+.+++......++..+++.|.+|..++++++ .+ T Consensus 78 ~~~v~~v~g~~~~nr~d~~v~Pr~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~Gw~e~~~~~d~~~~~i 157 (772) T protein:vir:10 78 GPALLSLQGYEAVTRTDWRVTPNGDVGGQEVADALNYRLNTAERQSGADRACSEAFRPQIACGIGWVEVSRESDPFKFPY 157 (772) T ss_pred HHHHHHHHHHHHhcCcceEEecCCCchHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHhhhcCceeEEeccccCCCCCCe Confidence 999999999999988888772 2344554444 455688999999999999999999999998643 47 Q ss_pred EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee----------------------------------------------- Q lcl|NC_012753. 143 RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE----------------------------------------------- 175 (502) Q Consensus 143 ~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~----------------------------------------------- 175 (502) +|.+|+|..++.=..-..+...|-++.+..+.+ T Consensus 158 ~i~~v~p~~v~~Dp~a~~D~sDar~~~~~~~~~~d~~~~~fp~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 237 (772) T protein:vir:10 158 RCRPIRRDEIHWDMKCGDDWEACRFLRRQRWLSPDRIALVFPEHAELIGMVGKYGSTWWGQPDLGMMEGGTSTGLHNAWN 237 (772) T ss_pred EEEeeCcccceecCCCCCCHHHhhhhhhhccCCHHHHHHhCCCchhHHHhhhhhcccccCcccccccccccccccccccc Confidence 899999998774111011233333222111100 Q ss_pred -------------CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc-----------------------c Q lcl|NC_012753. 176 -------------GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST-----------------------L 219 (502) Q Consensus 176 -------------~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~-----------------------~ 219 (502) +...+..+++|+|... ...+..+.+.+ |.-+.+.. + T Consensus 238 ~~~~~~~~~~~~~~~~~~rVrv~E~w~r~----~~~~~~~~~~~----g~~~~~~~~~~~~~~~l~~g~~~~~~~~~~rv 309 (772) T protein:vir:10 238 EARAWTVQEDHWYNPTSKEICLVELWYRR----WVQVHVLKSPD----GRVVEYDPNNLAHNIALASGRISPKKVTVSRV 309 (772) T ss_pred hhhccccccccccccCCceEEEEEEeeee----eeeeeeeccCC----CceEeeCcccHHHHHHHhhcccchheeeeeEE Confidence 0001223444533110 01111111111 11111100 0 Q ss_pred -ccCCCcceeec------CCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechH Q lcl|NC_012753. 220 -YEDLEETVTLN------GLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQ 292 (502) Q Consensus 220 -~~~l~~~~~~~------~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~ 292 (502) +.-+.....+. ..+..||+++.--. ....+.|+ +.+.++++.++.+|+..|...+-+-. .++....+ T Consensus 310 ~~~~~~g~~~L~~~~~p~~~~~fP~vP~~g~r--~~~~g~~~--G~vr~~kd~Qr~~N~~~S~~~~~l~~--~~~~~~~g 383 (772) T protein:vir:10 310 RRSYWLGPHCLHDGPTPYTHRHFPYVPFFGFR--EDATGIPY--GYVRGMKYAQDSLNSGVSKLRWGMSV--ARVERTKG 383 (772) T ss_pred EEEEEecceeeccCCCCCCCCccceEEEeeeE--eccCCccc--chhhhhhhHHHHHHHHHHHHHHHHhc--ccccccCC Confidence 00000000010 01223444332111 11122345 47899999999999999999886532 23433333 Q ss_pred HhccCCCCCCcccCccccccccchhhccccCCC-CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccc Q lcl|NC_012753. 293 MIKTEYDTNGEKVTVKREFETGHNVYEQFDSGD-MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGK 371 (502) Q Consensus 293 ~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~ 371 (502) .+.... . .+.. .......+ ..++... +..+..++..++.---.++...++.....|....|++...+|..+ T Consensus 384 av~~~d---~-~~~e--~~arp~~v-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGv~~~~lG~~~- 455 (772) T protein:vir:10 384 AVAMTD---A-QFRR--QIARPDAD-IVLDENHMAKPGARFDVKRDYTLTDQHFQMLQDNRATIERVSNITAGFQGRKG- 455 (772) T ss_pred Cccchh---H-HHHH--hccCCCCe-EEeCCccccCCCCCccccCCccccHHHHHHHHHHHHHHHHHhCCCHHHcCCCc- Confidence 222110 0 0000 00000011 1111111 111222443332222456888899999999999999999999654 Q ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hh---cccCCCcc-cccceEE---Ee-------- Q lcl|NC_012753. 372 SMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAK----VY---NLYTGEIP-TMDEVSV---DL-------- 432 (502) Q Consensus 372 ~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~----~~---~~~~~~~~-~~~~i~v---~f-------- 432 (502) +..|+.+|..+...........-..++.+.+...+.+|.+-. .. .+.+.... ....+.+ .+ T Consensus 456 na~SGvAi~~rq~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~~d~~~~~~~v~in~~~~d~~tg~~~ 535 (772) T protein:vir:10 456 TATSGIQEQQQIEQSNQSIGRIMDNFRAGRTLVGELLLAMIVEDIGQERTEVVIEGDAVTADRVVVLNEPQRDPQTGAAY 535 (772) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCCCCceEEeccceecccccccc Confidence 446888887776655555555555555555555554444332 21 12211100 0000000 00 Q ss_pred --CC-----------CccCC---HHHHHHHHHHHHhcCCCCHHHHH------HhcCCCCHHHHHHHHHHHHHhhhcccCC Q lcl|NC_012753. 433 --DD-----------GVFTD---RNAEFDYWSKMVAAGFAPKTMAI------EKTLNVTKEQAQEIYQKINDETMVSTDS 490 (502) Q Consensus 433 --~d-----------~i~~d---~~~~~~~~~~~~~~Gi~S~et~l------~~~~~~~deea~~el~ri~~E~~~~~~~ 490 (502) +| ..|.. .++..+.++++. +.++++... .++-++.- .++.+++|++-.....++ T Consensus 536 ~~NDi~~g~yDv~i~~~p~~~t~r~~~~~~m~ql~--~~~~P~~~~~~~~~~le~~D~p~--~~ei~~~ir~~~~~~~pe 611 (772) T protein:vir:10 536 LSNDLLRTRIKVALEDVPSTNSYRGQQLNAMSEAV--KSMPPQYQAAVLPFLVSLMDVPF--KRDVVEAIRAVDQQQTPE 611 (772) T ss_pred eeccceeeeEEEEeeccccchHHHHHHHHHHHHHH--hccChhHHHHHHHHHHhhcCCCC--hHHHHHHHHHHhccCChH Confidence 01 01111 234445555543 334554322 22333432 233344444332221111 Q ss_pred CC------------CccccCCCCC Q lcl|NC_012753. 491 FR------------TSEEVDIYGE 502 (502) Q Consensus 491 ~~------------~~~~~~~~g~ 502 (502) .. .....++-.. T Consensus 612 q~~~~~~q~~qq~~~~~~~el~~~ 635 (772) T protein:vir:10 612 QIQQQIDQAVQDALAKAGNDIKLR 635 (772) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Confidence 00 0000111110 No 96 >protein:vir:95449 Length: 584 # NCBI annotation: hypothetical protein ORF047 # Family: family:all:1548 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294640;genbank:gi:149408206;genbank:GeneID:5237016 Probab=99.35 E-value=5.5e-12 Score=82.31 Aligned_cols=441 Identities=12% Similarity=0.072 Sum_probs=199.9 Q ss_pred CChhH----------HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHH---HHHHHHhcCCCCccccccCCCcccc Q lcl|NC_012753. 1 MGIIQ----------TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRI---MDNLRYFAGDFDSVTYRDSNGSQVK 67 (502) Q Consensus 1 m~~~~----------~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i---~~~~~~Y~g~~~~~~~~~~~~~~~~ 67 (502) |+.-. .+..++.+.+.... ..+-..+ .+.++||..-... ...+-+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~----------------~~r~~~~~~w~el~~y~~a~~~~---~~~~~~~~~ 61 (584) T protein:vir:95 1 MSVKVAELNSLLVRDSSAQWVAYLWDRFN----------------NQRRQKIEEWKELRNYVFATDTT---TTSNQGLPW 61 (584) T ss_pred CCcchhhhhhhccccchHHHHHHHHHHHH----------------hhhchhhccCHHHHHHHHhhhhh---hhhhccccc Confidence 33211 11112111111000 0011111 2333455442111 111222222 Q ss_pred ccceecchHHHHH----HHHhhhhhcCcceEee-----CC--HHHHHHHHHHH----hhccHHHHHHHHHHHHhhcCCEE Q lcl|NC_012753. 68 RDFNHLPIGRTAS----KKVASLVFNEQATIRV-----DN--EVADAFINETL----KNDKFSKNFERYLESCLALGGLA 132 (502) Q Consensus 68 ~~~~~~n~~k~iv----~~~a~~l~~ep~~i~~-----~d--~~~~e~l~~~~----~~~~f~~~~~~~~~~~~~~G~~~ 132 (502) ++++.+|-...++ ..+-+.+|+..--+++ ++ ...++.++.+. .+.+|...+.+.+.++..+|.++ T Consensus 62 r~~~~~~k~~~~~~~i~~~l~~~~Fp~~~w~~~v~~~~~~~~~~~~~ai~~~i~dkl~e~~~~~~~~~~i~d~~~~G~~~ 141 (584) T protein:vir:95 62 KNSTTLPKLCQIRDNLHSNYFSSLFPNDDWLRWVGYGKGDSTKTKAKAIQAYMSNKCRESHFRTEVSKLIYDYIDYGNAF 141 (584) T ss_pred ccccchhHHHHHHHHHHHHHHHhhcCccceeeeecCCCchhhHHHHHHHHHHHhhhhhhccHHHHHHHHHHhhccCCceE Confidence 4455555444444 4444445543222222 12 22355555554 55689999999999999999999 Q ss_pred EEEEEeCC--------------ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeC--------CCceEEE------- Q lcl|NC_012753. 133 MRPYIDGD--------------QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEG--------QKVKYYS------- 183 (502) Q Consensus 133 ~~~~~d~~--------------~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~--------~~~~~yt------- 183 (502) ++++|..+ +++|+.++|..+|| --..+.+..+.|+.+.+...+ +...+|- T Consensus 142 ~k~~~~~~~~e~~e~~~v~~~~~prieriSP~d~~~-Dpsa~~i~d~~fivrs~~T~~~L~~l~~~~~~~~y~~d~v~~~ 220 (584) T protein:vir:95 142 ATVSFEAKYKEMTDGTLVPDYIGPRLVRISPLDIVF-NPLATSISDTFKIVRSVKTKGELMRLAQDEPEQSYWLEALKRR 220 (584) T ss_pred EEEeEeecceeeeccccccccccceEEeeChhheee-cCCCCCccchhhhhhhhhhHHHHHHHHhhcCccccchHHHHHH Confidence 99999754 58999999999886 122244556665543322100 0000110 Q ss_pred ----------------------------EEEEEEEeCCeEEEEEEEEecCCcc-ccC--ceeeccccccCCCccee---- Q lcl|NC_012753. 184 ----------------------------LIEFHEWNKETYTISNELYESESKT-IIG--QRVPLSTLYEDLEETVT---- 228 (502) Q Consensus 184 ----------------------------~~E~h~~~~~~~~I~~~l~~~~~~~-~lG--~~v~l~~~~~~l~~~~~---- 228 (502) ..|+|. ..+++-..|-+.-++ ..+ ...+.-.++.+ .... T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ey~~----~~~V~vl~~~g~~~~~~~~e~~~~~iv~v~~g--~~iIR~~~ 294 (584) T protein:vir:95 221 EEICRHLGGYSVEDFDKAAGFDVDGFGNLYEYYM----SDWVEILEFYGDYHDKETGELQTNRIITVVDR--STEVRNES 294 (584) T ss_pred HHhccCCCCCcccccccccccccccccccccccC----CceeEEEeecccccccccCCCcccceEEEEec--cEEEEeee Confidence 011110 001110001110000 000 00000000000 0000 Q ss_pred -ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCc Q lcl|NC_012753. 229 -LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTV 307 (502) Q Consensus 229 -~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~ 307 (502) ..-.+++||+...+ .....+.||.|+.+.+.++++.+|.+.-++++.+...-+. ++..++. .....+.+ T Consensus 295 np~~~~~~PF~~~~~----~p~~~s~yG~gi~~ll~d~Q~~lna~~r~~iDnl~l~~~p--v~k~~~~----~~~~~~~p 364 (584) T protein:vir:95 295 IPTWFGSAPIYHVGW----RFRPDNLWAMGPLDNLVGMQYRIDHLENAKADAVDLIIQP--PLKIIGE----VEEFVWGP 364 (584) T ss_pred cCCCCCCCCEEEEcc----eeeeccccCCCchhhhhhHHHHHhHHHHHHHHHHHHhcCc--ceeeccc----cchhcccC Confidence 01225667776543 2245688999999999999999999999999988654333 2222222 11223333 Q ss_pred cccccccchhhccccCCCCccccceeeeccccchHHHH---HHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 308 KREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYI---KAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~---~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) +..|..++ ...++.+.|. ..++. ..|+.+...++..+|+|+..-|..+.+.+||+++.+.-+ T Consensus 365 g~~~~~~~-------------~~~~q~~~p~--a~~~~s~~~~lq~~e~~me~~sGvp~~~~G~~~~~~~TAtg~s~l~n 429 (584) T protein:vir:95 365 GAEIHLDQ-------------GGDVQEIAKN--VNYIINADNQIQMLEDRMELYAGAPREAMGIRTPGEKTAFEVQQLGN 429 (584) T ss_pred CceeecCC-------------CCCcceecCc--hhhhhHHHHHHHHHHHHHHhhhCCChhhcccccchhhhHHHHHHHHH Confidence 33333321 1124444432 22333 336667778888999999999998888899999987777 Q ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhcccCCCc--ccccceEEEeCCCccC-----C-------------HHHH Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSL-KELVISILELAKVYNLYTGEI--PTMDEVSVDLDDGVFT-----D-------------RNAE 443 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l-~~l~~~il~~~~~~~~~~~~~--~~~~~i~v~f~d~i~~-----d-------------~~~~ 443 (502) ++-.-+..+.+.|..+| ++|+.++..+..- ++...+. ....++.+..-..+.. | .+.. T Consensus 430 aa~~~~r~~~~~f~~~ll~~l~~ll~~~~~~-nmd~~~~vr~~n~e~~~~~f~~i~r~Dl~g~~~~va~Ga~~~~~keq~ 508 (584) T protein:vir:95 430 AAGRIFQEKVTTFEVELLEPVLNAMLETATR-NMDGSDVIRVMDTDLGVKEFMSVTREDITANGKIRPIGARHFGKQAQD 508 (584) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hccccCceeeeccccccccccccChhhhccCeeEEeehhhHHHHHHHH Confidence 77777777888887666 8888888776432 2211111 0001111100001111 0 1111 Q ss_pred HHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhh-cccCCCCC-ccccCCCCC Q lcl|NC_012753. 444 FDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETM-VSTDSFRT-SEEVDIYGE 502 (502) Q Consensus 444 ~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~-~~~~~~~~-~~~~~~~g~ 502 (502) .+...+..++ +....+.|- ...+++.++-++.. .....+.. +....-==| T Consensus 509 ~q~l~~ilq~------~~~~~i~p~---~~~~~l~~~ladl~~~p~~~~~~~~~~~~~Q~~ 560 (584) T protein:vir:95 509 LQNLVGIFNS------QIGQMILPH---TSGKALATFVDDVTGLQGYEIFRPNVAVAEQAE 560 (584) T ss_pred HHHHHHHHHh------hhhhhcccc---chHHHHHHHHHHHhCCCcccccCCCcccchhHH Confidence 1111111111 111111111 12344555333322 11111111 000000000 No 97 >protein:vir:95821 Length: 763 # NCBI annotation: 94 kDa protein # Family: family:all:1548 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950537;genbank:gi:119952228;genbank:GeneID:5075648 Probab=99.30 E-value=1.5e-10 Score=74.39 Aligned_cols=457 Identities=10% Similarity=0.062 Sum_probs=188.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) =.+.+.++..+.. .+--. ..++.+...|.+||.+..+. ..+...| +..++.+.-...| T Consensus 26 ~~~~~~l~~~~~~----------------~~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~g----rs~vv~~~v~~~v 83 (763) T protein:vir:95 26 ELSLQALKADLDA----------------AKPSH-TAMMIKVKEWNDLMRIEGKA-KPPKVKG----RSQVQPKLVRRQA 83 (763) T ss_pred hHHHHHHHHHHHh----------------hhcch-hHHHHHHHHHHHhhhccccC-cccccCC----CccccCHHHHHHH Confidence 2222333333321 11112 23455667788876555332 1222222 2223333222222 Q ss_pred ----HHHhhhhhcCcceEee-----CCHHHHH----HHHHHH-hhccHHHHHHHHHHHHhhcCCEEEEEEEeC------- Q lcl|NC_012753. 81 ----KKVASLVFNEQATIRV-----DNEVADA----FINETL-KNDKFSKNFERYLESCLALGGLAMRPYIDG------- 139 (502) Q Consensus 81 ----~~~a~~l~~ep~~i~~-----~d~~~~e----~l~~~~-~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~------- 139 (502) -.+..-+++-+.-|.+ +|...++ +++-+| ..++=...+..+++.|+..|.+++++||+- T Consensus 84 e~~~~~l~~~f~~~~~~~~~~P~~~~D~~~A~q~t~~~n~~~~~~~~~~~~~~~~~~~~l~~~~gv~k~~W~~~~~~~~~ 163 (763) T protein:vir:95 84 EWRYSALTEPFLGSNKLFKVTPVTWEDVQGARQNELVLNYQFRTKLNRVSFIDNYVRSVVDDGTGIVRVGWNREIRKEKQ 163 (763) T ss_pred HHHHHHHHHhhcCCCcEEEEecCCcchHHHHHHHHHHHHHHHhhcCchhhHHHHHHHHHhhcCcceEEEeeeeeeeeeee Confidence 2344444554544555 3443333 555533 345556778899999999999999999951 Q ss_pred ------------------------------------------------------------------------CceEEEEE Q lcl|NC_012753. 140 ------------------------------------------------------------------------DQIRVSFV 147 (502) Q Consensus 140 ------------------------------------------------------------------------~~~~i~~v 147 (502) ++++|+.| T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~p~ie~V 243 (763) T protein:vir:95 164 EVPVFSLFPIQTQEQADALQQALQLRTDNPRGYEENVDEAIKESVRFFDETGQATYAVQTGTTTTEVEVPLANHPTVEML 243 (763) T ss_pred eehhhhhccccchhHHHHHHHHHHhhhhhhccccccccchhhhhhhhccccCcceeeecccceeEEEEEEecCceEEEee Confidence 23466778 Q ss_pred cCCeEEEEEEcC-CCeEEEEEE-EEEEEeeC---CCceEEEEEEE-----EE--------E--------eC-CeE-EEEE Q lcl|NC_012753. 148 QATVFFPLQANT-QDVSSAAIV-TKSTKTEG---QKVKYYSLIEF-----HE--------W--------NK-ETY-TISN 199 (502) Q Consensus 148 ~~~~~~Pi~~d~-~~~~~~~~~-~~~~~~~~---~~~~~yt~~E~-----h~--------~--------~~-~~~-~I~~ 199 (502) +|..+++= .+. .++..+-|+ .+.+.... ..+..|..++- +. + .+ ..- ...+ T Consensus 244 ~p~d~~iD-p~a~sD~~Da~~~~~~~~~t~~dL~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~V~v~ 322 (763) T protein:vir:95 244 NPENIIID-PSCQGDINKAMFAIVSFETCKADLLKEKDRYHNLNKIDWQSSAPVNEPDHATTTPQEFQISDPMRKRVVAY 322 (763) T ss_pred cHHHheec-CCCCCchhhCceEeeEEeccHHHHHhccCCccccchhcchhccccccccccccchhhccCCCcccceEEEE Confidence 88887741 111 123344442 22111100 00000110000 00 0 00 000 1111 Q ss_pred EEEecCCccccCceeeccccccC---CCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHH Q lcl|NC_012753. 200 ELYESESKTIIGQRVPLSTLYED---LEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEF 276 (502) Q Consensus 200 ~l~~~~~~~~lG~~v~l~~~~~~---l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~ 276 (502) +.|...+.+.-|.....--.+.+ +.-.......++.||+.+.. ....++.+|.|+++.++++++.+|..+++. T Consensus 323 E~y~~~d~~gdg~~~~~~v~~~g~~iL~~~~~p~~~~~~PFv~~~~----~p~~~~~~G~gi~~~~~d~Qr~~N~~~~~~ 398 (763) T protein:vir:95 323 EYWGFWDIEGNGVLEPIVATWIGSTLIRLEKNPYPDGKLPFVLIPY----MPVKRDMYGEPDAELLGDNQAVLGAVMRGM 398 (763) T ss_pred EeeeeeccCCcceeEEEEEEEEcCeeeecccccccCCCcCEEEecc----eeecCcccCCchHHHhhHHHHHHHHHHHHH Confidence 22221110000000000000110 00000001224567765543 124578899999999999999999999999 Q ss_pred HHHHh-hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec-cccchHHHHHHHHHHHHH Q lcl|NC_012753. 277 MWEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT-TDIRSDDYIKAINKGLSL 354 (502) Q Consensus 277 ~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~ 354 (502) .+.+. ..+.++.|+.+.+.... .....++.. ..+.. ..+.. ..++..+ +.+ -......++..... T Consensus 399 ~d~l~~~~~~~~~v~~gav~~~d---~~~~~pg~v-----~~v~~--g~~~~--~~~~~~~~p~~-~~~~~~~l~~~~~~ 465 (763) T protein:vir:95 399 IDLLGRSANGQRGMPKGMLDALN---SRRYREGED-----YEYNP--TQNPA--QMIIEHKFPEL-PQSALTMATLQNQE 465 (763) T ss_pred HHHHHhhcCCcEEeecccccchh---hhcccCCce-----EEeeC--CCChh--hhcccccCCCC-cchHHHHHHHHHHH Confidence 99885 46667888776653211 111111111 00000 00111 1111111 122 12333444455556 Q ss_pred HHHhcCCChhhccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-------ccCCCccc-- Q lcl|NC_012753. 355 FEMQLGVSTGMFSFDGKS-MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYN-------LYTGEIPT-- 424 (502) Q Consensus 355 i~~~~g~s~~~~~~~~~~-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~-------~~~~~~~~-- 424 (502) +...+|++....|.++.+ ..||+++....+........+.+.|..+++.+++.++.+..-+- +.+....+ T Consensus 466 ~e~~TGv~~~~~G~~~~~~~~tat~v~~l~qa~~~~~~~~~r~~~~~~k~l~~~~l~Li~q~~d~~rviRI~g~e~v~v~ 545 (763) T protein:vir:95 466 AESLTGVKAFAGGVTGESYGDVAAGIRGVLDAASKREMAILRRLAKGMSEIGNKIIAMNAVFLAEHEVVRITNEEFVTIK 545 (763) T ss_pred HHHhhCcchhhcCcCcccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCCCcEEEEeCCcccccc Confidence 677778887776654332 35777776655555555566677787788888888887755421 11111111 Q ss_pred ------ccceEEEeCCCccCCH-HHHHHHHHHHHh-cC-CCCHHHH---HHhcCCCCHHHHHHHHHHHHHhhhcccCCCC Q lcl|NC_012753. 425 ------MDEVSVDLDDGVFTDR-NAEFDYWSKMVA-AG-FAPKTMA---IEKTLNVTKEQAQEIYQKINDETMVSTDSFR 492 (502) Q Consensus 425 ------~~~i~v~f~d~i~~d~-~~~~~~~~~~~~-~G-i~S~et~---l~~~~~~~deea~~el~ri~~E~~~~~~~~~ 492 (502) ..++.|.- + +... .+..+..+.+.. .| .+..... +.......+ ..+..+.++.-+++..+... T Consensus 546 ~~~~~~~~DV~V~~--~-~as~~~q~~~~l~~ll~~l~~~~~~~~~~~il~~~~d~~~--~~~~~~~lr~~q~~~d~~~q 620 (763) T protein:vir:95 546 REDLKGNFDLEVDI--S-TAEVDNQKSQDLGFMLQTIGPNVDQQITLNILAEIADLKR--MPKLAHDLRTWQPQPDPVQE 620 (763) T ss_pred HHHhcCCcceEEec--c-cchHHHHHHHHHHHHHHHhccccChHHHHHHHHHHHhhhc--hhhhHHHHHhcCCCccchhh Confidence 11222221 1 1122 122222333221 11 1221110 000000000 00011111111111000000 Q ss_pred Ccccc-------CCCC-----------------C Q lcl|NC_012753. 493 TSEEV-------DIYG-----------------E 502 (502) Q Consensus 493 ~~~~~-------~~~g-----------------~ 502 (502) ..... ..-. + T Consensus 621 ~qaqle~~~~q~e~~~~~akaq~~qaqa~~~~aq 654 (763) T protein:vir:95 621 QLKQLAVEKAQLENEELRSKIRLNDAQAQKAMAE 654 (763) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00000 0000 0 No 98 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=99.22 E-value=1.5e-11 Score=79.94 Aligned_cols=401 Identities=11% Similarity=0.077 Sum_probs=180.2 Q ss_pred CChhHH--HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccc---ccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQT--IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTY---RDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~--ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~---~~~~~~~~~~~~~~~n~ 75 (502) |+++.. ..++ ..|..+...+ ....+-..-.-+.+..+ T Consensus 1 ~~~~~~d~~~~~--------------------------------------~~~~~~~~~~~~~~~~~~~~l~a~Y~~~~l 42 (427) T protein:vir:10 1 MKIVKHDGYNDI--------------------------------------FNGGADGSPKPFFMSDASYHVGSFYNDNAT 42 (427) T ss_pred CCccccchHHHH--------------------------------------hhcCCCCcccCccccCchHHHHHHHHcCch Confidence 222221 1111 1111111100 00011111112345578 Q ss_pred HHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEE Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPL 155 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi 155 (502) ++.+|+..|.-++.+...|+.+++. +.++..++.-++...+.+++.++-.+|++++.+-++++.+.- -|+ T Consensus 43 ~~~~Vd~~aed~~r~g~~i~g~~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~l~--------~p~ 112 (427) T protein:vir:10 43 AKRIVDVIPEEMVTAGFKMSGVKDE--KEFKSLWDSYKLDSSLVDLLCWARLYGGAAMVAIIKDNRMLT--------SQA 112 (427) T ss_pred hhhhhccchHHhhcCCccccCccHH--HHHHHHHHHhhHHHHHHHHHHhccccceeEEEEEecCCCccc--------ccc Confidence 8999999999999999988876433 456666666689999999999999999999988776543210 011 Q ss_pred EEcCCCeEEEEEEEEEEE-----eeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeec Q lcl|NC_012753. 156 QANTQDVSSAAIVTKSTK-----TEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLN 230 (502) Q Consensus 156 ~~d~~~~~~~~~~~~~~~-----~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 230 (502) ...+.+....++.+..- ..+-....|-.-+. |+|. +.+ ..-+..| |+.. -+.+. T Consensus 113 -~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~~-------y~v~-----~~~-~~~~~~i----H~SR---li~~~ 171 (427) T protein:vir:10 113 -KPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEPEI-------YKVS-----PGD-NMQPYLI----HHSR---VFIAD 171 (427) T ss_pred -CCCcceeEEEEechhcccccccccCccccccCcceE-------EEEe-----cCC-CCcceEE----cccc---EEEec Confidence 01122222222211000 00000000001111 1111 000 0000111 0111 01122 Q ss_pred CCCcceEEEecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccceee-ec--hHHhccCCCCCCcccC Q lcl|NC_012753. 231 GLTRPLFTYLKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQRRVA-VP--TQMIKTEYDTNGEKVT 306 (502) Q Consensus 231 ~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~~i~-v~--~~~l~~~~~~~g~~~~ 306 (502) |.+-|- . .......||.|++.. +.+-+..++++......=+.-.+..++ ++ .+++... +... ... T Consensus 172 g~~~p~---~------~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~-~~~~-~~~ 240 (427) T protein:vir:10 172 GERVAQ---Q------ARKQNQGWGASVLNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDD-DAQY-AAR 240 (427) T ss_pred CCCchh---h------hcccCCcccchhhhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCc-cchH-HHH Confidence 221110 0 112345789999864 667777777766554433322222222 21 1222111 1100 000 Q ss_pred ccc-cccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-cccccccc-ccHHHHHHHH Q lcl|NC_012753. 307 VKR-EFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSM-KTATEVVSEQ 383 (502) Q Consensus 307 ~~~-~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~-~tAtei~~~~ 383 (502) ... .+.........+...+ ....++.++.++ ......++...++|+..+|+|... ||...+|. +|+.+=...| T Consensus 241 ~r~~~~~~~~~~~~~~~l~~--~~e~~e~~~~~l--sgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~ny 316 (427) T protein:vir:10 241 LRLAQVDDNSGVGRAIGIDA--ETEEYDVLNSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETF 316 (427) T ss_pred HHHHHHHHhcCcccceeeec--CCCceeEEeccc--CChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHH Confidence 000 0000000111111111 112355555443 446677888889999999999775 56655554 4555544444 Q ss_pred HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHH-------HHHHHHhcCC Q lcl|NC_012753. 384 SDTYQMRNSI-ATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFD-------YWSKMVAAGF 455 (502) Q Consensus 384 ~~l~~~~~~~-~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~-------~~~~~~~~Gi 455 (502) .+. ++.+ +..++..|++|+.++++ ..++++.|+.-...++.+.++ .+.+++.+|+ T Consensus 317 yd~---i~~~Qe~~l~p~l~~l~~~i~~--------------s~~~~~~f~pL~~~s~kEkaei~~~~a~a~~~~~~~gv 379 (427) T protein:vir:10 317 YKL---VDRKREEDYRPLLEFLLPFIVD--------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI 379 (427) T ss_pred HHH---HHHHHHHHHHHHHHHHHHHhhc--------------CCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCC Confidence 333 3333 35688888888777552 136889999888777776654 4556677788 Q ss_pred CCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 456 APKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 456 ~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++.+++...+-...+++-......+..|..+...+.+++.+.+...| T Consensus 380 i~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d~ 426 (427) T protein:vir:10 380 IDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDE 426 (427) T ss_pred CCHHHHHHHHHhhhccccCCCCccccccccchhcCCCCCCCCCCCCC Confidence 88887654331111000000000000111111111222333333333 No 99 >protein:vir:77597 Length: 725 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:YP_063735;genbank:gi:51236726;genbank:GeneID:2944241 Probab=99.17 E-value=8.7e-10 Score=70.25 Aligned_cols=460 Identities=8% Similarity=-0.047 Sum_probs=215.3 Q ss_pred CC----hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MG----IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~----~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |+ .+.+++.+++.- +...++-.....+..+||.|.++.-.-.. ......+..+|.- T Consensus 1 m~d~~~~~~~~~~~~~~~-----------------~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~---~l~~q~rp~~N~i 60 (725) T protein:vir:77 1 MADNENRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRVSQWDDWLSQ---YTTLQYRGQFDVV 60 (725) T ss_pred CCchHHHHHHHHHHHHHH-----------------HHhhHHHHHHHHHHHHhhCCCCCCHHHHH---HHHhcCCCccccH Confidence 54 334444444431 11234445677788899999877432111 0111122356888 Q ss_pred HHHHHHHhhhhhcCcceEee-----CCHHHHHHHHHHH----hhccHHHHHHHHHHHHhhcCCEEEEEEEe---C----C Q lcl|NC_012753. 77 RTASKKVASLVFNEQATIRV-----DNEVADAFINETL----KNDKFSKNFERYLESCLALGGLAMRPYID---G----D 140 (502) Q Consensus 77 k~iv~~~a~~l~~ep~~i~~-----~d~~~~e~l~~~~----~~~~f~~~~~~~~~~~~~~G~~~~~~~~d---~----~ 140 (502) +-+|+...++--...+.+.+ +|...++.|+.++ +.++.......+...+++.|.+|+.+++| + + T Consensus 61 ~~~i~~v~g~~~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~ 140 (725) T protein:vir:77 61 RPVVRKLVSEMRQNPIDVLYRPKDGARPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) T ss_pred HHHHHHHHhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchhHHHHHHHHHhhcCcceeeeeecccCCCCCCC Confidence 89999999988888888777 2445555555544 45788888889999999999999998754 1 2 Q ss_pred ceEEEEE----cCCeEEEEEEcCCC----eEEEEEEEEEEEee------------------------------CCCceEE Q lcl|NC_012753. 141 QIRVSFV----QATVFFPLQANTQD----VSSAAIVTKSTKTE------------------------------GQKVKYY 182 (502) Q Consensus 141 ~~~i~~v----~~~~~~Pi~~d~~~----~~~~~~~~~~~~~~------------------------------~~~~~~y 182 (502) .++|..+ ++.++|. |..- ...+-++++..+.+ ..+.... T Consensus 141 ~~~i~~~~~~~~~~~v~~---Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~v 217 (725) T protein:vir:77 141 NQVIRREPIHSACSHVIW---DSNSKLMDKSDARHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTI 217 (725) T ss_pred ceeeEEeecccChhhcee---CchhhccChhhHHHHHHHhcCCHHHHHHHHhhCCcchhhcccccccccccccccCCCee Confidence 3444443 3544442 2211 11111111111100 0011123 Q ss_pred EEEEEEEEeCCeEEEEEEEEecCCccccCceeecc--c-------------------------c-ccCCCcceeecCC-- Q lcl|NC_012753. 183 SLIEFHEWNKETYTISNELYESESKTIIGQRVPLS--T-------------------------L-YEDLEETVTLNGL-- 232 (502) Q Consensus 183 t~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~--~-------------------------~-~~~l~~~~~~~~~-- 232 (502) ++.|+|+.....- .++...++ ..|..+.+. . + |.-+.+...+.+- T Consensus 218 rv~E~~~r~~~~~----~~~~~~~~-~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~rv~~~~~~g~~~l~~~~~ 292 (725) T protein:vir:77 218 QIAEFYEVVEKKE----TAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQL 292 (725) T ss_pred EEEEEEEEEEEee----EEEEecCC-CCcceeecChhhHHHHHHHhhhcCchhhhhcccceeeeeEeeecCceeeccCCc Confidence 3455544211110 11111110 011111110 0 0 0000111111110 Q ss_pred ---CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 233 ---TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 233 ---~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) ...||++|-- ......+.|++-|.+.++++.++.+|...|...+-+-. .+.+..+..+.+..... ..-.++ T Consensus 293 ~~~~~~P~vP~~g--~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~---~~~~~~ 367 (725) T protein:vir:77 293 IAGEHIPIVPVFG--EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH---MYDGND 367 (725) T ss_pred CCCCccceEEEee--eeeccCCcccccchhhhhhhHHHHHHHHHHHHHHHHHhccccccccchhhhhHHHH---HHHhcc Confidence 1123332210 01112456777789999999999999999999987643 34444555444321110 000000 Q ss_pred ccccccchhhccccCCCCcc-ccceee-eccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHH Q lcl|NC_012753. 309 REFETGHNVYEQFDSGDMDK-GIGITD-LTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDT 386 (502) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~-~~~i~~-~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l 386 (502) ..-....+. +...+|.. ...+.. -+++++ .++...++.....|...+|+....+|..++. .||.++..+.... T Consensus 368 ~~~~~~~~~---~~~~~g~~~~~~i~~~~~~~lp-~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~-~SG~ai~~rq~qg 442 (725) T protein:vir:77 368 DYPYYLLNR---TDENSGDLPTQPLAYYENPEVP-QANAYMLEAATSAVKEVATLGVDTEAVNGGQ-VAFDTVNQLNMRA 442 (725) T ss_pred CCceecccc---cccCCCcccccCccccCCCCch-HHHHHHHHHHHHHHHHHhCCCHHHhCCCchh-hHHHHHHHHHHHH Confidence 000000000 00111111 111222 244453 4566788888899999999999999977553 5777888777666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh----h---cccCCCcc-c------------------------ccceEEEeCC Q lcl|NC_012753. 387 YQMRNSIATLVEKSLKELVISILELAKV----Y---NLYTGEIP-T------------------------MDEVSVDLDD 434 (502) Q Consensus 387 ~~~~~~~~~~~~~~l~~l~~~il~~~~~----~---~~~~~~~~-~------------------------~~~i~v~f~d 434 (502) ......+-..++.+.+...+.+|.+-.- . ++.+.... . .+++.|+=.. T Consensus 443 ~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~rv~RI~~ed~~~~~v~in~~~~~~~~G~~~~~NDi~g~~Dv~v~~~p 522 (725) T protein:vir:77 443 DLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKDVQLMAEVVDLATGEKQVLNDIRGRYECYTDVGP 522 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecccccccccchhHhhhhhccceeeEEeecc Confidence 6666666666667776666655554221 1 11111100 0 0112221111 Q ss_pred CccCCHHHHHHHHHHHHhcC--CCCH-HHHHHhcCCCCH-HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 435 GVFTDRNAEFDYWSKMVAAG--FAPK-TMAIEKTLNVTK-EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 435 ~i~~d~~~~~~~~~~~~~~G--i~S~-et~l~~~~~~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +.+.=.++.++.++++..+. ..+. -..+.......+ +.+++.+++|++...+.....+.... -+ T Consensus 523 ~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~l~d~~~~~e~~erirkq~~~~~~~q~~~~~----e~ 590 (725) T protein:vir:77 523 SFQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPE----EQ 590 (725) T ss_pred chHHHHHHHHHHHHHHHHhccccchhHHHHHHHhhccccchHHHHHHHHHHhhhhhhhccCCCChh----hH Confidence 11111334445555554321 1111 112222223222 34566677777655443322211110 01 No 100 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=99.17 E-value=5.7e-11 Score=76.75 Aligned_cols=390 Identities=10% Similarity=0.097 Sum_probs=180.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc----CCCccccccceecchH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD----SNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~----~~~~~~~~~~~~~n~~ 76 (502) |.--+...+.+ .|-++.-++.. ......-.-+.+..++ T Consensus 1 ~~~~D~~~n~~--------------------------------------~gg~~~~~~~~~~~~~~~~~l~a~Y~~~~l~ 42 (422) T protein:vir:10 1 MVKTDSYANIF--------------------------------------LGGSDGSEIYGSLQNQAPTILASLYADNALV 42 (422) T ss_pred CccchhhHHHH--------------------------------------cCCCCCccccCcccccCHHHHHHHHHhChhh Confidence 33333333322 12111111000 0000011123455788 Q ss_pred HHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEE Q lcl|NC_012753. 77 RTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQ 156 (502) Q Consensus 77 k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~ 156 (502) +.+|+..|.-++.+...|+.+++. +.+..-++.-++...+.+++.++-.+|++++.+-..+++-.- =|+. T Consensus 43 ~~~Vd~~aed~~r~g~~i~~~~~~--~~~~~~~~~l~~~~~l~~a~~~~rl~G~a~i~i~v~d~~~~~--------~Pl~ 112 (422) T protein:vir:10 43 RRIIDTIPETALAAGFHIDGIDDE--PAFWSRWDDLEMTQNINDAWSWARLFGGAAIVAIVKDNRALT--------SPVR 112 (422) T ss_pred HHHHhhhhHHHhcCCccccCCCHH--HHHHHHHHHhhHHHHHHHHHHhhccccceEEEEEecCCCCcc--------cccc Confidence 999999999999999998876543 234444455678999999999999999998888764322100 0111 Q ss_pred EcCCCeEEEEEEEEEE-----EeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecC Q lcl|NC_012753. 157 ANTQDVSSAAIVTKST-----KTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNG 231 (502) Q Consensus 157 ~d~~~~~~~~~~~~~~-----~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 231 (502) . .+.+....++.+.. ...+-....|-.-+. |+|. +.+ ...+..| |+..+ +.+.| T Consensus 113 ~-~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~-------y~v~-----~~~-~~~~~~i----H~SRl---i~~~g 171 (422) T protein:vir:10 113 E-GAELETVRVYDRTQVKVQTREENPRNARFGEPLT-------YRIT-----TNE-SDMFYDV----HYSRI---HIIDG 171 (422) T ss_pred c-cCceeeEEeeccccccchhcccCccccccCcceE-------EEEe-----cCC-CCcceee----cccee---EEeCC Confidence 1 11111111111100 000000000101111 1111 100 0001111 11110 11222 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhh-HHHHHHHHHHHHHHHHHHHhhccceee-ec--hHHhccCCCCCCcccCc Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDN-AKTTMDFINTTYDEFMWEVKMGQRRVA-VP--TQMIKTEYDTNGEKVTV 307 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~-~~~lid~ld~~~S~~~~~~~~~~~~i~-v~--~~~l~~~~~~~g~~~~~ 307 (502) .+.|.. .......||.|.+.. +.+-+..++++......=+.-.+..++ ++ .+++.. + +..... T Consensus 172 ~~~p~~---------~~~~~~~~G~S~l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~---~-~~~~~~ 238 (422) T protein:vir:10 172 ERIPNV---------MRRQNDGWGRSVLSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDD---S-EGFGAA 238 (422) T ss_pred CCchhh---------hcccCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCC---c-cchHHH Confidence 221110 112345689999976 678888888766554443322233332 22 112211 1 111000 Q ss_pred cccc---cccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-cccccccc-ccHHHHHHH Q lcl|NC_012753. 308 KREF---ETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSM-KTATEVVSE 382 (502) Q Consensus 308 ~~~~---~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~-~tAtei~~~ 382 (502) ...+ .........+..... ..-++.++.++ ......++...++++..+|+|... ||...+|. +|+.+-... T Consensus 239 ~~r~~~~~~~~~~~~~~~l~~~--~e~~e~~~~~l--sgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~ 314 (422) T protein:vir:10 239 RLRLAQVDNNSGVGQAIGIDAE--SEEYSVLNSDI--GGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALET 314 (422) T ss_pred HHHHHHHHHhcCCccceeEecC--CcceEEEeccc--CChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHH Confidence 0000 000011111111111 12355555443 356777888899999999999775 56666564 355554444 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHH-------HHHHHHhcC Q lcl|NC_012753. 383 QSDTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFD-------YWSKMVAAG 454 (502) Q Consensus 383 ~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~-------~~~~~~~~G 454 (502) |. ..++.+| ..++..|+.|+.+|++ ..+++|.|+.-...++.+.++ ..++++.+| T Consensus 315 yy---d~i~~~Qe~~l~p~l~~l~~~i~~--------------s~~~~~~f~pL~~~sekekaei~~~~a~a~~~~~~~g 377 (422) T protein:vir:10 315 FH---KLVDRKRNAELLPILEFLIPFIVN--------------AEEWSVEFNPLAQESSKDKAEILEKNVNSIAALIAAG 377 (422) T ss_pred HH---HHHHHHHHHHHHHHHHHHHHHhcc--------------cCCcEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC Confidence 43 3444444 5678888888877652 136889999877777775544 456777889 Q ss_pred CCCHHHHHHhcC------CCCHHHHHHHHHHHHHhhhcccCCCCCccc Q lcl|NC_012753. 455 FAPKTMAIEKTL------NVTKEQAQEIYQKINDETMVSTDSFRTSEE 496 (502) Q Consensus 455 i~S~et~l~~~~------~~~deea~~el~ri~~E~~~~~~~~~~~~~ 496 (502) +++.+++...+- +..++-.+++.+. .++. ..|...++++ T Consensus 378 ~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~d 422 (422) T protein:vir:10 378 AMDIDEARDTLRTIAPEVKINDGSVETEVTI-SETS--NDPLEVPTDD 422 (422) T ss_pred CCCHHHHHHHhhhhcccccCCCCCCccccch-hhcC--CCCCCCCCCC Confidence 999877664441 1111101112221 1111 1222222333 No 101 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=99.16 E-value=1.3e-10 Score=74.87 Aligned_cols=437 Identities=11% Similarity=0.008 Sum_probs=196.9 Q ss_pred CChhHH---HHHHHHHHhhcccccch-------hhhhc--cccccCCH--HHHHHHHHHHHHhcCCCC--cc---ccc-c Q lcl|NC_012753. 1 MGIIQT---IKNFIKRSNYVITNQSL-------NSITD--HPKIAISP--EEYNRIMDNLRYFAGDFD--SV---TYR-D 60 (502) Q Consensus 1 m~~~~~---ik~~i~~~~~~~~~~~l-------~~i~~--~~~~~~~~--~~~~~i~~~~~~Y~g~~~--~~---~~~-~ 60 (502) |+++.+ .+.....+-..-..+.+ -...+ +.-++.+. ..+..+ --+.|-+. .+ .+. . T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~ 100 (537) T protein:vir:10 25 VGIFGAGDDEKPFTRAQLVHQTMMAIRDHAIAMMPKVDGSHPDMAMDGLDVEGGTF----SAYANPNLSEGLVLWYAQQA 100 (537) T ss_pred cCCCcccchhhHHHHHHhhhhccCCCCCccCcccccccccccchhccccccchhhh----hhhccccccchhhhhccccC Confidence 777753 33332221100000000 00000 01111110 000000 01111100 00 000 0 Q ss_pred CCCccccccceecchHHHHHHHHhhhhhcCcceEeeCCH-----HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEE Q lcl|NC_012753. 61 SNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNE-----VADAFINETLKNDKFSKNFERYLESCLALGGLAMRP 135 (502) Q Consensus 61 ~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~-----~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~ 135 (502) ..+...-.-+....+++.+|+..|.-++.++..|++++. ...+.|++.++.-++...+.++++++-.+|++++.+ T Consensus 101 ~~~~~l~a~Y~~~~l~r~iVd~~A~d~~r~~~~i~~~~~~~~~~~~~~~l~~~~~~l~~~~~l~~a~~~~rlyG~~~i~i 180 (537) T protein:vir:10 101 FIGHQMCALIATHWLVNKACSQMPRDAMRKGYKIISDDGNELDPKDAKFIDRYDRAFNIKKHAIQFVRKGRIFGIRIALF 180 (537) T ss_pred CccHHHHHHHHhCchhhhhhhhhhHHhhcCCceeecCCcccccHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEE Confidence 001111111345678999999999999999999998653 344567777777789999999999999999999887 Q ss_pred EEeC--CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEE-----E---EeeCCCceEEEEEEEEEEeCCeEEEEEEEEecC Q lcl|NC_012753. 136 YIDG--DQIRVSFVQATVFFPLQANTQDVSSAAIVTKS-----T---KTEGQKVKYYSLIEFHEWNKETYTISNELYESE 205 (502) Q Consensus 136 ~~d~--~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~-----~---~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~ 205 (502) ..+. +...-.-+.++.+-+ +......++.+. . ..++-....|-.-+.| +|. T Consensus 181 ~v~~~D~~~~~~Pl~~~~i~k-----g~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y-------~v~------- 241 (537) T protein:vir:10 181 KVDSPDPYYYEKPFNIDGVMP-----GAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYW-------LIN------- 241 (537) T ss_pred eecCcCCcccccccccccccc-----cceeEEEEechhhcccccchhhhccCCccccCCceee-------eec------- Confidence 7642 111111112221111 111111111100 0 0000000001011111 110 Q ss_pred CccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012753. 206 SKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR 285 (502) Q Consensus 206 ~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~ 285 (502) |..| |+..+ +.+.|.+.|- +.+ .....+|+|++..+.+-+..++++.-..+.=+..... T Consensus 242 -----g~~i----H~SRl---i~f~g~~~p~--~~~-------~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~ 300 (537) T protein:vir:10 242 -----GKKY----HRSHL---AIYINDEVVD--FLK-------PSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTKRQ 300 (537) T ss_pred -----CeEe----cceeE---EEecCCCCch--hhh-------cccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 1111 01000 1111111110 011 1123579999999999999999877666554433333 Q ss_pred eeeechHHhccCCCCCCcccCccc-cccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChh Q lcl|NC_012753. 286 RVAVPTQMIKTEYDTNGEKVTVKR-EFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTG 364 (502) Q Consensus 286 ~i~v~~~~l~~~~~~~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 364 (502) +++-- .++....+. ..+.... .+.........+..+ . ...-++.++.+ ..-....++...+.++..+|++.. T Consensus 301 ~v~k~-~~~~~l~~~--~~~~~r~~~~~~~r~n~g~~~id-~-e~e~~e~~~~~--lsgl~~~l~~~~~~iAa~~~IP~t 373 (537) T protein:vir:10 301 TVLKV-DAAQVLANK--QQFDETMSWWTATRDNYQVRVVD-K-DNEDVVQIDTT--LNDLDKVIMNQYQLVCAIARTPAP 373 (537) T ss_pred ceeee-chHHhhcCH--HHHHHHHHHHHhhcCCcceeEec-C-CCceeEEEecc--CCCHHHHHHHHHHHHHhhhCCCce Confidence 33311 111111111 0000000 000000000001111 1 11234444433 334566777888889999999877 Q ss_pred h-cccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHH Q lcl|NC_012753. 365 M-FSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNA 442 (502) Q Consensus 365 ~-~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~ 442 (502) . ||...+|. +|+.+=...| +..++.+|..++..|+.++.+|+... . + ...+++|.|+.-...++.+ T Consensus 374 ~L~G~sp~GlnatGe~D~~~y---yd~I~~~Qe~l~p~l~~l~~ll~~~~-----~-~---~~~~~~i~f~pL~~~s~kE 441 (537) T protein:vir:10 374 KMLGTVPTGFNSTGDYEEASY---HEECESTQDDMRPLIDRHHQLVCRSH-----L-R---KRIRVKVEFPPMDAPKESE 441 (537) T ss_pred eeccCCccccccchhHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHhc-----C-C---CCcceEEEeCCCCCCCHHH Confidence 4 66555554 4566444333 34455555568889988888776542 1 1 1346899999887788777 Q ss_pred HHH-------HHHHHHhcCCCCHHHHHHhc--------CCC----CHHHHHHHHHHHHHhhhc-----ccCCCCCccccC Q lcl|NC_012753. 443 EFD-------YWSKMVAAGFAPKTMAIEKT--------LNV----TKEQAQEIYQKINDETMV-----STDSFRTSEEVD 498 (502) Q Consensus 443 ~~~-------~~~~~~~~Gi~S~et~l~~~--------~~~----~deea~~el~ri~~E~~~-----~~~~~~~~~~~~ 498 (502) .++ +..+++.+|+||..++...+ .++ +++.++ + ..+..|... ..+.+..+.+.. T Consensus 442 kAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~~~~ed~e-~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 519 (537) T protein:vir:10 442 RADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPAMRPTDAE-D-IDVDDEGKPVRIIEDQPAPSEMFGAT 519 (537) T ss_pred HHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCCCChhhhh-c-ccCCccCCcCCCCCCCCCccccCCCC Confidence 665 47788899999998866553 122 111111 1 111122111 112222233333 Q ss_pred CCCC Q lcl|NC_012753. 499 IYGE 502 (502) Q Consensus 499 ~~g~ 502 (502) -.|+ T Consensus 520 ~~~~ 523 (537) T protein:vir:10 520 SSGE 523 (537) T ss_pred cccc Confidence 3444 No 102 >protein:vir:9263 Length: 725 # NCBI annotation: 1 # Family: family:all:487 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720327;genbank:gi:24371585;genbank:GeneID:955785 Probab=99.14 E-value=1.1e-09 Score=69.75 Aligned_cols=459 Identities=8% Similarity=-0.048 Sum_probs=209.2 Q ss_pred CC----hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MG----IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~----~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |+ .+.+++.+++.- +...++-.....+..+||.|.++.-.-.. ......+..+|.- T Consensus 1 m~d~~~~~~~~~~~~~~~-----------------~~~~~~~r~~a~~d~~fy~G~Qw~~~~~~---~l~~q~rp~~N~i 60 (725) T protein:vir:92 1 MADNENRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRISQWDDWLSQ---YTTLQYRGQFDVV 60 (725) T ss_pred CCchHHHHHHHHHHHHHH-----------------HHhhHHHHHHHHHHHHhhcCCCCCHHHHH---HHHhcCCCcccch Confidence 43 444555555432 11224445677788899999877422111 0111122346888 Q ss_pred HHHHHHHhhhhhcCcceEee-----CCHHHHHHHHHHH----hhccHHHHHHHHHHHHhhcCCEEEEEEEe---C----C Q lcl|NC_012753. 77 RTASKKVASLVFNEQATIRV-----DNEVADAFINETL----KNDKFSKNFERYLESCLALGGLAMRPYID---G----D 140 (502) Q Consensus 77 k~iv~~~a~~l~~ep~~i~~-----~d~~~~e~l~~~~----~~~~f~~~~~~~~~~~~~~G~~~~~~~~d---~----~ 140 (502) +.+|+...++--...+.+.+ +|...++.|+.++ +.++.......+...+++.|.+|+.+.+| + + T Consensus 61 ~~~i~~v~g~e~~nr~d~~v~P~~~~d~~~Ae~l~~~~~~~~~~~~~~~a~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~ 140 (725) T protein:vir:92 61 RPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNVAVREQIESGVGAWRLVTDYEDQSPTSN 140 (725) T ss_pred HHHHHHHHhhHHhCCcceEEecCCccHHHHHHHHHHHHHHHHHhhCchHHHHHHHHHHhhcCcceeeeeecccCCCCCCC Confidence 88999999988887877766 3445555555544 45788888889999999999999998754 1 2 Q ss_pred ceEEEEE----cCCeEEEEEEcCCC----eEEEEEEEEEEEee------------------------------CCCceEE Q lcl|NC_012753. 141 QIRVSFV----QATVFFPLQANTQD----VSSAAIVTKSTKTE------------------------------GQKVKYY 182 (502) Q Consensus 141 ~~~i~~v----~~~~~~Pi~~d~~~----~~~~~~~~~~~~~~------------------------------~~~~~~y 182 (502) .++|..+ |..++| +|..- ...+-++++..+.+ ..+.... T Consensus 141 ~~~i~~~~i~~~~~~V~---~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~v 217 (725) T protein:vir:92 141 NQVIRREPIHSACSHVI---WDSNSKLMDKSDSRHCTVIHSMSQNGWEDFAEKYDLDADDIPSFQNPNDWVFPWLTQDTI 217 (725) T ss_pred ceeeEEeeccCChhhcc---cCchhhccChhhHHHHHHHhcCCHHHHHHHHhhcCcchhhhhhcccCCcccccccCCCeE Confidence 3444443 233333 22211 11111111100000 0011123 Q ss_pred EEEEEEEEeCCeEEEEEEEEecCCccccCceeecc-----c----------------------c-ccCCCcceeecC--- Q lcl|NC_012753. 183 SLIEFHEWNKETYTISNELYESESKTIIGQRVPLS-----T----------------------L-YEDLEETVTLNG--- 231 (502) Q Consensus 183 t~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~-----~----------------------~-~~~l~~~~~~~~--- 231 (502) ++.|+|+.... .-.+|...++ ..|..+.+. . + |--+.+...+.+ T Consensus 218 rv~e~~~r~~~----~~~~~~~~d~-~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~ 292 (725) T protein:vir:92 218 QIAEFYEVVEK----KETAFIYQDP-VTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQL 292 (725) T ss_pred EEEEEEEEEEE----eeeEEeecCC-CCCceeecChhhHHHHHHHHhccCchhhhhccceeeeEeeeeecchhhhcCCCC Confidence 34454431110 0011111110 111111110 0 0 000000001111 Q ss_pred --CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-hccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 232 --LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 232 --~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) ....||++|-- ......+.|++-|.+.++++.++.+|...|...+-+- ..+.+..++.+.+..... ..-.++ T Consensus 293 ~~~~~~P~vP~~g--~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~S~~~~~~~~~~~~~~~~~~~~i~~~~~---~~~~~~ 367 (725) T protein:vir:92 293 IAGEHIPIVPVFG--EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH---MYDGND 367 (725) T ss_pred CCCCceeeEEEEe--eeeccCCcccccceeccchhHHHHHHHHHHHHHHHHHhccCcccccchhhhhHHHH---HHhccC Confidence 01123333210 0111245677668899999999999999999998773 444555565555432110 000000 Q ss_pred ccccccchhhccccCCCCc-cccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHH Q lcl|NC_012753. 309 REFETGHNVYEQFDSGDMD-KGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTY 387 (502) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~-~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~ 387 (502) .......+ .....++. ....++.+++.--..++...++.....|....|++...+|..++. .|+.+|..+..... T Consensus 368 ~~~~~~~~---~~~~~~g~~~~~~i~~~~~~~~p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n~-~SG~ai~~rq~qg~ 443 (725) T protein:vir:92 368 DYPYYLLN---RTDENNGEMPTQPLAYYENPEVPQANAYMLEAATAAVKEVATLGVDAEAVNGGQ-VAYDTVNQLNMRAD 443 (725) T ss_pred ccceeecc---ccccccccccccCCcccCCCCchHHHHHHHHHHHHHHHHHhCCCHHHhccCchh-hHHHHHHHHHHHHH Confidence 00000000 00001111 011233332222245677889999999999999999999986543 57777877766655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----HHhhc---ccCCCcc-c------------------------ccceEEEeCCC Q lcl|NC_012753. 388 QMRNSIATLVEKSLKELVISILEL----AKVYN---LYTGEIP-T------------------------MDEVSVDLDDG 435 (502) Q Consensus 388 ~~~~~~~~~~~~~l~~l~~~il~~----~~~~~---~~~~~~~-~------------------------~~~i~v~f~d~ 435 (502) .....+-..++.+.+...+.+|.+ ....+ +.+.... . .+++.|+=..+ T Consensus 444 ~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~~r~~RI~~edg~~~~v~in~~~~~~~~G~~~~~Ndi~g~~Dv~v~~~p~ 523 (725) T protein:vir:92 444 LETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGPS 523 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEecCCCCcceEEeccccccccccchhhhhccccceeeEEeeccC Confidence 555555555566666555544443 21111 1111000 0 01122211111 Q ss_pred ccCCHHHHHHHHHHHHhc-C-CCCH-HHHHHhcCCCCH-HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 436 VFTDRNAEFDYWSKMVAA-G-FAPK-TMAIEKTLNVTK-EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 436 i~~d~~~~~~~~~~~~~~-G-i~S~-et~l~~~~~~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+.=.++..+.++++..+ + +.+. -..+.......+ +-+++.+++|+....+.....+.. .| T Consensus 524 ~~s~r~~~~~~l~ql~~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~------~e 588 (725) T protein:vir:92 524 FQSMKQQNRAEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPET------PE 588 (725) T ss_pred hHHHHHHHHHHHHHHHHhcccchhHHHHHHHHHhhcccchHHHHHHHHHHhhhchhccCCccc------hh Confidence 111133444555555422 1 1111 011222222222 224445666665443332211111 12 No 103 >protein:vir:172 Length: 708 # NCBI annotation: portal protein # Family: family:all:487 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112077;genbank:gi:13559867;genbank:GeneID:920970 Probab=99.14 E-value=1.1e-09 Score=69.72 Aligned_cols=480 Identities=10% Similarity=-0.004 Sum_probs=200.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHH--HHhcCCCCccc---cccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNL--RYFAGDFDSVT---YRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~--~~Y~g~~~~~~---~~~~~~~~~~~~~~~~n~ 75 (502) |. ++....+++...+ |.. .....++-..+..+.. .||.|.++.-. .....+....++.++.|. T Consensus 1 ma--~~~~~~~~~~~~r-----~~~-----~~~~~~~~r~~~~~d~~f~~y~G~Qw~~~~~~~l~~~~q~~~rP~~~~N~ 68 (708) T protein:vir:17 1 MA--ETLEKKHERIMLR-----FDR-----AYSPQQEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINK 68 (708) T ss_pred Cc--hhHHHHHHHHHHH-----HHH-----HHhhhHHHHHHHHHHHHhhccCCCCCCHHHHHHHHhhhhhcCCCceEEcc Confidence 21 1112222211000 000 0112233344444443 57889776421 111122333467888999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHHH----HhhccHHHHHHHHHHHHhhcCCEEEEEEEe------- Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD------NEVADAFINET----LKNDKFSKNFERYLESCLALGGLAMRPYID------- 138 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~----~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d------- 138 (502) -+.+|+...++--...+.+.+. |.+.++.|+.+ .+.++.......+...+++.|.+|+.++.| T Consensus 69 i~~~i~~v~g~e~~nr~d~~v~p~~~~~d~~~Ae~l~~l~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~~~~d~~~e~d~ 148 (708) T protein:vir:17 69 VATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDP 148 (708) T ss_pred hHHHHHHHHhhHhhCCcceEEecCCCcchHHHHHHHHHHHHHHHHhcCchhHHhHHHHHhhhcccceeeeeecccccCCC Confidence 9999999999988878777662 33445555554 446788889999999999999999988542 Q ss_pred ---CCceEEEEE--cCCeEEEEEEcCC----CeEEEEEEEEEEEeeC---------------------------CCceEE Q lcl|NC_012753. 139 ---GDQIRVSFV--QATVFFPLQANTQ----DVSSAAIVTKSTKTEG---------------------------QKVKYY 182 (502) Q Consensus 139 ---~~~~~i~~v--~~~~~~Pi~~d~~----~~~~~~~~~~~~~~~~---------------------------~~~~~y 182 (502) +.+++|..+ |+.+++ ||.. +...|-++.+..+.+. -..... T Consensus 149 ~~~~~~i~i~~~~~~~~~v~---~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~~~~~~~d~v 225 (708) T protein:vir:17 149 MDDRQRIAIEPIYDPSRSVW---FDPDAKKYDKSDALWAFCMYSLSPEKYEAEYGKKPPASLDVTSMTSWEYDWFDADVI 225 (708) T ss_pred CCCccccceEeeccchhhee---cCccccccChhhhhhhhhhccCCHHHHHHhCccccchhhhhhhhccccccccCCCeE Confidence 124555554 445655 2221 2233322222111100 000112 Q ss_pred EEEEEEE--EeCCeEEEEE-------EEEecCCccc-------cCce----eecccc---ccCCCcceeecCC-----Cc Q lcl|NC_012753. 183 SLIEFHE--WNKETYTISN-------ELYESESKTI-------IGQR----VPLSTL---YEDLEETVTLNGL-----TR 234 (502) Q Consensus 183 t~~E~h~--~~~~~~~I~~-------~l~~~~~~~~-------lG~~----v~l~~~---~~~l~~~~~~~~~-----~~ 234 (502) .+.|+|. +....+.+-. ..|.+..... .|.. .+.... |--+-+.....+- .. T Consensus 226 rv~e~~~r~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~l~~~~~~p~~~ 305 (708) T protein:vir:17 226 YIAKYYEVRKESVDVISYRHPITGEIATYDSDQVEDIEDELAIAGFQEVARRSVKRRRVYVSVVDGDGFLEKPRRIPGEH 305 (708) T ss_pred EEEEEEEEeeeeeEEEEEecCccCceeeeCccchhhHHHHHHhcccccceeeeeeEEEEEEEeecccccccCCCCCCCCc Confidence 2233332 1111111110 0011110000 0100 000000 0000000000010 11 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCCCCCcccCccccccc Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFET 313 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~ 313 (502) .|+++|---.. .+ .+.|.--+.+.++++.++.+|...|.+.+-+-. .+...+++.+.+................+.. T Consensus 306 fP~vP~~g~r~-~~-d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~a~~g~~~~~~~~~~~~~~~~~ 383 (708) T protein:vir:17 306 IPLIPVYGKRW-FI-DDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEKHWEARNKKRPAFLP 383 (708) T ss_pred cceEEEecccc-cc-cCCCcccchhhhchhHHHHHHHHHHHHHHHHHhcCCcceeechhhhhhhHHhhhhcccchhhhhh Confidence 13332211000 11 122311256889999999999999999987743 4445556666554322111111111100000 Q ss_pred cchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 314 GHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 314 ~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~ 393 (502) ....-..+..-..+.......-.++++ .++...++.....|....|+++..+|..+ +.||.+|..+........... T Consensus 384 ~~~~~~~~g~v~~~a~~~~~~~~~~~~-~~~~~llq~~~~~i~~~tGi~d~~~G~~s--n~SG~Ai~~rq~qg~~~~~~~ 460 (708) T protein:vir:17 384 LREVRDKYGNIIAGATPAGYTQPAVMN-QALAALLQQTSADIQEVTGGSQAMQQMPS--NIAQETVNNLMNRADMASFIY 460 (708) T ss_pred hhccCCcccccccccCCcccCCCcccc-HHHHHHHHHHHHHHHHhcCCChHHccCcc--chHHHHHHHHHHHHHHHHHHH Confidence 000000000000011111112234454 57888899999999999999999998643 357878877766555555555 Q ss_pred HHHHHHHHHHHHHHHHHHHHh----hc---ccCCCcc----------------------c----ccceEEEeCCCccCCH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKV----YN---LYTGEIP----------------------T----MDEVSVDLDDGVFTDR 440 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~----~~---~~~~~~~----------------------~----~~~i~v~f~d~i~~d~ 440 (502) -..+..+.++..+.+|.+..- .+ +.+.... + .++|.|+=..+.+.-. T Consensus 461 ~Dnl~~~~~~~g~~lL~lI~~~y~~~R~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~~Dv~v~~~p~~~t~r 540 (708) T protein:vir:17 461 LDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVTVDVGPSYTARR 540 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeeecceeccCCCccceeeccceeeeeeEEEecccCchhHH Confidence 555555555544444433221 11 1111000 0 0011111111111222 Q ss_pred HHHHHHHHHHHhcCC-CCHHH-----HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCcc-cc---------------- Q lcl|NC_012753. 441 NAEFDYWSKMVAAGF-APKTM-----AIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSE-EV---------------- 497 (502) Q Consensus 441 ~~~~~~~~~~~~~Gi-~S~et-----~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~-~~---------------- 497 (502) ++..+.++++..+.. .-..+ .+.++-++.- +++.+++|+..........+... .. T Consensus 541 ~~~~~~l~qll~~~~~~~~~~~~~~~l~l~~~D~p~--~~ei~e~ir~~~~~~~~~~~~~~e~~q~~~q~qq~~q~q~~~ 618 (708) T protein:vir:17 541 DATVSVLTNVLSSMLPADPMRPAIQGIILDNIDGEG--LDDFKEYNRNQLLISGIAKPRNEKEQQIVQQAQMAAQSQPNP 618 (708) T ss_pred HHHHHHHHHHHHhcCCccchhHHHHHHHHHhcCCCC--hHHHHHHHHHHhhccccccCcchhhHHHHHHHHHHHHHHHHH Confidence 344445555543321 10111 1222223322 23344444443322111110000 00 Q ss_pred ---CCCC-------C Q lcl|NC_012753. 498 ---DIYG-------E 502 (502) Q Consensus 498 ---~~~g-------~ 502 (502) .... | T Consensus 619 ~~~eaqa~~~~~qAe 633 (708) T protein:vir:17 619 EMVLAQAQMVAAQAE 633 (708) T ss_pred HHHHHHHHHHHHHHH Confidence 0000 0 No 104 >protein:vir:105429 Length: 708 # NCBI annotation: gene 3 protein # Family: family:all:487 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958179;genbank:gi:41057281;genbank:GeneID:2716676 Probab=99.12 E-value=1.7e-09 Score=68.66 Aligned_cols=458 Identities=10% Similarity=0.010 Sum_probs=207.7 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhc--CCCCccc---cccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFA--GDFDSVT---YRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~--g~~~~~~---~~~~~~~~~~~~~~~~n~ 75 (502) ..+..++++++... ....++-..+..+-.+||. |+++.-. .....+....+..++.|. T Consensus 6 ~~~~~~~~~~~~~~-----------------~~~~~~~r~~~~~D~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~ 68 (708) T protein:vir:10 6 EKKHERIMLRFDRA-----------------YSPQKEVREKCIEATRFARVPGGQWEGATAAGTKLDEQFEKYPKFEINK 68 (708) T ss_pred HHHHHHHHHHHHHH-----------------HHhhHHHHHHHHHHHHhhcCCCCCCCHHHHHHHHHhhhhcCCCceEEcc Confidence 23333333333321 1122344455555566764 6655321 111122223456788999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHHH----HhhccHHHHHHHHHHHHhhcCCEEEEEEEe------- Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD------NEVADAFINET----LKNDKFSKNFERYLESCLALGGLAMRPYID------- 138 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~----~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d------- 138 (502) -+.+|+...++--...+.+.+. |.+..+.|+.+ .+.++.......++..+++.|-+|+.++.| T Consensus 69 i~~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~d~i~~G~Gw~~~~~d~~~e~d~ 148 (708) T protein:vir:10 69 VATELNRIIAEYRNNRITVKFRPGDREASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTSMLVNEYDP 148 (708) T ss_pred hHHHHHHHHHHHHhCCcceEEEcCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHhhhhcccceeeeeeccccccCC Confidence 9999999999999988888772 23445555554 446788889999999999999999988653 Q ss_pred ---CCceEEEEE--cCCeEE--EEEEcCCCeEEEEEEEEEEEeeC-----------------C----------CceEEEE Q lcl|NC_012753. 139 ---GDQIRVSFV--QATVFF--PLQANTQDVSSAAIVTKSTKTEG-----------------Q----------KVKYYSL 184 (502) Q Consensus 139 ---~~~~~i~~v--~~~~~~--Pi~~d~~~~~~~~~~~~~~~~~~-----------------~----------~~~~yt~ 184 (502) +.+++|..+ |...++ |-... -+...+-++.+..+.+. . ......+ T Consensus 149 ~~~~~~i~i~~~~~p~~~v~~Dp~a~~-~D~sDar~~~~~~~~~~d~~~~~~p~~a~~~~d~~~~~~~~~~~~~~d~v~v 227 (708) T protein:vir:10 149 MDDRQRIAIEPIYDPSRSVWFDPDAKK-YDKSDALWAFCMYSLSPEKYEAEYGKKPPTSLDVTSMTSWEYNWFGADVIYI 227 (708) T ss_pred CCCccccceEEeecchhhcccCccccc-cChhhhhhhhhccCCCHHHHHHhCCCCcccccccccCCCccccccCCCceEE Confidence 124444443 334444 21111 12333333322221100 0 0001122 Q ss_pred EEEEEE--eCCeEEEEEEEEecCCccccCceeeccccc----------cCCCc----------c--eeecC-----C--- Q lcl|NC_012753. 185 IEFHEW--NKETYTISNELYESESKTIIGQRVPLSTLY----------EDLEE----------T--VTLNG-----L--- 232 (502) Q Consensus 185 ~E~h~~--~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~----------~~l~~----------~--~~~~~-----~--- 232 (502) .|+|+. ....+.+ +. ++ ..|..+.+.+.. .+... . ..+.| . T Consensus 228 ~ey~~r~~~~~~~~~----~~--~~-~tg~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~r~~v~~~~~~g~~~le~~~~ 300 (708) T protein:vir:10 228 AKYYEVRKESVDVIS----YR--HP-ITGEIATYDSDQVEDIEDELAIAGFHEVARRSVKRRRVYVSVVDGDGFLEKPRR 300 (708) T ss_pred EEeeeEEEEEEEEEE----Ee--cC-CCCceeeecchhhhhHHHHHHhcccchhheeeeeeEEEEEEeecchhhhccCCC Confidence 333321 1111110 11 00 011111111000 00000 0 01111 0 Q ss_pred ---CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeee-chHHhccCCCCCCcccCcc Q lcl|NC_012753. 233 ---TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV-PTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 233 ---~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v-~~~~l~~~~~~~g~~~~~~ 308 (502) ...|+++|-- ...-..+.|.+-+.+.++++.++.+|+..|...+-+-..+..+++ +...+..... .+. T Consensus 301 ~p~~~fP~vP~~g--~r~~~d~~~~~yG~vr~~kd~Q~~~N~~~S~~~~~~a~~~~~~~i~~~~~i~~~~~----~~~-- 372 (708) T protein:vir:10 301 IPGEHIPLIPVYG--KRWFIDDIERVEGHIAKAMDPQRLYNLQVSMLADTAAQDPGQIPIVGMEQIRGLEK----HWE-- 372 (708) T ss_pred CCCCceeeEEEee--eeeccCCCcccceeecccchhHHHHHHHHHHHHHHHHhcCCcccccChhhhhhHHH----HHh-- Confidence 1113332210 000112344344678899999999999999999877544444433 2222211000 000 Q ss_pred ccccccchhhccccC---CCCcc---ccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHH Q lcl|NC_012753. 309 REFETGHNVYEQFDS---GDMDK---GIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSE 382 (502) Q Consensus 309 ~~~~~~~~~~~~~~~---~~~~~---~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~ 382 (502) .++.+...|..... .+|.- ......+++.--..++...++.....|....|+++..+|..+ + .||.+|..+ T Consensus 373 -~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~~~q~~~~~~~~~~l~q~~~~~i~~vsG~~~~~lG~~s-n-~SG~aI~~r 449 (708) T protein:vir:10 373 -ARNKKRPAFLPLREVRDKSGNIIAGATPAGYTQPAVMNQALAALLQQTSADIQEVTGGSQAMQQMPS-N-IAQETVNNL 449 (708) T ss_pred -hccccchhhhccccccccccccccccCCccccCCccchHHHHHHHHHHHHHHHHHhCcChhHccCcc-c-hHHHHHHHH Confidence 01111111111110 01110 001222233223456788899999999999999999999643 3 578888877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----h---cccCCCc-----------cc---------------ccceE Q lcl|NC_012753. 383 QSDTYQMRNSIATLVEKSLKELVISILELAKV----Y---NLYTGEI-----------PT---------------MDEVS 429 (502) Q Consensus 383 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~----~---~~~~~~~-----------~~---------------~~~i~ 429 (502) ...........-..++.+.+..-+.+|.+-.- . ++.+.+- .+ .++|. T Consensus 450 q~qg~~~l~~~~Dnl~~~~~~~g~~lL~li~~~y~~er~~RI~~edg~~~~v~in~~~~d~~~g~~~~~nDi~~g~yDv~ 529 (708) T protein:vir:10 450 MNRADMASFIYLDNMAKSLKRAGEVWLSMAREVYGSEREVRIVNEDGSDDIAVLSAQVVDRQTGAVVALNDLSVGRYDVT 529 (708) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEEecceeccCCCcceeeeeccceeeEEEE Confidence 66666555555566666666555544443322 1 1222110 00 01222 Q ss_pred EEeCCCccCCHHHHHHHHHHHHhcCC-CCHHH-----HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 430 VDLDDGVFTDRNAEFDYWSKMVAAGF-APKTM-----AIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 430 v~f~d~i~~d~~~~~~~~~~~~~~Gi-~S~et-----~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |+=..+.+.-.++..+.++++..+.. ..+.+ .+.++-++.- +++.+++|+...+......+.. .| T Consensus 530 i~~~p~~~s~r~~~~~~l~qll~~~~p~~~~~~~~~~~~l~~~D~p~--~~ei~erir~~~~~~~~~~~~~------~e 600 (708) T protein:vir:10 530 VDVGPSYTARRDATVSVLTNVLSSMLPTDPMRPAIQGIILDNIDGEG--LDDFKEYNRNQLLISGIAKPRN------EK 600 (708) T ss_pred EecccCchhHHHHHHHHHHHHHHhcCCCchhhHHHHHHHHHhcCCcC--hHHHHHHHHHhhcccccccccc------hh Confidence 22222333334566666666654422 11112 1223333332 3445555555433221111000 11 No 105 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=99.10 E-value=1.2e-10 Score=75.06 Aligned_cols=429 Identities=11% Similarity=0.055 Sum_probs=200.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhh-ccccccCCHHHHHHHH-----------------HHHHHhcCCCCccccccCC Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSIT-DHPKIAISPEEYNRIM-----------------DNLRYFAGDFDSVTYRDSN 62 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~-~~~~~~~~~~~~~~i~-----------------~~~~~Y~g~~~~~~~~~~~ 62 (502) |+=.-++.+|.....+.+........- ....+++++-....+. ....||... ... T Consensus 37 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ds~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~f~ 109 (765) T protein:vir:96 37 MIKLGKIRGWNVEPEKAPVIRSVKDFLEPGLSVAMDSAYGDGPTPAAKAAAGGQNPYVVPTMLQDWYNSQ-------GFI 109 (765) T ss_pred chhHHHHhhcccccccCCCCCCCCcccCcccceeccccccccccchHHHhhhccCccchhhHHHhhhccc-------CCc Confidence 666666777766654443332211110 0111222221111100 011111110 000 Q ss_pred CccccccceecchHHHHHHHHhhhhhcCcceEeeCCH----HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe Q lcl|NC_012753. 63 GSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNE----VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYID 138 (502) Q Consensus 63 ~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~----~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d 138 (502) +...-.-+.+..+++.+|+..|.-++.+...|+++++ ...++|++.++.-++...+.++++++-.+|++++.+-++ T Consensus 110 gyql~alY~~~~l~rkiVd~pAeDa~R~g~~I~~~~~e~~~~~~~~l~~~~~rl~v~~~l~ea~~~~RlyGga~i~i~i~ 189 (765) T protein:vir:96 110 GYQACAIISQHWLVDKACSMSGEDAARNGWELKSDGRKLSDEQSALIARRDMEFRVKDNLVELNRFKNVFGVRIALFVVE 189 (765) T ss_pred cHHHHHHHHhCchhhhhhhcchHHhhcCCceeecCccccCHHHHHHHHHHHHHhhHHHHHHHHHHHhhhceeeEEEEEec Confidence 1111112445688999999999999999999988643 344567777777788999999999999999999876654 Q ss_pred C--CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEE---------EeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCc Q lcl|NC_012753. 139 G--DQIRVSFVQATVFFPLQANTQDVSSAAIVTKST---------KTEGQKVKYYSLIEFHEWNKETYTISNELYESESK 207 (502) Q Consensus 139 ~--~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~---------~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~ 207 (502) . +.-.-.-++++.+-+ +.......+.++- ..+...-.||. -+. |+|. T Consensus 190 ~~D~~~l~~PL~~~~I~k-----g~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~-P~~-------y~i~--------- 247 (765) T protein:vir:96 190 SDDPDYYEKPFNPDGIAP-----GSYKGISQIDPYWAMPQLTAESTADPSAEHFYE-PDF-------WIIS--------- 247 (765) T ss_pred ccCcchhhcccccccccc-----ceeeEEEEechhhcccccchhccccccccccCc-cee-------eeec--------- Confidence 2 110001112221111 1111111111100 00000000110 011 1110 Q ss_pred cccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhcccee Q lcl|NC_012753. 208 TIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRV 287 (502) Q Consensus 208 ~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i 287 (502) |..| |+..+ +.+.|.+.|. +.+ .-...+|+|++..+.+-|..++++......=+...+.++ T Consensus 248 ---g~~I----H~SRl---i~~~g~~lpd--~lk-------~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v 308 (765) T protein:vir:96 248 ---GKKY----HRSHL---VVVRGPQPPD--ILK-------PTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSKRTST 308 (765) T ss_pred ---Ccee----ccceE---EEecCCCchh--hhc-------cccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccce Confidence 1111 01110 1122221110 111 112357999999999999999987755444333222233 Q ss_pred eechHHhccCCCCCCcccCccccccccchhhcc---ccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChh Q lcl|NC_012753. 288 AVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQ---FDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTG 364 (502) Q Consensus 288 ~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~ 364 (502) +-- .+++...+..+ +.. .+.. ...++. +..-+. ..-++.++.+ ..-....++...++|+..++++.. T Consensus 309 ~k~-~~~~~l~~~~~--l~~--r~~~-~~~~r~n~g~~~id~--ee~~e~~s~~--lsgl~d~l~~~~~~iAaas~IP~t 378 (765) T protein:vir:96 309 IHV-DVEKAIANEDA--FNA--RLAF-WIANRDNHGVKVIGI--DETMEQFDTN--LSDFDSVIMNQYQLVAAIAKTPAT 378 (765) T ss_pred eee-chHhhhccHHH--HHH--HHHH-HHHhcCCceeEEecC--CcceeEEecc--cCCHHHHHHHHHHHHHhhhCCCee Confidence 211 12221111110 000 0000 001110 111111 1235555544 345677888889999999999865 Q ss_pred h-cccccccc-ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHH Q lcl|NC_012753. 365 M-FSFDGKSM-KTATEVVSEQSDTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRN 441 (502) Q Consensus 365 ~-~~~~~~~~-~tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~ 441 (502) . ||....|. +|+.+=...|. ..++.+| ..++..|+.|+.++++. +.. ..+++|.|+.-...++. T Consensus 379 ~LfGqsp~GlnATGe~D~~nYy---D~I~s~Qe~~l~p~le~L~~li~~s--------~~i--~~d~~i~FnpL~~~sek 445 (765) T protein:vir:96 379 KLLGTSPKGFNATGEHETISYH---EELESIQEHIFDPLLERHYLLLAKS--------ESI--DVQLEIVWNPVDSTTSQ 445 (765) T ss_pred eeccCCcccccCcchHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHh--------cCC--CCcceEEeCCCCCCCHH Confidence 4 66554553 56653323333 3344444 56788999998887643 122 23689999998888877 Q ss_pred HHHHH-------HHHHHhcCCCCHHHHHHhc--------CCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 442 AEFDY-------WSKMVAAGFAPKTMAIEKT--------LNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 442 ~~~~~-------~~~~~~~Gi~S~et~l~~~--------~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +.+++ .++++.+|++|..+++..+ +.+++++++.+ .-+..|... +..-.+..+++-.|| T Consensus 446 EkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~-~~~~pe~~~-~~~~~~~~~~~~~~e 519 (765) T protein:vir:96 446 QQAELNNKKAATDEIYINSGVVSPDEVRERLRDDPRSGYNRLTDDQAETE-PGMSPENLA-ELEKAGAQSAKAKGE 519 (765) T ss_pred HHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccccCCCCCCCccccccc-cCCCccccc-cccCCCcccccccCc Confidence 66654 6678889999998877654 23444433211 001111100 000111111112222 No 106 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=99.10 E-value=2.2e-09 Score=68.02 Aligned_cols=434 Identities=9% Similarity=0.047 Sum_probs=205.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCc-----cccc--cCCC---------- Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDS-----VTYR--DSNG---------- 63 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~-----~~~~--~~~~---------- 63 (502) |+++++.-.+... .+...-....+.|.+-... +..+ .... T Consensus 8 ~~~~dr~i~~~~~-----------------------~~~~~~~~~~~~y~aa~~~r~~~~w~~~~~~~s~~~~i~~~~~~ 64 (505) T protein:vir:96 8 PSLAQRMVNWAWY-----------------------RYVEPQKNAARAFEAARRDRLGKAWLRRASRLSADEEIYADLAS 64 (505) T ss_pred cchhhcccchhhh-----------------------hhHHHHHHhhhhcccccCCCccccccCCCCCCChHHHHHHHHHH Confidence 6666555443211 1111222333445442111 1000 0000 Q ss_pred --ccccccceecchHHHHHHHHhhhhhc-CcceEeeC--------CHHHHHHHHHHHhh------------ccHHHHHHH Q lcl|NC_012753. 64 --SQVKRDFNHLPIGRTASKKVASLVFN-EQATIRVD--------NEVADAFINETLKN------------DKFSKNFER 120 (502) Q Consensus 64 --~~~~~~~~~~n~~k~iv~~~a~~l~~-ep~~i~~~--------d~~~~e~l~~~~~~------------~~f~~~~~~ 120 (502) .+..+-..-.++++-+++.+++.++| ..+++... ++..++.+++.|+. .+|...... T Consensus 65 lr~RaRdL~rNn~~a~~av~~~~~nvVG~~Gi~~~~~~~~~~~~~~~~~~~~ie~~w~~Wa~~~~~D~~g~~~f~~lq~l 144 (505) T protein:vir:96 65 LVQRAREQSINNPYAKRFYQLLKNNVIGPKGMTFQSRVKRRNGKPDDRANTLIEGNWQQWIKKGNCDVTGRYHFVTLLHL 144 (505) T ss_pred HHHHHHHHHhcChHHHHHHHHHHHHhcCCCcceeeecCCcccccccHHHHHHHHHHHHHhcCCcCcceeccCCHHHHHHH Confidence 00011122346889999999999999 46665442 45555555544432 247777777 Q ss_pred HHHHHhhcCCEEEEEEEeCC---ceEEEEEcCCeEE-EEEEc--CCC-eEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC Q lcl|NC_012753. 121 YLESCLALGGLAMRPYIDGD---QIRVSFVQATVFF-PLQAN--TQD-VSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE 193 (502) Q Consensus 121 ~~~~~~~~G~~~~~~~~d~~---~~~i~~v~~~~~~-Pi~~d--~~~-~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~ 193 (502) ++...+.-|.++++..+..+ .+++..++|+.+- |.-.. .+. +...+ +.+. . T Consensus 145 ~~r~~~~dGE~f~~~~~~~~~~~~~~lqliepd~l~~~~n~~~~~~~~i~~GI------e~d~----------------~ 202 (505) T protein:vir:96 145 WMETLARDGEVLVREHRGYPNKWGYALQILECDRLDLNYNADLQNGNRIRMSI------ELDA----------------W 202 (505) T ss_pred HHHHHhhCCceEEEEeecCCCCcceEEEEechhhcCCCCCcccCCcCeEEece------EECC----------------C Confidence 88888889998888877655 3689999999852 21000 111 11111 1111 0 Q ss_pred eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHH Q lcl|NC_012753. 194 TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTY 273 (502) Q Consensus 194 ~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~ 273 (502) +-.+-|.+++...++..... ... ...+..++.+-++. +......+...|+|.|+.++..+..++.-. T Consensus 203 Gr~~aY~i~~~hPgd~~~~~--------~~~-~~~~~rvpa~~vlH----~f~~~r~gQ~RGis~lapvl~~l~~l~~y~ 269 (505) T protein:vir:96 203 ERPVAYHLLVNHPGDNSYCY--------HYA-GQTYERVPADEIIH----TFVPWRPHQNRGIPWTHASMVELHHIGEYR 269 (505) T ss_pred CceEEEEEeecCCCcccccc--------ccc-cccccccCHhHhhh----hhcccCCccccCcchHHHHHHHHHHHhHHH Confidence 11111222222111110000 000 00001111111111 123334566789999999999999999655 Q ss_pred HHHHHHHh-hccceeeechHHhccCCCCCCcccC-ccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHH Q lcl|NC_012753. 274 DEFMWEVK-MGQRRVAVPTQMIKTEYDTNGEKVT-VKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKG 351 (502) Q Consensus 274 S~~~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~ 351 (502) ..-..--. .+.-..| ++.+.+..+.... ...... ..+ .......-..+.-|+.+++.-+..++..-+..+ T Consensus 270 dael~~a~i~A~~a~f-----i~~~~~~~~~~~~~~~~~~~--~~l-~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~ 341 (505) T protein:vir:96 270 KSEMIAAELGAKKVGF-----YEQDPEAYDQPPEDDQGEIV--EEV-EAGTYQLLPYGIRFKEHKIDHPHTNFGAFVKSS 341 (505) T ss_pred HHHHHHHHHhhhheee-----eecCCccCCCccccccCccc--ccc-CCceeeecCCCCeeeeeCCCCCCCCHHHHHHHH Confidence 43333222 1222223 3333222221100 000000 000 001111111223477788888888899999999 Q ss_pred HHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhcccCCC-cccccceE Q lcl|NC_012753. 352 LSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEK-SLKELVISILELAKVYNLYTGE-IPTMDEVS 429 (502) Q Consensus 352 l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~-~l~~l~~~il~~~~~~~~~~~~-~~~~~~i~ 429 (502) ++.|....|+|+..+..+-+++ |=.++++.....-......+..|.. .++.+.+..+..+-+.+...-. .....-.. T Consensus 342 lr~iaaglgi~ye~lt~D~s~~-nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~ 420 (505) T protein:vir:96 342 LRGVAAGMGPAYNRLAHDLEGV-NFSSLRSGELDERDLYKLLQFFVVTELLERVAGNLISMSLLTQALPLNMVDIDRLSQ 420 (505) T ss_pred HHHHHhhcCCCHHHHhcccccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCCCCccchhhce Confidence 9999999999999987664432 1112233333333344444444443 3333444334333222211100 01111123 Q ss_pred EEe--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHH---hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 430 VDL--DDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKIND---ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 430 v~f--~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~---E~~~~~~~~~~~~~~~~~g~ 502 (502) +.| +.-.-+|+.++++....++.+|+.|.+..++.. |.+-+++-+++++-++ |..-..+....+....--.| T Consensus 421 ~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~t~~~~~a~~-G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~ 497 (505) T protein:vir:96 421 YAFQPRGWDWVDPAKDSKAHSESIKNRTRSRSSIIRAA-GDDPEDVFDEIAWEEQLMRDKGVNPTPPEQESKDATTDE 497 (505) T ss_pred eeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCCCC Confidence 444 444557999999999999999999999988885 8776665444333221 11111111111111111111 No 107 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=99.06 E-value=2.2e-10 Score=73.51 Aligned_cols=401 Identities=11% Similarity=0.089 Sum_probs=185.2 Q ss_pred CChhHHHH--HHHHHHhhcccccchhhh-hccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQTIK--NFIKRSNYVITNQSLNSI-TDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~ik--~~i~~~~~~~~~~~l~~i-~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k 77 (502) |++|-+=| ...+. ..+.+. ........ ..+|.+. .+. ......-+.+..+++ T Consensus 1 ~~~~m~~~~~~~~~~-------D~~~~~~~~~~g~~~-----------~~~~~~~--~~~-----~~~l~~~Y~~~~l~~ 55 (435) T protein:vir:79 1 MGVFMSDKVKAITKE-------DGYNEIFGSKDGTFR-----------PNAFYMQ--RAA-----FKALSQFYEEDGMAR 55 (435) T ss_pred CCcccccccccchhh-------cchhhhhcccccccc-----------cCcccCC--cCC-----HHHHHHHHhcCchhh Confidence 77764322 11111 011110 00000000 0000000 000 000111234557889 Q ss_pred HHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEE Q lcl|NC_012753. 78 TASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQA 157 (502) Q Consensus 78 ~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~ 157 (502) .+|+..|.-++.+...|+.+++ .+.++..++.-++...+.+++.++-.+|++++.+-..+++.. ++ |+-. T Consensus 56 ~~Vd~~aed~~r~g~~i~g~~~--~~~~~~~~~~l~~~~~l~~a~~~~rl~G~~~i~i~~~d~~~~------~~--Pl~~ 125 (435) T protein:vir:79 56 RIVDVIPEEMVTPGFKVDGVKN--EKSFKSRWDELRLNAKIIDALSWSRLFGGSAILAVVADNKML------KS--PVKP 125 (435) T ss_pred hhhccchHHhhcCCceecCCCh--HHHHHHHHHHhhHHHHHHHHHHhhhccccEEEEEEecCCCCc------cc--cccc Confidence 9999999999999988876543 245666666667889999999999999998887766332211 11 2211 Q ss_pred cCCCeEEEEEEEEEEEe-------eCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeec Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKT-------EGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLN 230 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~-------~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~ 230 (502) .+.+....++.+. +. +...-.|+ .-+. |+|. . ....-+..|. +.. -+.+. T Consensus 126 -~g~i~~i~v~d~~-~i~~~~~~~dp~sp~fg-~P~~-------y~v~-----~-~~~~~~~~iH----~SR---li~~~ 182 (435) T protein:vir:79 126 -GAQLEDIRVYDRY-QITIHERETNARSVRYG-EPKL-------YKIS-----P-GGDIPEFFVH----YSR---ICIID 182 (435) T ss_pred -CCceeeEEeechh-hccchhhccCCcccccC-cceE-------EEEe-----c-CCCCCceEEc----cee---EEEec Confidence 1222222111110 00 00000000 0011 1111 0 0000011111 111 01122 Q ss_pred CCCcceEEEecCCccccccccCcCCcchh-hhHHHHHHHHHHHHHHHHHHHhhccceee-ec--hHHhccCCCCCCcccC Q lcl|NC_012753. 231 GLTRPLFTYLKPPGMNNKDINSPLGLSIF-DNAKTTMDFINTTYDEFMWEVKMGQRRVA-VP--TQMIKTEYDTNGEKVT 306 (502) Q Consensus 231 ~~~~~~f~~~~~~~~n~~~~~~p~G~S~~-~~~~~lid~ld~~~S~~~~~~~~~~~~i~-v~--~~~l~~~~~~~g~~~~ 306 (502) |.+.|. .-.....+||.|+| ..+.+-+..++++......=+.-.+..++ ++ ..++.. +.+ ... T Consensus 183 g~~~p~---------~~~~~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~---~~~-~~~ 249 (435) T protein:vir:79 183 GERVSN---------EKRRQNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDD---EEG-RYA 249 (435) T ss_pred CCcchh---------hhccccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcC---ccc-hHH Confidence 221110 01123467899987 68889888888877666554432333333 22 122211 111 100 Q ss_pred ccccc---cccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-cccccccc-ccHHHHHH Q lcl|NC_012753. 307 VKREF---ETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSM-KTATEVVS 381 (502) Q Consensus 307 ~~~~~---~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~-~tAtei~~ 381 (502) ....+ .........+...+.+ ..++.++.++ ......++...++++..+|+|... ||...+|. +|+.+-.. T Consensus 250 ~~~r~~~~~~~~~~~~~~~i~~~~--e~~e~~~~~l--sgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~ 325 (435) T protein:vir:79 250 ARLRLAQVDDESGVGKAIGIDATD--EEYEVLNSDV--SGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALE 325 (435) T ss_pred HHHHHHHHHHhcCCCCceeEecCC--cceEEEeccc--CCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHH Confidence 00000 0000111112222111 2355555443 456777888899999999999866 67666664 46655544 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHH-------HHHHHhcC Q lcl|NC_012753. 382 EQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDY-------WSKMVAAG 454 (502) Q Consensus 382 ~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~-------~~~~~~~G 454 (502) .|.+..... .+..++..|++|++++++ ..+++|.|+.-...++.+.++. +.+++.+| T Consensus 326 ~yyd~i~~~--Qe~~l~p~l~~l~~li~~--------------s~d~~~~f~pL~~~sekEkAei~~~~a~a~~~~~~~g 389 (435) T protein:vir:79 326 TFYKLIDRK--RVEDYKPILEFLLPFMIS--------------ETEWSIEFEPLSVPSDKDKAEIMAKNVESVVKLKAEQ 389 (435) T ss_pred HHHHHHHHH--HHHHHHHHHHHHHHHhhc--------------CCCCeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcC Confidence 444433332 245678888888776552 1368899999888888665544 44556677 Q ss_pred CCCHHHHHHhcC------CCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 455 FAPKTMAIEKTL------NVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 455 i~S~et~l~~~~------~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++.+++...+- +..++ +..++ ..+...+++...=+|| T Consensus 390 ~i~~~e~r~~L~~~~~~~~~~~~----~~~~~------~~~~d~~~~~~~e~g~ 433 (435) T protein:vir:79 390 AINLKETRDTLRSICPDLKIMDN----DNIEL------PEPEDLDPEPGQEGGL 433 (435) T ss_pred CCCHHHHHHHHHHhccccCCCCc----ccccC------CccccCCCCCCCCCCC Confidence 777766543321 11111 00111 0112223344444455 No 108 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=99.02 E-value=2.8e-09 Score=67.46 Aligned_cols=435 Identities=11% Similarity=0.035 Sum_probs=185.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhcccc--ccCCHHHHHHHH-------HHHHHhcCC--CCccccccCCCcccccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPK--IAISPEEYNRIM-------DNLRYFAGD--FDSVTYRDSNGSQVKRD 69 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~--~~~~~~~~~~i~-------~~~~~Y~g~--~~~~~~~~~~~~~~~~~ 69 (502) |...+++...-. .+..+.... ....... +.+..+-+..+- ....|+.+. .++..+....+...-.- T Consensus 66 ~~~~~~~~~~~~-~~~~~a~~~--a~~~~~~~~~~~~~Dgl~n~~~~lG~~~~~s~y~~~~~~~~~~~~~~f~gyql~al 142 (862) T protein:vir:99 66 VEISDSVNAKSV-SGKNFAMDS--AVRSAIKAITGFAMDDGGGAPVPIGAEGKQSSYAVPEALQDWYLSQGFIGHQACAL 142 (862) T ss_pred ccccccccchhh-hhhhhcchh--hcchhhhhhhhhhhhcchhhhhhccccccccccccchhccccccccCcccHHHHHH Confidence 111111111000 000000000 0000000 000000000000 000111110 01111111111111112 Q ss_pred ceecchHHHHHHHHhhhhhcCcceEeeCC------HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCce- Q lcl|NC_012753. 70 FNHLPIGRTASKKVASLVFNEQATIRVDN------EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQI- 142 (502) Q Consensus 70 ~~~~n~~k~iv~~~a~~l~~ep~~i~~~d------~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~- 142 (502) +.+..+++.+|+..|.-++.+.+.|.+.+ +...+.|++.++.-++...+.+++.++-.+|++++.+..+...+ T Consensus 143 Y~~~~larkiVd~pAeDatR~g~~I~~~~d~~e~~~e~~~~ie~~~~rL~v~~~l~eair~~RLyGga~ililv~~~D~~ 222 (862) T protein:vir:99 143 IAQHWLVDKACSLAGEDAIRNGWHLKSLGEGEEIDEESLEKFKAIDVEFKVKENLIEFNRFKNVFGIRVAIFVVDSEDPD 222 (862) T ss_pred HHhCchhhhhhhhhhHHHhhCCceEeecCcccccCHHHHHHHHHHHHHhhHHHHHHHHHHhcccccceEEEEEecCcCch Confidence 44568899999999999999999999842 34456777777777889999999999999999877665542111 Q ss_pred -EEEEEcCCeEEEEEEcCCCeEEEEEEEEEE---------EeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCc Q lcl|NC_012753. 143 -RVSFVQATVFFPLQANTQDVSSAAIVTKST---------KTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQ 212 (502) Q Consensus 143 -~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~---------~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~ 212 (502) .-.-++++.+ ..+.+....++.+.- ..+...-.|| .-+. |.|. |. T Consensus 223 ~LsqPLn~e~I-----~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yG-kP~~-------y~I~------------g~ 277 (862) T protein:vir:99 223 YYEKPFNPDGI-----TPGSYRGISQIDPYWMMPMLTAESTADPSSQFFY-EPEF-------WIIS------------GQ 277 (862) T ss_pred hhhcCcCcccc-----cccceeEEEEechhhhcccccccccccccccccC-Ccee-------eeec------------Ce Confidence 0011111110 011111111111100 0000000011 0111 1110 11 Q ss_pred eeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechH Q lcl|NC_012753. 213 RVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQ 292 (502) Q Consensus 213 ~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~ 292 (502) .| |+..+ +.+.|.+.| ++.. .....+|+|++..+.+.|..++.+......=+...+..++- -. T Consensus 278 ~I----H~SRl---iif~g~~vp---d~lk------~ay~f~G~SvLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~k-td 340 (862) T protein:vir:99 278 KY----HRSHL---IIARGPQPA---DILK------PTYIFGGIPLVQRIYERVYAAERTANEAPLLAMNKRTTAIH-TD 340 (862) T ss_pred ee----cccee---EEecCCCch---hhhh------ccCCccCccHHHHHHHHHHHHHHHHHHHHHHHHHhccceee-ch Confidence 11 01000 112221111 1111 11336899999999999999987765554333333333321 11 Q ss_pred HhccCCCCCCcccCccccccccchhhcc---ccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-ccc Q lcl|NC_012753. 293 MIKTEYDTNGEKVTVKREFETGHNVYEQ---FDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSF 368 (502) Q Consensus 293 ~l~~~~~~~g~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~ 368 (502) ++....+.. .+.. .+.. .+.++. +..-+. ..-++.++.++ ......++...++|+..++++... ||. T Consensus 341 ~l~~l~~ed--~l~~--r~~~-~~~~rdN~Gi~liD~--eEe~e~ls~sl--SGL~dll~~~~q~IAaas~IP~tiLfGq 411 (862) T protein:vir:99 341 TAKAIANED--KFIQ--RLMF-WVRYRDNHAVKVLGT--DETMEQFDTSL--ADFDAVIMGQYQLVASIAKTPATKLLGT 411 (862) T ss_pred hHhhhccHH--HHHH--HHHH-HHhccCcceeEEecC--CCceeEEeccc--CChHHHHHHHHHHHHhhhCCCceeeccc Confidence 222111100 0000 0000 011111 111111 12355554433 356677888888999999999874 676 Q ss_pred ccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHH- Q lcl|NC_012753. 369 DGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDY- 446 (502) Q Consensus 369 ~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~- 446 (502) ...|. +|+.+=...|.+...... ++.++..|+.|+.++.. .+ + ...+++|.|+.-...++.+.++. T Consensus 412 spaGlnATGE~D~~nYyD~I~s~Q--E~~L~P~LerL~~li~~--~l-----g---~~~d~~ieFnpL~~~sekEkAEi~ 479 (862) T protein:vir:99 412 APKGFNSTGEFETISYHEELESIQ--EHVYMPFLQRHYLISRL--SL-----G---IQHEIDVVMEPVASMTAQQQADLN 479 (862) T ss_pred CcccccCchHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHH--hc-----C---CCCcceEEeCCCCCCCHHHHHHHH Confidence 54553 566543333433333222 45688888887665432 11 1 12469999999888887777655 Q ss_pred ------HHHHHhcCCCCHHHHHHh--------cCCCCHHHHHHHHHHHHHhhhcc-----cCCCCCccccCC-------- Q lcl|NC_012753. 447 ------WSKMVAAGFAPKTMAIEK--------TLNVTKEQAQEIYQKINDETMVS-----TDSFRTSEEVDI-------- 499 (502) Q Consensus 447 ------~~~~~~~Gi~S~et~l~~--------~~~~~deea~~el~ri~~E~~~~-----~~~~~~~~~~~~-------- 499 (502) +++++.+|++|..+++.. ..+++++++++.- -+..|+... ......+.+... T Consensus 480 kk~Aea~~~lv~sGvispdEvR~~L~~~~~~g~~~l~ded~E~d~-~~~~e~~~~~e~~g~a~~~ap~de~~aga~~~~~ 558 (862) T protein:vir:99 480 KTKAEGGKVLIDGGVISPDEERNRIRDDKRSGYNRLTKEDAEETP-GASPENLAAYQKAGAAQETASAKETQAGAAVTTA 558 (862) T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHhcCCcCCCCCCcccccccC-CCCcccccccccCCcccccccccccccccCCccc Confidence 567888999999887765 2345555443110 000000000 000000000000 Q ss_pred --------------CCC Q lcl|NC_012753. 500 --------------YGE 502 (502) Q Consensus 500 --------------~g~ 502 (502) +|. T Consensus 559 e~d~~~~p~~~~~~~g~ 575 (862) T protein:vir:99 559 EGDQPNVQMVPSMKPGQ 575 (862) T ss_pred cCCcccccccCCCCCCC Confidence 000 No 109 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=99.00 E-value=6.3e-09 Score=65.53 Aligned_cols=435 Identities=11% Similarity=0.037 Sum_probs=198.8 Q ss_pred CChhHHHH-------HHHHHHhhcccccchhhhhccccc-cCCHHHH------HHHHHHHHHhcCCCCccccccCCCccc Q lcl|NC_012753. 1 MGIIQTIK-------NFIKRSNYVITNQSLNSITDHPKI-AISPEEY------NRIMDNLRYFAGDFDSVTYRDSNGSQV 66 (502) Q Consensus 1 m~~~~~ik-------~~i~~~~~~~~~~~l~~i~~~~~~-~~~~~~~------~~i~~~~~~Y~g~~~~~~~~~~~~~~~ 66 (502) |++.+.+- +.++.... ..-.-......|.. ..+.... .-....+.+|.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~--~a~~~~~~~~~w~~~~~s~~~~i~~~~~~lr~RaRdl~rN--------------- 63 (530) T protein:vir:38 1 MKIPSLVGPDGKTSLREYAGYHG--GGGGFGGQLRGWNPPSESADAALLPNYSRGNARADDLVRN--------------- 63 (530) T ss_pred CccceeecCccccchHHHhhhhc--ccCCCCCcccccccCCCCHHHHHHHHHHHHHHHHHHHHhc--------------- Confidence 65554222 11111100 00000111111111 1111110 011112222222 Q ss_pred cccceecchHHHHHHHHhhhhhcCcceEeeC------------CHHHHHHHHHHHh--------------hccHHHHHHH Q lcl|NC_012753. 67 KRDFNHLPIGRTASKKVASLVFNEQATIRVD------------NEVADAFINETLK--------------NDKFSKNFER 120 (502) Q Consensus 67 ~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~------------d~~~~e~l~~~~~--------------~~~f~~~~~~ 120 (502) .++++-+++.+++.++|..+++... +++.++.+++.|+ ..+|...... T Consensus 64 ------n~~a~~av~~~~~nvVG~Gi~~~~~p~~~~l~~~~~~~~~~~~~ie~~w~~W~~~~~~~~D~~g~~~f~~~q~l 137 (530) T protein:vir:38 64 ------NGYAANAVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFSRDVEAAWNEYAEDDFCGIDAERKRTFTMMIRE 137 (530) T ss_pred ------ChHHHHHHHHHHHHhhCCCceeeeccchhhcCCCHhHHHHHHHHHHHHHHHhhcCCCcEEeeeccCCHHHHHHH Confidence 3688999999999999987776542 2333444444442 1257777777 Q ss_pred HHHHHhhcCCEEEEEEEeCC-----ceEEEEEcCCeEE-EEEEcCCC-eEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC Q lcl|NC_012753. 121 YLESCLALGGLAMRPYIDGD-----QIRVSFVQATVFF-PLQANTQD-VSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE 193 (502) Q Consensus 121 ~~~~~~~~G~~~~~~~~d~~-----~~~i~~v~~~~~~-Pi~~d~~~-~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~ 193 (502) ++...+.-|.++++..+++. .+++..++|+.+- |.....++ +...+ +.+. .+ T Consensus 138 ~~r~~~~dGE~~~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GI------e~d~-~G-------------- 196 (530) T protein:vir:38 138 GVAMHAFNGELCVQATWDSDSTRLFRTQFKMVSPKRVSNPNNIGDTRNCRAGV------KIND-SG-------------- 196 (530) T ss_pred HHHHHhhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCeeEeee------EECC-CC-------------- Confidence 88888999999998888642 3789999998852 21111111 11111 1110 01 Q ss_pred eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHH Q lcl|NC_012753. 194 TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTY 273 (502) Q Consensus 194 ~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~ 273 (502) -.+-|.+++..-++..+. .|..++. ...++++-++.+. +....+...|+|.|+.++..+..|+.-. T Consensus 197 -r~~aY~i~~~~~~~~~~~------~~~~~~~---~~~v~a~~vlH~f----~~~r~gQ~RGis~lapvl~~l~~l~~y~ 262 (530) T protein:vir:38 197 -AALGYYVSDDGYPGWMAQ------NWTYIPR---ELPGGRPSFIHVF----EPMEDGQTRGANAFYSVMEQMKMLDTLQ 262 (530) T ss_pred -ceEEEEEeeccCCCcccc------ccceeee---eeccChhHeEeec----cccCCCcccCCchHHHHHHHHHHHhHHH Confidence 011111222110000000 0111111 1223444444433 2334567789999999999999998654 Q ss_pred HHHHHHHh-hccceeeechHHhccCCCCCC-ccc------Ccccc----ccccchhh---ccccCCCC-----cccccee Q lcl|NC_012753. 274 DEFMWEVK-MGQRRVAVPTQMIKTEYDTNG-EKV------TVKRE----FETGHNVY---EQFDSGDM-----DKGIGIT 333 (502) Q Consensus 274 S~~~~~~~-~~~~~i~v~~~~l~~~~~~~g-~~~------~~~~~----~~~~~~~~---~~~~~~~~-----~~~~~i~ 333 (502) ..-..--. .+.-..|| +...+..+ ... ..... +......+ ..+....+ ..+.-|+ T Consensus 263 dael~~a~i~A~~a~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~ 337 (530) T protein:vir:38 263 NTQLQSAIVKAMYAATI-----ESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSLN 337 (530) T ss_pred HHHHHHHHHhhhheeee-----eccCCccccccccccCCcccccccccccchhhhhcccccceeccCceeeecCCCCeee Confidence 33222111 11112222 21111100 000 00000 00000000 00001111 1122377 Q ss_pred eeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Q lcl|NC_012753. 334 DLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM--KTATEVVSEQSDTYQMRNSIATLVEKS-LKELVISILE 410 (502) Q Consensus 334 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~--~tAtei~~~~~~l~~~~~~~~~~~~~~-l~~l~~~il~ 410 (502) .+++.-+..+|..-+..+++.|....|+|+..+..+-+++ +|+-+. ....-......+..|... ++.+++..+. T Consensus 338 ~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYSS~R~~---~~e~~r~~~~~q~~~~~~~~~pi~~~wl~ 414 (530) T protein:vir:38 338 LQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARAS---ANESWAYFMGRRKFVASRQACQMFLCWLE 414 (530) T ss_pred eeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHHHHHHH---HHHHHHHHHHHHHHHHHHHhhHHHHHHHH Confidence 7888877788888899999999999999999987665433 233332 223333333444434332 2333332232 Q ss_pred HHHhhccc--CCCcc-c-------ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHH- Q lcl|NC_012753. 411 LAKVYNLY--TGEIP-T-------MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQK- 479 (502) Q Consensus 411 ~~~~~~~~--~~~~~-~-------~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~r- 479 (502) .+-.-+.. .+... + +..+....+.-.-+|+.++++....++.+|+.|.+..+++. |.+-+++.+++++ T Consensus 415 ~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~-G~D~~~v~~q~a~e 493 (530) T protein:vir:38 415 EAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR-GDDYQEIFAQQVRE 493 (530) T ss_pred HHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHH Confidence 22111111 11110 1 11233333555678999999999999999999999988875 7766655444333 Q ss_pred ---HHHhhhccc-CCCCCccccCCCCC Q lcl|NC_012753. 480 ---INDETMVST-DSFRTSEEVDIYGE 502 (502) Q Consensus 480 ---i~~E~~~~~-~~~~~~~~~~~~g~ 502 (502) +++--.... .....+..+...++ T Consensus 494 ~~~~~~~Gl~~~~~~~~~~~~~~~~~~ 520 (530) T protein:vir:38 494 SMERRAAGLNPPAWAAAAFEAGVKKSN 520 (530) T ss_pred HHHHHHcCCCCCCCcccccCCCCCCCC Confidence 222111000 00001111111111 No 110 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=98.95 E-value=4.5e-09 Score=66.32 Aligned_cols=442 Identities=13% Similarity=0.045 Sum_probs=184.2 Q ss_pred CChhHHHHH---------HHHHHhhcccc-cchhhhhccccccCCHHHH-HHHHHHHHHhcCCCCccccccCCCcccccc Q lcl|NC_012753. 1 MGIIQTIKN---------FIKRSNYVITN-QSLNSITDHPKIAISPEEY-NRIMDNLRYFAGDFDSVTYRDSNGSQVKRD 69 (502) Q Consensus 1 m~~~~~ik~---------~i~~~~~~~~~-~~l~~i~~~~~~~~~~~~~-~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~ 69 (502) ...-.+++. ++|..-..+.. .......-+--++++.-.- .+-..+.-+| +.... ..+-..-.- T Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~g~~~~~~~~~~~~~-~~~~~-----~~~~~l~a~ 90 (532) T protein:vir:94 17 LQQAQRVDAKRATHTSLGLATAHEIDPTAYSPYERNAAQNAMAMDYGLQTGRNGRNALSF-VEATS-----WPGFPTLAL 90 (532) T ss_pred hhhHhhhhhhhhhhhhhhhhhhhhhcccccccccccccccccccccccCccccccccccc-ccccc-----cchHHHHHH Confidence 111111111 11110000000 0000000000000000000 0000000000 10000 000000011 Q ss_pred ceecchHHHHHHHHhhhhhcCcceEeeCCH-----HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEE Q lcl|NC_012753. 70 FNHLPIGRTASKKVASLVFNEQATIRVDNE-----VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRV 144 (502) Q Consensus 70 ~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~-----~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i 144 (502) +....+++.+|+..|.-++.+.+.|+++++ ...+.|+..++.-++...+.+++.++-.+|++++.+-+++.+... T Consensus 91 Y~~~~l~r~~Vd~~aed~~r~~~~i~~~~~~~~~~~~~~~i~~~~~~l~v~~~l~~a~~~~rlyG~a~i~i~v~~~~~~~ 170 (532) T protein:vir:94 91 LAQLPEYRTMHETPADECVRAWGKITCSSKDELAADKATRITQKLEQYNVRTLVRTVVIHDQAYGGAHVFPHLKMDGDSV 170 (532) T ss_pred HHcCchhhhhhccchHHHhhCCceEeeCCccccchHHHHHHHHHHHhhhHHHHHHHHHHhhhcccceEEEEEeccCCccc Confidence 234578899999999999999999988542 333456666666678889999999999999999887765433211 Q ss_pred EEEcCCeEEEEEEcCCCeEEEEEEEEEE------EeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc Q lcl|NC_012753. 145 SFVQATVFFPLQANTQDVSSAAIVTKST------KTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST 218 (502) Q Consensus 145 ~~v~~~~~~Pi~~d~~~~~~~~~~~~~~------~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~ 218 (502) ..-+|-..-|.....+.......+.+.. ...+-....|-+-++|. +. + |..|. T Consensus 171 ~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~-------v~-------~----g~~iH--- 229 (532) T protein:vir:94 171 PADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWI-------AT-------S----GKKIH--- 229 (532) T ss_pred cccccccccccccccceeeEEEeechheecccccccccccccccCCceeEE-------Ec-------c----Ceeec--- Confidence 1111100001000111111111111100 00000000011111111 10 0 11111 Q ss_pred cccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCC Q lcl|NC_012753. 219 LYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEY 298 (502) Q Consensus 219 ~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~ 298 (502) +.. -+.+.|.+.|- +.+ .-...+|+|++..+.+-+..++.+....+.=+...+..++.. .+ .... T Consensus 230 -~SR---li~f~g~~~p~--~~~-------~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~-~~-a~~l 294 (532) T protein:vir:94 230 -SSR---IHTVVGRPVGD--MLK-------AAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLAT-DM-AQLL 294 (532) T ss_pred -cce---EEEecCCCchh--hhc-------cccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee-ch-HHhh Confidence 000 01111111110 000 112347999999999999999987655554232223233221 11 1111 Q ss_pred CCCCcccCccccccccchhhcc----ccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-cccccccc Q lcl|NC_012753. 299 DTNGEKVTVKREFETGHNVYEQ----FDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSM 373 (502) Q Consensus 299 ~~~g~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~ 373 (502) ...+..- ...++.. .+.++. +..+. ...-++.++.+ .......++...+.++..+|++... ||...+|. T Consensus 295 s~~~~~~-~~~r~~~-~~~~~~n~g~~~id~--~~e~~e~~~~~--lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~Gl 368 (532) T protein:vir:94 295 APGGAQS-LDARLQL-FNLYRDNRNIGALDK--GTEEIQQTNTP--LSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGL 368 (532) T ss_pred cchhHHH-HHHHHHH-HHhhcCCccceEEcC--CCceeEEEecc--cCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccc Confidence 1111100 0000000 011111 11111 11235555433 3446677888888999999998774 67665554 Q ss_pred -ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHH----- Q lcl|NC_012753. 374 -KTATEVVSEQSDTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDY----- 446 (502) Q Consensus 374 -~tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~----- 446 (502) +|+.+=+..| +..++.+| ..++..|+.|+..++... .+.. ..+++|.|+.-...++.+.++. T Consensus 369 nstGe~D~~~y---yd~I~s~Qe~~l~p~le~l~~~l~~s~------~g~~--~~d~~~~f~pL~~~s~kEkAei~~~~a 437 (532) T protein:vir:94 369 NASSDGEIRVW---YDFIAGYQATNLTPLMEWIIDLIQLSE------YGQI--DPGLAWEWSPLMELDDKELAEVRQLNA 437 (532) T ss_pred cccchHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHh------cCCC--CCCceEEeCCCCCCCHHHHHHHHHHHH Confidence 4555433333 34444444 556888888888776432 1222 3468999998777777765543 Q ss_pred --HHHHHhcCCCCHHHHHHhcC-----CCCH-----HHHHHHHHHHHHhhhcccCCCCCc------cccCCCCC Q lcl|NC_012753. 447 --WSKMVAAGFAPKTMAIEKTL-----NVTK-----EQAQEIYQKINDETMVSTDSFRTS------EEVDIYGE 502 (502) Q Consensus 447 --~~~~~~~Gi~S~et~l~~~~-----~~~d-----eea~~el~ri~~E~~~~~~~~~~~------~~~~~~g~ 502 (502) ..+++.+|++|.+++...+- ++.. ++.. +.+.+..|.......++.+ +.++.-+. T Consensus 438 ~a~~~~~~~Gvi~~~Evr~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 510 (532) T protein:vir:94 438 STDSTLMELGVIDAKMVQQRLAADPTSGYAGALGERDELD-DVEEIAKQLMAAALNPPATAPQTPNPQPDSEDD 510 (532) T ss_pred HHHHHHHhcCCCCHHHHHHHHhcCCccccccccccccccc-cccchhhhhcccccCCCCCCCCCCCCCCCCCCC Confidence 46778899999988655431 1111 1100 1112222221111111111 11111111 No 111 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=98.95 E-value=1.1e-08 Score=64.21 Aligned_cols=432 Identities=11% Similarity=0.071 Sum_probs=196.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCC--C----ccccccCCC----------- Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDF--D----SVTYRDSNG----------- 63 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~--~----~~~~~~~~~----------- 63 (502) ||-+.++.. ++....-.....||.|.. . .|....... T Consensus 3 ~p~~~~~~~--------------------------~~~~~~~~~~~~y~~~a~~~~~~~~~w~p~~~s~~~~~~~~~~~l 56 (533) T protein:vir:34 3 TPTIPTLLG--------------------------PDGMTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLPNFTRG 56 (533) T ss_pred Cchhhhhhc--------------------------ccccchHHHHHhhhhccCCCCCcccccccCCCCHHHHHHHHHHHH Confidence 222221111 111122233344444321 1 010000000 Q ss_pred -ccccccceecchHHHHHHHHhhhhhcCcceEeeC------------CHHHHHHHHHHH----hh----------ccHHH Q lcl|NC_012753. 64 -SQVKRDFNHLPIGRTASKKVASLVFNEQATIRVD------------NEVADAFINETL----KN----------DKFSK 116 (502) Q Consensus 64 -~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~------------d~~~~e~l~~~~----~~----------~~f~~ 116 (502) .+..+-....++++-+++.++++++|..+++... +++.++.++..| ++ .+|.. T Consensus 57 r~RaRdl~rNn~~a~~av~~~~~nvVG~Gi~~~~~p~~~~lg~~~~~~~~~~~~ie~~w~~w~~~~~~~~D~~g~~~f~~ 136 (533) T protein:vir:34 57 NARADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEFAEDDCCCIDVERKRTFTM 136 (533) T ss_pred HHHHHHHHhcChHHHHHHHHHHHHhhCCCceeeeccchhhcCCChhHHHHHHHHHHHHHHHhhcCccceeccccccCHHH Confidence 0000111223688999999999999988776552 223333333333 21 25777 Q ss_pred HHHHHHHHHhhcCCEEEEEEEeC--C---ceEEEEEcCCeEE-EEE-EcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEE Q lcl|NC_012753. 117 NFERYLESCLALGGLAMRPYIDG--D---QIRVSFVQATVFF-PLQ-ANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHE 189 (502) Q Consensus 117 ~~~~~~~~~~~~G~~~~~~~~d~--~---~~~i~~v~~~~~~-Pi~-~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~ 189 (502) ....++...+.-|.++++..|.. | .+++..++|+.+- |.. .+...+...+ |+-. T Consensus 137 ~q~l~~r~~~~dGE~f~~~~~~~~~g~~~~~~lq~ie~d~l~~~~~~~~~~~i~~GI-------------------e~d~ 197 (533) T protein:vir:34 137 MIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRISNPNNTGDSRNCRAGV-------------------QIND 197 (533) T ss_pred HHHHHHHHHHhCCceEEEeeeccCCCCccceEEEEechhhcCCCCCCCCCCceEeee-------------------EECC Confidence 77778888899999999988864 2 4689999998852 211 1111121111 1100 Q ss_pred EeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHH Q lcl|NC_012753. 190 WNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFI 269 (502) Q Consensus 190 ~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~l 269 (502) ..-.+-|.+++...++..+.. |..++ ....+++.-++.+. +....+...|+|.|+.++..+..+ T Consensus 198 ---~Gr~~aY~i~~~~~~~~~~~~------~~~~~---~~~~v~a~~VlH~f----~~~r~gQ~RGis~lapvl~~l~~l 261 (533) T protein:vir:34 198 ---SGAALGYYVSEDGYPGWMPQK------WTWIP---RELPGGRASFIHVF----EPVEDGQTRGANVFYSVMEQMKML 261 (533) T ss_pred ---CCCeEEEEEeecCCCCccccc------cceee---eeeccChhHeeeec----cccCCCcccCCchHHHHHHHHHHH Confidence 011122222222111110000 00000 01123333333332 233456678999999999999999 Q ss_pred HHHHHHHHHHHh-hccceeeechHHhccCCCCC-Cccc---------Cccc--c------ccccchh-hccccCCCCccc Q lcl|NC_012753. 270 NTTYDEFMWEVK-MGQRRVAVPTQMIKTEYDTN-GEKV---------TVKR--E------FETGHNV-YEQFDSGDMDKG 329 (502) Q Consensus 270 d~~~S~~~~~~~-~~~~~i~v~~~~l~~~~~~~-g~~~---------~~~~--~------~~~~~~~-~~~~~~~~~~~~ 329 (502) +.-...-..--. ...-..|| +...+.. +... .... . ++....+ ........-..+ T Consensus 262 ~~y~dael~~a~i~A~~a~fi-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pG 336 (533) T protein:vir:34 262 DTLQNTQLQSAIVKAMYAATI-----ESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPG 336 (533) T ss_pred HHHHHHHHHHHHHhhhheeee-----ecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCceeeecCCC Confidence 865433322111 11112222 2111100 0000 0000 0 0000000 000000000112 Q ss_pred cceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH Q lcl|NC_012753. 330 IGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLK-ELVISI 408 (502) Q Consensus 330 ~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~i 408 (502) .-|+.+++.-+..+|..-+..+++.|....|+|+..+..+-+++ |=.++++.....-......+..|...+. .+.+.. T Consensus 337 e~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~-nYSS~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~w 415 (533) T protein:vir:34 337 DSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQM-SYSTARASANESWAYFMGRRKFVASRQASQMFLCW 415 (533) T ss_pred CeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23677777777778888888999999999999999987765432 1112222222233333333333433332 233322 Q ss_pred HHHHHhhc---ccCCCccc-------ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHH Q lcl|NC_012753. 409 LELAKVYN---LYTGEIPT-------MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQ 478 (502) Q Consensus 409 l~~~~~~~---~~~~~~~~-------~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ 478 (502) +..+-+.+ +.++...+ +..+....+.-.-+|+.++++....++.+|+.|.+..+++. |.+-+++.++++ T Consensus 416 l~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~G~~s~~~~~a~~-G~D~~ev~~q~a 494 (533) T protein:vir:34 416 LEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEAGLSTYEKECAKR-GDDYQEIFAQQV 494 (533) T ss_pred HHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHH Confidence 32222111 11111100 11233333555678999999999999999999999988875 776666544444 Q ss_pred HHHHhhh---cccCCCCCccccCCCC----C Q lcl|NC_012753. 479 KINDETM---VSTDSFRTSEEVDIYG----E 502 (502) Q Consensus 479 ri~~E~~---~~~~~~~~~~~~~~~g----~ 502 (502) +-++... -..+.. +.....-| + T Consensus 495 ~e~~~~~~~gl~~~~~--~~~~~~s~~~~~~ 523 (533) T protein:vir:34 495 RETMERRAAGLKPPAW--AAAAFESGLRQST 523 (533) T ss_pred HHHHHHHhcCCCCCCC--CCcCccCCCCCCC Confidence 3221111 111110 10001101 1 No 112 >protein:vir:100920 Length: 725 # NCBI annotation: Gp1 # Family: family:all:487 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006406;genbank:gi:46358698;genbank:GeneID:2777070 Probab=98.94 E-value=1.2e-08 Score=63.96 Aligned_cols=464 Identities=9% Similarity=-0.016 Sum_probs=212.1 Q ss_pred CC----hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MG----IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~----~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |+ .+.+++.+++.- +...++-.....+..+||.|.++.-.-.. ......+..+|.- T Consensus 1 m~d~~~~~~~~~~~~~~~-----------------~~~~~~~R~~a~~d~~fy~G~QW~~~~~~---~l~~q~rp~~N~i 60 (725) T protein:vir:10 1 MADNENRLESILSRFDAD-----------------WTASDEARREAKNDLFFSRVSQWDDWLSQ---YTTLQYRGQFDVV 60 (725) T ss_pred CCchHHHHHHHHHHHHHH-----------------HHhhHHHHHHHHHHHHhhcCCCCCHHHHH---HHHhcCCCcccch Confidence 44 344444444431 11234455677888899999877422111 0111112246988 Q ss_pred HHHHHHHhhhhhcCcceEee-----CCHHHHHHHHHHH----hhccHHHHHHHHHHHHhhcCCEEEEEEEe---C----C Q lcl|NC_012753. 77 RTASKKVASLVFNEQATIRV-----DNEVADAFINETL----KNDKFSKNFERYLESCLALGGLAMRPYID---G----D 140 (502) Q Consensus 77 k~iv~~~a~~l~~ep~~i~~-----~d~~~~e~l~~~~----~~~~f~~~~~~~~~~~~~~G~~~~~~~~d---~----~ 140 (502) +.+|+...++--...+.+.+ +|...++.|+.++ +.++.......+...+++.|.+|+.+.+| + + T Consensus 61 ~~~v~~v~g~e~~nr~d~~v~p~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~ev~~d~~~~d~~~~ 140 (725) T protein:vir:10 61 RPVVRKLVSEMRQNPIDVLYRPKDGASPDAADVLMGMYRTDMRHNTAKIAVNIAVREQIEAGVGAWRLVTDYEDQSPTSN 140 (725) T ss_pred HHHHHHHHhhHHhCCcceEEecCCcchHHHHHHHHHHHHHHHHhcCcchHHhHHHHHHhhcCcceeeeeccccCCCCCCC Confidence 99999999998888888776 2445556555544 45778888889999999999999998654 1 1 Q ss_pred ceEEEEE----cCCeEEEEEEcCC----CeEEEEEEEEEEEeeC------------------------------CCceEE Q lcl|NC_012753. 141 QIRVSFV----QATVFFPLQANTQ----DVSSAAIVTKSTKTEG------------------------------QKVKYY 182 (502) Q Consensus 141 ~~~i~~v----~~~~~~Pi~~d~~----~~~~~~~~~~~~~~~~------------------------------~~~~~y 182 (502) .++|..+ |+.++| +|.. +...+-++.+..+.+. .+.... T Consensus 141 ~~~i~~~~i~~~~~~v~---~Dp~a~~~D~sDar~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~v 217 (725) T protein:vir:10 141 NQVIRREPIHSACSHVI---WDSNSKLMDKSDARHCTVIHSMSQNGWDDFAEKYDLDADNIPSFQNPNDWVFPWLTQDTI 217 (725) T ss_pred ceeeeeeecccCHhHcc---cCchhhccChhhhhhhhhhccCCHHHHHHHHHhCCCcccccccccccccccccccCCCeE Confidence 3344433 444444 2211 1222222212111110 011122 Q ss_pred EEEEEEEEeCCeEEEEEEEEecCCccccCceeecc-----ccc-----cCC------------------CcceeecC--- Q lcl|NC_012753. 183 SLIEFHEWNKETYTISNELYESESKTIIGQRVPLS-----TLY-----EDL------------------EETVTLNG--- 231 (502) Q Consensus 183 t~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~-----~~~-----~~l------------------~~~~~~~~--- 231 (502) ++.|+|+..... -.+|...+ ...|..+.+. .+. +++ .+...+.+ T Consensus 218 rv~E~~~r~~~~----~~~~~~~d-~~~g~~~~~~~~~~~~~~~~~~~~g~~~~~~r~~~~~kv~~~~~~g~~~l~~~~~ 292 (725) T protein:vir:10 218 QIAEFYEVVEKK----ETAFIYQD-PVTGEPVSYFKRDIKDVIDDLADSGFIKIAERQIKRRRVYKSIITCTAVLKDKQL 292 (725) T ss_pred EEEEEEEEEEEe----eEEEEecc-CCCCceeecchhhhHHHHHHhhcccchhhhhccceeeEEEEEeecchhhhcCCCC Confidence 334444321110 01111111 0112111110 000 000 00000111 Q ss_pred --CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-hccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 232 --LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 232 --~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) ....||++|-- ......+.|++-+.+.++++.++.+|...|...+-+- ..+....++.+.+..... ... .++ T Consensus 293 ~~~~~fP~vP~~g--~r~~~~g~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~e~-~~~--~~~ 367 (725) T protein:vir:10 293 IAGEHIPIVPVFG--EWGFVEDKEVYEGVVRLTKDGQRLRNMIMSFNADIVARTPKKKPFFWPEQIAGFEH-MYD--GND 367 (725) T ss_pred CCCCceeEEEEEe--eeeccCCcceeeeeeccchhHHHHHHHHHHHHHHHHHhcCCccccccHhhhhHHHH-HHh--ccC Confidence 01123333210 0111245666668899999999999999999998774 455555665555532110 000 000 Q ss_pred ccccccchhhccccCCCCcc-ccceeeec-cccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHH Q lcl|NC_012753. 309 REFETGHNVYEQFDSGDMDK-GIGITDLT-TDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDT 386 (502) Q Consensus 309 ~~~~~~~~~~~~~~~~~~~~-~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l 386 (502) ..-....+ .+...++.. ...+..++ +.+ ..++...++.....|...+|++...+|..++ ..|+.+|..+.... T Consensus 368 ~~~~~~~~---~~~~~~g~~~~~~i~~~~~~~~-p~~~~~ll~~~~~~i~~~tGi~~~~lG~~~n-~~SG~ai~~rq~qg 442 (725) T protein:vir:10 368 DYPYYLLN---RTDENNGEMPTQPLAYYENPEV-PQANAYMLEAATAAVKEVATLGVDAEAVNGG-QVAYDTVNQLNMRA 442 (725) T ss_pred Cceeeecc---cccccCcccccccCcccCCCCc-hHHHHHHHHHHHHHHHHHhCCCHHHhCcCch-hhHHHHHHHHHHHH Confidence 00000000 000011110 11222222 233 3467788999899999999999999987654 35677777776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH----hhc---ccCCCcc-c------------------------ccceEEEeCC Q lcl|NC_012753. 387 YQMRNSIATLVEKSLKELVISILELAK----VYN---LYTGEIP-T------------------------MDEVSVDLDD 434 (502) Q Consensus 387 ~~~~~~~~~~~~~~l~~l~~~il~~~~----~~~---~~~~~~~-~------------------------~~~i~v~f~d 434 (502) ......+-..++.+.+...+.+|.+-. ..+ +.+.... . .+++.|+=.. T Consensus 443 ~~~l~~~~Dnl~~~~~~~g~~lL~lI~~~~~~er~~RI~~edg~~~~v~in~~~~d~~~G~~v~~Ndi~g~~Dv~v~~~p 522 (725) T protein:vir:10 443 DLETYVFQDNLATAMRRDGEIYQSIVNDIYDVPRNVTITLEDGSEKEVQLMAEVVDLATGERQVLNDIRGRYECYTDVGP 522 (725) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceeEeccccccccccchhhhhccccceeEEEeecc Confidence 655555555666666665555554422 111 1111000 0 0122222111 Q ss_pred CccCCHHHHHHHHHHHHhc-C-CCCH-HHHHHhcCCCCH-HHHHHHHHHHHHhhhcccCCCCC-ccccC----------- Q lcl|NC_012753. 435 GVFTDRNAEFDYWSKMVAA-G-FAPK-TMAIEKTLNVTK-EQAQEIYQKINDETMVSTDSFRT-SEEVD----------- 498 (502) Q Consensus 435 ~i~~d~~~~~~~~~~~~~~-G-i~S~-et~l~~~~~~~d-eea~~el~ri~~E~~~~~~~~~~-~~~~~----------- 498 (502) +.+.=.++.++.++++..+ + ..+. -..+..+.+..+ +-+++.+++|+....+.....+. +...- T Consensus 523 ~~~s~r~~~~~~l~qll~~~~~~~~~~~~~l~~~~~~~d~~~~~e~~erirkq~~~~~~~~~~~~e~~q~~~e~qq~~~~ 602 (725) T protein:vir:10 523 SFQSMKQQNRSEILELLGKTPQGTPEYQLLLLQYFTLLDGKGVEMMRDYANKQLIQMGVKKPETPEEQQWLVEAQQAKQG 602 (725) T ss_pred CcHHHHHHHHHHHHHHHHhccccchhHHHHHHHHhhcCCchhHHHHHHHHHhhhhhhccCCccccchhHHHHHHHHHHHh Confidence 1111133445555555432 1 1111 112222222221 22445566666544332211111 11000 Q ss_pred -------------CCCC Q lcl|NC_012753. 499 -------------IYGE 502 (502) Q Consensus 499 -------------~~g~ 502 (502) .-++ T Consensus 603 q~~~e~~q~~~~~~~~q 619 (725) T protein:vir:10 603 QQDPAMVQAQGVLLQGQ 619 (725) T ss_pred hhHHHHHHHHHHHHHHH Confidence 0000 No 113 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=98.92 E-value=1.4e-08 Score=63.69 Aligned_cols=430 Identities=10% Similarity=0.064 Sum_probs=191.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCc-----cccccCC----------Ccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDS-----VTYRDSN----------GSQ 65 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~-----~~~~~~~----------~~~ 65 (502) |+++.+ .++ .+.. -.+-....+-|.|-... +.....+ ..+ T Consensus 1 m~~~~~--~~~-----a~~~------------------~~~~~~~~~~y~aa~~~~~~~~~~~~s~d~~~~~~~~~lr~R 55 (495) T protein:vir:10 1 MNMTPS--GYQ-----SLAS------------------GLLVPVGASAYEGASGGHRWQDIGDYGPDTAVASGIQTLRAR 55 (495) T ss_pred CCcccc--ccc-----ccch------------------hhhhHHHhhhhhccccCcccCCCCCCChhHHHHHHHHHHHHH Confidence 776664 111 0000 00001111223331110 0000000 000 Q ss_pred ccccceecchHHHHHHHHhhhhhcCcceEee--CCHHHHHHHHHHHh----------hccHHHHHHHHHHHHhhcCCEEE Q lcl|NC_012753. 66 VKRDFNHLPIGRTASKKVASLVFNEQATIRV--DNEVADAFINETLK----------NDKFSKNFERYLESCLALGGLAM 133 (502) Q Consensus 66 ~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~--~d~~~~e~l~~~~~----------~~~f~~~~~~~~~~~~~~G~~~~ 133 (502) ..+-....++++-+|+.+++.++|..++... ++++.++.+++.|+ ..+|......++...+.-|.+++ T Consensus 56 aRdl~rNn~~a~~av~~~~~~vVG~Gi~p~~~~~~~~~~~~ie~~w~~wa~~~D~~g~~~f~~lq~l~~r~~~~dGE~f~ 135 (495) T protein:vir:10 56 SHHNVRNNPWATNAVATWVAAAVGNGLTPRWRMKEQELRQELQELWGDWVNEADFDEVQSFYGLQALVVRTVINSGEAFV 135 (495) T ss_pred HHHHHhcChHHHHHHHHHHHhhcCCCcccccCCchHHHHHHHHHHHHHhhcCcccccccCHHHHHHHHHHHHHhCCceEE Confidence 1111122368899999999999998776554 56666655555442 23677777778888899999988 Q ss_pred EEEEeC---C---ceEEEEEcCCeE-EEEEEc---CC-CeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEE Q lcl|NC_012753. 134 RPYIDG---D---QIRVSFVQATVF-FPLQAN---TQ-DVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELY 202 (502) Q Consensus 134 ~~~~d~---~---~~~i~~v~~~~~-~Pi~~d---~~-~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~ 202 (502) +..+.+ | .+++..++|+.+ -|.-.. .+ .+...+-+ ...+ + .+-|.++ T Consensus 136 ~~~~~~~~~g~~~~~~lqliepd~l~~~~~~~~~~~g~~i~~GIe~----d~~G-r-----------------~vaY~i~ 193 (495) T protein:vir:10 136 IKKPRPLSEGLSVPLQLQIIEPDMLASDIPDETLPSGGYVKGGIRF----SNGG-K-----------------RKAYCFY 193 (495) T ss_pred EEeecccCCCCccceEEEEechhhcCCCCCCCCCCCCCEEEeceEE----CCCC-c-----------------eEEEEEe Confidence 876642 2 369999999986 242110 11 12222211 0011 1 1112222 Q ss_pred ecCCccc--cCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH-HH Q lcl|NC_012753. 203 ESESKTI--IGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM-WE 279 (502) Q Consensus 203 ~~~~~~~--lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~-~~ 279 (502) +...++. .+.... +.. ++..-++++.+ ...+...|+|.|+.+.. +..++...+.-. .. T Consensus 194 ~~hpgd~~~~~~~~~----~~r---------vpA~~vlH~f~-----~r~gQ~RGis~la~i~~-l~~l~~y~dael~~a 254 (495) T protein:vir:10 194 RNHPAESSLIGDPVD----TVW---------IKAEHVLHVTV-----LTVRSDAGAPWFQLLLR-LNELDQYEDAELVRK 254 (495) T ss_pred ecCCCcccccccccc----eee---------echhheEeccc-----cCCCcccCcchhHHHHH-HHHhhHHHHHHHHHH Confidence 2211110 000000 001 11111222211 13456679999887665 455554332211 11 Q ss_pred HhhccceeeechHHhccCCCCCCcccCccccccccchhhccc---cCCCCccccceeeeccccchHHHHHHHHHHHHHHH Q lcl|NC_012753. 280 VKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQF---DSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFE 356 (502) Q Consensus 280 ~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~ 356 (502) .-...-..||- ...++..+.........+........+ ....-..+.-|+.++|.-+..++..-+..+++.|. T Consensus 255 ~i~A~~~~fi~----~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~ia 330 (495) T protein:vir:10 255 KTAALFAAFIQ----EATADSTGGPTIGQPKRSKGGKRITGLNPGTLQYLQPGQEVKFSNPADVGTTYEPWLRYQLLSIA 330 (495) T ss_pred HHhhhheeeee----cCCCccccccccCccccccCcccceecCCceeeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHH Confidence 11111122221 111111111110000000000000001 00011122237778887777788888999999999 Q ss_pred HhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHH-HHHH-HHHHHHHHHHHHHHhhccc--CCCcc-cccceEEE Q lcl|NC_012753. 357 MQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIAT-LVEK-SLKELVISILELAKVYNLY--TGEIP-TMDEVSVD 431 (502) Q Consensus 357 ~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~-~~~~-~l~~l~~~il~~~~~~~~~--~~~~~-~~~~i~v~ 431 (502) ...|+|+..+..+-+++. =.++++.....-......+. .+-. .++.+.+..+..+-+-+.. ++... ...-..+. T Consensus 331 aglGi~Ye~ltgD~s~~n-YSS~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~~a~l~G~i~~p~~~~~~~~~~~~~ 409 (495) T protein:vir:10 331 KGYGITYEMLTGDLRGVN-YSSIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMDFAVASGAVVIPDYLQRRRYYNRVS 409 (495) T ss_pred hhcCCCHHHHhccccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCchhhhHhhhccc Confidence 999999999876654421 11222222222233333332 1222 2233333333332221111 11000 00112344 Q ss_pred e--CCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHh---hhcccCC---C--------CCcc Q lcl|NC_012753. 432 L--DDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKINDE---TMVSTDS---F--------RTSE 495 (502) Q Consensus 432 f--~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~E---~~~~~~~---~--------~~~~ 495 (502) | +.-.-+|+.++++....++.+|+.|.+..+++. |.+-+++.+++++=++. ..-..+. . .... T Consensus 410 w~~p~~~~vDP~Ke~~A~~~~i~~G~~s~~~~~a~~-G~D~~~v~~q~a~e~~~~~~~Gl~~~~~p~~~~~~~~~~~~~~ 488 (495) T protein:vir:10 410 WRTPRWEEVDPLKKHLADLGDVRAGFAPISDKQAER-GYDMEELFDMISDANQLIDEYDLRLDSDPRYVNGSGAEQKSVM 488 (495) T ss_pred cccCCccccChHHHHHHHHHHHHcCCCCHHHHHHHc-CCCHHHHHHHHHHHHHHHHHcCCCCCCCCCcCCCccCCCCCCC Confidence 4 444567999999999999999999999988876 77766665444431111 1110010 1 1111 Q ss_pred ccCCCCC Q lcl|NC_012753. 496 EVDIYGE 502 (502) Q Consensus 496 ~~~~~g~ 502 (502) +.....| T Consensus 489 ~~~~~~e 495 (495) T protein:vir:10 489 EAALNNE 495 (495) T ss_pred CCCCCCC Confidence 1111111 No 114 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=98.92 E-value=1.5e-08 Score=63.55 Aligned_cols=452 Identities=10% Similarity=0.009 Sum_probs=197.3 Q ss_pred CChhHHHHHHHH-----------HHhhcccccchhhhhccccc-cCCHHHHHHHHHHHHHhcCCCCccccccCCCccccc Q lcl|NC_012753. 1 MGIIQTIKNFIK-----------RSNYVITNQSLNSITDHPKI-AISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKR 68 (502) Q Consensus 1 m~~~~~ik~~i~-----------~~~~~~~~~~l~~i~~~~~~-~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~ 68 (502) |....+.-..+. ...|-. ......-...|.. ..+.... +....+--.. +..+ T Consensus 2 ~~~~~r~~~~~a~~~~~~~~~~~~~~y~g-A~~~~r~~~~w~~~~~s~~~~--~~~~~~~lr~-------------RaRd 65 (553) T protein:vir:63 2 TKVTVRKLSEVTSGRPEQSASLGGGGLEG-ASRLSRETVSWNPSLRSPDAL--INPLKRIADA-------------RGRD 65 (553) T ss_pred cchhhhhhcccccccchhhhhhhcccccc-cccCCCcccccccCCCChHHH--HHHHHHHHHH-------------HHHH Confidence 332222211111 111100 0000001111211 1111110 0000000000 0001 Q ss_pred cceecchHHHHHHHHhhhhhcCcceEeeC-------------CHHHHHHH----HHHHh----------hccHHHHHHHH Q lcl|NC_012753. 69 DFNHLPIGRTASKKVASLVFNEQATIRVD-------------NEVADAFI----NETLK----------NDKFSKNFERY 121 (502) Q Consensus 69 ~~~~~n~~k~iv~~~a~~l~~ep~~i~~~-------------d~~~~e~l----~~~~~----------~~~f~~~~~~~ 121 (502) -....++++-+|+.+++.++|..++.... ++..++.+ +.|.+ ..+|......+ T Consensus 66 L~rNn~~a~~av~~~~~nvVG~Gi~~~~~~~~~~l~g~~~~~~~~~~~~ie~~w~~wa~~~~~~~D~~g~~~f~~~q~l~ 145 (553) T protein:vir:63 66 MADNDGFTNGAVGYQRDSIVGAQYRLNSMPDINVIPGATEEWAEEYQTIVEAKFELYAESLACYIDNAAISTFTGLIRLG 145 (553) T ss_pred HHhcChHHHHHHHHHHHhhccCCceeeeccchhhhcCCCHHHHHHHHHHHHHHHHHhcCCccceeeccccCCHHHHHHHH Confidence 11223788999999999999987776542 12223333 33322 12577777778 Q ss_pred HHHHhhcCCEEEEEEEeC--C---ceEEEEEcCCeEE-EEEE-cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCe Q lcl|NC_012753. 122 LESCLALGGLAMRPYIDG--D---QIRVSFVQATVFF-PLQA-NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKET 194 (502) Q Consensus 122 ~~~~~~~G~~~~~~~~d~--~---~~~i~~v~~~~~~-Pi~~-d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~ 194 (502) +...+.-|.+++++.|.+ + .+++..++|+.+- |.-. +.+.+...+ |+- ... T Consensus 146 ~r~~~~dGE~~~~~~~~~~~~~~~~~~lq~ie~drl~~~~~~~~~~~i~~GV-------------------E~d---~~G 203 (553) T protein:vir:63 146 VVGYVKTGEVLATAEWDRAANRPYATCFQMVSTDRLSNPYQQLDTPTLRRGV-------------------QYD---KRG 203 (553) T ss_pred HHHHHhCCceEEEeeeccCCCCcccceEEEechhhcCCCCCCCCCCeeEeee-------------------EEC---CCC Confidence 888899999999988853 3 4688999998853 2111 111111111 110 011 Q ss_pred EEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHH Q lcl|NC_012753. 195 YTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYD 274 (502) Q Consensus 195 ~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S 274 (502) -.+-|.+++...++..+. .....-|..++. ...++++-++.+ ......+...|+|.|+.++..+..|+.-.+ T Consensus 204 r~vaY~i~~~hPgd~~~~-~~~~~~~~r~~~---~~~v~a~~vlH~----f~~~r~gQ~RGis~lapvl~~l~~l~~y~d 275 (553) T protein:vir:63 204 RPQGYWIQVAHPGDLYQM-APDMYKWKFVQQ---SKPWGRRQVIHI----LEPREPDQSRGIADIVSGLKDMRMAKRFKE 275 (553) T ss_pred ceEEEEeeccCCCccccc-cccccceeeecc---ccccChhHheec----ccccCCCcccCCchHHHHHHHHHHHhHHHH Confidence 122222333222111100 000000111111 112333333322 233345667899999999999999986554 Q ss_pred HHHHH-Hhhccceeeech-----HHhccCCCCCCcc--cCccccc------c-ccchh--hccccCCCCccccceeeecc Q lcl|NC_012753. 275 EFMWE-VKMGQRRVAVPT-----QMIKTEYDTNGEK--VTVKREF------E-TGHNV--YEQFDSGDMDKGIGITDLTT 337 (502) Q Consensus 275 ~~~~~-~~~~~~~i~v~~-----~~l~~~~~~~g~~--~~~~~~~------~-~~~~~--~~~~~~~~~~~~~~i~~~~~ 337 (502) .-..- .-.+--..||-. ..........+.. ....... + ..... ........-..+.-++.++| T Consensus 276 aeL~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~i~~L~pGe~i~~~~p 355 (553) T protein:vir:63 276 MSLQNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQIDGAKIPHLFPGTKLNLKPM 355 (553) T ss_pred HHHHHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeecCceeeecCCCCeeeecCC Confidence 33322 112222233311 0110000000000 0000000 0 00000 00000000011223677778 Q ss_pred ccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHh Q lcl|NC_012753. 338 DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM--KTATEVVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKV 414 (502) Q Consensus 338 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~--~tAtei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~ 414 (502) .-+..+|..-+..+++.|....|+|+..+..+-+++ +|+-+-.....+.+ ...+..|...+. .+.+..|..+-+ T Consensus 356 ~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYSS~R~~~~e~~r~~---~~~q~~~~~~~~~pi~~~wl~~a~l 432 (553) T protein:vir:63 356 GTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYSSIQAGIAMTRRFL---EGRKKMCADRLATEFFTLWLEEAIA 432 (553) T ss_pred CCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777788888899999999999999999887765432 23333333333333 333333433333 344433333222 Q ss_pred hcc--cCCCcc-----------cccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHH-- Q lcl|NC_012753. 415 YNL--YTGEIP-----------TMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKEQAQEIYQK-- 479 (502) Q Consensus 415 ~~~--~~~~~~-----------~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~deea~~el~r-- 479 (502) -+. ..+... .+..++...+.-.-+|+.++++....++.+|+.|.+..+++. |.+-+++.+++++ T Consensus 433 ~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~G~~t~~~~~a~~-G~D~~~v~~q~a~e~ 511 (553) T protein:vir:63 433 AGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDAGLSTYEREIARL-GGDFRKSFAQRARED 511 (553) T ss_pred cCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHcCCCCHHHHHHHh-CCCHHHHHHHHHHHH Confidence 111 111110 011223333444567999999999999999999999988887 7766655444333 Q ss_pred --HHHhhhc--ccC------------CCCCccccCCC---CC Q lcl|NC_012753. 480 --INDETMV--STD------------SFRTSEEVDIY---GE 502 (502) Q Consensus 480 --i~~E~~~--~~~------------~~~~~~~~~~~---g~ 502 (502) +++--.. ..+ .....+..+=. || T Consensus 512 ~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 512 ALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 2221110 000 00001111111 11 No 115 >protein:vir:3139 Length: 599 # NCBI annotation: hypothetical protein # Family: family:all:1548 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640321;genbank:gi:21234402;genbank:GeneID:956054 Probab=98.92 E-value=8.6e-09 Score=64.80 Aligned_cols=472 Identities=10% Similarity=-0.007 Sum_probs=194.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcC--CCCccccccCCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAG--DFDSVTYRDSNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g--~~~~~~~~~~~~~~~~~~~~~~n~~k~ 78 (502) |++-.+--+-.+.. +--.+.-...+...| -.....+-...++|+..+.= ..+-..--+.+.+|+ +++.+|-.-. T Consensus 1 m~~~~~~~~~~~~~-~~~~~~~~~~v~~~~-~~~~~~r~~~~~~w~e~~~yi~~~~tr~t~~~~~~w~--~s~t~~k~~~ 76 (599) T protein:vir:31 1 MSTDIKTLQKMLEG-RDDDRAFIDELVVLF-TNMENARAQKDREDKELMDYIDATDTRKTSNSKLPFK--NSTTINKLAH 76 (599) T ss_pred CccchHHHHHHhhc-cCchHHHHHHHHHHH-HhhhhhhhhhhcccHHHHHHHhhhcccccccCCCCcc--cccchHHHHH Confidence 76643222221111 000001001111111 12223333444445433211 111111112233343 3444444444 Q ss_pred HHHHHhhhhhcCc-ce---Eee-----CC--HHHHH----HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe----- Q lcl|NC_012753. 79 ASKKVASLVFNEQ-AT---IRV-----DN--EVADA----FINETLKNDKFSKNFERYLESCLALGGLAMRPYID----- 138 (502) Q Consensus 79 iv~~~a~~l~~ep-~~---i~~-----~d--~~~~e----~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d----- 138 (502) +++.+..++++-- |+ +.+ ++ ....+ +++.=+...+|...+...+.+-..+|-++..+-+. T Consensus 77 ~~~~l~a~~~~~~fp~~~w~d~~~~~~~~~~~~~~~~i~~yi~~Kl~e~~~~~~~~~~v~d~i~~G~~vat~~~er~~~~ 156 (599) T protein:vir:31 77 LHLMITTSYMEHLLPNRNWVDFVGFDNDSVNAEKREIARSYVRGKVEASNLEGVIERMVDDFAVRGFCVAHTRHVKRMTV 156 (599) T ss_pred HHHHHHHHHHhhhcCCccceEeeecCCchhHHHHHHHHHHHhhhhhhhcchHHHHHHHHhhhcccCceeEeeeEEEccee Confidence 5666555555421 11 222 11 12233 44444566788999999999999999887766532 Q ss_pred --CC-------ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeC--------CCceEEEE--------EEEEEE--e Q lcl|NC_012753. 139 --GD-------QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEG--------QKVKYYSL--------IEFHEW--N 191 (502) Q Consensus 139 --~~-------~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~--------~~~~~yt~--------~E~h~~--~ 191 (502) ++ +|+++.|+|..+|| --+.+.+..+.|+.+.+...+ ....+|.+ -.+|.. . T Consensus 157 ~~d~~v~~~~~~P~~ervsP~Di~~-Dp~A~si~d~~fivRs~~Tk~~L~~l~~~~~~~~y~~d~~~~~~~~~~~~~~~~ 235 (599) T protein:vir:31 157 TAENQVIKNYSGTVTERLSPSDVFW-DVTADSLPKAAKCIRQLYTLGSLKREIEEGTFPLMSMEDFQKLREERRTIREAL 235 (599) T ss_pred ecccccccccccceEEeecccceee-CCCCCCCCcceeeeehhhhHHHHHHHhccCCccccchHHHHHHHhhccCCCccc Confidence 21 58999999999987 123355666677666544211 11111210 001100 0 Q ss_pred CCeEEEEEE--------EEecCCccccCceeecccccc--------CCCc--ceeecC-------------CCcceEEEe Q lcl|NC_012753. 192 KETYTISNE--------LYESESKTIIGQRVPLSTLYE--------DLEE--TVTLNG-------------LTRPLFTYL 240 (502) Q Consensus 192 ~~~~~I~~~--------l~~~~~~~~lG~~v~l~~~~~--------~l~~--~~~~~~-------------~~~~~f~~~ 240 (502) -+.|.-.+. .+....+..-| .|..-+.|- ++.. ..++.| .++-||+.. T Consensus 236 ~d~~~~~~g~D~~~~d~~~~~~eY~~~~-~VevLeywGd~ydee~d~~~~~~ViTi~g~~~liR~e~np~~~g~~Pyvv~ 314 (599) T protein:vir:31 236 ADGYNGRRKFDSLHKKGYGSMMNYINEG-VVEVLTFMGDFYDEENDELWNNYEITVIDRKIIGRKQSKDTWDGSQNLHIA 314 (599) T ss_pred cchhhhhhhccccccccccchhhhcccc-hhhhhhhhhhhhcccCCccccceEEEEecCcEEeecccCCCCCCCCCeEEE Confidence 000000000 00000000000 111111111 0100 111112 122233321 Q ss_pred cCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCC--CcccCccccccccchhh Q lcl|NC_012753. 241 KPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTN--GEKVTVKREFETGHNVY 318 (502) Q Consensus 241 ~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~--g~~~~~~~~~~~~~~~~ 318 (502) . -.....+.||.+.+..+.++++.||.++....+.+...-+.+++ ...+-. ...+.|++.+...+ T Consensus 315 ~----~~P~~~~~yG~G~l~~~~gaQ~~lN~~~Ng~iD~~~~~l~p~l~------~~~dl~~eD~~~~P~~v~~~~d--- 381 (599) T protein:vir:31 315 V----YEFQKDTLCPIGPLHRLTGMQYKLDKRENFREDLHDRFLHPSLK------KVGDVREKGMRGGPNHVFEVEE--- 381 (599) T ss_pred E----eeeeccccCCCCCchhcchHHHHHHHHHHHhhhhhhhhhccccc------ccccccccCccCCCCcceeecC--- Confidence 1 11134578999999999999999999988888765433232332 222211 22233443333221 Q ss_pred ccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 319 EQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVE 398 (502) Q Consensus 319 ~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~ 398 (502) ...++.+.|....-+-..-+..+.......+|+++.+.|..+.|..||+++..+-...-.....+.+.|. T Consensus 382 ----------~~~vq~~~p~s~~~~a~~~is~~e~~mee~sGvp~~~~G~~~ag~~TA~~is~l~naa~~~~~~~vr~~e 451 (599) T protein:vir:31 382 ----------TGDVQYMTPPAEVLQPDNQLSITLQLMEDLSGAPKESIGQRTAGEKTKFEVQLLDQGQNKVFRRKVKKFE 451 (599) T ss_pred ----------CCccccccCchhhhhHHHHHHHHHHHHHHhhccchhhcCCcccchhhHHHHHHHHhhhhhhHHHHHHHHH Confidence 1123333333221122223444455567788999999998888889999998877777677777788886 Q ss_pred HHHHH-HHHHHHHHHHhhcccCCCc---ccc----cceEEEeCC------Ccc------CCHHHHHHHHHHHHhc----C Q lcl|NC_012753. 399 KSLKE-LVISILELAKVYNLYTGEI---PTM----DEVSVDLDD------GVF------TDRNAEFDYWSKMVAA----G 454 (502) Q Consensus 399 ~~l~~-l~~~il~~~~~~~~~~~~~---~~~----~~i~v~f~d------~i~------~d~~~~~~~~~~~~~~----G 454 (502) ..+-+ |++.++......--..+.+ .++ .=++|.=+| .++ ..++...+...+...+ + T Consensus 452 ~~~lepll~~l~e~~~~f~D~~~tiri~~~e~~~~~f~~i~redl~~~~~~v~~Ga~~v~ere~~~q~l~~il~~~~~q~ 531 (599) T protein:vir:31 452 RELLTPVLNDYLEQGRNHLDASDTIKTFNSELGTATFLDITADDLNLNGQMVAQGATLFAEKANTLQNLNAILGGPLGAA 531 (599) T ss_pred HHHHHHHHHHHHHHHHhhcccccceeeecccccceeeEEeehhhhhCCeeeeechhhHHHHHHHHHHHHHHHhcccCCCc Confidence 66544 8887776654210000000 000 001111000 011 1122222333333311 1 Q ss_pred C---CCHHH---HH---HhcCCCC---HH----HHHHH--HHH--HHHhhh-cccCCCCCccccCCCC Q lcl|NC_012753. 455 F---APKTM---AI---EKTLNVT---KE----QAQEI--YQK--INDETM-VSTDSFRTSEEVDIYG 501 (502) Q Consensus 455 i---~S~et---~l---~~~~~~~---de----ea~~e--l~r--i~~E~~-~~~~~~~~~~~~~~~g 501 (502) + ++... ++ ..+|.+. +. |.+.+ |.+ +++++. +...+..+.+-+|-.- T Consensus 532 ~~P~~~~k~l~~~l~~~~~l~~~~~~~~~va~~eqq~~~~m~Q~~lq~~~~~~~~~~~~~~~~~~~~~ 599 (599) T protein:vir:31 532 LAPHMSRTKLFNAVEYLGDLDAYGIFTFGIGVQEDQQLARMAQKSTQQTEETALTQEEVGGPTTDTGQ 599 (599) T ss_pred cchhhHHHHHHHHHHHHHhccccccCCCchhHHHHHHHHHHHHHHHHHhHhhhhhhhhcCCCCcccCC Confidence 1 22211 11 1222221 11 11222 111 111111 1111111111111111 No 116 >protein:vir:102668 Length: 547 # NCBI annotation: Hypothetical protein # Family: family:all:481 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_024419;genbank:gi:48696640;genbank:GeneID:2948135 Probab=98.90 E-value=1.7e-08 Score=63.19 Aligned_cols=444 Identities=9% Similarity=-0.018 Sum_probs=191.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccc-c-c--CCC--ccccccceecc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTY-R-D--SNG--SQVKRDFNHLP 74 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~-~-~--~~~--~~~~~~~~~~n 74 (502) |..-+.+|+| . .+..++-.....|+.+|.=-.+-+.. . . ..+ ......++--+ T Consensus 1 ~~~~~l~~r~--------------------~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~i~ds 59 (547) T protein:vir:10 1 MENSKIVKRL--------------------D-FLKTDRKNVEQIWDCIRKYIMPMRSDFFSDLRSEGSINWNQNREVFDS 59 (547) T ss_pred CCHHHHHHHH--------------------H-HHHHHhhHHHHHHHHHHHHhcccccccccCCCCCcccccccccccccc Confidence 4443333333 1 12223333444554443221111100 0 0 001 01122334446 Q ss_pred hHHHHHHHHhhhhhcC--cce-----EeeCC------HHHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 75 IGRTASKKVASLVFNE--QAT-----IRVDN------EVADA-------FINETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 75 ~~k~iv~~~a~~l~~e--p~~-----i~~~d------~~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) .+...|+.+|+.|.+- ||. +.+.| ..+.+ .+.+.+...+|...+.++..+..+.|.+.+. T Consensus 60 t~~~a~~~Las~L~~~ltPp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~i~~~l~~snf~~~~~~~~~~L~~~G~a~l~ 139 (547) T protein:vir:10 60 TAGDGLETLSSSLHGSLTSPATKWFELAFRDKELNSDDECRKWLENATHDVYSALQDSNFNLEANETYIDLCGYGNAIMV 139 (547) T ss_pred hHHHHHHHHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEE Confidence 7777788877777652 211 23322 22333 3445677789999999999999999999887 Q ss_pred EEEeC---CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee---------------------CCCceEEEEEEEEEE Q lcl|NC_012753. 135 PYIDG---DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE---------------------GQKVKYYSLIEFHEW 190 (502) Q Consensus 135 ~~~d~---~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~---------------------~~~~~~yt~~E~h~~ 190 (502) +-.|+ +.+++..+|..+++-- .|..+....+| +++...- .+.+.+... T Consensus 140 ~~~d~~~~~~~r~~~~pl~~~~v~-~d~~G~v~~i~-r~~~~t~~qi~~~fg~~~l~~~v~~~~~~~~~~~~~~------ 211 (547) T protein:vir:10 140 EEEDEDEEGSVVFQSSPIQDSYFE-EDSRGQVVNFY-RVFRWTPAQIYDRFGDEGTPEAIIKKAKEASNQAALK------ 211 (547) T ss_pred eccCCCCCCceeEEEeecceEEEe-eCCCcCeeeee-eeeeccHHHHHHhcCcccCCHHHHHHHhcCCCcccce------ Confidence 76664 4688999999997764 45444444433 3322100 011110001 Q ss_pred eCCeEEEEEEEEecCCcc--cc------CceeeccccccCCCc---ceeecCCCcceEEEecCCccccccccCcCCcchh Q lcl|NC_012753. 191 NKETYTISNELYESESKT--II------GQRVPLSTLYEDLEE---TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIF 259 (502) Q Consensus 191 ~~~~~~I~~~l~~~~~~~--~l------G~~v~l~~~~~~l~~---~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~ 259 (502) +.+.|.+|...+.. .. ....|..++|-...+ -....|+..-||++++-+ ...++.||+|-- T Consensus 212 ----~~v~~~v~~~~~~~~~~~~~~~~~~~~~p~~s~~~e~~~~~~~l~esg~~e~P~~~~Rw~----~~~ge~YGrgp~ 283 (547) T protein:vir:10 212 ----QEVVMCVFTRYDKKQNRNAGTVLAPTERPFGKKWILKEGAVQLGEEGGYYEMPAYAIRWR----KSAGSQWGFGPS 283 (547) T ss_pred ----EEEEEEEeeccCCCCCccccceeeccccceeEEEEEecCceeeeecCCcccCCeeeeeee----ecCCcccccchH Confidence 22223333211110 00 011222233211111 112233444566555432 234678999999 Q ss_pred hhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccc Q lcl|NC_012753. 260 DNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTD 338 (502) Q Consensus 260 ~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 338 (502) ..+.+-+..||..--..+...+. .+..+.||.+.+.. .....++... + . +....++.++.- T Consensus 284 ~~~l~D~k~L~~l~~~~l~~~~~~~~pp~~v~~~g~~~-----~~~~~pgg~~-----~-----~---~~~~~v~pl~~~ 345 (547) T protein:vir:10 284 HLALPDVLTANRYVELVLRSSEKVIDPAIMVTERGLIS-----DIDLGASGLT-----V-----V---RDMESMKPFESR 345 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCceecccccccc-----cceecCCeee-----e-----c---CCcccceeeecc Confidence 99999999999888777766653 34444454332211 1111111111 0 0 111233333322 Q ss_pred cchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcc Q lcl|NC_012753. 339 IRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV-EKSLKELVISILELAKVYNL 417 (502) Q Consensus 339 ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~ 417 (502) .+...-...++.+.+.|.... =...|...+....|||||....+.+.+..+-.-..+ ...|..|+.-++.++.-.+. T Consensus 346 ~~~~~~~~~i~~~~~rI~~af--~~d~~~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~ 423 (547) T protein:vir:10 346 ARFDVSSIQLTDLRSAVRRIY--YVDQLQMKDSPAMTATEVQVRYELMQRLLGPTLGRLENDFLSPMIQRTFNIRFRAGK 423 (547) T ss_pred cchHHHHHHHHHHHHHHHHHh--hhhhhhcCCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 222222333444444443321 111123334456799999988877776655533222 33444555444444443333 Q ss_pred cCCCcc-----cccceEEEeCCCccCCH----HHHHHHHHHHHh--cCC-------CCHHHHH---HhcCCC------CH Q lcl|NC_012753. 418 YTGEIP-----TMDEVSVDLDDGVFTDR----NAEFDYWSKMVA--AGF-------APKTMAI---EKTLNV------TK 470 (502) Q Consensus 418 ~~~~~~-----~~~~i~v~f~d~i~~d~----~~~~~~~~~~~~--~Gi-------~S~et~l---~~~~~~------~d 470 (502) .+.... ....++|++-..+-... .+.+.+..+.+. +++ +....++ ....|+ ++ T Consensus 424 lP~~p~~l~~~~~~~~~v~~is~Laraq~~~~~~~i~~~~~~v~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~ 503 (547) T protein:vir:10 424 LGELPSKLLESGKAAMDIVYTGPLSRAQKIDQAASIERWAGSTAQLAEINPEVLDIPDWDEMVRMLGSLLGAPQTLMRPK 503 (547) T ss_pred CCCCchhhhccCcceEEEEeccHHHHHHHHHHHHHHHHHHHHHHHhhccChhhhhcCCHHHHHHHHHHHhCCChhccCCH Confidence 332111 11245566543322211 111112222221 122 2223332 233454 44 Q ss_pred HHHHHHHHHHH-Hhhh----cc-------cCCCCCccccCCCCC Q lcl|NC_012753. 471 EQAQEIYQKIN-DETM----VS-------TDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea~~el~ri~-~E~~----~~-------~~~~~~~~~~~~~g~ 502 (502) +|+++..++-. +++. +. ++.. +.+++-|-=. T Consensus 504 eev~~~r~qr~~~~q~~~qaa~~~~~g~~m~~~-~~~~a~~~~~ 546 (547) T protein:vir:10 504 AKVTSIRKNRSQTQQKAEQAAIAEAEGNAMEAQ-GKGQAALKEN 546 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cCcccchhcc Confidence 55543322211 1111 00 0000 0000000000 No 117 >protein:vir:3520 Length: 720 # NCBI annotation: P19 # Family: family:all:487 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050980;genbank:gi:9633566;genbank:GeneID:1262313 Probab=98.87 E-value=2.3e-08 Score=62.49 Aligned_cols=469 Identities=10% Similarity=-0.010 Sum_probs=204.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhc--CCCCcccc---ccCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFA--GDFDSVTY---RDSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~--g~~~~~~~---~~~~~~~~~~~~~~~n~ 75 (502) |+ +.+++++++...++ .. .....++-..+..+.++||. |+++.-.- ++..-....++.+.+|. T Consensus 1 ma--~~~~~~l~~~~~~~-----~~-----~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~~~~~l~~~~~P~~~~N~ 68 (720) T protein:vir:35 1 MA--ETLQKRHEQIMRKF-----DR-----AHSPQEAVREKCLEATRFARVPGGQWEGATAAGSELGKHFEKYPKFEINK 68 (720) T ss_pred Cc--hHHHHHHHHHHHHH-----HH-----HHhhhHHHHHHHHHHHhhhccCCCCCCHHHHHHHHHHHhhCCCCeEEEcc Confidence 32 23334433321110 00 11112333445666778875 65553111 01001122345688899 Q ss_pred HHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHHH----HhhccHHHHHHHHHHHHhhcCCEEEEEEEe------C Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD------NEVADAFINET----LKNDKFSKNFERYLESCLALGGLAMRPYID------G 139 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~----~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d------~ 139 (502) -+.+|+..+++--...+.+.+. |...++.|+.+ .+.++.......+...+++.|-+|+.+++| + T Consensus 69 i~~~v~~v~g~~~~nr~d~~v~P~~~~~d~~~Ae~l~~~~~~~~~~~~~~~~~s~Af~~~i~~G~G~~~v~~d~~~~~d~ 148 (720) T protein:vir:35 69 ISTELNRIISEYRHNRITVKFRPGDKTASEALANKLNGLFRADYEETDGGEACDNAFDDGSTGGFGCFRLTTNLVNALDP 148 (720) T ss_pred HHHHHHHHHhHHHhCCCceEEEcCCCcchHHHHHHHHHHHHHHHHhcCchHHHhHHHHHhhhccceeEEeeecccccCCC Confidence 9999999999998888887762 33445555554 446788888899999999999999999874 1 Q ss_pred ----CceEEEEE--cCCeEEE--EEEcCCCeEEEEEEEEEEEee--------------------------CCCceEEEEE Q lcl|NC_012753. 140 ----DQIRVSFV--QATVFFP--LQANTQDVSSAAIVTKSTKTE--------------------------GQKVKYYSLI 185 (502) Q Consensus 140 ----~~~~i~~v--~~~~~~P--i~~d~~~~~~~~~~~~~~~~~--------------------------~~~~~~yt~~ 185 (502) +.+++..| |+.+++. -... -+...+-++.+..+.+ .-.....++. T Consensus 149 ~~~~~~i~i~~v~~~~~~v~~Dp~a~~-~D~sDar~~~~~~~~~~d~~~~~yp~~a~~~~~~~~~~~~~d~~~~~~v~i~ 227 (720) T protein:vir:35 149 MDERQRICLEPIYDPARSVWFDPDAKK-YDKSDAEWAFCMYSLSAEKYKAEYNKDPATLMSGIERSWDYDWYDVDVVYIA 227 (720) T ss_pred CcccceeeEecccCchhheeecccccc-cChhhhhhhhhhcCCCHHHHHHhCCCccccccccccccccccccCCCceEEE Confidence 13445443 3334432 1111 1122222221111100 0011123455 Q ss_pred EEEEEeCCeEEEEEEEEecCCccccCceeeccccc---------c-CCC-------c---ce--eecCC----------- Q lcl|NC_012753. 186 EFHEWNKETYTISNELYESESKTIIGQRVPLSTLY---------E-DLE-------E---TV--TLNGL----------- 232 (502) Q Consensus 186 E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~---------~-~l~-------~---~~--~~~~~----------- 232 (502) |+|...-...++ .++.... .|..+.+.+.. . +.. . .+ .+.|. T Consensus 228 E~~~~~~~~~~~--~~~~~~~---~g~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~v~~~~~~g~~~l~~~~~~p~ 302 (720) T protein:vir:35 228 KYYEVKKESVDV--VSFQNPL---TSETVTYDSDQLELVEDELADIGFIEAARRTIKRRRVYVSVVDGEGFLEKAQRIPG 302 (720) T ss_pred EeeEEEEEEEEE--EEeecCC---CCCeeecCCccHHHHHHHHhhhccccccccceeEEEEEEEeeccchhcccCCCCCC Confidence 554322211111 1111110 12211111000 0 000 0 00 01110 Q ss_pred CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCccc-Cccccc Q lcl|NC_012753. 233 TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKV-TVKREF 311 (502) Q Consensus 233 ~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~-~~~~~~ 311 (502) ...|+++|--- .....+.|..-+.+.++++.++.+|+..|.+.+-+-..+..+.. ...... .+-...+ +++ T Consensus 303 ~~fP~vP~~g~--r~~~d~~~~~~G~vr~~kd~Q~~~N~~~s~~~~~~~~~~~~~~~--~a~~~~-~~~~~~~a~~~--- 374 (720) T protein:vir:35 303 EHIPLIPVYGK--RWFIDDIERVEGHIAKAMDAQRLYNLQVSMLADSATQDTGSIPI--VGKSQI-KTLEKYWANRN--- 374 (720) T ss_pred CccceEEEEee--eeccCCCcccceeeecchhHHHHHHHHHHHHHHHHHcCCccccc--cCcchH-HHHHHHhhccc--- Confidence 11133322100 00112344344678999999999999999999876443332221 100000 0000000 000 Q ss_pred cccchhhccccCC---CCc---cccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHH Q lcl|NC_012753. 312 ETGHNVYEQFDSG---DMD---KGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSD 385 (502) Q Consensus 312 ~~~~~~~~~~~~~---~~~---~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~ 385 (502) .....|..++.. .|. ....+...++.-....+...++.-...|....|++...+|..++ .||.+|..+... T Consensus 375 -~~~~~~l~~~~~~~~~G~~~~~~~~~~~~~~~~~~~~~~~llq~~~~~i~~vsGi~~~~lG~~sn--~SG~Ai~~rq~q 451 (720) T protein:vir:35 375 -KNRPAFLPLNEIVDKQGNIIAPPTPVGYTQPQPLNQAMAALLQQTGADIQEVTGSSQAMQPMPSN--IAKETVNHLMHR 451 (720) T ss_pred -cccccccccccccccCcccccCCCcccccCCCCCchHHHHHHHHHHHHHHHHhCCChHHcCcccc--hHHHHHHHHHHH Confidence 000111111000 110 01123333433334567888888889999999999999997543 578888877655 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----HHhh---cccCCCc-c-------------------------cccceEEEe Q lcl|NC_012753. 386 TYQMRNSIATLVEKSLKELVISILEL----AKVY---NLYTGEI-P-------------------------TMDEVSVDL 432 (502) Q Consensus 386 l~~~~~~~~~~~~~~l~~l~~~il~~----~~~~---~~~~~~~-~-------------------------~~~~i~v~f 432 (502) ........-..+..+.+..-+.+|.+ .... .+.+... . ..++|.|+= T Consensus 452 g~~~~~~~~Dnl~~~~~~~g~~lL~lI~~~y~~er~~RI~~ed~~~~~v~~n~~~~d~~~g~~v~~NDi~~g~yDv~v~~ 531 (720) T protein:vir:35 452 SDMSSFIYLDNMAKSLKRAGEVWLSMAREVYGSDRQVRIVNADGTDDIALMSVVINDNQTGQVVAMNDLSSGRYDVTVDV 531 (720) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCcceEeechhhhccCCCceeeeecceeeeeEEEEec Confidence 55554445555555555544444433 2211 1222100 0 001222222 Q ss_pred CCCccCCHHHHHHHHHHHHhcCCCCHHH--------HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCcc-c------- Q lcl|NC_012753. 433 DDGVFTDRNAEFDYWSKMVAAGFAPKTM--------AIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSE-E------- 496 (502) Q Consensus 433 ~d~i~~d~~~~~~~~~~~~~~Gi~S~et--------~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~-~------- 496 (502) ..+-+.-.++..+.++++. +.+++.. .+.++-++.- +++.++++++...+.....+... . T Consensus 532 ~p~~~s~req~~~~m~qll--~~~~p~~~~~~~~~~~ile~~d~p~--~~e~~erirk~~~~~~~~~~~~~e~qq~~a~~ 607 (720) T protein:vir:35 532 GPSYTARRDATVSVLTNLL--AGMLPQDPMRQVLQGIILDNMEGEG--LDEFKEYNRKQLLTQGVVKPRNTEEEQMVAQM 607 (720) T ss_pred ccCcccHHHHHHHHHHHHH--HhcCCCchhHHHHHHHHHHhcCchh--HHHHHHHHHhhcchhcccCccChhHHHHHHHH Confidence 2222233445555555554 2233221 1233334432 34445566554332211111000 0 Q ss_pred -----------cCCCCC Q lcl|NC_012753. 497 -----------VDIYGE 502 (502) Q Consensus 497 -----------~~~~g~ 502 (502) .-.-++ T Consensus 608 qq~~qq~~~e~~~aqa~ 624 (720) T protein:vir:35 608 IQQAQQPNAELVAAQGV 624 (720) T ss_pred HHHHHhHhHHHHHHHHH Confidence 000000 No 118 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.86 E-value=2.4e-08 Score=62.33 Aligned_cols=404 Identities=12% Similarity=0.101 Sum_probs=175.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||++|+.|+.-..+.. .. .+.++ ....+...|.|-....-. .. ..+-+.+.--..++ T Consensus 1 M~~~~r~~~~~~~~~r~~-~~---------~~~~~-----~~~~~~~~~~g~~~~~~~--v~----~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQT-SQ---------VIELN-----KDDEKLLEWLGISPSTIS--VK----GKNALKVATVFACI 59 (432) T ss_pred CChHHHHHHhcCccccCc-cc---------ccccC-----CchHHHHHHhCCCcCccc--cc----hhhhhccHHHHHHH Confidence 999999999863111100 00 00111 111111222231111000 00 01122223223455 Q ss_pred HHHhhhhhcCcceEeeCC-----HHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDN-----EVADAFINETLKN-----DKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQ 148 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d-----~~~~e~l~~~~~~-----~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~ 148 (502) +..|+-+-+=|+.+--.+ +.....|..+|+. -.....+..++...+..|.+|+.+..+. |.+ .+..++ T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 139 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 139 (432) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 666666655566542211 1122234444432 1334455666777888999999998875 443 667778 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++-+...+.+.... +...+|. .-.+ |....+ ++ T Consensus 140 ~~~v~v~~d~~~~~~~------------~~~~~y~-----~~~~------------------g~~~~~-------~~--- 174 (432) T protein:vir:10 140 ASKVTVYIDDVGLLNS------------KTKMWYV-----VNTG------------------GQQRVL-------KP--- 174 (432) T ss_pred CceeEEEEcCcccccc------------cceEEEE-----EecC------------------CeEEEE-------cc--- Confidence 8776654222111100 0000010 0000 111100 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) --.++++.+.. .+...|+|.+..+...++....+-....+-|..+...=.+ |......+.... T Consensus 175 ------~eiih~r~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gi----l~~~~~l~~e~~--- 237 (432) T protein:vir:10 175 ------EEILHFKNGIT----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGL----VQYVGDLNEDAK--- 237 (432) T ss_pred ------ccEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE----EEcCCCCCHHHH--- Confidence 01244543211 2345689998888777776665444444445544322121 222111110000 Q ss_pred cccccc-chhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHH Q lcl|NC_012753. 309 REFETG-HNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSE 382 (502) Q Consensus 309 ~~~~~~-~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~ 382 (502) ..+... ...+...... --+.+.-++.++......++.+..+...++|+...|+|+..+|....+. +++.+.... T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~ 317 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ 317 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH Confidence 000000 0111110000 0011223555555555667778888889999999999999998754432 233322211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 383 QSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAI 462 (502) Q Consensus 383 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 462 (502) + .+.+|..++..|-...+..-+..........+.++++.-+..|..+.++...+++.+|+++.-+++ T Consensus 318 ~-------------~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R 384 (432) T protein:vir:10 318 F-------------YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEAR 384 (432) T ss_pred H-------------HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHH Confidence 1 223333333322221111001111112223355556666677999999999999999999999876 Q ss_pred HhcCCCCHHH-HHHH-----HHHHHH--hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 463 EKTLNVTKEQ-AQEI-----YQKIND--ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 463 ~~~~~~~dee-a~~e-----l~ri~~--E~~~~~~~~~~~~~~~~~g~ 502 (502) +. .|+..-+ .++. +..+.+ +. ...+...+.....-++| T Consensus 385 ~~-~g~~pi~ggD~~~~~~n~~~~~~~~~~-~~k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 385 SK-EDLPPEAGGDRLLVNGNMLPIDMAGQA-YLKGGDTNGEVSKEGNE 430 (432) T ss_pred HH-hCCCCCCCCCeEeecccccchhhcccc-ccCCCCCCCCCCCCCCC Confidence 54 3443210 1100 111110 00 00011111222222222 No 119 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.86 E-value=2.4e-08 Score=62.33 Aligned_cols=404 Identities=12% Similarity=0.101 Sum_probs=175.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||++|+.|+.-..+.. .. .+.++ ....+...|.|-....-. .. ..+-+.+.--..++ T Consensus 1 M~~~~r~~~~~~~~~r~~-~~---------~~~~~-----~~~~~~~~~~g~~~~~~~--v~----~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQT-SQ---------VIELN-----KDDEKLLEWLGISPSTIS--VK----GKNALKVATVFACI 59 (432) T ss_pred CChHHHHHHhcCccccCc-cc---------ccccC-----CchHHHHHHhCCCcCccc--cc----hhhhhccHHHHHHH Confidence 999999999863111100 00 00111 111111222231111000 00 01122223223455 Q ss_pred HHHhhhhhcCcceEeeCC-----HHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDN-----EVADAFINETLKN-----DKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQ 148 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d-----~~~~e~l~~~~~~-----~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~ 148 (502) +..|+-+-+=|+.+--.+ +.....|..+|+. -.....+..++...+..|.+|+.+..+. |.+ .+..++ T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 139 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 139 (432) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 666666655566542211 1122234444432 1334455666777888999999998875 443 667778 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++-+...+.+.... +...+|. .-.+ |....+ ++ T Consensus 140 ~~~v~v~~d~~~~~~~------------~~~~~y~-----~~~~------------------g~~~~~-------~~--- 174 (432) T protein:vir:10 140 ASKVTVYIDDVGLLNS------------KTKMWYV-----VNTG------------------GQQRVL-------KP--- 174 (432) T ss_pred CceeEEEEcCcccccc------------cceEEEE-----EecC------------------CeEEEE-------cc--- Confidence 8776654222111100 0000010 0000 111100 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) --.++++.+.. .+...|+|.+..+...++....+-....+-|..+...=.+ |......+.... T Consensus 175 ------~eiih~r~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gi----l~~~~~l~~e~~--- 237 (432) T protein:vir:10 175 ------EEILHFKNGIT----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGL----VQYVGDLNEDAK--- 237 (432) T ss_pred ------ccEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE----EEcCCCCCHHHH--- Confidence 01244543211 2345689998888777776665444444445544322121 222111110000 Q ss_pred cccccc-chhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHH Q lcl|NC_012753. 309 REFETG-HNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSE 382 (502) Q Consensus 309 ~~~~~~-~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~ 382 (502) ..+... ...+...... --+.+.-++.++......++.+..+...++|+...|+|+..+|....+. +++.+.... T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~ 317 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ 317 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH Confidence 000000 0111110000 0011223555555555667778888889999999999999998754432 233322211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 383 QSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAI 462 (502) Q Consensus 383 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 462 (502) + .+.+|..++..|-...+..-+..........+.++++.-+..|..+.++...+++.+|+++.-+++ T Consensus 318 ~-------------~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R 384 (432) T protein:vir:10 318 F-------------YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEAR 384 (432) T ss_pred H-------------HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHH Confidence 1 223333333322221111001111112223355556666677999999999999999999999876 Q ss_pred HhcCCCCHHH-HHHH-----HHHHHH--hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 463 EKTLNVTKEQ-AQEI-----YQKIND--ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 463 ~~~~~~~dee-a~~e-----l~ri~~--E~~~~~~~~~~~~~~~~~g~ 502 (502) +. .|+..-+ .++. +..+.+ +. ...+...+.....-++| T Consensus 385 ~~-~g~~pi~ggD~~~~~~n~~~~~~~~~~-~~k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 385 SK-EDLPPEAGGDRLLVNGNMLPIDMAGQA-YLKGGDTNGEVSKEGNE 430 (432) T ss_pred HH-hCCCCCCCCCeEeecccccchhhcccc-ccCCCCCCCCCCCCCCC Confidence 54 3443210 1100 111110 00 00011111222222222 No 120 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.86 E-value=2.4e-08 Score=62.33 Aligned_cols=404 Identities=12% Similarity=0.101 Sum_probs=175.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||++|+.|+.-..+.. .. .+.++ ....+...|.|-....-. .. ..+-+.+.--..++ T Consensus 1 M~~~~r~~~~~~~~~r~~-~~---------~~~~~-----~~~~~~~~~~g~~~~~~~--v~----~~~al~~~~v~~~i 59 (432) T protein:vir:10 1 MKIVDSVKKFFNFEKRQT-SQ---------VIELN-----KDDEKLLEWLGISPSTIS--VK----GKNALKVATVFACI 59 (432) T ss_pred CChHHHHHHhcCccccCc-cc---------ccccC-----CchHHHHHHhCCCcCccc--cc----hhhhhccHHHHHHH Confidence 999999999863111100 00 00111 111111222231111000 00 01122223223455 Q ss_pred HHHhhhhhcCcceEeeCC-----HHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDN-----EVADAFINETLKN-----DKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQ 148 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d-----~~~~e~l~~~~~~-----~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~ 148 (502) +..|+-+-+=|+.+--.+ +.....|..+|+. -.....+..++...+..|.+|+.+..+. |.+ .+..++ T Consensus 60 ~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 139 (432) T protein:vir:10 60 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 139 (432) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 666666655566542211 1122234444432 1334455666777888999999998875 443 667778 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++-+...+.+.... +...+|. .-.+ |....+ ++ T Consensus 140 ~~~v~v~~d~~~~~~~------------~~~~~y~-----~~~~------------------g~~~~~-------~~--- 174 (432) T protein:vir:10 140 ASKVTVYIDDVGLLNS------------KTKMWYV-----VNTG------------------GQQRVL-------KP--- 174 (432) T ss_pred CceeEEEEcCcccccc------------cceEEEE-----EecC------------------CeEEEE-------cc--- Confidence 8776654222111100 0000010 0000 111100 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) --.++++.+.. .+...|+|.+..+...++....+-....+-|..+...=.+ |......+.... T Consensus 175 ------~eiih~r~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gi----l~~~~~l~~e~~--- 237 (432) T protein:vir:10 175 ------EEILHFKNGIT----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGL----VQYVGDLNEDAK--- 237 (432) T ss_pred ------ccEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE----EEcCCCCCHHHH--- Confidence 01244543211 2345689998888777776665444444445544322121 222111110000 Q ss_pred cccccc-chhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHH Q lcl|NC_012753. 309 REFETG-HNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSE 382 (502) Q Consensus 309 ~~~~~~-~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~ 382 (502) ..+... ...+...... --+.+.-++.++......++.+..+...++|+...|+|+..+|....+. +++.+.... T Consensus 238 ~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~ 317 (432) T protein:vir:10 238 KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ 317 (432) T ss_pred HHHHHHHHHHhcccccCCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH Confidence 000000 0111110000 0011223555555555667778888889999999999999998754432 233322211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 383 QSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAI 462 (502) Q Consensus 383 ~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 462 (502) + .+.+|..++..|-...+..-+..........+.++++.-+..|..+.++...+++.+|+++.-+++ T Consensus 318 ~-------------~~~~l~P~~~~ie~~ln~kLl~~~~~~~g~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R 384 (432) T protein:vir:10 318 F-------------YTDTLQATLTMYEQEMTYKLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNEAR 384 (432) T ss_pred H-------------HHHHHHHHHHHHHHHHHHhhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHH Confidence 1 223333333322221111001111112223355556666677999999999999999999999876 Q ss_pred HhcCCCCHHH-HHHH-----HHHHHH--hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 463 EKTLNVTKEQ-AQEI-----YQKIND--ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 463 ~~~~~~~dee-a~~e-----l~ri~~--E~~~~~~~~~~~~~~~~~g~ 502 (502) +. .|+..-+ .++. +..+.+ +. ...+...+.....-++| T Consensus 385 ~~-~g~~pi~ggD~~~~~~n~~~~~~~~~~-~~k~~~~~~~~~~~~~~ 430 (432) T protein:vir:10 385 SK-EDLPPEAGGDRLLVNGNMLPIDMAGQA-YLKGGDTNGEVSKEGNE 430 (432) T ss_pred HH-hCCCCCCCCCeEeecccccchhhcccc-ccCCCCCCCCCCCCCCC Confidence 54 3443210 1100 111110 00 00011111222222222 No 121 >protein:vir:94599 Length: 641 # NCBI annotation: PfWMP4_39 # Family: family:all:1548 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762669;genbank:gi:115304377;genbank:GeneID:5142299 Probab=98.85 E-value=2.8e-08 Score=61.99 Aligned_cols=454 Identities=11% Similarity=0.071 Sum_probs=170.8 Q ss_pred CC---hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHH---HHHHHHHHhcCCCC-------ccccccCCCcccc Q lcl|NC_012753. 1 MG---IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYN---RIMDNLRYFAGDFD-------SVTYRDSNGSQVK 67 (502) Q Consensus 1 m~---~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~---~i~~~~~~Y~g~~~-------~~~~~~~~~~~~~ 67 (502) |+ +-+.|...++.. ..++-. +..+..+||..... .......+..... T Consensus 20 ~~~~~~~~~l~~~~~~~--------------------~~~R~~~e~~W~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (641) T protein:vir:94 20 LSTDRIGGVVISKWQES--------------------RDKRNTVENNWDETYELYRASAIDRQNTRARNFQTTGADDADW 79 (641) T ss_pred CCchhHHHHHHHHHHHH--------------------HHhhcchHHHHHHHHHHhhcchhhhhhcccccccccccchhcc Confidence 21 111122211111 001111 11122233332111 1111111112222 Q ss_pred ccceecchHHHHHHHHhhhhhc----CcceEee-----CCHHHH----HHHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 68 RDFNHLPIGRTASKKVASLVFN----EQATIRV-----DNEVAD----AFINETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 68 ~~~~~~n~~k~iv~~~a~~l~~----ep~~i~~-----~d~~~~----e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) ++++..+-+...|+.+++.|.+ .+.-|.+ +|.+.. ++++..+.+++|...+...+.+++.+|.++.+ T Consensus 80 r~ki~~~~~~~~~~~l~s~Lm~~~~p~~~wf~~~p~~~ed~~~A~~~~~~~~~~l~~~~~~~~~~~~~~d~~~~g~~iv~ 159 (641) T protein:vir:94 80 RHRINTGHTFEVVETLVAYFKGATFPSDDWFDLKGMVPELADAARVVKQLTKTKLEAASIRDIFETYVRNLVLYGVSTYR 159 (641) T ss_pred cccccchhHHHHHHHHhhHHhhhhcCCCceEEEecCCCChHHHHHHHHHHHHHHHhhcchHHHHHHHHHHHhhcCceEEE Confidence 4456666666666665555443 3333443 333333 45666677788999999999999999999999 Q ss_pred EEEeC-----------------------------CceEEEEEcCCeEEEEEEcCCCeEEEEEEE-EEEEee--C-CCceE Q lcl|NC_012753. 135 PYIDG-----------------------------DQIRVSFVQATVFFPLQANTQDVSSAAIVT-KSTKTE--G-QKVKY 181 (502) Q Consensus 135 ~~~d~-----------------------------~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~-~~~~~~--~-~~~~~ 181 (502) ++|+. ..+++..++|..+++ ..+.+.....|+. +.++.. . ....| T Consensus 160 ~~w~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~r~~~v~~~di~~--dps~~~~~~~f~~~r~t~~t~~~l~~eg~ 237 (641) T protein:vir:94 160 LGWDTSMERQFKRTFVETGDIFGGWEDVAVNRQRSELRIEPLSPYDVWL--DTSGGKNTGTFVRLRHTREELHELVTSGY 237 (641) T ss_pred eehhhHHHHhhhhhcccchhhcccccccceecccceeeEEecchhheee--cCCCCcccccceehhhhHHHHHHHHhcCC Confidence 98851 123455566666553 1233333334431 111100 0 00000 Q ss_pred EE--------EEEEEEEe--CCeEEE------EEEEEecCCccccCceeeccccccCCCcceee--cC---CCcceEEEe Q lcl|NC_012753. 182 YS--------LIEFHEWN--KETYTI------SNELYESESKTIIGQRVPLSTLYEDLEETVTL--NG---LTRPLFTYL 240 (502) Q Consensus 182 yt--------~~E~h~~~--~~~~~I------~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~--~~---~~~~~f~~~ 240 (502) |- ..++ .++ +...-+ ..+++.-.. +-.+...++..++........+ .+ +...||+.+ T Consensus 238 ~~~d~v~~~~~~~~-~~~~~d~~~d~~~~~~~~~~~~e~~g-d~~~d~~~~~~~~~~~~g~~il~~~~~~~~d~~Pf~~~ 315 (641) T protein:vir:94 238 YDLDLTQVEQYVDY-KFADPDTPKDVNGTDTSGWDIIEYYG-PLLVEGVQFWCVHAVFYGKQLIRLSDSKYWCGSPFVTT 315 (641) T ss_pred CChhhcchhhcccc-cccccccccccccccccccceeeeee-eeccCCCceeeEEEEEeCCEEeecccccccCcCCeEEe Confidence 00 0000 000 000000 000000000 0000001111111111111111 11 122355554 Q ss_pred cCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc-eeeechH-HhccCCCCCCcccCccccccccchhh Q lcl|NC_012753. 241 KPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQ-MIKTEYDTNGEKVTVKREFETGHNVY 318 (502) Q Consensus 241 ~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~-~l~~~~~~~g~~~~~~~~~~~~~~~~ 318 (502) +. ....++.||.|..+.+.+.++.+|...-..++.+...-+ .+.++.+ .+++ ..-...|+..+.. T Consensus 316 r~----~~~~~~~YG~gp~~~~l~dqk~ln~l~r~~ld~~~~~~~p~~~~~~~~~~~~----~~l~~~PG~ii~~----- 382 (641) T protein:vir:94 316 TL----LPDRDSVYGMSVLHPNLGALHVLNVLTNGRLDNLVLHINKMWTLVEDGILKR----EDVKAKPGAVFKV----- 382 (641) T ss_pred cc----eecCCcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeeecccccccc----ceeeccCCcceee----- Confidence 43 123478899999999999999999999888887754333 3323222 1111 0111112211111 Q ss_pred ccccCCCCccccceeeeccccch-HHHHHHHHHHHHHHHHhcCCCh--hhccccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 319 EQFDSGDMDKGIGITDLTTDIRS-DDYIKAINKGLSLFEMQLGVST--GMFSFDGKSMKTATEVVSEQSDTYQMRNSIAT 395 (502) Q Consensus 319 ~~~~~~~~~~~~~i~~~~~~ir~-e~~~~~l~~~l~~i~~~~g~s~--~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~ 395 (502) ...+.-..+..-+.++.. .+..+.++. .+....+.+. +......+...|||||..+.+......+.+.+ T Consensus 383 -----~~~~~v~pl~~~~~~~~~~~~~~~~~~~---~i~~~~~~~~~~~~~~~~~~~~~TAtEV~~~~~e~~~~l~~i~r 454 (641) T protein:vir:94 383 -----AQHGSLQPIDMGRQDFVVTYQEAQVQES---SVYRNTSTGPLIGNAAPRGGERVTAAEIQGVRDAGGNRLSSVHT 454 (641) T ss_pred -----CCCCcceeecCCccccchhHHHHHHHHH---HHHHhhhhhhhhcccccccchhccHHHHHHHHHHHHHHHHHHHH Confidence 111111111111112211 122222222 2322222221 11111122235999999888888888888878 Q ss_pred HHH-HHHHHHHHHHHHHHHhhcccC-------------CC-cccccceEEEeCCCccCCHHH------HHHHHHHHHhc- Q lcl|NC_012753. 396 LVE-KSLKELVISILELAKVYNLYT-------------GE-IPTMDEVSVDLDDGVFTDRNA------EFDYWSKMVAA- 453 (502) Q Consensus 396 ~~~-~~l~~l~~~il~~~~~~~~~~-------------~~-~~~~~~i~v~f~d~i~~d~~~------~~~~~~~~~~~- 453 (502) .|. ..|..|++-++.+...+.... +- ..+...++.+|.- ++..... .++...+..+. T Consensus 455 ~l~~e~l~pll~~~~~~~~~~~~~p~i~R~~~~~~~~~~~~~~~p~~L~~~~~i-v~l~~~q~~~~~~~i~~l~~~~~~~ 533 (641) T protein:vir:94 455 HIEDSSTLPLLNKVFSLLQQFYVTPETIRMYVPEEQMDGFFEVSPEYLHYPYKF-LALGANYVVERERMVTDLLQLLDIS 533 (641) T ss_pred HHHHHHHHHHHHHHHHHHHHhccchhhhhhhchhhhcccCCCCCccceeeeeeE-eecchhHHHHHHHHHHHHHHHHHHh Confidence 876 577778887777654431110 11 1122234444432 2333222 12222222211 Q ss_pred CCCCH-----------HHHHHhcCCC--CH------H-HHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 454 GFAPK-----------TMAIEKTLNV--TK------E-QAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 454 Gi~S~-----------et~l~~~~~~--~d------e-ea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |-.+. +. +.+..|+ .. + +.+....+.+ |..+.....-.+.++...++ T Consensus 534 a~~P~v~d~~d~~~~~~~-~~~~~g~~~p~~~ir~~~~~~~~~~~~~~-~~q~~~~~~a~~~~~~~~~~ 600 (641) T protein:vir:94 534 GRVPQIGQSLDYALILED-LLRQMRFTDPMRYIKKAEAPPAAPPIAPA-EPGALPPEMMNSVGGGLNDQ 600 (641) T ss_pred hcChhhhhcCCHHHHHHH-HHHHhCCCCchhhccCccCchhHHHHHHH-HHHHHHHHHHHHHHhhhHHH Confidence 11110 11 1111111 10 0 0000000000 00000011001111112222 No 122 >protein:vir:105520 Length: 706 # NCBI annotation: phage portal protein # Family: family:all:487 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516189;genbank:gi:89885992;genbank:GeneID:3964380 Probab=98.82 E-value=3.7e-08 Score=61.33 Aligned_cols=471 Identities=11% Similarity=-0.006 Sum_probs=207.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhc--CCCCccccc---cCCCccccccceecch Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFA--GDFDSVTYR---DSNGSQVKRDFNHLPI 75 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~--g~~~~~~~~---~~~~~~~~~~~~~~n~ 75 (502) |+= +-++++++....+. . .....++-..+..+..+||. |.++.-.-. ...+.-..+..++.|. T Consensus 1 m~e--~~~~~~~~~~~~~~-----~-----~~~~~~~~r~~~~~d~~f~~~~G~QW~~~~~~~l~~~~q~~grP~~~~N~ 68 (706) T protein:vir:10 1 MAE--SRQKQHERVMLRFD-----R-----AWSPQQVVREKCIEATRFVRVPGGQWEGATVAGTKLDEQFEKYPKFEINK 68 (706) T ss_pred CCc--chHHHHHHHHHHHH-----H-----HHHHHHHHHHHHHHHHHhhccCCccCCHHHHHHHHhhhhhcCCCceEecc Confidence 442 22223222111100 0 01112344555666777875 555432111 1112222456788999 Q ss_pred HHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHH----HHhhccHHHHHHHHHHHHhhcCCEEEEEEEe----C-- Q lcl|NC_012753. 76 GRTASKKVASLVFNEQATIRVD------NEVADAFINE----TLKNDKFSKNFERYLESCLALGGLAMRPYID----G-- 139 (502) Q Consensus 76 ~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~----~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d----~-- 139 (502) -+.+|+...++.-...+.+.+- |.+.++.|+. +.+.++.......+...+++.|-+|+.+..| + T Consensus 69 i~~~v~~v~g~~~~nr~~~~v~P~~~~~d~~~Ae~l~~l~~~~~~~~~~~~a~s~Af~d~i~~G~G~~ev~~d~~~~~d~ 148 (706) T protein:vir:10 69 VATELNRIISEYRNNRISVKFRPGDNAASEELANKLNGLFRADYEETDGGEACDNAFDDAATGGFGCFRLTTSFVNEYDP 148 (706) T ss_pred hHHHHHHHhhHHHhCCCceEEecCCCCchHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhhcCcceEEeeeccccccCC Confidence 9999999999998888877652 3345555554 4456788889999999999999999999654 1 Q ss_pred ----CceEEEEE-cC-CeEEEEEEcC----CCeEEEEEEEEEEEeeC--------------------------CCceEEE Q lcl|NC_012753. 140 ----DQIRVSFV-QA-TVFFPLQANT----QDVSSAAIVTKSTKTEG--------------------------QKVKYYS 183 (502) Q Consensus 140 ----~~~~i~~v-~~-~~~~Pi~~d~----~~~~~~~~~~~~~~~~~--------------------------~~~~~yt 183 (502) ..++|..+ +| +.++ +|. -+...+-++.+..+.+. ....... T Consensus 149 ~~~~~~i~i~~v~~p~~~v~---~Dp~a~~~D~sDar~~~~~~~~~~d~~~~~fp~~~~~~~~~~~~~~~~d~~~~d~~~ 225 (706) T protein:vir:10 149 MDERQRIAVEPIYDPARSVW---FDPDAKKYDKSDALWAFCMYSVSLEKYQSEYDKAPTSLDRVGSVSWQYDWFTPDVVY 225 (706) T ss_pred CCCCccceeeeeccchhcee---cCchhcccChhhcceEeeeecCCHHHHHHhcCCChhhhhhhccccccccccCCCcce Confidence 14555544 33 3443 222 23333333333321110 0000111 Q ss_pred EEEEEEEeCCeE-EEEEE--------EEecCCcccc-------Cc------eeecccc-ccCCCcceeecC-----CCcc Q lcl|NC_012753. 184 LIEFHEWNKETY-TISNE--------LYESESKTII-------GQ------RVPLSTL-YEDLEETVTLNG-----LTRP 235 (502) Q Consensus 184 ~~E~h~~~~~~~-~I~~~--------l~~~~~~~~l-------G~------~v~l~~~-~~~l~~~~~~~~-----~~~~ 235 (502) ..|+|+...... .+.+. .|........ |. .++--.+ |.-+.+...+.+ .++. T Consensus 226 ~~eyy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~v~~~~~~g~~~l~~~~p~~~~~~ 305 (706) T protein:vir:10 226 IAKYYEVRKESVDVISYRQPLTQEIATYDSEQIADIQDELEQAGFEEIGRRSVKRRRIYVAVVDGDGFLEKPRRIPGEHI 305 (706) T ss_pred ecccccccceeEEEEEeeccccCCceeeccchhhhhHHHHhhCCchhhhhcccceeeEEEEeeccccccccCCCCCCCcc Confidence 223222111110 00000 0000000000 00 0000000 000000001111 1233 Q ss_pred eEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeee-chHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 236 LFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV-PTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 236 ~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v-~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) ||++|--.... + .+++..-|.+.++++.++.+|...|.+.+-+-..+....+ +.+-+..... ... .++..-+.. T Consensus 306 P~vP~~g~r~~-~-d~~~~~~G~vr~~~d~Q~~~N~~~s~~~~~~~~~~~~~~~~~~~~i~~~~~-~~~--~~~~~~~~~ 380 (706) T protein:vir:10 306 PLIPVYGKRWF-I-DDVERVEGHIAKAMDPQRLYNLQVSMLADAAAQDPGQTPIVDMEQIRGLEQ-HWE--GRNRKRPAF 380 (706) T ss_pred ceEEEeecccc-c-cccCcccceeccchhhHHHHHHHHHHHHHHHHhcCCcccccchhHHHHHHH-Hhh--hcccccccc Confidence 44443211111 1 1222234568899999999999999999866433332221 1110100000 000 000000000 Q ss_pred chhhccccCCCCcc---ccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHH Q lcl|NC_012753. 315 HNVYEQFDSGDMDK---GIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRN 391 (502) Q Consensus 315 ~~~~~~~~~~~~~~---~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~ 391 (502) -.+......+|.- ...+..+++.--...+.+.++.....|....|+++..+|..++ .||.+|..+......... T Consensus 381 -l~~~~~~~~~g~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~vsGi~~~~lG~~sn--~SG~Ai~~rq~qg~~~~~ 457 (706) T protein:vir:10 381 -LPLRTVTDKTGNVVAPANVAGYTQAPVLNQALAALLQQTSADIQEVTGSSQAMQQMPSN--VARETVNSLLNRSDMASF 457 (706) T ss_pred -hhcccccCCCCcccccccccccCCCcchHHHHHHHHHHHHHHHHHHhCCCHHHcCCccc--hHHHHHHHHHHHHHHHHH Confidence 0000000001100 0011112221223467788888888999999999999986543 588888888777666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHh----h---cccCCCcc--------------------------cccceEEEeCCCccC Q lcl|NC_012753. 392 SIATLVEKSLKELVISILELAKV----Y---NLYTGEIP--------------------------TMDEVSVDLDDGVFT 438 (502) Q Consensus 392 ~~~~~~~~~l~~l~~~il~~~~~----~---~~~~~~~~--------------------------~~~~i~v~f~d~i~~ 438 (502) .+-..+..+.++.-+.+|.+..- . ++.+.... ..++|.|+=..+.+. T Consensus 458 ~~~Dnl~~~~~~~g~~lL~li~~~y~~~R~~RI~~ed~~~~~v~in~~~~d~~~G~~~~~nDi~~g~yDv~i~~~p~~~t 537 (706) T protein:vir:10 458 IYLDNMAKSLKRAGEIWLSMAREIYGSDREVRIVHEDGTDDIALMNAAVLDNQTGRVVALNDLSTGRYDVSVDVGPSYSA 537 (706) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcEEEEecCCCCccceeeccceeccccCceeeeecceeeeEEEEEecccCcch Confidence 66666777777665555554331 1 12221000 001222221223333 Q ss_pred CHHHHHHHHHHHHhcCC-CCHHH-----HHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 439 DRNAEFDYWSKMVAAGF-APKTM-----AIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 439 d~~~~~~~~~~~~~~Gi-~S~et-----~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) -.++..+.++++..++. ....+ .+.++-++.- +++.+++|+....+.....+. -.+ T Consensus 538 ~r~~~~~~m~el~~~~~p~~~~~~~l~~~~~~~~d~p~--~~e~~e~irk~~~~q~~~~~~------~~~ 599 (706) T protein:vir:10 538 RRDATVNALTQLLQGMLPQDPMRPALMGIIIDNMEGEG--LDDFKAFNRRQLLTQGIVKPR------NQQ 599 (706) T ss_pred HHHHHHHHHHHHHHhcCCcchhhHHHHHHHHhhcCccc--hHHHHHHHHHhhcccCCcccc------chh Confidence 35666667777765432 21222 1223333422 334455555433322111100 011 No 123 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.76 E-value=6.3e-08 Score=60.07 Aligned_cols=405 Identities=11% Similarity=0.093 Sum_probs=174.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhh-ccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSIT-DHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~-~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) |+||+++ +++... ..+..... ....+.++.. .+|.+-..... .... .++-+..+--... T Consensus 1 MG~f~~l---f~~~~~---~~~~~~~~~~~~~~~~~~~---------~~~~~~g~~~~-~~v~----~~~al~~~~v~~c 60 (422) T protein:vir:13 1 MGFLRGL---FNKKNN---NDEKRSNYDEDIGIDISDS---------NFWEKFGIKLN-FSVR----GKRALKENTVYVC 60 (422) T ss_pred Cchhhhh---hhccCC---ccchhhhhhhccccccCcc---------hhhhhccccCC-cccc----hhhhhccHHHHHH Confidence 9999876 322110 00000000 0000111100 11111000000 0000 0111223333455 Q ss_pred HHHHhhhhhcCcceEeeCCH-HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCe Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNE-VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATV 151 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~-~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~ 151 (502) |+..|+-+.+=|+.+--+.+ .....+..+|.. | ....-+..++...+..|.+|+.+..+. |+ ..+..++|++ T Consensus 61 i~~ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~ 140 (422) T protein:vir:13 61 TKIRAESIGKLSLKIYKDKEEYKEHELYYLLRYKPNPLMSSINFWKCLETQRTLKGNAYAYIERDRKGKIIGLYPINSDN 140 (422) T ss_pred HHHHHHhhhhCceEEEecCcccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcc Confidence 66666666666665522221 111123333321 2 223455566777888999998887765 44 4677788888 Q ss_pred EEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecC Q lcl|NC_012753. 152 FFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNG 231 (502) Q Consensus 152 ~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 231 (502) +-++..+.+......- .+| ++. ..+ |....+ + +. T Consensus 141 v~~~~~~~~~~~~~~~------------~~y-------------~~~-----~~~----g~~~~~---~---~~------ 174 (422) T protein:vir:13 141 VTKIIDDDNFLSSLSK------------VWY-------------VVT-----DKN----GKEHKL---L---PD------ 174 (422) T ss_pred eEEEEcCCcceeccce------------EEE-------------EEE-----eCC----CeEEEE---c---cc------ Confidence 8776433332210000 001 000 000 111000 0 00 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccc Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREF 311 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~ 311 (502) -.++++.+. ..+..+|+|.+.-+...|+....+-....+-|..+...-.+ |.....-+.... ..+ T Consensus 175 ----eiih~~~~~----~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~l~~e~~---~~~ 239 (422) T protein:vir:13 175 ----EMLHFIGDI----TLDGLIGIKPLDYLRCTIENGRATQEFINKFFKNGLSIKGI----VQYVGDLDEKAK---KIF 239 (422) T ss_pred ----ceEEEcCCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE----EEeCCCCCHHHH---HHH Confidence 012333211 12345789998888888775444433333344554322111 222111000000 000 Q ss_pred ccc-chhhccccCCC----CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHH Q lcl|NC_012753. 312 ETG-HNVYEQFDSGD----MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSD 385 (502) Q Consensus 312 ~~~-~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~ 385 (502) ... ...+....... -+.+..++.++......++.+..+....+|+...|+||..++....+. ++.++.... T Consensus 240 ~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~e~~~~~--- 316 (422) T protein:vir:13 240 KKEFESMSNGLENAHSISLLPFGYQFQPISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFNNLTEQQKD--- 316 (422) T ss_pred HHHHHHHhcCccccCCceecCCCceeeeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH--- Confidence 000 01111100000 011223555555556667788888889999999999999998755432 222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC-CCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 386 TYQMRNSIATLVEKSLKELVISILELAKVYNLYT-GEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 386 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~-~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) .++..|..+++.|-...+. .++. ........+.+++++-.-.|..+.++...+++.+|+|+.-++++. T Consensus 317 ----------f~~~~l~P~~~~ie~~l~~-~Ll~~~~~~~g~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~ 385 (422) T protein:vir:13 317 ----------FYVTTLQSSLTVYEQEIQD-KLFSQYETLQDVKAEFNVDTILRSDIKTRYEAYRIGIQGGFIEANEARRR 385 (422) T ss_pred ----------HHHHHHHHHHHHHHHHHHH-hhCChhhhcCCceEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 1233333333332221111 1111 111112234555555566788899999999999999999997654 Q ss_pred cCCCCH-HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 465 TLNVTK-EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 465 ~~~~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .|+.. +..++.+....-- ...........+++=+|+ T Consensus 386 -~gl~p~~ggD~~~~~~n~~-~l~~~~~~~~~~g~~~g~ 422 (422) T protein:vir:13 386 -ENLPPVEGGDRLLVNGNMI-PIEMAGEQYKKGGEKGGK 422 (422) T ss_pred -hCCCCCCCcCeeeeccCcc-chhhcccccccCCCcCCC Confidence 35433 1122111110000 001111223566677777 No 124 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.68 E-value=1.2e-07 Score=58.60 Aligned_cols=431 Identities=12% Similarity=0.099 Sum_probs=173.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccc---cCCHHHHHHHHHHHH-HhcCCCC-cccccc------CC-Cc---- Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKI---AISPEEYNRIMDNLR-YFAGDFD-SVTYRD------SN-GS---- 64 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~---~~~~~~~~~i~~~~~-~Y~g~~~-~~~~~~------~~-~~---- 64 (502) |+|+++++-..+-....+.+ + .+.+...+ ++..+.+++....+. -|.-... ...... .. +. T Consensus 5 ~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~k~~~~~~~a~~~~~~~~~~~~~~~~~r~~~~~~~~l~ 81 (551) T protein:vir:80 5 LGLFESIRLVGVNKSDAVKH--I-EVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLH 81 (551) T ss_pred hhhHHHhhhccCChhhcccc--c-ccccceeeecccccHHHHHHhhccCcceeecccccceecCcccccCccccChhHHH Confidence 99999887221111000000 0 00000011 222222222211110 1100000 000000 00 00 Q ss_pred cccccceecchHHHHHHHHhhhhhc-----------CcceEeeCC---------HHHHHHHHHHHhhc---------cHH Q lcl|NC_012753. 65 QVKRDFNHLPIGRTASKKVASLVFN-----------EQATIRVDN---------EVADAFINETLKND---------KFS 115 (502) Q Consensus 65 ~~~~~~~~~n~~k~iv~~~a~~l~~-----------ep~~i~~~d---------~~~~e~l~~~~~~~---------~f~ 115 (502) ...+.....++...+|+..|+-+.. -+..+.+.+ ....+.+.+++..- .|. T Consensus 82 ~~~~~~~~npiv~~~I~~ia~~IA~~~~~~~~~~~g~~~~i~~kd~~~~~~~~~~~~~~~i~~~l~~pn~~~~p~~~s~~ 161 (551) T protein:vir:80 82 GVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFS 161 (551) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhcCCCCceEEecccCcccChhHHHHHHHHHHHHHhcCCCCCCccchHH Confidence 0001122234555666666554432 122333322 12223445544321 234 Q ss_pred HHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC Q lcl|NC_012753. 116 KNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE 193 (502) Q Consensus 116 ~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~ 193 (502) ..+..++.+.+..|.+|+.+..|. |.+ .+..++|.++-++..+++......+ +|. . ... + T Consensus 162 ~f~~~lv~dlll~Gnay~~i~rd~~G~~~~L~~l~p~~V~v~~~~~g~~~~~~~-------------~y~--~--~~~-g 223 (551) T protein:vir:80 162 SFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPDNGN-------------RFV--Q--VID-Q 223 (551) T ss_pred HHHHHHHHHHHhcCCEEEEEEECCCCcEEEEEEeCCceeEEEECCccccccCce-------------EEE--E--EeC-C Confidence 455566777788899998888875 443 5777888888776433332211000 010 0 000 0 Q ss_pred eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHH Q lcl|NC_012753. 194 TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTY 273 (502) Q Consensus 194 ~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~ 273 (502) ..... | + +.+ .++|+.+- ..-....++|+|-+..+...|.....+- T Consensus 224 ~~~~~---~------------~--------~~e----------iiH~~~n~-~~~~~~~~~G~spi~~a~~~i~~~~a~~ 269 (551) T protein:vir:80 224 KIVAT---F------------N--------ARE----------MAFAVRNP-RSDIYATGYGYPELEIALKQFIAHENTE 269 (551) T ss_pred cEEEE---E------------c--------ccc----------eEEecccC-CCCcccccccccHHHHHHHHHHHHHHHH Confidence 00000 0 0 000 12233210 0011234579998888777776555443 Q ss_pred HHHHHHHhhccceeeechHHhccCCCCCCcccCccc--cccccc-hhhccccCCC-----CccccceeeeccccchHHHH Q lcl|NC_012753. 274 DEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR--EFETGH-NVYEQFDSGD-----MDKGIGITDLTTDIRSDDYI 345 (502) Q Consensus 274 S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~--~~~~~~-~~~~~~~~~~-----~~~~~~i~~~~~~ir~e~~~ 345 (502) .-..+-|..+...-.+ |...++.. ..... .+.... ..|....... .+.+.-++.++.....-++. T Consensus 270 ~~~~~~f~Ng~~p~gi----L~~~~~~~---lt~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~l~~~~~D~qfl 342 (551) T protein:vir:80 270 AFNDRFFSHGGTTRGI----LQIKAAQQ---QSQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVNMTPSARDMEFE 342 (551) T ss_pred HHHHHHHHcCCCcceE----EEEcCCCC---CCHHHHHHHHHHHHHHhcCccccCccccccCCCceEEEccCChhHHHHH Confidence 3333445655432111 21111100 01000 000000 0111100000 01122345555566677788 Q ss_pred HHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCccc Q lcl|NC_012753. 346 KAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIPT 424 (502) Q Consensus 346 ~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~ 424 (502) +..+...+.|+...|++|..+|+...+..++....+. ...++.... ..++..|..++..|-...+. .+.. . . T Consensus 343 e~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~---t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~-~L~~-~--~ 415 (551) T protein:vir:80 343 KWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSL---NEGNSAEKNQASKNKGLQPLLGFIEDFINK-HIVA-E--F 415 (551) T ss_pred HHHHHHHHHHHHHhcCCHHHcCccccccccccccccc---chhhHHHHHHHHHHHHHHHHHHHHHHHHHh-hhcc-c--c Confidence 9899999999999999999999755433222211111 111111111 22344455444444322221 1111 1 1 Q ss_pred ccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH--HHHHH-------------------HHHHHH-- Q lcl|NC_012753. 425 MDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK--EQAQE-------------------IYQKIN-- 481 (502) Q Consensus 425 ~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d--eea~~-------------------el~ri~-- 481 (502) ...+.+.|+.....+..+. ..+.+++.+|+|+.-++++.. |+.. +..+. +-++.+ T Consensus 416 ~~~~~f~f~~~~~~~~~~~-~~~~~~~~~g~lT~NE~R~~~-gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 493 (551) T protein:vir:80 416 GDKYTFQFVGGDIKSELES-VKILAEKAKVAMTVNEVRKEL-NLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSN 493 (551) T ss_pred CCceEEEeeccChhhHHHH-HHHHHHHhcCCcCHHHHHHHh-CCCCCCCCCceeecccccccccccccccCcchhhhhhc Confidence 2346778887666665443 345567778999999976653 4321 10000 001101 Q ss_pred -----Hhhh--cccCCCCCccccCCCCC Q lcl|NC_012753. 482 -----DETM--VSTDSFRTSEEVDIYGE 502 (502) Q Consensus 482 -----~E~~--~~~~~~~~~~~~~~~g~ 502 (502) +..+ ...+....+.+.+-.|+ T Consensus 494 ~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 521 (551) T protein:vir:80 494 LQMLQEQTGNRVSTDVEDIPDGKDTTGD 521 (551) T ss_pred cccccCcCCCCCCCCCCCCCCccccCCC Confidence 0000 00111122333334443 No 125 >protein:vir:95315 Length: 559 # NCBI annotation: putative head-to-tail-joining protein # Family: family:all:481 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512261;genbank:gi:89152428;genbank:GeneID:3952984 Probab=98.68 E-value=1.2e-07 Score=58.56 Aligned_cols=459 Identities=12% Similarity=0.091 Sum_probs=191.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccc-cc--cCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVT-YR--DSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~-~~--~~~~~~~~~~~~~~n~~k 77 (502) |.- +.+..+++.+. .+..++-.....|+.+|.=-.+-+. +. ..........++--+.+. T Consensus 1 m~~--~~~~~l~~r~~----------------~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~ 62 (559) T protein:vir:95 1 MAE--TTKERLNKQFA----------------QLESERQSFEPHWRELSDYINPRGSRFLTSEVNRNDRRNTRIIDSTGT 62 (559) T ss_pred CCh--hhHHHHHHHHH----------------HHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcccccccccccchHH Confidence 321 12222222111 1223333444445444322111110 00 111111222334446677 Q ss_pred HHHHHHhhhhhcCcc-------eEeeCC------HHHHHH-------HHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE Q lcl|NC_012753. 78 TASKKVASLVFNEQA-------TIRVDN------EVADAF-------INETLKNDKFSKNFERYLESCLALGGLAMRPYI 137 (502) Q Consensus 78 ~iv~~~a~~l~~ep~-------~i~~~d------~~~~e~-------l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~ 137 (502) ..|+.+|+.|.+-.. .+.+.| ..+.++ +.+.+...+|...+.++..+..+.|.+++.+-. T Consensus 63 ~a~~~Las~l~~~ltpp~~~WF~l~~~d~~~~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~Gta~l~~~~ 142 (559) T protein:vir:95 63 MAARTLASGMMSGITSPARPWFRLATPDPEMMDYGPVKLWLEAVQNRMNDMFNKSNLYQSLPQLYGSLGTYSTGAMAVLD 142 (559) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccCCccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeEeec Confidence 777777777765221 133332 223334 445677789999999999999999999987766 Q ss_pred eC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEE----------EEEEEEEEe-CCeEEEEEEEEecC Q lcl|NC_012753. 138 DG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYY----------SLIEFHEWN-KETYTISNELYESE 205 (502) Q Consensus 138 d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~y----------t~~E~h~~~-~~~~~I~~~l~~~~ 205 (502) |+ +.+++..++..+++-- .|..+....+| +++...-.+-...| ..++. .. +..+.|.|.+|.-. T Consensus 143 d~~~~~r~~~~~l~~~~v~-~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~~~~~~~~--~~~~~~v~v~~~V~pr~ 218 (559) T protein:vir:95 143 DDEDIIRTMPFPIGSYYLA-NSPRGSVDTCF-RKFSMTVRQLVQEFGLNNVSESVKSMWES--GTYEKWIEVMHSVYPNI 218 (559) T ss_pred CCCceeEEEEeecCeEEEe-eCCCCCeEEEE-EeEecCHHHHHHHcCcccCCHHHHHHHhc--CCCCCeEEEEEEEeccc Confidence 65 4578999999998764 55545444443 33221100000000 00000 00 11234444444221 Q ss_pred Ccc--cc-CceeeccccccCC--Cc--ceeecCCCcceEEEecCCccccccccCcCCcc-hhhhHHHHHHHHHHHHHHHH Q lcl|NC_012753. 206 SKT--II-GQRVPLSTLYEDL--EE--TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLS-IFDNAKTTMDFINTTYDEFM 277 (502) Q Consensus 206 ~~~--~l-G~~v~l~~~~~~l--~~--~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S-~~~~~~~lid~ld~~~S~~~ 277 (502) +.+ .. ....|..++|-.. .. .....|+..-||++++- +...++.||+| --..+.+-+..|+..--..+ T Consensus 219 ~~~~~~~~~~~~pf~s~~~e~~~~~~~~l~esg~~e~P~~~~Rw----~~~~ge~YGrg~P~~~al~d~k~L~~l~~~~l 294 (559) T protein:vir:95 219 DRDTSKLDSKNKPFKSVYYEVGGDNDKLLRESGFDEFPIMAPRW----EVNGEDVYGSSCPGMLALGPVKALQLLQKRKS 294 (559) T ss_pred cccccccccccceEEEEEEEecCCCceeeecCCcccCCccceee----eecCCccccccchHHHhhHHHHHHHHHHHHHH Confidence 111 11 1223343443111 11 11223444445555442 23457789999 58899999999998877776 Q ss_pred HHHhh-ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceee---eccccchHHHHHHHHHHHH Q lcl|NC_012753. 278 WEVKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITD---LTTDIRSDDYIKAINKGLS 353 (502) Q Consensus 278 ~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~---~~~~ir~e~~~~~l~~~l~ 353 (502) ...+. ....+.||.+.. .......|+.. .. + .+......++. .++++- .....++.+.+ T Consensus 295 ~~~~~~~~pp~~v~~~~~-----~~~~~l~pgg~-----~~---~--~~~~~~~~i~p~~~~~~~~~--~~~~~i~~~~~ 357 (559) T protein:vir:95 295 QLIDKATNPPMVAPTSLK-----NQRASLLPGDI-----TY---I--DQITGQDGFRPAYLVNPSTA--DLVADIQDTRQ 357 (559) T ss_pred HHHHHHhcCceecccccc-----ccceeeeccce-----ee---e--CCCCCcccceeecccccchH--HHHHHHHHHHH Confidence 66543 444455554421 11111222211 11 1 11111122332 233331 11122333333 Q ss_pred HHHHhcCCCh-hhccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCcc--cccceE Q lcl|NC_012753. 354 LFEMQLGVST-GMFSFDGKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYNLYTGEIP--TMDEVS 429 (502) Q Consensus 354 ~i~~~~g~s~-~~~~~~~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~--~~~~i~ 429 (502) .|....-... .++...+....|||||......+.+..+- ..+.-...|..|+.-++.++.-.+..+.... ....++ T Consensus 358 rI~~af~~d~~~~l~~r~~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~l~~~~i~ 437 (559) T protein:vir:95 358 IINSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDECLNPLIDRSFSMMVRKNMLPPPPDVMEGMPLK 437 (559) T ss_pred HHHHHhhhhhHHHhhcCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcccccCcceE Confidence 4433221110 12333445567999999888777766554 2222344455555555555544333322111 123466 Q ss_pred EEeCCCccCCH-HHH---HHHHHHHHh--cCC-------CCHHHHHH---hcCCC------CHHHHHHHHH-HHHHhh-- Q lcl|NC_012753. 430 VDLDDGVFTDR-NAE---FDYWSKMVA--AGF-------APKTMAIE---KTLNV------TKEQAQEIYQ-KINDET-- 484 (502) Q Consensus 430 v~f~d~i~~d~-~~~---~~~~~~~~~--~Gi-------~S~et~l~---~~~~~------~deea~~el~-ri~~E~-- 484 (502) |++--.+-.-. ... +....+.+. +++ +....++. ...|+ +++|+++.-+ |.++.+ T Consensus 438 v~~is~La~aqk~~~~~~i~~~~~~~~~laq~~Pevld~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~rqqr~~~qq~~ 517 (559) T protein:vir:95 438 VEYISVMAQAQKSIGLSSLASTVNFIGQLAQVKPEALDKLNVDQAIDAFADMSGVSPTVIVPQEQVEQARQQRAQQQQQQ 517 (559) T ss_pred EEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhhhcCCHHHHHHHHHHHhCCchhhcCCHHHHHHHHHHHHHHHHHH Confidence 66543221100 000 111111110 111 22233322 33444 3344332211 111110 Q ss_pred -h-----cccCCCCCccccCCCCC Q lcl|NC_012753. 485 -M-----VSTDSFRTSEEVDIYGE 502 (502) Q Consensus 485 -~-----~~~~~~~~~~~~~~~g~ 502 (502) . +++.....-.++..-|. T Consensus 518 q~~~~~~~aa~~~~~~~~~~~~~~ 541 (559) T protein:vir:95 518 QMMAMGMAAAQGVKTLSEAKTSDP 541 (559) T ss_pred HHHHHHHHHHHhhhccccccCCCh Confidence 0 00111111111111111 No 126 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.66 E-value=1.4e-07 Score=58.16 Aligned_cols=387 Identities=13% Similarity=0.082 Sum_probs=174.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |++|++|++++++-.. ...++... +..|..+... . ...-+.++--...| T Consensus 1 MG~~~~~~~~~~~~~~--------------~~~~~~~~------~~~~~g~~~~--~---------~~~al~~~~V~~~v 49 (411) T protein:vir:81 1 MGWWSRLTRFFRPRNE--------------TVDMTNPL------LLQWLGVDPD--T---------PRNQLSEATYFACL 49 (411) T ss_pred CchHHHHHhhccCccc--------------ccccchHH------HHHHhcCccc--C---------hhhhhccHHHHHHH Confidence 9999999988764211 11111111 2233333211 0 01112222223455 Q ss_pred HHHhhhhhcCcceEee--CC---HHHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEcC Q lcl|NC_012753. 81 KKVASLVFNEQATIRV--DN---EVADAFINETLKN-----DKFSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQA 149 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~--~d---~~~~e~l~~~~~~-----~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~~ 149 (502) +..|+-+-+=|+.+-- ++ +.....+..+|.. -....-+..++...+..|.+|+.+..++|++ .+..++| T Consensus 50 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~~~~l~~l~~ 129 (411) T protein:vir:81 50 KILSESLGKLPLKMYQKTERGIVKSDREELYNLLKLRPNPYMTSSVFWSTVEMNRNHYGNAYVWCQYSGPQLQALWILPS 129 (411) T ss_pred HHHHHhHhhCceeEEEecCCceeeecccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCceEEEEEECC Confidence 6666666555655411 11 1111123333321 1333445556667788899998888887664 4666788 Q ss_pred CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceee Q lcl|NC_012753. 150 TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTL 229 (502) Q Consensus 150 ~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~ 229 (502) +.+-++..+.+... .....+|+ +.. ..+ |..+.+ T Consensus 130 ~~v~~~~~~~~~~~------------~~~~~~~~-------------~~~----~~~----g~~~~~------------- 163 (411) T protein:vir:81 130 QYVTIVVDDRGLLG------------EKNAIWYR-------------YND----PYD----GKMYVF------------- 163 (411) T ss_pred ceEEEEEcCccccc------------ccceEEEE-------------EEe----cCC----ceEEEE------------- Confidence 87766533222110 00000110 000 000 111110 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR 309 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 309 (502) +.--.++|+.+. ..+..+|+|.+.-+...++....+..-..+-|..+...-.+ |.....-+.... . T Consensus 164 ---~~~eiih~k~~~----~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~l~~e~~---~ 229 (411) T protein:vir:81 164 ---RNDEILHFKTSV----TFDGITGLSVRDVLKHTVDGALESQKFMNNLYKTGLTGKAV----LEYTGDLNQEAR---D 229 (411) T ss_pred ---ccccEEEEcCCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCCCHHHH---H Confidence 000123444321 12345789988888777766665544444444554332111 222111100000 0 Q ss_pred cccccc-hhhccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHH Q lcl|NC_012753. 310 EFETGH-NVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQ 383 (502) Q Consensus 310 ~~~~~~-~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~ 383 (502) .+.... ..+..... .--+++.-++.++......++.+..+....+|+...|+||..+|...++. .++.+.. T Consensus 230 ~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~--- 306 (411) T protein:vir:81 230 RLVKGFEQFANGSKNAGKIIPVPLGMKLVPLDIKLTDSQFFELKKYTALQIAAAFGIKPNQINDYEKSSYASAEAQN--- 306 (411) T ss_pred HHHHHHHHHhcCccccCCceecCCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCchhHHHHH--- Confidence 000000 00111000 00011223555555555667778888889999999999999998765432 2232221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) ...++.+|..++..|..-.+..-+..........+.++++.-+-.|..+.++...+++.+|+|+.-++++ T Consensus 307 ----------~~f~~~~l~P~~~~ie~~l~~~ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~g~~t~NE~R~ 376 (411) T protein:vir:81 307 ----------LAFYVDTLLYVLKQYEEEITYKILSNDLISQGHYFKFNVNVILRADIKTQMDSLSTAVQNGIMTPNEARD 376 (411) T ss_pred ----------HHHHHHHHHHHHHHHHHHHHhhcCChhhcCCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 1123334444443333222211011111222234566666667788999999999999999999988765 Q ss_pred hcCCCCHH-HHHHHH-----HHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 464 KTLNVTKE-QAQEIY-----QKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~~~~~de-ea~~el-----~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) . .|+... ..++.+ ..+.. ......-+|| T Consensus 377 ~-~gl~p~~ggD~~~~~~n~~pl~~----------~~~~~~kgGd 410 (411) T protein:vir:81 377 Y-LDMPADDYGNNLMANGNYIPLSM----------LGANYGKGGD 410 (411) T ss_pred H-hCCCCCCCCCeeeeccCccchhh----------hhhhhccCCC Confidence 4 354331 111111 11110 0011112444 No 127 >protein:vir:1538 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052106;swissprot:trembl:q9t110;genbank:gi:9634032;uniprot:Q9T110;genbank:GeneID:1262384 Probab=98.61 E-value=2e-07 Score=57.36 Aligned_cols=446 Identities=9% Similarity=0.035 Sum_probs=183.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |.=-+ ++-+ .....++ .+. .+..++-.....|+.++.=-.+-+......+......++--+.+...+ T Consensus 1 m~~~~--~~~~-------~~~~~k~---r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 67 (535) T protein:vir:15 1 MADSK--RTGL-------GEDGAKA---TYD-RLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGL 67 (535) T ss_pred CCccc--hhcc-------chHHHHH---HHH-HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH Confidence 11000 0000 0000000 000 122233333444444332211111111111111222223334566677 Q ss_pred HHHhhhhhcC--cce----EeeCCH-------------HHHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 81 KKVASLVFNE--QAT----IRVDNE-------------VADAFI-------NETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 81 ~~~a~~l~~e--p~~----i~~~d~-------------~~~e~l-------~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) +.+|+.|.+- |++ +.+.+. .+.++| ...+..++|...+.++..+..+.|.+.++ T Consensus 68 ~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~ 147 (535) T protein:vir:15 68 NNLASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLY 147 (535) T ss_pred HHHHHHHHHhhcCCCcccccccChHHHhccCCCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEE Confidence 7777666541 221 232221 233344 33477789999999999999999998877 Q ss_pred EEEeCC-ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe--------------------eCCCceEEEEEEEEEEeCC Q lcl|NC_012753. 135 PYIDGD-QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKT--------------------EGQKVKYYSLIEFHEWNKE 193 (502) Q Consensus 135 ~~~d~~-~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~--------------------~~~~~~~yt~~E~h~~~~~ 193 (502) +-.+++ .+++..+|-.+++-- .|..+....+|. +++.. ..+...+|+++... .+++ T Consensus 148 ~~~~~~~~~~f~~~pl~~~~v~-~d~~G~vd~i~r-~~~~t~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~-~~~~ 224 (535) T protein:vir:15 148 LPEPEGSYNPMKLYRLSSYVVQ-RDAYGNVLQIVT-RDQIAFGALPEDVRSAVEKAGGEKKMDEMVDVYTHVYLD-EESG 224 (535) T ss_pred eecCCCCceeeEEEEcCeeEEe-eCCCCCeeEEEE-eEeecHHHHHHHHhHhhhccccccCCCCceeEEEEEEEe-cCCC Confidence 655654 578888988886654 555554445443 32211 01122344444221 1122 Q ss_pred eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHH Q lcl|NC_012753. 194 TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTY 273 (502) Q Consensus 194 ~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~ 273 (502) .|...++++ |..+++. .. ..+...-||++++- +...++.||+|-...+.+-+..|+..- T Consensus 225 ~~~~~~e~~--------g~~~~~~------~~---~~~~~~~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~ 283 (535) T protein:vir:15 225 DYLKYEEVE--------DVEIDGS------DA---TYPTDAMPYIPVRM----VRIDGESYGRSYCEEYLGDLRSLENLQ 283 (535) T ss_pred cEEEEEEee--------Ccccccc------cc---ccccccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHH Confidence 233222221 1112111 00 11223345554442 223467899999999999999999877 Q ss_pred HHHHHHHh-hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHH Q lcl|NC_012753. 274 DEFMWEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGL 352 (502) Q Consensus 274 S~~~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l 352 (502) -....-.+ ..+..+.||.+... +...+.+. ....+.. ...++...+ .+...-+.....+.++.+. T Consensus 284 ~~~l~~~~~~~~p~~lv~~~g~~-----~~~~l~~~-----~~g~~v~---g~~~~v~~~-~~~~~~~~~~~~~~i~~~~ 349 (535) T protein:vir:15 284 EAIVKMSMISAKVIGLVNPAGIT-----QPRRLTKA-----QTGDFVP---GRREDIDFL-QLEKQADFTVAKAVSDQIE 349 (535) T ss_pred HHHHHHHHHHhcCceeecccccc-----cchhcccC-----Cceeeec---CCcccceee-ecccccchhHHHHHHHHHH Confidence 66666553 44445555433221 11111110 0011111 111111111 1111112233334444444 Q ss_pred HHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEE Q lcl|NC_012753. 353 SLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL-VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVD 431 (502) Q Consensus 353 ~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~-~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~ 431 (502) +.|....=+. .+...++...|||||....+...+..+-.-.. =...|..|+.-++.+....+..+.. +...++++ T Consensus 350 ~~I~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~--p~~~v~~~ 425 (535) T protein:vir:15 350 ARLSYAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPEL--PKEAVEPT 425 (535) T ss_pred HHHHHHHhhh--hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--CccceeEE Confidence 4443332111 12223344579999998877776665552222 2333444554444444333333322 22335566 Q ss_pred eCCCccCC-HHHHHHHHHHHHh--cCC--------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhhhccc-- Q lcl|NC_012753. 432 LDDGVFTD-RNAEFDYWSKMVA--AGF--------APKTMAI---EKTLNV-------TKEQAQEIYQKINDETMVST-- 488 (502) Q Consensus 432 f~d~i~~d-~~~~~~~~~~~~~--~Gi--------~S~et~l---~~~~~~-------~deea~~el~ri~~E~~~~~-- 488 (502) |--++..- ....++...+... +++ +....++ ....|+ ++||+++..++.++.++... T Consensus 426 yis~La~aqr~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~~~~eev~~~~~q~~~~~~~~~~a 505 (535) T protein:vir:15 426 ISTGLEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGIENAA 505 (535) T ss_pred EecHHHHHHHHHHHHHHHHHHHHHHhcChhhhhccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHHH Confidence 54322111 0111111111110 111 1122222 222343 44555555443332221110 Q ss_pred ---CCCCCccccCCCCC Q lcl|NC_012753. 489 ---DSFRTSEEVDIYGE 502 (502) Q Consensus 489 ---~~~~~~~~~~~~g~ 502 (502) +.- ..+..-..+| T Consensus 506 ~~~g~~-~~~~~~~~p~ 521 (535) T protein:vir:15 506 ATGGAG-VGALATSSPE 521 (535) T ss_pred HHHHhh-ccchhccChH Confidence 100 0111112233 No 128 >protein:vir:103765 Length: 549 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024925;genbank:gi:48697195;genbank:GeneID:2846089 Probab=98.58 E-value=2.4e-07 Score=56.85 Aligned_cols=460 Identities=12% Similarity=0.051 Sum_probs=187.6 Q ss_pred CChh-HHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcC------CCCccccccCCCccccccceec Q lcl|NC_012753. 1 MGII-QTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAG------DFDSVTYRDSNGSQVKRDFNHL 73 (502) Q Consensus 1 m~~~-~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g------~~~~~~~~~~~~~~~~~~~~~~ 73 (502) |+== +++..-+++ ++. .+..++-.....|+.+|.= ..................++-- T Consensus 1 m~~d~~~~~~~l~~---------------r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~~~~d 64 (549) T protein:vir:10 1 MTNDDAKILQALNA---------------DHG-RMKEKRQSYEAVWNDVIDYLMPRLDKFGQLPRPDSEKGRERSQKMFD 64 (549) T ss_pred CCcchHHHHHHHHH---------------HHH-HHHHHhhhHHHHHHHHHHHhccccccccccCCCCCCccccccccccc Confidence 3220 011111111 000 2223333444445444322 1111111111112222233444 Q ss_pred chHHHHHHHHhhhhhcC--cce-----EeeCCHH------HHHHHHH-------HH--hhccHHHHHHHHHHHHhhcCCE Q lcl|NC_012753. 74 PIGRTASKKVASLVFNE--QAT-----IRVDNEV------ADAFINE-------TL--KNDKFSKNFERYLESCLALGGL 131 (502) Q Consensus 74 n~~k~iv~~~a~~l~~e--p~~-----i~~~d~~------~~e~l~~-------~~--~~~~f~~~~~~~~~~~~~~G~~ 131 (502) +.+...++.+|+.|.+- ||. +.+.++. +.++|++ ++ ...+|...+.++..+....|.+ T Consensus 65 stg~~a~~~LAs~l~~~ltpp~~~wF~l~~~~~~~~e~~~v~~~l~~ve~~~~~~~~~~~snf~~~~~~~~~~L~~~Gta 144 (549) T protein:vir:10 65 STAPLALRNFVAAMDSMITPATQLWHRLKTGNDALNEIASVKAYLQGVVRTLFAARYRWQGGFVTQMGATYQSIGLFGPG 144 (549) T ss_pred chHHHHHHHHHHHHHhhccCCCCccccccCCccchhhhhHHHHHHHHHHHHHHHHHhhhhcChHHHHHHHHHHHHhhcce Confidence 56778888888777752 222 3444322 2334443 22 2467999999999999999999 Q ss_pred EEEEEEeCC-ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeC--------CC--ceEEEEEEEEEEeCCeEEEEEE Q lcl|NC_012753. 132 AMRPYIDGD-QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEG--------QK--VKYYSLIEFHEWNKETYTISNE 200 (502) Q Consensus 132 ~~~~~~d~~-~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~--------~~--~~~yt~~E~h~~~~~~~~I~~~ 200 (502) ++.+-.|++ .+++..+|-.+++- ..|..+....+| +++...-. ++ ......++ .-.+..+.|.|. T Consensus 145 ~l~~~~~~~~~~~f~~~pl~~~~v-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~~~~--~~~~~~~~v~~~ 220 (549) T protein:vir:10 145 ALMIEHDVGKGIVYRNVPMQRLWF-AENNSGLIDKTH-VQWELTLRQAAQRFGRENLSPSMQSTLE--KDPEKSAIFYHA 220 (549) T ss_pred eeEEeecCCCeeEEEEEEcCeEEE-eeCCCCCeEEEE-EEeecCHHHHHHhcCcccCCHHHHHHhh--cCCCceEEEEEE Confidence 988766654 57888899998775 455555444444 33211000 00 00000000 001122334444 Q ss_pred EEecCCccc---cCceeeccccccCCCcc--eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHH Q lcl|NC_012753. 201 LYESESKTI---IGQRVPLSTLYEDLEET--VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDE 275 (502) Q Consensus 201 l~~~~~~~~---lG~~v~l~~~~~~l~~~--~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~ 275 (502) +|...+.+. -+.-.|..++|-..... ....|+..-||++++- + ...++.||+|--..+.+-+..|+..--. T Consensus 221 V~pr~~~~~~~~~~~~~pf~sv~~e~~~~~il~esg~~e~P~~~~Rw---~-~~~ge~YGrgp~~~~l~D~k~L~~l~~~ 296 (549) T protein:vir:10 221 VEPRADRDPRKLDGRNMQFASYWLDEGRDRIVQNSGFRTFPFAIGRF---Y-VGTDDVYGGSPAYDAMPDVRMANDMAKT 296 (549) T ss_pred eecCCCCCccccccccCceEEEEEEecCCEeeccCCcccCCcceeee---e-ecCCCccccchHHHHHHHHHHHHHHHHH Confidence 443222111 11223333333211111 1123344445555442 2 2346789999999999999999987766 Q ss_pred HHHHHh-hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHH Q lcl|NC_012753. 276 FMWEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSL 354 (502) Q Consensus 276 ~~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~ 354 (502) ...-.+ ..+..+.||.+... +.....++. ..+.. ... .....+..+..--+..-....++.+.+. T Consensus 297 ~l~~~~~~~~p~~~v~~~g~~-----~~~~l~pgg------~~~~~--~~~-~~~~~~~pl~~~~~~~~~~~~i~~~~~r 362 (549) T protein:vir:10 297 NIRGAQKLVDPPLLANEDGVL-----DGFDLRSGA------LNWGG--LND-KGEEMVKPLLTGKQAQIGIEFAQDTRQT 362 (549) T ss_pred HHHHHHHHhcCceeecccccc-----ccceeccCC------ccccc--cCC-CCccceeeeccccchhHHHHHHHHHHHH Confidence 665554 34455556544221 111111111 11100 011 1112233222111222222334444444 Q ss_pred HHHhcCCChhhccc-cccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccCCCcc----cccce Q lcl|NC_012753. 355 FEMQLGVSTGMFSF-DGKSMKTATEVVSEQSDTYQMRNSIATLV-EKSLKELVISILELAKVYNLYTGEIP----TMDEV 428 (502) Q Consensus 355 i~~~~g~s~~~~~~-~~~~~~tAtei~~~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~~~~~~~----~~~~i 428 (502) |....-... |+. ..+...|||||....+.+.+..+-.-..+ ...|.-|+.-+++++.-.+..+.... ....+ T Consensus 363 I~~af~~d~--~~~~~~~~~~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~R~~~il~r~g~lP~~p~~l~~~~~~~ 440 (549) T protein:vir:10 363 INQWFYVTL--FQILVDSGDMTATEVLQRAQEKGVLLAPTLGRTQSELLGPMIAREVDILAEAGQLPDMPQELIDAGADV 440 (549) T ss_pred HHHHHhhhh--hhhhcCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhhhcCCcee Confidence 433321111 111 12344799999988887777655432222 34444555444555444333332111 12235 Q ss_pred EEEeCCCccCC-HHHHH---HHHHHHHh--cCC-------CCHHHHHH---hcCCC------CHHHHHHHHHHHH----- Q lcl|NC_012753. 429 SVDLDDGVFTD-RNAEF---DYWSKMVA--AGF-------APKTMAIE---KTLNV------TKEQAQEIYQKIN----- 481 (502) Q Consensus 429 ~v~f~d~i~~d-~~~~~---~~~~~~~~--~Gi-------~S~et~l~---~~~~~------~deea~~el~ri~----- 481 (502) .|++--.+-.. ....+ .+..+.+. +++ +....++. ...|+ +++|+++..+.-+ T Consensus 441 ~i~yis~La~aq~~~~~~~i~~~~~~~~~laq~~Pe~ld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~~~~~qqq~ 520 (549) T protein:vir:10 441 DVEYDSPLNKAMRAGEGAAILQWLQQLGIVSQFDPAAAKVPNGARIARLLADYGGVPVEAMSTDEELQAQQAAEAQAAQM 520 (549) T ss_pred EEEeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhcCCHHHHHHHHHHhcCCCccccCCHHHHHHHHHHHHHHHHH Confidence 55553221110 00111 11111111 111 22223222 33443 4555544321111 Q ss_pred HhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 482 DETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 482 ~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++..++.+ ...+.+--.++ T Consensus 521 ~~~~~~a~--~a~~~a~~~~~ 539 (549) T protein:vir:10 521 QQMLAAAP--VAAGAIKDLSD 539 (549) T ss_pred HHHHHHHH--HHHHHHHhhhh Confidence 11111000 00011111111 No 129 >protein:vir:3361 Length: 535 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523332;genbank:gi:17570823;genbank:GeneID:927409 Probab=98.57 E-value=2.6e-07 Score=56.72 Aligned_cols=446 Identities=10% Similarity=0.053 Sum_probs=180.6 Q ss_pred hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHH Q lcl|NC_012753. 4 IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKV 83 (502) Q Consensus 4 ~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~ 83 (502) +...| +.+ +.....++-.+ .+..++-.....|+.++.=-.+-+.............++--+.+...++.+ T Consensus 1 m~~~~---~~~---~~~~~~~~r~~----~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~L 70 (535) T protein:vir:33 1 MADSK---RTG---LGEDGAKATYD----RLTNDRRAYETRAENCAQYTIPSLFPKESDNESTDYTTPWQAVGARGLNNL 70 (535) T ss_pred CChhh---hhc---cChhHHHHHHH----HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHH Confidence 11111 000 00000000000 122233333444444432211111111111111111222234556667776 Q ss_pred hhhhhcC--cce--E--eeCCH-------------HHHHH-------HHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE Q lcl|NC_012753. 84 ASLVFNE--QAT--I--RVDNE-------------VADAF-------INETLKNDKFSKNFERYLESCLALGGLAMRPYI 137 (502) Q Consensus 84 a~~l~~e--p~~--i--~~~d~-------------~~~e~-------l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~ 137 (502) |+.|.+- |+. | .+.+. ...++ +.+.+..++|...+.++.++..+.|.+++++-. T Consensus 71 aa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~~~~ 150 (535) T protein:vir:33 71 ASKLMLALFPMQSWMKLTISEYEAKQLVGDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFECLKQLIVAGNALLYLPE 150 (535) T ss_pred HHHHHHhhcCCCcccccccChHHHhccccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeEEeec Confidence 6666541 221 2 22221 12333 344477789999999999999999998887766 Q ss_pred eCC-ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEE-------------------eeCCCceEEEEEEEEEEeCCeEEE Q lcl|NC_012753. 138 DGD-QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTK-------------------TEGQKVKYYSLIEFHEWNKETYTI 197 (502) Q Consensus 138 d~~-~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~-------------------~~~~~~~~yt~~E~h~~~~~~~~I 197 (502) +++ .+++..+|-.+++- ..|..+....+|...... ...+...+|+++.+. .+++.+.. T Consensus 151 ~~~~~~~f~~~pl~~~~v-~~d~~G~vd~i~r~~~~t~~ql~~~~~~~~~~~~~~k~~~~~~~v~~~v~~~-~~~~~~~~ 228 (535) T protein:vir:33 151 PEGSYNPMKLYRLSSYVV-QRDAYGNVLQIVTRDQIAFGALPEDVRSAVEKSGGEKKMDEMVDVYTHVYLD-EESGDYLK 228 (535) T ss_pred CCCCceeeEEEEcCeeEE-eeCCCCCeeEEEeeEeecHHHHHHHhhhhhcccccccccccCCeEEEEEEee-CCCCcEEE Confidence 654 57888898888665 455555444544322111 011222344443221 11222332 Q ss_pred EEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH Q lcl|NC_012753. 198 SNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM 277 (502) Q Consensus 198 ~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~ 277 (502) .++++ |..++.. ... .+...-||++++- +...++.||+|-...+.+-+..|+..--... T Consensus 229 ~~~~~--------~~~~~~~------~~~---~~~~~~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 287 (535) T protein:vir:33 229 YEEVE--------DVEIDGS------DAT---YPTDAMPYIPVRM----VRIDGESYGRSYCEEYLGDLRSLENLQEAIV 287 (535) T ss_pred EEEEe--------Ccccccc------ccc---cccccCCceeeee----eecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 22221 1111111 000 1222344554442 2234678999999999999999998776666 Q ss_pred HHHh-hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHH Q lcl|NC_012753. 278 WEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFE 356 (502) Q Consensus 278 ~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~ 356 (502) .-.+ ..+..+.||.+... +...+.+. ....+.. ...++...+ .+...-+.....+.++.+.+.|. T Consensus 288 ~~~~~~~~p~~lv~~~g~~-----~~~~~~~~-----~~g~~v~---g~~~~v~~~-~~~~~~~~~~~~~~i~~~~~~I~ 353 (535) T protein:vir:33 288 KMSMISAKVIGLVNPAGIT-----QPRRLTKA-----QTGDFVP---GRREDIDFL-QLEKQADFTVAKAVSDQIEARLS 353 (535) T ss_pred HHHHHHhcCceeecccccc-----chhhcccC-----Cceeeec---CCcccceee-ecccccchhHHHHHHHHHHHHHH Confidence 6553 44445555433221 11111111 0111111 111111111 11111122333344444444443 Q ss_pred HhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCC Q lcl|NC_012753. 357 MQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL-VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDG 435 (502) Q Consensus 357 ~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~-~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~ 435 (502) ...=+. .+...++...|||||....+...+..+-.-.. =...|..|++-++.+....+..+.. +...++++|--+ T Consensus 354 ~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell~Pli~r~~~il~r~g~lP~~--p~~~v~~~yis~ 429 (535) T protein:vir:33 354 YAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPLVRVLLKQLQATSQIPEL--PKEAVEPTISTG 429 (535) T ss_pred HHHhhh--hcccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--CccceeEEEecH Confidence 332111 12223344579999998877776665552222 2333444554444444333333322 223356665432 Q ss_pred ccCC-HHHHHHHHHHHHh--cCC--------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhhhcccCCCCCc Q lcl|NC_012753. 436 VFTD-RNAEFDYWSKMVA--AGF--------APKTMAI---EKTLNV-------TKEQAQEIYQKINDETMVSTDSFRTS 494 (502) Q Consensus 436 i~~d-~~~~~~~~~~~~~--~Gi--------~S~et~l---~~~~~~-------~deea~~el~ri~~E~~~~~~~~~~~ 494 (502) +..- ....++...+... +++ +....++ ....|+ ++||+++..++..+.++.. . .-.. T Consensus 430 La~aqr~~~~~~l~~~~~~la~~~P~~~d~~id~d~~~~~~a~~~Gvp~~~i~~~~ee~~~~~~q~~~~~~~~-~-~~~~ 507 (535) T protein:vir:33 430 LEAIGRGQDLDKLERCISAWAALAPMQGDPDINLAVIKLRIANAIGIDTSGILLTDEQKQALMMQDAAQTGVE-N-AAAA 507 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhChhhhhccCCHHHHHHHHHHHcCCCHhHhcCCHHHHHHHHHHHHHHHHHH-H-HHHh Confidence 2111 0111111111110 111 1122222 222343 3444444333221111100 0 0001 Q ss_pred cccCCCCC Q lcl|NC_012753. 495 EEVDIYGE 502 (502) Q Consensus 495 ~~~~~~g~ 502 (502) .++.+-+- T Consensus 508 ~g~~~~~~ 515 (535) T protein:vir:33 508 GGAGVGAL 515 (535) T ss_pred hhhhhcch Confidence 11112111 No 130 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.57 E-value=2.7e-07 Score=56.58 Aligned_cols=400 Identities=13% Similarity=0.124 Sum_probs=171.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |++++++-++.++.. ....++ ..+ ..+..-+.|-.... ..... ..-+...--..+| T Consensus 1 M~~~~~~f~~~~r~~--~~~~~~---------~~~-------~~~~~~~~g~~~~~--~~v~~----~~al~~~~v~~~i 56 (429) T protein:vir:10 1 MDSVKKFFNFEKRQT--SQVIEL---------NKD-------DEKLLEWLGISPST--ISVKG----KNALKVATVFACI 56 (429) T ss_pred CchhhhhhcccccCc--cccccc---------CCC-------hHHHHHHhcCCCCc--ceech----hhhhccHHHHHHH Confidence 999999888766421 011111 111 11112222321110 00000 1112222223455 Q ss_pred HHHhhhhhcCcceEeeCC-----HHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDN-----EVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQ 148 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d-----~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~ 148 (502) +..|+-+-+=|..+--.+ +.....+..+|.. | ....-++.++...+..|.+|+.+..|. |.+ .+..++ T Consensus 57 ~~ia~~ia~l~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~i~ 136 (429) T protein:vir:10 57 KILSESVSKLPLKIYQEDEYGIQRGTKHYLNNLLRLRPNPYMSSMNFFGSLEAQKNLYGNSYANIEFDRKGKVQALWPID 136 (429) T ss_pred HHHHHhhccCceEEEEecCCceeeccccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEc Confidence 566665555455432111 1112234444431 1 233445566777888999999988875 443 677778 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++-+...+.+....-. ..+| ++ .. . |....+. +. T Consensus 137 ~~~v~v~~~~~~~~~~~~------------~~~~-------------~~-----~~-~----g~~~~~~------~~--- 172 (429) T protein:vir:10 137 ASKVTVYIDDVGLLNSKT------------KMWY-------------VV-----NT-G----GQQRVLK------PE--- 172 (429) T ss_pred CceeEEEEcCcccccccc------------eEEE-------------EE-----cc-C----CeEEEEc------cc--- Confidence 877665422211111000 0000 00 00 0 1111000 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhcc-ceeeechHHhccCCCCCCcccCc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQ-RRVAVPTQMIKTEYDTNGEKVTV 307 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~-~~i~v~~~~l~~~~~~~g~~~~~ 307 (502) -.++|+.+.. .+...|+|.+..+...++....+.....+-|+.+. ..-+ +.....-+.... T Consensus 173 -------evih~~~~~~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i-----l~~~~~l~~e~~-- 234 (429) T protein:vir:10 173 -------EILHFKNGIT----LDGLVGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGL-----VQYVGDLNEDAK-- 234 (429) T ss_pred -------cEEEecCCCC----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE-----EEcCCCCCHHHH-- Confidence 1234443211 23356889988888777766554444444455443 2322 221111100000 Q ss_pred ccccccc-chhhccccCCC----CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHH Q lcl|NC_012753. 308 KREFETG-HNVYEQFDSGD----MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVS 381 (502) Q Consensus 308 ~~~~~~~-~~~~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~ 381 (502) ..+... ...+....... -+.+..++.++.....-++.+..+....+|+...|+|+..+|...++. +++.+... T Consensus 235 -~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~ 313 (429) T protein:vir:10 235 -KVFRENFESMSSGLQNSHRIALMPVGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQ 313 (429) T ss_pred -HHHHHHHHHHhccccccCceeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH Confidence 000000 01111100000 011223555554445567778888889999999999999998654432 23333221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC-CCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH Q lcl|NC_012753. 382 EQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYT-GEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM 460 (502) Q Consensus 382 ~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~-~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et 460 (502) . .++.+|..++..|....+. .++. ........+.++++.-+..|..+.++...+++.+|+|+.-+ T Consensus 314 ~-------------f~~~~l~P~~~~ie~~ln~-kl~~~~~~~~g~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE 379 (429) T protein:vir:10 314 Q-------------FYTDTLQATLTMYEQEMTY-KLFLDSELDKGFYSKFNVDAILRADIKTRYEAYRTGIQGGFLKPNE 379 (429) T ss_pred H-------------HHHHHHHHHHHHHHHHHHH-hhcChhhcCCCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCHHH Confidence 1 1233333333333322211 1111 11112233555555556678999999999999999999998 Q ss_pred HHHhcCCCCH-HHHHHHH-----HHHHH-hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 461 AIEKTLNVTK-EQAQEIY-----QKIND-ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 461 ~l~~~~~~~d-eea~~el-----~ri~~-E~~~~~~~~~~~~~~~~~g~ 502 (502) +++.+ |+.. +..++.+ ..+.. .+....+...+....+=.+| T Consensus 380 ~R~~~-gl~p~~ggD~~~~~~n~~~~d~~~~~~~k~g~~~~~~~~~~~e 427 (429) T protein:vir:10 380 ARSKE-DLPPEAGGDRLLVNGNMLPIDMAGQAYLKGGDTNGEVSKEGNE 427 (429) T ss_pred HHHHh-CCCCCCCcCeeeecccccchhhccccccCCCCCCCCCCCCCCC Confidence 76653 4322 1111111 11110 00000011111111111122 No 131 >protein:vir:1785 Length: 555 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570351;genbank:gi:18640510;genbank:GeneID:932723 Probab=98.56 E-value=2.8e-07 Score=56.54 Aligned_cols=445 Identities=8% Similarity=0.032 Sum_probs=172.0 Q ss_pred HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh Q lcl|NC_012753. 7 IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL 86 (502) Q Consensus 7 ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~ 86 (502) +|.-.++.+. .+..++-.....|+.+|.=-.+-+.............++--+.+...++.+|+. T Consensus 1 m~~~~~~r~~----------------~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~~~Laa~ 64 (555) T protein:vir:17 1 MKHSAQAKYM----------------MLRADREDYLDSGRQSARLTLPYILTDEGHVQGGYLPTPWQSVGSKGVNVLASK 64 (555) T ss_pred ChhHHHHHHH----------------HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHHHHHHHH Confidence 2222222111 112233233444444432211111111111122222334446677788888877 Q ss_pred hhcC--cc-----eEeeCCH---------HHHH-----------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Q lcl|NC_012753. 87 VFNE--QA-----TIRVDNE---------VADA-----------FINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG 139 (502) Q Consensus 87 l~~e--p~-----~i~~~d~---------~~~e-----------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~ 139 (502) |.+- || ++.+.+. .... .+...+..++|...+.++..+....|.+++ |.++ T Consensus 65 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--y~~~ 142 (555) T protein:vir:17 65 LMLSLFPVNTSFFKLQINDAEIDNLGMDEQARSEIDLSLSRIERIVTQDIAESSDRVHLEMAMKHLIVTGNALL--YQGK 142 (555) T ss_pred HHHhhcCCCCcccccccCHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE--EecC Confidence 7652 22 1333321 1222 233345568999999999999999999764 6666 Q ss_pred CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee---CCCceEE------EEEEEEEEeCCeEEEEEEEEecCCcc-- Q lcl|NC_012753. 140 DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE---GQKVKYY------SLIEFHEWNKETYTISNELYESESKT-- 208 (502) Q Consensus 140 ~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~---~~~~~~y------t~~E~h~~~~~~~~I~~~l~~~~~~~-- 208 (502) +.++ .+|-.+++ +..|..+....+|........ ..-+..+ ...+ .-.+..+.+.|.++...... T Consensus 143 ~~~~--~~pl~~y~-v~~d~~G~vd~v~rk~~~t~~ql~~~fg~~~l~~~~~~~~~--~~~d~~~~~~~~~~~~~~~~~~ 217 (555) T protein:vir:17 143 KNLK--LYPLDRFV-VSRDGEGNVMEIVTEEQIDRSLLPEEFQKVGGLEGAPDSNA--VGEDGPKMGVTAPGGRDKGKSN 217 (555) T ss_pred Ccee--EEEcCeEE-EeeCCCcCeeEEEeeeeecHHHHHHHhhhccccchhhhhhh--ccccchhhhhhhhcccccCCCc Confidence 6544 45656644 455655554554432211100 0000000 0000 00000000101000000000 Q ss_pred ----------ccCceeecccc-ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH Q lcl|NC_012753. 209 ----------IIGQRVPLSTL-YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM 277 (502) Q Consensus 209 ----------~lG~~v~l~~~-~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~ 277 (502) ..|...-..++ -+.+.......|+..-||+.++-+ ...++.||+|--..+.+-+..|+..--... T Consensus 218 ~~~v~t~~~~~~~~~~~~~e~~~~~v~~~l~e~g~~e~P~i~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l 293 (555) T protein:vir:17 218 DALVYTYVCRKDGQVKWHQECDGKVIPGSNSSAPYTHNPWIPLRFN----IVDGEAYGRGRVEEFMGDLKSLEALSQAMV 293 (555) T ss_pred ceeEeecccccCCeeEEEEecCceeccccccccCcccCCeeeeeee----ecCCCccccchHHHHHHHHHHHHHHHHHHH Confidence 00000000000 000011111123334455555432 234678999999999999999998766666 Q ss_pred HHHh-hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec----cccc-hHHHHHHHHHH Q lcl|NC_012753. 278 WEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT----TDIR-SDDYIKAINKG 351 (502) Q Consensus 278 ~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~----~~ir-~e~~~~~l~~~ 351 (502) .-.+ ..+..+.||.+... +...+.+.+ ...+. .. ....++.++ .++. ..+-++.+..- T Consensus 294 ~~~~~~~~pp~lv~~~g~~-----~~~~l~~~~-----~g~v~---~g---~~~~v~~~~~~~~~~~~~~~~~i~~~~~~ 357 (555) T protein:vir:17 294 EGSAASAKVVFMVSPSATT-----KPQNLALAA-----NGAII---QG---RPDDVSVVQANKAADFRTVLEMIQKLEQR 357 (555) T ss_pred HHHHHHhCCceeecccccc-----CcceeecCC-----Cceee---cC---CcccceeeeccccchhhHHHHHHHHHHHH Confidence 5543 44555555443221 111111111 01110 01 111122221 1221 12223333333 Q ss_pred HHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccCCCcccccceEE Q lcl|NC_012753. 352 LSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV-EKSLKELVISILELAKVYNLYTGEIPTMDEVSV 430 (502) Q Consensus 352 l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v 430 (502) ++.+.+. + +..++...|||||..+.+...+..+-.-..+ ...|.-|+.-+++++.-.+..+.......++++ T Consensus 358 I~~aFm~--~-----~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~L~Pli~R~~~il~r~g~lP~~p~~~v~~~i 430 (555) T protein:vir:17 358 ISDAFLM--L-----QVRQSERTTATEVQATVQELNEQIGGIYSNLTTELLQPYLARKLHLLQKQRKLPQLPKDLVQPTV 430 (555) T ss_pred HHHHHhh--c-----CCCCcccchHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCCHhhhccce Confidence 3332221 1 1223445699999988877776666543333 344555666556665544443332222223333 Q ss_pred EeCCCccCCHHHHHHHHHHHHh--cCC---------CCHHHH---HHhcCCC-------CHHHHHHHHHHHHHhhhc--- Q lcl|NC_012753. 431 DLDDGVFTDRNAEFDYWSKMVA--AGF---------APKTMA---IEKTLNV-------TKEQAQEIYQKINDETMV--- 486 (502) Q Consensus 431 ~f~d~i~~d~~~~~~~~~~~~~--~Gi---------~S~et~---l~~~~~~-------~deea~~el~ri~~E~~~--- 486 (502) .=.- ......+.++...+.++ +.+ +....+ +...+|+ ++||+++..+..++++++ T Consensus 431 ~~~l-~~l~r~~~~~~l~~~~~~laq~~~~p~~~d~id~d~~~~~~a~~~Gv~p~~ivrs~eev~~~rq~~~~~~~q~~~ 509 (555) T protein:vir:17 431 VAGL-WGVGRGQDKQQLMEFITTLAQTMGPEIAMKYINPTEFIKRLAAAQGIDTLQLINSPETMKQLGDQQKQDMVQASL 509 (555) T ss_pred eehH-HHHHHHHHHHHHHHHHHHHHhhcCchhHhhcCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 2110 00011112222211110 011 222222 2344455 555554432221111110 Q ss_pred ------ccCCC---CCccccCCCCC Q lcl|NC_012753. 487 ------STDSF---RTSEEVDIYGE 502 (502) Q Consensus 487 ------~~~~~---~~~~~~~~~g~ 502 (502) .++.. .....+-.-|. T Consensus 510 ~~qa~~~~~~~~~~~~~~~~~~~~~ 534 (555) T protein:vir:17 510 INQAGQLAKTPMAEQAMQLIQQQQE 534 (555) T ss_pred HHHHHHHHhhhhhhhHHhccccchh Confidence 00100 00000000000 No 132 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=98.44 E-value=6.6e-07 Score=54.46 Aligned_cols=378 Identities=9% Similarity=0.017 Sum_probs=162.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccc-cccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQV-KRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-~~~~~~~n~~k~i 79 (502) |+|++. +++............ +..... -.....++.+.. +... .+.-+..+--..+ T Consensus 3 m~~~~~----~~~~~~~~~~~~~~~----~~~~~~------~~~~~~~~~~~~---------g~~v~~~~al~~~~v~~~ 59 (392) T protein:vir:74 3 LPILNF----INQTNDPPEAGSVQS----YFPDGN------DAQIMESLLGDN---------NEWVSARAALRNSDLFSI 59 (392) T ss_pred chhhhh----hhcccCccccccccc----ccccCc------hhhhhhhccCCC---------CcccchhhhhcchHHHHH Confidence 888854 443211111111000 000000 000111222210 1000 0111222223445 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQA 157 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~ 157 (502) |+.+|+-+-+=|+ .+........+++-...-....-....+...+..|.+|+.+..|. |.+ .+..++|+++-+... T Consensus 60 v~~ia~~ia~lp~--~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~ 137 (392) T protein:vir:74 60 ILQLSSDLAIVKI--NAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYF 137 (392) T ss_pred HHHHHHhhccCce--eeccchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEc Confidence 6666666644444 444443333443322222234445556678888999998887775 443 677777777655432 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) ..++.. ++++. ..++.. +..+.+ + ++ -+ T Consensus 138 ~~~~~~----~y~~~-----------------~~~~~~---------------~~~~~~---~---~~----------ev 165 (392) T protein:vir:74 138 EYENGM----YYNIT-----------------FDDPKI---------------EPILQA---P---QS----------DL 165 (392) T ss_pred CCCceE----EEEEE-----------------ecCCcc---------------ceeEEE---c---Cc----------cE Confidence 222110 11100 000000 000000 0 00 02 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchh Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNV 317 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~ 317 (502) ++|+.+.. .+...|+|-+..+...|+....+-.-..+-|+.+...-.+ |+...+. ... ...+. .-... T Consensus 166 ih~~~~~~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~i----l~~~~~~-~~~-~~~~~--~~~~~ 233 (392) T protein:vir:74 166 IHMKLLSI----DGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGV----LTVKGGG-LLS-DKDKA--SRSRS 233 (392) T ss_pred EEecCCCC----CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCC-Cch-HHHHH--HHHHH Confidence 33433211 1335699998888888755444433333345654432222 2211110 000 00000 00000 Q ss_pred hccccCCC----CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDSGD----MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 318 ~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~ 393 (502) +....... -+.+.-++.++.....-++.+..+....+|+...|++|..+|+.+....++++.+.. T Consensus 234 ~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~e~~~~~----------- 302 (392) T protein:vir:74 234 FMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISGM----------- 302 (392) T ss_pred HhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHH----------- Confidence 11100000 012223555665556667888888889999999999999998755444333333222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKE 471 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~de 471 (502) ++..|..+++.|..-.+.. +. ..+.+++..-+-.|..+.+..+.+++.+|+++..++.+.. -|+..+ T Consensus 303 ---~~~~l~p~~~~ie~~l~~~-l~-------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~near~~~~~~g~~pn 371 (392) T protein:vir:74 303 ---YASALNRYLRPAISELEYK-LS-------DHISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK 371 (392) T ss_pred ---HHHHHHHHHHHHHHHHHHh-cc-------chhcccchhhhcCCHHHHHHHHHHHHhCCCcCHHHHHHHHHhCCCCcc Confidence 2223333333222111110 10 0122222222335667788888999999999999876543 466655 Q ss_pred HHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 472 QAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 472 ea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |+.+. |.... .+++| -.| T Consensus 372 e~r~~------enl~~------~~~Gd-~~~ 389 (392) T protein:vir:74 372 DLPAP------ENTNK------KTTGQ-SNE 389 (392) T ss_pred ccchh------cCCCC------CCCCC-CCC Confidence 44321 21111 11111 122 No 133 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=98.42 E-value=7.4e-07 Score=54.20 Aligned_cols=438 Identities=12% Similarity=0.077 Sum_probs=179.5 Q ss_pred CChhHHHHHHHHHHhh-c----c-cccchhhhh-------ccc-cccCCHHHHHHHHHHHHHhc---CCCCccccccCCC Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNY-V----I-TNQSLNSIT-------DHP-KIAISPEEYNRIMDNLRYFA---GDFDSVTYRDSNG 63 (502) Q Consensus 1 m~~~~~ik~~i~~~~~-~----~-~~~~l~~i~-------~~~-~~~~~~~~~~~i~~~~~~Y~---g~~~~~~~~~~~~ 63 (502) -+|+++|..+.|+-.. + + -.|-|+.-- ... ...-++.+...-+.....-. |..+.+. ...+- T Consensus 8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~d~~~~~~~r~g~~~~~~~~g~~~~~e-pp~d~ 86 (648) T protein:vir:79 8 RGFWSRISLMWRDEDDDKEPLVLEESMQLGEAPGAMPKGGGGGGSAKRDPKMSLVKRIGLAIMDGGGGGRDFEE-PEFDF 86 (648) T ss_pred chhhhhhhhhccCccccccccccccccccCCCccccCCCCcccccccccchhHHHHHhHHHHHhhcCCcccccc-CCcCH Confidence 6889999998882110 0 0 001111000 000 00011222222221222111 2212111 11111 Q ss_pred ccccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHH---HHHHhh---ccHHHHHHHHHHHHhhcCCEEEEEEE Q lcl|NC_012753. 64 SQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFI---NETLKN---DKFSKNFERYLESCLALGGLAMRPYI 137 (502) Q Consensus 64 ~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l---~~~~~~---~~f~~~~~~~~~~~~~~G~~~~~~~~ 137 (502) ....+.....++....|+.+|.-+.+-|..+..++....+.. ...+.- ....+.+..++.+.+..|.+|+.+.. T Consensus 87 ~~l~~l~~~np~V~~aI~iia~~ia~l~~~i~~~~~~~~~~~~~~~ll~rPn~~~t~~~f~~~l~~~lll~GNAYveiiR 166 (648) T protein:vir:79 87 NEITSAYNTEGYVRQAVDKYIEMMFKADWDFVSKNPNAVEYIRMRFTLMAEATQIPTNQLFIEIAEDLVKYCNVVIAKSR 166 (648) T ss_pred HHHHHHHhcChHHHHHHHHHHHHHhhCcceEEecCCccchhhHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCeEEEEEe Confidence 111122223455677888888888887777766543222111 111121 13445566678888899999998877 Q ss_pred eCCceEEEEE---cC------CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCcc Q lcl|NC_012753. 138 DGDQIRVSFV---QA------TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKT 208 (502) Q Consensus 138 d~~~~~i~~v---~~------~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~ 208 (502) ++++.....+ .. ...+|+..+ .+. + .. +.. +.+....|..... T Consensus 167 d~~G~~~~~l~~~~~~~~~~v~~l~pl~p~--~v~---v----~~--d~~----------------g~~~~Y~y~~~g~- 218 (648) T protein:vir:79 167 AKDALPFQGMNVMGVGDSMPVAGYFPLNLA--SMK---V----KR--DKF----------------GMIKGWQQEQEGQ- 218 (648) T ss_pred cCCCccchhhhhhhhccccceeeeEeecCc--eeE---E----EE--cCC----------------CceeeeEEEecCC- Confidence 7644221111 11 122232110 000 0 00 000 0111111111100 Q ss_pred ccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceee Q lcl|NC_012753. 209 IIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVA 288 (502) Q Consensus 209 ~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~ 288 (502) +..+++. +. ..++|+.+ ...+.++|+|.+..+...|.....+-.-..+-|..+...-. T Consensus 219 --~~~~~~~------~~----------dIIHik~~----~~~d~~~GlSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P~g 276 (648) T protein:vir:79 219 --DKPQKFK------PE----------DIVHIYYK----REKGRAFGTPWLLPALDDIRALRQVEENVLRLVYRNLHPLW 276 (648) T ss_pred --ceeEEec------Cc----------cEEEEccC----CCCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccE Confidence 1111110 00 12344421 12345689999888877775544333222233454443322 Q ss_pred echHHhccCCCCCCcccCccccc-cccchhhccccCCCCccccceeeecccc--chHHHHHHHHHHHHHHHHhcCCChhh Q lcl|NC_012753. 289 VPTQMIKTEYDTNGEKVTVKREF-ETGHNVYEQFDSGDMDKGIGITDLTTDI--RSDDYIKAINKGLSLFEMQLGVSTGM 365 (502) Q Consensus 289 v~~~~l~~~~~~~g~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~i~~~~~~i--r~e~~~~~l~~~l~~i~~~~g~s~~~ 365 (502) + ++...+... ....+.. ..-...+......++........+.+.. ..-++++..+...++|+...|+||.. T Consensus 277 i----l~~~~~~~~--~e~~k~~~e~~~~~~~~~~i~gg~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~l 350 (648) T protein:vir:79 277 H----VKVGLEQEG--FGAEEGEVDLVRGEVENMDVEGGMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELM 350 (648) T ss_pred E----EEeCCCccc--hHHHHHHHHHHHHhcccccccccccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhH Confidence 2 321111110 0000000 0000111111111221111111112211 23357777788889999999999999 Q ss_pred cccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHH Q lcl|NC_012753. 366 FSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELV-ISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAE 443 (502) Q Consensus 366 ~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~-~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~ 443 (502) +|...++. .|+.+....++. .+.-.+..+...+...+ +.++ +.. .++........+.|.|++-...|..+. T Consensus 351 LG~~~~ss~stae~~~~~~~~---~i~~l~~~i~~~le~~~~~~ll-~e~---~l~~~l~~d~~ieF~~~~Llr~D~~~~ 423 (648) T protein:vir:79 351 MGRGGTASRSTGDNLSSDFKD---RIKALQKVMATFINEFMVKEIL-MEG---GFDPVLNPDDKVEFRFNEIDMDSKIKL 423 (648) T ss_pred cccCCCccchHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHh-hhh---hccccccccceEEEeecccchhhHHHH Confidence 99765433 344443333322 22223333333333211 1111 111 111122234567888888788888888 Q ss_pred HHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHH----HHHHhhhc--ccCCCCCccccCCCCC Q lcl|NC_012753. 444 FDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEIYQ----KINDETMV--STDSFRTSEEVDIYGE 502 (502) Q Consensus 444 ~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~el~----ri~~E~~~--~~~~~~~~~~~~~~g~ 502 (502) ++...+++.+|+||.-++++.. +|+.+.+-...+. ....+..+ ..+........+=.|| T Consensus 424 a~~~~~l~~~GilT~NEaR~~lGlpPi~~g~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~e 490 (648) T protein:vir:79 424 ENQAVFLYEHNAISEDEMRELIGRDPVDDGEGRAKMHLQMVTIAQATALAALAPTPAGGSSASASGD 490 (648) T ss_pred HHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCccccccccccchhccccccCCCCCCCCCCCCcccc Confidence 8889999999999999977653 2443321111111 11111111 1111111111111222 No 134 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.42 E-value=7.4e-07 Score=54.18 Aligned_cols=392 Identities=9% Similarity=0.067 Sum_probs=167.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||+++ +++-.- .. ..+ ...+..++.+......-..+.. ..-+...--...| T Consensus 1 Mg~f~~l---f~r~~~----~~----------~~~------~~~~~~~~~~~~~~~~g~~v~~----~~al~~~~v~~~i 53 (414) T protein:vir:44 1 MVFFSGL---FQRKSD----AP----------VTT------PAELADAIGLSYDTYTGKQISS----QRAMRLTAVFSCV 53 (414) T ss_pred Cchhhhh---hccCcc----Cc----------ccc------hhhHhHhhccCccccCCceech----hhhhccHHHHHHH Confidence 9999754 443100 00 001 1112233322222111111111 1112222234556 Q ss_pred HHHhhhhhcCcceEee-CCH----HHHHHHHHHHh-----hccHHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEcC Q lcl|NC_012753. 81 KKVASLVFNEQATIRV-DNE----VADAFINETLK-----NDKFSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQA 149 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~-~d~----~~~e~l~~~~~-----~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~~ 149 (502) +..|+-+.+=|+.+-- +++ .....+..+|. ......-+..++...+..|.+|+.+..+.|++ .+..++| T Consensus 54 ~~Ia~~ia~~p~~~~~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gna~~~i~~~~g~~~~L~~l~~ 133 (414) T protein:vir:44 54 RVLAESVGMLPCNLYHLNGSLKQRATGERLHKLISTHPNGYMTPQEFWELVVTCLCLRGNFYAYKVKAFGEVAELLPVDP 133 (414) T ss_pred HHHHHHhccCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcC Confidence 6666666665655421 111 11122233332 12334445556667778899988877666665 5666777 Q ss_pred CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceee Q lcl|NC_012753. 150 TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTL 229 (502) Q Consensus 150 ~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~ 229 (502) ..+-+...+++.. .|. ++ ..++... .++- . T Consensus 134 ~~v~~~~~~~~~~------------------~y~---~~-~~~g~~~----------------~~~~--------~---- 163 (414) T protein:vir:44 134 GCVVPKLNSSWEP------------------VYQ---VT-FPDGSTD----------------VLSQ--------E---- 163 (414) T ss_pred ceEEEEECCCCcE------------------EEE---EE-ecCceEE----------------EEcc--------c---- Confidence 7765543222211 010 00 0000000 0000 0 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR 309 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 309 (502) -.++|+.+. .+...|+|.+.-+...++....+-.-..+-|..+...-.+ +.....-+.... . T Consensus 164 ------evih~~~~~-----~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~l~~e~~---~ 225 (414) T protein:vir:44 164 ------DIWHVRTLT-----LDGLVGLNPIAYAREAISLAAATEEHGARLFSNGAVTSGV----LRTEQTLSDQAY---E 225 (414) T ss_pred ------cEEEecCCC-----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCCCHHHH---H Confidence 013344221 1235799988887777765554433333344543332122 222111110000 0 Q ss_pred ccccc-chhhccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHH Q lcl|NC_012753. 310 EFETG-HNVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQ 383 (502) Q Consensus 310 ~~~~~-~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~ 383 (502) .+... ...+..... .--+.+.-++.++.....-++.+..+....+|+...|++|..++...++. +++++.... T Consensus 226 ~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~- 304 (414) T protein:vir:44 226 RLKKDFEERHTGLGNAHRPMILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLG- 304 (414) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH- Confidence 00000 011111000 00011223555665556667888888888999999999999998754432 333333211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) .++.+|..+++.|-...+. .++.........+.|+++.-+..|..+.++...+++.+|++++-++++ T Consensus 305 ------------~~~~~l~P~~~~ie~~ln~-~L~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~ 371 (414) T protein:vir:44 305 ------------FINYSLVPYLTRIEQRINT-GLVRKSKQGVFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRD 371 (414) T ss_pred ------------HHHHHHHHHHHHHHHHHHh-hcCCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 1233444444433322221 112111111223455555556678889999999999999999999765 Q ss_pred hcCCCCHH-HHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 464 KTLNVTKE-QAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) . .|++.- ..++-+...........+......+.+-.++ T Consensus 372 ~-~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~d 410 (414) T protein:vir:44 372 L-EDMNPRPGGDVYLTPMNMTTKPSDGSKAGKQKDNANAD 410 (414) T ss_pred H-hCCCCCCCcceecccccccccCCccccCCCCCCCCCCC Confidence 4 455431 1111111111000000000000000000111 No 135 >protein:vir:107404 Length: 555 # NCBI annotation: Bbp21 # Family: family:all:481 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958690;genbank:gi:41179382;genbank:GeneID:2717198 Probab=98.38 E-value=9.6e-07 Score=53.57 Aligned_cols=463 Identities=12% Similarity=0.045 Sum_probs=191.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCc---cccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDS---VTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~---~~~~~~~~~~~~~~~~~~n~~k 77 (502) |-=-...+.+ ..++. .+..++...-..|+.+|.=-.+- +.............++--+.+. T Consensus 1 M~~~~~~~~l----------------~~r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~ 63 (555) T protein:vir:10 1 MAEQTERKLL----------------LSRWG-QLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGT 63 (555) T ss_pred CCCcccHHHH----------------HHHHH-HHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHH Confidence 2222211222 11111 22233333344444443221111 1111111122223344456677 Q ss_pred HHHHHHhhhhhcCcc-------eEeeCCH------HHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE Q lcl|NC_012753. 78 TASKKVASLVFNEQA-------TIRVDNE------VADA-------FINETLKNDKFSKNFERYLESCLALGGLAMRPYI 137 (502) Q Consensus 78 ~iv~~~a~~l~~ep~-------~i~~~d~------~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~ 137 (502) ..++.+|+.|.+-.. .+.+.+. .+.+ .+.+.|..++|...+.++..+..+.|.+.+.+-. T Consensus 64 ~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~ 143 (555) T protein:vir:10 64 RALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP 143 (555) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec Confidence 888888877765221 1333321 2333 3345677889999999999999999999987666 Q ss_pred eC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEE----------EEEEEEEeCCeEEEEEEEEecCC Q lcl|NC_012753. 138 DG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYS----------LIEFHEWNKETYTISNELYESES 206 (502) Q Consensus 138 d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt----------~~E~h~~~~~~~~I~~~l~~~~~ 206 (502) |+ +.+++..++..+++- ..|..+....+| +++...-.+-...|- .++.- ..+....|.|.+|--.+ T Consensus 144 d~~~~~rf~~~pl~~~~v-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~-~~~~~v~v~~~V~pr~~ 220 (555) T protein:vir:10 144 DFDAVVYHHSLTAGEYAI-AADNQGRVNTLY-REFQITVAQMVREFGKDKCSTTVQSLFDRG-ALEQWVTVIHAIEPRAD 220 (555) T ss_pred CCCceEEEEEeecceeEE-eeCCCCCEEEEE-EEEeccHHHHHHhcCcccCCHHHHHHHhcC-CCCceEEEEEEEeeccC Confidence 64 457888899999876 455555444443 332111000000000 00000 00011233344432111 Q ss_pred c--ccc-Cceeeccccc-c-CCCcc--eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 207 K--TII-GQRVPLSTLY-E-DLEET--VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE 279 (502) Q Consensus 207 ~--~~l-G~~v~l~~~~-~-~l~~~--~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~ 279 (502) . ... +.-.|..++| + +.... ....|+..-||++++-+ ...++.||+|--..+.+-+..|+..--..... T Consensus 221 ~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~----~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~ 296 (555) T protein:vir:10 221 RDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWA----LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQA 296 (555) T ss_pred cCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 1 111 1112333221 1 11111 12234444566655532 23467899999999999999999755555444 Q ss_pred Hhh-ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCcccccee-eeccccchHHHHHHHHHHHHHHHH Q lcl|NC_012753. 280 VKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGIT-DLTTDIRSDDYIKAINKGLSLFEM 357 (502) Q Consensus 280 ~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~ir~e~~~~~l~~~l~~i~~ 357 (502) .+. .+..+.||.+... ......|+.. ..+ . .+..+..+. .+++......-.+.++.+.+.|.. T Consensus 297 ~~~~~~pp~~v~~~~~~-----~~~~~~pgg~-----~~v---~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~ 361 (555) T protein:vir:10 297 IDYKSNPPLQLPVSAKN-----QDISTVPGGL-----SYV---D--AAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKA 361 (555) T ss_pred HHHHhcCceeecccccc-----ccceeccccc-----ccc---c--cCCCCcceecccccccchHHHHHHHHHHHHHHHH Confidence 433 3334445444311 1111111111 111 1 111111111 122222223333445555555543 Q ss_pred hcCCC-hhhccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCcc--cccceEEEeC Q lcl|NC_012753. 358 QLGVS-TGMFSFDGKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYNLYTGEIP--TMDEVSVDLD 433 (502) Q Consensus 358 ~~g~s-~~~~~~~~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~--~~~~i~v~f~ 433 (502) ..=.+ ...++..+....|||||....+...+..+- ..+.-...|.-|+.-++.++.-.+..+.... ...+++|++- T Consensus 362 af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yi 441 (555) T protein:vir:10 362 SFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFV 441 (555) T ss_pred HhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEec Confidence 32111 112333445567999999887777666655 2333344555555545555443333322111 1123556554 Q ss_pred CCccCCHH-HH---HHHHHHHHh--cCC-------CCHHHH---HHhcCCC------CHHHHHHHHHH-HHHhhhc---- Q lcl|NC_012753. 434 DGVFTDRN-AE---FDYWSKMVA--AGF-------APKTMA---IEKTLNV------TKEQAQEIYQK-INDETMV---- 486 (502) Q Consensus 434 d~i~~d~~-~~---~~~~~~~~~--~Gi-------~S~et~---l~~~~~~------~deea~~el~r-i~~E~~~---- 486 (502) -.+-.... .. +....+.+. +++ +....+ +....|+ +++|+++..+. .++++.+ T Consensus 442 s~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~ 521 (555) T protein:vir:10 442 SMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAA 521 (555) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Confidence 33221100 00 111111110 222 122222 3334454 34444332111 1111100 Q ss_pred c----cCCCCCccccCCCCC Q lcl|NC_012753. 487 S----TDSFRTSEEVDIYGE 502 (502) Q Consensus 487 ~----~~~~~~~~~~~~~g~ 502 (502) . ......-++++.-++ T Consensus 522 ~~~q~~~~~~~~~~~~~~~~ 541 (555) T protein:vir:10 522 LLNQGADTAAKLGSVDTSKQ 541 (555) T ss_pred HHHHHHHHHHHhcccccCcc Confidence 0 000111122222222 No 136 >protein:vir:98506 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996583;genbank:gi:45569514;genbank:GeneID:2767834 Probab=98.38 E-value=9.6e-07 Score=53.57 Aligned_cols=463 Identities=12% Similarity=0.045 Sum_probs=191.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCc---cccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDS---VTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~---~~~~~~~~~~~~~~~~~~n~~k 77 (502) |-=-...+.+ ..++. .+..++...-..|+.+|.=-.+- +.............++--+.+. T Consensus 1 M~~~~~~~~l----------------~~r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~ 63 (555) T protein:vir:98 1 MAEQTERKLL----------------LSRWG-QLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGT 63 (555) T ss_pred CCCcccHHHH----------------HHHHH-HHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHH Confidence 2222211222 11111 22233333344444443221111 1111111122223344456677 Q ss_pred HHHHHHhhhhhcCcc-------eEeeCCH------HHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE Q lcl|NC_012753. 78 TASKKVASLVFNEQA-------TIRVDNE------VADA-------FINETLKNDKFSKNFERYLESCLALGGLAMRPYI 137 (502) Q Consensus 78 ~iv~~~a~~l~~ep~-------~i~~~d~------~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~ 137 (502) ..++.+|+.|.+-.. .+.+.+. .+.+ .+.+.|..++|...+.++..+..+.|.+.+.+-. T Consensus 64 ~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~ 143 (555) T protein:vir:98 64 RALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP 143 (555) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec Confidence 888888877765221 1333321 2333 3345677889999999999999999999987666 Q ss_pred eC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEE----------EEEEEEEeCCeEEEEEEEEecCC Q lcl|NC_012753. 138 DG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYS----------LIEFHEWNKETYTISNELYESES 206 (502) Q Consensus 138 d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt----------~~E~h~~~~~~~~I~~~l~~~~~ 206 (502) |+ +.+++..++..+++- ..|..+....+| +++...-.+-...|- .++.- ..+....|.|.+|--.+ T Consensus 144 d~~~~~rf~~~pl~~~~v-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~-~~~~~v~v~~~V~pr~~ 220 (555) T protein:vir:98 144 DFDAVVYHHSLTAGEYAI-AADNQGRVNTLY-REFQITVAQMVREFGKDKCSTTVQSLFDRG-ALEQWVTVIHAIEPRAD 220 (555) T ss_pred CCCceEEEEEeecceeEE-eeCCCCCEEEEE-EEEeccHHHHHHhcCcccCCHHHHHHHhcC-CCCceEEEEEEEeeccC Confidence 64 457888899999876 455555444443 332111000000000 00000 00011233344432111 Q ss_pred c--ccc-Cceeeccccc-c-CCCcc--eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 207 K--TII-GQRVPLSTLY-E-DLEET--VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE 279 (502) Q Consensus 207 ~--~~l-G~~v~l~~~~-~-~l~~~--~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~ 279 (502) . ... +.-.|..++| + +.... ....|+..-||++++-+ ...++.||+|--..+.+-+..|+..--..... T Consensus 221 ~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~----~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~ 296 (555) T protein:vir:98 221 RDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWA----LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQA 296 (555) T ss_pred cCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 1 111 1112333221 1 11111 12234444566655532 23467899999999999999999755555444 Q ss_pred Hhh-ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCcccccee-eeccccchHHHHHHHHHHHHHHHH Q lcl|NC_012753. 280 VKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGIT-DLTTDIRSDDYIKAINKGLSLFEM 357 (502) Q Consensus 280 ~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~ir~e~~~~~l~~~l~~i~~ 357 (502) .+. .+..+.||.+... ......|+.. ..+ . .+..+..+. .+++......-.+.++.+.+.|.. T Consensus 297 ~~~~~~pp~~v~~~~~~-----~~~~~~pgg~-----~~v---~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~ 361 (555) T protein:vir:98 297 IDYKSNPPLQLPVSAKN-----QDISTVPGGL-----SYV---D--AAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKA 361 (555) T ss_pred HHHHhcCceeecccccc-----ccceeccccc-----ccc---c--cCCCCcceecccccccchHHHHHHHHHHHHHHHH Confidence 433 3334445444311 1111111111 111 1 111111111 122222223333445555555543 Q ss_pred hcCCC-hhhccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCcc--cccceEEEeC Q lcl|NC_012753. 358 QLGVS-TGMFSFDGKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYNLYTGEIP--TMDEVSVDLD 433 (502) Q Consensus 358 ~~g~s-~~~~~~~~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~--~~~~i~v~f~ 433 (502) ..=.+ ...++..+....|||||....+...+..+- ..+.-...|.-|+.-++.++.-.+..+.... ...+++|++- T Consensus 362 af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yi 441 (555) T protein:vir:98 362 SFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFV 441 (555) T ss_pred HhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEec Confidence 32111 112333445567999999887777666655 2333344555555545555443333322111 1123556554 Q ss_pred CCccCCHH-HH---HHHHHHHHh--cCC-------CCHHHH---HHhcCCC------CHHHHHHHHHH-HHHhhhc---- Q lcl|NC_012753. 434 DGVFTDRN-AE---FDYWSKMVA--AGF-------APKTMA---IEKTLNV------TKEQAQEIYQK-INDETMV---- 486 (502) Q Consensus 434 d~i~~d~~-~~---~~~~~~~~~--~Gi-------~S~et~---l~~~~~~------~deea~~el~r-i~~E~~~---- 486 (502) -.+-.... .. +....+.+. +++ +....+ +....|+ +++|+++..+. .++++.+ T Consensus 442 s~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~ 521 (555) T protein:vir:98 442 SMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAA 521 (555) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Confidence 33221100 00 111111110 222 122222 3334454 34444332111 1111100 Q ss_pred c----cCCCCCccccCCCCC Q lcl|NC_012753. 487 S----TDSFRTSEEVDIYGE 502 (502) Q Consensus 487 ~----~~~~~~~~~~~~~g~ 502 (502) . ......-++++.-++ T Consensus 522 ~~~q~~~~~~~~~~~~~~~~ 541 (555) T protein:vir:98 522 LLNQGADTAAKLGSVDTSKQ 541 (555) T ss_pred HHHHHHHHHHHhcccccCcc Confidence 0 000111122222222 No 137 >protein:vir:107822 Length: 555 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:481 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996631;genbank:gi:45580765;genbank:GeneID:2767898 Probab=98.38 E-value=9.6e-07 Score=53.57 Aligned_cols=463 Identities=12% Similarity=0.045 Sum_probs=191.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCc---cccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDS---VTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~---~~~~~~~~~~~~~~~~~~n~~k 77 (502) |-=-...+.+ ..++. .+..++...-..|+.+|.=-.+- +.............++--+.+. T Consensus 1 M~~~~~~~~l----------------~~r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~ 63 (555) T protein:vir:10 1 MAEQTERKLL----------------LSRWG-QLRTERESWMSHWKEISDYLLPRAGRFFVQDRNRGEKRHNNILDNTGT 63 (555) T ss_pred CCCcccHHHH----------------HHHHH-HHHHHhhHHHHHHHHHHHHhCcccccccCCCCCcchhcccccccccHH Confidence 2222211222 11111 22233333344444443221111 1111111122223344456677 Q ss_pred HHHHHHhhhhhcCcc-------eEeeCCH------HHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE Q lcl|NC_012753. 78 TASKKVASLVFNEQA-------TIRVDNE------VADA-------FINETLKNDKFSKNFERYLESCLALGGLAMRPYI 137 (502) Q Consensus 78 ~iv~~~a~~l~~ep~-------~i~~~d~------~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~ 137 (502) ..++.+|+.|.+-.. .+.+.+. .+.+ .+.+.|..++|...+.++..+..+.|.+.+.+-. T Consensus 64 ~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~e~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Lv~~G~a~l~~~~ 143 (555) T protein:vir:10 64 RALRVLAAGMMAGMTSPARPWFRLTTSIPELDESAAVKAWLANVTRLMLMIFAKSNTYRALHSMYEELGAFGTASSIVLP 143 (555) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceEEEEec Confidence 888888877765221 1333321 2333 3345677889999999999999999999987666 Q ss_pred eC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEE----------EEEEEEEeCCeEEEEEEEEecCC Q lcl|NC_012753. 138 DG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYS----------LIEFHEWNKETYTISNELYESES 206 (502) Q Consensus 138 d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt----------~~E~h~~~~~~~~I~~~l~~~~~ 206 (502) |+ +.+++..++..+++- ..|..+....+| +++...-.+-...|- .++.- ..+....|.|.+|--.+ T Consensus 144 d~~~~~rf~~~pl~~~~v-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~~~~~~~~~-~~~~~v~v~~~V~pr~~ 220 (555) T protein:vir:10 144 DFDAVVYHHSLTAGEYAI-AADNQGRVNTLY-REFQITVAQMVREFGKDKCSTTVQSLFDRG-ALEQWVTVIHAIEPRAD 220 (555) T ss_pred CCCceEEEEEeecceeEE-eeCCCCCEEEEE-EEEeccHHHHHHhcCcccCCHHHHHHHhcC-CCCceEEEEEEEeeccC Confidence 64 457888899999876 455555444443 332111000000000 00000 00011233344432111 Q ss_pred c--ccc-Cceeeccccc-c-CCCcc--eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 207 K--TII-GQRVPLSTLY-E-DLEET--VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE 279 (502) Q Consensus 207 ~--~~l-G~~v~l~~~~-~-~l~~~--~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~ 279 (502) . ... +.-.|..++| + +.... ....|+..-||++++-+ ...++.||+|--..+.+-+..|+..--..... T Consensus 221 ~~~~~~~~~~~p~~s~~~~~~~d~~~vl~esgy~e~P~i~~Rw~----~~~ge~YGrgp~~~~lgD~k~L~~l~~~~l~~ 296 (555) T protein:vir:10 221 RDPSKRDDRNMAWKSVYFEPGADETRTLRESGYRSFRALCPRWA----LVGGDIYGNSPAMEALGDVRQLQHEQLRKAQA 296 (555) T ss_pred cCcCCCCccccceEEEEEEeccCCccccccCCcccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHHHHHHHHH Confidence 1 111 1112333221 1 11111 12234444566655532 23467899999999999999999755555444 Q ss_pred Hhh-ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCcccccee-eeccccchHHHHHHHHHHHHHHHH Q lcl|NC_012753. 280 VKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGIT-DLTTDIRSDDYIKAINKGLSLFEM 357 (502) Q Consensus 280 ~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~ir~e~~~~~l~~~l~~i~~ 357 (502) .+. .+..+.||.+... ......|+.. ..+ . .+..+..+. .+++......-.+.++.+.+.|.. T Consensus 297 ~~~~~~pp~~v~~~~~~-----~~~~~~pgg~-----~~v---~--~g~~~d~~~~~~~~~~d~~~~~~~i~~~~~rI~~ 361 (555) T protein:vir:10 297 IDYKSNPPLQLPVSAKN-----QDISTVPGGL-----SYV---D--AAAPNGGIRTAFEVNLDLSHLLADIVDVRERIKA 361 (555) T ss_pred HHHHhcCceeecccccc-----ccceeccccc-----ccc---c--cCCCCcceecccccccchHHHHHHHHHHHHHHHH Confidence 433 3334445444311 1111111111 111 1 111111111 122222223333445555555543 Q ss_pred hcCCC-hhhccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCcc--cccceEEEeC Q lcl|NC_012753. 358 QLGVS-TGMFSFDGKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYNLYTGEIP--TMDEVSVDLD 433 (502) Q Consensus 358 ~~g~s-~~~~~~~~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~--~~~~i~v~f~ 433 (502) ..=.+ ...++..+....|||||....+...+..+- ..+.-...|.-|+.-++.++.-.+..+.... ...+++|++- T Consensus 362 af~~dlf~~l~~~~~~~~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v~yi 441 (555) T protein:vir:10 362 SFYADLFLMLANGTNPQMTATEVAERHEEKLLMLGPVLERMHNEILDPLIELTFQRMVEANILPPPPQEMQGVDLNVEFV 441 (555) T ss_pred HhhcchhhhccCCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEEEec Confidence 32111 112333445567999999887777666655 2333344555555545555443333322111 1123556554 Q ss_pred CCccCCHH-HH---HHHHHHHHh--cCC-------CCHHHH---HHhcCCC------CHHHHHHHHHH-HHHhhhc---- Q lcl|NC_012753. 434 DGVFTDRN-AE---FDYWSKMVA--AGF-------APKTMA---IEKTLNV------TKEQAQEIYQK-INDETMV---- 486 (502) Q Consensus 434 d~i~~d~~-~~---~~~~~~~~~--~Gi-------~S~et~---l~~~~~~------~deea~~el~r-i~~E~~~---- 486 (502) -.+-.... .. +....+.+. +++ +....+ +....|+ +++|+++..+. .++++.+ T Consensus 442 s~La~aq~~~~~~~i~~~l~~i~~laq~~P~vld~id~d~~~~~~a~~~Gvp~~~irs~eev~~~r~qr~~~~q~~~~a~ 521 (555) T protein:vir:10 442 SMLAQAQRAIATNSVDRFVGNLGAVAGIKPEVLDKFDADRWADTYADMLGIDPELIVPGNQVALIRKQRADQQQAAQQAA 521 (555) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHHHHHH Confidence 33221100 00 111111110 222 122222 3334454 34444332111 1111100 Q ss_pred c----cCCCCCccccCCCCC Q lcl|NC_012753. 487 S----TDSFRTSEEVDIYGE 502 (502) Q Consensus 487 ~----~~~~~~~~~~~~~g~ 502 (502) . ......-++++.-++ T Consensus 522 ~~~q~~~~~~~~~~~~~~~~ 541 (555) T protein:vir:10 522 LLNQGADTAAKLGSVDTSKQ 541 (555) T ss_pred HHHHHHHHHHHhcccccCcc Confidence 0 000111122222222 No 138 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.35 E-value=1.2e-06 Score=53.11 Aligned_cols=430 Identities=12% Similarity=0.097 Sum_probs=173.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHH----HhcC-------CCCccccccCCCc----- Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLR----YFAG-------DFDSVTYRDSNGS----- 64 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~----~Y~g-------~~~~~~~~~~~~~----- 64 (502) |+|++++.-..+-....+.+ + .+.+...+.+..-+...+.+... -|.- -...+..+....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~l~ 77 (547) T protein:vir:63 1 MGLFESIRLAGVNKSDAVKH--I-EVDDNYSIAIQQREQEQISKAMNNKEVAYSQPVIGSMSANPGFKTKPSIRNNQDLH 77 (547) T ss_pred CchhhhhhhhcCCccccccc--c-ccccccchhhhhhhHHHHHHhhcccchhhhchhhheeecccccccCCccCChhHHH Confidence 99999876644311110000 0 00000111111111111111100 0000 0001000000000 Q ss_pred cccccceecchHHHHHHHHhhhhhc--Cc---------ceEeeC---------CHHHHHHHHHHHhhc---------cHH Q lcl|NC_012753. 65 QVKRDFNHLPIGRTASKKVASLVFN--EQ---------ATIRVD---------NEVADAFINETLKND---------KFS 115 (502) Q Consensus 65 ~~~~~~~~~n~~k~iv~~~a~~l~~--ep---------~~i~~~---------d~~~~e~l~~~~~~~---------~f~ 115 (502) ...+.....++...+++..|+-+.+ .+ ..|.+. +....+.|.+++..- .+. T Consensus 78 ~l~~~~~~npiv~~~I~~~a~~ia~~~~~~~~~~~~~~~~ir~k~~~~~~~~~~~~~~~~l~~~l~~pn~~~~p~~~s~~ 157 (547) T protein:vir:63 78 GVLKKFGGNIILNAIINTRSNQVSMYCKPARHSEKGVGFEVRLKDLDKKPTSHDEATIKRIESFIEKTGVDNDINRDSFS 157 (547) T ss_pred HHHHHhhcCHHHHHHHHHHHHHHhhhhhhhhhhccCCCceeEecccccccChhhHHHHHHHHHHHHhhCCCCCCccchHH Confidence 0011122334555555544443321 11 122222 122223455544321 244 Q ss_pred HHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCC Q lcl|NC_012753. 116 KNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKE 193 (502) Q Consensus 116 ~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~ 193 (502) ..+..++.+.+..|.+|+.+..|. |. ..+..++|..+-++..+.+.... ...+|. . ..++ T Consensus 158 ~f~~~lv~d~ll~Gn~~~~i~rd~~G~~~~L~~l~p~~V~~~~~~~g~~~~-------------~~~~y~--~---~~~~ 219 (547) T protein:vir:63 158 SFVKKIVRDTYMYDQVNFEKVFNRNQSMVRFVAKDPTTIFFATTADGKIPD-------------NGNRFV--Q---VIDQ 219 (547) T ss_pred HHHHHHHHHHHhhCCEEEEEEECCCCcEEEEEEecCceeEEEECCcccccc-------------CceEEE--E---EcCC Confidence 556667778888999998888875 44 36778888888776333221100 011110 0 0000 Q ss_pred eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHH Q lcl|NC_012753. 194 TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTY 273 (502) Q Consensus 194 ~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~ 273 (502) .... .++ +.+ .++++.+- ..-....++|+|.+..+...|.....+- T Consensus 220 ~~~~---------------~~~--------~~e----------iih~r~n~-~~~~~~~~~G~Spi~~~~~~i~~~~~a~ 265 (547) T protein:vir:63 220 KIVA---------------TFN--------ARE----------MAFAVRNP-RSDIYATGYGYPELEIALKQFIAHENTE 265 (547) T ss_pred cEEE---------------Eec--------ccc----------EEEecccC-CCCcccccccccHHHHHHHHHHHHHHHH Confidence 0000 000 000 12232210 0111235679999888777776655444 Q ss_pred HHHHHHHhhccceeeechHHhccCCCCCCcccCcc-c-ccccc-chhhcccc------CCCCccccceeeeccccchHHH Q lcl|NC_012753. 274 DEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK-R-EFETG-HNVYEQFD------SGDMDKGIGITDLTTDIRSDDY 344 (502) Q Consensus 274 S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~-~-~~~~~-~~~~~~~~------~~~~~~~~~i~~~~~~ir~e~~ 344 (502) .-..+-|..+...-.+ |....+.. +... . .+... ...|.... .-. +.+.-++.++.....-++ T Consensus 266 ~~~~~~f~Ng~~p~gi----L~~~~~~~---ls~e~~~~lk~~~~~~~~G~~nagk~~vl~-~~g~~~~~l~~~~~d~qf 337 (547) T protein:vir:63 266 AFNDRFFSHGGTTRGI----LQIKAAQQ---QSQHALEIFKREWKNSLSGINGSWQIPVVS-AEDVKFVNMTPSARDMEF 337 (547) T ss_pred HHHHHHHHcCCCcceE----EEecCCCC---CCHHHHHHHHHHHHHHhcCccccccccccc-CCCceEEEcCCChhHHHH Confidence 3333445655432111 21111100 0000 0 00000 00111000 000 112235556666677788 Q ss_pred HHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCcc Q lcl|NC_012753. 345 IKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIP 423 (502) Q Consensus 345 ~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~ 423 (502) .+..+...++|+...|++|..+|+...+..++....+. ....+.... ..++..|..++..|-...+.. +.. .. T Consensus 338 le~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~---t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~-L~~-~~- 411 (547) T protein:vir:63 338 EKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSL---NEGNSAEKNQASKNKGLQPLLGFIEDFINKH-IVA-EF- 411 (547) T ss_pred HHHHHHHHHHHHHHhCCCHHHcCccccccccccccccc---chhhHHHHHHHHHHHHHHHHHHHHHHHHHhh-ccc-cc- Confidence 89888999999999999999999755432222211111 111122211 223455555555444333221 211 11 Q ss_pred cccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH--HHHHHH------------------------- Q lcl|NC_012753. 424 TMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK--EQAQEI------------------------- 476 (502) Q Consensus 424 ~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d--eea~~e------------------------- 476 (502) ...+.+.|+.....+..+ .....+++.+|+|+.-++++.. |+.. +..+.- T Consensus 412 -~~~~~~~f~~~~~~~~~~-~~~~~~~~~~g~lT~NE~R~~~-gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~ 488 (547) T protein:vir:63 412 -GDKYTFQFVGGDIKSELE-SVKILAEKAKVAMTVNEVRKEL-NLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQS 488 (547) T ss_pred -CCceEEEeeccccccHHH-HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCceeecccccccccccccccCCccccchh Confidence 134677887766666554 3445567788999999876553 4321 100000 Q ss_pred -HHHHHHhhh--cccCCCCCccccCCCCC Q lcl|NC_012753. 477 -YQKINDETM--VSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 -l~ri~~E~~--~~~~~~~~~~~~~~~g~ 502 (502) ++.+.+..+ ...+..+.+.+.+-.|+ T Consensus 489 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 517 (547) T protein:vir:63 489 NLQMLQEQTGNRVSTDVEDIPDGKDTTGD 517 (547) T ss_pred hccccccccCCCCCCCCCCCCCCcccCCC Confidence 000000000 01112222233333443 No 139 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.34 E-value=1.2e-06 Score=53.09 Aligned_cols=408 Identities=11% Similarity=0.087 Sum_probs=161.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |++++++. ++. ....+...... .+..+... -+..|.. ...+.... +.-+.+.---.. T Consensus 1 Mg~~~~l~---~~~----~~~~~~~~~~~--------~~~~~~~~-~~~~~~~------~~~g~~v~~~~al~~~~v~~~ 58 (457) T protein:vir:62 1 MGFWSALF---GRG----HSPALDAAEGR--------AWEPYDPS-IYNLGAT------ASSGERVTPHDALQVSAVFAS 58 (457) T ss_pred Cchhhhhh---ccc----ccccccccccc--------ccccchhh-hhhcccc------ccCCceechHHhhccHHHHHH Confidence 99999753 221 00000000000 00001001 0111110 00111111 011111111233 Q ss_pred HHHHhhhhhcCcceEeeC-CH---HHH-HHHHHHHh-h---ccHHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEcC Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVD-NE---VAD-AFINETLK-N---DKFSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQA 149 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~-d~---~~~-e~l~~~~~-~---~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~~ 149 (502) |+.+|+-+-+=|+.+--. +. ... ..+..++. . -....-+...+...+..|.+|+.+-.++|++ .+..++| T Consensus 59 i~~ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~ll~~pn~~~t~~~f~~~~~~~l~l~Gna~~~i~~~~g~~~~l~~l~p 138 (457) T protein:vir:62 59 VRLLSETIATLPLSTYSKRGGTRKEIDTPEWLDFPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWAGPNIAGLDVLDP 138 (457) T ss_pred HHHHHHhHhhCceEEEEecCCccccccchHHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcC Confidence 455555444445554221 11 111 11222222 1 1244456666777888899998887666664 4555666 Q ss_pred CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceee Q lcl|NC_012753. 150 TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTL 229 (502) Q Consensus 150 ~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~ 229 (502) .++-+.....+....-. ...+..... |....+..+. +. T Consensus 139 ~~v~v~~~~~~~~~~~~------------------~~~y~~~~~-----------------g~~~~~~~~~---~~---- 176 (457) T protein:vir:62 139 TKIHVHMVMVDGLRRKV------------------FEAYDIDAD-----------------GNEVLLGWFT---PR---- 176 (457) T ss_pred cceEEEEeccCCcccee------------------EEEEEEccC-----------------CceeEEEeeC---cc---- Confidence 66544322111110000 001111000 0000000000 00 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR 309 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 309 (502) -.++|+.+.. .+...|+|.+..+...|.....+-....+-|..+...-.| |.....-+.... . T Consensus 177 ------eiih~r~~~~----~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~ls~e~~---~ 239 (457) T protein:vir:62 177 ------DVLHIPGMML----PGDFVGCSPISYARESIGLALAAQKYGAHFFRNGAMPGAV----VEVPGTMSEEGL---A 239 (457) T ss_pred ------ceEEecCCCC----CCceecccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE----EEcCCCCCHHHH---H Confidence 1234443321 1235688988877777665554444333445543332111 222111110000 0 Q ss_pred ccccc-chhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 310 EFETG-HNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 310 ~~~~~-~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) .+... ...+...... --+.+.-++.++.....-++.+..+....+|+...|++|..+|+...+..++..+.-... T Consensus 240 ~~~~~~~~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~~ 319 (457) T protein:vir:62 240 RAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQNI 319 (457) T ss_pred HHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHHH Confidence 00000 0111100000 001122355555555566788888888999999999999999876655432322221111 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 385 DTY-QMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 385 ~l~-~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) ..+ .++.-....|+..|... +..........+.++++.-+-.|..+.++...+++.+|+|+.-++++ T Consensus 320 ~f~~~~l~P~~~~ie~~ln~~------------L~~~~~~~~~~i~fd~~~l~~~d~~~r~~~~~~~~~~G~~T~NE~R~ 387 (457) T protein:vir:62 320 AFTMFSLRPWLERIEAGFNRL------------LFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVRA 387 (457) T ss_pred HHHHHHHHHHHHHHHHHHHhh------------hcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 111 12222222222222221 11111111223555555666678899999999999999999999766 Q ss_pred hc--CCCCHHHHHHHH-----HHHH---Hhhhcc----------cCCCCCccccCCCCC Q lcl|NC_012753. 464 KT--LNVTKEQAQEIY-----QKIN---DETMVS----------TDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~--~~~~deea~~el-----~ri~---~E~~~~----------~~~~~~~~~~~~~g~ 502 (502) .. +++.+..+++.+ ..+. +.+... .+..+ ....+--|. T Consensus 388 ~~gl~pi~~g~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 445 (457) T protein:vir:62 388 AEDMTPLPDGLGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADD-EEPDNAEGD 445 (457) T ss_pred HhCCCCCCCCCcceeeeccccccccccccccccCCCccCCCCccCCCCC-CCCCCCCCC Confidence 53 234331111111 1111 000000 00000 000111111 No 140 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.31 E-value=1.4e-06 Score=52.65 Aligned_cols=378 Identities=9% Similarity=0.027 Sum_probs=164.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |++|++++.-..+ ...++.. ........+.+. . ..+.... .+-+..+---.. T Consensus 1 M~~f~~~~~~~~~------------------~~~~~~~--~~~~~~~~~~~~--~-----~~~~~v~~~~al~~~~v~~~ 53 (386) T protein:vir:49 1 MPIFNITNLATES------------------PPINQES--FFDIADSDFLAS--L-----NSSEWVSAENALKNSDLFSI 53 (386) T ss_pred CchhhhhccCCCC------------------cccchhh--hhhhhhcccccc--c-----cCCceechhhhhccHHHHHH Confidence 9999776542110 0011000 000010111110 0 0011110 011111222245 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEEcCCeEEEEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSFVQATVFFPLQA 157 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~v~~~~~~Pi~~ 157 (502) ++..|+-+.+=|+ .+.++.....+.+-........-...++...+..|.+|+.+..+.. . +.+.+++|+++-+... T Consensus 54 i~~ia~~ia~~p~--~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~i~~~~v~v~~~ 131 (386) T protein:vir:49 54 ISQLSNDLATAKI--TTSRKQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNDNGRDMKWEYLRPSQVSFNRL 131 (386) T ss_pred HHHHHHHhhhCce--eeccchhhhhhhccCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEecCceeEEEEc Confidence 5666666655454 4444444444333222223344555667777888999988877653 4 4677788877655433 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) +.+... ++++. ..+. ..|..+.+ + +. -. T Consensus 132 ~~~~~~----~y~~~-----------------~~~~---------------~~~~~~~~---~---~~----------ev 159 (386) T protein:vir:49 132 DNQNGL----YYNIT-----------------FDDP---------------HIAPKQHV---P---QN----------DI 159 (386) T ss_pred CCCceE----EEEEE-----------------EcCc---------------cccceeEE---c---cc----------cE Confidence 222211 00000 0000 00100000 0 00 12 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-ccccccc-cc Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFET-GH 315 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~~-~~ 315 (502) ++|+.+.+ .+..+|+|.+..+...++....+..-..+-|..+...-.+ |+........... ....+.. .. T Consensus 160 ih~~~~~~----~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i----l~~~~~~~~~~~~~~~~~~~~~~~ 231 (386) T protein:vir:49 160 LHFRLLSV----DGGLTSVSPLMALGREFNIQKASDKLTISALKNALNANGI----LKIKGGGLLDFKTKVSRSRQAMKQ 231 (386) T ss_pred EEecCCCC----CCccccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEE----EEeCCCCChHHHHHHHHHHHHhcc Confidence 34443211 2335799998888887765554433333344543332222 2221111110000 0000000 00 Q ss_pred hhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 316 NVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIAT 395 (502) Q Consensus 316 ~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~ 395 (502) +....+-. +.+.-++.++.....-++.+..+....+|+...|+||..+|.+..+..++..+...+..+. .-..+ T Consensus 232 n~g~~~vl---~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~~~~~~~~i---~~~l~ 305 (386) T protein:vir:49 232 MQGGPLVL---DDLEDFTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDGDQQSSLEMIYNIYFKSV---SRYLR 305 (386) T ss_pred CCCCceec---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccchHHHHHHHHHHHH---HHHHH Confidence 00000111 1222356666666666788888889999999999999999876554444544433222211 11111 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHH Q lcl|NC_012753. 396 LVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQA 473 (502) Q Consensus 396 ~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea 473 (502) .+...|++. +. ..+.++....+-.|..+.+....+++.+|++++-++++.+ .|+..+++ T Consensus 306 ~i~~~~~~~-------------l~------~~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~~~~~~~~ 366 (386) T protein:vir:49 306 PFVSEMSKK-------------LS------CEVDVDISPAVDPTGSNYISLINSMVKSGTLAQNQGLYILQQAEILPKEL 366 (386) T ss_pred HHHHHHHHH-------------hc------chhcccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHhhCCCCCCcC Confidence 111111110 00 1233333444555667788888899999999998877643 24433322 Q ss_pred HHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 474 QEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 474 ~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+.. .+......++|-.++ T Consensus 367 ----~~~~------~~~~~~~~gGd~~~~ 385 (386) T protein:vir:49 367 ----PDGK------NPNRTSLKGGEINEQ 385 (386) T ss_pred ----cchh------ccCCCCCCCCCCCCC Confidence 1111 111112223333333 No 141 >protein:vir:94572 Length: 535 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919010;genbank:gi:119637774;genbank:GeneID:5179332 Probab=98.30 E-value=1.5e-06 Score=52.52 Aligned_cols=445 Identities=10% Similarity=0.062 Sum_probs=181.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |..=.....+-+... ++-.+ .+..++-.....|+.+|.=-.+-+......+......++--+.+...+ T Consensus 1 ~~~~~~~~~~~~~~~--------~~r~~----~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 68 (535) T protein:vir:94 1 MASSQKREGFAENGA--------KAVYD----ALKNDRNSYETRAENCAKYTIPSLFPKDSDNASTDYTTPWQAVGARGL 68 (535) T ss_pred CCchhhhhhHHHHHH--------HHHHH----HHHHHhhHHHHHHHHHHHHhccccCCCCCCccccccCCcccccHHHHH Confidence 544333222211110 00000 122233333444444432221211111112222222334445666777 Q ss_pred HHHhhhhhcC--cc--eEe--eCCH-------------HHHHHHHH-------HHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 81 KKVASLVFNE--QA--TIR--VDNE-------------VADAFINE-------TLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 81 ~~~a~~l~~e--p~--~i~--~~d~-------------~~~e~l~~-------~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) +.+|+.|.+- |+ =|. +.+. .+.++|.+ .+..++|...+.++..+..+.|.+.+. T Consensus 69 ~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~ 148 (535) T protein:vir:94 69 NNLASKLMLALFPMQTWMKLTISEFEAKQLVAQPAELAKVEEGLSMVERILMNYIESNSYRVTLFETLKQLVVAGNALLY 148 (535) T ss_pred HHHHHHHHhhhcCCCCccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEe Confidence 7777666541 22 133 2221 23344433 466789999999999999999998876 Q ss_pred EEEeCC-ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe---------------eC---CCceEEEEEEEEEEeCCeE Q lcl|NC_012753. 135 PYIDGD-QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKT---------------EG---QKVKYYSLIEFHEWNKETY 195 (502) Q Consensus 135 ~~~d~~-~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~---------------~~---~~~~~yt~~E~h~~~~~~~ 195 (502) +-.+++ ..++..+|-.+++- ..|..+....+|....... +. ....+|+++... .++..| T Consensus 149 ~~~~~~~~~~f~~~pl~~y~v-~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~~~~~~v~v~~~v~~~-~~~~~~ 226 (535) T protein:vir:94 149 IPEPEGTYNPMKLYRLSSYVV-QRDAFGTVLQIVTLDKTAYAALPEDVRNSMDSSQEHKGDEMIDVYTHIYLD-EESGEY 226 (535) T ss_pred eccCcCcccceEEEEcCeEEE-eeCCCCCeEEEEeeeeccHHHhhHHHHHHHHhccccCCCceeEEEEEEEee-CCCCcE Confidence 655544 35677788777554 4565554444443221110 00 111233333221 112233 Q ss_pred EEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHH Q lcl|NC_012753. 196 TISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDE 275 (502) Q Consensus 196 ~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~ 275 (502) ...|.+ . |..++.. ....|+..-||++++-+ ...++.||+|--..+.+-+..|+..--. T Consensus 227 ~~~~e~----~----g~~~~~~---------~~~~g~~~~P~~~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l~~~ 285 (535) T protein:vir:94 227 LKYEEI----D----GVEVEGT---------DASYPVDACPYIPVRMV----RIDGESYGRSYCEEYLGDLRSLENLQEA 285 (535) T ss_pred EEEEEe----c----Ceeeccc---------cccCccccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHHHHH Confidence 222211 0 2222110 11123444555555432 2346789999999999999999976544 Q ss_pred HHHHH-hhccceeeech-HHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHH Q lcl|NC_012753. 276 FMWEV-KMGQRRVAVPT-QMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLS 353 (502) Q Consensus 276 ~~~~~-~~~~~~i~v~~-~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 353 (502) ...-. ...+....|+. +.++.. .+.+ .....+.. ...++ .++..++...+...-...++.+.+ T Consensus 286 ~l~~~~~a~~~~~lv~p~g~~~~~------~~~~-----~~~g~~v~---g~~~~-v~~~~~~~~~~~~~~~~~i~~~~~ 350 (535) T protein:vir:94 286 IVKMSMISAKVIGLVNPAGITQVR------RLTK-----AQTGDFVS---GRPED-ISFLQLEKAADFSVARAVSEQIEG 350 (535) T ss_pred HHHHHHHhccCCcccccccccchh------hccc-----CCCceeec---CCccc-ceeeecccccchhHHHHHHHHHHH Confidence 44332 22333333422 222111 0000 00000010 01111 111122222223333344444444 Q ss_pred HHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEe Q lcl|NC_012753. 354 LFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL-VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDL 432 (502) Q Consensus 354 ~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~-~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f 432 (502) .|....=+. .+...++...|||||....+...+..+-.-.. =...|..|+.-++.++.-.++.+...... +.+++ T Consensus 351 rI~~af~~~--~~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~ElL~Pli~r~~~il~r~g~lP~~p~~~--v~~~~ 426 (535) T protein:vir:94 351 RLSYAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQLPMVRVLLKQLQATNQIPELPKEA--VEPTI 426 (535) T ss_pred HHHHHHhHh--hhccCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhCCCCCCCChhh--ccceE Confidence 443322111 12223344469999998877776665542222 23334444444454443333333222222 33333 Q ss_pred CCCccCCHH---HHHHHHHHHHh--cCC--------CCHHHHHH---hcCCC-------CHHHHHHHHHHHHHhhhcc-- Q lcl|NC_012753. 433 DDGVFTDRN---AEFDYWSKMVA--AGF--------APKTMAIE---KTLNV-------TKEQAQEIYQKINDETMVS-- 487 (502) Q Consensus 433 ~d~i~~d~~---~~~~~~~~~~~--~Gi--------~S~et~l~---~~~~~-------~deea~~el~ri~~E~~~~-- 487 (502) --+ .... ..++...+..+ +++ +....++. ...|+ +++|++++.++.++.++.. T Consensus 427 vs~--la~l~r~~~~~~l~~~~~~laq~~P~~ld~~id~d~~~~~~a~~~Gvp~~~i~rs~eev~~~~~q~~~~~~~~~~ 504 (535) T protein:vir:94 427 STG--MEALGRGQDLDKLERCIAAWSALAPMQGDPDINIATIKLRIANAIGIDTSGILKTPEEKQQEMAEAAQGTAMQNA 504 (535) T ss_pred eeh--HHHHHHHHHHHHHHHHHHHHHhhChHHhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 211 1111 11111111111 111 22222222 23333 4566666554433322211 Q ss_pred ---cCCCCCccccCCCCC Q lcl|NC_012753. 488 ---TDSFRTSEEVDIYGE 502 (502) Q Consensus 488 ---~~~~~~~~~~~~~g~ 502 (502) .+... .+.+....+ T Consensus 505 ~~~~g~~~-~~~~~~~~~ 521 (535) T protein:vir:94 505 AASAGAGA-GTMATASPE 521 (535) T ss_pred HHHHHHhh-hcccccChH Confidence 11110 111222222 No 142 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.30 E-value=1.5e-06 Score=52.49 Aligned_cols=392 Identities=10% Similarity=0.040 Sum_probs=167.8 Q ss_pred cchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEee-CC- Q lcl|NC_012753. 21 QSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRV-DN- 98 (502) Q Consensus 21 ~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~-~d- 98 (502) |-|.++..+......... .-......+|.|.... ...... ..+-+..+--...|+..|+-+.+=|+.+-- .+ T Consensus 1 m~~~~~f~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~v~----~~~al~~~~v~~~i~~Ia~~ia~l~~~~~~~~~~ 74 (416) T protein:vir:12 1 MLLERMFEKRSGSSDHED-GFNNILLNMFGGRKTA-SGERVS----ESNSLVQPDIFACVNVLSDDIAKLPIHTYKRTDG 74 (416) T ss_pred CccchhcccccCccccCc-cchhHHHHhhcCcccc-cCceec----hhhhhccHHHHHHHHHHHHhhhhCceEEEEecCC Confidence 333333322211100000 0111234455443211 110000 011122222234566666666555654321 11 Q ss_pred -------HHHHHHHH-HHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEcCCCeEEEEEE Q lcl|NC_012753. 99 -------EVADAFIN-ETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQANTQDVSSAAIV 168 (502) Q Consensus 99 -------~~~~e~l~-~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~ 168 (502) ......|. +-...-....-...++...+..|.+|+.+..+. |.+ .+..++|+.+-++..+.++.. T Consensus 75 ~~~~~~~~~l~~~l~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~v~v~~~~~~~~~----- 149 (416) T protein:vir:12 75 GIERKPEHKSAHAVYARPNPYMTAFTWKKLMMTHVLTWGNAYSYIQFGSHGYPEALFPLRPDYTNAYVHPTTGML----- 149 (416) T ss_pred ccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEeCCCcEE----- Confidence 11122221 111112333455566777888999998887765 443 566677776654422222110 Q ss_pred EEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccc Q lcl|NC_012753. 169 TKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNK 248 (502) Q Consensus 169 ~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~ 248 (502) +|. +..+ |..+.+ ++ . .+++++.+. T Consensus 150 ------------~~~----~~~~-------------------g~~~~~---~~---~----------eiih~~~~~---- 174 (416) T protein:vir:12 150 ------------WYQ----TVLN-------------------GKAIEL---YD---Y----------EVLHFKGLS---- 174 (416) T ss_pred ------------EEE----EecC-------------------CeEEEe---cC---c----------cEEEecCcC---- Confidence 110 0001 111100 00 0 123333221 Q ss_pred cccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc-eeeechHHhccCCCCCCcccCccccccccch-hhccccCCCC Q lcl|NC_012753. 249 DINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQMIKTEYDTNGEKVTVKREFETGHN-VYEQFDSGDM 326 (502) Q Consensus 249 ~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~-~~~~~~~~~~ 326 (502) .+.+.|+|.+..+...++....+.....+-|+.+.. ..+ |......+.... ..+..... ...+....-- T Consensus 175 -~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i-----l~~~~~~~~e~~---~~~~~~~~~~~~~~~~~vl 245 (416) T protein:vir:12 175 -TDGIHGKSPIGVVREHIGAQAAATKYNAKLYKNEATPRGI-----LKVPAFLDEKPK---ENVRKEWKRVNKVENIAII 245 (416) T ss_pred -CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceE-----EecCCCCCHHHH---HHHHHHHHHHhcCCCeeec Confidence 124679999888888777655444444444565433 222 221111110000 00000000 0000000001 Q ss_pred ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 327 DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELV 405 (502) Q Consensus 327 ~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~ 405 (502) +.+..++.++.....-++.+..+...++|+...|+||..++....+. +++.+.... .++.+|..++ T Consensus 246 ~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~-------------f~~~~l~P~~ 312 (416) T protein:vir:12 246 DYGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIE-------------YVRNTLQPWI 312 (416) T ss_pred CCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHH-------------HHHHHHHHHH Confidence 12223556665566678888888889999999999999998655432 233333211 1233333333 Q ss_pred HHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH-HHHHHHHHH----- Q lcl|NC_012753. 406 ISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK-EQAQEIYQK----- 479 (502) Q Consensus 406 ~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d-eea~~el~r----- 479 (502) ..|-...+..-+..........+.+++++-+..|..+.++...+++.+|+++.-++++.+ |+.. +..++-+.. T Consensus 313 ~~ie~~l~~~l~~~~~~~~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-gl~Pi~ggd~~~~~~n~~~ 391 (416) T protein:vir:12 313 VNFEQELNVKLFLDHDQKSGHYVKFNIDSELRGDSKTQAEYLKTLHETGVLNKDEIRELL-ERNPIENGDKYISSLNYVF 391 (416) T ss_pred HHHHHHHHHhhcCchhhcCCceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcceeeecccccc Confidence 333222111001111112223466666777788999999999999999999999976653 4322 111111110 Q ss_pred HH--HhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 480 IN--DETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 480 i~--~E~~~~~~~~~~~~~~~~~g~ 502 (502) +. .++. ....-....++|=.+| T Consensus 392 ~~~~~~~~-~~~~~~~~~gge~~~~ 415 (416) T protein:vir:12 392 LDFLEEYQ-RLKAGGAMKGGDNKNE 415 (416) T ss_pred ccccchhh-ccccccccCCCCCcCC Confidence 00 0000 0000001222222222 No 143 >protein:vir:99672 Length: 532 # NCBI annotation: Head-to-tail joining protein # Family: family:all:481 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249587;genbank:gi:68299738;genbank:GeneID:3799987 Probab=98.22 E-value=2.5e-06 Score=51.31 Aligned_cols=443 Identities=9% Similarity=0.019 Sum_probs=172.6 Q ss_pred CChhHH---HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQT---IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~---ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k 77 (502) |-=..+ -++-+++.+. .+..++-.....|+.++.=-.+-+.............++--+.+. T Consensus 1 m~~~~~~~~~~~~~~~r~~----------------~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~ 64 (532) T protein:vir:99 1 MAEVEKTGFAADGAAAAYN----------------RLKNDRGAYETRAEDCATYTIPSVFPSATADGSTSYTTPWQSIGA 64 (532) T ss_pred CcchhhccccHHHHHHHHH----------------HHHHHhhHHHHHHHHHHHHhhhcccCCCCCcchhhccccccchHH Confidence 111000 0111111000 111122222333333322111111111111111122233345567 Q ss_pred HHHHHHhhhhhcC--cc-----eEeeCCHH-------------HHHHH-------HHHHhhccHHHHHHHHHHHHhhcCC Q lcl|NC_012753. 78 TASKKVASLVFNE--QA-----TIRVDNEV-------------ADAFI-------NETLKNDKFSKNFERYLESCLALGG 130 (502) Q Consensus 78 ~iv~~~a~~l~~e--p~-----~i~~~d~~-------------~~e~l-------~~~~~~~~f~~~~~~~~~~~~~~G~ 130 (502) ..++.+|+.|.+- || ++.+.+.. +.++| .+.+..++|...+.++..+..+.|. T Consensus 65 ~a~~~LAa~L~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~ 144 (532) T protein:vir:99 65 RGLNNLASKLMLALFPVGSSFFKLNVSELEVKQSITSPEELTEIATGLAMVERICMNYMESNSFRPTLHAAIKQLLVAGN 144 (532) T ss_pred HHHHHHHHHHHHhhcCCCCccccccCCHHHHhccCCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCc Confidence 7777777777652 22 12333321 23333 3456678999999999999999999 Q ss_pred EEEEEEEeC----CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeC-------------------CCceEEEEEEE Q lcl|NC_012753. 131 LAMRPYIDG----DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEG-------------------QKVKYYSLIEF 187 (502) Q Consensus 131 ~~~~~~~d~----~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~-------------------~~~~~yt~~E~ 187 (502) +.+.+-.++ ....+..+|-.+++- ..|..+....+|.......+. .....|+.+++ T Consensus 145 a~l~~~~~~~~~~~~~~f~~~pl~~y~v-~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~p~~~v~v~~~v~~ 223 (532) T protein:vir:99 145 VLLYIPSTEQVEGQSNAPKLYKLHNFVV-ERDAYDNVLQIVTEDKIARAALPEDVRKSLEDAQGDQNPSEEVTIYTHVYR 223 (532) T ss_pred EeEEecccccccCcccceEEEEcCeEEE-eeCCCCCeeeEeeeeeecHHhcChHHHHHhhccccccCCCcceEEEEEEEe Confidence 998776543 245677888877554 556555444444332211100 01112222221 Q ss_pred EEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHH Q lcl|NC_012753. 188 HEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMD 267 (502) Q Consensus 188 h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid 267 (502) .. ++..|...|.+ . |..+.+.. .. .++..-||++++- +...++.||+|--..+.+-+. T Consensus 224 ~~-~~~~~~~~~~~----~----g~~~~~~~------~~---~~~~e~P~~~~Rw----~~~~ge~YGrgp~~~~l~D~k 281 (532) T protein:vir:99 224 DP-EAMVFRSYQEI----D----GEIVAGTE------GE---YPLDSCPWIPVRL----IKMPNEDYGRSFVEEYLGDLK 281 (532) T ss_pred cC-CCCeeEEEEee----c----Cceecccc------cc---cccccCCceeeee----eecCCCccccchHHHHHHHHH Confidence 10 11112111111 0 11111110 00 1122335555443 223477899999999999999 Q ss_pred HHHHHHHHHHHHH-hhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHH Q lcl|NC_012753. 268 FINTTYDEFMWEV-KMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIK 346 (502) Q Consensus 268 ~ld~~~S~~~~~~-~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~ 346 (502) .|+..--....-. ...+....|+.+... +...+.+ .....+.. ...++...+. +...-+...-.. T Consensus 282 ~L~~l~~~~l~~~~~a~~~~~lv~p~g~~-----~~~~~~~-----~~~g~~v~---g~~~~i~~~~-~~~~~~~~~~~~ 347 (532) T protein:vir:99 282 SLENLYEAIVKMSMISSKVLFFVNPNGVT-----QIRRVAK-----ANTGDFVA---GRKQDVEVFQ-LEKYNDFQVAKA 347 (532) T ss_pred HHHHHHHHHHHHHHHHcCCCceecccccc-----chhhhcc-----CCCcceec---CCcccceeee-cccccchhHHHH Confidence 9997655444432 344444445322211 1111000 00000100 0111111111 111111222223 Q ss_pred HHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhcccCCCcccc Q lcl|NC_012753. 347 AINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA-TLVEKSLKELVISILELAKVYNLYTGEIPTM 425 (502) Q Consensus 347 ~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~ 425 (502) .++.+.+.|....=+. .+....+...|||||....+...+..+-.- +.=...|..|+.-++.++.-.++.+...... T Consensus 348 ~i~~~~~rI~~af~~~--~~~~~d~~r~TAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~ 425 (532) T protein:vir:99 348 TADDIEKRLSYAFMLN--SAVQRGGDRVTAEEIRYVAGELEDTLGGVYSLLSQELQLPLVKILLKELQATSKIPNLPKEA 425 (532) T ss_pred HHHHHHHHHHHHHhhh--hcccCCCCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCChhh Confidence 3334333343322111 122223344699999988777766655522 2223333444444444443333333222222 Q ss_pred cceEEEeCCCccCCHHHHHHHHHHHHh-----cCC-------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHh Q lcl|NC_012753. 426 DEVSVDLDDGVFTDRNAEFDYWSKMVA-----AGF-------APKTMAI---EKTLNV-------TKEQAQEIYQKINDE 483 (502) Q Consensus 426 ~~i~v~f~d~i~~d~~~~~~~~~~~~~-----~Gi-------~S~et~l---~~~~~~-------~deea~~el~ri~~E 483 (502) ..+.+.- -++..+.++.+..+.+ +.+ +....++ ....|+ ++||++++.++.+.+ T Consensus 426 ~~~~iv~----~is~Laraq~~~~l~~~~~~laq~~p~~~d~id~d~~~~~~a~~~GV~~~~i~r~~ee~~~~~~q~~~~ 501 (532) T protein:vir:99 426 VEPAIAT----GLEALGRGHDLNKLNVFIDYMIKLAGLQDDDINLLDVKMRLANSLGMDTTGLILTQQDKQAKMAEASTA 501 (532) T ss_pred cccceee----cchHHHHHHHHHHHHHHHHHHHhhcchhhhhCCHHHHHHHHHHHhCCChhhccCCHHHHHHHHHHHHHH Confidence 2222211 1222333222222110 111 2222222 233344 445555554433322 Q ss_pred hhcc------------cCCCCCccccCCCCC Q lcl|NC_012753. 484 TMVS------------TDSFRTSEEVDIYGE 502 (502) Q Consensus 484 ~~~~------------~~~~~~~~~~~~~g~ 502 (502) +++. ...-..+...++.-| T Consensus 502 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:99 502 AGMVTAGQQMGAAGGQAAAAMMQQQAGMPTQ 532 (532) T ss_pred HHHHHHHHHHHHHHHHhcchhHHhhcCCCCC Confidence 2211 011122222233233 No 144 >protein:vir:10447 Length: 536 # NCBI annotation: head-to-tail joining protein # Family: family:all:481 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848294;genbank:gi:30387485;genbank:GeneID:1733984 Probab=98.20 E-value=2.8e-06 Score=51.04 Aligned_cols=441 Identities=11% Similarity=0.086 Sum_probs=171.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |.= ++ .+ +....+++ .+. .+..++-.....|+.++.=-.+-+......+......++--+.+...+ T Consensus 1 m~~-~~-------~~--~~~~~~~~---r~~-~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 66 (536) T protein:vir:10 1 MAE-KR-------TG--LAEDGAKS---VYE-RLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGL 66 (536) T ss_pred Ccc-hh-------hc--hhHHHHHH---HHH-HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH Confidence 221 00 00 00000100 000 122223233344444432211111111111222222334445667777 Q ss_pred HHHhhhhhcC--cce--E--eeCCH-------------HHH-------HHHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 81 KKVASLVFNE--QAT--I--RVDNE-------------VAD-------AFINETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 81 ~~~a~~l~~e--p~~--i--~~~d~-------------~~~-------e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) +.+|+.|.+- |+. | .+.+. .++ +.+.+.+..++|...+.++.++..+.|.+++. T Consensus 67 ~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly 146 (536) T protein:vir:10 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) T ss_pred HHHHHHHHhhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEE Confidence 7777666541 211 2 22221 122 24445577789999999999999999988764 Q ss_pred EEEeCC-ce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe--------------eC------CCceEEEEEEEEEEeC Q lcl|NC_012753. 135 PYIDGD-QI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKT--------------EG------QKVKYYSLIEFHEWNK 192 (502) Q Consensus 135 ~~~d~~-~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~--------------~~------~~~~~yt~~E~h~~~~ 192 (502) +--+++ ++ .+..+|-.+++- ..|..+....+|. ++... .. +....|+.++.. .++ T Consensus 147 ~~e~~~~~~~~~~~~pl~~~~v-~~d~~G~vd~i~r-~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~V~~~-~~~ 223 (536) T protein:vir:10 147 LPEPEGSNYNPMKLYRLSSYVV-QRDAFGNVLQMVT-RDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLD-EAS 223 (536) T ss_pred EeeCCCCceeeEEEEEcCeEEE-eeCCCCCeeEEee-eeeccHHHHHHhhhhhhcccccccCcccceEEEEEEEEe-cCC Confidence 433332 33 467778777664 4555554444443 22110 01 111122222211 112 Q ss_pred CeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHH Q lcl|NC_012753. 193 ETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTT 272 (502) Q Consensus 193 ~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~ 272 (502) +.|...+++ + |..|+.. ....++..-||++++-+ ...++.||+|-...+.+-+..|+.. T Consensus 224 ~~~~~~~e~----~----g~~v~~~---------~g~~~f~~~P~i~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l 282 (536) T protein:vir:10 224 GEYLRYEEV----E----GMEVQGS---------DGTYPKEACPYIPIRMV----RLDGESYGRSYIEEYLGDLRSLENL 282 (536) T ss_pred CcEEEEEee----c----Ccccccc---------ccccccccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHH Confidence 222221111 1 2222111 11112333455554432 2347789999999999999999976 Q ss_pred HHHHHHH-Hhhccceeeech-HHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHH Q lcl|NC_012753. 273 YDEFMWE-VKMGQRRVAVPT-QMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINK 350 (502) Q Consensus 273 ~S~~~~~-~~~~~~~i~v~~-~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~ 350 (502) --....- ....+....|+. .+++.. ... ......+.. ...++ .++..++...+...-.+.++. T Consensus 283 ~~~~l~~~~~a~~~~~lv~p~g~~~~~------~~~-----~~~~g~~v~---g~~~~-v~~~~~~~~~~~~~~~~~i~~ 347 (536) T protein:vir:10 283 QEAIVKMSMISSKVIGLVNPAGITQPR------RLT-----KAQTGDFVT---GRPED-ISFLQLEKQADFTVAKAVSDA 347 (536) T ss_pred HHHHHHHHHHHhcCCcccCcccccchh------hhc-----cCCCcceec---CCccc-ceeeeccccccchHHHHHHHH Confidence 5555442 233443444422 222111 000 000000110 01111 111112221122222233444 Q ss_pred HHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhcccCCCccccc Q lcl|NC_012753. 351 GLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS----IATLVEKSLKELVISILELAKVYNLYTGEIPTMD 426 (502) Q Consensus 351 ~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~----~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~ 426 (502) +.+.|....-+. .+...++...|||||....+.+.+..+- ++.+|- ..|++-++.+..-.+..+...... T Consensus 348 ~~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell---~Pli~r~~~il~r~g~lP~~p~~~- 421 (536) T protein:vir:10 348 IEARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ---LPLVRVLLKQLQATQQIPELPKEA- 421 (536) T ss_pred HHHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHH---HHHHHHHHHHHHhCCCCCCCChhh- Confidence 444443322111 1222334446999999888777775544 444443 334444444443333333222222 Q ss_pred ceEEEeCCCcc-CCHHHHHHHHHHHHh--cCC--------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhhh Q lcl|NC_012753. 427 EVSVDLDDGVF-TDRNAEFDYWSKMVA--AGF--------APKTMAI---EKTLNV-------TKEQAQEIYQKINDETM 485 (502) Q Consensus 427 ~i~v~f~d~i~-~d~~~~~~~~~~~~~--~Gi--------~S~et~l---~~~~~~-------~deea~~el~ri~~E~~ 485 (502) +.+++--++. ......++......+ +++ +....++ ....|+ +++|++++.++-.++++ T Consensus 422 -v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~P~~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~ 500 (536) T protein:vir:10 422 -VEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMG 500 (536) T ss_pred -ccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCCchhhcCCHHHHHHHHHHHHHHHH Confidence 2333322111 111111222211111 121 2222222 223454 44555544432111111 Q ss_pred c----c-cCCCCCccccCCCCC Q lcl|NC_012753. 486 V----S-TDSFRTSEEVDIYGE 502 (502) Q Consensus 486 ~----~-~~~~~~~~~~~~~g~ 502 (502) . . ... ...+.+-..+| T Consensus 501 ~~~~a~~~~~-~~~~~~~~~~~ 521 (536) T protein:vir:10 501 MDNGAAALAQ-GMAAQATASPE 521 (536) T ss_pred HHHHHHHHHH-HHHHHHhcCch Confidence 0 0 000 01111122222 No 145 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.19 E-value=2.9e-06 Score=50.93 Aligned_cols=384 Identities=10% Similarity=0.043 Sum_probs=165.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||++.+.--+ ...+++ ..|..++.+.... .+ ... +.-+...--...| T Consensus 1 M~~f~~~~~~~~------------------~~~~~~------~~~~~~~~~~~~~-~~--v~~----~~al~~~~V~~~v 49 (397) T protein:vir:38 1 MPLLKLNKSHSQ------------------GFSLND------PDWVNFLTGGEAQ-KY--VSA----DTALKNSDIFSLI 49 (397) T ss_pred CcchhhhhcccC------------------cccCCc------hhhhhhhcCCcCC-ce--ech----HHhhccHHHHHHH Confidence 999986542211 001110 0122333321100 00 000 1112122223345 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQAN 158 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~d 158 (502) +..|+-+.+=| +.+++......+.+-...-....-++.++...+..|.+|+.+..|. |. +.+..++|..+-++... T Consensus 50 ~~ia~~ia~~p--~~~~~~~~~~l~~~PN~~~s~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~l~~l~~~~v~i~~~~ 127 (397) T protein:vir:38 50 MQLSGDLAMVR--YTSESDRSQSIISNPSVTANGYSFWQGMFAQLLLDGNCYAYRHKNTNGVDLSWEYLRPSQVQPMLLQ 127 (397) T ss_pred HHHHHHHhhCc--ccccccHHHHHHhcCCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 55555554434 4455555444443332222344455566777788899988887775 34 46777888877654322 Q ss_pred CCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEE Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFT 238 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~ 238 (502) .++.. .|. ++. .....|..+.+ + +. -.+ T Consensus 128 ~~~~~-----------------~y~-------------~~~------~~~~~~~~~~~---~---~~----------eii 155 (397) T protein:vir:38 128 DGSGL-----------------IYN-------------INF------DEPAIGYMENV---P---AA----------DVI 155 (397) T ss_pred CCceE-----------------EEE-------------EEe------ccccccceeEe---c---Cc----------cEE Confidence 22110 010 000 00000111100 0 00 123 Q ss_pred EecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhh Q lcl|NC_012753. 239 YLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVY 318 (502) Q Consensus 239 ~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~ 318 (502) +|+.+.. .+..+|+|.+..+...|.....+..-..+-|..+...-.+ +.......... . ..+....... T Consensus 156 h~~~~~~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i----l~~~~~~~~e~--~-~~~~~~~~~~ 224 (397) T protein:vir:38 156 HIRLLSK----NGGKTGISPLSALINEQQIKDASNELTLKALKQSVTASAV----LTIQKGGLLDA--E-TRIARSKEIS 224 (397) T ss_pred EecCCCC----CCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE----EEeCCCCCHHH--H-HHHHHHHHHH Confidence 4443322 1234699998888877765554444344445544332222 22211111100 0 0000000000 Q ss_pred ccccCCC----CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 319 EQFDSGD----MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 319 ~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~ 394 (502) ....... -+.+.-++.++.....-++.+..+...++|+...|+|+..+|...+...+..+.... T Consensus 225 ~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~e~~~~~------------ 292 (397) T protein:vir:38 225 KQIHNSDGPVVIDALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQSSITQISGQ------------ 292 (397) T ss_pred hcccccCCceecCCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHH------------ Confidence 1100000 012223555665556677888888999999999999999998765443322222221 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCH-H Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTK-E 471 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~d-e 471 (502) +..+|..++..|....+. .+. ...+..+.| .+-.|.++.++...+++.+|+|+.-++++.+ +|+.. + T Consensus 293 --~~~~l~P~~~~ie~~ln~-~l~-----~~~~~~~~~--~~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~~d 362 (397) T protein:vir:38 293 --YAKSLNRYVQAIVGELND-KLH-----ANISANIRF--AIDAMGDQYASTISSSVKGGTIAGNQARFILQNSGYLAKD 362 (397) T ss_pred --HHHHHHHHHHHHHHHHHH-hcc-----Chhcccccc--cccCCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCc Confidence 223333333333221111 111 111222333 3445788889999999999999999976643 23221 1 Q ss_pred HHHHH--HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 472 QAQEI--YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 472 ea~~e--l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ...-+ ............++..+.....-.|+ T Consensus 363 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~ 395 (397) T protein:vir:38 363 LPDPEKEPQQAIQLIQQEGGENDGNNSDERGSD 395 (397) T ss_pred cccccccccccccccccccCCCCCCCCCCCCCC Confidence 11001 00100011111111111111122222 No 146 >protein:vir:8883 Length: 543 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813772;genbank:gi:29366727;genbank:GeneID:1258836 Probab=98.18 E-value=3e-06 Score=50.90 Aligned_cols=445 Identities=9% Similarity=0.020 Sum_probs=172.1 Q ss_pred CChhH---HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQ---TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~---~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k 77 (502) |.=.. .-++-+++.+. .+..++-.....|+.+|.=-.+-+......+......++--+.+. T Consensus 1 ~~~~~~~~~~~~~~~~r~~----------------~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~ 64 (543) T protein:vir:88 1 MAETKREGLAEEGAKAVYE----------------RLKNDRVPYETRAENCAKVTIPSLFPKDSDNSSTDYTTPWQAVGA 64 (543) T ss_pred CcccccCcchHHHHHHHHH----------------HHHHHHhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHH Confidence 11100 01111111000 122233334444444432221211111111122222233335566 Q ss_pred HHHHHHhhhhhcC--cce----EeeCCH-------------HHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCE Q lcl|NC_012753. 78 TASKKVASLVFNE--QAT----IRVDNE-------------VADA-------FINETLKNDKFSKNFERYLESCLALGGL 131 (502) Q Consensus 78 ~iv~~~a~~l~~e--p~~----i~~~d~-------------~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~ 131 (502) ..++.+|+.|.+- |+. +.+.+. .+.+ .+.+.+..++|...+.++..+..+.|.+ T Consensus 65 ~a~~~Laa~l~~~ltP~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a 144 (543) T protein:vir:88 65 RGLNNLSAKVMLALFPLQSWMKLKVSEWQAKQLVSDPSQLAVVEQGLGMVERILMSYMEANSYRVTLFELIRQLALAGTA 144 (543) T ss_pred HHHHHHHHHHHHhhcCCCcccccccChHHHhcccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCce Confidence 6777776666541 222 222321 1222 3344566689999999999999999998 Q ss_pred EEEEEEeCCc-eE---EEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee------------------CCCceEEEEEEEEE Q lcl|NC_012753. 132 AMRPYIDGDQ-IR---VSFVQATVFFPLQANTQDVSSAAIVTKSTKTE------------------GQKVKYYSLIEFHE 189 (502) Q Consensus 132 ~~~~~~d~~~-~~---i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~------------------~~~~~~yt~~E~h~ 189 (502) ++.+--|++. ++ +..+|-.+ |-+..|..+....+|........ .+....|+.++... T Consensus 145 ~ly~~~~~~~~~~~~~~~~~pl~~-y~v~~d~~G~v~~i~r~~~~~~~~l~~~~~~~v~~~~~~~p~~~~~v~~~V~pr~ 223 (543) T protein:vir:88 145 LIYLPPPDASSNSYNPMKLYTLHN-HVVQRDAFGNVLQIVTLDKVAYAALPEDVRNSLSGGQEYKPEQELEVYTHIYIDD 223 (543) T ss_pred eeeeccCccccceecceEEeEcce-EEEeeCCCCCeeeeeeeeeccHHHHhHHhhHHHHHHhhcCCccceEEEEEEEeec Confidence 7644434332 22 33444444 34456666655555543221100 01111222222110 Q ss_pred EeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHH Q lcl|NC_012753. 190 WNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFI 269 (502) Q Consensus 190 ~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~l 269 (502) +.+.| ..|.-+. |..|+.. ...+. ...-||+.++- +...++.||+|--..+.+-+..| T Consensus 224 -~~~~~-~~~~~~~-------~~~v~~~------~~~~~---~~e~P~i~~Rw----~~~~ge~YGrgp~~~~l~D~k~L 281 (543) T protein:vir:88 224 -ESGDF-LSYQEIE-------GVEVDGS------DGQYP---QDALPWIAVRW----TKRDGEHYGRSHVEEYLGDLNSL 281 (543) T ss_pred -CCCcc-ccccccc-------CeeeecC------CCccc---cccCCceeeee----eecCCCccccchHHHHHHHHHHH Confidence 01111 1111000 1111111 00010 11234444442 22346789999999999999999 Q ss_pred HHHHHHHHHHHh-hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHH Q lcl|NC_012753. 270 NTTYDEFMWEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAI 348 (502) Q Consensus 270 d~~~S~~~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l 348 (502) |..--....-.+ ..+..+.||.+... +...+.+. ....+.. ...++...++ +...-+...-.+.+ T Consensus 282 ~~l~~~~l~~~~~~~~pp~~v~~~g~~-----~~~~~~~~-----~~g~~v~---g~~~~v~~~~-~~~~~~~~~~~~~i 347 (543) T protein:vir:88 282 ESLNEAMIKFAMISSKVVGLVNPNGIT-----QVRRLVKA-----QTGDFVA---GRKADIEFLQ-LEKTADFTVAKSVA 347 (543) T ss_pred HHHHHHHHHHHHHHhcCceeecccccc-----chhhcccC-----CCceeec---CCCCcceeee-cccccchhHHHHHH Confidence 987766665543 44555556443221 11111111 0111110 1111111111 11111222333444 Q ss_pred HHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhcccCCCcccccc Q lcl|NC_012753. 349 NKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL-VEKSLKELVISILELAKVYNLYTGEIPTMDE 427 (502) Q Consensus 349 ~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~-~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~ 427 (502) +.+.+.|....=+. .+...++...|||||....+...+..+-.-.. =...|..|+.-++.+....+..+..... . T Consensus 348 ~~~~~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~--~ 423 (543) T protein:vir:88 348 DAIEARLSYVFMLN--SAVQRSGERVTAEEIRYVASELEDTLGGVYSILSQELQLPIVRVLLNQLQATQQIPNLPQE--A 423 (543) T ss_pred HHHHHHHHHHHhhh--hhccCCCCcccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchh--c Confidence 44444443332111 12333344579999998877776666552222 2333444454444444333333332222 3 Q ss_pred eEEEeCCCc-cCCHHHHHHHHHHHHh-cCCCC---------HHHHHH---hcCCC-------CHHHHHHHHHHHHHhhhc Q lcl|NC_012753. 428 VSVDLDDGV-FTDRNAEFDYWSKMVA-AGFAP---------KTMAIE---KTLNV-------TKEQAQEIYQKINDETMV 486 (502) Q Consensus 428 i~v~f~d~i-~~d~~~~~~~~~~~~~-~Gi~S---------~et~l~---~~~~~-------~deea~~el~ri~~E~~~ 486 (502) +++++--.+ +......++.+.+..+ .|.++ ...++. ...|+ +++|++++.++-.++++. T Consensus 424 v~~~~vs~l~~l~r~~~~~~l~~~~~~v~~~~~p~vld~id~d~~~~~~a~~~Gv~~~~i~r~~~e~~~~~~q~~~q~~~ 503 (543) T protein:vir:88 424 VEPTVTTGAEALGRGQDLDKLTQFLNAVATVSQLNGDPDLNVNNIKLRLANAIGIDTAGLLLTEAEKAQAQSQEMLKQGG 503 (543) T ss_pred eeeeEEecHHHHHHHHHHHHHHHHHHHHHhccchhhhccCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHH Confidence 444432211 1111222222222111 12222 222222 22354 334443332221111111 Q ss_pred cc--CC--CCCccccCCCCC Q lcl|NC_012753. 487 ST--DS--FRTSEEVDIYGE 502 (502) Q Consensus 487 ~~--~~--~~~~~~~~~~g~ 502 (502) .. .. -....+.--.|+ T Consensus 504 ~~~~~~~~~~~~~~~~~~~~ 523 (543) T protein:vir:88 504 LNAAAGIGSGVAAQATASPE 523 (543) T ss_pred HHHHHHHhhchhhhhccChH Confidence 00 00 000111111222 No 147 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=98.11 E-value=4.4e-06 Score=49.97 Aligned_cols=401 Identities=12% Similarity=0.115 Sum_probs=161.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |++|+++|+++++.......... .+. .++ ...+.+ |-.. . ..+.... ..-+..+---.. T Consensus 7 mg~f~r~~~~~~~~~~~~~~~~~-~~~-----~~~--------~~~~~~-~~~~--~---~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:81 7 LGLFGQLKAMFVPPDPVDIGGGQ-TFT-----PVN--------ATARDL-GIII--S---DTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred cchhhhhhhhccccccccccccc-ccc-----cCc--------cchhhh-cccc--c---ccCcccchHhhhccHHHHHH Confidence 99999999998763211000000 000 000 000111 1000 0 0011100 011111112334 Q ss_pred HHHHhhhhhcCcceEee--CC---HHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRV--DN---EVADAFINETLKN--DK---FSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQ 148 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~--~d---~~~~e~l~~~~~~--~~---f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~ 148 (502) |+..|+-+-+=|+.+-- ++ +..+.-+..+|.. |. -..-...++...+..|.+|+.+..++|++ .+..++ T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnayv~i~~~~g~~~~L~~l~ 146 (432) T protein:vir:81 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLA 146 (432) T ss_pred HHHHHHhhhhCceeeEEecCCcceecccchHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEc Confidence 45555555444554311 11 1111123333321 22 22334445667788899988887776664 556677 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+.+-+...+.+.. +++++.. ++..+ .++ +++ T Consensus 147 ~~~v~v~~~~~g~~-----~y~~~~~-----------------~g~~~----------------~~~--------~~~-- 178 (432) T protein:vir:81 147 NDRLTITTDPKGNT-----AYRYRRT-----------------DGQMI----------------DIP--------KQQ-- 178 (432) T ss_pred CCceEEEECCCCcE-----EEEEEec-----------------CceEE----------------EEc--------ccc-- Confidence 77665542222211 0111110 01000 000 000 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-c Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-V 307 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~ 307 (502) +++++.+. .+...|+|-+..+...|+.....-.-..+-|..+...=.| +......+..... . T Consensus 179 --------iih~r~~~-----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gi----l~~~~~l~~e~~~~~ 241 (432) T protein:vir:81 179 --------IWKIMGYS-----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVY----YQIDRFLTDDQYDSF 241 (432) T ss_pred --------EEEecCCC-----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceE----EecCCCCCHHHHHHH Confidence 12233211 1123588887777666654443332222334443332111 2221111100000 0 Q ss_pred cccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHH Q lcl|NC_012753. 308 KREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDT 386 (502) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l 386 (502) ...+....+....+.. +.+.-++.++.....-++++..+...++|+...|++|..+|+...+. .+++.+.-..... T Consensus 242 ~~~~~~~~nag~~~vl---~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f 318 (432) T protein:vir:81 242 AKKVSGSVEAGRAPLL---EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGF 318 (432) T ss_pred HHHHhhhhcCCCceec---CCCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHH Confidence 0000000011111111 11223556666666677888888889999999999999998765432 2222222111111 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc Q lcl|NC_012753. 387 Y-QMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT 465 (502) Q Consensus 387 ~-~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~ 465 (502) + .++.-....++..|.. .+..........+.++++.-+..|..+.++...+++.+|+++.-++++.+ T Consensus 319 ~~~tl~P~~~~ie~~l~~------------kLl~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~t~NE~R~~~ 386 (432) T protein:vir:81 319 LTMTLSPWLRRIEQSIAL------------NLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREIE 386 (432) T ss_pred HHHHHHHHHHHHHHHHHh------------hccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh Confidence 1 1222222222222221 11111111112344444455677889999999999999999999976543 Q ss_pred --CCCCHHHHHHH--------HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 466 --LNVTKEQAQEI--------YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 466 --~~~~deea~~e--------l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++.... ... +..+.++.......-....+-+--.| T Consensus 387 glpp~~g~~-~~~~~~~~~~pl~~~~~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 387 GLPKLGGNA-AVLTVQSAMVPLDSIGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred CCCCCCCCc-ceEeecCcccchhhhccCCCCCCCCCCCCcccccccC Confidence 2332211 000 01111100000000000001111111 No 148 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.09 E-value=4.9e-06 Score=49.70 Aligned_cols=408 Identities=11% Similarity=0.081 Sum_probs=164.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH--HH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG--RT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~--k~ 78 (502) |+|++++....+. ..+........-..++. .-.+.+. ...+..... ...+..+ -. T Consensus 1 Mg~~~~l~~r~~~-------~~~~~~~~~~~~~~~~~--------~~~~~~~-------~~~g~~V~~-~~al~~~~V~~ 57 (457) T protein:vir:13 1 MGFWSALFGRGHS-------PALDGIEARAWEPYDPS--------IYNLGAV-------AASGETVTP-HDALQVSAVFA 57 (457) T ss_pred Cchhhhhhccccc-------ccccccccccccccchH--------HHhhccc-------ccCCceech-HHhhccHHHHH Confidence 9999877553222 11111110000011111 0001110 001111110 1111111 23 Q ss_pred HHHHHhhhhhcCcceEeeCC-----HHHHHHHHHHHhh-c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEc Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDN-----EVADAFINETLKN-D---KFSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQ 148 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d-----~~~~e~l~~~~~~-~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~ 148 (502) .|+..|+-+-+=|+.+--.+ +.....|...++. + .....+..++...+..|.+|+.+-.++|++ .+..++ T Consensus 58 ~v~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~l~~~ln~~~n~~t~~~f~~~~~~~lll~Gna~~~i~~~~g~~~~l~~l~ 137 (457) T protein:vir:13 58 SVRLLSETIATLPLSTYSKRGGSRKEIVTPEWLDYPNAEPGGMGRIDILSQTVLSLLLQGNAFLAVRWQGPNIVGLDVLD 137 (457) T ss_pred HHHHHHHhhccCceEEEEecCCcccccccchHHHhccccCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEc Confidence 45555555555565542211 1112233344432 1 233455666777788899998887776664 566666 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++-+.....+....-.| ..|..... |..+-...+. +. T Consensus 138 p~~v~v~~~~~~~~~~~~~------------------~~y~~~~~-----------------~~~~~~~~~~---~~--- 176 (457) T protein:vir:13 138 PTKIHVHMVMVDGLRRKVF------------------EAYDIDAD-----------------GNEVLLGWFT---PR--- 176 (457) T ss_pred cCceEEEEecCCCccceeE------------------EEEEEecC-----------------CceeeEEeeC---cc--- Confidence 6665543222111111111 01111000 0000000000 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) -+++++.+..+ +..+|+|.+..+...|.....+-.-..+-|..+...-.| |.....-+.... T Consensus 177 -------diih~~~~~~~----~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~ls~e~~--- 238 (457) T protein:vir:13 177 -------DVLHIPGMMLP----GDFVGCSPISYARESIGLALAAQKYGSKFFANGAMPGAV----VEVPGTMSEEGL--- 238 (457) T ss_pred -------ceEEecCCCCC----CccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceE----EEcCCCCCHHHH--- Confidence 12334333221 234688988777776665444433333334443332111 222111000000 Q ss_pred cccccc-chhhccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHH Q lcl|NC_012753. 309 REFETG-HNVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQ 383 (502) Q Consensus 309 ~~~~~~-~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~ 383 (502) ..+... ...+..... .--+.+.-++.++.....-++.+..+....+|+...|++|..+|+...+..++..+.-.. T Consensus 239 ~~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~eq~~ 318 (457) T protein:vir:13 239 ARAREAWRAANSGVDNAHRVALLTEGAKFSKVAMSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWGSGLAEQN 318 (457) T ss_pred HHHHHHHHHHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccccchHHHHH Confidence 000000 001110000 000112235556655556677888888889999999999999987665543332222111 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 384 SDT-YQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAI 462 (502) Q Consensus 384 ~~l-~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 462 (502) ... ..++.-....++..|.. .+....-.....+.++++.-+-.|..+.++...+++.+|+|+.-+++ T Consensus 319 ~~f~~~tl~P~~~~ie~~ln~------------~L~~~~~~~~~~i~fd~~~l~~~D~~~r~~~~~~~~~~G~~T~NE~R 386 (457) T protein:vir:13 319 IAFTMFSLRPWLERIEAGFNR------------LLFAETADRFRFVKFNLDEIKRGAPKERMELWSLGLQNGIYSIDEVR 386 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHH------------hhcCccccCceeEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 111 11222222222222222 11111111122356666666777889999999999999999998876 Q ss_pred Hhc--CCCCHHHHHHHHH-----HHHH----hhhcccCCCCCccc--------cCCC-----CC Q lcl|NC_012753. 463 EKT--LNVTKEQAQEIYQ-----KIND----ETMVSTDSFRTSEE--------VDIY-----GE 502 (502) Q Consensus 463 ~~~--~~~~deea~~el~-----ri~~----E~~~~~~~~~~~~~--------~~~~-----g~ 502 (502) +.. .++.+..+++.+. .+.+ +.+...+....+.+ .|-. +| T Consensus 387 ~~~gl~Pi~~g~~d~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~ 450 (457) T protein:vir:13 387 AAEDMTPLPDGLGEKYRVPLNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATE 450 (457) T ss_pred HHhCCCCCCCCcccceeeccccccccccccccccCCCCCCCCCccccCCCCCCCCCCccccCCC Confidence 542 2333321111111 1110 10000011100100 0000 11 No 149 >protein:vir:78696 Length: 542 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285446;genbank:gi:148724480;genbank:GeneID:5220167 Probab=98.08 E-value=5.1e-06 Score=49.60 Aligned_cols=450 Identities=9% Similarity=0.045 Sum_probs=178.7 Q ss_pred HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh Q lcl|NC_012753. 7 IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL 86 (502) Q Consensus 7 ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~ 86 (502) +|...|+.+. .+..++-.....|+.+|.=-.+-+.............++--+.+...++.+|+. T Consensus 1 mk~~a~~r~~----------------~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dstg~~a~~~Laa~ 64 (542) T protein:vir:78 1 MKGLAQARYS----------------AMRADREDFLDMARRCAALTLPYLLTEDGHASGGRLQQPYQSLGSKGVNALSSK 64 (542) T ss_pred ChhHHHHHHH----------------HHHHHhhHHHHHHHHHHHHhccccCCCCCCcccccccccccchHHHHHHHHHHH Confidence 2222222111 112233233444444432211111111111111112233335677788888777 Q ss_pred hhcC--cc-----eEeeCCH--------------HHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe Q lcl|NC_012753. 87 VFNE--QA-----TIRVDNE--------------VADA-------FINETLKNDKFSKNFERYLESCLALGGLAMRPYID 138 (502) Q Consensus 87 l~~e--p~-----~i~~~d~--------------~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d 138 (502) |.+- || ++.+.+. .+.. .+.+.+..++|...+.++..+..+.|.+++ |.+ T Consensus 65 l~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~ 142 (542) T protein:vir:78 65 LMLSLFPIQTSFFKLQINDAEIASVPELTPEVRSEIDMNLSKMEKMVMQQIAESSDRVQLTAAMKHLIVTGNVLV--FAG 142 (542) T ss_pred HHHhhcCCCCccccccCCHHHHHhhccCChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--Eec Confidence 7652 22 1233221 1222 334556678999999999999999999764 566 Q ss_pred CCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee-------CCCceEEEEEEEE-EEeCCeEEEEEEEEecCCcccc Q lcl|NC_012753. 139 GDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE-------GQKVKYYSLIEFH-EWNKETYTISNELYESESKTII 210 (502) Q Consensus 139 ~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~-------~~~~~~yt~~E~h-~~~~~~~~I~~~l~~~~~~~~l 210 (502) ++. ++.+|-.+++ +..|..+....+|. ++...- +............ .-.+..+.+.|.++.-.+.+.. T Consensus 143 ~~~--~~~~pl~~y~-v~~d~~G~vd~v~r-~~~~t~~ql~~~fg~~~l~~~~~~~~~~~~~~~~~v~~~v~pr~~~~~~ 218 (542) T protein:vir:78 143 KKT--LKVYPLDRYV-IERDGDGNVIEIIT-RELVDRSLLPAEFQKQSLLEGKDSNAVGEDGPKFGVAQGKGGRNDAEVF 218 (542) T ss_pred CCC--ceEEecceeE-EeeCCCCCeEEEee-eeecCHHHHHHhhccccCchHHHhhccccCCCeEEEEEEeecccCCccc Confidence 665 4456666644 45666555555543 322110 0000000000000 0011123344433332211100 Q ss_pred ---CceeeccccccCCC-----cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh- Q lcl|NC_012753. 211 ---GQRVPLSTLYEDLE-----ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK- 281 (502) Q Consensus 211 ---G~~v~l~~~~~~l~-----~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~- 281 (502) ....+..+++..+. ......|+..-||+.++-+ ...++.||+|-...+.+-+..|+..--....-.+ T Consensus 219 ~~~~~~~~~~s~~~e~~g~~v~~~~~e~g~~~~P~i~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~~~~ 294 (542) T protein:vir:78 219 TCCKLVDGQHRWHQECDGKEIKGSRSSSPLKHSPWLPLRFN----VVDGESYGRGRVEEFFGDLSSLDALTRSLIEGSAA 294 (542) T ss_pred cccccCCCeEEEEEEeccccccccccccccccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 00011111111111 1111224444455555432 2357789999999999999999987766665543 Q ss_pred hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCC Q lcl|NC_012753. 282 MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGV 361 (502) Q Consensus 282 ~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 361 (502) ..+..+.||.+... +...+. +.....+.. ...++...+. +....+...-.+.++.+.+.|....-+ T Consensus 295 a~~pp~lv~~~g~~-----~~~~~~-----~~~~g~iv~---g~~~~v~~~~-~~~~~~~~~~~~~i~~~~~rI~~aFl~ 360 (542) T protein:vir:78 295 AAKVVFMVSPSATT-----KPQSLA-----RAGTGAIIQ---GRAEDVSVVQ-ANKGADFRTVQEMIRDLSQRISDAFLI 360 (542) T ss_pred HhcCceeecccccc-----chhhcc-----cCCCceeec---CCccceeeee-cccccchhHHHHHHHHHHHHHHHHhcc Confidence 44555556443211 111100 001111110 1111111111 111112222333344444444333222 Q ss_pred ChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCC- Q lcl|NC_012753. 362 STGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV-EKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTD- 439 (502) Q Consensus 362 s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d- 439 (502) . ...++...|||||....+...+..+-.-..+ ...|.-++.-++.++.-.+..+... ..-+++++--++..- T Consensus 361 ~----~~~d~~rvTAtEV~~r~~E~~~~LG~v~~rl~~E~L~Pli~R~~~il~r~g~lP~~p--~~lv~~~~~s~La~~~ 434 (542) T protein:vir:78 361 L----NVRQSERTTATEVREVQMELDRQLSGIYGSLTVELLTPYLNRKLHLMQRSKQLPSLP--KGLVMPTVVAGLGGVG 434 (542) T ss_pred c----ccCCcccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCc--hhceeeeeechHHHHH Confidence 1 1122334699999988777766665533333 3334444444454444333333222 223555554332110 Q ss_pred HHHHHH---HHHHHHhc--C------CCCHHHHHH---hcCCCC-------HHHHHHHHHHHHHhhhccc--CCCCCcc- Q lcl|NC_012753. 440 RNAEFD---YWSKMVAA--G------FAPKTMAIE---KTLNVT-------KEQAQEIYQKINDETMVST--DSFRTSE- 495 (502) Q Consensus 440 ~~~~~~---~~~~~~~~--G------i~S~et~l~---~~~~~~-------deea~~el~ri~~E~~~~~--~~~~~~~- 495 (502) ....++ ...+.++. | .+....++. ...|++ +|+++++.++.++.+..+. ...-... T Consensus 435 r~~~~~~l~~~~~~i~~~~~p~~l~~~id~d~~~~~~a~~~Gvp~~~i~~s~e~~~~~~~q~q~~~~~~al~~~a~~~a~ 514 (542) T protein:vir:78 435 RGEDRAALIEFMQTVGQAMGPEALQQFIDPTEFLKRLAAASGIDTLNLVKSPETMANEAQQAQQQQMTASLMGQAGQLAK 514 (542) T ss_pred HHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHcCCCHhhccCCHHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 001111 11122211 1 012222222 224543 3555555444333222110 0000001 Q ss_pred -------------------ccCCCCC Q lcl|NC_012753. 496 -------------------EVDIYGE 502 (502) Q Consensus 496 -------------------~~~~~g~ 502 (502) .+-.-|| T Consensus 515 ~~~~~~~~~~~~a~~~~~~~~~~~~~ 540 (542) T protein:vir:78 515 SPIGEKMMQQINAPGQEAPAGPQTGE 540 (542) T ss_pred cccccchhhhcCCCCcCCCCCCcccc Confidence 1112222 No 150 >protein:vir:94709 Length: 522 # NCBI annotation: head to tail connector # Family: family:all:481 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338118;genbank:gi:77118196;genbank:GeneID:3707732 Probab=98.05 E-value=6e-06 Score=49.23 Aligned_cols=442 Identities=9% Similarity=0.054 Sum_probs=180.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |.--+.+-+ +.+.+-.+ .+..++-.....|+.+|.=-.+-+......+....+.++--+.+...+ T Consensus 1 ~~~~~~~~~-----------~~~~~r~~----~l~~~R~~~e~~w~e~~~y~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 65 (522) T protein:vir:94 1 MAEREGFAA-----------EGAKAVYD----RLKNGRQPYETRAQNCAAVTIPSLFPKESDNSSTEYTTPWQAVGARCL 65 (522) T ss_pred CcccchhhH-----------HHHHHHHH----HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH Confidence 333221100 00100000 112223233444444432221212112222222233334446677777 Q ss_pred HHHhhhhhcC--c--ceEee--CC-------------HHHHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 81 KKVASLVFNE--Q--ATIRV--DN-------------EVADAFI-------NETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 81 ~~~a~~l~~e--p--~~i~~--~d-------------~~~~e~l-------~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) +.+|+.|.+- | +=|.+ .+ ..+.++| .+.+..++|...+.++.++..+.|.+++. T Consensus 66 ~~Las~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~~~~snf~~~~~~~~~~L~~~G~a~l~ 145 (522) T protein:vir:94 66 NNLAAKLMLALFPQSPWMRLTVSEYEAKTLSQDSEAAARVDEGLAMVERVLMAYMETNSFRVPLFEALKQLIVSGNCLLY 145 (522) T ss_pred HHHHHHHHhhcCCCCcccccccchhhhhccCcccchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCcEeEe Confidence 7777776652 2 11222 21 1123333 34566789999999999999999998764 Q ss_pred EEEeC-Cc-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee-----------------CCCceEEEEEEEEEEeCCeE Q lcl|NC_012753. 135 PYIDG-DQ-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE-----------------GQKVKYYSLIEFHEWNKETY 195 (502) Q Consensus 135 ~~~d~-~~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~-----------------~~~~~~yt~~E~h~~~~~~~ 195 (502) +--+. +. ..+..+|-.+++ +..|..+....+|.......+ .+....|+.++ ..++.+ T Consensus 146 ~~~~~~~~~~~~~~~pl~~y~-v~~d~~G~vd~i~r~~~~~~~~l~~~~~~~~~~~~~~p~~~v~v~~~v~---~~~~~~ 221 (522) T protein:vir:94 146 IPEPEQGTYSPMRMYRLVSYV-VQRDAFGNILQIVTIDKVAFSALPEDVKSQLNADDYEPDTELEVYTHIY---RQDDEY 221 (522) T ss_pred eeccCCCceeeEEEEEcceEE-EeeCCCcCeEEEeeeeeccHHhcchHHHHHHhcccCCccceEEEEEEEE---eeCCce Confidence 33232 22 457778877755 455655554455433221110 01112222222 223333 Q ss_pred EEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHH Q lcl|NC_012753. 196 TISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDE 275 (502) Q Consensus 196 ~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~ 275 (502) ...+.+ . |..++..+ .. .++..-||++++-+ ...++.||+|--..+.+-+..|+..--. T Consensus 222 ~~~~~~----~----g~~~~~~~------~~---~~~~e~P~~~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l~~~ 280 (522) T protein:vir:94 222 LRYEEV----E----GIEVTGTD------GS---YPLTACPYIPVRMV----RLDGEDYGRSYCEEYLGDLNSLETITEA 280 (522) T ss_pred eEEeec----c----CceecccC------CC---CccccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHHHHH Confidence 221111 1 22222111 11 12333455544432 2346789999999999999999987776 Q ss_pred HHHHHh-hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec--cccchHHHHHHHHHHH Q lcl|NC_012753. 276 FMWEVK-MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT--TDIRSDDYIKAINKGL 352 (502) Q Consensus 276 ~~~~~~-~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~l~~~l 352 (502) ...-.+ ..+..+.||.+.... ...+.+. ....+.. ...+ .++.++ ..-+...-...++.+. T Consensus 281 ~l~~~~~~~~p~~~v~~~g~~~-----~~~~~~~-----~~g~~v~---g~~~---~v~~~~~~~~~~~~~~~~~i~~~~ 344 (522) T protein:vir:94 281 ITKMAKVASKVVGLVNPNGITQ-----PRRLNKA-----ATGEFVA---GRVE---DINFLQLTKGQDFTIAKSVADAIE 344 (522) T ss_pred HHHHHHHHhCCceeeccccccc-----chheecc-----CCceeec---CCcc---cceeeecccccchhHHHHHHHHHH Confidence 666553 455556664432211 1111110 0011110 1111 122111 1111222223344444 Q ss_pred HHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEE Q lcl|NC_012753. 353 SLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI-ATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVD 431 (502) Q Consensus 353 ~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~-~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~ 431 (502) +.|....-+. .++..++...|||||....+.+.+..+-. .+.-...|..|++-++.+..-.+..+.. +...++++ T Consensus 345 ~rI~~af~~~--~~~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~--p~~~v~v~ 420 (522) T protein:vir:94 345 QRLGWAFLLN--SAVQRNAERVTAEEIRYVAGELEATLGGVYSVQSQELQLPIVRVLMNQLQSAGMIPDL--PKEAVEPT 420 (522) T ss_pred HHHHHHHhhh--hhccCCCccccHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC--CcccEEee Confidence 4443332122 23333445579999998877777666552 2222334444554444444333333322 22235555 Q ss_pred eCCCccCC-HHHHHHHHHHHHh-cCCCCHH---------HH---HHhcCCC-------CHHHHHHHHHHHHHhhh--ccc Q lcl|NC_012753. 432 LDDGVFTD-RNAEFDYWSKMVA-AGFAPKT---------MA---IEKTLNV-------TKEQAQEIYQKINDETM--VST 488 (502) Q Consensus 432 f~d~i~~d-~~~~~~~~~~~~~-~Gi~S~e---------t~---l~~~~~~-------~deea~~el~ri~~E~~--~~~ 488 (502) +--++..- ....++.+.+..+ .+.++++ .+ +....|+ +++|+++++++..+.++ ... T Consensus 421 ~~s~La~~qr~~~~~~l~~~~~~ia~l~P~~~~~~id~d~~~~~~a~~~Gv~~~~ivr~~ee~~~~~~q~~~~~~~~~~~ 500 (522) T protein:vir:94 421 VSTGLEALGRGQDLEKLTQAVNMMTGLQPLSQDPDINLPTLKLRLLNALGIDTAGLLLTQDEKIQRMAEQSSQQAVVQGA 500 (522) T ss_pred EecHHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHcCCChhhccCCHHHHHHHHHHHHHHHHHHHHH Confidence 53221110 0011111111110 0012222 11 2233444 34455544443222111 111 Q ss_pred CCCCCccccCCCCC Q lcl|NC_012753. 489 DSFRTSEEVDIYGE 502 (502) Q Consensus 489 ~~~~~~~~~~~~g~ 502 (502) +......++.+..+ T Consensus 501 ~~~~~~~~a~~~~~ 514 (522) T protein:vir:94 501 SAAGANMGAAVGQG 514 (522) T ss_pred HHHHHHhhhhhhcc Confidence 11111111222211 No 151 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=98.05 E-value=6e-06 Score=49.21 Aligned_cols=390 Identities=13% Similarity=0.096 Sum_probs=165.7 Q ss_pred cchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEe-eCC- Q lcl|NC_012753. 21 QSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIR-VDN- 98 (502) Q Consensus 21 ~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~-~~d- 98 (502) |-|.+......-.++... ..+ -.+.|....-.+. . .++-+...---..|+..|+-+.+=|..+- -.+ T Consensus 1 m~f~~~~~~~~~~~~~~~-~~~----~~~~g~~~~~~~v--~----~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~~ 69 (409) T protein:vir:10 1 MLFRKGFKNQSQEISIDD-KKI----LEWLGINPSETYV--N----GKSCLKQATVFGCIRILSDNISKLPIKIYQKKDG 69 (409) T ss_pred CcccccccCcCCCCCCCh-HHH----HHHhcCCcCccee--c----hhhhhccHHHHHHHHHHHHhhhhCceEEEEecCC Confidence 223332222211111110 011 1122221110000 0 01122223334455566655555455441 111 Q ss_pred H--HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEEcCCeEEEEEEcCCCeEEEEEEE Q lcl|NC_012753. 99 E--VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSFVQATVFFPLQANTQDVSSAAIVT 169 (502) Q Consensus 99 ~--~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~ 169 (502) . ..+.-+..+|.. | ....-+...+...+..|.+|+.+..+.+ . ..+..++|+++-++..+.+... T Consensus 70 ~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~------ 143 (409) T protein:vir:10 70 IKRVPDHYLEYLLKLRPNPYMSSSDFWKCIEVQRNIYGNAYVALDFKKNGEIKGLYPLKSDGMKIFVDDTGLLN------ 143 (409) T ss_pred eeeccCchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCCceEEEEcCCcccc------ Confidence 1 111123333321 1 2334455567778889999998887754 4 3566777777655432211110 Q ss_pred EEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCcccccc Q lcl|NC_012753. 170 KSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKD 249 (502) Q Consensus 170 ~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~ 249 (502) ......| + |.. ..|....+ + .--.++++.+.. T Consensus 144 ------~~~~~~y-------------~-----~~~----~~g~~~~~---~-------------~~evih~r~~~~---- 175 (409) T protein:vir:10 144 ------SENNVWY-------------L-----YTD----DLGQRHKF---M-------------SDEILHFKGLTA---- 175 (409) T ss_pred ------ccceEEE-------------E-----EEe----CCceeEEe---c-------------cccEEEecCcCC---- Confidence 0000001 0 000 00111100 0 001234443211 Q ss_pred ccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc-chhhcccc----CC Q lcl|NC_012753. 250 INSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG-HNVYEQFD----SG 324 (502) Q Consensus 250 ~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~~~~~~----~~ 324 (502) +...|+|.+..+...++....+-....+-|..+...=.| |......+.... ..+... ...+.... .. T Consensus 176 -d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gi----l~~~~~l~~e~~---~~~~~~~~~~~~g~~n~~~~~ 247 (409) T protein:vir:10 176 -DGLAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGL----VQYAGDLNPEAE---EVFKENFERMSSGLKNAHRIA 247 (409) T ss_pred -CCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE----EEcCCCCCHHHH---HHHHHHHHHHhccccccCCce Confidence 235799998887777766554443333345544332112 222111110000 000000 01111110 00 Q ss_pred CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 325 DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKS-MKTATEVVSEQSDTYQMRNSIATLVEKSLKE 403 (502) Q Consensus 325 ~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~ 403 (502) --+.+..++.++.....-++.+..+...++|+...|+|+..+|...++ .+++.+... ..++.+|.. T Consensus 248 vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~-------------~f~~~~l~P 314 (409) T protein:vir:10 248 MLPIGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNR-------------EFYIDTLQS 314 (409) T ss_pred ecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHH-------------HHHHHHHHH Confidence 001222355666666667788888888999999999999999865443 223333221 122334444 Q ss_pred HHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH-HHHHHHHHHHHH Q lcl|NC_012753. 404 LVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK-EQAQEIYQKIND 482 (502) Q Consensus 404 l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d-eea~~el~ri~~ 482 (502) +++.|-...+..-+..........+.|+++.-.-.|..+.++...+++.+|++++-++++.+ |+.. +..++-+. . T Consensus 315 ~~~~ie~~ln~kL~~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~l-gl~p~~ggD~~~~--~- 390 (409) T protein:vir:10 315 ILNMYELEINYKLFLISEIKNGFYSKFNVDTILRADIKTRYESYKEAIQNGFKTPNEIRELE-EDEPLEGGDVLLI--N- 390 (409) T ss_pred HHHHHHHHHHHhhcCchhccCCcEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcCeeee--c- Confidence 44433322221101111122223466666666678999999999999999999999876643 4432 10111100 0 Q ss_pred hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 483 ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 483 E~~~~~~~~~~~~~~~~~g~ 502 (502) .+....... .....=+|| T Consensus 391 ~n~~~~~~~--~~~~~kgGe 408 (409) T protein:vir:10 391 GNMIPVKMA--GEQYSKGGE 408 (409) T ss_pred cCccchhhc--cccccccCC Confidence 010000000 111112355 No 152 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.04 E-value=6.1e-06 Score=49.17 Aligned_cols=378 Identities=9% Similarity=0.006 Sum_probs=159.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |+||+++++--+ .......... .... ........+.|. .+.... +.-+..+--... T Consensus 3 m~~f~~~~~~~~----~~~~~~~~~~----~~~~------~~~~~~~~~~~~---------~~~~v~~~~al~~~~v~~~ 59 (392) T protein:vir:10 3 LPILNFINQTND----PPEVGSVQSY----FPDG------NDAQIMESLLGD---------NNEWVSARAALRNSDLFSI 59 (392) T ss_pred chhhhhhhcccc----cccccccccc----cccC------chhhhhhhhcCC---------CCceechHHhhccHHHHHH Confidence 999986653211 1110100000 0000 000001111111 011100 001112222445 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQA 157 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~ 157 (502) |+.+|+-+-+=|+ .+.++.....+.+-...-....-+..++...+..|.+|+.+..|. |.+ .+..++|+.+-+... T Consensus 60 i~~ia~~ia~lp~--~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~ 137 (392) T protein:vir:10 60 ILQLSSDLAIVKI--NAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYF 137 (392) T ss_pred HHHHHHhhccCce--eeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEc Confidence 6666665554444 454444333333322222224445556778888999998887765 443 677777777655432 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) ..++.. ++++. ..+..+ +..+.+ . ++ -. T Consensus 138 ~~~~~~----~y~~~-----------------~~~~~~---------------~~~~~~---~---~~----------ei 165 (392) T protein:vir:10 138 EYENGM----YYNIT-----------------FDDPKI---------------EPILQA---P---QS----------DL 165 (392) T ss_pred CCCceE----EEEEE-----------------ecCccc---------------ceeEEE---c---cc----------cE Confidence 222211 10100 000000 000000 0 00 02 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchh Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNV 317 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~ 317 (502) ++++.+- ..+...|+|-+..+...|+....+-....+-|..+...-.+ |....+. ... ...+. .-... T Consensus 166 ih~~~~~----~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~~-~~~-~~~~~--~~~~~ 233 (392) T protein:vir:10 166 IHMKLLS----IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGV----LTVKGGG-LLS-DKDKA--SRSRS 233 (392) T ss_pred EEecCCC----CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCC-Cch-HHHHH--HHHHH Confidence 3343321 12335699998888888755444433333344544332111 2211110 000 00000 00000 Q ss_pred hccccCCCC----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDSGDM----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 318 ~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~ 393 (502) +.......+ +.+.-++.++.....-++.+..+...++|+...|+++..+|+......+.++.+. T Consensus 234 ~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~------------ 301 (392) T protein:vir:10 234 FMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISG------------ 301 (392) T ss_pred HhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHH------------ Confidence 111000000 1122355555555566788888888999999999999999875443322222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKE 471 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~de 471 (502) .++.+|..+++.|..-.+. .+. + .+.++...-.-.|..+.+..+.+++.+|+++..++.+.+ .|+..+ T Consensus 302 --f~~~~l~P~~~~ie~~l~~-~L~-----~--~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~ 371 (392) T protein:vir:10 302 --MYASALNRYLRPAISELEY-KLS-----D--HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK 371 (392) T ss_pred --HHHHHHHHHHHHHHHHHHH-hcc-----c--cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc Confidence 1223333333322221111 010 0 111222222234567777888899999999998876543 467655 Q ss_pred HHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 472 QAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 472 ea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |+.+. |... ..+++ -.+| T Consensus 372 e~r~~------e~l~------~~~~G-d~~~ 389 (392) T protein:vir:10 372 DLPAP------ENTN------KKTTG-QSNE 389 (392) T ss_pred ccchh------cCCC------CCCCC-CCCC Confidence 44321 1111 11111 1233 No 153 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.04 E-value=6.1e-06 Score=49.17 Aligned_cols=378 Identities=9% Similarity=0.006 Sum_probs=159.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |+||+++++--+ .......... .... ........+.|. .+.... +.-+..+--... T Consensus 3 m~~f~~~~~~~~----~~~~~~~~~~----~~~~------~~~~~~~~~~~~---------~~~~v~~~~al~~~~v~~~ 59 (392) T protein:vir:39 3 LPILNFINQTND----PPEVGSVQSY----FPDG------NDAQIMESLLGD---------NNEWVSARAALRNSDLFSI 59 (392) T ss_pred chhhhhhhcccc----cccccccccc----cccC------chhhhhhhhcCC---------CCceechHHhhccHHHHHH Confidence 999986653211 1110100000 0000 000001111111 011100 001112222445 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQA 157 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~ 157 (502) |+.+|+-+-+=|+ .+.++.....+.+-...-....-+..++...+..|.+|+.+..|. |.+ .+..++|+.+-+... T Consensus 60 i~~ia~~ia~lp~--~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~ 137 (392) T protein:vir:39 60 ILQLSSDLAIVKI--NAEKKKNQGIIDNPSTNANKHGFWQSMFAQLLLGGEAFAYRWRNANGADMKWEYLRPSQVNTYYF 137 (392) T ss_pred HHHHHHhhccCce--eeccchhhhHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEEcCceeEEEEc Confidence 6666665554444 454444333333322222224445556778888999998887765 443 677777777655432 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) ..++.. ++++. ..+..+ +..+.+ . ++ -. T Consensus 138 ~~~~~~----~y~~~-----------------~~~~~~---------------~~~~~~---~---~~----------ei 165 (392) T protein:vir:39 138 EYENGM----YYNIT-----------------FDDPKI---------------EPILQA---P---QS----------DL 165 (392) T ss_pred CCCceE----EEEEE-----------------ecCccc---------------ceeEEE---c---cc----------cE Confidence 222211 10100 000000 000000 0 00 02 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchh Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNV 317 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~ 317 (502) ++++.+- ..+...|+|-+..+...|+....+-....+-|..+...-.+ |....+. ... ...+. .-... T Consensus 166 ih~~~~~----~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~~-~~~-~~~~~--~~~~~ 233 (392) T protein:vir:39 166 IHMKLLS----IDGGKTGISPLYSLRRESKIQRASDRLTISSLNSSLNVPGV----LTVKGGG-LLS-DKDKA--SRSRS 233 (392) T ss_pred EEecCCC----CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCC-Cch-HHHHH--HHHHH Confidence 3343321 12335699998888888755444433333344544332111 2211110 000 00000 00000 Q ss_pred hccccCCCC----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDSGDM----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 318 ~~~~~~~~~----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~ 393 (502) +.......+ +.+.-++.++.....-++.+..+...++|+...|+++..+|+......+.++.+. T Consensus 234 ~~~~~~~g~~~vl~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~~~~~~~~------------ 301 (392) T protein:vir:39 234 FMKRSRSGGPVVLDDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQSSIQQISG------------ 301 (392) T ss_pred HhccccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHH------------ Confidence 111000000 1122355555555566788888888999999999999999875443322222221 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKE 471 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~de 471 (502) .++.+|..+++.|..-.+. .+. + .+.++...-.-.|..+.+..+.+++.+|+++..++.+.+ .|+..+ T Consensus 302 --f~~~~l~P~~~~ie~~l~~-~L~-----~--~~~~d~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~l~~~g~~p~ 371 (392) T protein:vir:39 302 --MYASALNRYLRPAISELEY-KLS-----D--HISVNMRPAIDPLGDNYLSTISTATRWGALAENQATFVLQEAGYIPK 371 (392) T ss_pred --HHHHHHHHHHHHHHHHHHH-hcc-----c--cccccchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHHHhcCCCcc Confidence 1223333333322221111 010 0 111222222234567777888899999999998876543 467655 Q ss_pred HHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 472 QAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 472 ea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |+.+. |... ..+++ -.+| T Consensus 372 e~r~~------e~l~------~~~~G-d~~~ 389 (392) T protein:vir:39 372 DLPAP------ENTN------KKTTG-QSNE 389 (392) T ss_pred ccchh------cCCC------CCCCC-CCCC Confidence 44321 1111 11111 1233 No 154 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=98.04 E-value=6.4e-06 Score=49.08 Aligned_cols=411 Identities=10% Similarity=0.060 Sum_probs=170.3 Q ss_pred CChhHHHHHH-HHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNF-IKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~-i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) |+|++.-|-- ++... +.+.++.+..+.+-....+.....+-..+......+ T Consensus 6 ~~~~~~~~~~~~~~~~----------------------------~~~~~~~~~~~~~~~pp~~~~~La~~~~~n~~v~sc 57 (540) T protein:vir:41 6 LSIKSLEKYRAIKGDT----------------------------DSQALKEDRFEEYVEPKVHPLVLLSLLQVNPYHASA 57 (540) T ss_pred cChhhccchhhhhccc----------------------------cccccccCCCCccccCCCCHHHHHHHHHhcHHHHHH Confidence 6665532221 11100 111112111111000000000011111233556788 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQA 157 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~ 157 (502) |+.+|+.+.+-|..+..++....+++-.. .-.+...+...+.+.+..|.+|+.+..+. |. ..+..++|..+-+.. T Consensus 58 I~~ia~~ia~~~~~i~~~~~~~~~~lpN~--~~t~~~f~~~~v~dlll~Gnayv~i~r~~~G~~~~L~~i~~~~V~v~~- 134 (540) T protein:vir:41 58 CSIKANDILRTGYLIDGDDGGVEELLRAC--RPSFEFILLQALEDLQVFNYCTLEVVRDDQGEPVRLDYIPAHTVRVHR- 134 (540) T ss_pred HHHHHHHHhcCCceEecCccchhhhccCC--CCCHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEeCCcceEEeE- Confidence 89999999999988887776655544211 12345556666777888999999887765 44 367778888765432 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) +..+ ++...+....+| +.. |.....+.. .+ |. ....+.. --. T Consensus 135 ~~~~---------~~~~~d~~~~~~----~~~-----~~~~~~~~~-~~----g~------------~~~~~~~---~eV 176 (540) T protein:vir:41 135 DGSR---------YMQTWDGIHVTY----FKD-----YRYEGEVNP-DN----GE------------DQDGVGA---NEI 176 (540) T ss_pred cCce---------eEeeecCceeee----eec-----ccccceeec-cc----cc------------cceeecc---cce Confidence 2111 111111111111 000 000000000 00 00 0000000 012 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccce---eeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRR---VAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~---i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) ++++.+. ..+..+|+|.+..+...+.....+..-..+-|..+... |.++..+.+...............+... T Consensus 177 iHir~~~----~~~~~~G~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~ 252 (540) T protein:vir:41 177 IFIHLPS----PICSYYGVPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGL 252 (540) T ss_pred EEecCCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHH Confidence 3454332 12456899988776665554443333223334554433 2222221110000000000000000000 Q ss_pred ---------chhhccccC-CCC--ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc---ccHHHH Q lcl|NC_012753. 315 ---------HNVYEQFDS-GDM--DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM---KTATEV 379 (502) Q Consensus 315 ---------~~~~~~~~~-~~~--~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~---~tAtei 379 (502) .+....+-. ..+ +.+.-++.++......++.+..+...++|+...|++|..+|....+. +++.+. T Consensus 253 ~~~~~~g~~~nag~~~vLe~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~ 332 (540) T protein:vir:41 253 IEDNFKYLKEAPHTPLVFSIPGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVA 332 (540) T ss_pred HHHHhccccccccceEEEecCCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHH Confidence 000000100 111 11223445555556667888888999999999999999998754322 334443 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHH Q lcl|NC_012753. 380 VSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKT 459 (502) Q Consensus 380 ~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~e 459 (502) ...+ ...++.-..+.++..|...+ + . .. .....|.|+..-....+ .+..+.+++.+|+++.- T Consensus 333 ~~~f--~~~tL~P~~~~ie~~ln~~L---~---------~-~~--~~~~~i~f~~~~ll~~D-~~~~~~~lv~~G~lT~N 394 (540) T protein:vir:41 333 RRTY--YESVVRPQQEIVSSVLTDFI---Q---------L-KL--DPGARFVFNEEILMESE-FVHNYALLVQCGVLTPS 394 (540) T ss_pred HHHH--HHHHHHHHHHHHHHHHHHhh---h---------h-cc--CCceEEEecchhhcchH-HHHHHHHHHhCCCCCHH Confidence 2222 11223333333444443321 0 0 01 12345667665444432 34456678889999999 Q ss_pred HHHHhcCCCC---HHHH------HHHHHHHHHh--hhc----------ccCCCC----------------CccccCCCCC Q lcl|NC_012753. 460 MAIEKTLNVT---KEQA------QEIYQKINDE--TMV----------STDSFR----------------TSEEVDIYGE 502 (502) Q Consensus 460 t~l~~~~~~~---deea------~~el~ri~~E--~~~----------~~~~~~----------------~~~~~~~~g~ 502 (502) +++..+.|+. |.-. ..+++..+.+ +.+ ..+... +...+++-+| T Consensus 395 E~Re~L~g~e~gdd~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 474 (540) T protein:vir:41 395 EVREKLFGLDGGPDMFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPLEDKKKKIDEVLSDFRAE 474 (540) T ss_pred HHHHHhCcCcCCCcccccccccccccccccccccCCCCccccccccchhcccccCccccccccccccccccccccccCCc Confidence 8876665542 2100 0111100000 000 000000 0001111111 No 155 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.02 E-value=7e-06 Score=48.84 Aligned_cols=384 Identities=10% Similarity=0.026 Sum_probs=163.7 Q ss_pred ccccceecchHHHHHHHHhhhhhcCcceEeeCC-----H---HHHHHHHH-HHhh--c-----------cHHHHHHHHHH Q lcl|NC_012753. 66 VKRDFNHLPIGRTASKKVASLVFNEQATIRVDN-----E---VADAFINE-TLKN--D-----------KFSKNFERYLE 123 (502) Q Consensus 66 ~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d-----~---~~~e~l~~-~~~~--~-----------~f~~~~~~~~~ 123 (502) .+.-....+....+|+..|+.+.+=|..+.... . ...+.+.. ++.. | -+...+..++. T Consensus 1 l~~l~~~n~~v~~ci~~ia~~ia~~p~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~pn~~~~~~~~~~~t~~~~~~~~~~ 80 (467) T protein:vir:31 1 MAELLEHNETHAKCVHAKSRYVAGFGINIIPHPEAEDPDRDGEQYERVWDFWFGDDSNWQVGPMESERATATNVLQTAWT 80 (467) T ss_pred ChhhhhcCHHHHHHHHHHHHhhhcCCeEEEEccCcccccchhhhhhhHHHHhhccCCCccccchhhHhhHHHHHHHHHHH Confidence 111111235667888999999988887764211 1 11122222 2211 1 23455666788 Q ss_pred HHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEE--EeCCeEEEEE Q lcl|NC_012753. 124 SCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHE--WNKETYTISN 199 (502) Q Consensus 124 ~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~--~~~~~~~I~~ 199 (502) ..+..|.+|+.+..+. |+ +.+..++|..+-+. .+..+. .........+|....... ...+.+...+ T Consensus 81 ~l~l~Gn~~i~~~r~~~G~~~~l~~l~~~~v~~~-~d~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 150 (467) T protein:vir:31 81 DYEAIGWLTIEILTQTDGTPTGLAYVPGHTIRKR-MDERGF---------VQLLEEKEKYFGVAGDRYQTNGNGDLDPVF 150 (467) T ss_pred HHHhcCCeEEEEEECCCCcEEEEEEeCCceeEee-eeccee---------EeecCCceeeEEeccccceeecccceeeee Confidence 8888999999888875 44 47888888887664 222111 111111111111000000 0000000000 Q ss_pred EEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 200 ELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE 279 (502) Q Consensus 200 ~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~ 279 (502) ........|..+ . ++.--.++|+.+.+ .+..+|+|.+..+...++. +.....+... T Consensus 151 ---~~~~~~~~~~~~-------------~---~~~~diih~r~~~~----~~~~~G~s~~~~~~~~i~~-~~~~~~~~~~ 206 (467) T protein:vir:31 151 ---VDADDGSTGTSV-------------S---NPANELIFKRNHSP----LYPHYGAPDIIPAVKTIRG-DSAAQDYNID 206 (467) T ss_pred ---eeecccccccee-------------E---eccccEEEecCCCC----CCCcccccHHHHHHHHHHH-HHHHHHHHHH Confidence 000000011111 0 01112345554322 2345799998887777654 3344444443 Q ss_pred -Hhhccce--e-eechHHhccCCCCCCcccCcccccccc---------------chhhccccCCCCc--ccccee--eec Q lcl|NC_012753. 280 -VKMGQRR--V-AVPTQMIKTEYDTNGEKVTVKREFETG---------------HNVYEQFDSGDMD--KGIGIT--DLT 336 (502) Q Consensus 280 -~~~~~~~--i-~v~~~~l~~~~~~~g~~~~~~~~~~~~---------------~~~~~~~~~~~~~--~~~~i~--~~~ 336 (502) |..+... | .++..++.. .... .....+... .+.........+. ...+++ .++ T Consensus 207 ~f~ng~~p~gil~~~~~~l~~----e~~~-~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls 281 (467) T protein:vir:31 207 FFENDGVPRIAIIVKGAELTE----KGRE-EMRNLIEDNNEDNHRTAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLT 281 (467) T ss_pred HHhccCCCceEEEecCcCCCH----HHHH-HHHHHHHhhhcchhhhhhhhhcccccccccccccCCCcccccceeEEecc Confidence 3544332 2 222211110 0000 000000000 0000000001110 011122 121 Q ss_pred ccc-chHHHHHHHHHHHHHHHHhcCCChhhccccccccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 337 TDI-RSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK--TATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAK 413 (502) Q Consensus 337 ~~i-r~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~--tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~ 413 (502) .-. ..-++.+..+...++|+...|++|..+|+..++.. ++.+....+ ...++.-..+.|+..|... T Consensus 282 ~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~lG~~~~~~~~s~~e~~~~~f--~~~~l~P~~~~ie~~ln~~--------- 350 (467) T protein:vir:31 282 VGIDEEASFLEFRGRNEHDILKVHDVPPVIAGVVESGAFSTDAEEQRKEF--AEETIQPKQHDFGELLYEL--------- 350 (467) T ss_pred ccChhhHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcccCHHHHHHHH--HHHHHHHHHHHHHHHHHHh--------- Confidence 111 23467788888899999999999999987544321 222221111 1122222222333333221 Q ss_pred hhcccC-CCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhhhcccCC Q lcl|NC_012753. 414 VYNLYT-GEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEIYQKINDETMVSTDS 490 (502) Q Consensus 414 ~~~~~~-~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~el~ri~~E~~~~~~~ 490 (502) ++. ........+.+++..-...|..+.++....++.+|+++.-++++.. .++.|++...-..-...-++...+. T Consensus 351 ---l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~Gl~pi~d~~~~~~~~~~~~~~~~~~~~ 427 (467) T protein:vir:31 351 ---VHKQGLDAPDWTIEFELAKPDTKLQDVEIASQRVQAMQGLLTVNELRDEFGFEPFPEEHVYGGETLVAEVTGGSGPG 427 (467) T ss_pred ---hcchhhccCCceEEEecchhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCcccccCCcccccccccccCCC Confidence 111 1112334577777888889999999999999999999999977653 2343321100000000000000000 Q ss_pred CCCccccCCCCC Q lcl|NC_012753. 491 FRTSEEVDIYGE 502 (502) Q Consensus 491 ~~~~~~~~~~g~ 502 (502) ....+...=.+| T Consensus 428 ~~~~~~~~~~~~ 439 (467) T protein:vir:31 428 GGIGDQIEQLVE 439 (467) T ss_pred CcccCcCCCCCC Confidence 000000000011 No 156 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=98.01 E-value=7.3e-06 Score=48.76 Aligned_cols=447 Identities=14% Similarity=0.144 Sum_probs=193.6 Q ss_pred Hhhcccccchhhhhccccc---cCCHHHHHHHHHHHHHhcCCC-CccccccCCCccccccceec---chHHHHHHHHhhh Q lcl|NC_012753. 14 SNYVITNQSLNSITDHPKI---AISPEEYNRIMDNLRYFAGDF-DSVTYRDSNGSQVKRDFNHL---PIGRTASKKVASL 86 (502) Q Consensus 14 ~~~~~~~~~l~~i~~~~~~---~~~~~~~~~i~~~~~~Y~g~~-~~~~~~~~~~~~~~~~~~~~---n~~k~iv~~~a~~ 86 (502) |...+...+++..-...+. ..+...-..+.-....|.|.. +...... +....-+++.++ +-.--.|+..++= T Consensus 1 ~~~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~~~~~~~~g~~~~~e~~~~-~~~eLI~~YR~ma~~pEvd~Av~eIVne 79 (537) T protein:vir:10 1 MAQQLFGFSLQRAKKVPKGPSFVQKDSLDGSQPIVGGGYFGYSVDFDGTIR-NDHELITRYREMVLNPECDSAVDDVVNE 79 (537) T ss_pred CccccccceeecccccccCCcccCCCcccccceeecccccccccccccccc-hHHHHHHHHHHHhhccchhhHHHHhhcc Confidence 3333333333222111110 000000001100001111110 0000000 000000111110 1111112222211 Q ss_pred h-----hcCcceEeeCC----H----HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC----c-eEEEEEc Q lcl|NC_012753. 87 V-----FNEQATIRVDN----E----VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD----Q-IRVSFVQ 148 (502) Q Consensus 87 l-----~~ep~~i~~~d----~----~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~----~-~~i~~v~ 148 (502) + ..+|+++.+++ + ...+..+.+++--+|++...+.+....+-|..|++.++|+. + ..+.+++ T Consensus 80 aiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fhKiid~k~pk~GI~ELr~lD 159 (537) T protein:vir:10 80 TICGNFDDVPISIDLHNLKQSEKIKKLIRSEFDEILRLLDFDNRAYEIFRRWYVDGRLFFHKVIDPKKPRQGLVELRYVD 159 (537) T ss_pred eeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeC Confidence 1 12566677664 2 24556677787779999999999999999999999999853 3 4788899 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-ccCCCcce Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-YEDLEETV 227 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~~~l~~~~ 227 (502) |..+-++..-........- ..+.... +++. ...|++-|.-... .....|-.+|-+.+ |.+ T Consensus 160 Pr~i~~vR~i~~~~~~~~~-----~~~~~~~-v~~~-------~~eyf~ynp~g~~-~~~~~~vkI~~dAI~y~h----- 220 (537) T protein:vir:10 160 PRKIRKVTEYEAKRPEALR-----TQDLNQQ-LTQQ-------SASYFLYNPKGLK-NSTNQGMKIAPDSIAYCH----- 220 (537) T ss_pred CccceeeEeecccCCccce-----EEeccee-eeec-------ccceeeecccccc-ccCCCceeccHhheeeec----- Confidence 9888766431110000000 0000000 0000 0111111100000 00111223332221 110 Q ss_pred eecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH--HHHhhccceee-ech---------HHh- Q lcl|NC_012753. 228 TLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT---------QMI- 294 (502) Q Consensus 228 ~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~---------~~l- 294 (502) .|+ . +...+..+|-|..+......|=-+-+.++ +-.++-..||| |+- .+| T Consensus 221 --SGl--~-------------d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr 283 (537) T protein:vir:10 221 --SGI--Q-------------DLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLR 283 (537) T ss_pred --ccc--e-------------eCCCCeeeeeehhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHH Confidence 011 0 11122334545444443333322222221 11233344454 211 011 Q ss_pred ---c-----cCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhc Q lcl|NC_012753. 295 ---K-----TEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMF 366 (502) Q Consensus 295 ---~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~ 366 (502) . .+-+...++++.++-+..-...|..- --+|+.+.-|+++..--...+ ++-++.+.+.+....++|.+.+ T Consensus 284 ~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLP-RReGgrgTEItTLpGgqnlge-m~DV~YF~kKLy~aLnVP~SRl 361 (537) T protein:vir:10 284 EVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLP-RREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRL 361 (537) T ss_pred HHHHhccceEEEeccCceecccchhhhhhhhhccc-ccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCcccc Confidence 0 01122233333332222222222211 113333344666554322222 2446667778888888887777 Q ss_pred cccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCCHHHH Q lcl|NC_012753. 367 SFDGKS-MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNAE 443 (502) Q Consensus 367 ~~~~~~-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~~ 443 (502) +.+++. ..-++||....-....-+.+++..|..-+.++++.-|.+-. ++...-+.. ..+.++|...--..+..+ T Consensus 362 ~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKg---iit~eeW~~i~~~I~~~f~~Dn~f~ElKe 438 (537) T protein:vir:10 362 ETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKG---ICSIEEWEEMKEHIQFDFIADNYFTELKE 438 (537) T ss_pred CCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---CCCHHHHHHHhhcceEEeeecchHHHHHH Confidence 765432 22345665555555667888888888888888887665432 222221111 357788866544444444 Q ss_pred HHHHHHHH---h------cCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhccc-CCCCCccccCCC-CC Q lcl|NC_012753. 444 FDYWSKMV---A------AGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVST-DSFRTSEEVDIY-GE 502 (502) Q Consensus 444 ~~~~~~~~---~------~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~-~~~~~~~~~~~~-g~ 502 (502) ++.+..-. + +-..|.+++.++..-.||+|.+++.+.|++|....- +.+....+++++ |+ T Consensus 439 ~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~~~~~~~~~~~~ 508 (537) T protein:vir:10 439 IEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQAMQAMEMGIGD 508 (537) T ss_pred HHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcccccccccCCCC Confidence 44433211 1 114589988778889999999999999999876432 222222233332 11 No 157 >protein:vir:78942 Length: 510 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522822;genbank:gi:158345057;genbank:GeneID:5687432 Probab=97.97 E-value=8.9e-06 Score=48.28 Aligned_cols=431 Identities=9% Similarity=0.023 Sum_probs=164.7 Q ss_pred HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh Q lcl|NC_012753. 7 IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL 86 (502) Q Consensus 7 ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~ 86 (502) +|..+++.+.+. + +-.....|+.++.=-.+-+......+......+.--..+...++.+|+. T Consensus 1 mk~~~~~~~~~l------------k------r~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~ 62 (510) T protein:vir:78 1 MKSTAAMLWEKL------------R------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAK 62 (510) T ss_pred ChhHHHHHHHHH------------h------ccchHHHHHHHHHhhccccccCCCCcccccccCcccchHHHHHHHHHHH Confidence 444444332211 0 0112233443332111111111111111111122224556677777766 Q ss_pred hhcC--cce-----EeeCCH-------------HHHHH-------HHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Q lcl|NC_012753. 87 VFNE--QAT-----IRVDNE-------------VADAF-------INETLKNDKFSKNFERYLESCLALGGLAMRPYIDG 139 (502) Q Consensus 87 l~~e--p~~-----i~~~d~-------------~~~e~-------l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~ 139 (502) |.+- ||. +.+.+. .+.++ +...+..++|...+.++..+....|.+.+ |.++ T Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~~ 140 (510) T protein:vir:78 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNALL--YRNS 140 (510) T ss_pred HHHhhcCCCCcccccCCChHHhhhcccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeEEE--EEeC Confidence 6652 221 333332 12333 33456678999999999999999998654 5555 Q ss_pred CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee-------------CCCceEEEEEEEEEEeCCeEEEEEEEEecCC Q lcl|NC_012753. 140 DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE-------------GQKVKYYSLIEFHEWNKETYTISNELYESES 206 (502) Q Consensus 140 ~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~-------------~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~ 206 (502) +.-++..+|-.+++ +..|..+....+|........ ..+...+. ...|-|.++.-.+ T Consensus 141 ~~~~~~~~pl~~y~-v~~d~~G~vd~i~rr~~~t~~~l~~~~~~~~~~~~~~~~~~~----------~v~v~~~V~~~~~ 209 (510) T protein:vir:78 141 DEATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKDLDDVYKQDLMRAGRNLSGSG----------SVDLYTHVQRRKG 209 (510) T ss_pred CCCeEEEEEcceeE-EeeCCCcCeeEEEeeeeccHHHHHHHhhHHhhhhhhccCCCc----------eEEEEEEEEeecC Confidence 44356677777744 456655554455433221100 00001111 1122222222111 Q ss_pred ccccCceeeccccccCCCcc--eeecC--CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-Hh Q lcl|NC_012753. 207 KTIIGQRVPLSTLYEDLEET--VTLNG--LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VK 281 (502) Q Consensus 207 ~~~lG~~v~l~~~~~~l~~~--~~~~~--~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~ 281 (502) . ..|..++|..+... ....+ ...-||++++-+ ...++.||+|--..+.+-+..|+..--....- .. T Consensus 210 ~-----~~~~~sv~~e~dg~~i~~~~~~~~~e~P~~~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~ 280 (510) T protein:vir:78 210 T-----AMDYAEMYHEIDGVRVGETGRWPIHLCPYIVPTWN----LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELE 280 (510) T ss_pred C-----CCcEEEEEEEecCeeeccccccccccCCeeeeeee----ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 0 01111111101000 00111 122355554432 23467899999999999999999765544443 23 Q ss_pred hccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec----cccc-hHHHHHHHHHHHHHHH Q lcl|NC_012753. 282 MGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT----TDIR-SDDYIKAINKGLSLFE 356 (502) Q Consensus 282 ~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~----~~ir-~e~~~~~l~~~l~~i~ 356 (502) ..+....|+.+.+ .+.....+ .....+.. .. ...++.++ .++. +.+-++.+..-|+... T Consensus 281 a~~~~~lv~p~g~-----~~~~~l~~-----~~~g~~v~---g~---~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~aF 344 (510) T protein:vir:78 281 SLEVLNLVDEAKG-----AVVDDYQD-----AEMGDYVP---GG---AEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAF 344 (510) T ss_pred hhcCCcccCCccc-----cchhhhcc-----CCCceeec---CC---cccccccccCcccchHHHHHHHHHHHHHHHHHH Confidence 3343334432211 11010000 00000000 00 11122221 1221 1223333333333332 Q ss_pred HhcCCChhhccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCC Q lcl|NC_012753. 357 MQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDG 435 (502) Q Consensus 357 ~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~ 435 (502) +. .+ ....+...|||||....+.+.+..+- ..+.-...|..|++-.+.++.-.++.+-.........|++-.. T Consensus 345 ~~-~l-----~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~is~ 418 (510) T protein:vir:78 345 MY-GA-----NQRDAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGLPA 418 (510) T ss_pred hh-cc-----ccCCCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccceeeecccH Confidence 21 11 11233346999999887776665554 2232333444444444444332222221111111222333221 Q ss_pred ccCCHH-HHHHHHHHHHh-cCC-------CCHHHH---HHhcCCC-------CHHHHHHHHHHHHHhh--hc----c--- Q lcl|NC_012753. 436 VFTDRN-AEFDYWSKMVA-AGF-------APKTMA---IEKTLNV-------TKEQAQEIYQKINDET--MV----S--- 487 (502) Q Consensus 436 i~~d~~-~~~~~~~~~~~-~Gi-------~S~et~---l~~~~~~-------~deea~~el~ri~~E~--~~----~--- 487 (502) +-.... +.+....+.++ .|- +....+ +....|+ |+||++++.++.++.. ++ + T Consensus 419 Laraq~~~~l~~~~q~l~~~~~~~q~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~~q~~~~~~~~~a~~~ 498 (510) T protein:vir:78 419 LSRSAAVQSMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEEQRRQAAQAQAAQETLLE 498 (510) T ss_pred HHHHHHHHHHHHHHHHHHHhcChhhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 00111111111 111 122222 2334455 4566655544321111 11 0 Q ss_pred cCCCCCccccCC Q lcl|NC_012753. 488 TDSFRTSEEVDI 499 (502) Q Consensus 488 ~~~~~~~~~~~~ 499 (502) ........-+|+ T Consensus 499 ~~~~~~~~~~g~ 510 (510) T protein:vir:78 499 GASDMTNALAGV 510 (510) T ss_pred hhhhhcccCCCC Confidence 111112222223 No 158 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=97.96 E-value=9.1e-06 Score=48.23 Aligned_cols=398 Identities=12% Similarity=0.086 Sum_probs=162.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCC-------HHHHHHHH--HHHHHhcCCCCccccccCCCcccc-ccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAIS-------PEEYNRIM--DNLRYFAGDFDSVTYRDSNGSQVK-RDF 70 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~-------~~~~~~i~--~~~~~Y~g~~~~~~~~~~~~~~~~-~~~ 70 (502) |++|+++++. +....-....+ ....++. .+....+. .+..|..+. ...+.... .+- T Consensus 1 Mgl~d~~r~~--~~~~~~~~~~~-----~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~-------~~~g~~v~~~~a 66 (431) T protein:vir:10 1 MGLFDFIRRE--KQPEAQARPHV-----EPSFQASTPTTSIPGETFEGLDDPRLKEYIRRG-------ELNGGTGRETRA 66 (431) T ss_pred CcchhhhhcC--ccccccccccc-----ccccccccccccccccccccccchHHHHhhccC-------ccCcceechhhh Confidence 9999997752 11000000000 0000000 00000000 011222111 01111111 111 Q ss_pred eecchHHHHHHHHhhhhhcCcceE-eeCCH---HHHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCCEEEEEEEeCCc Q lcl|NC_012753. 71 NHLPIGRTASKKVASLVFNEQATI-RVDNE---VADAFINETLKN--DK---FSKNFERYLESCLALGGLAMRPYIDGDQ 141 (502) Q Consensus 71 ~~~n~~k~iv~~~a~~l~~ep~~i-~~~d~---~~~e~l~~~~~~--~~---f~~~~~~~~~~~~~~G~~~~~~~~d~~~ 141 (502) +.+.--...|+..|+-+-+=|+.+ ..++. ..+.-+..+|.. |. -..-...++...+..|.+|+.+..|+|+ T Consensus 67 l~~~~V~~ci~~Ia~~iA~lp~~v~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~g~ 146 (431) T protein:vir:10 67 LRNMAVLRCVTLISGTIGMLPMNLISSDDSKQVLTDDPAHRLLKYKPNDWQTPMEFKSLMQLRALLDGESMARIVWSGNR 146 (431) T ss_pred hccHHHHHHHHHHHHhhccCceEEEEecCceeeeccchHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCc Confidence 222233455666666665556654 21211 112234444432 21 2233445567777889999998888765 Q ss_pred e-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccc Q lcl|NC_012753. 142 I-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLY 220 (502) Q Consensus 142 ~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~ 220 (502) + .+-.++|..+-+...+.+.+ +++++.. + |..+.+. T Consensus 147 ~~~L~pl~~~~v~~~~~~~~~~-----~y~~~~~-----------------~------------------g~~~~~~--- 183 (431) T protein:vir:10 147 PIRLIPMDRGSAKGRLTSTWQI-----VYDYTTP-----------------T------------------GDKIELP--- 183 (431) T ss_pred eEEEEEEcCceeEEEEcCCCeE-----EEEEEeC-----------------C------------------ceEEEEc--- Confidence 4 45556666655432221111 0010000 0 1111000 Q ss_pred cCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-HhhccceeeechHHhccCCC Q lcl|NC_012753. 221 EDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPTQMIKTEYD 299 (502) Q Consensus 221 ~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~~~l~~~~~ 299 (502) +.+ .++|+.+.. +...|+|.+.-+...|.-.. ....+... |..+...=.| |..... T Consensus 184 ---~~d----------ViHir~~~~-----dg~~G~spi~~~~~~i~~~~-~~~~~~~~~f~ng~~p~gi----l~~~~~ 240 (431) T protein:vir:10 184 ---ARE----------VFHLRDLSI-----DGVSGVSRVKLSGNALELAE-QAERAASRTFRTGVMAGGA----IEVPKE 240 (431) T ss_pred ---hhh----------EEEecCcCC-----CCcccccHHHHHHHHHHHHH-HHHHHHHHHHhccCCccEE----EecCCC Confidence 000 123432211 23568888877776665333 33334333 4544332222 322211 Q ss_pred CCCcccCcccccccc-chhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc- Q lcl|NC_012753. 300 TNGEKVTVKREFETG-HNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM- 373 (502) Q Consensus 300 ~~g~~~~~~~~~~~~-~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~- 373 (502) -+..... .+... ...+...... --+.+.-++.++.....-++.+..+....+|+...|+++..++...++. T Consensus 241 ls~e~~~---~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~ 317 (431) T protein:vir:10 241 LSDNAYG---RMKASVQENHTGSENAGSWMLLEEGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWG 317 (431) T ss_pred CCHHHHH---HHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCcc Confidence 1110000 00000 0111100000 0011223555555555667888888888999999999999999765432 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhc Q lcl|NC_012753. 374 KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAA 453 (502) Q Consensus 374 ~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~ 453 (502) ++..+....+ +..+..-..+.++..|.. .++.........+.++++.-+-.|..+.++...+++.+ T Consensus 318 sn~eq~~~~f--~~~tL~P~~~~ie~~ln~------------~Ll~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~ 383 (431) T protein:vir:10 318 SGIEQLAIFF--IQYGLSHWFVSWEQAAAR------------AFLPEKMLGQRQFKFNEGALLRGTLNDQAAFFSKALGA 383 (431) T ss_pred ccHHHHHHHH--HHHHHHHHHHHHHHHHHh------------hccChhhcCCceEEEechhhhccCHHHHHHHHHHHHhc Confidence 2222222111 111222222223322222 11111111123455555665777888989999898877 Q ss_pred CC----CCHHHHHHhc--CCCCHHHHHHHHHHHHHhhhcccCCCCCcccc Q lcl|NC_012753. 454 GF----APKTMAIEKT--LNVTKEQAQEIYQKINDETMVSTDSFRTSEEV 497 (502) Q Consensus 454 Gi----~S~et~l~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~ 497 (502) |+ |+.-++++.. +++.++..++-..-.-.....+.++. |... T Consensus 384 G~~~g~lT~NE~R~~~gl~p~~~~~gD~~~~p~n~~~~~~~~~~--p~~~ 431 (431) T protein:vir:10 384 GGQSPWMKQNEVREMLDLPRADDPVADQLRNPMTQKQKGSGDEP--PATT 431 (431) T ss_pred ccccCccCHHHHHHHhCCCCCCCccccceecccccccCCCCCCC--CCCC Confidence 76 8888765542 33433323222111111111111222 2222 No 159 >protein:vir:2198 Length: 536 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041995;swissprot:sw:p03728;genbank:gi:9627467;goa:P03728;uniprot:P03728;genbank:GeneID:1261033 Probab=97.93 E-value=1.1e-05 Score=47.87 Aligned_cols=442 Identities=11% Similarity=0.086 Sum_probs=170.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |.= ++ .+ +....+++ .+. .+..++-.....|+.++.=-.+-+......+......++--+.+...+ T Consensus 1 m~~-~~-------~~--~~~~~~~~---r~~-~lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~dst~~~a~ 66 (536) T protein:vir:21 1 MAE-KR-------TG--LAEDGAKS---VYE-RLKNDRAPYETRAQNCAQYTIPSLFPKDSDNASTDYQTPWQAVGARGL 66 (536) T ss_pred Ccc-hh-------hc--hhHHHHHH---HHH-HHHHHhhHHHHHHHHHHHHhcccccCCCCCcccccccccccccHHHHH Confidence 221 00 00 00000100 000 122223223344444332211111111111222222334445666777 Q ss_pred HHHhhhhhcC--cce--E--eeCCH-------------HHH-------HHHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 81 KKVASLVFNE--QAT--I--RVDNE-------------VAD-------AFINETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 81 ~~~a~~l~~e--p~~--i--~~~d~-------------~~~-------e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) +.+|+.|.+- |+. | .+.+. .++ +.+...+..++|...+.++.++..+.|.+++. T Consensus 67 ~~Laa~l~~~ltP~~~WFrl~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ly 146 (536) T protein:vir:21 67 NNLASKLMLALFPMQTWMRLTISEYEAKQLLSDPDGLAKVDEGLSMVERIIMNYIESNSYRVTLFEALKQLVVAGNVLLY 146 (536) T ss_pred HHHHHHHHHhhcCCCcccccccChhhhhccccchhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcEeEE Confidence 7777666541 211 2 22221 122 24445677789999999999999999988764 Q ss_pred EEEeCC-ce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe-------------e------CCCceEEEEEEEEEEeCC Q lcl|NC_012753. 135 PYIDGD-QI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKT-------------E------GQKVKYYSLIEFHEWNKE 193 (502) Q Consensus 135 ~~~d~~-~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~-------------~------~~~~~~yt~~E~h~~~~~ 193 (502) +--+++ ++ .+..+|-.+++- ..|..+....+|....... . .+...+|+.++.. .++. T Consensus 147 ~~e~~~~~~~~f~~~pl~~~~v-~~d~~G~vd~i~r~~~~t~~~l~~~fg~~~~~~~~~~~~~~~v~v~~~v~~~-~~~~ 224 (536) T protein:vir:21 147 LPEPEGSNYNPMKLYRLSSYVV-QRDAFGNVLQMVTRDQIAFGALPEDIRKAVEGQGGEKKADETIDVYTHIYLD-EDSG 224 (536) T ss_pred EeeCCCCceeeEEEEEcCeEEE-eeCCCCCeeEEeeeeeccHHHHHHhhhhhhcccccccccccceeEEEEEEEe-cCCC Confidence 433332 33 467778777664 4555554445543221110 0 0111223332211 1112 Q ss_pred eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHH Q lcl|NC_012753. 194 TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTY 273 (502) Q Consensus 194 ~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~ 273 (502) .+.+ |..-+ |.+|+..+ ...++..-||++++-+ ...++.||+|-...+.+-+..|+..- T Consensus 225 ~~~~----~~e~~----g~~v~~~~---------g~~~f~~~P~i~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l~ 283 (536) T protein:vir:21 225 EYLR----YEEVE----GMEVQGSD---------GTYPKEACPYIPIRMV----RLDGESYGRSYIEEYLGDLRSLENLQ 283 (536) T ss_pred cEEE----EeccC----Ceeecccc---------CccccccCCeeeeeee----ecCCCccccchHHHHHHHHHHHHHHH Confidence 2211 11101 22221110 1112333455554432 23477899999999999999999765 Q ss_pred HHHHHH-Hhhccceeeech-HHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHH Q lcl|NC_012753. 274 DEFMWE-VKMGQRRVAVPT-QMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKG 351 (502) Q Consensus 274 S~~~~~-~~~~~~~i~v~~-~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~ 351 (502) -....- ....+....|+. .+++.. ... ......+.. ...++ .++..++...+...-.+.++.+ T Consensus 284 ~~~l~~~~~a~~~~~lv~p~g~~~~~------~~~-----~~~~g~~v~---g~~~~-v~~~~~~~~~~~~~~~~~i~~~ 348 (536) T protein:vir:21 284 EAIVKMSMISSKVIGLVNPAGITQPR------RLT-----KAQTGDFVT---GRPED-ISFLQLEKQADFTVAKAVSDAI 348 (536) T ss_pred HHHHHHHHHHhcCCcccCcccccchh------hhc-----cCCCcceec---CCccc-ceeeeccccccchHHHHHHHHH Confidence 555442 233443444422 221111 000 000000110 01111 1111122211222222334444 Q ss_pred HHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccc Q lcl|NC_012753. 352 LSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS----IATLVEKSLKELVISILELAKVYNLYTGEIPTMDE 427 (502) Q Consensus 352 l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~----~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~ 427 (502) .+.|....=+. .+...++...|||||....+.+.+..+- ++.+|- ..|++-++.+..-.+..+...... T Consensus 349 ~~rI~~af~~~--~l~~~~~~r~TAtEV~~r~~E~~~~LG~v~~rl~~Ell---~Pli~r~~~il~r~g~lP~~p~~~-- 421 (536) T protein:vir:21 349 EARLSFAFMLN--SAVQRTGERVTAEEIRYVASELEDTLGGVYSILSQELQ---LPLVRVLLKQLQATQQIPELPKEA-- 421 (536) T ss_pred HHHHHHHHhhh--hcccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHH---HHHHHHHHHHHHhCCCCCCCChhh-- Confidence 44443322111 1222334446999999888777775544 444443 334444444443333333222222 Q ss_pred eEEEeCCCcc-CCHHHHHHHHHHHHh--cCC--------CCHHHHH---HhcCCC-------CHHHHHHHHHHHHHhhhc Q lcl|NC_012753. 428 VSVDLDDGVF-TDRNAEFDYWSKMVA--AGF--------APKTMAI---EKTLNV-------TKEQAQEIYQKINDETMV 486 (502) Q Consensus 428 i~v~f~d~i~-~d~~~~~~~~~~~~~--~Gi--------~S~et~l---~~~~~~-------~deea~~el~ri~~E~~~ 486 (502) +.+++--++. ......++......+ +++ +....++ ....|+ +++|++++.++-.++++. T Consensus 422 v~~~~vs~l~~l~r~~~~~~l~~~~~~la~~~Pe~ld~~id~d~~~~~~a~~~Gv~p~~~irt~eev~~~r~q~~~~~~~ 501 (536) T protein:vir:21 422 VEPTISTGLEAIGRGQDLDKLERCVTAWAALAPMRDDPDINLAMIKLRIANAIGIDTSGILLTEEQKQQKMAQQSMQMGM 501 (536) T ss_pred ccceEEecHHHHHHHHHHHHHHHHHHHHHhhchhhhcccCCHHHHHHHHHHHcCCChhhhcCCHHHHHHHHHHHHHHHHH Confidence 2333322111 111112222221111 121 2222222 223454 445554443321111110 Q ss_pred ----c-cCCCCCccccCCCCC Q lcl|NC_012753. 487 ----S-TDSFRTSEEVDIYGE 502 (502) Q Consensus 487 ----~-~~~~~~~~~~~~~g~ 502 (502) . ... .....+-..+| T Consensus 502 ~~~a~~~~~-~~~~~~~~~~~ 521 (536) T protein:vir:21 502 DNGAAALAQ-GMAAQATASPE 521 (536) T ss_pred HHHHHHHHH-HHHHHHhcChh Confidence 0 000 00111111222 No 160 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=97.92 E-value=1.1e-05 Score=47.78 Aligned_cols=382 Identities=13% Similarity=0.087 Sum_probs=164.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHH-hcCCCCccccccCCCccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRY-FAGDFDSVTYRDSNGSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~-Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~i 79 (502) |+||+++ +++.... ....... ... .....+ +.|.. .. ...-+.+.--... T Consensus 1 Mgl~~~~---f~~~~~~---~~~~~~~---~~~---------~~~~~~~~~g~~-------v~----~~~al~~~~v~~~ 51 (409) T protein:vir:84 1 MSLFTRI---FSGPSEE---RTLTKIS---GIP---------SPAEDWAMHGDR-------PG----ANSAMTLGAFYAC 51 (409) T ss_pred Cchhhhh---hcCCCcc---ccccccc---ccc---------cccchhhccCcc-------cc----hhhhhccHHHHHH Confidence 9999965 3321000 0000000 000 000011 11110 00 0111222223445 Q ss_pred HHHHhhhhhcCcceEeeC--C-HHHHHHHHHHHh-----hccHHHHHHHHHHHHhhcCCEEEEEEE-eC-Cc-eEEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVD--N-EVADAFINETLK-----NDKFSKNFERYLESCLALGGLAMRPYI-DG-DQ-IRVSFVQ 148 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~--d-~~~~e~l~~~~~-----~~~f~~~~~~~~~~~~~~G~~~~~~~~-d~-~~-~~i~~v~ 148 (502) |+.+|+-+.+=|+.+--. + +....-+.++|. .-.....+..++...+..|.+|+.+.+ +. |. ..+..++ T Consensus 52 v~~ia~~iA~lp~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~l~l~Gn~~~~i~~~~~~g~~~~L~~l~ 131 (409) T protein:vir:84 52 VTLLADTVASLSIDAYRKKDNVRIPVSPAPKLLESTPYPGLTWFDWLWMLMESLAVTGNAFGYISARDEANRPTAIMPIH 131 (409) T ss_pred HHHHHHhhhhCceEEEEecCCcccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEECCCCceEEEEEEc Confidence 666666665555543211 1 111112333332 123344555667778888998877654 33 44 3566677 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+.+-+....... +.++.. .|.. + |..++..+ T Consensus 132 p~~v~v~~~~~~~-----------------~~~~~~----------------~~~~-~----g~~~~~~d---------- 163 (409) T protein:vir:84 132 PDCIHVTDAKDED-----------------GDWIEP----------------VYRI-D----GKVVPNHR---------- 163 (409) T ss_pred CceeEEEEcCCCc-----------------ceEEEE----------------EecC-C----ceEEchhh---------- Confidence 7665443211111 111100 0100 0 11111110 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc-eeeechHHhccCCCCCCcccCc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQMIKTEYDTNGEKVTV 307 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~~l~~~~~~~g~~~~~ 307 (502) .++++.+.. .+..+|+|.+..+...++....+.....+-|..+.. ..+ |......+..... T Consensus 164 --------vih~~~~~~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi-----l~~~~~l~~e~~~- 225 (409) T protein:vir:84 164 --------IMHIKRYPV----AGCALGMSPIEKAASAIGLGLAAERYGLRWFRDSANPSGI-----LSSDADLTPDQVK- 225 (409) T ss_pred --------EEEecCCCC----CcccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEE-----EecCCCCCHHHHH- Confidence 233432211 133478998887777776665544444444554333 222 2221111110000 Q ss_pred cccccccchhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHH Q lcl|NC_012753. 308 KREFETGHNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQ 383 (502) Q Consensus 308 ~~~~~~~~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~ 383 (502) .+ ........... --+.+.-++.++......++.+..+....+|+...|+++..+|+...+..++..+.... T Consensus 226 --~~--~~~~~~~~~n~g~~~vl~~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~ 301 (409) T protein:vir:84 226 --QT--QKQWIQSHHNRRLPAVMSAGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQG 301 (409) T ss_pred --HH--HHHHHHHhccCCCeeecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHH Confidence 00 00000111000 00112235556655566778888888899999999999999987655443232222111 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 384 SDT-YQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAI 462 (502) Q Consensus 384 ~~l-~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 462 (502) ... ..++.-..+.++.+|... +. ....+.++++.-+-.|..+.++...+++.+|+++.-+++ T Consensus 302 ~~f~~~~l~P~~~~ie~~l~~~------------L~-----~g~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R 364 (409) T protein:vir:84 302 INFVRHTLLPWLRCIEQALDTF------------LP-----RGQFVKFNVDGLMRGDVTARFTAYQMGLQNGIWSVNEVR 364 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHh------------cc-----CCCeEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 111 122222222333333321 11 122466667776778899999999999999999999876 Q ss_pred HhcCCCCH-HHHHHHH-----HHH---HHhhhcccCCCCCccccCC Q lcl|NC_012753. 463 EKTLNVTK-EQAQEIY-----QKI---NDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 463 ~~~~~~~d-eea~~el-----~ri---~~E~~~~~~~~~~~~~~~~ 499 (502) +.. |+.. +..++-+ ..+ ...+....+...+..+++= T Consensus 365 ~~~-g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 365 AWE-DAPPIPEGDIHLQPMNFVPLGYVPPEEPAQEPQPNSATEGNK 409 (409) T ss_pred HHh-CCCCCCCcceeeecccccccccCCccccCcCCCCCCccCCCC Confidence 653 4332 1111111 111 1122222222222222222 No 161 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=97.91 E-value=1.1e-05 Score=47.69 Aligned_cols=372 Identities=10% Similarity=0.057 Sum_probs=152.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||++.. +.++- .. . ... ..+ ..+.....+... .. ... .+..++..--..+| T Consensus 1 Mg~~~~~~-~~~~~-~~---~---~~~-----~~~-------~~~~~~~~~~~~-~~--~v~----~~~al~~~~v~~~i 53 (385) T protein:vir:10 1 MGLLTPRN-FNKRK-AK---N---MVY-----PSN-------PAFFTTTVGGMQ-LS--YVS----ALSALQNTNVYSVI 53 (385) T ss_pred Cccccchh-ccccc-cc---c---ccc-----ccc-------hhhhhhhccccC-cc--ccC----HHHhhccHHHHHHH Confidence 99998742 21110 00 0 000 000 111122222100 00 001 11122233335567 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCe--EEEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATV--FFPLQAN 158 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~--~~Pi~~d 158 (502) +.+|+-+..=|+ .+.+......|++-...-....-...++...+..|.+|+.+..+. +..++++. +-+. T Consensus 54 ~~ia~~ia~~p~--~v~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~r~~----~~~~p~~~~~v~~~--- 124 (385) T protein:vir:10 54 NRIASDVASAHF--KTENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLVGQN----LEHIPNSDVQINYL--- 124 (385) T ss_pred HHHHHHHhhCce--eeeccchhhhhhcCCCCCCHHHHHHHHHHHhhhcCCeEEEEEcCc----eeEeecCCceEEEE--- Confidence 777777666564 454444444444322111233334445556667888888765332 11222221 1111 Q ss_pred CCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEE Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFT 238 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~ 238 (502) .+..+..|...+ .++...+ .++- .+ .+ T Consensus 125 ----------------~~~~~~~~~~~~----~~~~~~~---------------~~~~--------~e----------ii 151 (385) T protein:vir:10 125 ----------------PGNMGIVYTVLE----SNDRPQM---------------VLRQ--------DQ----------ML 151 (385) T ss_pred ----------------EcCCceEEEEEE----cCCceEE---------------EEcc--------cc----------EE Confidence 111111111000 0000000 0100 00 12 Q ss_pred EecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc-chh Q lcl|NC_012753. 239 YLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG-HNV 317 (502) Q Consensus 239 ~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~ 317 (502) +|+....+ ..+...|+|.+..+...|+....+-.-..+-|..+...-.+ +.........+-. ..+... ... T Consensus 152 hik~~~~~--~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gi----l~~~~~~~~~e~~--~~~~~~~~~~ 223 (385) T protein:vir:10 152 HFRLMPDP--QYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGK----LTISNYLSDGKDL--ESAREEFEKA 223 (385) T ss_pred EeccCCCC--cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE----EEeCCCCCCHHHH--HHHHHHHHHH Confidence 33321111 12334689998888888866554444344445554332222 2221111000000 000000 011 Q ss_pred hccccC---CCCccccceeeeccccchHHHH-HHHHHHHHHHHHhcCCChhhccccccccc---cHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDS---GDMDKGIGITDLTTDIRSDDYI-KAINKGLSLFEMQLGVSTGMFSFDGKSMK---TATEVVSEQSDTYQMR 390 (502) Q Consensus 318 ~~~~~~---~~~~~~~~i~~~~~~ir~e~~~-~~l~~~l~~i~~~~g~s~~~~~~~~~~~~---tAtei~~~~~~l~~~~ 390 (502) +..-+. .--+.+.-++.++......++. +..+...++|+...|+|+..++....+.. +....+..+ T Consensus 224 ~~~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~------- 296 (385) T protein:vir:10 224 NTGDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATY------- 296 (385) T ss_pred hCccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHH------- Confidence 100000 0001122355555555555653 67777789999999999999986433322 222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh--cCCC Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK--TLNV 468 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~--~~~~ 468 (502) ..+|..++..|..-.+. .+.. ..+.++++.-+..|..+.++...+++.+|+|+.-+++.. +.|+ T Consensus 297 -------~~~l~P~~~~ie~~l~~-~l~~------~~~~f~~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~p~ 362 (385) T protein:vir:10 297 -------LANLNSYVNPIVDELRL-KMNA------PDLELDIKDMLDVDDSALINQVSNLAKSGVLGAEQAQFILTRSGF 362 (385) T ss_pred -------HHHHHHHHHHHHHHHHH-hhCC------ceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcc Confidence 11222222222111111 1111 236666666677899999999999999999999887654 3455 Q ss_pred CHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 469 TKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 469 ~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .++. +.+....... .. +|=-|+ T Consensus 363 p~~~----~~~~~~~~~~-------~~-~g~~~d 384 (385) T protein:vir:10 363 LPDN----LPEFKPLTTQ-------VK-GGDEGD 384 (385) T ss_pred CCCC----CccccCcccc-------cC-CCCCCC Confidence 4322 1111111111 01 111111 No 162 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=97.89 E-value=1.2e-05 Score=47.51 Aligned_cols=378 Identities=11% Similarity=0.022 Sum_probs=159.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||++++..-++... ... ..+..++.+.... ...... ...-...+....+| T Consensus 1 Mg~f~~~~~~~~~~~~----~~~-------------------~~~~~~~~~~~~~-~~~~~~----~~~~~~~~~v~~~i 52 (406) T protein:vir:95 1 MGLFDRWRRTKRKSKI----RAD-------------------TGYVGLFMSGEDV-SFLVPG----YVRLSDNPEVRMAV 52 (406) T ss_pred Ccchhhhccccccccc----ccc-------------------chhhhhhccCccc-CccccC----HHHHhhcHHHHHHH Confidence 9999876544322100 000 0111222221110 000000 00112234446677 Q ss_pred HHHhhhhhcCcceEe-eCCH---HHHHHHHHHHh-----hccHHHHHHHHHHHHhhcCCEEEE--EEEeC-Cce-EEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATIR-VDNE---VADAFINETLK-----NDKFSKNFERYLESCLALGGLAMR--PYIDG-DQI-RVSFV 147 (502) Q Consensus 81 ~~~a~~l~~ep~~i~-~~d~---~~~e~l~~~~~-----~~~f~~~~~~~~~~~~~~G~~~~~--~~~d~-~~~-~i~~v 147 (502) +..|+-+..=|..+- .+++ .....+...|. .......+..++...+..|.+++. +-.+. |.+ .+..+ T Consensus 53 ~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~~l~~~PN~~~t~~~f~~~~~~~~ll~g~g~a~~~~~~~~~g~~~~l~~i 132 (406) T protein:vir:95 53 HKIADLISSMTIYLMQNTEDGDIRIRNELSRKIDITPYSLMTRKSWMYNIVYTMLLDGEGNSVVFPKYTADGLIDELVPL 132 (406) T ss_pred HHHHHhhccCceEEEEecCCcceeecchHHHHHhhccCCCCCHHHHHHHHHHHHHhcCCceEEEEEEECCCCcEEEEEEE Confidence 777777766665541 1111 11111222221 123344555566666767765443 33443 333 45556 Q ss_pred cCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcce Q lcl|NC_012753. 148 QATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETV 227 (502) Q Consensus 148 ~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~ 227 (502) +|..+-++..+ + .|.+.+ . |..++.. T Consensus 133 ~~~~v~~~~~~-~---------------------------------~~~~~~---~-------~~~~~~~---------- 158 (406) T protein:vir:95 133 TPSKVNFLDTP-D---------------------------------GYQVLY---G-------GQTFNYD---------- 158 (406) T ss_pred cCceeEEEEcC-C---------------------------------eEEEEe---c-------cEEEchh---------- Confidence 66555443211 1 011110 0 0111100 Q ss_pred eecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCc Q lcl|NC_012753. 228 TLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTV 307 (502) Q Consensus 228 ~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~ 307 (502) -.++|+.+. .......|+|.+..+...++....+.....+-+..+...-.+ +......+.... T Consensus 159 --------evih~~~~~---~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i----l~~~~~l~~e~~-- 221 (406) T protein:vir:95 159 --------EVLHFIYNP---DPERPYIGRGYRVVLKDIADNLKQATATKKSFMSGKYMPSLI----VKVDAATAELSS-- 221 (406) T ss_pred --------HEEEeeccC---CCCCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE----EEeCCCCCHHHH-- Confidence 013343211 111234688988888777776665544333444544443222 222111111000 Q ss_pred ccccccc-chhhcccc-------CCCCccccceeeec-cccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH Q lcl|NC_012753. 308 KREFETG-HNVYEQFD-------SGDMDKGIGITDLT-TDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE 378 (502) Q Consensus 308 ~~~~~~~-~~~~~~~~-------~~~~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte 378 (502) ..+... ...+.... ...+. .-++.+. .....-++.+..+....+|+...|+|+..+|..... + T Consensus 222 -~~~~~~~~~~~~g~~n~~~~~v~~~~~--~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~-----~ 293 (406) T protein:vir:95 222 -EEGRNAVFKKYLQATEAGQPWIIPAEL--LEVEQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLLGIGEFN-----R 293 (406) T ss_pred -HHHHHHHHHHhccccccCCceeecCCC--ccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCch-----H Confidence 000000 01111110 11111 1112221 123345677888888899999999999998743221 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPK 458 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~ 458 (502) - .++ ..++.+|..+++.|-...+. .++.. ....+.+++++-+..|..+.++...+++.+|+++. T Consensus 294 ~--~~~----------~~~~~~l~P~~~~ie~~l~~-~l~~~---~~~~~~fd~~~l~~~d~~~~~~~~~~l~~~G~~t~ 357 (406) T protein:vir:95 294 D--EYN----------NFINSTILPIAKGIEQELTR-KLLIS---PDLYFKFNPRSLYAYDLKELAEVGSNMYVRGIMEG 357 (406) T ss_pred H--HHH----------HHHHHHHHHHHHHHHHHHHH-hcCCC---CCcEEEeechhhhcCCHHHHHHHHHHHHhCCCcCH Confidence 1 111 12344455554444433221 11211 12346666666677888999999999999999999 Q ss_pred HHHHHhcCCCCHH-HHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 459 TMAIEKTLNVTKE-QAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 459 et~l~~~~~~~de-ea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .++++. .|++.- ..++-+....--.....+......+++-.|+ T Consensus 358 NE~R~~-~gl~p~~~gd~~~~~~n~~~~~~~~~~~~~k~g~~~~~ 401 (406) T protein:vir:95 358 NEVRDW-LGLSPKEGLSELVILENYIPLDKIGDQSKLKGGDNSGA 401 (406) T ss_pred HHHHHH-hCCCCCCCcceeeeccCccchhhcccccccCCCCCCCC Confidence 997665 355331 1111110000000001111122233333333 No 163 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=97.88 E-value=1.3e-05 Score=47.40 Aligned_cols=397 Identities=11% Similarity=0.098 Sum_probs=164.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |++++++|.++.+.. ..++..-.... .++ ..+..++..- . ..+.... +.-+.+.---.. T Consensus 7 ~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~~-------~~~~~~~~~~----s---~~g~~v~~~~al~~~~V~~~ 66 (432) T protein:vir:10 7 LGLLGQLKAMFVPPD----PVDIGGGQTFT--PVN-------ATARDLGIII----S---DTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred cchhhhhHhhcCCcc----ccccccccccc--cCc-------chhhhhcccc----c---ccCcccchhhhhcchHHHHH Confidence 999999999986421 11111000000 000 0000111000 0 0111100 111222222334 Q ss_pred HHHHhhhhhcCcceEee--CC---HHHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRV--DN---EVADAFINETLK--ND---KFSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQ 148 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~--~d---~~~~e~l~~~~~--~~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~ 148 (502) |+..|+-+-+=|+.+-- ++ +..+.-+..+|. -| .....+..++...+..|.+|+.+..++|++ .+..++ T Consensus 67 i~~Ia~~ia~lp~~~y~~~~~g~~~~~~~~l~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L~~l~ 146 (432) T protein:vir:10 67 VKLVSQAIAAMPLTMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLA 146 (432) T ss_pred HHHHHHhhhhCceeEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEc Confidence 55555555554655421 11 111122333332 12 223344556677788999998887776664 566778 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++-++..+.+.. +++++..+ + ..+.+ + +++ T Consensus 147 ~~~v~v~~~~~g~~-----~y~~~~~~---g-------------~~~~~-----------------~--------~~~-- 178 (432) T protein:vir:10 147 NDRLTITTDTKGNT-----AYRYRRTD---G-------------QMIDI-----------------P--------KQQ-- 178 (432) T ss_pred CCceEEEEcCCCcE-----EEEEEecC---c-------------eEEEE-----------------c--------Ccc-- Confidence 88776653222211 11111000 0 00000 0 000 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-HhhccceeeechHHhccCCCCCCcccC- Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPTQMIKTEYDTNGEKVT- 306 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~~~l~~~~~~~g~~~~- 306 (502) +++++.+. .+...|+|-+..+...|+..... .++... |..+...-.| +......+..... T Consensus 179 --------iih~~~~~-----~dg~~G~spi~~~~~~i~~~~~~-~~~~~~~f~ng~~~~gi----l~~~~~l~~e~~~~ 240 (432) T protein:vir:10 179 --------IWKIMGYS-----LDGENGLSAIRYGAQIFGTAIAA-EAQAARAFRNGQLQSVY----YQIDRFLTDDQYDS 240 (432) T ss_pred --------EEEecCCC-----CCCcccccHHHHHHHHHHHHHHH-HHHHHHHHhcCCCcceE----EecCCCCCHHHHHH Confidence 12232221 12245888877766666543322 333333 4443332222 3322111100000 Q ss_pred ccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHH Q lcl|NC_012753. 307 VKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSD 385 (502) Q Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~ 385 (502) ....+....+....... +.+..++.++.....-++++..+....+|+...|++|..+|....+. .++..+...... T Consensus 241 ~~~~~~~~~nag~~~vl---~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~t~~~~sn~e~~~~~ 317 (432) T protein:vir:10 241 FAKKVSGSVEAGRAPLL---EGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLG 317 (432) T ss_pred HHHHHhhhhhCCCceec---CCCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCcccccchHHHHHHH Confidence 00000000000000011 12223566666666677888888889999999999999998765432 222222211111 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 386 TY-QMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 386 l~-~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) .+ .++.-..+.++..|.. .++.........+.++.+.-+-.|..+.++...+++.+|+|+.-++++. T Consensus 318 f~~~tl~P~~~~ie~~ln~------------kL~~~~~~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~T~NE~R~~ 385 (432) T protein:vir:10 318 FLSMTLSPWLRRIEQSIAL------------NLLSPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEAREI 385 (432) T ss_pred HHHHHHHHHHHHHHHHHHh------------hhcCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHH Confidence 11 1222222233333322 1111111111223344344456788899999999999999999997665 Q ss_pred c--CCCCHHHHHHHH----HHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 465 T--LNVTKEQAQEIY----QKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 465 ~--~~~~deea~~el----~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) + +++..+...--+ .-+.. ....+. +.+. .+-..+ T Consensus 386 ~glppi~g~~~~~~~~~~~~pl~~--~~~~~~-~~~~-~~~~~~ 425 (432) T protein:vir:10 386 EGLPKLGGNAAVLTVQSAMVPLDS--IGLQAS-PEPA-SGLGNQ 425 (432) T ss_pred hCCCCCCCCcceEeecCcccchhh--hcccCC-CCCC-CCCCCc Confidence 3 234322100000 00110 000111 1111 111122 No 164 >protein:vir:7321 Length: 556 # NCBI annotation: hypothetical protein # Family: family:all:481 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848212;genbank:gi:30387383;genbank:GeneID:2641872 Probab=97.83 E-value=1.6e-05 Score=46.84 Aligned_cols=458 Identities=12% Similarity=0.085 Sum_probs=193.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccc-cc--cCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVT-YR--DSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~-~~--~~~~~~~~~~~~~~n~~k 77 (502) |.= ..+..+++.+. .+..++-.....|+.+|.=-.+-+. +. ..........++--+.+. T Consensus 1 m~~--~~~~~l~~r~~----------------~l~~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~~~~~~~~~~~dst~~ 62 (556) T protein:vir:73 1 MAE--TEKERLLKQLA----------------QLKNERTSFESHWLDLSDFINPRGSRFLTSDVNRDDRRNTKIVDPTGS 62 (556) T ss_pred CCh--hhHHHHHHHHH----------------HHHHHhhHHHHHHHHHHHHhccccCCcCCCCCCcchhhcCccccchHH Confidence 432 22222332211 1222333334444443321111010 00 111111222344456677 Q ss_pred HHHHHHhhhhhcCcc-------eEeeCCH------HHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEE Q lcl|NC_012753. 78 TASKKVASLVFNEQA-------TIRVDNE------VADA-------FINETLKNDKFSKNFERYLESCLALGGLAMRPYI 137 (502) Q Consensus 78 ~iv~~~a~~l~~ep~-------~i~~~d~------~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~ 137 (502) ..|+.+|+.|.+-.. ++.+.++ .+.+ .+.+.|...+|...+.++..+..+.|.+.+.+-. T Consensus 63 ~a~~~Las~l~~~ltpp~~~WF~l~~~d~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l~~~~ 142 (556) T protein:vir:73 63 MAQRILSSGMMSGITSPARPWFKLATPDPDMMDYGPVKIWLEVVQRRMNEVFNKSNLYQSLPVMYASLGTFGTGAMAVME 142 (556) T ss_pred HHHHHHHHHHHHhhcCCCCcccccccCcccccchHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCceeeeeee Confidence 777777777765221 1333332 2233 4445677789999999999999999999987666 Q ss_pred eC-CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee-------CCCc---eEEEEEEEEEEeCCeEEEEEEEEecCC Q lcl|NC_012753. 138 DG-DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE-------GQKV---KYYSLIEFHEWNKETYTISNELYESES 206 (502) Q Consensus 138 d~-~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~-------~~~~---~~yt~~E~h~~~~~~~~I~~~l~~~~~ 206 (502) |+ +.+++..++..+++- ..|..+....+| +++...- +... ..-..++. ...+..+.|.|.+|.... T Consensus 143 ~~~~~~r~~~~~l~~~~~-~~d~~G~vd~i~-r~~~~t~~ql~~~fg~~~l~~~v~~~~~~-~~~~~~~~v~~~V~pr~~ 219 (556) T protein:vir:73 143 DDQDVIRTMPFPIGSYYL-ANSPRGSVDTCI-RQFSMTVRQMVQEFGLDNVSTSVKGMWEN-GTYETWVEVNHCITPNVN 219 (556) T ss_pred cCCceEEEEEeecceeEE-eeCCCCCeEEEE-EEEeccHHHHHHHcCcccCCHHHHHHHhc-CCccceEEEEEEEecccc Confidence 64 457888999998775 455555444443 3322110 0000 00000000 000112344444443222 Q ss_pred ccc--c-Cceeeccccc-c-CCCc--ceeecCCCcceEEEecCCccccccccCcCCcch-hhhHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 207 KTI--I-GQRVPLSTLY-E-DLEE--TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSI-FDNAKTTMDFINTTYDEFMW 278 (502) Q Consensus 207 ~~~--l-G~~v~l~~~~-~-~l~~--~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~-~~~~~~lid~ld~~~S~~~~ 278 (502) .+. . +.-.|..++| + +... .....|+..-||++++- +...++.||+|. -..+.+-+..|+..--..+. T Consensus 220 ~~~~~~~~~~~p~~s~~~~~~~~~~~vl~esg~~e~P~~~~Rw----~~~~ge~YGrg~P~~~~lgD~k~L~~l~~~~l~ 295 (556) T protein:vir:73 220 RDSGKMDSKNKPYRSVYFESGGDSDKLLRESGFDEFPILAPRW----EVNGEDVYASSCPGMLALGQVKALQVEQKRKAQ 295 (556) T ss_pred ccccccCcccceEEEEEEEecCCCceecccCCcccCCceeeee----eecCCcccccCccHHHhHHHHHHHHHHHHHHHH Confidence 111 1 1223333332 1 1111 11223444445555542 234577899994 88999999999987777666 Q ss_pred HHhh-ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeee---ccccchHHHHHHHHHHHHH Q lcl|NC_012753. 279 EVKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDL---TTDIRSDDYIKAINKGLSL 354 (502) Q Consensus 279 ~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~---~~~ir~e~~~~~l~~~l~~ 354 (502) -.+. .+..+.||.++. .......++.. .+..+ .+....++.+ ++++ ....+.++.+.+. T Consensus 296 ~~~~~~~pp~~v~~~~~-----~~~~~~~pgg~------~~~~~----~~~~~~i~p~~~~~~d~--~~~~~~i~~~~~r 358 (556) T protein:vir:73 296 LIDKATNPPMVAPTSLK-----NQRVSLLPGDV------TYLDV----ISGQDGFKPAYLVNPNT--ADLLADIQDTRQT 358 (556) T ss_pred HHHHHhcCceecccccc-----ccceeeccCcc------ccccC----CCCccceeeeccccccH--HHHHHHHHHHHHH Confidence 5543 344455544421 11111111110 01100 1112233332 3332 2223334444444 Q ss_pred HHHhcCCC-hhhccccccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccCCCcc--cccceEE Q lcl|NC_012753. 355 FEMQLGVS-TGMFSFDGKSMKTATEVVSEQSDTYQMRNSI-ATLVEKSLKELVISILELAKVYNLYTGEIP--TMDEVSV 430 (502) Q Consensus 355 i~~~~g~s-~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~-~~~~~~~l~~l~~~il~~~~~~~~~~~~~~--~~~~i~v 430 (502) |....-.+ ...++..+....|||||......+.+..+-. .+.-...|..|+.-++.++.-.+..+.... ....++| T Consensus 359 I~~af~~d~~~~l~~~~~~r~TAtEv~~r~~E~~~~LG~v~~rl~~E~l~Pli~r~~~il~r~g~lP~~P~~l~~~~i~v 438 (556) T protein:vir:73 359 INSAYFVDLFMMLQNINTRSMPVEAVIEMKEEKLLMLGPVLERLNDEALNPLIDRVFSIMARKNMLPEPPDVLQGMPLRI 438 (556) T ss_pred HHHHhhcchhhhhccCCCCCccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhcCceeEE Confidence 43332111 0123334455679999998887777665542 222344455555555555544333332111 1234666 Q ss_pred EeCCCccCCHH-HH---HHHHHHHHh--cCC-------CCHHHHHH---hcCCC------CHHHHHHHHH-HHHHhh--- Q lcl|NC_012753. 431 DLDDGVFTDRN-AE---FDYWSKMVA--AGF-------APKTMAIE---KTLNV------TKEQAQEIYQ-KINDET--- 484 (502) Q Consensus 431 ~f~d~i~~d~~-~~---~~~~~~~~~--~Gi-------~S~et~l~---~~~~~------~deea~~el~-ri~~E~--- 484 (502) ++--.+-.... .. +....+.+. +++ +....++. ...|+ +++|+++.-+ |.++.+ T Consensus 439 ~yis~La~aqk~~~~~~i~~~~~~~~~laq~~Pe~~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~rq~r~~~qq~~~ 518 (556) T protein:vir:73 439 EYISVMAQAQKSIGLTSLSQTVGFIGQLAQFKPEALDKLDVDQAIDAFSEMSGVSPTVIVPQEQVQGIREERAKQAQAAQ 518 (556) T ss_pred EeecHHHHHHHHHHHHHHHHHHHHHHHHhccChhhHhcCCHHHHHHHHHHHcCCChhhcCCHHHHHHHHHHHHHHHHHHH Confidence 66433221100 01 111111111 111 22233222 33444 3444433211 111111 Q ss_pred -hcccCCCCCccccCCCCC Q lcl|NC_012753. 485 -MVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 485 -~~~~~~~~~~~~~~~~g~ 502 (502) .+.... ..+.+...++ T Consensus 519 ~~~~~~~--a~~~~~~~~~ 535 (556) T protein:vir:73 519 AMAMGQA--AAQGAKTLSE 535 (556) T ss_pred HHHHHHH--HHHHHHHhhh Confidence 111000 0111111111 No 165 >protein:vir:6322 Length: 510 # NCBI annotation: head-tail connector protein # Family: family:all:481 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877469;genbank:gi:33300841;uniprot:Q7Y2D5;genbank:GeneID:1482611 Probab=97.82 E-value=1.7e-05 Score=46.77 Aligned_cols=439 Identities=7% Similarity=-0.032 Sum_probs=168.0 Q ss_pred HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh Q lcl|NC_012753. 7 IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL 86 (502) Q Consensus 7 ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~ 86 (502) +|.-+++.+.+. + +-.....|+.++.=-.+-+......+......+.--+.+...++.+|+- T Consensus 1 mk~~~~~~~~~l------------k------R~~~e~~w~e~a~~tlP~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~ 62 (510) T protein:vir:63 1 MKTTAAMLWEKL------------R------DGSVEQRAIEFAKTTLPYLMVDPMSGSRGVVEHDFQSAGALLVNNLAAK 62 (510) T ss_pred ChhHHHHHHHHH------------h------ccchHHHHHHHHHhhccccCCCCCCccccccCCCccchHHHHHHHHHHH Confidence 333333322110 0 1112233433332111111111111111111222234566777777776 Q ss_pred hhcC--cce-----EeeCCH-------------HHHH-------HHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Q lcl|NC_012753. 87 VFNE--QAT-----IRVDNE-------------VADA-------FINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG 139 (502) Q Consensus 87 l~~e--p~~-----i~~~d~-------------~~~e-------~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~ 139 (502) |.+- ||. +.+.++ .+.+ .+...+..++|...+.++..+....|.+ ..|.++ T Consensus 63 l~~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~Li~~G~a--~l~~~~ 140 (510) T protein:vir:63 63 LARSLFPTGIPFFRSELTDAIRREADSRDTDITEVTAALARVDRKATQRLFQNASLAVLTQVIKLLIVTGNA--LLYRDS 140 (510) T ss_pred HHhhhcCCCCcccccCCChHHhhcccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhhCeE--EEEEcC Confidence 6652 221 333321 1233 3444666789999999999999998985 555677 Q ss_pred CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee---CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeec Q lcl|NC_012753. 140 DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE---GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPL 216 (502) Q Consensus 140 ~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~---~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l 216 (502) +..++..+|-.+++ +..|..+....+|........ .+-.....+-..+.-.+....|-|.+++.++ ...|. T Consensus 141 ~~~~~~~~pl~~y~-v~~d~~G~vd~i~rr~~~t~~~l~e~~~~~~~~~~~~~~~~~~v~v~~~V~~~~~-----~~~~~ 214 (510) T protein:vir:63 141 DAATVVAWSLRSYA-VRRDATGRWMDIVLKQRYKSKDLDEEYKQDLMRAGRNLSGSGSVDLYTHVQRKKG-----TAMEY 214 (510) T ss_pred CCcEEEEEEcceeE-EeeCCCcCeeEEEeeeeccHHHHhHHhhhhhhccccccCCCcceEEEEEEEeecC-----CCceE Confidence 76677788877755 455655544444433221100 0000000000000000111222222222111 01121 Q ss_pred cccccCCCcc--eeec--CCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-Hhhccceeeech Q lcl|NC_012753. 217 STLYEDLEET--VTLN--GLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPT 291 (502) Q Consensus 217 ~~~~~~l~~~--~~~~--~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~ 291 (502) .++|-..... .... +...-||++++-+ ...++.||+|--..+.+-+..|+..--....- ....+....|+. T Consensus 215 ~sv~~e~dg~~~~~~~~~~~~e~P~~~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~~a~~a~~~~~lv~p 290 (510) T protein:vir:63 215 AELYHEIDGVRVGKEGRWPIHLCPYIVPTWN----LAPGEHYGRGHVEDYIGDFAKLSLLSEKLGLYELESLEVLNLVDE 290 (510) T ss_pred EEEEEEecCceeccccccccccCceeeeeee----ecCCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCc Confidence 2221111100 0011 1223455554432 23467899999999999999999765544442 233444444433 Q ss_pred HHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec--cccchHHHHHHHHHHHHHHHHhcCCChhhcccc Q lcl|NC_012753. 292 QMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT--TDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD 369 (502) Q Consensus 292 ~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~ 369 (502) +.+. +.....+ .....+. . + ....+..++ +..+...-.+.++.+.+.|....=+ ++... T Consensus 291 ~g~~-----~~~~~~~-----~~~g~~v---~--g-~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~af~~---~l~~~ 351 (510) T protein:vir:63 291 AKGA-----VVDDYQD-----AEMGDYV---P--G-GAEAVRAYERGDYNKMAAIQQSLQAVVVRLNQAFMY---GANQR 351 (510) T ss_pred cccc-----chhhhcc-----CCCceee---c--C-CcccceeeecCcccchHHHHHHHHHHHHHHHHHHHh---hcccC Confidence 2111 0000000 0000000 0 0 111122222 1111222223333333333222101 11122 Q ss_pred ccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHH Q lcl|NC_012753. 370 GKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWS 448 (502) Q Consensus 370 ~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~ 448 (502) .+...|||||....+.+.+..+- ..+.-...|..|++-.+.++.-.++.+-.........|++- +....++... T Consensus 352 ~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~gl~p~p~~~~~~~~v~~i-----s~Laraq~~~ 426 (510) T protein:vir:63 352 DAERVTAEEVRITAEEAENTLGGTYSLLAENLQSPLAYVCLSEVDDALLQGLITKQHKPAIETGL-----PALSRSAAVQ 426 (510) T ss_pred CCCCcCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCchhcccceecch-----hHHHHHHHHH Confidence 33446999999887776665544 22223333444444444433322222211111111112221 2222222111 Q ss_pred ------HHH-hcCC-------CCHHHH---HHhcCCC-------CHHHHHHHHHHHHHhhhcc------c---CCCCCcc Q lcl|NC_012753. 449 ------KMV-AAGF-------APKTMA---IEKTLNV-------TKEQAQEIYQKINDETMVS------T---DSFRTSE 495 (502) Q Consensus 449 ------~~~-~~Gi-------~S~et~---l~~~~~~-------~deea~~el~ri~~E~~~~------~---~~~~~~~ 495 (502) +.+ ..|- +-...+ +....|+ ++||++++.++.+++.+++ . -..-..+ T Consensus 427 ~l~~~~q~l~~~~~~aq~~~~id~d~~~~~~a~~~Gv~p~~ivrs~eev~a~~~~~~qq~~~~~~~~~~~~~~a~~~~~~ 506 (510) T protein:vir:63 427 SMLNASQVIAGLAPIAQLDPRISLPKMMDTIWAAFSVDTSQFYKSADELQAEAEQQRQQAAQAQAAQETLLEGASDMTNA 506 (510) T ss_pred HHHHHHHHHHHhcCchhhhccCCHHHHHHHHHHHhCCChhHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 111 1111 112222 2333454 5566666544322222111 0 1111122 Q ss_pred ccCC Q lcl|NC_012753. 496 EVDI 499 (502) Q Consensus 496 ~~~~ 499 (502) -+|+ T Consensus 507 ~~g~ 510 (510) T protein:vir:63 507 LAGV 510 (510) T ss_pred ccCC Confidence 2222 No 166 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=97.74 E-value=2.3e-05 Score=46.02 Aligned_cols=375 Identities=12% Similarity=0.102 Sum_probs=147.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+|++++.+.+.-+-+..... ..+..... ...+. .+ .+......-...| T Consensus 1 mg~~~~~~~~~~~~~~~~~~~--~~~~~~~~-----------------------~~~~~--t~----~~~~~~~~v~~cv 49 (403) T protein:vir:10 1 MGFKSWITEKLNPGQRIIRDM--EPVSHRTN-----------------------RKPFT--TG----QAYSKIEILNRTA 49 (403) T ss_pred Ccchhhhhhccchhhhhhhcc--cccccccC-----------------------Ccccc--cH----HHHHHHHHHHHHH Confidence 999998887764221110000 00111000 00000 00 0011111122334 Q ss_pred HHHhhhhhcCcceEee------C-CHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRV------D-NEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQ 148 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~------~-d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~ 148 (502) +..|+-+..=|..+.- + +.....-+..+|.. | ....-...++..++..|.+|+.+ +.. .+..++ T Consensus 50 ~~Ia~~ia~~p~~v~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~~ll~Gnayi~~--~~~--~l~~l~ 125 (403) T protein:vir:10 50 NMVIDSAAECSYTVGDKYNIVTYANGVKTKTLDTLLNVRPNPFMDISTFRRLVVTDLLFEGCAYIYW--DGT--SLYHVP 125 (403) T ss_pred HHHHHHHhhCceeEeecccccccccccccchHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEE--eCc--eeEeec Confidence 4444444444443311 0 11111223344432 2 22233344566666778776543 332 233455 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) ++.+-. ..+.++. +| .+.+ .. .+. |.. . + T Consensus 126 ~~~~~v-~~~~~~~------------------~~-~~~~---~~---~~~---~~~------------~--------e-- 154 (403) T protein:vir:10 126 AALMQV-EADANKF------------------IK-KFIF---NN---QIN---YRV------------D--------E-- 154 (403) T ss_pred CcceEE-EEcCCce------------------EE-EEEe---cC---cee---ecc------------c--------c-- Confidence 544332 1222111 00 0000 00 000 000 0 0 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) .++|+.+..-........|+|.+.-+...++....+..-..+-|..|...-.| |+.....+..... T Consensus 155 --------iih~~~~~~~~~~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gi----l~~~~~l~~e~~~-- 220 (403) T protein:vir:10 155 --------IIFIKDNSYVCGTNSQISGQSRVATVIDSLEKRSKMLNFKEKFLDNGTVIGLI----LETDEILNKKLRE-- 220 (403) T ss_pred --------eEEecccccccCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE----EEeCCCCCHHHHH-- Confidence 01111111000011335688888777777765554443333445555433222 3322111111000 Q ss_pred cccccc-chhhcc-------ccCCCCccccceeeecc--ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH Q lcl|NC_012753. 309 REFETG-HNVYEQ-------FDSGDMDKGIGITDLTT--DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE 378 (502) Q Consensus 309 ~~~~~~-~~~~~~-------~~~~~~~~~~~i~~~~~--~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte 378 (502) .+... ...+.. +..+ .+..++.++. ....-++.+..+....+|+...|++|..+|.... ++..+ T Consensus 221 -~~~~~~~~~~~g~~n~g~~~vl~---~g~~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--sn~e~ 294 (403) T protein:vir:10 221 -RKQEELQLDYNPSTGQSSVLILD---GGMKAKPYSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLLDGGNN--ANIRP 294 (403) T ss_pred -HHHHHHHHHhCCcccCcceeecC---CCceeEEecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC--cCHHH Confidence 00000 011100 0011 1112344432 2234467888888899999999999999975332 12222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCC--ccCCHHHHHHHHHHHHhcCCC Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDG--VFTDRNAEFDYWSKMVAAGFA 456 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~--i~~d~~~~~~~~~~~~~~Gi~ 456 (502) .. +..++..|..++..|....+. .+ ...+.+++++- +-.|..+.++.+.+++.+|++ T Consensus 295 ~~-------------~~f~~~tl~P~~~~ie~~l~~-~L-------~~~~~~d~~~~~~l~~D~~~~~~~~~~~~~~G~l 353 (403) T protein:vir:10 295 NI-------------ELFYYMTIIPMLNKLTSSLTF-FF-------GYKITPNTKEVAALTPDKEAEAKHLTSLVNNGII 353 (403) T ss_pred HH-------------HHHHHHHHHHHHHHHHHHHHH-hc-------CceeeeccchhhhcccCHHHHHHHHHHHHhCCCc Confidence 11 111122222222222211111 00 12344455432 455888888889999999999 Q ss_pred CHHHHHHhc--CCCCHHHHHHHHHHHHH--hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 457 PKTMAIEKT--LNVTKEQAQEIYQKIND--ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 457 S~et~l~~~--~~~~deea~~el~ri~~--E~~~~~~~~~~~~~~~~~g~ 502 (502) +.-++++.. ++++++.+.+-+--... ......+...+.+++.-=|| T Consensus 354 T~NE~R~~~gl~pi~~~~~d~~~~p~n~~~~~~~~~~~e~~~~~~~~~g~ 403 (403) T protein:vir:10 354 TGNEARSELNLEPLDDEQMNKIRIPANVAGSATGVSGQEGGRPKGSTEGD 403 (403) T ss_pred CHHHHHHHhCCCCCCcccccccccccccccccccCCCCcCCCCCCCcCCC Confidence 999876653 45555444333211111 11111122222233333344 No 167 >protein:vir:100039 Length: 522 # NCBI annotation: T7-like head-to-tail connector # Family: family:all:481 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214201;genbank:gi:61806424;genbank:GeneID:3294719 Probab=97.69 E-value=2.9e-05 Score=45.49 Aligned_cols=431 Identities=11% Similarity=0.057 Sum_probs=167.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCcccccc--CCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRD--SNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~--~~~~~~~~~~~~~n~~k~ 78 (502) |++.++-..+-. ++-.....|+.++.=-.+-+.... ........+++--+.+.. T Consensus 1 m~~~~r~~~L~~------------------------~R~~~e~~w~e~~~~tlP~~~~~~~~~~~~~~~~~~~~dstg~~ 56 (522) T protein:vir:10 1 MKARERYNQLTT------------------------ARQMFLDKAVECSELTLPYLIDDDISSRPNHKSLTVPWQSVGAK 56 (522) T ss_pred CchHHHHHHHHH------------------------HhhHHHHHHHHHHHHhhhcccCCCCCCCcccccccccccchHHH Confidence 665544433322 121222333333211111110000 011111122333356677 Q ss_pred HHHHHhhhhhcC--cc-----eEeeCCHH------------HHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEE Q lcl|NC_012753. 79 ASKKVASLVFNE--QA-----TIRVDNEV------------ADAFI-------NETLKNDKFSKNFERYLESCLALGGLA 132 (502) Q Consensus 79 iv~~~a~~l~~e--p~-----~i~~~d~~------------~~e~l-------~~~~~~~~f~~~~~~~~~~~~~~G~~~ 132 (502) .++.+|+-|.+- || ++.+.+.. +.++| ...+..++|...+.++..+..+.|.++ T Consensus 57 a~~~LAa~l~~~ltpp~~~WF~l~~~d~~l~~~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~ 136 (522) T protein:vir:10 57 CCVTLAAKLMLAVLPPQTSFFKLQVRDDKLGEELDPQIRSELDLSFSKMERMIMDYIAASNDRVAVHQALKHLIVGGNAL 136 (522) T ss_pred HHHHHHHHHHHhhcCCCCccccccCChHHHhhhcChhhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCcee Confidence 777777776652 22 13333211 23333 344667899999999999999999977 Q ss_pred EEEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEE--------e-e---------CCCceEEEEEEEEEEeCC- Q lcl|NC_012753. 133 MRPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTK--------T-E---------GQKVKYYSLIEFHEWNKE- 193 (502) Q Consensus 133 ~~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~--------~-~---------~~~~~~yt~~E~h~~~~~- 193 (502) .|.+++.++ .+|-.+++ +..|..+....+|...... . + .+....++.++...+... T Consensus 137 --ly~~~~~~~--~~pl~~y~-v~~d~~G~vd~i~r~~~~t~~ql~~~fg~~~~~~~~~~~~~~~~~v~v~~~v~p~~~~ 211 (522) T protein:vir:10 137 --IFMGKDGLK--TFPLTRYV-INRDGDGNVLEIVTKELISRKVLDIELPEPKPNTGIDESSTTNDDVTIYTYVKLDKSS 211 (522) T ss_pred --EEEcCCCce--EEEcceEE-EeeCCCCCeeEEEeeeeccHHHHHHhcchhccchhhhcccCCCCceEEEEEEEeeccC Confidence 467776644 46666644 4456555444444322211 0 0 011111111111111111 Q ss_pred -eEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHH Q lcl|NC_012753. 194 -TYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTT 272 (502) Q Consensus 194 -~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~ 272 (502) .+...+.++ |..++ +.. ...|...-||++++-+ ...++.||+|--..+.+-+..|+.. T Consensus 212 ~~~~~~~~~~--------~~~~~------~~~---s~~g~~~~P~~~~Rw~----~~~ge~YGrgp~~~~l~D~k~L~~l 270 (522) T protein:vir:10 212 GRWVWHQEAF--------DKIIP------DSR---STAPKNASPWLPLRFN----TVDGEDYGRGRVEEFLGDLKSLDGL 270 (522) T ss_pred CceEEEEccC--------Ccccc------ccc---cccccccCCceeeeee----ecCCCccccchHHHHHHHHHHHHHH Confidence 111111110 11111 000 1123334455555432 2346789999999999999999977 Q ss_pred HHHHHHHH-hhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceee-eccccchHHHHHHHHH Q lcl|NC_012753. 273 YDEFMWEV-KMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITD-LTTDIRSDDYIKAINK 350 (502) Q Consensus 273 ~S~~~~~~-~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~ir~e~~~~~l~~ 350 (502) --....-. ...+..+.||.+..... ....+. ....+. ....+.-..++. ...++ ....+.++. T Consensus 271 ~~~~~~~~~~a~~p~~lv~~~~~~~~-----~~l~~~-----~~~~~v---~g~~~~v~~~~~~~~~d~--~~~~~~i~~ 335 (522) T protein:vir:10 271 SQSLIEGAAAASKVVFLVSPSSTTKP-----ATIAKA-----GNGAIV---QGRPEDVAVIQVGKTADF--STAANMATA 335 (522) T ss_pred HHHHHHHHHHhcCCceeecccccccc-----ccccCC-----CCccee---cCCCccceeecccccccc--hHHHHHHHH Confidence 65555544 34555556644332211 111000 011111 011111111110 01122 112233334 Q ss_pred HHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccCCCcccccc-e Q lcl|NC_012753. 351 GLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLV-EKSLKELVISILELAKVYNLYTGEIPTMDE-V 428 (502) Q Consensus 351 ~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~-~~~l~~l~~~il~~~~~~~~~~~~~~~~~~-i 428 (502) +.+.|....-+ +........|||||....+...+..+-.-..+ ...|.-|+.-++.++.-.++.+.......+ . T Consensus 336 ~~~ri~~aFl~----~~~~d~~rvTAtEV~~r~~E~~~~LGpv~~rl~~E~l~Pli~r~~~il~r~g~lP~~p~~~~~~~ 411 (522) T protein:vir:10 336 IEKRLLEAFLV----MNVRNAERVTAEEVRLTQLELEQQLGGIFSLLVIEFLIPYLNRTLLVLQRSNQIPKLPKDIVRPT 411 (522) T ss_pred HHHHHHHHHhh----ccCCCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCccccccc Confidence 43334332111 11223345699999988777776555522222 333344444455554433333322111111 1 Q ss_pred EEEeCCCccCCHHHHHHHHHHHHh--cCCCCHH---------HHH---HhcCCC-------CHHHHHHHHHHHHHhhh-- Q lcl|NC_012753. 429 SVDLDDGVFTDRNAEFDYWSKMVA--AGFAPKT---------MAI---EKTLNV-------TKEQAQEIYQKINDETM-- 485 (502) Q Consensus 429 ~v~f~d~i~~d~~~~~~~~~~~~~--~Gi~S~e---------t~l---~~~~~~-------~deea~~el~ri~~E~~-- 485 (502) .|++-..+ .....++.+....+ +.++.++ .++ ....|+ +++|++++-+..++.++ T Consensus 412 ~v~~is~L--araq~~~~l~~~~~~i~~~~~p~~~~~~id~d~~~~~~a~~~Gvp~~~ivrt~eev~~~~q~~q~~~~~~ 489 (522) T protein:vir:10 412 IVAGVNAL--GRGQDRESLTAFVGTIAQTLGPEALMQYLNPLEAIKRLAAAQGIDVLNLVKTEQQLAEEQQAAQQQAAQQ 489 (522) T ss_pred cccchhHH--HHHHHHHHHHHHHHHHHHhhCchhhhhcCCHHHHHHHHHHHhCCChhhhcCCHHHHHHHHHHHHHHHHHH Confidence 22222211 11111121111110 0111122 222 223443 44444333222111111 Q ss_pred ---cccCCCCCccccC----CCCC Q lcl|NC_012753. 486 ---VSTDSFRTSEEVD----IYGE 502 (502) Q Consensus 486 ---~~~~~~~~~~~~~----~~g~ 502 (502) ..+...-+....+ .-|. T Consensus 490 ~~~~~a~~~~~~~~~~~~~~~~~~ 513 (522) T protein:vir:10 490 SLVDQAGQMTGSPLMDPTKNPQLM 513 (522) T ss_pred HHHHHHHHHhcccccCccccHHHH Confidence 0001000000000 0010 No 168 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=97.67 E-value=3e-05 Score=45.34 Aligned_cols=373 Identities=11% Similarity=0.059 Sum_probs=160.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |++|.+.+.--.. .. ..... ...... + .++... ..+.... +.-+..+--... T Consensus 1 M~~f~~~~~~~~~----~~-~~~~~-----~~~~~~--------------~--~~~~~~-~~~~~v~~~~~~~~~~v~~~ 53 (386) T protein:vir:48 1 MPIFNITNLATES----PP-ISQGG-----FFDITD--------------P--DFLSTL-NGSEWVSAESALRNSDLFSI 53 (386) T ss_pred Ccccccccccccc----cc-ccccc-----cccccc--------------c--hhcccc-cCCceechhhhhcchHHHHH Confidence 9999865432110 00 00000 000000 0 000000 0011110 111122222345 Q ss_pred HHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEEcCCeEEEEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSFVQATVFFPLQA 157 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~v~~~~~~Pi~~ 157 (502) ++..|+-+-+=|+ .+.+......+.+-...-.....+..++...+..|.+++.+..|.+ . ..+..++|+.+-+... T Consensus 54 i~~ia~~ia~~p~--~~~~~~~~~l~~~pN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~ 131 (386) T protein:vir:48 54 INQLSNDLATVKL--TASRKQLQGIIDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRL 131 (386) T ss_pred HHHHHHhhccCce--eeccchhHHHhhcCCCCCCHHHHHHHHHHHhhhcCcEEEEEEECCCCcEEEEEEecCceeEEEEc Confidence 5666665555443 4555544444444333233444556667788888999888877654 3 3666677777654322 Q ss_pred cCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceE Q lcl|NC_012753. 158 NTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLF 237 (502) Q Consensus 158 d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f 237 (502) ..+.. .+|. +.. +....|..+.+ . + --+ T Consensus 132 ~~~~~-----------------~~y~-------------~~~------~~~~~~~~~~~---~---~----------~ev 159 (386) T protein:vir:48 132 DNKDG-----------------IYYN-------------ITF------DDPRIPPKQHV---P---Q----------GDV 159 (386) T ss_pred CCCce-----------------EEEE-------------EEe------cCccccceeEe---c---C----------ccE Confidence 21111 0110 000 00000110000 0 0 012 Q ss_pred EEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc-eeeechHHhccCCCCCCcccCccccccccch Q lcl|NC_012753. 238 TYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQMIKTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 238 ~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~ 316 (502) ++++.+.. .+..+|+|.+..+...+.....+.....+-|..+.. ..+ +........... ..+ .+ . T Consensus 160 ih~~~~~~----~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i-----i~~~~~~~~e~~---~~~-~~-~ 225 (386) T protein:vir:48 160 LHFKLLSV----DGGLTSVSPLMALSRELNIQKASDKLTLNSLKNALNANGI-----LKIKGGGLLDFK---TKL-SR-S 225 (386) T ss_pred EEecCCCC----CCceeeccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE-----EEeCCCCCHHHH---HHH-HH-H Confidence 44543322 234568998877776665555444333334444332 222 222211111000 000 00 0 Q ss_pred hhccccCCCC-----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH-HHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDM-----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE-VVSEQSDTYQMR 390 (502) Q Consensus 317 ~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte-i~~~~~~l~~~~ 390 (502) +.......+ +.+.-++.++.....-++++..+...++|+...|+||..+|+.+++. ++.+ .+.. T Consensus 226 -~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~e~~~~~~-------- 295 (386) T protein:vir:48 226 -RQAMKQMQGGPLVLDDLEEFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPENVVGGQGDQQ-SSLEMSLDL-------- 295 (386) T ss_pred -HHHhhcCCCCceecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc-cHHHHHHHH-------- Confidence 000000000 11223555555555667888888889999999999999998654432 2222 2211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCC Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNV 468 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~ 468 (502) ++.+|..++..|..-.+. .+.. ++.+++...+-.+....+..+.+++.+|++++-++++.+ .|+ T Consensus 296 ------~~~~l~P~~~~ie~~l~~-~l~~-------~~~~~~~~~~~~d~~~~~~~~~~l~~~g~~t~nE~r~~lg~~~~ 361 (386) T protein:vir:48 296 ------YNKAVSRYLRPFLSELSQ-KLSC-------DVDADILPAVDPTGSNSVSRINSMVKSGTLAQNQGLYILQQAEI 361 (386) T ss_pred ------HHHHHHHHHHHHHHHHHH-hhcc-------hhhcchhhhhccChHHHHHHHHHHHhCCCcCHHHHHHHhhcCCC Confidence 222233332222211110 0000 111222222334566777888899999999999887653 355 Q ss_pred CHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 469 TKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 469 ~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+.++ .+.. . .......++|-.|+ T Consensus 362 ~~~~~----~~~~--~----~~~~~~~gGd~~~~ 385 (386) T protein:vir:48 362 LPKEL----PEGE--N----PNKTTLKGGEINGE 385 (386) T ss_pred CCccc----hhhc--C----CCCCccCCCCCCCC Confidence 54432 2111 1 11112334444555 No 169 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=97.65 E-value=3.3e-05 Score=45.17 Aligned_cols=444 Identities=14% Similarity=0.142 Sum_probs=189.3 Q ss_pred hhcccccchhhhhcccccc---CCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceec---chHHHHHHHHhhhh- Q lcl|NC_012753. 15 NYVITNQSLNSITDHPKIA---ISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHL---PIGRTASKKVASLV- 87 (502) Q Consensus 15 ~~~~~~~~l~~i~~~~~~~---~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~---n~~k~iv~~~a~~l- 87 (502) ...+...+|++..+..+.. .+...-....-.--.|.|..-.+.-.-......-+++.++ +-.--.|+..++=+ T Consensus 1 m~~lfg~~i~~~~~~~~~~s~~~~~~~dg~~~i~~~~~~~~~~~~e~~~~~~~eLI~~YR~ma~~pEvd~Av~eIVneai 80 (533) T protein:vir:10 1 MSQLFGFSLERAKKAPKGPSFVQKDNLDGSQPVSGGGYYGYTVDFDGQVRNEYQLISRYREMVLQPECDSAVDDIVNETI 80 (533) T ss_pred CccccccccccccccccCCCCCCCCcccccceeecccccceeeecccccchHHHHHHHHHHHhhccchhhHHHHhhccee Confidence 1122222332221111110 0000000000000001111000000000000000011110 11111122221111 Q ss_pred ----hcCcceEeeCC--------HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC----c-eEEEEEcCC Q lcl|NC_012753. 88 ----FNEQATIRVDN--------EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD----Q-IRVSFVQAT 150 (502) Q Consensus 88 ----~~ep~~i~~~d--------~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~----~-~~i~~v~~~ 150 (502) ..+|+++.+++ +...+..+.+++--+|+++..+.+....+-|..|++..+|++ + ..+.+++|. T Consensus 81 v~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~pk~GI~ELr~lDPr 160 (533) T protein:vir:10 81 CGNFDDVPVSVELSNLKVSDKIKKLIREEFGEILRLLDFENRSYEIFRRWYVDGRLFYHKVIDPDNPQGGLIELRYIDPR 160 (533) T ss_pred eecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEecCCCccccceeeeecccc Confidence 12566677764 224556677777779999999999999999999999999853 3 468888998 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecC---CccccCceeecccc-ccCCCcc Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESE---SKTIIGQRVPLSTL-YEDLEET 226 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~---~~~~lG~~v~l~~~-~~~l~~~ 226 (502) .+-++..-... ..++.++ ++...+.. +.+.++.+|.-. ....-|-.+|-+.+ |. T Consensus 161 ~i~~vr~i~~~-----------~~~~~~~-~~~~~~v~-----~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~----- 218 (533) T protein:vir:10 161 KIRKINETEQK-----------RPEQLRG-LPLNQQLS-----PKSAEYFLYDPKGLKNSTTQGLKIAPDSICYV----- 218 (533) T ss_pred ceeeeeeeecc-----------CCCccce-eecchhhh-----ccceeeeeeccccccccCCCceecchhheeee----- Confidence 88775321000 0000000 00000000 011111122100 00111222332211 11 Q ss_pred eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH--HHHhhccceee-ech---------HHh Q lcl|NC_012753. 227 VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT---------QMI 294 (502) Q Consensus 227 ~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~---------~~l 294 (502) -+|+ . +.....=+|-|..+......|=-+-+.++ +-.++-..||| |+- .+| T Consensus 219 --hSGl--~-------------d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl 281 (533) T protein:vir:10 219 --HSGI--M-------------DLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYL 281 (533) T ss_pred --eccc--e-------------eCCCCceeccchHhHHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 0111 0 00111112333333333332222111111 11233344554 211 011 Q ss_pred ----c-----cCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh Q lcl|NC_012753. 295 ----K-----TEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM 365 (502) Q Consensus 295 ----~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 365 (502) . .+-+...++++.++-+..-...|..- --+|+.+.-|+++..--...+ ++-++.+.+.+....++|.+. T Consensus 282 r~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLP-RReGgrgTEItTLpGgqnLge-m~DV~YF~kKLY~aLnVP~SR 359 (533) T protein:vir:10 282 REVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLP-RREGGRGTEITTLPGGQNLGE-LEDVKYFQKKLYKSLNVPGSR 359 (533) T ss_pred HHHHHhccceEEEeccCceecccchhhhhHhhhccc-ccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccc Confidence 0 01122233333332222222222211 113333344666554322222 244666777888888888777 Q ss_pred ccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCCHHH Q lcl|NC_012753. 366 FSFDGKS-MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNA 442 (502) Q Consensus 366 ~~~~~~~-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~ 442 (502) ++.+++. ..-++||....-....-+.+++..|..-+.++++.-|.|-. ++...-+.. ..+.++|...--..+.. T Consensus 360 l~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKg---iit~eeW~~i~~~I~~~f~~Dn~f~ElK 436 (533) T protein:vir:10 360 LETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKG---VISIEEWDQMKEHIQYDYIADNYFAELK 436 (533) T ss_pred cCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---CCCHHHHHHHhhcceEeeeecchHHHHH Confidence 7765432 22345665555555667888888888888888887665432 222221111 34778886654444444 Q ss_pred HHHHHHHHH---h------cCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhccc-----------CCCCCccccCCCCC Q lcl|NC_012753. 443 EFDYWSKMV---A------AGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVST-----------DSFRTSEEVDIYGE 502 (502) Q Consensus 443 ~~~~~~~~~---~------~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~-----------~~~~~~~~~~~~g~ 502 (502) +++.+..-. + +-..|.+++.++..-.||+|.+++.+.|++|....- .....|.-+|..+| T Consensus 437 e~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~ 516 (533) T protein:vir:10 437 EIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAE 516 (533) T ss_pred HHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccc Confidence 444433211 1 124699998888889999999999999998864321 12223334444444 No 170 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=97.65 E-value=3.3e-05 Score=45.14 Aligned_cols=430 Identities=13% Similarity=0.108 Sum_probs=192.4 Q ss_pred CCh--hHHHHHHHHHHh-----------hcccccchhhhhcccccc------------------CC---HHHHHHHHHHH Q lcl|NC_012753. 1 MGI--IQTIKNFIKRSN-----------YVITNQSLNSITDHPKIA------------------IS---PEEYNRIMDNL 46 (502) Q Consensus 1 m~~--~~~ik~~i~~~~-----------~~~~~~~l~~i~~~~~~~------------------~~---~~~~~~i~~~~ 46 (502) |++ .+-.+-|++.-- .++.+-...+-...+.+. +. .....-|+.++ T Consensus 1 m~f~~~~lf~f~~~~de~~~~~~~~~~~~S~~~p~~dDGa~~i~~~~~~~~~~~~~~~q~~y~~~e~~~~~~~eLI~~YR 80 (523) T protein:vir:68 1 MKFNILSLFAPWAKMDERDYKDQEKENLESITSPKLDDGAKEYEVSENEAQQTYNAMFQRMFGSQEPGLKSTRELIDTYR 80 (523) T ss_pred CCCchhhhhhhhhhhhhhhhhhhhhccCCCccccCCCCcceeeeccccccccccchhhhhhhhccccccchHHHHHHHHH Confidence 766 444444444211 011111000000000000 00 01111222222 Q ss_pred HHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh-h----hcCcceEeeCCH--------HHHHHHHHHHhhcc Q lcl|NC_012753. 47 RYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL-V----FNEQATIRVDNE--------VADAFINETLKNDK 113 (502) Q Consensus 47 ~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~-l----~~ep~~i~~~d~--------~~~e~l~~~~~~~~ 113 (502) .+... +-.--.|+..++= + ..+|+++.+++- ...+..+.+++--+ T Consensus 81 ~ma~~----------------------pEvd~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~ 138 (523) T protein:vir:68 81 NLMTN----------------------YEVDNAVSEIVSDAIVYEDDTEVVSINLDNTKFSPNIKSMMLDEFNEVLNHLS 138 (523) T ss_pred HHhhc----------------------cchhhHHHHhhcceeeecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhc Confidence 22211 1111112221111 1 235667777642 24566677887779 Q ss_pred HHHHHHHHHHHHhhcCCEEEEEEEeCC----c-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCce-EEEEEEE Q lcl|NC_012753. 114 FSKNFERYLESCLALGGLAMRPYIDGD----Q-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVK-YYSLIEF 187 (502) Q Consensus 114 f~~~~~~~~~~~~~~G~~~~~~~~d~~----~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~-~yt~~E~ 187 (502) |+++..+.+....+-|..|++.++|+. + ..+..++|..+-++..- ..+...+. .++-+ T Consensus 139 F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i--------------~~~~~~g~~vi~~~-- 202 (523) T protein:vir:68 139 FQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREV--------------ITTTEAGVKIVKGY-- 202 (523) T ss_pred cchhhhHHHHhheeeeEEEEEEEeeCCCccccceeeeeeCCcceeEEEee--------------cCCCCcchhhhhhh-- Confidence 999999999999999999999999853 3 46888888876554221 11111110 11100 Q ss_pred EEEeCCeEEEEEEEEecC--Cc--cccCcee--eccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhh Q lcl|NC_012753. 188 HEWNKETYTISNELYESE--SK--TIIGQRV--PLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDN 261 (502) Q Consensus 188 h~~~~~~~~I~~~l~~~~--~~--~~lG~~v--~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~ 261 (502) ..|++-+.-+.+. ++ ...|+.| |-+. +++...+- +..+.. .=+|-|.. T Consensus 203 -----~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dA--------I~y~hSGL-----~d~~~~--------~i~gyLhk 256 (523) T protein:vir:68 203 -----KEYFIYDTSHESYACDGRIYEAGTKIKIPKAA--------IVYAHSGL-----VDCCGK--------NIIGYLHR 256 (523) T ss_pred -----hhheeeccccccccccccccCCCcceecchhh--------eeeeeccc-----eeCCCC--------ceeccchh Confidence 0111111000000 00 0011111 1111 11111000 000000 00233333 Q ss_pred HHHHHHHHHHHHHHHH--HHHhhccceee-ech---------HHhc---------cCCCCCCcccCccccccccchhhcc Q lcl|NC_012753. 262 AKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT---------QMIK---------TEYDTNGEKVTVKREFETGHNVYEQ 320 (502) Q Consensus 262 ~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~---------~~l~---------~~~~~~g~~~~~~~~~~~~~~~~~~ 320 (502) +......|=-.-+.++ +-.++-..||| |+- .+++ .+-+...++++.++-+..-...|.. T Consensus 257 AiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWL 336 (523) T protein:vir:68 257 AIKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWL 336 (523) T ss_pred hhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcc Confidence 3322222221111111 11233334444 211 0110 0112222333322222222222211 Q ss_pred ccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccc--cccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 321 FDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKS--MKTATEVVSEQSDTYQMRNSIATLVE 398 (502) Q Consensus 321 ~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~--~~tAtei~~~~~~l~~~~~~~~~~~~ 398 (502) - --+|+.+.-|+++..--...+. +-++.+.+.+....++|.+.+..++++ ..-++||....-....-+.+++..|. T Consensus 337 p-RReGgrgTEItTLpGgqnlgem-~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs 414 (523) T protein:vir:68 337 Q-RRDGKAVTEVDTLPGADNTGNM-EDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQHKFE 414 (523) T ss_pred c-ccCCCcccceeeccccCCcChH-HHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHHHHHH Confidence 1 1133333446666543222222 346667778888888887777443321 11244565444445567788888888 Q ss_pred HHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHHHhcCC Q lcl|NC_012753. 399 KSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAIEKTLN 467 (502) Q Consensus 399 ~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l~~~~~ 467 (502) .-+.++++.-|.+-. ++...-+.. ..+.++|...--..+..+++.+..-. ..+ ..|.+++.++... T Consensus 415 ~lf~~~Lk~qLilKg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr 491 (523) T protein:vir:68 415 EIFLDPLKTNLILKG---IITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQ 491 (523) T ss_pred HHHHHHHHHhhhhcc---CCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhc Confidence 888888887665432 222211111 34778886654444444444433211 112 4589998888889 Q ss_pred CCHHHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 468 VTKEQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 468 ~~deea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) .||+|.+++.+.|++|....--..+..+.-|| T Consensus 492 ~tDeei~~~~kqI~~E~k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 492 MSDEEIEQEAKQIEEESKEARFQDPDQEQEDF 523 (523) T ss_pred cCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 99999999999999998776655555666666 No 171 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=97.60 E-value=3.9e-05 Score=44.74 Aligned_cols=423 Identities=11% Similarity=0.121 Sum_probs=195.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+..-.--.+... +..+ ...+ .....-|+.++.++.. +-+. +--..|| T Consensus 47 ~~~~~~~~~~~~~---------~~~~--~~~~---~n~~eLI~~YR~ma~~--pEvd----------------~Av~eIv 94 (521) T protein:vir:10 47 IDTTAPKTAIVQS---------VLGY--APKI---QNTKDLINQYRSLSKY--HEVD----------------NAIDEII 94 (521) T ss_pred CCccccccchhhh---------hhcc--cccc---chHHHHHHHHHHHhhc--cchh----------------hHHHhhh Confidence 2211100000000 0000 0000 0122335555555433 1110 0001122 Q ss_pred HHHhhh-hhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC----c-eEEEE Q lcl|NC_012753. 81 KKVASL-VFNEQATIRVDNE--------VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD----Q-IRVSF 146 (502) Q Consensus 81 ~~~a~~-l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~----~-~~i~~ 146 (502) +...-+ -..+|+.+.+++- ...+..+.+++--+|+++..+.+....+-|..|++..+|++ + ..+.. T Consensus 95 neaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~ 174 (521) T protein:vir:10 95 NDAIVQEDNRDTVYLDLDKTDWNESVKEMVREEFRTILKLLKFEREGKRHFRRWYVDSRIYFHKMIDPARPKDGIKELRL 174 (521) T ss_pred cceEEecCCCceEEEEecCcccchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeeEEEEEEeeCCCccccceeeee Confidence 221111 1224666666532 34566677777779999999999999999999999999843 3 46888 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCce-EEE-EEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVK-YYS-LIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE 224 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~-~yt-~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~ 224 (502) ++|..+-++...... ...+. .++ ..|++..+... +.-|...+....+-++|.+. T Consensus 175 lDPr~i~~vr~i~k~--------------~~~~~~v~~~~~e~f~Y~~~~----~~~~~~~g~~~~~vkI~~da------ 230 (521) T protein:vir:10 175 LDPRNVEYYRVNLKS--------------NENGNDVYKGVKEFFTYGATE----DNRYNISGNSNNLVQIPIDA------ 230 (521) T ss_pred eCCcceeeeeeecCC--------------CCCcchhhccceeeeeeccCC----CceecCCCCCCcceeechhh------ Confidence 888876554321110 00110 000 01221111000 00011100001111122111 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH--HHHhhccceee-ech---------H Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT---------Q 292 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~---------~ 292 (502) +++...+- .+...+..+|-|..+......|=-+-+.++ +-.++-..||| |+- . T Consensus 231 --I~y~hSGL-------------~d~~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDvGnlpk~KAeq 295 (521) T protein:vir:10 231 --IVYSHSGK-------------VDIDGKTIVGYLHNVIKPANQLKMLEDAMVIYRITRAPERRVFYIDVGTMPNKKATQ 295 (521) T ss_pred --eeeecccc-------------eeCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHH Confidence 11111110 122233445555555444444432222222 11234444554 211 0 Q ss_pred Hh----c-----cCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCCh Q lcl|NC_012753. 293 MI----K-----TEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVST 363 (502) Q Consensus 293 ~l----~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~ 363 (502) ++ . .+-+...++++.++-+..-...|..- --+|+.+.-|+++..--...+ ++-++.+.+.+....++|. T Consensus 296 Yl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEI~TLpggqnlge-m~DV~YF~kkLy~aLnVP~ 373 (521) T protein:vir:10 296 HLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLM-RRDGKATTEVSTLPGAQSMGE-MDDVRWFNRKLYESMKIPL 373 (521) T ss_pred HHHHHHHhcCceEEEeccCceeccchhhhhhHhhhccc-ccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCc Confidence 11 0 01122223333222222222222211 113333344666554332222 2446667778888888887 Q ss_pred hhccccccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCC Q lcl|NC_012753. 364 GMFSFDGKS--MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTD 439 (502) Q Consensus 364 ~~~~~~~~~--~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d 439 (502) +.+..+++| .--++||....-....-+.+++..|..-+.++++.-|.+-. ++...-+.. ..+.++|...--.. T Consensus 374 sRl~~e~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKg---iit~eew~~i~~~I~~~f~~Dn~f~ 450 (521) T protein:vir:10 374 SRLPQEGAGVTFGAGNDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKG---KMSVSEWEEQAENIKVVFSKDSYYE 450 (521) T ss_pred cccCCCCCceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---CCCHHHHHHHhhcceEEeeecchHH Confidence 777665432 11234555444445567788888888888888887665432 222211111 34778886654444 Q ss_pred HHHHHHHHHH---HH--hcC------CCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 440 RNAEFDYWSK---MV--AAG------FAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 440 ~~~~~~~~~~---~~--~~G------i~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) +..+++.+.. +. ..+ ..|.+++.++....||+|.+++.+.|++|....--..+..+..|| T Consensus 451 ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~~~~~~~~p~~e~~df 521 (521) T protein:vir:10 451 EIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDGELKDSVYKNPEDPMEEF 521 (521) T ss_pred HHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCCCcchhhcC Confidence 4444443321 11 123 688899877888999999999999999998775444445555666 No 172 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=97.57 E-value=4.4e-05 Score=44.46 Aligned_cols=435 Identities=12% Similarity=0.133 Sum_probs=194.4 Q ss_pred CChhHHHHHHHHHHhh-----------cccccchhhhhcccc-----------------ccCC-HHHHHHHHHHHHHhcC Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNY-----------VITNQSLNSITDHPK-----------------IAIS-PEEYNRIMDNLRYFAG 51 (502) Q Consensus 1 m~~~~~ik~~i~~~~~-----------~~~~~~l~~i~~~~~-----------------~~~~-~~~~~~i~~~~~~Y~g 51 (502) |++.+-.+-|++.--. ++.+-...+-..... .... .....-|+.++.++.. T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 7777766666542110 000000000000000 0000 0111223333333221 Q ss_pred CCCccccccCCCccccccceecchHHHHHHHHh-hhhhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHHHH Q lcl|NC_012753. 52 DFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVA-SLVFNEQATIRVDNE--------VADAFINETLKNDKFSKNFERYL 122 (502) Q Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a-~~l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~~~ 122 (502) +-+. +--..||+... .=-..+|+.+.+++. ...+..+.+++--+|+++..+.+ T Consensus 81 --pEvd----------------~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~f 142 (516) T protein:vir:10 81 --PEVE----------------RAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLF 142 (516) T ss_pred --cchh----------------hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHH Confidence 1000 00011111111 001135666666542 24566677777779999999999 Q ss_pred HHHhhcCCEEEEEEEeC---CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEE--EeCCeEEE Q lcl|NC_012753. 123 ESCLALGGLAMRPYIDG---DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHE--WNKETYTI 197 (502) Q Consensus 123 ~~~~~~G~~~~~~~~d~---~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~--~~~~~~~I 197 (502) ....+-|..|++.+.|+ |=..+..++|..+.++..-.........+.. -| .|++. .....|.. T Consensus 143 R~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~----------~~--~e~~~Y~~~~~~~~~ 210 (516) T protein:vir:10 143 RRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVK----------GY--REFFIYTTGNEGYSY 210 (516) T ss_pred hhhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhh----------hh--hheeeeccCcccccc Confidence 99999999999988874 3357888999988876432110000011000 00 01111 00011111 Q ss_pred EEEEEecCCccccCceeecccc-cc--CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHH Q lcl|NC_012753. 198 SNELYESESKTIIGQRVPLSTL-YE--DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYD 274 (502) Q Consensus 198 ~~~l~~~~~~~~lG~~v~l~~~-~~--~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S 274 (502) .-..|...+ +-.+|-+.+ |. ||.+ ....+- +|-|..+......|=-+-+ T Consensus 211 ~g~~~~~~~----~ikI~~dAI~y~hSGL~d-----~~~~~i-------------------~syLhkAiKp~NQLkm~ED 262 (516) T protein:vir:10 211 NGRIFEPNT----RIKIPRSAVVYASSGLMD-----CSDRGI-------------------IGYLHNAVKPANQLKLLED 262 (516) T ss_pred ccceeCCCc----ceeechhheeeeccccee-----CCCCce-------------------eeeehhhhHhHHhhHHHHh Confidence 001111100 111221111 10 1110 001111 2223332222222211111 Q ss_pred HHH--HHHhhccceee-ech---------HHh---------ccCCCCCCcccCccccccccchhhccccCCCCcccccee Q lcl|NC_012753. 275 EFM--WEVKMGQRRVA-VPT---------QMI---------KTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGIT 333 (502) Q Consensus 275 ~~~--~~~~~~~~~i~-v~~---------~~l---------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 333 (502) .++ +-.++-..||| |+- .++ +-+-+...++++.++-+..-...|..- --+|+.+.-|+ T Consensus 263 AlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEIt 341 (516) T protein:vir:10 263 AMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLM-RRDGKSVTEVS 341 (516) T ss_pred hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccc-ccCCCCcccee Confidence 111 11233333444 211 000 001122233333322222222222211 11333334466 Q ss_pred eeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 334 DLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM---KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILE 410 (502) Q Consensus 334 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~---~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~ 410 (502) ++..--...+ ++-++.+.+.+....++|.+.+..++++. .-++||.-..-....-+.+++..|..-+.++++.-|. T Consensus 342 TLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLi 420 (516) T protein:vir:10 342 SLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLI 420 (516) T ss_pred eccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 6554322222 24466777788888888888876554321 2355665554455567788888888888888887665 Q ss_pred HHHhhcccCCCcccc--cceEEEeCCCccCCHHHHHHHHHH-------HH--hcCCCCHHHHHHhcCCCCHHHHHHHHHH Q lcl|NC_012753. 411 LAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNAEFDYWSK-------MV--AAGFAPKTMAIEKTLNVTKEQAQEIYQK 479 (502) Q Consensus 411 ~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~~~~~~~~-------~~--~~Gi~S~et~l~~~~~~~deea~~el~r 479 (502) +-. ++...-+.. ..+.++|...--..+..+++.+.. +. -+...|.+++.++....||+|.++|-+. T Consensus 421 lKg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~ 497 (516) T protein:vir:10 421 YKR---IITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQ 497 (516) T ss_pred hcc---CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHH Confidence 432 222211111 347788866544444444443332 11 2357899998888889999999999999 Q ss_pred HHHhhhcccCCCCCccccCC Q lcl|NC_012753. 480 INDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 480 i~~E~~~~~~~~~~~~~~~~ 499 (502) |++|....--.. +....|| T Consensus 498 I~~E~~~~~~~~-p~~~~~f 516 (516) T protein:vir:10 498 IEQEAGIKRFQN-PENEDDF 516 (516) T ss_pred HHHhhhCCCCCC-CCccccC Confidence 999976543221 1223444 No 173 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=97.57 E-value=4.4e-05 Score=44.46 Aligned_cols=435 Identities=12% Similarity=0.133 Sum_probs=194.4 Q ss_pred CChhHHHHHHHHHHhh-----------cccccchhhhhcccc-----------------ccCC-HHHHHHHHHHHHHhcC Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNY-----------VITNQSLNSITDHPK-----------------IAIS-PEEYNRIMDNLRYFAG 51 (502) Q Consensus 1 m~~~~~ik~~i~~~~~-----------~~~~~~l~~i~~~~~-----------------~~~~-~~~~~~i~~~~~~Y~g 51 (502) |++.+-.+-|++.--. ++.+-...+-..... .... .....-|+.++.++.. T Consensus 1 ~~~~~lf~f~~~~d~~~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~eLI~~YR~ma~~ 80 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEYDERLKLGHESIATPKKDDGATEIETREGEATYNAVMQQFFGIDNNISGTKDLINTYRQLINN 80 (516) T ss_pred CCchHhcccccchhhhHHhhhhcCCcCcccCCCCCCCceeeecCCCcccccceeeeeeccccccchHHHHHHHHHHHhhc Confidence 7777766666542110 000000000000000 0000 0111223333333221 Q ss_pred CCCccccccCCCccccccceecchHHHHHHHHh-hhhhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHHHH Q lcl|NC_012753. 52 DFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVA-SLVFNEQATIRVDNE--------VADAFINETLKNDKFSKNFERYL 122 (502) Q Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a-~~l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~~~ 122 (502) +-+. +--..||+... .=-..+|+.+.+++. ...+..+.+++--+|+++..+.+ T Consensus 81 --pEvd----------------~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~ik~kI~eeF~~Il~ll~F~~~~~~~f 142 (516) T protein:vir:10 81 --PEVE----------------RAVANIVNEAIVYERGHKVVSLDLDDTDFGSNVKEKILEEFDEVCRLLDASRKLDTLF 142 (516) T ss_pred --cchh----------------hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHH Confidence 1000 00011111111 001135666666542 24566677777779999999999 Q ss_pred HHHhhcCCEEEEEEEeC---CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEE--EeCCeEEE Q lcl|NC_012753. 123 ESCLALGGLAMRPYIDG---DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHE--WNKETYTI 197 (502) Q Consensus 123 ~~~~~~G~~~~~~~~d~---~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~--~~~~~~~I 197 (502) ....+-|..|++.+.|+ |=..+..++|..+.++..-.........+.. -| .|++. .....|.. T Consensus 143 R~WYVDgRi~fhKiid~~k~GI~Elr~lDPr~i~~vR~i~~~~~~~~~v~~----------~~--~e~~~Y~~~~~~~~~ 210 (516) T protein:vir:10 143 RRWYVDSRIFFHKIMPNPKKGIAELRRLDPRFMEYYREIVTSDIGGTTIVK----------GY--REFFIYTTGNEGYSY 210 (516) T ss_pred hhhhhcceEEEEEEecCccccceeeeeeCCcceeeEeeecccccccchhhh----------hh--hheeeeccCcccccc Confidence 99999999999988874 3357888999988876432110000011000 00 01111 00011111 Q ss_pred EEEEEecCCccccCceeecccc-cc--CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHH Q lcl|NC_012753. 198 SNELYESESKTIIGQRVPLSTL-YE--DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYD 274 (502) Q Consensus 198 ~~~l~~~~~~~~lG~~v~l~~~-~~--~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S 274 (502) .-..|...+ +-.+|-+.+ |. ||.+ ....+- +|-|..+......|=-+-+ T Consensus 211 ~g~~~~~~~----~ikI~~dAI~y~hSGL~d-----~~~~~i-------------------~syLhkAiKp~NQLkm~ED 262 (516) T protein:vir:10 211 NGRIFEPNT----RIKIPRSAVVYASSGLMD-----CSDRGI-------------------IGYLHNAVKPANQLKLLED 262 (516) T ss_pred ccceeCCCc----ceeechhheeeeccccee-----CCCCce-------------------eeeehhhhHhHHhhHHHHh Confidence 001111100 111221111 10 1110 001111 2223332222222211111 Q ss_pred HHH--HHHhhccceee-ech---------HHh---------ccCCCCCCcccCccccccccchhhccccCCCCcccccee Q lcl|NC_012753. 275 EFM--WEVKMGQRRVA-VPT---------QMI---------KTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGIT 333 (502) Q Consensus 275 ~~~--~~~~~~~~~i~-v~~---------~~l---------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 333 (502) .++ +-.++-..||| |+- .++ +-+-+...++++.++-+..-...|..- --+|+.+.-|+ T Consensus 263 AlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEIt 341 (516) T protein:vir:10 263 AMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLM-RRDGKSVTEVS 341 (516) T ss_pred hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccc-ccCCCCcccee Confidence 111 11233333444 211 000 001122233333322222222222211 11333334466 Q ss_pred eeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 334 DLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM---KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILE 410 (502) Q Consensus 334 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~---~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~ 410 (502) ++..--...+ ++-++.+.+.+....++|.+.+..++++. .-++||.-..-....-+.+++..|..-+.++++.-|. T Consensus 342 TLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLi 420 (516) T protein:vir:10 342 SLPGAQTMGD-MDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLI 420 (516) T ss_pred eccccCCcCh-HHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 6554322222 24466777788888888888876554321 2355665554455567788888888888888887665 Q ss_pred HHHhhcccCCCcccc--cceEEEeCCCccCCHHHHHHHHHH-------HH--hcCCCCHHHHHHhcCCCCHHHHHHHHHH Q lcl|NC_012753. 411 LAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNAEFDYWSK-------MV--AAGFAPKTMAIEKTLNVTKEQAQEIYQK 479 (502) Q Consensus 411 ~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~~~~~~~~-------~~--~~Gi~S~et~l~~~~~~~deea~~el~r 479 (502) +-. ++...-+.. ..+.++|...--..+..+++.+.. +. -+...|.+++.++....||+|.++|-+. T Consensus 421 lKg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~ 497 (516) T protein:vir:10 421 YKR---IITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQ 497 (516) T ss_pred hcc---CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHH Confidence 432 222211111 347788866544444444443332 11 2357899998888889999999999999 Q ss_pred HHHhhhcccCCCCCccccCC Q lcl|NC_012753. 480 INDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 480 i~~E~~~~~~~~~~~~~~~~ 499 (502) |++|....--.. +....|| T Consensus 498 I~~E~~~~~~~~-p~~~~~f 516 (516) T protein:vir:10 498 IEQEAGIKRFQN-PENEDDF 516 (516) T ss_pred HHHhhhCCCCCC-CCccccC Confidence 999976543221 1223444 No 174 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=97.57 E-value=4.4e-05 Score=44.46 Aligned_cols=371 Identities=11% Similarity=0.101 Sum_probs=154.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||++. .+-|+- ... ....+ -..+.....+... ..+ .. .+..+.+.--...+ T Consensus 1 Mg~~~~~-~~~k~~-----~~~--------~~~~~------~~~~~~~~~~~~~-~~~--v~----~~~~l~~~~v~~~i 53 (383) T protein:vir:10 1 MGLLTPK-NFSKRN-----AKN--------MVYPS------NPAFFTTTVGGMQ-LSY--VS----ALSALQNTNVYSVI 53 (383) T ss_pred CCccccc-cccccc-----ccc--------ccccc------chhhhhhhccCcc-ccc--cc----hhHhhcchHHHHHH Confidence 9999862 111110 000 00000 0011111111100 000 00 11122222234455 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCC Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQ 160 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~ 160 (502) +.+|+-+..-| |.+.+......|++-.........+..++...+..|.+|+.+. .+.. ..++++.+ T Consensus 54 ~~ia~~ia~~~--~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gn~~~~i~--~~~~--~~~p~~~~-------- 119 (383) T protein:vir:10 54 NRIASDVSSAH--FKTENTATLNRLESPSSLIGRFSFWQGALMQLCLSGNDYIPLV--GQNL--EHIPNSDV-------- 119 (383) T ss_pred HHHHHhhccCc--eeecccchhhhhhCCCCCCCHHHHHHHHHHHhhhcCCeEEEEE--cCce--eEeecCcc-------- Confidence 66666554444 4555544444444322222344455566777777888887653 2221 12222211 Q ss_pred CeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEe Q lcl|NC_012753. 161 DVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYL 240 (502) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~ 240 (502) .+ . ...+..+.+|...+. .+ |..+.+. +. -.++| T Consensus 120 ~v-------~--~~~~~~~~~~~~~~~----~~-----------------~~~~~~~------~~----------evih~ 153 (383) T protein:vir:10 120 QI-------N--YLPGNMGIVYTVLES----ND-----------------RPKMVLR------QD----------QMLHF 153 (383) T ss_pred eE-------E--EEEcCCceEEEEEEc----CC-----------------ceEEEEc------cc----------ceEEe Confidence 00 0 001111111110000 00 1011000 00 12334 Q ss_pred cCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccch-hhc Q lcl|NC_012753. 241 KPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHN-VYE 319 (502) Q Consensus 241 ~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~-~~~ 319 (502) +.+..+ -.+...|+|.+..+...++....+..-..+-|..+...-.+ |.........+-. ..+....+ .+. T Consensus 154 r~~~~~--~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i----l~~~~~~~~~e~~--~~~~~~~~~~~~ 225 (383) T protein:vir:10 154 RLMPDP--QYRYLIGRSPLESLQNALNLDDKASKSNMSAMENQINPAGK----LTISNYLSDGKDL--ESAREEFEKANT 225 (383) T ss_pred ccCCCC--cccccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE----EEeCCCCCCHHHH--HHHHHHHHHHhC Confidence 432111 11234699998888888877665555444445544332111 2211111000000 00000000 111 Q ss_pred cccCC---CCccccceeeeccccchHHH-HHHHHHHHHHHHHhcCCChhhccccccccc---cHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 320 QFDSG---DMDKGIGITDLTTDIRSDDY-IKAINKGLSLFEMQLGVSTGMFSFDGKSMK---TATEVVSEQSDTYQMRNS 392 (502) Q Consensus 320 ~~~~~---~~~~~~~i~~~~~~ir~e~~-~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~---tAtei~~~~~~l~~~~~~ 392 (502) .-+.. --+.+.-++.++......++ .+..+...++|+...|+||..+|....+.. ++.+.... T Consensus 226 ~~n~~~~~vl~~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~---------- 295 (383) T protein:vir:10 226 GDNSGRLMVLPDGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKAT---------- 295 (383) T ss_pred ccccCCccccCCCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHH---------- Confidence 00000 00122235566666555665 467778789999999999999986433322 22222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCH Q lcl|NC_012753. 393 IATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTK 470 (502) Q Consensus 393 ~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~d 470 (502) |..+|..+++.|-...+. .+. ...+.++++.-+..|..+.++.+.+++.+|+|+.-++++.+ .|+.. T Consensus 296 ----~~~~l~P~~~~ie~~l~~-~l~------~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~t~nE~R~~lg~~p~~~ 364 (383) T protein:vir:10 296 ----YLANLNSYVNPIVDELRL-KMN------APDLELDIKDMLDVDDSILINQVSNLAKSGVLGAEQAQFILTRSGFLP 364 (383) T ss_pred ----HHHHHHHHHHHHHHHHHH-hhC------CceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCcccC Confidence 222333333332221111 111 12467777777889999999999999999999999876643 23322 Q ss_pred HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 471 EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 471 eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+ +.+.+. +. ++.. +|+ T Consensus 365 ~d----~~~~~~------~~--~~~~---gGd 381 (383) T protein:vir:10 365 DN----LPEFKP------LT--NETK---GGD 381 (383) T ss_pred Cc----ccccCC------Cc--ccCC---CCC Confidence 11 111110 00 1111 233 No 175 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=97.52 E-value=5.1e-05 Score=44.13 Aligned_cols=390 Identities=14% Similarity=0.085 Sum_probs=158.7 Q ss_pred CChhHH--HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQT--IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~--ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~ 78 (502) |+||.+ +..-+++. +....+ .. ... ....+ -.|.+. ..... ..+..+..+--.. T Consensus 1 m~~~~~~~~~~~~~~~---~~~~~~----~~---~~~-----~~~~~-~~~~~~----~~~~v----~~~~a~~~~~v~~ 56 (412) T protein:vir:26 1 MNVIAKENIVTRIKKK---LIDNWI----DQ---STS-----KLYDF-SPWKNR----SFWGV----INNTLETNETIFS 56 (412) T ss_pred Cccchhhhhhhhhhhh---Hhhhhh----cc---ccc-----ccccc-cccCCc----ccccc----chhhhhccHHHHH Confidence 999955 33322211 000000 00 000 00000 000000 00000 0111222233344 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCe Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATV 151 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~ 151 (502) .|+..|+-+..=|+.+--..+.....+..+|.. | ....-...++..++..|.+|+.+..+. |. ..+..++|+. T Consensus 57 ~i~~ia~~iA~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~ 136 (412) T protein:vir:26 57 AITKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDV 136 (412) T ss_pred HHHHHHHhHhhCceeEeeccccccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCce Confidence 555566555555655422222233333333331 2 223344556778888999998887764 44 3666777877 Q ss_pred EEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecC Q lcl|NC_012753. 152 FFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNG 231 (502) Q Consensus 152 ~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 231 (502) +-+...+.++.. + +.+. .. + |..+.+ . +. T Consensus 137 v~v~~~~~~~~~---~-y~~~-----------------~~--------------~----g~~~~~---~---~~------ 165 (412) T protein:vir:26 137 VEMLIENQSREL---Y-YSIH-----------------AA--------------T----GNKLIV---H---NM------ 165 (412) T ss_pred eEEEEeCCCcEE---E-EEEE-----------------cC--------------C----ceEEEE---c---cc------ Confidence 766433322110 0 0000 00 0 111000 0 00 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccc Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREF 311 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~ 311 (502) -.++|+.+. ..+..+|+|.+.-+...++-...+..............| +......+.... T Consensus 166 ----evih~~~~~----~~~~~~G~s~i~~~~~~i~~~~a~~~~~~~~~~~~~~~i------~~~~~~l~~e~~------ 225 (412) T protein:vir:26 166 ----DMLHFKHIV----ASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFM------LKYGSNVGKEKR------ 225 (412) T ss_pred ----cEEEeCCCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHhcCCCCceE------EecCCCCCHHHH------ Confidence 013344321 123456888887776666544333221111111111112 221111111100 Q ss_pred cccchhhccccCCCC-----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHH Q lcl|NC_012753. 312 ETGHNVYEQFDSGDM-----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSD 385 (502) Q Consensus 312 ~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~ 385 (502) ..-...+.......+ +.+.-++.++......++.+..+....+|+...|+||..++...++. +++.+....+ T Consensus 226 ~~~~~~~~~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~f-- 303 (412) T protein:vir:26 226 QQVLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRFY-- 303 (412) T ss_pred HHHHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHHH-- Confidence 000000100000000 12223555555555667888888888999999999999998654432 2333322111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc Q lcl|NC_012753. 386 TYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT 465 (502) Q Consensus 386 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~ 465 (502) ++..|..++..|....+..=+..........+.+++++-+..|..+.++.+.+++.+|+++.-++++. T Consensus 304 -----------~~~~l~P~~~~ie~~ln~kLl~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~G~~t~NE~R~~- 371 (412) T protein:vir:26 304 -----------LQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREW- 371 (412) T ss_pred -----------HHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH- Confidence 12223333333222111100111111112234444455566789999999999999999999987665 Q ss_pred CCCCHH-HHHHHHHHHHHhhhcccCC----CCCccccCCCCC Q lcl|NC_012753. 466 LNVTKE-QAQEIYQKINDETMVSTDS----FRTSEEVDIYGE 502 (502) Q Consensus 466 ~~~~de-ea~~el~ri~~E~~~~~~~----~~~~~~~~~~g~ 502 (502) .|+..- ..++-+- .-+....+. .....+++-.+. T Consensus 372 ~gl~p~~ggD~~~~---~~n~~~~~~~~~~~~~~~gG~~n~~ 410 (412) T protein:vir:26 372 EDLPPVEGGDKPLI---SGDLYPIDTPLELRKSLKGGDKNVN 410 (412) T ss_pred hCCCCCCCcCeeee---cccccccccchhhcccccCCCCCcC Confidence 344331 1111110 000000000 011222222222 No 176 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=97.52 E-value=5.2e-05 Score=44.08 Aligned_cols=397 Identities=11% Similarity=0.089 Sum_probs=162.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |++|+++|.++..-.. .+...-.. .-.++. .+..++..- . ..+.... ..-+.+.---.. T Consensus 7 ~g~~~~~~~~~~~~~~----~~~~~~~~--~~~~~~-------~~~~~~~~~----~---~~g~~v~~~~a~~~~aV~~~ 66 (432) T protein:vir:97 7 LGLLGQLKAMFVPPDP----VDIGGGQT--FTPVNA-------TARDLGIII----S---DTGAAVNADAIMRLDAVAAC 66 (432) T ss_pred CchhhhhHhhcCCccc----cccccccc--cccCch-------hhhhhcccc----c---ccCcccchHhhhcchHHHHH Confidence 9999999999864211 11100000 000100 001111110 0 0111110 011112222334 Q ss_pred HHHHhhhhhcCcceEeeC--C---HHHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEc Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVD--N---EVADAFINETLK--ND---KFSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQ 148 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~--d---~~~~e~l~~~~~--~~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~ 148 (502) |+..|+-+-.=|+.+--. + +..+.-+..+|. -| ....-+..++...+..|.+|+.+..++|++ .+..++ T Consensus 67 v~~Ia~~ia~lp~~~y~~~~~g~~~~~~~pl~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~~~~~~g~~~~L~~l~ 146 (432) T protein:vir:97 67 VKLVSQAVAAMPLMMYMRTPDGRKEAVNHPLYTLLLDGPNSTQTAFDFWQVVVTRLLLDGTAYVRKVVTDGRIESLQYLA 146 (432) T ss_pred HHHHHHhhccCceEEEEecCCCcccccccHHHHHHHhcccccCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEEEEEEEc Confidence 455555554445543211 1 111112223332 12 222344455667788899998888877664 566778 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+.+-++....+.. +++++.. ++..+ .++- .+ T Consensus 147 p~~v~v~~~~~g~~-----~y~~~~~-----------------~g~~~----------------~~~~--------~~-- 178 (432) T protein:vir:97 147 NDRLTITTDTKGNT-----AYRYRRT-----------------DGQMI----------------DIPR--------QQ-- 178 (432) T ss_pred CcceEEEEcCCCcE-----EEEEEec-----------------CceEE----------------EEcc--------cc-- Confidence 87776653222211 1111100 00000 0000 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) +++++.+. .+...|+|-+.-+...++....+-....+-|..+...-.| |.....-+.... T Consensus 179 --------iih~r~~~-----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gi----l~~~~~l~~e~~--- 238 (432) T protein:vir:97 179 --------IWKIMGYS-----LDGENGLSAIRYGAQIFGTAIAAEAQAARAFRNGQLQSVY----YQIDRFLTDDQY--- 238 (432) T ss_pred --------EEEecCcC-----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccee----EecCCCCCHHHH--- Confidence 12233221 1224588887776666544333322222234444332222 322211110000 Q ss_pred ccccccchhhcccc-C---CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccc-cHHHHHHHH Q lcl|NC_012753. 309 REFETGHNVYEQFD-S---GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK-TATEVVSEQ 383 (502) Q Consensus 309 ~~~~~~~~~~~~~~-~---~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~-tAtei~~~~ 383 (502) ..+ ...+.... . .--+.+..++.++.....-++.+..+....+|+...|++|..+|....+.. ++..+.... T Consensus 239 ~~~---~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~~s~~e~~~ 315 (432) T protein:vir:97 239 DSF---SKKVSGSVEAGRAPLLEGGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQ 315 (432) T ss_pred HHH---HHHHhhhhcCCCceecCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCcccccchhHHHHH Confidence 000 01111000 0 000112235566665666778888888899999999999999987654321 222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) ... ++.+|..+++.|-...+. .++.........+.++++.-+-.|..+.++...+++.+|+++.-++++ T Consensus 316 ~~f----------~~~tl~P~~~~ie~~ln~-kLl~~~e~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~ 384 (432) T protein:vir:97 316 LGF----------LTMTLSPWLRRIEQSIAL-NLLTPAERRRYFADFDTSALLRADSAARSSYYSQLVNNGLMTRDEARE 384 (432) T ss_pred HHH----------HHHHHHHHHHHHHHHHhh-hccCccccCceEEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHH Confidence 111 222333333322221111 111111111123444444445678889999999999999999998765 Q ss_pred hc--CCCCHHHHH----HHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 464 KT--LNVTKEQAQ----EIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~--~~~~deea~----~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .. +++..+... .-+.-+.. ....+. +.+..++-..+ T Consensus 385 ~~glpp~~g~~~~~~~~~~~~pl~~--~~~~~~-~~~~~~~~~~~ 426 (432) T protein:vir:97 385 IEGLPKLGGNAAVLTVQSAMVPLDS--IGLQAS-PEPASGLGNQQ 426 (432) T ss_pred HhCCCCCCCCcceEeecccccchhh--hcccCC-CCCCCCCCCcc Confidence 43 233322100 00000110 000011 11111111111 No 177 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=97.51 E-value=5.4e-05 Score=44.00 Aligned_cols=373 Identities=12% Similarity=0.129 Sum_probs=155.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+|++++++++.+. ..... .+. .++.+... ...... ...+-+...-....+ T Consensus 1 MGl~~~~~~~~~~~--~~~~~-------------------~~~---~~~~~~~~-~~~~~v----t~~~al~~~~v~~~i 51 (394) T protein:vir:62 1 MGLRDRFSNYLFKK--AEKRG-------------------YLD---NVLGKSIR-YSGVYV----TDSNILQSSDVYELL 51 (394) T ss_pred CchhhhhhhhccCC--CCchh-------------------hhh---hhhhcccc-cCcccc----ChhhhhccHHHHHHH Confidence 99999988764321 00000 000 11111100 000000 011122333445566 Q ss_pred HHHhhhhhcCcceEeeCC-H-HHHHHHHHHHhh-c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDN-E-VADAFINETLKN-D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFP 154 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d-~-~~~e~l~~~~~~-~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~P 154 (502) +..|+-+-.=|+.+--.+ + .....+..++.. | ....-...++...+..|.+|+.+ +.+.+. .+..+.| T Consensus 52 ~~Ia~~iA~lp~~v~~~~g~~~~~~~~~~Ll~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i--~~~~~~----~~~~~~~ 125 (394) T protein:vir:62 52 QDISNQMVLADIVVEDEFGNEIKDDIALQILRNPNNYLTQSEFIKLMTNTYLLEGETFPIL--NGAQIH----LASNVFT 125 (394) T ss_pred HHHHHhhcccceEEEcCCCcccchhhHHHHhccCCCCCCHHHHHHHHHHHHHhcCCeEEEE--ecceee----ccccceE Confidence 666666666565543222 1 112223333332 1 22334444566677778877654 322211 0112222 Q ss_pred EEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCc Q lcl|NC_012753. 155 LQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTR 234 (502) Q Consensus 155 i~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~ 234 (502) .. + +++.++ +. .+ |..++-. T Consensus 126 ~~------------------~-~~~~~~-----~~--~~-----------------~~~~~~~----------------- 145 (394) T protein:vir:62 126 EL------------------D-DNLVEH-----FN--IG-----------------GHEIPPC----------------- 145 (394) T ss_pred EE------------------C-CceEEE-----Ee--eC-----------------CEEechh----------------- Confidence 11 0 011000 00 00 1111100 Q ss_pred ceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc Q lcl|NC_012753. 235 PLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG 314 (502) Q Consensus 235 ~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~ 314 (502) -+++++.+.. +..+|+|.+.-+...|.....+.....+-+..+...=++ |..........-. ...+... T Consensus 146 -eiih~r~~~~-----d~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i----l~~~~~~~~~~~~-~~~~~~~ 214 (394) T protein:vir:62 146 -MIRHVKNIGA-----DHLRGKGILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFL----LNLDAHINPQNGA-QSKLINA 214 (394) T ss_pred -heEEecCcCC-----CCccccChHHHHHHHHHHHHHHHHHHHHHHHccCCcceE----EEeCCCCCcCHHH-HHHHHHH Confidence 0234443211 224688988877777766555444333445554322111 2211111100000 0000000 Q ss_pred -chhhcccc------CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHH Q lcl|NC_012753. 315 -HNVYEQFD------SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTY 387 (502) Q Consensus 315 -~~~~~~~~------~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~ 387 (502) ...+.... .-..+.+--+..++......++.+..+....+|+...|+||..+|.... +++.+.. T Consensus 215 ~~~~~~g~~n~g~~~vl~~g~~~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~--sn~e~~~------- 285 (394) T protein:vir:62 215 ILDQLESIDEARSVKMIPLGKGYSIDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELIK--EDIEKAM------- 285 (394) T ss_pred HHHHhccccccCceeEeeCCCceeEEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCCC--cCHHHHH------- Confidence 01111100 0011111112344555556778888888899999999999999975332 1222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc-- Q lcl|NC_012753. 388 QMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT-- 465 (502) Q Consensus 388 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~-- 465 (502) +..++.+|..++..|....+. .++... ....+.|.|+.....+.++.++...+++.+|+|+.-++++.. T Consensus 286 ------~~~~~~~l~P~~~~ie~~l~~-kll~~~--~~~~~~~~fd~~~~~~~~~~~~~~~~~~~~g~~T~NE~R~~~gl 356 (394) T protein:vir:62 286 ------MYIHNKAVRPIMKNFEDHLSL-LFYAQN--SGKRIKFKINILDFVTYSNKTNIGYNLVRTAITSPDNVADMLGF 356 (394) T ss_pred ------HHHHHHHHHHHHHHHHHHHhh-hhcCcc--ccCceEEEechhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 111233344443333222221 111111 123577889888778888888889999999999999876643 Q ss_pred CCCCHHHHHHH-----HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 466 LNVTKEQAQEI-----YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 466 ~~~~deea~~e-----l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++++++.... +..+.+.++. ...+.+++- .| T Consensus 357 ~p~~~~~gd~~~~~~n~~~~~~~~~~----~~~~kgge~-~e 393 (394) T protein:vir:62 357 PKQNTKESQAIYISNDVTEIGKKEAT----DGSLGGGEE-NE 393 (394) T ss_pred CCCCCCCCCeeecccccccccccccc----cccCCCCCC-CC Confidence 23433333222 1121111111 111111111 22 No 178 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=97.44 E-value=6.8e-05 Score=43.44 Aligned_cols=434 Identities=11% Similarity=0.079 Sum_probs=166.4 Q ss_pred CChh---------HHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHH-HHHHhcCCCCc-cc----------cc Q lcl|NC_012753. 1 MGII---------QTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMD-NLRYFAGDFDS-VT----------YR 59 (502) Q Consensus 1 m~~~---------~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~-~~~~Y~g~~~~-~~----------~~ 59 (502) ||-+ +.|.++.+...|.+...+..+-+-+..+ .+.+.+++-.. ...-|...... .. .+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (574) T protein:vir:80 1 MPKWLDKALGIEKSSIEETRNMENYKMHLREIDTNVVNNEP-YSMESIEKGMNGKTTAYMQPIIGEMSVNPGYKTKPSIR 79 (574) T ss_pred CcchhhhhhccchhhHHHHHhhhhhccccchhhhhhhhccC-CCHHHHHHhHhhhcccccchhhhhccccccccCcCccC Confidence 4432 4666776666664333222221111111 11122221111 11111111100 00 00 Q ss_pred cCCCc-cccccceecchHHHHHHHHhhhhh-----------cCcceEeeCC---------HHHHHHHHHHHhh------- Q lcl|NC_012753. 60 DSNGS-QVKRDFNHLPIGRTASKKVASLVF-----------NEQATIRVDN---------EVADAFINETLKN------- 111 (502) Q Consensus 60 ~~~~~-~~~~~~~~~n~~k~iv~~~a~~l~-----------~ep~~i~~~d---------~~~~e~l~~~~~~------- 111 (502) +...- ..-+.....++...+++..++-+. +=|..|...+ ......|.+++.+ T Consensus 80 ~~~~~~~~l~~~~~~~iv~~~i~~~~~~V~~~~~~i~~~ia~lp~~i~~kd~~~~~~~~~~~~~~~l~~ll~~~~~~~nP 159 (574) T protein:vir:80 80 NSQDLHKTLKKFGNNIILNAIINTRSNQVSMYCKPARNSETGVGYEIRLKDIEAEPTSHDIANIKRIESFLENTAQFRDP 159 (574) T ss_pred CcccHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHHhhhccCceEEEEeccCCCccchhhhhhhHHHHHHhccCCCCCC Confidence 00000 000011111222333333322221 1233332211 1122345555532 Q ss_pred --ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEE Q lcl|NC_012753. 112 --DKFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEF 187 (502) Q Consensus 112 --~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~ 187 (502) ..|..-+..++...+..|.+|+.+..+. |. ..+..++|..+.+.....+.... ....||. T Consensus 160 ~~~s~~ef~~~lv~~lll~Gnayi~i~r~~~G~~~~L~pl~p~~V~v~~d~~~~~~~------------~~~~y~~---- 223 (574) T protein:vir:80 160 NRDNFTTFCKKLVRATYMYDQVNFEKVFDKDGNFIKFDTVDPTTIFLATNGEGKLIK------------NGERFVQ---- 223 (574) T ss_pred ccccHHHHHHHHHHHHHhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCcccccc------------CceEEEE---- Confidence 1234455666777888999998887775 44 35777888887775322211100 0001110 Q ss_pred EEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHH Q lcl|NC_012753. 188 HEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMD 267 (502) Q Consensus 188 h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid 267 (502) ...++. ... | + +.+ .++|+.+. +.......+|+|.+.-+...|. T Consensus 224 -~~~g~~-~~~---~------------~--------~~e----------iih~~~~~-~~~~~~~~~G~spi~~a~~~i~ 267 (574) T protein:vir:80 224 -VIDNRI-VAK---F------------N--------ERE----------LAFAVRNP-RADIEVGQYGYPELEIALKQFI 267 (574) T ss_pred -EeCCce-EEE---E------------c--------ccc----------EEEEeccC-CCCcccccccccHHHHHHHHHH Confidence 000000 000 0 0 000 12333211 1112234679999888877776 Q ss_pred HHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc--ccccc-chhhccccCCC-----Cccccceeeecccc Q lcl|NC_012753. 268 FINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR--EFETG-HNVYEQFDSGD-----MDKGIGITDLTTDI 339 (502) Q Consensus 268 ~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~--~~~~~-~~~~~~~~~~~-----~~~~~~i~~~~~~i 339 (502) ....+..-..+-|..+...-.| |....+. ...+.. .+... ...|....... .+.+.-++.++... T Consensus 268 ~~~~a~~~~~~~f~ng~~p~gi----l~~~~~~---~ls~e~~~~lk~~~~~~~~G~~n~g~~~vl~~~G~~~~~l~~s~ 340 (574) T protein:vir:80 268 AHENTEVFNDRFFSHGGTTRGI----LHVKTGQ---QQSQQALDIFRREWRSSLAGINGSWQIPVVSAEDVKFVNMTPSA 340 (574) T ss_pred HHHHHHHHHHHHHhccCCCceE----EEeCCCC---CCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEEccCCh Confidence 5555444334445554432111 2111110 011000 00000 01111100000 01122355566666 Q ss_pred chHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012753. 340 RSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI-ATLVEKSLKELVISILELAKVYNLY 418 (502) Q Consensus 340 r~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~-~~~~~~~l~~l~~~il~~~~~~~~~ 418 (502) ..-++.+..+...++|+...|++|..+|+...+..+++...... +.++... ...++.+|..+++.|-...+. .++ T Consensus 341 ~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n---~sn~E~~~~~f~~~tL~P~~~~ie~~ln~-~Ll 416 (574) T protein:vir:80 341 NDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLN---EGNSKEKMQASQNKGLQPLLRFIEDTVNT-YIV 416 (574) T ss_pred hHHHHHHHHHHHHHHHHHHhCCCHHHhccccccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHHHHHh-hhh Confidence 67788888888999999999999999997655432222211110 0111111 111233344443333222221 111 Q ss_pred CCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCH-HHHH--HHHHHHH----H---h--- Q lcl|NC_012753. 419 TGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTK-EQAQ--EIYQKIN----D---E--- 483 (502) Q Consensus 419 ~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~d-eea~--~el~ri~----~---E--- 483 (502) .. ....+.+.|+..-..+..+ .....+++.+|+|+.-+++..+ +++.. |+.- .-+..+. . + T Consensus 417 ~~---~~~~~~~~f~~~d~~~~~~-~~~~~~~~~~G~lT~NE~R~~lgl~Pi~gGD~~~~~~n~~~~~~~~~~~~~~~~~ 492 (574) T protein:vir:80 417 AE---FGEKYQFQFRGGDLSAQLD-KLKIIEQEGKVFRTVNEIRHDKGLEPIKGGDVILNGVHIQAIGQALQEEQLEYQR 492 (574) T ss_pred hh---cCCceEEEecccchhhHHH-HHHHHHHHhCCccCHHHHHHHhCCCCCCCCCEeeeccceeecccccccccCCccc Confidence 11 1124566777665554443 3344556778999999876653 23321 0000 0000000 0 0 Q ss_pred hhcccCCCCCccccCCCCC Q lcl|NC_012753. 484 TMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 484 ~~~~~~~~~~~~~~~~~g~ 502 (502) +.+..+....+.+.+.-+. T Consensus 493 ~~~~~~~~~~~~~~~~~~~ 511 (574) T protein:vir:80 493 SQDRLNRLLELSGGDVEQP 511 (574) T ss_pred hhccccccccccCCCCCCC Confidence 0000000001111111110 No 179 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=97.41 E-value=7.3e-05 Score=43.26 Aligned_cols=385 Identities=14% Similarity=0.082 Sum_probs=159.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+|++++|+.+..-+-. .+.....+ +.. |.++ .+.... .+..+..+--...+ T Consensus 4 ~~~~~~~k~~~~~~~~~---~~~~~~~~----------------~~~-~~~~----~~~~v~----~~~a~~~~~V~~ci 55 (409) T protein:vir:96 4 ENIVTRIKKKLIDNWID---QSASKLYD----------------FSP-WKNK----SFWGVI----NNTLETNETIFSAI 55 (409) T ss_pred ccchhhhhhHHhhhhhc---cccccccc----------------ccc-ccCc----cccccc----hhhHhhhHHHHHHH Confidence 89999999986432111 11100000 000 1111 000000 01111112223344 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEEcCCeEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSFVQATVFF 153 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~v~~~~~~ 153 (502) +..|+-+-.=|+.+--+.+.....+..+|.. | .-..-...++...+..|.+|+.+..+.. . ..+..++|+.+- T Consensus 56 ~~ia~~ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:96 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceEEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeE Confidence 5555555444544322222222233333321 2 2233445567778889999998877653 3 355556676655 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) ++..+.+.. .+|+ ++ .. + |..+.+ . +. T Consensus 136 v~~~~~~~~-----------------~~y~---~~-~~--------------~----g~~~~~---~---~~-------- 162 (409) T protein:vir:96 136 MLIENQSRE-----------------LYYS---IH-AA--------------T----GNKLIV---H---NM-------- 162 (409) T ss_pred EEEeCCCcE-----------------EEEE---EE-cC--------------C----ceEEEE---c---cc-------- Confidence 442221111 1111 00 00 0 111100 0 00 Q ss_pred cceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFET 313 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~ 313 (502) -.++|+.+. ..+..+|+|.+.-+...++....+..............|+. ....-+.... .. T Consensus 163 --evih~r~~~----~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~------~~~~l~~e~~------~~ 224 (409) T protein:vir:96 163 --DMLHFKHIV----ASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLK------YGSNVSTEKR------QQ 224 (409) T ss_pred --cEEEeCCCC----CCCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCceeEEe------cCCCCCHHHH------HH Confidence 013343211 12334688888777766664433322222211111111222 1111111000 00 Q ss_pred cchhhccccCCC-----CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHH Q lcl|NC_012753. 314 GHNVYEQFDSGD-----MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTY 387 (502) Q Consensus 314 ~~~~~~~~~~~~-----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~ 387 (502) -...+....... -+.+.-++.++......++.+..+....+|+...|+||..+|....+. +++++.... T Consensus 225 ~~~~~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~----- 299 (409) T protein:vir:96 225 VLEDFKQYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRF----- 299 (409) T ss_pred HHHHHHHHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH----- Confidence 000010000000 012223555665566667888888888999999999999998654332 233332211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC-CCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Q lcl|NC_012753. 388 QMRNSIATLVEKSLKELVISILELAKVYNLYT-GEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTL 466 (502) Q Consensus 388 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~-~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~ 466 (502) .++.+|..++..|-...+. .++. ........+.++.+.-+-.|..+.++...+++.+|+++.-++++.. T Consensus 300 --------f~~~~l~P~~~~ie~~l~~-~Ll~~~~~~~g~~i~fd~~~ll~~d~~~~~e~~~~~~~~G~~T~NE~R~~~- 369 (409) T protein:vir:96 300 --------YLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE- 369 (409) T ss_pred --------HHHHHHHHHHHHHHHHHHh-hcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh- Confidence 1223333333333221111 1111 1111122344444444567888999999999999999998876653 Q ss_pred CCCHH-HHHHHHHHHHHhhhcccC----CCCCccccCCCCC Q lcl|NC_012753. 467 NVTKE-QAQEIYQKINDETMVSTD----SFRTSEEVDIYGE 502 (502) Q Consensus 467 ~~~de-ea~~el~ri~~E~~~~~~----~~~~~~~~~~~g~ 502 (502) |+.+- ..++-+-. -+....+ ......++|-.+. T Consensus 370 g~~pi~ggD~~~~~---~n~~~~~~~~~~~~~~~gG~~n~~ 407 (409) T protein:vir:96 370 DLPPVEGGDKPLIS---GDLYPIDTPLELRKSLKGGDKNVN 407 (409) T ss_pred CCCCCCCcceeeec---ccccccccchhhcccccCCCCCcC Confidence 44321 01111110 0000000 0111233333333 No 180 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=97.37 E-value=8.2e-05 Score=42.97 Aligned_cols=412 Identities=10% Similarity=0.092 Sum_probs=164.2 Q ss_pred hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHH--HHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 3 IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRI--MDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 3 ~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i--~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) +|. +.+.+.++.+..++.-.......+ .....||.- ++ +.....+-..+.+....+| T Consensus 1 ~~~-------------~~~~i~s~~~~~~i~~~~~~s~~~~~~~~~~~~~p---p~-----~~~~la~l~~~n~~v~scI 59 (542) T protein:vir:41 1 MFN-------------YHLSIRSLEKYKAIKREEVESQALGETRFEEYVEP---KV-----NPLVLLSLLQVNPYHASAC 59 (542) T ss_pred Ccc-------------ccccccccccchhhhhccccccccccccCCccccC---CC-----CHHHHHHHHhhcHHHHHHH Confidence 111 122333333222211000000000 000011110 01 1011111122334567888 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--ccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKN--DKFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQ 156 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~ 156 (502) +.+|+.+.+-|..+.-++.. .+..++-+ ..+...+..++.+.+..|.+|+.+..|. |. ..+..++|..+.+. T Consensus 60 ~~ia~~IA~l~~~~~~~~~~---~l~~~lpN~~~s~~~f~~~~v~~lll~Gnayi~i~rd~~G~~~~L~~l~~~~v~v~- 135 (542) T protein:vir:41 60 SIKANDIIRTGYILEGDDEG---VVDEFIRACKPSFEYVLLRALEDLQVFNYCTLEVVRDDRGDPIRFEYIPSHTIRVH- 135 (542) T ss_pred HHHHHHHhhCceeeecccch---hhhhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEEcCcceEEE- Confidence 88898888878766544332 23333221 1344556667778888999999887775 44 46778888876653 Q ss_pred EcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcce Q lcl|NC_012753. 157 ANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPL 236 (502) Q Consensus 157 ~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~ 236 (502) .|.+. ++ .+. .+....++..+.+ .+. +.. .+ ...+..+ +.-- T Consensus 136 ~d~~~-----~~-~~~--~~~~~~~~~~y~~------~~~----~~~-~~-g~~~~~~------------------~~~e 177 (542) T protein:vir:41 136 KDGSR-----YR-QTW--DGVNITHFKDYRY------EGE----INP-ET-GEDQDSV------------------GANE 177 (542) T ss_pred EcCCe-----eE-eee--cCCcceeEEeecc------ccc----ccc-cc-ccccccc------------------Cccc Confidence 33221 11 111 1111111111000 000 000 00 0000000 0001 Q ss_pred EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-Hhhccce---eeechHHhccCCCCCCcccCccc--c Q lcl|NC_012753. 237 FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRR---VAVPTQMIKTEYDTNGEKVTVKR--E 310 (502) Q Consensus 237 f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~---i~v~~~~l~~~~~~~g~~~~~~~--~ 310 (502) .++|+.+. ..+..+|+|.+..+...+... .....+... |..+... +.++..+.+... ......+.. . T Consensus 178 IiHir~~~----~~~~~~Glspi~~~~~~i~~~-~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~--~~~~~~~e~~~~ 250 (542) T protein:vir:41 178 LVFIHIPS----PVCSYYGVPRYVSAAPAILAM-QKIDEYNYAFFDNYTIPSYVITVTGEFEDELE--EDPDGNPTGRTV 250 (542) T ss_pred EEEecCCC----CCCCcccccHHHHHHHHHHHH-HHHHHHHHHHHhccCCccEEEEeCCccccccc--cccccCHHHHHH Confidence 24455332 234568999988777666443 334444433 4544332 233322211000 000000000 0 Q ss_pred cccc-chhhcc--------ccCC-CC--ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc---cc Q lcl|NC_012753. 311 FETG-HNVYEQ--------FDSG-DM--DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM---KT 375 (502) Q Consensus 311 ~~~~-~~~~~~--------~~~~-~~--~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~---~t 375 (502) +... ...+.. +-.. .+ +.+..++.++......++.+..+...++|+...|+||..+|....+. ++ T Consensus 251 lk~~~~~~~~g~~~n~gk~~vL~~~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn 330 (542) T protein:vir:41 251 IQALIEDNFKHLKEAPHTPLVFSIPGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNF 330 (542) T ss_pred HHHHHHHHHhhhhcccCceeEeeccCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCccccccc Confidence 0000 000000 0000 11 11222444454555667888888889999999999999998764332 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCC Q lcl|NC_012753. 376 ATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGF 455 (502) Q Consensus 376 Atei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi 455 (502) +.+....+ ...++.-..+.++..|.+.+ .. ... ....+.|+..-....+ ..+....++++|+ T Consensus 331 ~Eq~~~~f--~~~tL~P~~~~ie~~ln~~L------------~~-~~~--~~~~~~f~~~~ll~~d-~~~~~~~~v~~Gi 392 (542) T protein:vir:41 331 AEVTRRTY--YESVVRPQQNIISSILTDFF------------QV-KFN--PKTRFKFNDETLLESD-SVRNCALLVQSGV 392 (542) T ss_pred HHHHHHHH--HHHHHHHHHHHHHHHHHhhc------------cc-ccC--CceEEEecchhhcchH-HHHHHHHHHhCCC Confidence 33322111 12222233333333333211 11 111 1344556543222222 3344566788999 Q ss_pred CCHHHHHHhcCCCCH--HH-H---------------HHH---HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 456 APKTMAIEKTLNVTK--EQ-A---------------QEI---YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 456 ~S~et~l~~~~~~~d--ee-a---------------~~e---l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++..++++++.|+.. +. . ..+ +.++++-.+...|.+..+...-+-+| T Consensus 393 lT~NE~Re~L~g~~pgdd~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~~~~ 460 (542) T protein:vir:41 393 LTPAEARERLFGLDGGPDIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKLSAE 460 (542) T ss_pred CCHHHHHHhhCCCCCCCccccccccccccccccCCcCCCCCchhhhhhcccccCccccccccccccch Confidence 999998766655422 10 0 000 00111100111111211111111112 No 181 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.34 E-value=9.1e-05 Score=42.74 Aligned_cols=411 Identities=11% Similarity=0.051 Sum_probs=150.9 Q ss_pred CChhHH--HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQT--IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~--ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~ 78 (502) |+=.-. ++..++.+-.....+.|..... .+..+ |-+-|..|+.. ..+.+.. ...-+.+..+++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~rd~l~~~~~----glg~~---r~~~~~~~g~~--~~~~~~~-----l~~~Yr~~~ia~~ 66 (449) T protein:vir:10 1 MTDKLTLAVNHALNDARMARARMGLMVPTM----GLDNK---RHSAWCEYGFP--ELVTYEN-----LYSLYRRGGIAHG 66 (449) T ss_pred CchhhHHHHhhhcchhHHHHHHHHHHHHHh----cCCcc---cchhhhhcCCc--ccCCHHH-----HHHHHhcCchhHH Confidence 321100 0000000000000000000000 00000 00011111000 0010000 0011223457889 Q ss_pred HHHHHhhhhhcCcceEeeCCH----H----HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCC Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNE----V----ADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQAT 150 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~----~----~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~ 150 (502) ||+..|+-+.-+-+.|.-.++ . ....+++++. .+++..+.++..++..+|++++.+-+++++..- T Consensus 67 iVd~~~d~~~~~~~~i~~g~~~~~~~~~~~~e~~~~~l~~-~~~~~~l~ea~~~~rl~Gga~i~i~v~d~~~l~------ 139 (449) T protein:vir:10 67 AVEKLVGKCWQTNPEIIEGDDADDSEDETSWEKKSKQVFT-NRLWRSFAEADRRRLVGRYAGILLHIRDEKDWN------ 139 (449) T ss_pred HHHhhhhhhhhcCcccccCccccchhhhHHHHHHHHHHHH-HHHHHHHHHHHHhhhccCcEEEEEEecCCCCCC------ Confidence 999999877665555432211 1 1234455443 477888999999988888888877675443211 Q ss_pred eEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEE--EEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 151 VFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSL--IEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 151 ~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~--~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) -|+- ..+++....++++. +... ..+++- -+.+ .....|+|...+. |... .... T Consensus 140 --~Pl~-~~~~i~~i~v~~~~-~i~~--~~~~~dp~sp~y-g~P~~y~v~~~~~--------g~~~----------~~~~ 194 (449) T protein:vir:10 140 --LPAT-KGRGLQKVSVSWAG-SLKV--AEWDTGINSKTY-GQPKLWKYTERLP--------NGSS----------RRVD 194 (449) T ss_pred --cccc-cCcceeeEEeeccc-cCCh--hhhhcCCCCCCC-CCceEEEEeeecc--------CCCc----------ccee Confidence 1331 12233333333321 0000 000000 0000 0011122221100 0000 0000 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH-----HHHhhc----cceeeechHHhccCCC Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM-----WEVKMG----QRRVAVPTQMIKTEYD 299 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~-----~~~~~~----~~~i~v~~~~l~~~~~ 299 (502) +| ..|+. .|. .....|+|.+..+-+-+-.++++--.+. +..+-. ...+-+ .++... . T Consensus 195 iH-~SRl~--~~~--------~~~~~g~~~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~-~~l~~~-~- 260 (449) T protein:vir:10 195 IH-PDRVF--ILG--------DYSEDAIGFLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDF-TNLASL-Y- 260 (449) T ss_pred ec-cceeE--eec--------CCCCCChhHHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhh-hhhhHH-h- Confidence 11 11211 110 0001156666655444434443321111 111100 000100 000000 0 Q ss_pred CCCcccCccccccccchhh---ccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-cccccccccc Q lcl|NC_012753. 300 TNGEKVTVKREFETGHNVY---EQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSMKT 375 (502) Q Consensus 300 ~~g~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~~t 375 (502) +.|.+-... .+......+ .....-+.+ .-++.++.. ..-....++...+.++..+|+|... ||...+|..| T Consensus 261 ~~~~e~~~~-~~~~~~~~~~~~~~~~~i~~~--~d~~~~~~~--~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glns 335 (449) T protein:vir:10 261 GVSIDELQD-KFNEVAGEINRGNDVLMTTQG--ATVTPLVTS--VADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSS 335 (449) T ss_pred hCCchHHHH-HHHHHHHHHhccchheeecCC--cceEEEecc--cCChhHHHHHHHHHHHHHhCCCeeeeeccCcccccc Confidence 011100000 000000000 000011111 124444332 2345566778888899999998654 7777766654 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHH------- Q lcl|NC_012753. 376 ATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWS------- 448 (502) Q Consensus 376 Atei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~------- 448 (502) ..+++ .-+..++.+|..++..|++|+..++... .+. ...+++|.|+.--..++.+.++... T Consensus 336 t~D~~----nyyd~i~~~Q~~l~p~le~l~~~l~~s~------~g~--~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~ 403 (449) T protein:vir:10 336 TEDQK----YFNARCQSRRVDLSFEIEDFCDKLIELK------IID--AVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQ 403 (449) T ss_pred chhHH----HHHHHHHHHHHhhhHHHHHHHHHHHHhh------cCC--CCCceeEEeCCCCCCCHHHHHHHHHHHHHHHH Confidence 33443 2445555566678999999988766442 112 2347999999988888877766544 Q ss_pred HHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 449 KMVAAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 449 ~~~~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++.+|.+ .-++.+|+++.+ ..+.....+. +.++.+--++ T Consensus 404 ~~~~ag~~---------~~~~~~EiR~~~---~~~~~~~~~~--~~e~~de~~~ 443 (449) T protein:vir:10 404 TMLGSGDN---------PAFSREEIRTAA---GYDNDDEEPL--GEEDGDEEDK 443 (449) T ss_pred HHHHcccc---------CCcCHHHHHHHh---cccCCCCCCC--CCCCCccccc Confidence 33333310 112334333221 1111000000 0000110011 No 182 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=97.28 E-value=0.00011 Score=42.36 Aligned_cols=375 Identities=9% Similarity=0.010 Sum_probs=157.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||+++..--. . .... ...+ ....+.+- +....... ....+..+--..++ T Consensus 1 Mg~f~~~~~~~~---~--~~~~--------~~~~----------~~~~~~~~--~~~~~~v~----~~~~l~~~~v~~~i 51 (382) T protein:vir:48 1 MPIFNLATESPP---D--NQGG--------FFDV----------VDSDFLAS--LKGNEWVS----AETALRNSDLFSII 51 (382) T ss_pred CccccccccCCc---c--cccc--------cccc----------hhhhcccc--ccCCcccc----hHhhhccHHHHHHH Confidence 999986532100 0 0000 0000 00001110 00000000 00111112223456 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQAN 158 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~d 158 (502) +..|+-+-+=|+ .+.+......+.+-...-....-+..++...+..|.+|+.+..|. |. +.+.+++|+.+-++..+ T Consensus 52 ~~ia~~ia~~~~--~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~l~l~Gna~~~i~rd~~G~~~~l~~i~~~~v~v~~~~ 129 (382) T protein:vir:48 52 NQLSNDLATVKL--ITSRKKLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLD 129 (382) T ss_pred HHHHHhhccCce--eeecchhhhhhhhcCCCCCHHHHHHHHHHHhhhcCCEEEEEEECCCCcEEEEEEEcCceeEEEEcC Confidence 666666655454 344433333333322222334555566777888899998887765 44 36777788776554332 Q ss_pred CCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEE Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFT 238 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~ 238 (502) .+... +| ++.. +....|..+.+. .--++ T Consensus 130 ~~~~~-----------------~y-------------~~~~------~~~~~~~~~~~~----------------~~evi 157 (382) T protein:vir:48 130 NKDGI-----------------YY-------------NITF------DDPRIPPKQHVP----------------QNDVL 157 (382) T ss_pred CCCeE-----------------EE-------------EEEe------cCccccceeEEc----------------CccEE Confidence 22211 11 0000 000011111100 00134 Q ss_pred EecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-cccccccc-ch Q lcl|NC_012753. 239 YLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFETG-HN 316 (502) Q Consensus 239 ~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~~~-~~ 316 (502) +|+.+. ..+..+|.|-+..+...++....+-.-..+-|..+...-.+ ++........... ....+... .+ T Consensus 158 h~~~~~----~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i----l~~~~~~~~e~~~~~~~~~~~~~~n 229 (382) T protein:vir:48 158 HFRLLS----VDGGMTSVSPLMALSRELDIQKASGNLTINSLKNALNANGI----LKIKGGGLLDFKTKLSRSRQAMKQM 229 (382) T ss_pred EecCCC----CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCCChHHHHHHHHHHHhhccC Confidence 454332 12446799998888888865554444444445654332222 2221111110000 00000000 00 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) ....+-. +.+.-++.++.....-++.+..+...++|+...|+||..+|....+..++.+.+.. T Consensus 230 ~g~~~vl---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~~~~~~~~~~-------------- 292 (382) T protein:vir:48 230 QGGPLVL---DDLEDFTPLEIKSNVSQLLKQADWTTGQFAKVYGIPDNVVGGQGDQQSSLEMSSDL-------------- 292 (382) T ss_pred CCCeeEc---CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccHHHHHHHH-------------- Confidence 0000111 11223555665666667888888889999999999999998654433222222211 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQ 474 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~ 474 (502) ++.+|..++..|..-.+. .++. .........++ .+.......+.+++.+|++++-++++.+ .||..+++ T Consensus 293 ~~~~l~p~~~~i~~~l~~-~l~~-~~~~~~~~~~~------~~~~~~~~~~~~l~~~g~~t~~e~r~~l~~~g~~~~~~- 363 (382) T protein:vir:48 293 YSKAVSRYLRPFLSELSQ-KLSC-DVDADIFPAVD------PTGSNYISRINSLVKTGTLAQNQGLYILQQAEILPKEL- 363 (382) T ss_pred HHHHHHHHHHHHHHHHHH-HhcC-hhhhhhhhhhc------cchhHHHHHHHHHhhcCccCHHHHHHHHhhCCCCCcch- Confidence 222222222222111111 0000 00000011111 2334556667788889999999887653 35544433 Q ss_pred HHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 475 EIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 475 ~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+. ++. .+...+||=-++ T Consensus 364 ---~~~--~~~-----~~~~~GGd~~~~ 381 (382) T protein:vir:48 364 ---PNG--ENP-----NSTLKGGEEDGQ 381 (382) T ss_pred ---hhh--hcC-----CCCCCCCCCCCC Confidence 211 111 011223332222 No 183 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=97.23 E-value=0.00012 Score=42.05 Aligned_cols=440 Identities=11% Similarity=0.113 Sum_probs=193.9 Q ss_pred CChhHHHHHHHH-----------HHhhcccccchhhhhcccc-----------------ccCC---HHHHHHHHHHHHHh Q lcl|NC_012753. 1 MGIIQTIKNFIK-----------RSNYVITNQSLNSITDHPK-----------------IAIS---PEEYNRIMDNLRYF 49 (502) Q Consensus 1 m~~~~~ik~~i~-----------~~~~~~~~~~l~~i~~~~~-----------------~~~~---~~~~~~i~~~~~~Y 49 (502) .++.+-+|-|.+ .-...+.+-...+-..... +.+. .....-|+.++.+. T Consensus 2 ~~~l~~~~~~~~~d~~~~~e~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~~~g~~~~~~~~e~~~~~~~eLI~~YR~ma 81 (521) T protein:vir:65 2 FSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNSPASSWNSLTQQFYSTDQKISTTKQLVNTYRGLM 81 (521) T ss_pred ccchhhhhhccCchhhHHHhhhccCCCcccCCCCCCCceeecccCCccccccccceeeeccccchhhhHHHHHHHHHHHh Confidence 222222232221 0000000000000000000 0000 01222233333332 Q ss_pred cCCCCccccccCCCccccccceecchHHHHHHHHh-hhhhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHH Q lcl|NC_012753. 50 AGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVA-SLVFNEQATIRVDNE--------VADAFINETLKNDKFSKNFER 120 (502) Q Consensus 50 ~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a-~~l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~ 120 (502) .. +-+. +--..||+... .=-..+|+++.+++. ...+..+.+++--+|+++..+ T Consensus 82 ~~--pEvd----------------~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~ 143 (521) T protein:vir:65 82 NN--HEVE----------------NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQD 143 (521) T ss_pred hc--cchh----------------hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhH Confidence 21 0000 00011111111 001125666666542 245566777777799999999 Q ss_pred HHHHHhhcCCEEEEEEEeCC---c-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEE Q lcl|NC_012753. 121 YLESCLALGGLAMRPYIDGD---Q-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYT 196 (502) Q Consensus 121 ~~~~~~~~G~~~~~~~~d~~---~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~ 196 (502) .+....+-|..|++..+|++ + ..+..++|..+-++...........-+. .-+.-+..|...+..|. T Consensus 144 ~fR~WYVDgRi~fhkiid~~pk~GI~ELr~lDPr~i~~vr~i~k~~~~~~~v~----------~~~~e~f~Y~~~~~~~~ 213 (521) T protein:vir:65 144 MFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIY----------KATKEYFIYTVGNSSYC 213 (521) T ss_pred HHhhhhhcceeEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCCccee----------cceeeeeeeecCCccee Confidence 99999999999999998742 3 5788899998887653211100000000 00110111111111221 Q ss_pred EEEEEEecCCccccCceeecccc-ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHH Q lcl|NC_012753. 197 ISNELYESESKTIIGQRVPLSTL-YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDE 275 (502) Q Consensus 197 I~~~l~~~~~~~~lG~~v~l~~~-~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~ 275 (502) .....|.. ..+-.+|-+.+ |. -+|+ .+.....=+|-|..+......|=-+-+. T Consensus 214 ~~g~~~~~----~~~vkI~~dAI~y~-------hSGl---------------~d~~~~~i~syLhkAiKp~NQLkm~EDA 267 (521) T protein:vir:65 214 AGGQVFSP----NSRVKIPRSAITYA-------HSGL---------------MDCDDKYIIGYLHRAVKPANQLKLLEDA 267 (521) T ss_pred ccceeecC----Ccceeechhheeee-------eccc---------------eeCCCCeeeecchhhhHhHHhhHHHHhh Confidence 11111111 01112222111 11 0111 0011111123333333333222211111 Q ss_pred HH--HHHhhccceee-ech---------H----Hhc---c--CCCCCCcccCccccccccchhhccccCCCCccccceee Q lcl|NC_012753. 276 FM--WEVKMGQRRVA-VPT---------Q----MIK---T--EYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITD 334 (502) Q Consensus 276 ~~--~~~~~~~~~i~-v~~---------~----~l~---~--~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 334 (502) ++ +-.++-..||| |+- . +.. + +-+...++++.++-+..-...|..- --+|+.+.-|++ T Consensus 268 lVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEItT 346 (521) T protein:vir:65 268 MVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQ-RRDGKAITDVTT 346 (521) T ss_pred HHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhccc-ccCCCCccceee Confidence 11 11234444554 211 0 110 0 0122233333322222222222211 113333444666 Q ss_pred eccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 335 LTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM---KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILEL 411 (502) Q Consensus 335 ~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~---~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~ 411 (502) +..--...+ ++-++.+.+.+....++|.+.++.++++. .-++||....-....-+.+++..|..-+.++++.-|.+ T Consensus 347 LpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLil 425 (521) T protein:vir:65 347 LPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLIL 425 (521) T ss_pred cccCCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 654322222 23466677788888888877765443211 23456655554555677888888888888888876654 Q ss_pred HHhhcccCCC-ccc-ccceEEEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHHHhcCCCCHHHHHHHHHHH Q lcl|NC_012753. 412 AKVYNLYTGE-IPT-MDEVSVDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAIEKTLNVTKEQAQEIYQKI 480 (502) Q Consensus 412 ~~~~~~~~~~-~~~-~~~i~v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l~~~~~~~deea~~el~ri 480 (502) -. ++... +.. ...+.++|...--..+..+++.+..-. ..+ ..|.+++.++....||+|.+++.+.| T Consensus 426 Kg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~k~I 502 (521) T protein:vir:65 426 KN---VITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQI 502 (521) T ss_pred hc---CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHH Confidence 32 22211 111 124778886654444444444433211 112 46999988888899999999999999 Q ss_pred HHhhhcccCCCCCccccCC Q lcl|NC_012753. 481 NDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 481 ~~E~~~~~~~~~~~~~~~~ 499 (502) ++|....--..+..+..|| T Consensus 503 ~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 503 EEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHhhhCCCCCCCcccccCC Confidence 9998776554555555666 No 184 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=97.21 E-value=0.00013 Score=41.92 Aligned_cols=389 Identities=15% Similarity=0.080 Sum_probs=159.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) -++++++|..+..-+..-....+.. +--|.......+ . .+..+...--...| T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~~~~v-----~----~~~~~~~~~V~~ci 55 (409) T protein:vir:93 4 ENIVTRIKKKLIDNWIDQSTSKLYD-------------------FSPWKNRSFWGV-----I----NNTLETNETIFSAI 55 (409) T ss_pred cchhhhhhhhhhhhhhccccccccc-------------------cccccCcccccc-----c----hhhhhccHHHHHHH Confidence 4677777776543211111111100 000000000000 0 01112222223445 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFF 153 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~ 153 (502) +..|+-+-.=|..+--.++.....+..+|.. | ....-...++...+..|.+|+.+..+. |. ..+..++|+.+- T Consensus 56 ~~Ia~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:93 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceeEeeccccccchHHHHHhhhcccCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEcCceeE Confidence 5555555555555422333233333334431 2 233334556777788899998887775 33 356667777765 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) ++..+.++.. +|. +. ..+ |..+.+ ++ . T Consensus 136 ~~~~~~~~~~-----------------~y~-------------~~-----~~~----g~~~~~---~~---~-------- 162 (409) T protein:vir:93 136 MLIENQSREL-----------------YYS-------------IH-----AAT----GNKLIV---HN---M-------- 162 (409) T ss_pred EEEeCCCcEE-----------------EEE-------------EE-----cCC----ceEEEE---cc---c-------- Confidence 5432222110 010 00 000 111100 00 0 Q ss_pred cceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-cccccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFE 312 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~ 312 (502) -.++++.+- ..+..+|+|.+.-+...++-...+..............|+. .....+..... ....|. T Consensus 163 --eVih~r~~~----~~~~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~------~~~~l~~e~~~~~~~~~~ 230 (409) T protein:vir:93 163 --DMLHFKHIV----ASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLK------YGSNVGKEKRQQVLEDFK 230 (409) T ss_pred --cEEEeCCCC----CCCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEe------cCCCCCHHHHHHHHHHHH Confidence 023343221 12334688887777666664443322111112222122222 11111111000 000010 Q ss_pred ccchhhccc-cCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHH Q lcl|NC_012753. 313 TGHNVYEQF-DSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMR 390 (502) Q Consensus 313 ~~~~~~~~~-~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~ 390 (502) ..+..- ...--+.+.-++.++......++.+..+....+|+...|+||..+|....+. +++.+... T Consensus 231 ---~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~--------- 298 (409) T protein:vir:93 231 ---QYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNR--------- 298 (409) T ss_pred ---HHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH--------- Confidence 001000 0000012223555555555667888888888999999999999998654432 23332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) ..++.+|..++..|-...+..=+..........+.++++.-+-.|..+.++.+.+++.+|+++.-++++.+ |+.. T Consensus 299 ----~f~~~~l~P~~~~ie~~l~~~Ll~~~~~~~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~~-g~~p 373 (409) T protein:vir:93 299 ----FYLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE-DLPP 373 (409) T ss_pred ----HHHHHHHHHHHHHHHHHHHhhcCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCC Confidence 11223333333333221111101111111222344444444567889999999999999999999876653 4433 Q ss_pred HH-HHHHHHHHHHhhhcccCC----CCCccccCCCCC Q lcl|NC_012753. 471 EQ-AQEIYQKINDETMVSTDS----FRTSEEVDIYGE 502 (502) Q Consensus 471 ee-a~~el~ri~~E~~~~~~~----~~~~~~~~~~g~ 502 (502) -+ .++-+- .-+....+. .....+++-.+. T Consensus 374 ~~ggD~~~~---~~n~~~~~~~~~~~~~~~gG~~n~~ 407 (409) T protein:vir:93 374 VEGGDKPLI---SGDLYPIDTPLELRKSLKGGDKNVN 407 (409) T ss_pred CCCcCeeee---cccccccccchhhcccccCCCCCcC Confidence 10 111110 000000000 011222222222 No 185 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=97.11 E-value=0.00017 Score=41.27 Aligned_cols=327 Identities=16% Similarity=0.109 Sum_probs=136.6 Q ss_pred hhcCcceEeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEcC Q lcl|NC_012753. 87 VFNEQATIRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQANT 159 (502) Q Consensus 87 l~~ep~~i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d~ 159 (502) +-.=|+.+.-+++....-+.++|.. | ....-....+...+..|.+|+.+..+. |.+ .+-.++|+.+-++..+. T Consensus 1 ia~lp~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~l~~~~v~~~~~~~ 80 (348) T protein:vir:93 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (348) T ss_pred CcccceEeEecCcCcccHHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCCceEEEEeCC Confidence 1122333322222222223333321 1 222333445667778999998887765 443 45555665554432221 Q ss_pred CCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 160 QDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) ++. .+|+ ++ .. + |..+.+ .+ . -.++ T Consensus 81 ~~~-----------------~~y~---~~-~~--------------~----g~~~~~---~~---~----------eiih 105 (348) T protein:vir:93 81 SRE-----------------LYYS---IH-AA--------------T----GNKLIV---HN---M----------DMLH 105 (348) T ss_pred CcE-----------------EEEE---EE-cC--------------C----CeEEEE---cc---c----------cEEE Confidence 111 0010 00 00 0 111100 00 0 0233 Q ss_pred ecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-ccccccccchhh Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFETGHNVY 318 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~~~~~~~ 318 (502) |+.+.. .+..+|+|.+.-+...++..+.+.......+..+...+ +......+..... ....|. ..+ T Consensus 106 ~r~~~~----~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i------~~~~~~l~~e~~~~~~~~~~---~~~ 172 (348) T protein:vir:93 106 FKHIVA----SNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFM------LKYGSNVSTEKRQQVLEDFK---QYY 172 (348) T ss_pred ecCCCC----CCceeeccHHHHHHHHHHHHHHHHHHHHHhcCCCceeE------EecCCCCCHHHHHHHHHHHH---HHh Confidence 443211 13346888877777666644433222122221111112 2111111100000 000010 111 Q ss_pred cc---ccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 319 EQ---FDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNSIA 394 (502) Q Consensus 319 ~~---~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~~~ 394 (502) .. +..- +.+..++.++.....-++.+..+...++|+...|+|+..++...++. +++.+.... T Consensus 173 ~n~~~~~vl--~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~------------ 238 (348) T protein:vir:93 173 EENGGILFQ--EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRF------------ 238 (348) T ss_pred hcCCCeeec--CCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH------------ Confidence 00 0000 12223555555555557888888889999999999999998654332 233322111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccC-CCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-H Q lcl|NC_012753. 395 TLVEKSLKELVISILELAKVYNLYT-GEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKE-Q 472 (502) Q Consensus 395 ~~~~~~l~~l~~~il~~~~~~~~~~-~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~de-e 472 (502) .++.+|.-+++.|-...+. .++. ........+.++++.-+-.|..+.++...+++.+|+++.-++++.. |+..- . T Consensus 239 -~~~~~l~P~~~~ie~~l~~-~l~~~~~~~~g~~i~fd~~~l~~~d~~~~a~~~~~~~~~G~~T~NE~R~~~-g~~p~~g 315 (348) T protein:vir:93 239 -YLQHTLLPIVKQYEEEFNR-KLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREWE-DLPPVEG 315 (348) T ss_pred -HHHHHHHHHHHHHHHHHHH-hhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHh-CCCCCCC Confidence 1222333333322221111 1111 1111122355555555667889999999999999999999976653 44321 0 Q ss_pred HHHHH-----HHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 473 AQEIY-----QKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 473 a~~el-----~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .++-+ ..+.. .........++|-.++ T Consensus 316 gD~~~~~~n~~~~~~----~~~~~~~~~gg~~n~~ 346 (348) T protein:vir:93 316 GDKPLISGDLYPIDT----PLELRKSLKGGDKNVN 346 (348) T ss_pred cCeEeeccccccccc----chhhcccccCCCCCcC Confidence 11111 11110 0000111233333333 No 186 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=97.10 E-value=0.00017 Score=41.25 Aligned_cols=389 Identities=9% Similarity=0.041 Sum_probs=162.8 Q ss_pred cchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEee-CCH Q lcl|NC_012753. 21 QSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRV-DNE 99 (502) Q Consensus 21 ~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~-~d~ 99 (502) |-|..+..+..-.. ..-...+...+.+............ ...+.++---..|+..|+-+.+=|+.+-- +++ T Consensus 1 ~~f~~~f~r~~~~~----~~~~~~~~~~~~~~~~~~~g~~v~~----~~~l~~~~v~~~i~~Ia~~iA~~p~~~~~~~~~ 72 (413) T protein:vir:48 1 MFFSGLFQRKSDAP----VTTPAELAEAIGLSYDTYTGKRISS----QRAMRLTAVYSCVRVLAESVGMLPCSLYKISGT 72 (413) T ss_pred CccchhhccCccCC----ccchHHHHHhhhcCcccccCceech----hhhhccHHHHHHHHHHHHhhhhCceEEEEecCC Confidence 22222222211100 0011122233333222211111111 11122233345566667666665655321 111 Q ss_pred ----HHHHHHHHHHh-----hccHHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEcCCeEEEEEEcCCCeEEEEEEE Q lcl|NC_012753. 100 ----VADAFINETLK-----NDKFSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQATVFFPLQANTQDVSSAAIVT 169 (502) Q Consensus 100 ----~~~e~l~~~~~-----~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~ 169 (502) ....-+..+|. .-....-...++...+..|.+|+.+..+.|++ .+..++|+.+-+.. +.+.. T Consensus 73 ~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~~~~g~~~~L~~l~~~~v~~~~-~~~~~------- 144 (413) T protein:vir:48 73 LKTRVVDERLHKLVSAKPNGYMTPQEFWELVIVCLCLRGNFYAYKVKALGEVVELLPIDPGCVEPKL-NSQWQ------- 144 (413) T ss_pred cceeecccHHHHHHHhhccCCCCHHHHHHHHHHHHhhcCceEEEEEeCCCcEEEEEEEcCceEEEEE-cCCce------- Confidence 11112333332 12334455556777888899988887776664 45556776655532 21111 Q ss_pred EEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCcccccc Q lcl|NC_012753. 170 KSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKD 249 (502) Q Consensus 170 ~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~ 249 (502) ..|. +...++... .++. . -+++++.+. T Consensus 145 ----------~~y~----~~~~~g~~~----------------~~~~--------~----------evih~~~~~----- 171 (413) T protein:vir:48 145 ----------PVYQ----VTFPDGSVD----------------VLTQ--------D----------EIWHVRTLT----- 171 (413) T ss_pred ----------EEEE----EEecCceEE----------------EEcc--------c----------cEEEecCcC----- Confidence 0010 000001000 0000 0 123343321 Q ss_pred ccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc-eeeechHHhccCCCCCCcccCcccccccc-chhhccccC---- Q lcl|NC_012753. 250 INSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQMIKTEYDTNGEKVTVKREFETG-HNVYEQFDS---- 323 (502) Q Consensus 250 ~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~---- 323 (502) .....|+|.+..+...|+.....-....+-|..+.. .-+ |......+.... ..+... ...+..... T Consensus 172 ~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gi-----l~~~~~~~~e~~---~~~~~~~~~~~~g~~n~g~~ 243 (413) T protein:vir:48 172 LDGLVGLNPIAYAREAISLAAATEEHGARLFGNGAVTSGV-----LRTEQKLTPDAY---ERLKKDFEERHTGLGNAHRP 243 (413) T ss_pred CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE-----EEeCCCCCHHHH---HHHHHHHHHHhcCccccCcc Confidence 123579998888888877555444333334554332 222 222111110000 000000 011111100 Q ss_pred CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 324 GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLK 402 (502) Q Consensus 324 ~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~ 402 (502) .--+.+.-++.++.....-++.+..+....+|+...|++|..++...++. +++.+.... .++.+|. T Consensus 244 ~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~~~-------------f~~~~i~ 310 (413) T protein:vir:48 244 MILEMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLG-------------FINYSLV 310 (413) T ss_pred eecCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHHHH-------------HHHHHHH Confidence 00012223555555555667788888889999999999999998754332 233333211 1223333 Q ss_pred HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHHHHHH Q lcl|NC_012753. 403 ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKE-QAQEIYQKIN 481 (502) Q Consensus 403 ~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~de-ea~~el~ri~ 481 (502) -++..|....+. .++.........+.++++.-+-.|..+.++...+++.+|+++.-++++. .|+..- ..++-+.... T Consensus 311 P~~~~ie~~l~~-~L~~~~~~~~~~~~fd~~~l~~~d~~~~~~~~~~~~~~g~~T~NE~R~~-~g~~p~~ggD~~~~~~n 388 (413) T protein:vir:48 311 PYLTRIEQRINT-GLVRESKQGKFYAKFNAGALLRGDMKSRFEAYATGINWGIYSPNDCRDL-EDMNPRPGGDVYLTPMN 388 (413) T ss_pred HHHHHHHHHHHh-hccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH-hCCCCCCCcceeecccc Confidence 333333222211 1111111112345555666566788999999999999999999987654 354321 1111111100 Q ss_pred HhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 482 DETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 482 ~E~~~~~~~~~~~~~~~~~g~ 502 (502) ............ ...+=..+ T Consensus 389 ~~~~~~~~~~~~-~~~~~~~~ 408 (413) T protein:vir:48 389 MTTSPSAGDDNG-KKKESGDA 408 (413) T ss_pred ccccccccccCC-CCCCCCCc Confidence 000000000000 00000000 No 187 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=97.09 E-value=0.00018 Score=41.17 Aligned_cols=426 Identities=12% Similarity=0.124 Sum_probs=190.8 Q ss_pred CCh--hHHHHHHHHHHhhcccccchhhhhccc--------------cccCC-----------------------HHHHHH Q lcl|NC_012753. 1 MGI--IQTIKNFIKRSNYVITNQSLNSITDHP--------------KIAIS-----------------------PEEYNR 41 (502) Q Consensus 1 m~~--~~~ik~~i~~~~~~~~~~~l~~i~~~~--------------~~~~~-----------------------~~~~~~ 41 (502) |++ ..-+|-|-+. -...+.+..... .+.++ .....- T Consensus 1 m~~~~L~~~~~w~~~-----de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eL 75 (524) T protein:vir:10 1 MKFNVLSLFAPWAKM-----DERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTREL 75 (524) T ss_pred CCCchhhHhhccccC-----cchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHH Confidence 776 3333333221 111111110000 00000 001111 Q ss_pred HHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh-----hhcCcceEeeCCH--------HHHHHHHHH Q lcl|NC_012753. 42 IMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL-----VFNEQATIRVDNE--------VADAFINET 108 (502) Q Consensus 42 i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~-----l~~ep~~i~~~d~--------~~~e~l~~~ 108 (502) |+.++.+... +-.--.|+..++= -..+|+.+.+++. ...+..+.+ T Consensus 76 I~~YR~ma~~----------------------pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~I 133 (524) T protein:vir:10 76 IDTYRNLMNN----------------------YEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFNDV 133 (524) T ss_pred HHHHHHHhhc----------------------cchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHH Confidence 2222222111 1111111111111 1135666666542 245667778 Q ss_pred HhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC----c-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCce-EE Q lcl|NC_012753. 109 LKNDKFSKNFERYLESCLALGGLAMRPYIDGD----Q-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVK-YY 182 (502) Q Consensus 109 ~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~----~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~-~y 182 (502) ++--+|+++..+.+....+-|..|++.++|+. + ..+..++|..+-++..- ..+..++. .+ T Consensus 134 l~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i--------------~~~~~~~~~vi 199 (524) T protein:vir:10 134 LNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREI--------------ITETEAGTKIV 199 (524) T ss_pred HHHhccchhhhHHHhhheeeeEEEEEEEeeCCCccccceeeeeeCCccceeeeee--------------ccCCCccchhh Confidence 87779999999999999999999999999853 3 46888888776554211 01111111 11 Q ss_pred EE-EEEEEEeCCeEEEEEEEEecCCcc--ccCcee--eccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcc Q lcl|NC_012753. 183 SL-IEFHEWNKETYTISNELYESESKT--IIGQRV--PLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLS 257 (502) Q Consensus 183 t~-~E~h~~~~~~~~I~~~l~~~~~~~--~lG~~v--~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S 257 (502) +- .|++..+. .+.-|.. ++. ..|+.| |-+. +++...+- +..+.. .=+| T Consensus 200 ~~~~e~f~Y~~-----~~~~y~~-~g~~~~~~~~ikI~~dA--------I~y~hSGL-----~d~~~~--------~i~g 252 (524) T protein:vir:10 200 KGYKEYFIYDT-----AHESYAC-DGRMYEAGTKIKIPKAA--------IVYAHSGL-----VDCCGK--------NIIG 252 (524) T ss_pred cchhhheeecc-----Ccccccc-CccccCCCcceecchhh--------eeeeeccc-----eeCCCC--------ceec Confidence 10 01111000 0000000 000 111111 1111 11110000 000000 0023 Q ss_pred hhhhHHHHHHHHHHHHHHHH--HHHhhccceee-ech---------HHh---------ccCCCCCCcccCccccccccch Q lcl|NC_012753. 258 IFDNAKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT---------QMI---------KTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 258 ~~~~~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~---------~~l---------~~~~~~~g~~~~~~~~~~~~~~ 316 (502) -|..+......|=-+-+.++ +-.++-..||| |+- .++ +-+-+...++++.++-+..-.. T Consensus 253 yLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlE 332 (524) T protein:vir:10 253 YLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTE 332 (524) T ss_pred cchhhhHHHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 33333322222221111111 11233334444 211 000 0011222333333222222222 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccc-c--cccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGK-S--MKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~-~--~~tAtei~~~~~~l~~~~~~~ 393 (502) .|..- --+|+.+.-|+++..--...+ ++-++.+.+.+....++|.+.+..+++ + ..-++||....-....-+.++ T Consensus 333 DyWLp-RReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rL 410 (524) T protein:vir:10 333 DYWLQ-RRDGKAVTEVDTLPGADNTGN-MEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIREL 410 (524) T ss_pred hhccc-ccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHH Confidence 22211 113333444666654322222 234666777888888888777743321 1 113556655554555677888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAI 462 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l 462 (502) +..|..-+.++++.-|.+-. ++...-+.. ..+.++|...--..+..+++.+..-. ..+ ..|.+++. T Consensus 411 R~rFs~~f~~~Lk~qLilKg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~ 487 (524) T protein:vir:10 411 QHKFEEVFLDPLKTNLLLKG---IITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAM 487 (524) T ss_pred HHHHHHHHHHHHHHhhhhcc---CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHH Confidence 88888888888887665432 222211111 35778886654444444444433211 112 45899988 Q ss_pred HhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 463 EKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 463 ~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) ++....||+|.+++.+.|++|....--..+..+.-|| T Consensus 488 k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 488 KDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 8888999999999999999998776555555666666 No 188 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=97.07 E-value=0.00018 Score=41.05 Aligned_cols=426 Identities=13% Similarity=0.122 Sum_probs=190.6 Q ss_pred CCh--hHHHHHHHHHHhhcccccchhhhhccc--------------cccCC-----------------------HHHHHH Q lcl|NC_012753. 1 MGI--IQTIKNFIKRSNYVITNQSLNSITDHP--------------KIAIS-----------------------PEEYNR 41 (502) Q Consensus 1 m~~--~~~ik~~i~~~~~~~~~~~l~~i~~~~--------------~~~~~-----------------------~~~~~~ 41 (502) |++ ..-+|-|-+. -...+.+..... .+.++ .....- T Consensus 1 m~~~~L~~~~~w~~~-----de~~~~~~~~~~~~S~~~p~~~Dga~e~~~~~~~~a~~~~g~~~~~~g~~e~~~~~~~eL 75 (524) T protein:vir:72 1 MKFNVLSLFAPWAKM-----DERNFKDQEKEDLVSITAPKLDDGAREFEVSSNEAASPYNAAFQTIFGSYEPGMKTTREL 75 (524) T ss_pred CCCchhhHhhccccC-----cchhhhhhhccCCccccCccCCCCceeeeecccccccccceeeeehhcccccccchHHHH Confidence 776 3333333221 111111110000 00000 001111 Q ss_pred HHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh-----hhcCcceEeeCCH--------HHHHHHHHH Q lcl|NC_012753. 42 IMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL-----VFNEQATIRVDNE--------VADAFINET 108 (502) Q Consensus 42 i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~-----l~~ep~~i~~~d~--------~~~e~l~~~ 108 (502) |+.++.+... +-.--.|+..++= -..+|+.+.+++. ...+..+.+ T Consensus 76 I~~YR~ma~~----------------------pEvd~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~I 133 (524) T protein:vir:72 76 IDTYRNLMNN----------------------YEVDNAVSEIVSDAIVYEDDTEVVALNLDKSKFSPKIKNMMLDEFSDV 133 (524) T ss_pred HHHHHHHhhc----------------------cchhhHHHHhhcceeEecCCCceEEEEecCcCcchHHHHHHHHHHHHH Confidence 2222222111 1111111111111 1135666666542 245667778 Q ss_pred HhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC----c-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCce-EE Q lcl|NC_012753. 109 LKNDKFSKNFERYLESCLALGGLAMRPYIDGD----Q-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVK-YY 182 (502) Q Consensus 109 ~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~----~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~-~y 182 (502) ++--+|+++..+.+....+-|..|++.++|+. + ..+..++|..+-++..- ..+..++. .+ T Consensus 134 l~ll~F~~~~~~~fR~WYVDgRi~fhKiid~k~pk~GI~Elr~lDPr~i~~vr~i--------------~~~~~~~~~vi 199 (524) T protein:vir:72 134 LNHLSFQRKGSDHFRRWYVDSRIFFHKIIDPKRPKEGIKELRRLDPRQVQYVREI--------------ITETEAGTKIV 199 (524) T ss_pred HHHhccchhhhHHHhhheeeeEEEEEEEEeCCCccccceeeeeeCCccceeeeee--------------ccCCCccchhh Confidence 87779999999999999999999999999854 3 46788888776554211 01111111 11 Q ss_pred EE-EEEEEEeCCeEEEEEEEEecCCcc--ccCcee--eccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcc Q lcl|NC_012753. 183 SL-IEFHEWNKETYTISNELYESESKT--IIGQRV--PLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLS 257 (502) Q Consensus 183 t~-~E~h~~~~~~~~I~~~l~~~~~~~--~lG~~v--~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S 257 (502) +- .|++..+. .+.-|.. ++. ..|+.| |-+. +++...+- +..+.. .=+| T Consensus 200 ~~~~e~f~Y~~-----~~~~y~~-~g~~~~~~~~ikI~~dA--------I~y~hSGL-----~d~~~~--------~i~g 252 (524) T protein:vir:72 200 KGYKEYFIYDT-----AHESYAC-DGRMYEAGTKIKIPKAA--------VVYAHSGL-----VDCCGK--------NIIG 252 (524) T ss_pred cchhhheeecc-----Ccccccc-CccccCCCcceecchhh--------eeeeeccc-----eeCCCC--------ceec Confidence 10 01111000 0000000 000 111111 1111 11110000 000000 0023 Q ss_pred hhhhHHHHHHHHHHHHHHHH--HHHhhccceee-ech---------HHh---------ccCCCCCCcccCccccccccch Q lcl|NC_012753. 258 IFDNAKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT---------QMI---------KTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 258 ~~~~~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~---------~~l---------~~~~~~~g~~~~~~~~~~~~~~ 316 (502) -|..+......|=-+-+.++ +-.++-..||| |+- .++ +-+-+...++++.++-+..-.. T Consensus 253 yLhkAiKp~NQLkmlEDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlE 332 (524) T protein:vir:72 253 YLHRAVKPANQLKLLEDAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTE 332 (524) T ss_pred cchhhhHhHHhhhHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHh Confidence 33333322222221111111 11233334444 211 000 0011222333333222222222 Q ss_pred hhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccc-c--cccHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 317 VYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGK-S--MKTATEVVSEQSDTYQMRNSI 393 (502) Q Consensus 317 ~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~-~--~~tAtei~~~~~~l~~~~~~~ 393 (502) .|..- --+|+.+.-|+++..--...+ ++-++.+.+.+....++|.+.+..+++ + ..-++||....-....-+.++ T Consensus 333 DyWLp-RReGgrgTEItTLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rL 410 (524) T protein:vir:72 333 DYWLQ-RRDGKAVTEVDTLPGADNTGN-MEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIREL 410 (524) T ss_pred hhccc-ccCCCcccceeeccccCCcCh-HHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHH Confidence 22211 113333444666654322222 234666777888888888877743321 1 113556655554555677888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHH Q lcl|NC_012753. 394 ATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAI 462 (502) Q Consensus 394 ~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l 462 (502) +..|..-+.++++.-|.+-. ++...-+.. ..+.++|...--..+..+++.+..-. ..+ ..|.+++. T Consensus 411 R~rFs~~f~~~Lk~qLilKg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~ 487 (524) T protein:vir:72 411 QHKFEEVFLDPLKTNLLLKG---IITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAM 487 (524) T ss_pred HHHHHHHHHHHHHHhhhhcc---CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHH Confidence 88888888888887665432 222211111 35778886654444444444433211 112 45899988 Q ss_pred HhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 463 EKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 463 ~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) ++....||+|.+++.+.|++|....--..+..+.-|| T Consensus 488 k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 488 KDILQMTDEEIEQEAKQIEEESKEARFQDPDQEQEDF 524 (524) T ss_pred HHHhccCHHHHHHHHHHHHHHhhcCCCCCCchhhhcC Confidence 8888999999999999999998776555555555666 No 189 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=96.88 E-value=0.00028 Score=40.06 Aligned_cols=350 Identities=12% Similarity=0.104 Sum_probs=146.7 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH--HH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG--RT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~--k~ 78 (502) |+++...++ +. .+.... + |..+..+. .... +... +....+..+ -. T Consensus 1 M~~~~~f~~---r~--~~~~~~-------~--------------~~~~~~~~-~~~~-----~~~v-~~~~al~~~av~~ 47 (359) T protein:vir:10 1 MSILNPFER---RS--SITPNN-------Y--------------YPFMVQNG-SIVP-----NSLV-DATEALKNSDLYA 47 (359) T ss_pred Ccccchhhc---cc--cCCCCc-------c--------------hhhhhccc-cccC-----Cccc-CHHHhhcchHHHH Confidence 888864332 10 000000 0 01111111 1100 0000 001111111 23 Q ss_pred HHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEEcCCeEEEEE Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSFVQATVFFPLQ 156 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~v~~~~~~Pi~ 156 (502) .|+..|+-+-+-|. ++++.....+.+-...-...+-...++...+..|.+|+.+..+.+ . ..+..++|+.+-+. T Consensus 48 cv~~ia~~ia~~p~---~~~~~~~~L~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~g~~~~l~~l~~~~v~i~- 123 (359) T protein:vir:10 48 VTSLISSDIAGTRF---IGNQVFTSVLNNPSHLTNAFSFWQTAILNLLLNGNVFLAILKGDNSLMKELRLIPSNAITID- 123 (359) T ss_pred HHHHHHHhhhcCcc---ccchHHHHHhhcccccCCHHHHHHHHHHhccccCceEEEEEECCCCeEEEEEEeCCceEEEE- Confidence 55666666655554 234444444433221112222334445566677889888877654 3 34556677665442 Q ss_pred EcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcce Q lcl|NC_012753. 157 ANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPL 236 (502) Q Consensus 157 ~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~ 236 (502) .+++. +++++.... .+. . ..++- . - T Consensus 124 ~~~~~-----~~y~~~~~~------------------~~~-~-------------~~~~~--------~----------e 148 (359) T protein:vir:10 124 LTDDT-----LTYEVNQFD------------------DYP-S-------------AKYNA--------S----------E 148 (359) T ss_pred EcCCe-----EEEEEEecC------------------Cce-E-------------EEEcc--------c----------c Confidence 22221 011110000 000 0 00000 0 0 Q ss_pred EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccch Q lcl|NC_012753. 237 FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHN 316 (502) Q Consensus 237 f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~ 316 (502) .++|+.+..+....+...|+|-+.-+...+.....+..-..+-|..+...-.+ |......-..+-. ..+..... T Consensus 149 vih~~~~~~~~~~~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gi----l~~~~~~l~~e~~--~~~~~~~~ 222 (359) T protein:vir:10 149 MIHVKIMAYGVDTLHNLVGHSPLESLTSEIGQQKEANRLSLSTLKGALNPTSV----VKVPQGTLSSEAK--DSIRKEFE 222 (359) T ss_pred eEEeccCCCCCCccCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE----EEeCCCCCCHHHH--HHHHHHHH Confidence 12344332232233445688888877777766555444444445554432111 2211100000000 00000001 Q ss_pred hhcccc-CC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHH-HHHHH Q lcl|NC_012753. 317 VYEQFD-SG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDT-YQMRN 391 (502) Q Consensus 317 ~~~~~~-~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l-~~~~~ 391 (502) .+.... .+ --+.+.-++.++.....-++.+..+....+|+...|+||..+|..++...|...++..+... ...+. T Consensus 223 ~~~~~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~~~e~~~~~~l~~~l~ 302 (359) T protein:vir:10 223 KANGGNNSGRVMVLDQSADFSTVSINADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQSSLDQIKDLYVNALNRFIE 302 (359) T ss_pred HHhCccccCCceecCCCcceeeecCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCcccccHHHHHHHHHHHHHHHHH Confidence 010000 00 00112234445544445568888888899999999999999986554444555554443322 22233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH Q lcl|NC_012753. 392 SIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKE 471 (502) Q Consensus 392 ~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~de 471 (502) -++.++...|..- + .++...-+-.|.+.......+++.+|+++.-++++.+ +... T Consensus 303 p~~~~l~~~l~~~----~-------------------~~~~~~~~~~d~~~~~~~~~~~~~~G~~t~NE~R~~l-~~~p- 357 (359) T protein:vir:10 303 PLISELRIKCDSS----I-------------------GVDMSPITDYSNSVFKADILNWVKEGIIEPTEAKTLL-ESKG- 357 (359) T ss_pred HHHHHHHHHhhhh----h-------------------cccchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCC- Confidence 3333333222210 0 0011111112334455667778889999998876643 3222 Q ss_pred HHH Q lcl|NC_012753. 472 QAQ 474 (502) Q Consensus 472 ea~ 474 (502) +- T Consensus 358 -v~ 359 (359) T protein:vir:10 358 -II 359 (359) T ss_pred -CC Confidence 11 No 190 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=96.88 E-value=0.00028 Score=40.05 Aligned_cols=439 Identities=14% Similarity=0.161 Sum_probs=191.7 Q ss_pred CC-----h-----hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccc Q lcl|NC_012753. 1 MG-----I-----IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDF 70 (502) Q Consensus 1 m~-----~-----~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~ 70 (502) ++ + -+.... | .+|+.++..-+ ..-.. -.....-|+.++.+... +-+. T Consensus 13 ~~~~~~S~vpp~~~~~~~~-i-~~g~~g~~v~~---~g~~~---~~n~~eLI~~YR~ma~~--pEVd------------- 69 (564) T protein:vir:10 13 EGQKGQSPVPPNDEASVST-V-AGGYFGTYVDT---SGGQN---SRNEYELIRRYRDMSLH--PEVD------------- 69 (564) T ss_pred ccCCCCCcccCCcCCChhh-h-hccccceeeec---ccccc---hhhHHHHHHHHHHHhhc--cchh------------- Confidence 00 0 001111 1 12221111100 00000 12344556666666533 1110 Q ss_pred eecchHHHHHHHHhhh-hhcCcceEeeCC--------HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC- Q lcl|NC_012753. 71 NHLPIGRTASKKVASL-VFNEQATIRVDN--------EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD- 140 (502) Q Consensus 71 ~~~n~~k~iv~~~a~~-l~~ep~~i~~~d--------~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~- 140 (502) +--..||+...-+ -..+|+.+.+++ +...+..+.+++--+|+++..+.+....+-|..|++..+|++ T Consensus 70 ---~Av~eIVneaIv~d~~~~pV~vdL~~~~~s~siK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRi~fHkiid~~~ 146 (564) T protein:vir:10 70 ---SAIDEIVNEFVVNDGDDKPVEVDLQNLEIGSGVKKKIRDEFNRILRMMNFNVNAHEIIRNWYVDGRSHYHKVIDLDN 146 (564) T ss_pred ---hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeeCCC Confidence 0001222221111 112455555553 225566777887779999999999999999999999999853 Q ss_pred ---c-eEEEEEcCCeEEEEEEcCCCe--EEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCc-- Q lcl|NC_012753. 141 ---Q-IRVSFVQATVFFPLQANTQDV--SSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQ-- 212 (502) Q Consensus 141 ---~-~~i~~v~~~~~~Pi~~d~~~~--~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~-- 212 (502) + ..+.+++|-.+=+++..-.+. ....++ .-+.. .-.|....|++..+... |.+......|. T Consensus 147 pk~GI~eLr~lDPr~i~~vr~i~~~~~~~~~~v~-k~~~~---~~~y~~~~Eyy~Ynp~~-------~~g~~~~~~~~~~ 215 (564) T protein:vir:10 147 PKKGILELRYIDSLKIRKVRQKLKDVDPNRKEIE-KGTAL---QYDYGDFIEYYIYNPKG-------FAGNIPMVTGSMD 215 (564) T ss_pred hhhhhhhhhhhcccceeeeeeeccccccccceee-eeeee---eccccccccceeecccc-------ccCcccccccccc Confidence 3 368888998876665221100 000000 00000 00011111222211111 11111111111 Q ss_pred -------eeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH--HHHhhc Q lcl|NC_012753. 213 -------RVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM--WEVKMG 283 (502) Q Consensus 213 -------~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~--~~~~~~ 283 (502) .+|.+.+ +|..-... ++....=+|-|..+......|=-+-+.++ +-.++- T Consensus 216 ~~~~~~ikI~~daI------------------~y~hSGL~---d~~~~~i~gyLhkAIKp~NQLkmlEDAlVIYRitRAP 274 (564) T protein:vir:10 216 WSNQEGIKIASDAI------------------AQSTSGLM---DLNKKMTLSFLHKAIKSLNQLRMIEDSLVIYRLSRAP 274 (564) T ss_pred cccccceeechhhc------------------ceecccce---eCCCCceeccchhhhHhHHhhHHHHhhHHHHhhhccc Confidence 1111111 11000000 00000012333333322222221111111 112343 Q ss_pred cceee-ech---------HHh---------ccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHH Q lcl|NC_012753. 284 QRRVA-VPT---------QMI---------KTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDY 344 (502) Q Consensus 284 ~~~i~-v~~---------~~l---------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~ 344 (502) ..||| |+- .+| +.+-+...++++.++-+..-...|..- --+|+.+.-|+++..--...+ T Consensus 275 eRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDyWLP-RReGgrgTEItTLpGgqnLge- 352 (564) T protein:vir:10 275 ERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDFWLP-RREGGRGTEITTLPGGQNLGE- 352 (564) T ss_pred cceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhhccc-ccCCCcccceeeccccCCcch- Confidence 44554 211 011 001122333333333222222222211 113333344666544322222 Q ss_pred HHHHHHHHHHHHHhcCCChhhccccccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCc Q lcl|NC_012753. 345 IKAINKGLSLFEMQLGVSTGMFSFDGKS--MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEI 422 (502) Q Consensus 345 ~~~l~~~l~~i~~~~g~s~~~~~~~~~~--~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~ 422 (502) +.-++.+.+.+....++|.+.+..++++ .--++||....-....-+.+++..|..-+.++++.-|.|-. ++...- T Consensus 353 m~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKg---iit~ee 429 (564) T protein:vir:10 353 LKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFAQLFHDILKTQLILKG---IITPED 429 (564) T ss_pred HHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---CCCHHH Confidence 2346667778888888887777765431 11244555444445567788888888888888887665432 222221 Q ss_pred ccc--cceEEEeCCCccCCHHHHHHHHHHHH---h------cCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhccc--- Q lcl|NC_012753. 423 PTM--DEVSVDLDDGVFTDRNAEFDYWSKMV---A------AGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVST--- 488 (502) Q Consensus 423 ~~~--~~i~v~f~d~i~~d~~~~~~~~~~~~---~------~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~--- 488 (502) +.. ..+.++|...--..+..+++.+..-+ + +-..|.+++.++..-.||+|.+++.+.|++|..... T Consensus 430 W~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~ 509 (564) T protein:vir:10 430 WDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTENEFKEIDKQMKSDIESGLAID 509 (564) T ss_pred HHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCC Confidence 111 34778886654444444444433211 1 124699998888889999999999999999865432 Q ss_pred -------------CCCCCccccCCCCC Q lcl|NC_012753. 489 -------------DSFRTSEEVDIYGE 502 (502) Q Consensus 489 -------------~~~~~~~~~~~~g~ 502 (502) +....|.+.+..|. T Consensus 510 P~e~~~~~~~~~~~~~~~p~~~~~~~~ 536 (564) T protein:vir:10 510 PIQVNMLDDMEKQNQAFAPELQAAQDD 536 (564) T ss_pred chhhhcCCCccCCCCcCCcchhhhccc Confidence 11111222233332 No 191 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=96.82 E-value=0.00032 Score=39.76 Aligned_cols=400 Identities=11% Similarity=0.019 Sum_probs=156.2 Q ss_pred HHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhc Q lcl|NC_012753. 10 FIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFN 89 (502) Q Consensus 10 ~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ 89 (502) ++...| +. ..+|+..+....-...|.+.+..-.................+.-..+|+.+|+-+-+ T Consensus 1 ~~~~~~-----~~----------~~~p~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~V~acV~~IA~~iA~ 65 (518) T protein:vir:78 1 MLLANG-----QT----------LSAPAMAELSPQMQDSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR 65 (518) T ss_pred CcccCc-----ee----------eccchhhhhhhhhhhcccccceeceecccccchhhHHhhhhHHHHHHHHHHHHhhcc Confidence 111111 11 112222111111112222211100000000000000111112224456666666655 Q ss_pred CcceEee-CC-H---HHHHHHHHHHhh-c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEc Q lcl|NC_012753. 90 EQATIRV-DN-E---VADAFINETLKN-D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQAN 158 (502) Q Consensus 90 ep~~i~~-~d-~---~~~e~l~~~~~~-~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d 158 (502) =|+.+-- ++ . .....+..++.. | ....-...++...+..|.+|+.+..+. |.+ .+..++|+.+-+.... T Consensus 66 lp~~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~Vtv~~~~ 145 (518) T protein:vir:78 66 LPVKCMFTSGDTETEEHDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNS 145 (518) T ss_pred CceEEEEEcCCccccccchHHHHHHhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEECCCceEEEEcC Confidence 5555421 11 1 111122333332 2 122334455666777899998887765 443 5666777766554322 Q ss_pred CCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEE Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFT 238 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~ 238 (502) .+... .+++ . ..++.+ +..+.+ +. . -.+ T Consensus 146 ~~~~~----~y~~-~----------------~~~~~~---------------~~~~~~---~~---~----------eIi 173 (518) T protein:vir:78 146 RTGRY----EYYF-Q----------------AGAGVG---------------TQLVSF---AD---D----------EVV 173 (518) T ss_pred CCCEE----EEEE-E----------------ecCCcc---------------ceeEEe---cC---C----------cEE Confidence 11111 0000 0 000000 000000 00 0 023 Q ss_pred EecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc-chh Q lcl|NC_012753. 239 YLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG-HNV 317 (502) Q Consensus 239 ~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~ 317 (502) +|+.+.++ +...|+|.+.-+...|.....+-....+-|..+...=.| |.....-+.... ..+... ... T Consensus 174 Hir~~~~d----g~~~G~Spi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gv----l~~~~~ls~e~~---~~~k~~~~~~ 242 (518) T protein:vir:78 174 PIRFFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLV----LRHEKRLSPEAQ---QRLREQFDRA 242 (518) T ss_pred EecCCCCC----cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEE----EecCCCCCHHHH---HHHHHHHHHH Confidence 34432211 223588888776666655544443333345554432111 222111000000 000000 011 Q ss_pred hccccCCC----CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDSGD----MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNS 392 (502) Q Consensus 318 ~~~~~~~~----~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~ 392 (502) +....... -+.+.-++.++.....-++.+..+....+|+...|++|..+|+...+. +++.+.... T Consensus 243 ~~G~~nag~~~vL~~G~~~~~l~~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~---------- 312 (518) T protein:vir:78 243 HAGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA---------- 312 (518) T ss_pred hcCcccCCceeEcCCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH---------- Confidence 11000000 011223455555555667888888888999999999999998765432 222222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCH Q lcl|NC_012753. 393 IATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTK 470 (502) Q Consensus 393 ~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~d 470 (502) .++.+|..++..|-...+. .+.. .......+.|+.+.-+..|..+.++...+++.+|+|+.-++++.. +++.+ T Consensus 313 ---f~~~tL~P~~~~ie~eln~-~L~~-~~~~~~~~~fd~~~Llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~gl~pie~ 387 (518) T protein:vir:78 313 ---FYRDTMAIPIARIQSAMDK-YVGQ-YWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDD 387 (518) T ss_pred ---HHHHHHHHHHHHHHHHHHH-hhcc-cccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 1122233322222211111 0111 111123455555566778999999999999999999999876543 23332 Q ss_pred HHHHHH-----HHHHHHh-----hhcccCCCCCccccCC--CCC Q lcl|NC_012753. 471 EQAQEI-----YQKINDE-----TMVSTDSFRTSEEVDI--YGE 502 (502) Q Consensus 471 eea~~e-----l~ri~~E-----~~~~~~~~~~~~~~~~--~g~ 502 (502) ....+. +..+..- +....+..+.+....+ .++ T Consensus 388 ~~gD~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 431 (518) T protein:vir:78 388 PKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQ 431 (518) T ss_pred CCCceeeecccceecccccccccCCCCCCCCCCCCccccccccc Confidence 222211 1111110 0011111111111111 011 No 192 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=96.76 E-value=0.00036 Score=39.49 Aligned_cols=389 Identities=15% Similarity=0.086 Sum_probs=156.9 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) =+|+++||..+...+.. .+..... .+.. |.++. +.... .+..+..+--...| T Consensus 4 ~~~~~~~k~~~~~~~~~---~~~~~~~----------------~~~~-~~~~~----~~~v~----~~~a~~~~~v~~~i 55 (409) T protein:vir:94 4 ENIVTRIKKKLIDNWID---QSASKLY----------------DFSP-WKNKS----FWGVI----NNTLETNETIFSAI 55 (409) T ss_pred cccchhhhhHHhhhhhc---CCccccc----------------cccc-ccCcc----ccccc----hhhhhccHHHHHHH Confidence 35677777765321110 0000000 0000 11110 00000 01111222223445 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFF 153 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~ 153 (502) +..|+-+-.=|+.+--..+.....+..+|.. | ....-...++...+..|.+|+.+..+. |. ..+..++|+.+- T Consensus 56 ~~Ia~~ia~lp~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~l~~~~v~ 135 (409) T protein:vir:94 56 TKLSNSMASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVE 135 (409) T ss_pred HHHHHhhhhCceeEeecccccchhHHHHHhhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCceeE Confidence 5555555555554422222222223333321 1 223334455677788999998887764 34 356667777765 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) ++..+.++.. +|+ +. .. + |..+.+ ++ . T Consensus 136 v~~~~~~~~~-----------------~y~---~~-~~--------------~----g~~~~~---~~---~-------- 162 (409) T protein:vir:94 136 MLIENQSREL-----------------YYS---IH-AA--------------T----GNKLIV---HN---M-------- 162 (409) T ss_pred EEEeCCCcEE-----------------EEE---EE-cC--------------C----ceEEEE---cc---c-------- Confidence 5433222110 010 00 00 0 111100 00 0 Q ss_pred cceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-cccccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFE 312 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~ 312 (502) -.++|+.+. ..+..+|+|.+.-+...++....+.......+..+...| +......+..... ....|. T Consensus 163 --dvih~r~~~----~~~~~~G~s~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i------~~~~~~l~~e~~~~~~~~~~ 230 (409) T protein:vir:94 163 --DMLHFKHIV----ASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFM------LKYGSNVGKEKRQQVLEDFK 230 (409) T ss_pred --cEEEecCCC----CCCccccccHHHHHHHHHHHHHHHHHHHHHhcCCCCeeE------EecCCCCCHHHHHHHHHHHH Confidence 023343221 123346888887776666644333221111122111112 2111111111000 000000 Q ss_pred ccchhhccc-cCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHH Q lcl|NC_012753. 313 TGHNVYEQF-DSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMR 390 (502) Q Consensus 313 ~~~~~~~~~-~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~ 390 (502) ..+..- ...--+.+.-++.++......++.+..+....+|+...|+||..+|....+. ++..+.... T Consensus 231 ---~~~~~~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~-------- 299 (409) T protein:vir:94 231 ---QYYEENGGILFQEPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF-------- 299 (409) T ss_pred ---HHhhcCCCeeecCCCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH-------- Confidence 001000 0000012223555555555667888888888999999999999998654332 223222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH Q lcl|NC_012753. 391 NSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK 470 (502) Q Consensus 391 ~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d 470 (502) .++.+|..++..|-...+..-+..........+.++.+.-+-.|..+.++...+++.+|+++.-++++. .|+.+ T Consensus 300 -----f~~~~l~P~~~~ie~~ln~~Ll~~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~-~g~~p 373 (409) T protein:vir:94 300 -----YLQHTLLPIVKQYEEEFNRKLLTKTDREKNRYFKFNVKSYLRADSATQAEVYFKAVRSGYYTINDIREW-EDLPP 373 (409) T ss_pred -----HHHHHHHHHHHHHHHHHHHhhCCcccccCcceEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH-hCCCC Confidence 122233333333322111100111111112234444444456788899999999999999999887654 34433 Q ss_pred HH-HHHHHHHHHHhhhcccCC----CCCccccCCCCC Q lcl|NC_012753. 471 EQ-AQEIYQKINDETMVSTDS----FRTSEEVDIYGE 502 (502) Q Consensus 471 ee-a~~el~ri~~E~~~~~~~----~~~~~~~~~~g~ 502 (502) -+ .++-+.. -+....+. .....++|-.|. T Consensus 374 ~~ggD~~~~~---~n~~~~~~~~~~~~~~kGG~~n~~ 407 (409) T protein:vir:94 374 VEGGDKPLIS---GDLYPIDTPLELRKSLKGGDKNVN 407 (409) T ss_pred CCCcCeEeec---ccccccccchhhcccccCCCCCcC Confidence 11 1111100 00000011 112233333333 No 193 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=96.74 E-value=0.00037 Score=39.40 Aligned_cols=388 Identities=9% Similarity=0.018 Sum_probs=160.9 Q ss_pred CChhHHHHHHHHHHhhcccccc---hhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQS---LNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGR 77 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~---l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k 77 (502) |+.|+-++ -++.|.-.-.... -++... .....+.-......+.....+. +. ..-...+... T Consensus 1 ~~~~~~~~-~~~~m~~F~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~----------~~---~~~~~~~~v~ 64 (413) T protein:vir:96 1 MPGVSEIR-KDKNLKFFNNKRSPTEESKAKD--EIPKAPQVVMTLPNFFKELISD----------GY---TKLSDSPEVR 64 (413) T ss_pred CCccchhh-hhhcCCccccCCCcchhhhhhc--cccccccccccchhhHhhhccc----------hh---HHHhhchHHH Confidence 88888655 2222211000000 000000 0000000000000111100000 00 0011124445 Q ss_pred HHHHHHhhhhhcCcceEeeCC----HHHHHHHHHHHh--hc---cHHHHHHHHHHHHhhcCCEEEEEEEeC-C-ce-EEE Q lcl|NC_012753. 78 TASKKVASLVFNEQATIRVDN----EVADAFINETLK--ND---KFSKNFERYLESCLALGGLAMRPYIDG-D-QI-RVS 145 (502) Q Consensus 78 ~iv~~~a~~l~~ep~~i~~~d----~~~~e~l~~~~~--~~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~-~~-~i~ 145 (502) .+|+..|+-+..-|+.+--.+ +.....+..++. -| ....-+..++...+..|.+|+.+..+. | .+ .+. T Consensus 65 ~cI~~ia~~ia~~~~~~~~~~~~~~~~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~i~r~~~g~~~~~L~ 144 (413) T protein:vir:96 65 MAVDCIADLVSNMTIQLMQNGETGDKRIKNDLSRVVDIEPNKYLSRKTFIQWLVRSMLLEGNGNAVVKPQVSGDKIIGLT 144 (413) T ss_pred HHHHHHHHhhccCceEEEEecCCCccccccHHHHHHHhccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCceEEEE Confidence 677777777776666542111 111222333332 12 234455667788888999999888874 3 33 577 Q ss_pred EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCc Q lcl|NC_012753. 146 FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEE 225 (502) Q Consensus 146 ~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~ 225 (502) .++|..+-+.. +.+.+ .|.. ..+++ .++- . T Consensus 145 ~l~~~~v~~~~-~~~~~------------------~y~~----~~~~~-------------------~~~~--------~ 174 (413) T protein:vir:96 145 PISPYKVTFNV-SDDDL------------------DYSI----TFDNK-------------------EYDP--------S 174 (413) T ss_pred EecCceeEEEE-cCCeE------------------EEEE----eecCc-------------------EEch--------h Confidence 77787765532 22210 1100 00000 0000 0 Q ss_pred ceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCccc Q lcl|NC_012753. 226 TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKV 305 (502) Q Consensus 226 ~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~ 305 (502) + .++|+.+. + ....-.|.|.+..+...+...........+-|..+...-.+ |......+.... T Consensus 175 e----------vih~k~~~-~--~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gi----l~~~~~l~~e~~ 237 (413) T protein:vir:96 175 T----------LLHFVLNP-S--IERPFIGTGYKVALKDIVGNLKQASVTKKGFMASEYMPNLI----VSVDSDSDELSD 237 (413) T ss_pred h----------EEEEeccC-C--CCCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE----EEeCCCCCHHHH Confidence 0 13343210 0 01122488887777776655554433333334554432222 222111111000 Q ss_pred Cccccccccc-hhhcc-------ccCCCCccccceeeec-cccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccH Q lcl|NC_012753. 306 TVKREFETGH-NVYEQ-------FDSGDMDKGIGITDLT-TDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTA 376 (502) Q Consensus 306 ~~~~~~~~~~-~~~~~-------~~~~~~~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tA 376 (502) ..+.... ..+.. +....+. ..++.+. .....-++++..+...++|+...|+|+..+|..... .+ T Consensus 238 ---~~~~~~~~~~~~g~~n~g~~~vl~~~~--~~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~--~~ 310 (413) T protein:vir:96 238 ---EEGRENFEEMYLKRKEAGKPWIIPEGM--VNVQQIKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLLGVGTYN--KD 310 (413) T ss_pred ---HHHHHHHHHHhcCccccCceeeecCCc--ccccccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCcch--HH Confidence 0000000 10110 1111111 1122221 123345677777788899999999999999743211 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_012753. 377 TEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFA 456 (502) Q Consensus 377 tei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~ 456 (502) +.. ..++.+|..+++.|....+.. ++. +...+.+++++-+..|..+.++...+++.+|++ T Consensus 311 ~~~---------------~~~~~~l~P~~~~ie~~ln~~-ll~----~~~~~~fd~~~ll~~d~~~~~~~~~~~~~~G~~ 370 (413) T protein:vir:96 311 EFN---------------NFINTKIMSIAQVIQQTYNKL-IVE----EDMYFSLNPRSLYNYSLTEMVSAGAQMTQLNAL 370 (413) T ss_pred HHH---------------HHHHHHHHHHHHHHHHHHHHh-hCC----CCcEEEEechhhhccCHHHHHHHHHHHHhCCCc Confidence 111 123334444444443332221 222 123466666676778989999999999999999 Q ss_pred CHHHHHHhcCCCCH-HHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 457 PKTMAIEKTLNVTK-EQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 457 S~et~l~~~~~~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) +.-++++. .|+.. +..++-+....-......+....+.++|= T Consensus 371 t~NE~R~~-~g~~p~~~gd~~~~~~n~~~~~~~~~~~~~~~~dt 413 (413) T protein:vir:96 371 RRNEFRNW-VGMPPDAEMDDLLVLENYLQQKDLVNQKKLIQDET 413 (413) T ss_pred CHHHHHHH-hCCCCCCCcceeeecccccchhhcccccCCCCCCC Confidence 99997654 35433 11221110000000000011111111111 No 194 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=96.66 E-value=0.00043 Score=39.04 Aligned_cols=449 Identities=12% Similarity=0.125 Sum_probs=187.9 Q ss_pred CChhHHHHHHHHHHhhcccccchh-hhhccccccC-CH-HHH-----H-HHHH-HHHHhcCCCCccccccCCCccccccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLN-SITDHPKIAI-SP-EEY-----N-RIMD-NLRYFAGDFDSVTYRDSNGSQVKRDF 70 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~-~i~~~~~~~~-~~-~~~-----~-~i~~-~~~~Y~g~~~~~~~~~~~~~~~~~~~ 70 (502) |++.+-.+-|.|.--... .+.++ +...-..+.. +. ..+ . .+-- ...||.....+ . .....-+++ T Consensus 1 ~~~~~lf~f~~~~d~~~~-~~~~~~~~~s~~~p~~~DGa~~i~~~~~~~~~~g~~~~~~d~~~~~-~----~~~~LI~~Y 74 (516) T protein:vir:10 1 MKFLDLFKFWDRVDQNEY-DERLKQGHESIATPKKDDGATEIEAREGESSYNALMQQFFGIDNNI-S----GTKDLINTY 74 (516) T ss_pred CCchHhcccccchhhHHH-HhhhcCCCCcccCCCCccCceeeecCcccccccceeeeeecccCcc-c----cHHHHHHHH Confidence 776665555543110000 00000 0000000000 00 000 0 0000 00000000000 0 000000000 Q ss_pred eec-chH--HHHHHHHhhh-----hhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 71 NHL-PIG--RTASKKVASL-----VFNEQATIRVDNE--------VADAFINETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 71 ~~~-n~~--k~iv~~~a~~-----l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) .++ ..| --.|+..++= -..+|+.+.+++- ...+..+.+++--+|+++..+.+....+-|..|++ T Consensus 75 R~ma~~pEvd~Av~eIvneaiv~d~~~~pV~l~l~~~e~s~sik~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fh 154 (516) T protein:vir:10 75 RQLTNNPEVERAVANIVNEAVVYEKGHKVVSLDLDDTEFSSSIKDKILEEFDEICRLLDASRKLDTLFRRWYIDSRIFFH 154 (516) T ss_pred HHhhhccchhHHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhHHHHhhhhcceEEEE Confidence 000 000 1111111111 1135666666642 24556677777779999999999999999999999 Q ss_pred EEEeC---CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEE-eCCeEEEEE-EEEecCCccc Q lcl|NC_012753. 135 PYIDG---DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEW-NKETYTISN-ELYESESKTI 209 (502) Q Consensus 135 ~~~d~---~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~-~~~~~~I~~-~l~~~~~~~~ 209 (502) .+.|+ |=..+..++|..+-++..--........+.. ...|++.. .+..++.-+ ..|...+ T Consensus 155 Kiid~~k~GI~elr~lDPr~i~~vR~i~~~~~~~~~v~~------------~~~e~~~Y~~~~~~~~~~g~~~~~~~--- 219 (516) T protein:vir:10 155 KIMPNPKEGIVELRRLDPRHVEYYREIVTSDVGGTSVVK------------GYREFFVYTTGNEGYAYNGRLFEPNT--- 219 (516) T ss_pred EEecCcccceeeeeeeCCcceeeEEeeecccCcchhhhh------------ceeeeeeeecCccceeccccccCCCC--- Confidence 88874 2357888999988876432110000000000 00111110 001111000 0111100 Q ss_pred cCceeecccc-ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH--HHHhhccce Q lcl|NC_012753. 210 IGQRVPLSTL-YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM--WEVKMGQRR 286 (502) Q Consensus 210 lG~~v~l~~~-~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~--~~~~~~~~~ 286 (502) +-.+|-+.+ |. -+|+. ++ ....=+|-|..+......|=-+-+.++ +-.++-..| T Consensus 220 -~ikI~~daI~y~-------hSGl~--d~-------------~~~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRR 276 (516) T protein:vir:10 220 -RIKIPRSAIVYA-------HSGLQ--DC-------------SDRGIVGYLHNAVKPANQLKLLEDALVIYRITRAPERR 276 (516) T ss_pred -ceecchhheeee-------ecCcc--cC-------------CCCceeceehhhhHhHHhhHHHHhhHHHHhhhccccce Confidence 111121111 10 01110 00 000002333333222222211111111 112333344 Q ss_pred ee-ech---------HHh---------ccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHH Q lcl|NC_012753. 287 VA-VPT---------QMI---------KTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKA 347 (502) Q Consensus 287 i~-v~~---------~~l---------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~ 347 (502) || |+- .++ +-+-+...++++.++-+..-...|..- --+|+.+.-|+++..--...+ ++- T Consensus 277 vFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEItTLpGgqnlge-m~D 354 (516) T protein:vir:10 277 VFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLM-RRDGKSVTEVTSLPGAQTMGE-MDD 354 (516) T ss_pred EEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhccc-ccCCCcccceeeccccCCcCh-HHH Confidence 44 211 000 001122233333322222222222211 113333344666554322222 244 Q ss_pred HHHHHHHHHHhcCCChhhcccccccc---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccc Q lcl|NC_012753. 348 INKGLSLFEMQLGVSTGMFSFDGKSM---KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPT 424 (502) Q Consensus 348 l~~~l~~i~~~~g~s~~~~~~~~~~~---~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~ 424 (502) ++.+.+.+....++|.+.+..++++. .-++||.-..-....-+.+++..|..-+.++++.-|.+-. ++...-+. T Consensus 355 V~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLilKg---Iit~eeW~ 431 (516) T protein:vir:10 355 VRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIYKK---IILESEWE 431 (516) T ss_pred HHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcC---CCCHHHHH Confidence 66777788888888888876554321 2345665444445567777888888888888776665432 22211111 Q ss_pred c--cceEEEeCCCccCCHHHHHHHHHH-------HH--hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCC Q lcl|NC_012753. 425 M--DEVSVDLDDGVFTDRNAEFDYWSK-------MV--AAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRT 493 (502) Q Consensus 425 ~--~~i~v~f~d~i~~d~~~~~~~~~~-------~~--~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~ 493 (502) . ..+.++|...--..+..+++.+.+ +. -+...|.+++.++....||+|.+++.+.|++|....--.. + T Consensus 432 ~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~~~~~~~~-p 510 (516) T protein:vir:10 432 EQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQIEKEANVKRFQN-P 510 (516) T ss_pred HHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHHHHHHHhhhCCCCCC-C Confidence 1 246777866544444444443332 11 2357899998888889999999999999999976532111 1 Q ss_pred ccccCC Q lcl|NC_012753. 494 SEEVDI 499 (502) Q Consensus 494 ~~~~~~ 499 (502) ....+| T Consensus 511 ~~e~~f 516 (516) T protein:vir:10 511 ENEDDF 516 (516) T ss_pred CccccC Confidence 222344 No 195 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=96.60 E-value=0.00048 Score=38.77 Aligned_cols=396 Identities=12% Similarity=0.091 Sum_probs=157.7 Q ss_pred HHhhcccc--cchhhh-hccccccCCHHHHHHHHHHHHHhcCCCCccccc---------------cCCCccccccceecc Q lcl|NC_012753. 13 RSNYVITN--QSLNSI-TDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYR---------------DSNGSQVKRDFNHLP 74 (502) Q Consensus 13 ~~~~~~~~--~~l~~i-~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~---------------~~~~~~~~~~~~~~n 74 (502) ..||..-- +.++.= ..+.++.+. --+...+.+.+... ...+..... ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~----------~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~al~ 69 (441) T protein:vir:94 1 MHWYNTDCYFVDFKSRKQSRKELVVV----------GIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD-IEAIR 69 (441) T ss_pred CccccCccccccccccccchhhhhcc----------ccccccccccccCCCcchHHHHHHhcccCcccccccch-hhhhc Confidence 11111100 000000 000000000 00000000000000 000000000 00011 Q ss_pred hH--HHHHHHHhhhhhcCcceEeeCCHH-HHHHHHHHHh--hccH---HHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EE Q lcl|NC_012753. 75 IG--RTASKKVASLVFNEQATIRVDNEV-ADAFINETLK--NDKF---SKNFERYLESCLALGGLAMRPYIDG-DQI-RV 144 (502) Q Consensus 75 ~~--k~iv~~~a~~l~~ep~~i~~~d~~-~~e~l~~~~~--~~~f---~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i 144 (502) .+ -..|+..|+-+-+=|..+.-+++. ....+-.+|. -|.+ ..-...++...+..|.+|+.+..+. |.+ .+ T Consensus 70 ~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L 149 (441) T protein:vir:94 70 HSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNL 149 (441) T ss_pred cHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 11 124555555555555544322221 1222333332 1222 2334455666788899999888775 443 67 Q ss_pred EEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC Q lcl|NC_012753. 145 SFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE 224 (502) Q Consensus 145 ~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~ 224 (502) ..++|+++-+...+.+.+ .+.++..+ +....+. ..|.. T Consensus 150 ~~i~~~~v~v~~d~~g~~-----~~~~~~~~----------------~~~~~~~-~~~~~-------------------- 187 (441) T protein:vir:94 150 TFRKTSEIELKSDARGRL-----YYFHQRID----------------SNGNNIE-RNVKF-------------------- 187 (441) T ss_pred EEEcCceeEEEECCCccE-----EEEEEEec----------------cCCceeE-EEEcc-------------------- Confidence 788888877653222211 00000000 0000000 00000 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-HhhccceeeechHHhccCCCCCCc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~~~l~~~~~~~g~ 303 (502) . -.++|+.+ ..+...|+|.+..+...|+. ......+... |+.+...-.| |......... T Consensus 188 ~----------dvih~k~~-----~~dg~~G~spl~~~~~~i~~-~~~~~~~~~~~f~ng~~p~gi----l~~~~~~~~~ 247 (441) T protein:vir:94 188 E----------DMLDIKFY-----SLDGINGLSLLDTLSRTIES-DNNGKDFLNNFLRNGTHAGGI----LKMKGVLDNK 247 (441) T ss_pred c----------cEEEeccC-----CCCCccccCHHHHHHHHHHH-HHHHHHHHHHHHhccCCCcEE----EEcCCCCCCH Confidence 0 01233322 11234688988877777763 3334444443 4544332122 2221111110 Q ss_pred ccC--ccccccccchhhccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHH Q lcl|NC_012753. 304 KVT--VKREFETGHNVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT 377 (502) Q Consensus 304 ~~~--~~~~~~~~~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt 377 (502) +-. ....|. ..+..... .--+++.-++.++.....-++.+..+...++|+...|+||..+|.+..+. +.+ T Consensus 248 e~~e~~r~~~~---~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~ 323 (441) T protein:vir:94 248 KARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SIT 323 (441) T ss_pred HHHHHHHHHHH---HHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHH Confidence 000 000010 11110000 00011223556666666677888888889999999999999998654432 223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 378 EVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 378 ei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) +....+. .+..-..+.++..|... +.. . .....+.++++.-+-.|..+.++...+++.+|+++ T Consensus 324 q~~~~~~---~tl~P~~~~ie~eln~k------------l~~-~-~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T 386 (441) T protein:vir:94 324 DANLDYL---STLKPYITCVCAELNFK------------FND-E-YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMN 386 (441) T ss_pred HHHHHHH---HHHHHHHHHHHHHHhhh------------ccc-c-ccCceEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 3222221 12222222232222221 111 1 11234555555657778899999999999999999 Q ss_pred HHHHHHhc--CCCCHHHHHH-----H---HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 458 KTMAIEKT--LNVTKEQAQE-----I---YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 458 ~et~l~~~--~~~~deea~~-----e---l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .-++++.. +|+..-+... - ++.+.+.+......-..+..+|=-+| T Consensus 387 ~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 387 IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 99976543 2332211000 0 01111111111111222223333333 No 196 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=96.60 E-value=0.00048 Score=38.77 Aligned_cols=396 Identities=12% Similarity=0.091 Sum_probs=157.7 Q ss_pred HHhhcccc--cchhhh-hccccccCCHHHHHHHHHHHHHhcCCCCccccc---------------cCCCccccccceecc Q lcl|NC_012753. 13 RSNYVITN--QSLNSI-TDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYR---------------DSNGSQVKRDFNHLP 74 (502) Q Consensus 13 ~~~~~~~~--~~l~~i-~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~---------------~~~~~~~~~~~~~~n 74 (502) ..||..-- +.++.= ..+.++.+. --+...+.+.+... ...+..... ...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~----------~lf~~~e~R~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~al~ 69 (441) T protein:vir:79 1 MHWYNTDCYFVDFKSRKQSRKELVVV----------GIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYKD-IEAIR 69 (441) T ss_pred CccccCccccccccccccchhhhhcc----------ccccccccccccCCCcchHHHHHHhcccCcccccccch-hhhhc Confidence 11111100 000000 000000000 00000000000000 000000000 00011 Q ss_pred hH--HHHHHHHhhhhhcCcceEeeCCHH-HHHHHHHHHh--hccH---HHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EE Q lcl|NC_012753. 75 IG--RTASKKVASLVFNEQATIRVDNEV-ADAFINETLK--NDKF---SKNFERYLESCLALGGLAMRPYIDG-DQI-RV 144 (502) Q Consensus 75 ~~--k~iv~~~a~~l~~ep~~i~~~d~~-~~e~l~~~~~--~~~f---~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i 144 (502) .+ -..|+..|+-+-+=|..+.-+++. ....+-.+|. -|.+ ..-...++...+..|.+|+.+..+. |.+ .+ T Consensus 70 ~~~V~~cv~~Ia~~iA~lp~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L 149 (441) T protein:vir:79 70 HSDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNL 149 (441) T ss_pred cHHHHHHHHHHHHhhccCceeeecCccccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEE Confidence 11 124555555555555544322221 1222333332 1222 2334455666788899999888775 443 67 Q ss_pred EEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC Q lcl|NC_012753. 145 SFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE 224 (502) Q Consensus 145 ~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~ 224 (502) ..++|+++-+...+.+.+ .+.++..+ +....+. ..|.. T Consensus 150 ~~i~~~~v~v~~d~~g~~-----~~~~~~~~----------------~~~~~~~-~~~~~-------------------- 187 (441) T protein:vir:79 150 TFRKTSEIELKSDARGRL-----YYFHQRID----------------SNGNNIE-RNVKF-------------------- 187 (441) T ss_pred EEEcCceeEEEECCCccE-----EEEEEEec----------------cCCceeE-EEEcc-------------------- Confidence 788888877653222211 00000000 0000000 00000 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-HhhccceeeechHHhccCCCCCCc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~~~l~~~~~~~g~ 303 (502) . -.++|+.+ ..+...|+|.+..+...|+. ......+... |+.+...-.| |......... T Consensus 188 ~----------dvih~k~~-----~~dg~~G~spl~~~~~~i~~-~~~~~~~~~~~f~ng~~p~gi----l~~~~~~~~~ 247 (441) T protein:vir:79 188 E----------DMLDIKFY-----SLDGINGLSLLDTLSRTIES-DNNGKDFLNNFLRNGTHAGGI----LKMKGVLDNK 247 (441) T ss_pred c----------cEEEeccC-----CCCCccccCHHHHHHHHHHH-HHHHHHHHHHHHhccCCCcEE----EEcCCCCCCH Confidence 0 01233322 11234688988877777763 3334444443 4544332122 2221111110 Q ss_pred ccC--ccccccccchhhccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHH Q lcl|NC_012753. 304 KVT--VKREFETGHNVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT 377 (502) Q Consensus 304 ~~~--~~~~~~~~~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt 377 (502) +-. ....|. ..+..... .--+++.-++.++.....-++.+..+...++|+...|+||..+|.+..+. +.+ T Consensus 248 e~~e~~r~~~~---~~~~G~~nag~~~vl~~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~ 323 (441) T protein:vir:79 248 KARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SIT 323 (441) T ss_pred HHHHHHHHHHH---HHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHH Confidence 000 000010 11110000 00011223556666666677888888889999999999999998654432 223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 378 EVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 378 ei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) +....+. .+..-..+.++..|... +.. . .....+.++++.-+-.|..+.++...+++.+|+++ T Consensus 324 q~~~~~~---~tl~P~~~~ie~eln~k------------l~~-~-~~~~~~~fd~~~llr~D~~~~~~~~~~~i~~G~~T 386 (441) T protein:vir:79 324 DANLDYL---STLKPYITCVCAELNFK------------FND-E-YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMN 386 (441) T ss_pred HHHHHHH---HHHHHHHHHHHHHHhhh------------ccc-c-ccCceEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 3222221 12222222232222221 111 1 11234555555657778899999999999999999 Q ss_pred HHHHHHhc--CCCCHHHHHH-----H---HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 458 KTMAIEKT--LNVTKEQAQE-----I---YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 458 ~et~l~~~--~~~~deea~~-----e---l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .-++++.. +|+..-+... - ++.+.+.+......-..+..+|=-+| T Consensus 387 ~NE~R~~~gl~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 387 IDEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 99976543 2332211000 0 01111111111111222223333333 No 197 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=96.57 E-value=0.0005 Score=38.67 Aligned_cols=400 Identities=11% Similarity=0.025 Sum_probs=155.6 Q ss_pred HHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhc Q lcl|NC_012753. 10 FIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFN 89 (502) Q Consensus 10 ~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ 89 (502) ++...|..+.+-++.. ....+. ..|.+....-.................+.-..+|+..|+-+-+ T Consensus 1 ~~~~~~~~~~~p~~~e------------~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~a~~~~~V~acV~~IA~~iA~ 65 (518) T protein:vir:10 1 MLLANGQTLSAPAMAE------------LSPQMQ---DSYYYAPAVGMQLERQFSLYGGIYKNQPWVRTVIAKRAQALAR 65 (518) T ss_pred CcccCceeecCchhhh------------hhhhhh---cccccccccceecccccchhhHHHhhhHHHHHHHHHHHHhhcc Confidence 2222233222222111 111111 1111110000000000000000011112224455555555544 Q ss_pred CcceEee---CCH--HHHHHHHHHHhh-c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEc Q lcl|NC_012753. 90 EQATIRV---DNE--VADAFINETLKN-D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQAN 158 (502) Q Consensus 90 ep~~i~~---~d~--~~~e~l~~~~~~-~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d 158 (502) =|+.+-- ++. .....+..++.. | ....-...++...+..|.+|+.+..+. |.+ .+..++|+.+-+.... T Consensus 66 lpl~l~~~~~~~~~~~~~~~~~~Ll~~PN~~~t~~~F~~~lv~~lll~Gnay~~i~r~~~G~~~~L~~l~p~~v~v~~~~ 145 (518) T protein:vir:10 66 LPVKCMFTSGDTETEESDTGYAKLLADPCEYLDPFAFWEWVASTLDIYGETYLAIQKNKSGTPEKLMPMHPSRVAIKRNS 145 (518) T ss_pred CceEEEEEcCCCceeccchHHHHHHcCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCCceEEEEcC Confidence 4544311 111 111223333332 2 222344455666778899998887765 443 5667777776554322 Q ss_pred CCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEE Q lcl|NC_012753. 159 TQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFT 238 (502) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~ 238 (502) .+... ++.+ ... ++.. +..+. +.. --.+ T Consensus 146 ~~~~~----~y~~-~~~----------------~~~~---------------~~~~~-------------~~~---~eVi 173 (518) T protein:vir:10 146 RTGRY----EYYF-QAG----------------AGVG---------------TQLVS-------------FAD---DEVV 173 (518) T ss_pred CCCEE----EEEE-Eec----------------CCcc---------------ceEEE-------------ecC---CcEE Confidence 11110 0000 000 0000 00000 000 0123 Q ss_pred EecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc-chh Q lcl|NC_012753. 239 YLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG-HNV 317 (502) Q Consensus 239 ~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~ 317 (502) +|+.+..+ +...|+|.+.-+...|.....+-....+-|..+...=.| |.....-+ .+-. ..+... ... T Consensus 174 Hir~~s~d----g~~~G~spi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gi----l~~~~~ls-~e~~--~~~k~~~~~~ 242 (518) T protein:vir:10 174 PIRFFNPD----GLERGLSLMESLKSTIFSEDSSRNATAAMWKNAGRPNLV----LRHEKRLS-EAAQ--QRLREQFDRA 242 (518) T ss_pred EecCCCCC----cccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEE----EecCCCCC-HHHH--HHHHHHHHHH Confidence 34433221 234688888776666655554444333445554332111 22111100 0000 000000 001 Q ss_pred hccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 318 YEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNS 392 (502) Q Consensus 318 ~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~ 392 (502) +..... .--+.+.-++.++.....-++.+..+....+|+...|++|..+|+...+. +++++....+ ...++.- T Consensus 243 ~~G~~nag~v~vL~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~lg~~~~~t~sn~eq~~~~f--~~~tL~P 320 (518) T protein:vir:10 243 HSGSSNTGKTMVVEEGMEPIPLQLTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRAF--YRDTMAI 320 (518) T ss_pred hcCccccCcceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHHH--HHHHHHH Confidence 110000 00011223455555555567888888889999999999999998755432 2222221111 1112222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCH Q lcl|NC_012753. 393 IATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTK 470 (502) Q Consensus 393 ~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~d 470 (502) ..+.++..|... +.. .......+.|+.+.-+..|..+.++...+++.+|+++.-++++.. +++.+ T Consensus 321 ~l~~ie~~ln~~------------L~~-~~~~~~~~~fd~~~llr~D~~~r~~~~~~~~~~G~lT~NE~R~~~Gl~pie~ 387 (518) T protein:vir:10 321 PIARIQSAMDKY------------VGQ-YWVRKNRMKFDIDDVIQPDWEAKSESTQKMVNSGVATPNEGREIMGLPRSDD 387 (518) T ss_pred HHHHHHHHHHHh------------hcc-cccCCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCC Confidence 222222222221 111 111123455555566678999999999999999999998876543 23332 Q ss_pred HHHHHH-----HHHHHHh-----hhcccCCCCCccccC-----------CCCC Q lcl|NC_012753. 471 EQAQEI-----YQKINDE-----TMVSTDSFRTSEEVD-----------IYGE 502 (502) Q Consensus 471 eea~~e-----l~ri~~E-----~~~~~~~~~~~~~~~-----------~~g~ 502 (502) +..++. +..+..- .....+..+.+.... ..|- T Consensus 388 ~~gD~~~~~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 440 (518) T protein:vir:10 388 PKADELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPTSVPGL 440 (518) T ss_pred CCCCeeeecccceecccccccccCCCCCCCCCCCCccccccccccccccCCCC Confidence 222221 1111100 000011111111111 0000 No 198 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=96.56 E-value=0.00051 Score=38.64 Aligned_cols=393 Identities=13% Similarity=0.071 Sum_probs=148.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCC-CccccccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSN-GSQVKRDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~-~~~~~~~~~~~n~~k~i 79 (502) |+|+++|.....+ . . +++-. +..|.. ........ ......-...++.-..+ T Consensus 1 Mg~~~~~~~~~~~--~--~---------------~~~~~--------~~~~~~-~~~~~~~~~~~~~~~~~~~~~~v~~~ 52 (423) T protein:vir:81 1 MGFLQKLGLAPSV--V--A---------------TPEPI--------ELVGPI-FESLKLSTKNMTVEQIWEDQPHLRTV 52 (423) T ss_pred CchhHhhcccccc--c--c---------------Ccccc--------cccccc-ccccccccchhhHHHHHHhhhHHHHH Confidence 9999998421100 0 0 00000 011110 00000000 00000000112233456 Q ss_pred HHHHhhhhhcCcceE-e--eCCH---HHHHHHHHHHhh-c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcC Q lcl|NC_012753. 80 SKKVASLVFNEQATI-R--VDNE---VADAFINETLKN-D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQA 149 (502) Q Consensus 80 v~~~a~~l~~ep~~i-~--~~d~---~~~e~l~~~~~~-~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~ 149 (502) |+.+|+-+-+=|..+ . .++. ..+..+.+++.. | .....+..++...+..|.+|+.+.-|.+.. T Consensus 53 i~~ia~~ia~lp~~~~~~~~dg~~~~~~~~~~~~ll~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~rd~~~~------- 125 (423) T protein:vir:81 53 TTFIARNVASLQLQAFERVEDGGRERVREGHLARVCKLANSDMTMYDLLERTMFDLCLYDEFFWLLPGDLGVD------- 125 (423) T ss_pred HHHHHHhHhhCceEEEEEecCCceeeeccchHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcC------- Confidence 666666666656543 1 1221 112233344432 1 233344445667778898888776664422 Q ss_pred CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceee Q lcl|NC_012753. 150 TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTL 229 (502) Q Consensus 150 ~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~ 229 (502) ...+|+....-. .+-+.. ..++.....|+.... ...-|..+.+. . T Consensus 126 ~~~~~l~p~~~~---~v~~~~--~~~~~~~~~Y~~~~~-------------------~~~~g~~~~~~-------~---- 170 (423) T protein:vir:81 126 TPTLDIRPIPVS---WVQRRA--YKDGWGSLDYIIIES-------------------GDNDGRSVKVP-------G---- 170 (423) T ss_pred cceEEEeecccc---eeeeee--ccCCCcceEEEEEEe-------------------cCCCceEEEEc-------c---- Confidence 111222110000 000000 001111111211110 00112222110 0 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-Hhhccc-eeeechHHhccCCCCCCcccCc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQR-RVAVPTQMIKTEYDTNGEKVTV 307 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~-~i~v~~~~l~~~~~~~g~~~~~ 307 (502) --.++++.+..+ +...|+|.+..+...+...... .++... |..+.. ..+ |.......+..... T Consensus 171 -----~evih~r~~~~~----~~~~G~spi~~~~~~i~~~~~~-~~~~~~~f~ng~~p~gv-----i~~~~~~~~~~l~~ 235 (423) T protein:vir:81 171 -----ERVIHRHGYNPK----TMKRGKSPVQSLRDILGEQIEA-AIFRAQMWRNGPRPGMV-----IMRDPESKAGKWDA 235 (423) T ss_pred -----cceEEecCCCCC----CccccccHHHHHHHHHHHHHHH-HHHHHHHHhccCCCceE-----EEecCcccCccCCH Confidence 012344433222 2336899888777766544433 333333 444332 222 22111111111110 Q ss_pred cc--cccccch-hhccccCCCC-----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHH Q lcl|NC_012753. 308 KR--EFETGHN-VYEQFDSGDM-----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATE 378 (502) Q Consensus 308 ~~--~~~~~~~-~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAte 378 (502) .. .+....+ .+.......+ +.+..++.++.....-++.+..+....+|+...|+||..+|+..++. ++.++ T Consensus 236 e~~~~~~~~~~~~~~~~~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~ 315 (423) T protein:vir:81 236 ESRTRFMANLRASFSPKSSDVGGTLLLEDGMKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVRE 315 (423) T ss_pred HHHHHHHHHHHHHhccccccCCcceecCCCceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHH Confidence 00 0000000 0000000000 11223555555445557778777888899999999999998754432 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc--cccceEEEeCCCccCCHHHHHHHHHHHH-hcCC Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIP--TMDEVSVDLDDGVFTDRNAEFDYWSKMV-AAGF 455 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~--~~~~i~v~f~d~i~~d~~~~~~~~~~~~-~~Gi 455 (502) ....+ ...+..-..+.++.+|... +....-. ....+.++++.-+-.|..+.++...+++ ++|+ T Consensus 316 ~~~~f--~~~~L~P~~~~ie~~l~~~------------L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~l~~~G~ 381 (423) T protein:vir:81 316 FRKAL--YGDNLGSWIRIIQDVMNLF------------LLPRVGIDNEKFYFEFNLEEKLRASFEEAAEIKRAAVGNVAW 381 (423) T ss_pred HHHHH--HHHHHHHHHHHHHHHHhhh------------hcCccccccCccEEEecchhhhccCHHHHHHHHHHHHhCCCC Confidence 22111 1112222222333333221 1111101 1112344444445668888777777766 4699 Q ss_pred CCHHHHHHhcCCCCH-HHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 456 APKTMAIEKTLNVTK-EQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 456 ~S~et~l~~~~~~~d-eea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++.-++++. .|+.. +..++-+.- .+... .+..+-.|| T Consensus 382 ~T~NE~R~~-~gl~p~~gGD~~~~p---~n~~~------~~~~~~~~~ 419 (423) T protein:vir:81 382 MTINEVRAM-DNLPSIDGGDDLARP---LNTEF------GDSEDAPGE 419 (423) T ss_pred cCHHHHHHH-hCCCCCCCcceeecc---ccccc------CccCCCCCC Confidence 999886554 35433 111111111 01000 111112223 No 199 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=96.55 E-value=0.00052 Score=38.58 Aligned_cols=439 Identities=11% Similarity=0.102 Sum_probs=196.4 Q ss_pred CChhHHHHHHHH----H----Hhh---cccccch--hhhhccc-----c----------ccCC---HHHHHHHHHHHHHh Q lcl|NC_012753. 1 MGIIQTIKNFIK----R----SNY---VITNQSL--NSITDHP-----K----------IAIS---PEEYNRIMDNLRYF 49 (502) Q Consensus 1 m~~~~~ik~~i~----~----~~~---~~~~~~l--~~i~~~~-----~----------~~~~---~~~~~~i~~~~~~Y 49 (502) .++.+-++.|.+ + +.. ++.+-.. .....+. . +.+. .....-|+.++.+. T Consensus 2 ~~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~P~~~dGa~~i~~~~~~~~~~~gg~~~~~~~~e~~~~~~~eLI~~YR~ma 81 (521) T protein:vir:81 2 FSRLKMLARWADFDNDKYEEQIKDKAESIAAPKNNDGATEVEINDNLPASAWNSLTQQFYSTDQKISTTKQLVNTYRGLM 81 (521) T ss_pred cchhhhhHhhcCchhhhHHhhhccCccccccCCCCCCceEecccCCCcceeecceeeeecccccchhhHHHHHHHHHHHh Confidence 344444444422 1 000 0000000 0000000 0 0000 12223344444443 Q ss_pred cCCCCccccccCCCccccccceecchHHHHHHHHh-hhhhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHH Q lcl|NC_012753. 50 AGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVA-SLVFNEQATIRVDNE--------VADAFINETLKNDKFSKNFER 120 (502) Q Consensus 50 ~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a-~~l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~ 120 (502) .. +-+. +--..||+... .=-..+|+++.+++. ...+..+.+++--+|+++..+ T Consensus 82 ~~--pEvd----------------~Av~eIVneaiv~d~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~ 143 (521) T protein:vir:81 82 NN--HEVE----------------NAVQNIVNDAIVFEEGHEVVSLNLEATGFSESVKERIHEEFKDLLNTIQFDRRGQD 143 (521) T ss_pred hc--cchh----------------hHHHHhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccchhhhH Confidence 22 1000 00011111111 001125666666542 245566777777799999999 Q ss_pred HHHHHhhcCCEEEEEEEeCC---c-eEEEEEcCCeEEEEEEcCCCeE-EEEEEEEEEEeeCCCceEEEEEEEEEEeCCeE Q lcl|NC_012753. 121 YLESCLALGGLAMRPYIDGD---Q-IRVSFVQATVFFPLQANTQDVS-SAAIVTKSTKTEGQKVKYYSLIEFHEWNKETY 195 (502) Q Consensus 121 ~~~~~~~~G~~~~~~~~d~~---~-~~i~~v~~~~~~Pi~~d~~~~~-~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~ 195 (502) .+....+-|..|++..+|++ + ..+..++|..+-++........ ...++ . -+.-+..|...+..| T Consensus 144 ~fR~WYVDgRi~fhkiid~~pk~GI~Elr~lDPr~i~~vr~i~k~~~~~~~v~-~----------~~~e~f~Y~~~~~~~ 212 (521) T protein:vir:81 144 MFRRWYVDSRIFFHKIIGKNPKDGIVELRQLDPRNLEYVREIITEDTPEGKIY-K----------ATKEYFIYTVGNSSY 212 (521) T ss_pred HHhhhhhcceEEEEEEEcCCccccceeeeeeCCcceeeeeeecccccCcccee-c----------ceeeeeeeecCCccc Confidence 99999999999999998742 3 5788899998887653221100 00000 0 011011111111112 Q ss_pred EEEEEEEecCCccccCceeecccc-ccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHH Q lcl|NC_012753. 196 TISNELYESESKTIIGQRVPLSTL-YEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYD 274 (502) Q Consensus 196 ~I~~~l~~~~~~~~lG~~v~l~~~-~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S 274 (502) ......|.. ..+-.+|-+.+ |. -+|+ .+.....=+|-|..+......|=-+-+ T Consensus 213 ~~~g~~~~~----~~~vkI~~dAI~y~-------hSGl---------------~d~~~~~i~syLhkAiKp~NQLkm~ED 266 (521) T protein:vir:81 213 CAGGQVFSP----NSRVKIPRSAITYA-------HSGL---------------MDCDDKYIIGYLHRAVKPANQLKLLED 266 (521) T ss_pred cccceeecC----Ccceeechhheeee-------eccc---------------eeCCCCeeeecchhhhHhHHhhHHHHh Confidence 111111111 01112222111 11 0111 000111112333333333322221111 Q ss_pred HHH--HHHhhccceee-ech---------H----Hhc---c--CCCCCCcccCccccccccchhhccccCCCCcccccee Q lcl|NC_012753. 275 EFM--WEVKMGQRRVA-VPT---------Q----MIK---T--EYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGIT 333 (502) Q Consensus 275 ~~~--~~~~~~~~~i~-v~~---------~----~l~---~--~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 333 (502) .++ +-.++-..||| |+- . +.. + +-+...++++.++-+..-...|..- --+|+.+.-|+ T Consensus 267 AlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEIt 345 (521) T protein:vir:81 267 AMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQ-RRDGKAITDVT 345 (521) T ss_pred hHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhccc-ccCCCccccee Confidence 111 11234444554 211 0 110 0 0122233333322222222222211 11333344466 Q ss_pred eeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccc-c--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 334 DLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGK-S--MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILE 410 (502) Q Consensus 334 ~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~-~--~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~ 410 (502) ++..--...+ ++-++.+.+.+....++|.+.++.+++ + ..-++||....-....-+.+++..|..-+.++++.-|. T Consensus 346 TLpGgqnlge-m~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLi 424 (521) T protein:vir:81 346 TLPGASGMSD-IDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLI 424 (521) T ss_pred ecccCCCCCh-HHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 6654322222 234666777888888888888854332 1 12345665555455567788888888888888887665 Q ss_pred HHHhhcccCCC-ccc-ccceEEEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHHHhcCCCCHHHHHHHHHH Q lcl|NC_012753. 411 LAKVYNLYTGE-IPT-MDEVSVDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAIEKTLNVTKEQAQEIYQK 479 (502) Q Consensus 411 ~~~~~~~~~~~-~~~-~~~i~v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l~~~~~~~deea~~el~r 479 (502) +-. ++... +.. ...+.++|...--..+..+++.+..-. ..+ ..|.+++.++..-.||+|.+++.+. T Consensus 425 lKg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~ 501 (521) T protein:vir:81 425 LKN---VITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQ 501 (521) T ss_pred hhc---CCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHH Confidence 432 22211 111 124778886654444444444433211 112 4589998778889999999999999 Q ss_pred HHHhhhcccCCCCCccccCC Q lcl|NC_012753. 480 INDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 480 i~~E~~~~~~~~~~~~~~~~ 499 (502) |++|....--..+..+..|| T Consensus 502 I~~E~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 502 IEEEANDPRFKQTPDEIEDF 521 (521) T ss_pred HHHHhhCCCCCCCcccccCC Confidence 99998776555555556666 No 200 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=96.52 E-value=0.00055 Score=38.46 Aligned_cols=372 Identities=10% Similarity=0.046 Sum_probs=157.9 Q ss_pred cchhhhhc--cccccCCHHHHHHHHHHHHHhcCCC--CccccccCCCccc-cccceecchHHHHHHHHhhhhhcCcceEe Q lcl|NC_012753. 21 QSLNSITD--HPKIAISPEEYNRIMDNLRYFAGDF--DSVTYRDSNGSQV-KRDFNHLPIGRTASKKVASLVFNEQATIR 95 (502) Q Consensus 21 ~~l~~i~~--~~~~~~~~~~~~~i~~~~~~Y~g~~--~~~~~~~~~~~~~-~~~~~~~n~~k~iv~~~a~~l~~ep~~i~ 95 (502) |.|.+... ......+ ...+.+-. ..+.... .+... .+..+...--..+|+..|+-+-+=|+. T Consensus 1 Mglf~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~-~~~~v~~~~al~~~~V~~~i~~Ia~~ia~l~~~-- 67 (384) T protein:vir:49 1 MPIFNITNLATESPPSN----------QDSFFDITDPEFLDALN-GSEWVSAETALKNSDLFSIISQLSNDLATAKIT-- 67 (384) T ss_pred CccccccccCccccccc----------chhhccccchhhccccc-CCceechhhhhccHHHHHHHHHHHHHHhhCcee-- Confidence 44332211 1111111 11111110 1111110 01111 011122222245566666666555544 Q ss_pred eCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEE Q lcl|NC_012753. 96 VDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTK 173 (502) Q Consensus 96 ~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~ 173 (502) +.++.....+.+-.......+-...++...+..|.+|+.+..|. |.+ .+..++|+.+-++..+.+... T Consensus 68 ~~~~~~~~l~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~g~~~~L~~l~~~~v~v~~~~~~~~~---------- 137 (384) T protein:vir:49 68 TSRKQLQGIVDNPSNNANRFNFYQSIFAQMLLGGEAFAYRWRNENGRDMKWEYLRPSQVSFNRLDNQNGL---------- 137 (384) T ss_pred eecchhhhhhhccCCCCCHHHHHHHHHHHhhhcCCeEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceE---------- Confidence 44444333333322222345555667778888999999888875 443 666677777655432222110 Q ss_pred eeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCc Q lcl|NC_012753. 174 TEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSP 253 (502) Q Consensus 174 ~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p 253 (502) +|+ + ...+ ...|..+.+ .. --+++++.+.. .+.. T Consensus 138 -------~y~---~-~~~~---------------~~~~~~~~~-------------~~---~eVih~~~~~~----~~~~ 171 (384) T protein:vir:49 138 -------YYN---I-TFDD---------------PRIPPKQHV-------------PQ---GDILHFRLLSV----DGGL 171 (384) T ss_pred -------EEE---E-EecC---------------ccccceeEe-------------cC---ccEEEecCCCC----CCce Confidence 110 0 0000 000111100 00 01234443221 1335 Q ss_pred CCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-cccccc-ccchhhccccCCCCccccc Q lcl|NC_012753. 254 LGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFE-TGHNVYEQFDSGDMDKGIG 331 (502) Q Consensus 254 ~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~ 331 (502) +|+|.+..+...++....+.....+-|..+...-.+ |+........... ...... ...+....+-. +.+.- T Consensus 172 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~i----l~~~~~~~~~~~~~~~~~~~~~~~n~~~~~vl---~~g~~ 244 (384) T protein:vir:49 172 TSVSPLMALGRELNIQKASDKLTLNALKNALNANGI----LKIKGGGLLDFKTKQSRSRQAMKQMQGGPLVL---DDLED 244 (384) T ss_pred eeccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCCChHHHHHHHHHHHhcccCCccceec---CCCce Confidence 688988777777765444443333445544332121 2221111000000 000000 00000000011 11223 Q ss_pred eeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 332 ITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTY-QMRNSIATLVEKSLKELVISILE 410 (502) Q Consensus 332 i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~-~~~~~~~~~~~~~l~~l~~~il~ 410 (502) ++.++.....-++++..+...++|+...|+|+..+|....+..|+..++..+...+ ..+.-+...+...|..-+ . T Consensus 245 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~~~~~~~~~~~i~~~l~pi~~~i~~~l~~~l----~ 320 (384) T protein:vir:49 245 FTPLEIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQSSLEMIYNIYFKAVSRFLRPFVSELSKKLSCEV----D 320 (384) T ss_pred EEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh----h Confidence 55555556667788888889999999999999999976655556655543333222 122222222222222110 0 Q ss_pred HHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhhhccc Q lcl|NC_012753. 411 LAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEIYQKINDETMVST 488 (502) Q Consensus 411 ~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~el~ri~~E~~~~~ 488 (502) . +.....-.+..........++.+|++++-++++.+ .|+..+|+.+. +. . T Consensus 321 -------------~------~~~~~~~~~~~~~~~~~~~l~~~~~~t~~e~~~~l~~~g~~~ne~r~~------~~---~ 372 (384) T protein:vir:49 321 -------------A------DILPAVDPTGSNYIGLINSMVKTGTLAQNQGLYVLQQAEILPKDLPEG------ET---D 372 (384) T ss_pred -------------h------hhhhhhhccchHHHHHHHHHhhcCcccHHHHHHHHhhCCCCChhHHHH------cC---C Confidence 0 00001111222333445567778999998877654 36655544332 11 1 Q ss_pred CCCCCccccCCC Q lcl|NC_012753. 489 DSFRTSEEVDIY 500 (502) Q Consensus 489 ~~~~~~~~~~~~ 500 (502) +..++.+..+.| T Consensus 373 ~p~~gGd~~~~~ 384 (384) T protein:vir:49 373 STLKGGETNEQY 384 (384) T ss_pred CCCCCCCCCCCC Confidence 223334444444 No 201 >protein:vir:103330 Length: 517 # NCBI annotation: head portal-like protein # Family: family:all:481 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039666;genbank:gi:125999995;genbank:GeneID:4818406 Probab=96.42 E-value=0.00064 Score=38.09 Aligned_cols=424 Identities=13% Similarity=0.076 Sum_probs=165.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |-+ ++..+..++..++. .+..++-.....|+.+|.=-.+-+......+.. ..++--+.+...+ T Consensus 1 ~~~--------------~~~~e~~~l~~r~~-~Lk~~R~~~e~~w~e~~~~~lP~~~~~~~~~~~--~~~~~dstg~~a~ 63 (517) T protein:vir:10 1 MDM--------------RFAGNKSKIPKLYE-QLVGKRSPFLSRAENYSRFTLPYLMADVNDDLS--SQNAWQDDGASAT 63 (517) T ss_pred Ccc--------------cccccHHHHHHHHH-HHHHhhhHHHHHHHHHHHHhccccccCCCCCcc--ccccccchHHHHH Confidence 222 22222222222111 122233333344444432221211111111111 1223234566777 Q ss_pred HHHhhhhhcC--cc-----eEeeCCHH-------------HHHH-------HHHHHhhccHHHHHHHHHHHHhhcCCEEE Q lcl|NC_012753. 81 KKVASLVFNE--QA-----TIRVDNEV-------------ADAF-------INETLKNDKFSKNFERYLESCLALGGLAM 133 (502) Q Consensus 81 ~~~a~~l~~e--p~-----~i~~~d~~-------------~~e~-------l~~~~~~~~f~~~~~~~~~~~~~~G~~~~ 133 (502) +.+|+-|.+- || ++.+.++. +.++ +...+..++|...+.++..+....|.+++ T Consensus 64 ~~LAa~l~~~ltpp~~~WF~l~~~~~~l~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 143 (517) T protein:vir:10 64 NFLSNKLSQVLFPAQRSFFRIDLTPEGIKQLDNEAMTQSTAQKLLSDVEKAAMLYGESLQFRPAVVEAFKHLIVTGNVMM 143 (517) T ss_pred HHHHHHHHHhhcCCCCccccccCCHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE Confidence 7777766652 22 13333321 2333 33456677999999999999999998754 Q ss_pred EEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee---------------------CCCceEEEEEEEEEEeC Q lcl|NC_012753. 134 RPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE---------------------GQKVKYYSLIEFHEWNK 192 (502) Q Consensus 134 ~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~---------------------~~~~~~yt~~E~h~~~~ 192 (502) |.+++...+..+|-.+++- ..|..+....+|.+...... ++....|+++++. .+ T Consensus 144 --y~~~~~~~~~~~pl~~y~v-~~d~~G~v~~ivrr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~--~~ 218 (517) T protein:vir:10 144 --YHPDKTSPIQAVPLHHYCV-RRDNNGTVLDIVFLQEKALETFEPSIRMAIQASRKGKQYKDKDNVKLYTHAKRT--KD 218 (517) T ss_pred --EEeCCCCcEEEEEcCeEEE-eeCCCcCeEEEEeeeeccHHHHHHHhhhhcchhhhhhccCCcCceEEEEEEEEe--CC Confidence 6676655677777777554 45555444444433221100 1111233333221 11 Q ss_pred CeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHH Q lcl|NC_012753. 193 ETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTT 272 (502) Q Consensus 193 ~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~ 272 (502) +.+.+ |..-++..+|. +..+. ...-||++++- + ...++.||+|--..+.+-+..|+.. T Consensus 219 ~~~~~----~~~~d~~~~~~-----------~s~y~---~~e~P~~~~Rw---~-~~~ge~YGrgp~~~~L~D~k~L~~l 276 (517) T protein:vir:10 219 GKYLI----RQSADDVPVGK-----------ESTVT---EDKSPFLILTW---K-RSYGEDYGRGMAEDHAGAFFVIQFL 276 (517) T ss_pred CceEE----EEEeCceeecc-----------ccccc---cccCCeeeeee---e-ecCCCCcccchHHHhHHHHHHHHHH Confidence 11111 11111111110 00010 12334554442 2 2346789999999999999999976 Q ss_pred HHHHHH-HHhhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec--cccchHHHHHHHH Q lcl|NC_012753. 273 YDEFMW-EVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT--TDIRSDDYIKAIN 349 (502) Q Consensus 273 ~S~~~~-~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~l~ 349 (502) --.... .....+..+.||.+.... ...+.+. ....+.. .. ...+..++ .-.......+.++ T Consensus 277 ~~~~~~~~~~a~~~~~lv~~~~~~~-----~~~l~~~-----~~g~~~~---g~---~~~v~~~~~~~~~d~~~~~~~i~ 340 (517) T protein:vir:10 277 SEALARGMALMADVKYLVKPGSYTD-----INQFVEG-----GSGAVLH---GV---EGDIHIVQLGKYADYTPIQAVLN 340 (517) T ss_pred HHHHHHHHHHhccCCcccCcccccc-----hhhccCC-----Ccccccc---CC---cccceeeecccccchhHHHHHHH Confidence 544443 334555556665443221 1111111 0011110 01 11122221 1111222223333 Q ss_pred HHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc Q lcl|NC_012753. 350 KGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS----IATLVEKSLKELVISILELAKVYNLYTGEIPTM 425 (502) Q Consensus 350 ~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~----~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~ 425 (502) .+.+.|....=+.. +....+...|||||....+...+..+- ++.+|-. .|++-++.... ....+. T Consensus 341 ~~~~rI~~af~~~~--l~~~~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~---Pli~r~~~~l~--~~l~~~---- 409 (517) T protein:vir:10 341 DYRQRIGRVFMMEA--MTRRDAERVTAYEIQRDAMLVEQSLGGVYSLFATTFQG---PLARWFMNGIS--SILTSK---- 409 (517) T ss_pred HHHHHHHHHHhhhh--hhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHH---HHHHHHHHHhh--hhcCCC---- Confidence 33333322221111 222223346999999887776665554 3444333 33333332221 111111 Q ss_pred cceEEEeCCCccCCHH---HHHHHHHH---HHh--cC-------CCCHHHHH---HhcCCC------CHHHHHHHHHHHH Q lcl|NC_012753. 426 DEVSVDLDDGVFTDRN---AEFDYWSK---MVA--AG-------FAPKTMAI---EKTLNV------TKEQAQEIYQKIN 481 (502) Q Consensus 426 ~~i~v~f~d~i~~d~~---~~~~~~~~---~~~--~G-------i~S~et~l---~~~~~~------~deea~~el~ri~ 481 (502) .+.++.--. .+.. ..++.+.+ .++ +. .+-...++ ....|+ +++|++++.+... T Consensus 410 -~v~~~~~s~--la~l~r~~~~~~i~~~~~~i~~~a~~~~~~~~~id~d~~~~~~a~~~Gvp~~~irs~~ev~~~~~~~~ 486 (517) T protein:vir:10 410 -NVSPTILTG--IEALGRMAELDKLGTFNGYVSMTAQWPEPLQQAIKWPDFTDWVQGQISANFPFFKTQDELNAEAQAQQ 486 (517) T ss_pred -Cccceeecc--HHHHHHHHHHHHHHHHHHHHHHhhcCChHHHhcCCHHHHHHHHHHHhCCChhhcCCHHHHHHHHHHHH Confidence 122221111 1111 11111111 111 00 01112222 223343 3456555544332 Q ss_pred Hhhhc--------------ccCCCCCccccC Q lcl|NC_012753. 482 DETMV--------------STDSFRTSEEVD 498 (502) Q Consensus 482 ~E~~~--------------~~~~~~~~~~~~ 498 (502) ++++. ......++.++- T Consensus 487 ~~~~~~~~~~~ag~~~~~~~~~~~~~~~~~~ 517 (517) T protein:vir:10 487 EQEATKYAAEQAGKAIPDMVKNGQINPQGGQ 517 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCCCCCCCC Confidence 22211 011122222222 No 202 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=96.42 E-value=0.00064 Score=38.09 Aligned_cols=397 Identities=12% Similarity=0.114 Sum_probs=164.1 Q ss_pred CC-----hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecc Q lcl|NC_012753. 1 MG-----IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLP 74 (502) Q Consensus 1 m~-----~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n 74 (502) |+ ++++++.-+.+ +... .+..+... + | ..+.|.... .+.... +.-+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~--~~g~-----------~~s~~~~~---~--~-~~~~~~~~~------~g~~v~~~~al~~~ 55 (437) T protein:vir:10 1 MKQGKQRALGRIKSSFLK--WLGV-----------PISLTDGS---F--W-SAWGGMGSS------SGETVTADSALQLS 55 (437) T ss_pred CCcchhhhhhhhHHhhhh--hcCC-----------cccCCchh---H--H-HhhcccccC------CCceechHhhhccH Confidence 55 22222222111 1001 11111100 0 1 122221110 011100 1112222 Q ss_pred hHHHHHHHHhhhhhcCcceE-eeC--CH---HHHHHHHHHHhh-----ccHHHHHHHHHHHHhhcCCEEEEEEEeCCce- Q lcl|NC_012753. 75 IGRTASKKVASLVFNEQATI-RVD--NE---VADAFINETLKN-----DKFSKNFERYLESCLALGGLAMRPYIDGDQI- 142 (502) Q Consensus 75 ~~k~iv~~~a~~l~~ep~~i-~~~--d~---~~~e~l~~~~~~-----~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~- 142 (502) --..+|+..|+-+.+=|..+ ..+ +. .....+..+|.. -....-...++..++..|.+|+.+..+.|++ T Consensus 56 ~v~~ci~~Ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~g~~~ 135 (437) T protein:vir:10 56 AVWSCVRLIAETIATLPLNLYQTKPDGTRVLAKQHRLYTVIHSQPNAENTAAEFWEVIVASMLLWGNGYARKLRSAGVLI 135 (437) T ss_pred HHHHHHHHHHHHHhhCceeEEEEcCCCceeeccccHHHHHhhccCCcCCCHHHHHHHHHHHHhhcCCeEEEEEecCCcEE Confidence 22335555555555445543 111 10 112223333321 1334445556777788999998888777664 Q ss_pred EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccC Q lcl|NC_012753. 143 RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYED 222 (502) Q Consensus 143 ~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~ 222 (502) .+..++|+.+-+...+++.. +|+ +... + |....+ T Consensus 136 ~L~~l~p~~v~i~~~~~g~~------------------~y~---~~~~-~------------------g~~~~~------ 169 (437) T protein:vir:10 136 GLELMLPQRTTVKRLTSGAL------------------QYT---YRNV-D------------------GTVSTL------ 169 (437) T ss_pred EEEEEcCcceEEEECCCCeE------------------EEE---EEec-C------------------ceEEEE------ Confidence 46667777665542221110 110 0000 0 100000 Q ss_pred CCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc-eeeechHHhccCCCCC Q lcl|NC_012753. 223 LEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQMIKTEYDTN 301 (502) Q Consensus 223 l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~~l~~~~~~~ 301 (502) ... -.++|+.+. .+...|+|.+.-+...+.....+-....+-|..+.. .-+ |.....-+ T Consensus 170 -~~~---------dIih~r~~~-----~d~~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi-----l~~~~~l~ 229 (437) T protein:vir:10 170 -AED---------DVFHVRGFS-----LDGLMGLTPIQYAREVLGNSTAANKTSASVFRNGLRPSGV-----LSTDQILQ 229 (437) T ss_pred -ccc---------cEEEecCcC-----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE-----EEcCCCCC Confidence 000 023344321 124679998877777766555444333444554433 222 22211111 Q ss_pred CcccCccccccccc-hhhcccc----CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccH Q lcl|NC_012753. 302 GEKVTVKREFETGH-NVYEQFD----SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTA 376 (502) Q Consensus 302 g~~~~~~~~~~~~~-~~~~~~~----~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tA 376 (502) ..... .+.... ..+.... ..--+.+..++.++.....-++.+..+....+|+...|++|..+|+...+..++ T Consensus 230 ~e~~~---~~~~~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~ 306 (437) T protein:vir:10 230 KEKRA---EIRTDLAEQFGGAMQAGKTMVLEAGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWG 306 (437) T ss_pred HHHHH---HHHHHHHHHhcCccccCcceeccCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccc Confidence 10000 000000 0111100 000012223555665666667888888889999999999999998765543322 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_012753. 377 TEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFA 456 (502) Q Consensus 377 tei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~ 456 (502) ..+...... .++.+|..++..|-...+. .++.........+.++++.-+..|..+.++...+++.+|+| T Consensus 307 sn~e~~~~~----------f~~~tl~P~~~~ie~~l~~-kll~~~e~~~~~~~fd~~~ll~~d~~~r~~~~~~~~~~G~~ 375 (437) T protein:vir:10 307 TGIEQQTLG----------FLTFTLRPWLTRIEQAARR-SLLRPGERDQFYAEFSVEGLLRADSAGRAAFYSTMTQNGLM 375 (437) T ss_pred chHHHHHHH----------HHHHHHHHHHHHHHHHHHh-hccCccccCceEEEEechhhhccCHHHHHHHHHHHHhCCCc Confidence 222211111 1233333333322221111 11111111122355555666777889999999999999999 Q ss_pred CHHHHHHhc--CCCCHH-H---HHHHHHHHHHhhhc-ccCCCCCccccCCCCC Q lcl|NC_012753. 457 PKTMAIEKT--LNVTKE-Q---AQEIYQKINDETMV-STDSFRTSEEVDIYGE 502 (502) Q Consensus 457 S~et~l~~~--~~~~de-e---a~~el~ri~~E~~~-~~~~~~~~~~~~~~g~ 502 (502) +.-++++.+ +++... + +..-+..+..-... ....-.+...++..|| T Consensus 376 T~NE~R~~~gl~pi~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (437) T protein:vir:10 376 TRDECRAKENLPPMGGNAAVLTVQSALLPIDKLGEHTTATAAQDALKAWLYQE 428 (437) T ss_pred CHHHHHHHhCCCCCCCCcceEeecCcccchhhccCcCCCcchhccccccCCCC Confidence 999876653 233221 1 01011112111000 0011111222333444 No 203 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=96.36 E-value=0.0007 Score=37.88 Aligned_cols=430 Identities=12% Similarity=0.140 Sum_probs=190.2 Q ss_pred CChhHH-HHHHHHHHhhcccccchhhhhccccccC-CHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHH Q lcl|NC_012753. 1 MGIIQT-IKNFIKRSNYVITNQSLNSITDHPKIAI-SPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRT 78 (502) Q Consensus 1 m~~~~~-ik~~i~~~~~~~~~~~l~~i~~~~~~~~-~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~ 78 (502) -..++. .....-.++ ......+. ...+ +.+.+.+|+.-...++-+... .. T Consensus 32 a~~i~~~~~~~~~~g~------~~~~~~~~-~~~~~~~eLI~~YR~ma~~pEvd~Av---------------------~e 83 (511) T protein:vir:56 32 AKEIHTNLLAPQLGHA------IIPSDAQS-EGTIPVKELIKSYRALAEYHEVDDAI---------------------QE 83 (511) T ss_pred ceEEecccccceecce------eccccccc-cCccchHHHHHHHHHHhhccchhhHH---------------------HH Confidence 000000 000000000 00000000 1111 123333333333322221100 11 Q ss_pred HHHHHh-hhhhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEEE Q lcl|NC_012753. 79 ASKKVA-SLVFNEQATIRVDNE--------VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSFV 147 (502) Q Consensus 79 iv~~~a-~~l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~v 147 (502) ||+... .=-..+|+.+.+++. ...+..+.+++--+|+++..+.+....+-|..|++...|+. + ..+.++ T Consensus 84 Ivne~iv~d~~~~pV~l~ld~~~~s~~iK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~k~GI~eLr~l 163 (511) T protein:vir:56 84 IVDEAIVYENDKEVVWLNLDNTDFSENIKAKINEEFDRVVSLLQMRKHGYKWFRKWYVDSRIYFHKILDKDNNIIELRPL 163 (511) T ss_pred hhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhhhhcceEEEEEEeccccceeehhhc Confidence 111111 001124556666542 35566777887779999999999999999999999999863 4 468888 Q ss_pred cCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEE-EEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcc Q lcl|NC_012753. 148 QATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYS-LIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEET 226 (502) Q Consensus 148 ~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt-~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~ 226 (502) +|..+-+|..--......+ ..++ ..|++..+...+......+.. +...-+-.+|.++ T Consensus 164 DPr~i~~vr~i~~~~~~~~-------------~v~~~~~ey~~Y~~~~~~~~~~~~~~-~~~~~~vkI~~da-------- 221 (511) T protein:vir:56 164 NPMKMELVREIQKETIDGV-------------EVVKGTLEYYVYKQSDYKMPSWMSAT-NRAQTSFRIPKDA-------- 221 (511) T ss_pred Ccccchhhhhhhccccccc-------------ccccceeeeeEecCCCcccCcccccc-cccccceeechhh-------- Confidence 8888766532100000000 0000 022222111111000000000 0000011111111 Q ss_pred eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH--HHHhhccceee-ech---------HHh Q lcl|NC_012753. 227 VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT---------QMI 294 (502) Q Consensus 227 ~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~---------~~l 294 (502) +++...+- +- . ....+..+|-|..+......|=-+-+.++ +-.++-..||| |+- .+| T Consensus 222 I~y~hSGL-----~d----~--~~~~g~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl 290 (511) T protein:vir:56 222 IVFAHSGL-----MR----G--CADDPYIIGYLDRAIKPANQLKMLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYV 290 (511) T ss_pred eeeecccc-----ee----c--cCCCCeeeccchhhhHHHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHH Confidence 11110000 00 0 01123345555554444443332222222 11234444554 211 011 Q ss_pred ----c-----cCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh Q lcl|NC_012753. 295 ----K-----TEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM 365 (502) Q Consensus 295 ----~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~ 365 (502) . .+-+...++++.++-+..-...|..- --+|+.+.-|+++..--...+ ++-++.+.+.+....++|.+. T Consensus 291 ~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEItTLpGgqnlge-m~DV~YF~kKLy~aLnVP~SR 368 (511) T protein:vir:56 291 NGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLP-RREGSKGTEVSTLPGGQSLGD-IEDVLYFNRKLYKAMRIPTSR 368 (511) T ss_pred HHHHHhcCceEEEeccCceeccchhhhhhHhhhccc-ccCCCCccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCccc Confidence 0 01122223333222222222222211 113333344666554322222 244666777888888888777 Q ss_pred cccccc--c--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceEEEeCCCccCC Q lcl|NC_012753. 366 FSFDGK--S--MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVSVDLDDGVFTD 439 (502) Q Consensus 366 ~~~~~~--~--~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~v~f~d~i~~d 439 (502) +..+++ + ..-++||....-....-+.+++..|..-+.++++.-|.+-. ++...-+.. ..+.++|...--.. T Consensus 369 l~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKg---iit~eeW~~i~~~I~~~f~~Dn~f~ 445 (511) T protein:vir:56 369 AASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKHQLIVNN---IITEEEWDANHEKLYVVFNQDSYFE 445 (511) T ss_pred ccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---CCCHHHHHHHhhcceEEeeecchHH Confidence 764321 1 11255665555555667888888888888888887665432 222211111 35778886654444 Q ss_pred HHHHHHHHHHHH-----hcC----CCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 440 RNAEFDYWSKMV-----AAG----FAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 440 ~~~~~~~~~~~~-----~~G----i~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) +..+++.+..-. ..+ ..|.+++.++....||+|.+++.+.|++|.... -+.. ++-|| T Consensus 446 ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~I~~E~k~~--~~~~-~e~~f 511 (511) T protein:vir:56 446 EAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSEIDEEETNP--RFQQ-DDQGF 511 (511) T ss_pred HHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHHHHHhhcCC--CCCC-cccCC Confidence 444444433211 113 459999888888999999999999999998763 2222 33444 No 204 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=96.30 E-value=0.00076 Score=37.67 Aligned_cols=396 Identities=12% Similarity=0.054 Sum_probs=156.0 Q ss_pred CCh----hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MGI----IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~~----~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |.+ ++.+ .|.-..+- ++-..|..+..++. +-.... =++ T Consensus 92 ~~~~~~~~~~l-~~~~~~~F-~Gy~~la~laQ~~e------yr~~~~------------------------------~ia 133 (695) T protein:vir:78 92 LDFNGTSMDAL-SFVTSSGF-PGFPTLVLLAQLPE------YRAMHE------------------------------VLA 133 (695) T ss_pred hcccccccccc-hhhhccCc-chHHHHHHHhhccc------hhhHHH------------------------------HHH Confidence 332 0000 01100000 00011111111100 000000 011 Q ss_pred HHHHHHHhhhhhcC-----------cceEee-CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc--- Q lcl|NC_012753. 77 RTASKKVASLVFNE-----------QATIRV-DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQ--- 141 (502) Q Consensus 77 k~iv~~~a~~l~~e-----------p~~i~~-~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~--- 141 (502) +..+++|-....+. ...+.. ++.+..+.|+.-++.-+....++++++++-.+|+++..+-.+++. T Consensus 134 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l 213 (695) T protein:vir:78 134 DECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM 213 (695) T ss_pred HHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCcccc Confidence 11111111111000 001111 233455677777777889999999999999999998776664321 Q ss_pred --e---E-----------EEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecC Q lcl|NC_012753. 142 --I---R-----------VSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESE 205 (502) Q Consensus 142 --~---~-----------i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~ 205 (502) | + +..++|-.+.|-..+......-- || .-++ |+| T Consensus 214 ~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spd--------------fg-kP~~-------y~V-------- 263 (695) T protein:vir:78 214 DTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADD--------------FY-KPST-------WWM-------- 263 (695) T ss_pred ccccccccccccCcceeeeEeecccccccchhhhccchhhc--------------cC-CCce-------EEE-------- Confidence 1 1 22222222222111100000000 00 0011 111 Q ss_pred CccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012753. 206 SKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR 285 (502) Q Consensus 206 ~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~ 285 (502) .|.+|-.+ .+ ..+.+.+.| -.+|+. ...+|+|....+.+-+++.+++......=+.- . T Consensus 264 ----~G~kIH~S----RL---~~f~g~plP--d~LKp~-------y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~--~ 321 (695) T protein:vir:78 264 ----IGTEVHAT----RL---HTIVSRPVG--DMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ--F 321 (695) T ss_pred ----eceEEeee----eE---EEecCCCch--hhhhcc-------cccCcccHHHHHHHHHHHHHHHHhHHHHHHHh--h Confidence 11111100 00 111111111 112221 23579999999999999888765444432211 1 Q ss_pred eeee-chHHhccCCCCCCcccCccccccccchhhcccc---CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCC Q lcl|NC_012753. 286 RVAV-PTQMIKTEYDTNGEKVTVKREFETGHNVYEQFD---SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGV 361 (502) Q Consensus 286 ~i~v-~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 361 (502) ++-+ -.+|.....+ |........+.. .+.|+... .-|.+ ..-++.+ ++...-....+....++++..+++ T Consensus 322 ~v~~lk~dla~~L~~--g~~~~l~~R~el-i~~~Rsn~G~~llDk~-~Eefeq~--stslSGLddVi~qf~q~VAgaa~I 395 (695) T protein:vir:78 322 SVSGILMDLAQALMP--GANVDLSMRAEL-INRYRDNRNILFLDKA-TEEFFQF--NTPLSGLDALQAQAQEQMSAVSHI 395 (695) T ss_pred hhHHHHHHHHHhhcC--hhHHHHHHHHHH-HHHhcCccceEEEecC-CcceEEE--ecccCCHHHHHHHHHHHHHhhhcC Confidence 1111 1111111111 111110000000 01111111 11111 1223333 344555667777888888888899 Q ss_pred Chhh-cccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCC Q lcl|NC_012753. 362 STGM-FSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTD 439 (502) Q Consensus 362 s~~~-~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d 439 (502) +... ||.+..|. +|+..=...|-+..... .++.++..|++++.+|..-. + |.. +.++++.|+---..+ T Consensus 396 PltkLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~ii~rS~-----~-G~i--dpdi~~~fnPL~qmt 465 (695) T protein:vir:78 396 PLIKLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVIVMIQLSL-----F-GAV--DPSIKWQWNALRELD 465 (695) T ss_pred chhhhhccCCccccccchhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHh-----c-CCC--CCcceEEeCCCCCcC Confidence 8654 78887775 67764333444433322 36778999999887765321 1 222 236889998655555 Q ss_pred HHHHHH-------HHHHHHhcCCCCHHHHHHhcC-----CCCH-HHHHHH--H---HHHHHhhhcccCCCCCccccCCCC Q lcl|NC_012753. 440 RNAEFD-------YWSKMVAAGFAPKTMAIEKTL-----NVTK-EQAQEI--Y---QKINDETMVSTDSFRTSEEVDIYG 501 (502) Q Consensus 440 ~~~~~~-------~~~~~~~~Gi~S~et~l~~~~-----~~~d-eea~~e--l---~ri~~E~~~~~~~~~~~~~~~~~g 501 (502) +.+.++ ....++.+|+|+..+...++- +... .+++.+ + ..|.......++......-++..| T Consensus 466 d~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (695) T protein:vir:78 466 DLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG 545 (695) T ss_pred HHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCC Confidence 544443 334555678888776544421 1100 000000 0 000000000000000000111111 Q ss_pred C Q lcl|NC_012753. 502 E 502 (502) Q Consensus 502 ~ 502 (502) - T Consensus 546 ~ 546 (695) T protein:vir:78 546 A 546 (695) T ss_pred C Confidence 1 No 205 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=96.28 E-value=0.00079 Score=37.60 Aligned_cols=382 Identities=13% Similarity=0.080 Sum_probs=160.7 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) =++++++|+|+++.-....... ..+. . ++ +..+..|.. .. .++-+.+.--...| T Consensus 14 ~g~~~~~~~~f~~~~~~~~~~~--~~~~--~--~~---------~~~~~~~~~-------v~----~~~al~~~~v~~cv 67 (424) T protein:vir:18 14 NGWWARLKSWFVGGRLVTPNQG--SQTG--P--VS---------AHGYLGDSS-------IN----DERILQISTVWRCV 67 (424) T ss_pred CchHHHHHhhccccccccccch--hhcc--c--cc---------ccccccccc-------cc----HHHhhccHHHHHHH Confidence 5677888888754211100000 0000 0 00 001111110 00 01112222223456 Q ss_pred HHHhhhhhcCcceE-eeC--CH--H--HHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATI-RVD--NE--V--ADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSF 146 (502) Q Consensus 81 ~~~a~~l~~ep~~i-~~~--d~--~--~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~ 146 (502) +..|+-+-+=|+.+ ..+ +. . .+.-+..+|.. | ....-...++...+..|.+|+.+..+.+ . +.+.. T Consensus 68 ~~Ia~~iA~lp~~vy~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~~ 147 (424) T protein:vir:18 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) T ss_pred HHHHHhhccCceEEEEeccCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 66666665555543 111 10 1 11223344432 1 2223344556678889999988877653 3 35666 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcc Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEET 226 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~ 226 (502) ++|..+-+. .+.+.. .|+ +..+ |..+.+ + +. T Consensus 148 l~~~~v~v~-~~~~~~------------------~y~----~~~~-------------------g~~~~~---~---~~- 178 (424) T protein:vir:18 148 LQSANMDVK-LVGKKV------------------VYR----YQRD-------------------SEYADF---S---QK- 178 (424) T ss_pred ecCcceEEE-EcCCeE------------------EEE----EEeC-------------------CeEEEe---c---cc- Confidence 677665542 222111 110 0000 000000 0 00 Q ss_pred eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH-HHHhhcc-ceeeechHHhccCCCCCCcc Q lcl|NC_012753. 227 VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM-WEVKMGQ-RRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 227 ~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~-~~~~~~~-~~i~v~~~~l~~~~~~~g~~ 304 (502) -.++++.+. .+...|+|.+.-+...|..... ..++. +-|..+. ..-+ |.........+ T Consensus 179 ---------eVihir~~~-----~dg~~G~spi~~~~~~i~~~~~-~~~~~~~~f~ng~~~~gi-----l~~~~~~l~~e 238 (424) T protein:vir:18 179 ---------EIFHLKGFG-----FTGLVGLSPIAFACKSAGVAVA-MEDQQRDFFANGAKSPQI-----LSTGEKVLTEQ 238 (424) T ss_pred ---------cEEEecCcC-----CCCcccccHHHHHHHHHHHHHH-HHHHHHHHHhccCCcceE-----EEeCCcCCCHH Confidence 013343221 1235688887776666554332 23333 3344433 2222 22111100000 Q ss_pred cCccccccccchhhccccCCCC-------ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHH Q lcl|NC_012753. 305 VTVKREFETGHNVYEQFDSGDM-------DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT 377 (502) Q Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~-------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt 377 (502) ... .-...+........ +.+.-++.++.....-++.+..+....+|+...|++|..+|+...+..++. T Consensus 239 ---~~~--~~~~~~~~~~~~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~s 313 (424) T protein:vir:18 239 ---QRS--QVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGS 313 (424) T ss_pred ---HHH--HHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCcccccc Confidence 000 00000011110000 112235555555556678888888889999999999999987665544332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 378 EVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 378 ei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) .+....... ++..|..++..|-.-.+. .++...-.....+.|+++.-+..|..+.++...+++.+|+|+ T Consensus 314 n~eq~~~~f----------~~~tl~P~~~~ie~~ln~-~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T 382 (424) T protein:vir:18 314 GIEQQNLGF----------LQYTLQPYISRWENSIQR-WLIPSKDVGRLHAEHNLDGLLRGDSASRAAFMKAMGESGLRT 382 (424) T ss_pred cHHHHHHHH----------HHHHHHHHHHHHHHHHHh-hcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcC Confidence 222111111 122333333322221111 111111112234666666667889999999999999999999 Q ss_pred HHHHHHhcCCCCH-HHHHH-----HHHHHHHhhhcccCCCCCcccc Q lcl|NC_012753. 458 KTMAIEKTLNVTK-EQAQE-----IYQKINDETMVSTDSFRTSEEV 497 (502) Q Consensus 458 ~et~l~~~~~~~d-eea~~-----el~ri~~E~~~~~~~~~~~~~~ 497 (502) .-++++. .|++. +..++ -+..+..-.....+. +. ++ T Consensus 383 ~NE~R~~-~gl~pi~ggD~~~~~~n~~~l~~~~~~~~~~--~n-~a 424 (424) T protein:vir:18 383 INEMRRT-DNMPPLPGGDVAMRQAQYVPITDLGTNKEPR--NN-GA 424 (424) T ss_pred HHHHHHH-hCCCCCCCcCeeeeccCccchhhhhccCCcc--cc-CC Confidence 9987654 34432 10111 111111100000111 11 11 No 206 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=96.20 E-value=0.00088 Score=37.33 Aligned_cols=396 Identities=12% Similarity=0.054 Sum_probs=155.7 Q ss_pred CCh----hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MGI----IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~~----~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |.+ .+.+ .|.-..+- ++-..|..+..++. +-.... =++ T Consensus 91 ~~~~~~~~~~l-~~~~~~~F-~Gy~~la~laQ~~e------yr~~~~------------------------------~ia 132 (694) T protein:vir:10 91 LDFNGTSMDAL-SFVTSSGF-PGFPTLVLLAQLPE------YRAMHE------------------------------VLA 132 (694) T ss_pred hccCcccccch-hhhhccCc-chHHHHHHHhhccc------hhhHHH------------------------------HHH Confidence 322 0000 11100000 00011111111100 000000 011 Q ss_pred HHHHHHHhhhhhcC-----------cceEee-CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc--- Q lcl|NC_012753. 77 RTASKKVASLVFNE-----------QATIRV-DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQ--- 141 (502) Q Consensus 77 k~iv~~~a~~l~~e-----------p~~i~~-~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~--- 141 (502) +..+++|-....+. ...+.. ++.+..+.|+.-++.-+....++++++++-.+|+++..+-.+++. T Consensus 133 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eaik~aRlfGGa~~~i~I~gdd~~l 212 (694) T protein:vir:10 133 DECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM 212 (694) T ss_pred HHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeecCcccc Confidence 11111111111000 011111 233455677777777889999999999999999998766664321 Q ss_pred --e---E-----------EEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecC Q lcl|NC_012753. 142 --I---R-----------VSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESE 205 (502) Q Consensus 142 --~---~-----------i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~ 205 (502) | + +..++|-.+.|-..+......-- || .-++ |+| T Consensus 213 ~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spd--------------fg-kP~~-------y~V-------- 262 (694) T protein:vir:10 213 DTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADD--------------FY-KPST-------WWM-------- 262 (694) T ss_pred ccccccccccccCcceeeeEeecccccccchhhhccchhhc--------------cC-CCce-------EEE-------- Confidence 1 1 22222222222111100000000 00 0011 111 Q ss_pred CccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012753. 206 SKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR 285 (502) Q Consensus 206 ~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~ 285 (502) .|.+|-.+ .+ ..+.+.+.| -.+|+. ...+|+|....+.+-+++.+++......=+.- . T Consensus 263 ----~G~~IH~S----RL---~~f~g~plP--d~LKp~-------y~~~G~Sv~q~~~e~V~~~~rT~~~v~~Li~~--~ 320 (694) T protein:vir:10 263 ----IGTEVHAT----RL---HTIVSRPVG--DMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ--F 320 (694) T ss_pred ----eceEEeee----eE---EEecCCCch--hhhhcc-------cccCcccHHHHHHHHHHHHHHHHhHHHHHHHh--h Confidence 11111100 00 111111111 112221 23579999999999999888665444432211 1 Q ss_pred eeee-chHHhccCCCCCCcccCccccccccchhhcccc---CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCC Q lcl|NC_012753. 286 RVAV-PTQMIKTEYDTNGEKVTVKREFETGHNVYEQFD---SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGV 361 (502) Q Consensus 286 ~i~v-~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 361 (502) ++-+ -.+|.....+ |........+.. .+.|+... .-|.+ ..-++.+ ++...-....+....++++..+|+ T Consensus 321 ~v~~lk~dla~~L~~--g~~~~l~~R~el-i~~~Rsn~G~~llDk~-~Eefeq~--stslSGLddVi~qf~q~VAgaa~I 394 (694) T protein:vir:10 321 SVSGILMDLAQALMP--GANVDLSMRAEL-INRYRDNRNILFLDKA-TEEFFQF--NTPLSGLDALQAQAQEQMSAVSHI 394 (694) T ss_pred hhHHHHHHHHHhhcC--hhHHHHHHHHHH-HHHhcCccceEEEecC-CcceEEE--ecccCCHHHHHHHHHHHHHhhhcC Confidence 1111 1111111111 111110000000 01111111 11111 1223333 344555667777888888888899 Q ss_pred Chhh-cccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCC Q lcl|NC_012753. 362 STGM-FSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTD 439 (502) Q Consensus 362 s~~~-~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d 439 (502) +... ||.+.+|. +|+..=...|-+..... .++.++..|++++.+|..-. + |.. +.++++.|+---..+ T Consensus 395 PltkLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~ii~rS~-----~-G~i--dp~i~~~fnPL~qmt 464 (694) T protein:vir:10 395 PLIKLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVIVMIQLSL-----F-GAV--DPSIKWQWNALRELD 464 (694) T ss_pred chhhhhccCcccccccchhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHh-----c-CCC--CCcceEEeCCCCCcC Confidence 8654 78887775 67764333444433322 36778999999887764321 1 222 236889998655555 Q ss_pred HHHHHH-------HHHHHHhcCCCCHHHHHHhcC-----CCCH-HHHHHH--H---HHHHHhhhcccCCCCCccccCCCC Q lcl|NC_012753. 440 RNAEFD-------YWSKMVAAGFAPKTMAIEKTL-----NVTK-EQAQEI--Y---QKINDETMVSTDSFRTSEEVDIYG 501 (502) Q Consensus 440 ~~~~~~-------~~~~~~~~Gi~S~et~l~~~~-----~~~d-eea~~e--l---~ri~~E~~~~~~~~~~~~~~~~~g 501 (502) +.+.++ ....++.+|+|+......++- +... .+++.+ + ..|.......++......-++..| T Consensus 465 d~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 544 (694) T protein:vir:10 465 DLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG 544 (694) T ss_pred HHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCc Confidence 544443 334555678888776544421 1100 000000 0 000000000000000011111111 Q ss_pred C Q lcl|NC_012753. 502 E 502 (502) Q Consensus 502 ~ 502 (502) - T Consensus 545 ~ 545 (694) T protein:vir:10 545 A 545 (694) T ss_pred c Confidence 1 No 207 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=96.18 E-value=0.00091 Score=37.27 Aligned_cols=396 Identities=12% Similarity=0.054 Sum_probs=155.5 Q ss_pred CCh----hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MGI----IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~~----~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |.+ ++.+ .|.-..+- ++-..|..+..++. +-.... =++ T Consensus 92 ~~~~~~~~~~l-~~~~~~~F-~Gy~~la~laQ~~e------yr~~~~------------------------------~ia 133 (695) T protein:vir:36 92 LDFNGTSMDAL-SFVTSSGF-PGFPTLVLLAQLPE------YRAMHE------------------------------VLA 133 (695) T ss_pred hcccccccccc-hhhhccCc-chHHHHHHHhhccc------hhhHHH------------------------------HHH Confidence 332 0000 01100000 00011111111100 000000 011 Q ss_pred HHHHHHHhhhhhcC-----------cceEee-CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc--- Q lcl|NC_012753. 77 RTASKKVASLVFNE-----------QATIRV-DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQ--- 141 (502) Q Consensus 77 k~iv~~~a~~l~~e-----------p~~i~~-~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~--- 141 (502) +..+++|-....+. ...+.- ++.+..+.|+.-++.-+....++++++++-.+|+++..+-.+++. T Consensus 134 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqik~L~~e~erL~V~~~l~eaik~aRlfGGa~~~i~i~gdd~~l 213 (695) T protein:vir:36 134 DECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM 213 (695) T ss_pred HHhhcccceecccchhhhhhccccccccccccCchHHHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEEeccCcccc Confidence 11111111100000 001111 223456677777777789999999999999999998776664321 Q ss_pred --e---E-----------EEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecC Q lcl|NC_012753. 142 --I---R-----------VSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESE 205 (502) Q Consensus 142 --~---~-----------i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~ 205 (502) | + +..++|-.+.|-..+......-- || .-++ |+| T Consensus 214 ~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spd--------------fg-kP~~-------y~V-------- 263 (695) T protein:vir:36 214 DTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADD--------------FY-KPST-------WWM-------- 263 (695) T ss_pred ccccccccccccCcceeeeEeecccccccchhhhccchhhc--------------cC-CCce-------EEE-------- Confidence 1 1 22222222222111100000000 00 0011 111 Q ss_pred CccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012753. 206 SKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR 285 (502) Q Consensus 206 ~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~ 285 (502) .|.+|-.+ .+ ..+.+.+.| -.+|+. ...+|+|....+.+-+++.+++......=+.- . T Consensus 264 ----~G~kIH~S----RL---~~f~g~plP--d~LKp~-------y~~~GiSv~q~~~e~V~~~~rT~~~v~~Li~~--~ 321 (695) T protein:vir:36 264 ----IGTEVHAT----RL---HTIVSRPVG--DMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVKQ--F 321 (695) T ss_pred ----eceEEeee----eE---EEecCCCch--hhhhcc-------cccCcccHHHHHHHHHHHHHHHHhHHHHHHHh--h Confidence 11111100 00 111111111 112221 23579999999999998888665444332210 1 Q ss_pred eeee-chHHhccCCCCCCcccCccccccccchhhcccc---CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCC Q lcl|NC_012753. 286 RVAV-PTQMIKTEYDTNGEKVTVKREFETGHNVYEQFD---SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGV 361 (502) Q Consensus 286 ~i~v-~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~ 361 (502) ++-+ -.+|..-..+ |........+.. .+.|+... .-|.+ ..-++.+ ++...-....+....++++..+|+ T Consensus 322 ~v~~lk~dla~aL~~--g~~~~l~~R~el-i~~~Rsn~G~~llDk~-~Eefeq~--stslSGLddVi~qf~q~VAgaa~I 395 (695) T protein:vir:36 322 SVSGILMDLAQALMP--GANVDLSMRAEL-INRYRDNRNILFLDKA-TEEFFQF--NTPLSGLDALQAQAQEQMSAVSHI 395 (695) T ss_pred hHHHHHHHHHHhhcC--hhHHHHHHHHHH-HHHhcCccceEEEecC-CcceEEE--ecccCCHHHHHHHHHHHHHhhhcC Confidence 1111 1111111111 111110000000 01111111 11111 1223333 344555667777888888888899 Q ss_pred Chhh-cccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCC Q lcl|NC_012753. 362 STGM-FSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTD 439 (502) Q Consensus 362 s~~~-~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d 439 (502) +... ||.+..|. +|+..=...|-+..... .++.++..|++++.+|..-. + |.. +.++++.|+---..+ T Consensus 396 PltkLfGqSPkGlNATGE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~ii~rS~-----~-G~i--dpdi~~~fnPL~qmt 465 (695) T protein:vir:36 396 PLIKLLGITPTGLNASSEGEIRVWYDYVRAY--QRNALQQLMNDVIVMIQLSL-----F-GAV--DPSIKWQWNALRELD 465 (695) T ss_pred chhhhhccCcccccccchhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHh-----c-CCC--CCcceEEeCCCCCcC Confidence 8654 78887775 67764333444433322 36778999999887764321 1 222 236889998655555 Q ss_pred HHHHHHH-------HHHHHhcCCCCHHHHHHhcC-----CCCH-HHHHHH--H---HHHHHhhhcccCCCCCccccCCCC Q lcl|NC_012753. 440 RNAEFDY-------WSKMVAAGFAPKTMAIEKTL-----NVTK-EQAQEI--Y---QKINDETMVSTDSFRTSEEVDIYG 501 (502) Q Consensus 440 ~~~~~~~-------~~~~~~~Gi~S~et~l~~~~-----~~~d-eea~~e--l---~ri~~E~~~~~~~~~~~~~~~~~g 501 (502) +.+.++. ...++.+|+|+..+...++- +... .+++.+ + ..|.......++......-++..| T Consensus 466 d~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 545 (695) T protein:vir:36 466 DLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYVQRLAEGGDTGAPGG 545 (695) T ss_pred HHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhhcCcccccccCCCCc Confidence 5444433 34555678877776544421 1100 000000 0 000000000000000011111111 Q ss_pred C Q lcl|NC_012753. 502 E 502 (502) Q Consensus 502 ~ 502 (502) - T Consensus 546 ~ 546 (695) T protein:vir:36 546 A 546 (695) T ss_pred c Confidence 1 No 208 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=96.18 E-value=0.00091 Score=37.26 Aligned_cols=416 Identities=12% Similarity=0.078 Sum_probs=191.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+.++ ..++.. ..+..+ .. .-.....-|+.++.+... +-+.. --..|| T Consensus 49 ~~~~~-------~~~~~q--~~y~~~----e~-~~~~~~eLI~~YR~ma~~--pEvd~----------------Av~eIV 96 (524) T protein:vir:10 49 EQNIP-------YNALMQ--QMFGSN----EP-EVKNTRELIDTYRNLMNN--YEVDN----------------AVQEIV 96 (524) T ss_pred ccccc-------chhhhh--hhhhcc----cc-hhhhHHHHHHHHHHHhhc--cchhh----------------HHHHhh Confidence 22111 111100 000000 00 011334456666666533 11100 001122 Q ss_pred HHHh-hhhhcCcceEeeCCH--------HHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCC----c-eEEEE Q lcl|NC_012753. 81 KKVA-SLVFNEQATIRVDNE--------VADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGD----Q-IRVSF 146 (502) Q Consensus 81 ~~~a-~~l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~----~-~~i~~ 146 (502) +... .=-..+|+++.+++- ...+..+.+++--+|+++..+.+....+-|..|++..+|++ + ..+.. T Consensus 97 neaiv~d~~~~pV~l~Ld~~~~s~siK~kI~eeF~~Il~ll~F~~~~~~~fR~WYVDgRi~fHkiid~~~pk~GI~Elr~ 176 (524) T protein:vir:10 97 SDAIVYEDDKEVVALNLDGTDFSQSIKDKILAEFSEVLNLLNFQRKGTDHFQRWYVDSRIFFHKIINPKKMKDGVQELRR 176 (524) T ss_pred cceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeceEEEEEEeeCCCccccceeeee Confidence 1111 011124566666542 35566777887779999999999999999999999999843 3 46888 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCce-EEEEEEEEEEeCCeEEEEEEEEecC--Ccc------ccCce--ee Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVK-YYSLIEFHEWNKETYTISNELYESE--SKT------IIGQR--VP 215 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~-~yt~~E~h~~~~~~~~I~~~l~~~~--~~~------~lG~~--v~ 215 (502) ++|..+-++..- . .+..++. .++- +-++.+|... ++. ..++. +| T Consensus 177 lDPr~i~~vr~i-------------~-~~~~~~~~vi~~-----------~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~ 231 (524) T protein:vir:10 177 LDPRQVQYIREI-------------V-TRMEDGVKIVDG-----------YREFFVYDTGHESYCADGRIYSAGTKVKIP 231 (524) T ss_pred eCCccceeeeee-------------c-ccCcccchhhcc-----------hhhheeecCCCcccccCcceecCCcceecc Confidence 888776554321 1 1111111 1110 0111111100 000 01111 11 Q ss_pred ccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH--HHHhhccceee-ech- Q lcl|NC_012753. 216 LSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM--WEVKMGQRRVA-VPT- 291 (502) Q Consensus 216 l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~--~~~~~~~~~i~-v~~- 291 (502) -+. +++...+- +..+. ..=+|-|..+......|=-+-+.++ +-.++-..||| |+- T Consensus 232 ~dA--------Ivy~~SGL-----~d~~~--------~~i~syLhkAiKp~NQLkm~EDAlVIYRitRAPeRRvFYIDVG 290 (524) T protein:vir:10 232 RAA--------VVYAHSGL-----LDCCG--------KNIIGYLQRAIKPANQLKLMEDAMVIYRITRAPDRRVFYIDTG 290 (524) T ss_pred hhh--------eeeeccCc-----ccCCC--------CceeccchHhhHHHHhhHHHHhhHHHHhhhccccceEEEEecC Confidence 111 11100000 00000 0002333333332222221111111 11234444554 211 Q ss_pred --------HHh---------ccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHH Q lcl|NC_012753. 292 --------QMI---------KTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSL 354 (502) Q Consensus 292 --------~~l---------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~ 354 (502) .+| +.+-+...++++.++-+..-...|..- --+|+.+.-|+++..--...+ ++-++.+.+. T Consensus 291 nlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEItTLpGgqnlge-m~DV~YF~kk 368 (524) T protein:vir:10 291 NMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQ-RRDGKAVTEVDTMPGATGMSD-MDDVLYFRTA 368 (524) T ss_pred CCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhccc-ccCCCCccceeeccccCCcCh-HHHHHHHHHH Confidence 011 001122223333222222222222111 113333344666554322222 2446667778 Q ss_pred HHHhcCCChhhcccccc-c--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc--cceE Q lcl|NC_012753. 355 FEMQLGVSTGMFSFDGK-S--MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTM--DEVS 429 (502) Q Consensus 355 i~~~~g~s~~~~~~~~~-~--~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~--~~i~ 429 (502) +....++|.+.+..++. + ..-++||....-....-+.+++..|..-+.++++.-|.+-. ++...-+.. ..+. T Consensus 369 Ly~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLilKg---iit~eew~~i~~~I~ 445 (524) T protein:vir:10 369 LYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLILKK---IITEDEWEREINNIK 445 (524) T ss_pred HHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---CCCHHHHHHHhhcce Confidence 88888888877753321 1 22355665555555567888888888888888887665432 222211111 3577 Q ss_pred EEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 430 VDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 430 v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) ++|...--..+..+++.+..-. ..+ ..|.+++.++....||+|.+++.+.|++|....--..+..+.-|| T Consensus 446 ~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 446 VTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEESKEARFQNPDEEEEDF 524 (524) T ss_pred EEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHhhcCCCCCCChhhhcC Confidence 8886654444444444433211 112 458998877888999999999999999998766555555556666 No 209 >protein:vir:80211 Length: 514 # NCBI annotation: putative head-tail connector protein # Family: family:all:481 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522882;genbank:gi:158345175;genbank:GeneID:5687474 Probab=96.14 E-value=0.00095 Score=37.15 Aligned_cols=429 Identities=7% Similarity=0.010 Sum_probs=161.9 Q ss_pred HHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccc--cccCCCccccccceecchHHHHHHHHhhhhh Q lcl|NC_012753. 11 IKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVT--YRDSNGSQVKRDFNHLPIGRTASKKVASLVF 88 (502) Q Consensus 11 i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~--~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~ 88 (502) ++. ....... ..++-.....|+.++.=-.+-+. .....+...+..+.--+.+...++.+|+-|. T Consensus 1 m~~--------~~~~l~~------k~~R~~~e~~w~e~a~~~lP~~~~~~~~~~~~~~~~~~~~dstg~~a~~~LAa~l~ 66 (514) T protein:vir:80 1 MRQ--------QASAMWA------EYRDSTAIRKAEDFAKFTIASLMVDPLDKTHQAEVVEYDFQSAGAFLVNNLTAKLA 66 (514) T ss_pred Ccc--------chHHHHH------HhhcchHHHHHHHHHHHhcccccCCCCCCcccccccccccchhHHHHHHHHHHHHH Confidence 110 0000000 11122234444444322111111 0011111111111112345666777776666 Q ss_pred cC--cc-----eEeeCCH-------------HHHHH-------HHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCc Q lcl|NC_012753. 89 NE--QA-----TIRVDNE-------------VADAF-------INETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQ 141 (502) Q Consensus 89 ~e--p~-----~i~~~d~-------------~~~e~-------l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~ 141 (502) +- || ++.++|+ .+.++ +...+..++|...+.++..+..+.|.+.+ |.+++. T Consensus 67 ~~ltpp~~~WF~l~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~~~~~ 144 (514) T protein:vir:80 67 LTLFPPGRPSFQIELDDTLQELAAANGIDQSELHSRTADLERRATRRLFVNASLSKLHRILKLLVVTGNALF--YREPGT 144 (514) T ss_pred hhhcCCCCcccccccCchhhhhccccchhHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEEE--EEecCC Confidence 52 22 1333321 13333 33446678999999999999999998764 456554 Q ss_pred eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe-------------------eCCCceEEEEEEEEEEeCCeEEEEEEEE Q lcl|NC_012753. 142 IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKT-------------------EGQKVKYYSLIEFHEWNKETYTISNELY 202 (502) Q Consensus 142 ~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~-------------------~~~~~~~yt~~E~h~~~~~~~~I~~~l~ 202 (502) -.+..+|-.+++ +..|..+....+|....... ..+....|+++++..-.++.+...| T Consensus 145 ~~~~~~pl~~y~-v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~v~v~~~v~~~~~~~~~~~sv~--- 220 (514) T protein:vir:80 145 GKMLVWTMQSYT-VRRTSHGDPAVVVLRQQMPFRELTPEIQADAQAKQIAKRDSDKCDLYTVIEWQPTPNGKRCAVW--- 220 (514) T ss_pred CcEEEEEcCeEE-EeeCCCcCeEEEEeeeeecHHHhhhhhhhhhhhhhccCCCCCceEEEEEEEeecCCCCeEEEEE--- Confidence 456667777755 44555444444443322110 0112234455444322222222222 Q ss_pred ecCCccccCceeeccccccCCCcceeecCC--CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 203 ESESKTIIGQRVPLSTLYEDLEETVTLNGL--TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEV 280 (502) Q Consensus 203 ~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~--~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~ 280 (502) ..-.+..+|. ..++ ..-||++++-+ ...++.||+|--..+.+-+..|++.--....-. T Consensus 221 ~e~~g~~i~~----------------es~y~~~e~P~i~~Rw~----~~~ge~YGrgp~~~al~D~k~L~~l~~~~l~~~ 280 (514) T protein:vir:80 221 HELEGKRVGP----------------ESSYPAHLCPYVPVAWN----VPDGEHYGRGYVEEYSGDFARLSILSERLGLYE 280 (514) T ss_pred Eeccceeecc----------------cCccccccCCeeeeeeE----ecCCCCcccchHHHHHHHHHHHHHHHHHHHHHH Confidence 1111111111 1121 12344444422 234678999999999999999996654444332 Q ss_pred -hhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeeccc--cchHHHHHHHHHHHHHHHH Q lcl|NC_012753. 281 -KMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTD--IRSDDYIKAINKGLSLFEM 357 (502) Q Consensus 281 -~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~--ir~e~~~~~l~~~l~~i~~ 357 (502) ...+....|+.+... +.....+. ....+. . +....+..++.. -....-.+.++.+.+.|.. T Consensus 281 ~~a~~~~~~v~~~g~~-----~~~~l~~~-----~~g~~v---~---g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI~~ 344 (514) T protein:vir:80 281 FEALSLLNLVDEAKGG-----AVDDYRDA-----ETGDFV---P---GQVGSVASYERGDYNKIAQASASVESIVMRLNR 344 (514) T ss_pred HHhcCCCceeCccccc-----chhhhccc-----CCceee---c---CCCccceeeecCcccchHHHHHHHHHHHHHHHH Confidence 334444455432211 11100100 000000 0 111122222211 1122222334444444432 Q ss_pred hcCCChhhccccccccccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHh--hcccCCCcccccceEEE Q lcl|NC_012753. 358 QLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS----IATLVEKSLKELVISILELAKV--YNLYTGEIPTMDEVSVD 431 (502) Q Consensus 358 ~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~----~~~~~~~~l~~l~~~il~~~~~--~~~~~~~~~~~~~i~v~ 431 (502) ..=+.. .. ..+...|||||....+...+..+- ++.+|-.. |++-.+.++.- .+..........++++. T Consensus 345 aFml~~-~~--rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~P---li~r~~~il~r~~~g~lP~~p~~l~~~~~v 418 (514) T protein:vir:80 345 AFMYTG-QV--RDAERVTVEEIRTVAEEAENLLGGVYSLLAETLQAP---LAYLTMYEASRGNGGMLLGIAQGVYRPSII 418 (514) T ss_pred HHhhhc-cC--CCCCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHH---HHHHHHHHHhhhccCCCCCCCchhhcceee Confidence 211111 11 122336999999887777665544 44444433 33333333221 11222222222233332 Q ss_pred eCCCccCCHHHH---HHHHHHHHh--cCC-------CCHHHHHH---hcCCC-------CHHHHHHHHHHHHHhhhcc-- Q lcl|NC_012753. 432 LDDGVFTDRNAE---FDYWSKMVA--AGF-------APKTMAIE---KTLNV-------TKEQAQEIYQKINDETMVS-- 487 (502) Q Consensus 432 f~d~i~~d~~~~---~~~~~~~~~--~Gi-------~S~et~l~---~~~~~-------~deea~~el~ri~~E~~~~-- 487 (502) -.= ........ +....+.++ +++ +-...++. ...|+ ++|+++.+.+|.++.+.+. T Consensus 419 s~l-a~l~r~~~~~~l~~~~~~i~~l~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~i~~~~e~~~~~~~~~~~~~~~~~~ 497 (514) T protein:vir:80 419 TGI-PALTRNIETANILRATQEASAIVPALVQLSKRFDPEKLVERIFANNSVDLSTLSKDPDVVAAEAEQEAALAQQQLD 497 (514) T ss_pred ecH-HHHHHHHHHHHHHHHHHHHHHHhccchhhhhcCCHHHHHHHHHHHhCCCHhhccCCHHHHHHHHHHHHHHHHHHHH Confidence 110 00011111 111111110 011 12223332 23444 3343333333332211111 Q ss_pred --cCCCCCccccCCCCC Q lcl|NC_012753. 488 --TDSFRTSEEVDIYGE 502 (502) Q Consensus 488 --~~~~~~~~~~~~~g~ 502 (502) ....-...+.++.=. T Consensus 498 ~~~~~~~~~~~~~~~~~ 514 (514) T protein:vir:80 498 VASGALAAETSAGVLTS 514 (514) T ss_pred HHHHHHHHhhhccccCC Confidence 000001111111111 No 210 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=96.14 E-value=0.00096 Score=37.14 Aligned_cols=383 Identities=12% Similarity=0.084 Sum_probs=159.6 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) =+++.++|+|++..-....... .... .+ + +...+.|. ... .+.-+.+.---..| T Consensus 14 ~g~~~~~~~~~~~~~~~~~~~~--~~~~--~~--~---------~~~~~~~~-------~v~----~~~al~~~~v~~cv 67 (424) T protein:vir:18 14 NGWWARLQSWFVGGRLVTPNQG--SQTG--PV--S---------AHGHLGDS-------SIN----DERILQISTVWRCV 67 (424) T ss_pred CchHHHHHhhhccccccccccc--cccc--cc--c---------cccccccc-------ccc----HHHhhccHHHHHHH Confidence 5677788887753211111000 0000 00 0 00001110 000 00111112222455 Q ss_pred HHHhhhhhcCcceE-eeC-CH---H--HHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCC-c-eEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATI-RVD-NE---V--ADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGD-Q-IRVSF 146 (502) Q Consensus 81 ~~~a~~l~~ep~~i-~~~-d~---~--~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~-~-~~i~~ 146 (502) +..|+-+-+=|+.+ ..+ +. . ...-|.++|.. | ....-....+...+..|.+|+.+..+.+ . +.+.. T Consensus 68 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~~G~~~~L~p 147 (424) T protein:vir:18 68 SLISTLTACLPLDVFETDQNDNRKKVDLSNPLARLLRYSPNQYMTAQEFREAMTMQLCFYGNAYALVDRNSAGDVISLLP 147 (424) T ss_pred HHHHHhhccCceEEEEeecCCceeeeccccHHHHHHhhccCCCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEE Confidence 55555555545543 111 11 0 11123343431 1 2233344556678888999988877654 3 35666 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcc Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEET 226 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~ 226 (502) ++|..+-+. .+.+.. .|+ + ..++.... ++ +. T Consensus 148 l~~~~V~v~-~~~~~~------------------~y~---~-~~~g~~~~-----------------~~--------~~- 178 (424) T protein:vir:18 148 LQSANMDVK-LVGKKV------------------VYR---Y-QRDSEYAD-----------------FS--------QK- 178 (424) T ss_pred ecCcceEEE-EcCCeE------------------EEE---E-EeCCeEEE-----------------ec--------cc- Confidence 677665542 222111 110 0 00000000 00 00 Q ss_pred eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-Hhhcc-ceee--echHHhccCCCCCC Q lcl|NC_012753. 227 VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQ-RRVA--VPTQMIKTEYDTNG 302 (502) Q Consensus 227 ~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~-~~i~--v~~~~l~~~~~~~g 302 (502) -.++++.+. .+...|+|-+.-+...++.. ....++... |..+. ...+ +|..++. T Consensus 179 ---------eIih~r~~~-----~dg~~G~spi~~~~~~i~~~-~a~~~~~~~~f~ng~~p~gil~~~~~~l~------- 236 (424) T protein:vir:18 179 ---------EIFHLKGFG-----FTGLVGLSPIAFACKSAGVA-VAMEDQQRDFFANGAKSPQILSTGEKVLT------- 236 (424) T ss_pred ---------cEEEecCcC-----CCCcccccHHHHHHHHHHHH-HHHHHHHHHHHHccCCcceEEEeCCcCCC------- Confidence 012333221 12356888887776666543 333333333 44433 3222 2221110 Q ss_pred cccCccccccccc-hhhccccCC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH Q lcl|NC_012753. 303 EKVTVKREFETGH-NVYEQFDSG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE 378 (502) Q Consensus 303 ~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte 378 (502) .+-. ..+.... ..+..-... --+.+.-++.++.....-++.+..+...++|+...|++|..+|+...+..+++. T Consensus 237 ~e~~--~~~~~~~~~~~~g~nag~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn 314 (424) T protein:vir:18 237 EQQR--SQVEENFKEIAGGPVKKRLWILEAGFSTSAIGVTPQDAEMMASRKFQVSELARFFGVPPHLVGDVEKSTSWGSG 314 (424) T ss_pred HHHH--HHHHHHHHHHhCCcccCCceeccCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccccccc Confidence 0000 0000000 011000000 001122355565555666788888888999999999999999876554432222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPK 458 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~ 458 (502) +....... ++.+|..++..|-...+. .++...-.....+.++++.-+..|..+.++...+++.+|+|+. T Consensus 315 ~eq~~~~f----------~~~tl~P~~~~ie~~l~~-~L~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~ 383 (424) T protein:vir:18 315 IEQQNLGF----------LQYTLQPYISRWENSIQR-WLIPAKDVGRIHAEHNLDGLLRGDSASRAAFMKAMGEAGLRTI 383 (424) T ss_pred HHHHHHHH----------HHHHHHHHHHHHHHHHHh-hcCCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 22111111 122333333322221111 1111111112335566666677899999999999999999999 Q ss_pred HHHHHhcCCCCH-HHHHHH-----HHHHHHhhhcccCCCCCcccc Q lcl|NC_012753. 459 TMAIEKTLNVTK-EQAQEI-----YQKINDETMVSTDSFRTSEEV 497 (502) Q Consensus 459 et~l~~~~~~~d-eea~~e-----l~ri~~E~~~~~~~~~~~~~~ 497 (502) -++++.. |+.. +..++- +..+.+-.....|. ..++ T Consensus 384 NE~R~~~-gl~pi~gGD~~~~~~n~~~l~~~~~~~~p~---~~ga 424 (424) T protein:vir:18 384 NEMRRTD-NLPPLPGGDVAMRQSQYVPITDLGTNKEPR---NNGA 424 (424) T ss_pred HHHHHHh-CCCCCCCcCeeeeccCccchHhhhccCCCc---cCCC Confidence 8876543 4332 101111 11111100011111 1111 No 211 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=96.12 E-value=0.00098 Score=37.07 Aligned_cols=427 Identities=9% Similarity=0.027 Sum_probs=164.3 Q ss_pred CChhHHHHHHHH--------------------HHhhcccccchhhhhccccccCCHH--HHHHHHHHHHHhcCCCC-ccc Q lcl|NC_012753. 1 MGIIQTIKNFIK--------------------RSNYVITNQSLNSITDHPKIAISPE--EYNRIMDNLRYFAGDFD-SVT 57 (502) Q Consensus 1 m~~~~~ik~~i~--------------------~~~~~~~~~~l~~i~~~~~~~~~~~--~~~~i~~~~~~Y~g~~~-~~~ 57 (502) -+|...+..+-| -..|.+.-...+.++.+.+|..+-. .-..-.+. --+.++.. .+. T Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~kk~~i~~pfkkk~~~~~~d~-f~~s~es~s~vt 103 (945) T protein:vir:10 25 SNIKANVDSLSRGKDYPGFKPLLTYRALAWNSTVVYSIIIFRKNQVLKKEKIIVPYNHQEPPFKFNL-FEYSPESLMYLP 103 (945) T ss_pred ccchhchhhhhcccCCCCcchhhhhhhhhccceeeeeeeeehhhhHHHhhcccccccccccchhhhh-hhccCccceecc Confidence 001111111100 0111111122222222222211100 00000000 01222211 010 Q ss_pred ccc-CCC----ccccccceecchHHHHHHHHhhhhhcCcceE--eeCCH---------HHHHHHHHHHhh-c------cH Q lcl|NC_012753. 58 YRD-SNG----SQVKRDFNHLPIGRTASKKVASLVFNEQATI--RVDNE---------VADAFINETLKN-D------KF 114 (502) Q Consensus 58 ~~~-~~~----~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i--~~~d~---------~~~e~l~~~~~~-~------~f 114 (502) ... ... .-..+.......-...|+..|+-+.+=|+.+ ..++. .....+..+++. | +| T Consensus 104 sls~pdaf~~vnVs~~~AlknsaV~scI~~IA~sIAsLPlklYrr~edG~~~~~~kk~~~~hpL~~LL~rPNp~mT~~eF 183 (945) T protein:vir:10 104 SISDPDAFFLINLFRKYRFNNDSKLIKVSEIPKKLTSKELEIYKHIEDKHVNYYLKRIRDARNILEFLERPDPYFSEVNS 183 (945) T ss_pred cccCccceeeehhhhhhhhccHHHHHHHHHHHhhhccCceEEEEecccCcccccccccccchHHHHHHhCCCcccChhHH Confidence 000 000 0000111111222445666666666666654 11111 112234444432 2 23 Q ss_pred HH-HHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEe Q lcl|NC_012753. 115 SK-NFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWN 191 (502) Q Consensus 115 ~~-~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~ 191 (502) ++ .++.++.+++..|.+|+.+..+. |.+ .+..++|.++-|...++++.. .++.. .. T Consensus 184 wqsFl~~Lv~dLLL~GNAYieIiRd~~G~ii~L~pLdPs~Vti~~ddDG~~~-----y~Yv~---------------~i- 242 (945) T protein:vir:10 184 WEYLLGMVLDDILTIDRGAIVKIRDEQGNLVAITPVDGTTIKPILSEDTGIV-----VGYVQ---------------EV- 242 (945) T ss_pred HHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEECCcceEEEEcCCCcEE-----EEEEE---------------ec- Confidence 33 33455678889999999887764 444 677788888877544333221 00000 00 Q ss_pred CCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHH Q lcl|NC_012753. 192 KETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINT 271 (502) Q Consensus 192 ~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~ 271 (502) ++..... |.. .+ .+. .++++..+ ....++|+|.+..+...+..... T Consensus 243 dG~~~~~---v~a--------------------~D-------vIl--hirn~s~D--G~~~GyGlSPIeaa~~aI~~alA 288 (945) T protein:vir:10 243 DGAIVAH---FDK--------------------RD-------VVL--FRQNLTPD--VYMYGYSLPPIEILYKVILSDIF 288 (945) T ss_pred CCceEEE---ecC--------------------Cc-------eEE--EeccCCCC--cccccCCchHHHHHHHHHHHHHH Confidence 0000000 000 00 011 11221111 12234688877766655544332 Q ss_pred HHHHHHHHHh-hcc-ce-ee-echHHhccCCCCCCcccCcc-c-cccccc-hhhccccCC---CCccccceeeeccccch Q lcl|NC_012753. 272 TYDEFMWEVK-MGQ-RR-VA-VPTQMIKTEYDTNGEKVTVK-R-EFETGH-NVYEQFDSG---DMDKGIGITDLTTDIRS 341 (502) Q Consensus 272 ~~S~~~~~~~-~~~-~~-i~-v~~~~l~~~~~~~g~~~~~~-~-~~~~~~-~~~~~~~~~---~~~~~~~i~~~~~~ir~ 341 (502) +-....+-|. .|. .+ |+ ++.... ......+ ...+. . .+.... ..+...... -.+.+.-++.++..... T Consensus 289 aek~aar~FskNGa~PsGILsvkg~~~-~d~k~~~-~LseEq~erlKe~wee~~sG~NnG~piVLdeGmef~pLs~s~~D 366 (945) T protein:vir:10 289 IDKGNLDYYRKGGSIPEGILAIEPPSY-KEGDIYP-QLSREQLESIQRQLQAIMMGDYTQVPILSGGKFTWIDFKGKRRD 366 (945) T ss_pred HHHHHHHHHHhCCCccceEEEecCccc-ccccccc-ccCHHHHHHHHHHHHHHhCCcccccceecCCCceEEEccCChhH Confidence 2222223333 332 22 22 211100 0000000 00000 0 000000 001000000 00122235556666667 Q ss_pred HHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCC Q lcl|NC_012753. 342 DDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTG 420 (502) Q Consensus 342 e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~ 420 (502) -++.+..+....+|+...|+||..+|+..+.. +++.+....+ ...+..-..+.++..|...+ .. T Consensus 367 aQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq~~~F--v~~tL~Pil~~IEqeLNrkL------------l~- 431 (945) T protein:vir:10 367 MQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVMASLT--KAKGLEPLMATISKGFDEVV------------SE- 431 (945) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHHHHHH--HHHHHHHHHHHHHHHHHHhc------------cc- Confidence 78888888889999999999999998765433 2222221111 11223333333343333321 00 Q ss_pred CcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHHHHH----------HHhhhcc-- Q lcl|NC_012753. 421 EIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKE-QAQEIYQKI----------NDETMVS-- 487 (502) Q Consensus 421 ~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~de-ea~~el~ri----------~~E~~~~-- 487 (502) ......+.+.|+.....+..+.++...+++.+|+|+.-++++.. |+..- .-+.-+-.. +..+... T Consensus 432 -~~eg~~i~fdFd~ldl~D~ksraEal~kli~sGiLTiNEvRe~l-GLpPIeGGD~lli~~nn~~P~d~~~ka~~ga~p~ 509 (945) T protein:vir:10 432 -FRNEKDIKLWFKEDDLEKERDWWNIIQGQLNTGFRSINEARMEK-GLEPVPWGDVPFSGLRNWKPEDEQAKAQQGAMPP 509 (945) T ss_pred -cccCceeEEEecchhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcceeeeccccccccccccccccCCCCc Confidence 11224578889888888999999999999999999999976543 33220 000000000 0000000 Q ss_pred -----cCCCCCccccCCC------CC Q lcl|NC_012753. 488 -----TDSFRTSEEVDIY------GE 502 (502) Q Consensus 488 -----~~~~~~~~~~~~~------g~ 502 (502) ....+.+.+++-- +| T Consensus 510 q~aq~~~dqp~~kGGe~dEns~~psE 535 (945) T protein:vir:10 510 QLAQAMADQPSQQGGGVDENSSVPSE 535 (945) T ss_pred ccccCCCCCCCCCCCCCCCCCCCCCc Confidence 0111111111111 11 No 212 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=400 Identities=11% Similarity=0.045 Sum_probs=157.1 Q ss_pred hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccc-cceecchHHHHHH Q lcl|NC_012753. 3 IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKR-DFNHLPIGRTASK 81 (502) Q Consensus 3 ~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~-~~~~~n~~k~iv~ 81 (502) +...|.+.+|+...... + ....|.+|. |.. +......+..... ..+..+.--..|+ T Consensus 1 ~~~~~~~~~~~~~~~~~--~------------------~~~~~~~~~-g~~--~~~~~~~~~~~~~~~a~~~~~v~~~v~ 57 (460) T protein:vir:10 1 MANRIIRALRELTGLDN--K------------------FNDAFIKYI-GQT--FTKYDNNGKTYLEQGYNINPDVYSCIS 57 (460) T ss_pred CchhHHHHHhhhhccCC--C------------------chHHHHHhh-ccc--cCCCccchhhhhHHHHhcchHHHHHHH Confidence 44455555554321111 0 112233333 211 1111111111111 1111122234556 Q ss_pred HHhhhhhcCcceEeeCC--HH-------------------------------HHHHHHHHHhh----ccHHHHHHHHHHH Q lcl|NC_012753. 82 KVASLVFNEQATIRVDN--EV-------------------------------ADAFINETLKN----DKFSKNFERYLES 124 (502) Q Consensus 82 ~~a~~l~~ep~~i~~~d--~~-------------------------------~~e~l~~~~~~----~~f~~~~~~~~~~ 124 (502) .+|+-+.+=|..+--.+ .. ....+..++.. -........++.. T Consensus 58 ~ia~~iA~lp~~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~~~~~ 137 (460) T protein:vir:10 58 QMAAKTVAVPYTIKVVKDTKAYQQLNNLNISTKGLYSFTQSLQKNRLDTKAFSETEKAFPLESPNPTQTWADIYSLYKTY 137 (460) T ss_pred HHHHhhhhCceEEEeccCCccchhhhhhhhhhhhhHHHHHHhhcchhhhcccchhHHHHHHhCCCCCCCHHHHHHHHHHH Confidence 66666655555432111 00 00111112211 1233444556678 Q ss_pred HhhcCCEEEEEEEeC-----Cce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEE Q lcl|NC_012753. 125 CLALGGLAMRPYIDG-----DQI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTIS 198 (502) Q Consensus 125 ~~~~G~~~~~~~~d~-----~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~ 198 (502) .+..|.+|+.+..+. |.+ .+..++|+.+-+...+.+.... ++ +... .|.+. T Consensus 138 lll~Gnay~~i~r~~~~~~~G~~~~L~~l~~~~v~v~~~~~~~~~~----~~-----------~~~~--------~~~~~ 194 (460) T protein:vir:10 138 MRLNGNCYFYLMSPDDGINAGVPSQMYVLPAHLIKIVLKDDINLLS----TD-----------SPIK--------SYMLI 194 (460) T ss_pred HhhcCCeEEEEEecCCCccCceeEEEEEEcCceEEEEEcCCCceee----ee-----------eeee--------EEEEe Confidence 888999988876642 333 4666777777654332221110 00 0000 01100 Q ss_pred EEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccc-cCcCCcchhhhHHHHHHHHHHHHHHHH Q lcl|NC_012753. 199 NELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDI-NSPLGLSIFDNAKTTMDFINTTYDEFM 277 (502) Q Consensus 199 ~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~-~~p~G~S~~~~~~~lid~ld~~~S~~~ 277 (502) . + |..+.+ .+ - -.++|+.+.+++... ...+|+|.+.-+...|.....+-.-.. T Consensus 195 -----~-~----g~~~~~-------~~------~---evih~r~~~~~~~~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~ 248 (460) T protein:vir:10 195 -----Q-G----DQFIEF-------NE------D---EVIHTKYANPNFDLQGSHLYGMSPIRAILRNINSQNSTIDNNV 248 (460) T ss_pred -----c-C----ceeEEe-------cc------c---ceEEEecCCCCcccccCccccccHHHHHHHHHHHHHHHHHHHH Confidence 0 0 111100 00 0 124455443333222 234699998887777766554443333 Q ss_pred HHHhhccceeeechHHhccCCCCCCcccCcccccccc-chhhccccC----CCCccccceeeeccccchHHHHHHHHHHH Q lcl|NC_012753. 278 WEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG-HNVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGL 352 (502) Q Consensus 278 ~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l 352 (502) +-|..+...-++ +......+.... ..+... ...+..... .--+.+.-++.++.....-++.+..+... T Consensus 249 ~~f~ng~~~~~i----~~~~~~l~~e~~---~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~ 321 (460) T protein:vir:10 249 KTMQNGGVFGFI----HGGSTGLTQPQA---DSLKQRLTEMDKSPDRLSQIAGASGEIAFTKISLNTDELKPFDYLKYDQ 321 (460) T ss_pred HHHhcCCCccee----eecCCCCCHHHH---HHHHHHHHHHhcCccccCCceecCCCceEEEccCChhHHHHHHHHHHHH Confidence 445554332222 221111110000 000000 000110000 00012223555555555667888888889 Q ss_pred HHHHHhcCCChhhccccccccc---cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceE Q lcl|NC_012753. 353 SLFEMQLGVSTGMFSFDGKSMK---TATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVS 429 (502) Q Consensus 353 ~~i~~~~g~s~~~~~~~~~~~~---tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~ 429 (502) .+|+...|+||..+|....+.. ++.+....+ ...++.-..+.++..|..- | +..........+. T Consensus 322 ~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f--~~~~l~P~~~~ie~~ln~k----l-------~~~~~~~~~~~i~ 388 (460) T protein:vir:10 322 KAICNALGWSDKLLNNNEGGGLNTGNLEEERKRV--VTDNIQPDLVILKQAFDKK----F-------IKRFKGYENAVIE 388 (460) T ss_pred HHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHH--HHHHHHHHHHHHHHHHHHh----h-------cCcccccCCceEE Confidence 9999999999999987544322 222221111 1112222333333333221 0 0111111222344 Q ss_pred EEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHH-----HHHHHHhhhcccCCCCCccc Q lcl|NC_012753. 430 VDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEI-----YQKINDETMVSTDSFRTSEE 496 (502) Q Consensus 430 v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~e-----l~ri~~E~~~~~~~~~~~~~ 496 (502) ++|+. .....+......+++.+|+++.-++++.. ++++++..++- +..+.+-.....++..+-.. T Consensus 389 ~d~~~--l~~l~~d~~~~~~~~~~g~~T~NE~R~~~g~~pi~~~~gD~~~~~~n~~~~~~~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 389 WDISE--LPEMQTDMVAMASWLNTIPVTPNEIRIAMKYETLNQDGMDIVFMPSNKVRIDDVSNNLIDSAFNQNQ 460 (460) T ss_pred eecch--hhhHHHHHHHHHHHHhCCCCCHHHHHHHhCCCCCCCCCCCeeeecccccchhhcccccCCCcccCCC Confidence 44433 21122334445567789999999876653 34443322211 11221111111111111111 No 213 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=95.98 E-value=0.0012 Score=36.67 Aligned_cols=381 Identities=13% Similarity=0.061 Sum_probs=157.1 Q ss_pred cccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcc---ccc-----cceecchHHHHHHHHhhhhhcCcceEeeCC-- Q lcl|NC_012753. 29 HPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQ---VKR-----DFNHLPIGRTASKKVASLVFNEQATIRVDN-- 98 (502) Q Consensus 29 ~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~---~~~-----~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d-- 98 (502) -.+..++.+- .-.....+.+.+-...+...+...-. ..+ ....-.-=.-.+.+-..-+++.+..|...+ T Consensus 1 v~~~~l~~e~-at~~~~~d~~~~~~~~l~~~~~~il~~a~~g~~~~y~~l~~D~~i~s~l~~rk~av~~~~w~i~p~~~~ 79 (488) T protein:vir:99 1 MEKPALGREI-ATSGDGRDITRPFISGLQVPNDSILQRRGGNDLRVYEEILSDAQVKTVWGQRQLAVVSREWKVEAGGDR 79 (488) T ss_pred CCccchhHHH-HHHHhhhhhhccccCCCCCCChHHHHhhccCCHHHHHHHhhChHHHHHHHHHHHHHhcCCceEEcCCCC Confidence 0111222111 11222222222211111100000000 000 000000012334555566677777775432 Q ss_pred ---HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe--CCce---EEEEEcCCeEEEEEEcCCCeEEEEEEEE Q lcl|NC_012753. 99 ---EVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYID--GDQI---RVSFVQATVFFPLQANTQDVSSAAIVTK 170 (502) Q Consensus 99 ---~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d--~~~~---~i~~v~~~~~~Pi~~d~~~~~~~~~~~~ 170 (502) ....+++.++|++-.|...+..++ +|..+|-+++-+.|. ++.+ ++.++++..|.+ +..+. T Consensus 80 ~~~~~~ae~v~~~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~~---d~~~~-------- 147 (488) T protein:vir:99 80 PIDQAAAEHLEQQLQRVGWDRVTSKML-FGVFYGYAVSELIYGRDDRYITLEAIKVRNRRRFRY---DQDGG-------- 147 (488) T ss_pred hHHHHHHHHHHHHHhCCCHHHHHHHHH-hhhhhcceeEEEEEeecCCeeeEeeeeeecccceee---cCCCc-------- Confidence 345678999998878988888777 488899888877774 3332 344444432221 11100 Q ss_pred EEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcc-eEEEecCCcccccc Q lcl|NC_012753. 171 STKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRP-LFTYLKPPGMNNKD 249 (502) Q Consensus 171 ~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~-~f~~~~~~~~n~~~ 249 (502) .++. ..+...-|.++| .| -|++.+ . ... T Consensus 148 ------------------------l~~~-----~~~~~~~g~~lp------------------~~~~~i~~~-~---~~~ 176 (488) T protein:vir:99 148 ------------------------LRLL-----TPNNMFEGEPCP------------------APYFWHFST-G---ADN 176 (488) T ss_pred ------------------------eEEe-----ccCCCCCccccc------------------cCceEEEEe-e---cCC Confidence 0000 000000011111 11 111111 1 112 Q ss_pred ccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCC-- Q lcl|NC_012753. 250 INSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDM-- 326 (502) Q Consensus 250 ~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~-- 326 (502) .++|+|.|.+..+--..---+..+..|+.=++- |.+ +.+ -+ ++.. +..-.....+ .+....+..+.. T Consensus 177 ~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P-~~i----gk-y~~~-~a~~~ek~~l---~~av~~~~~~~~~v 246 (488) T protein:vir:99 177 DDEPYGLGLAHWLYWPVFFKRNGIKFWLIFLDKFGMP-TAV----GR-YDDK-TATPEDKAKL---LAALHAIQTDSAII 246 (488) T ss_pred CCCcccchHHHHHHHHHHHHHhhHHHHHHHHHHcCCc-eee----ee-cCCC-CCCHHHHHHH---HHHHHHHhcCcEEE Confidence 467899999888766543333334444433332 333 222 12 2110 1100000000 011111111100 Q ss_pred -ccccceeeecc-ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH-H Q lcl|NC_012753. 327 -DKGIGITDLTT-DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLK-E 403 (502) Q Consensus 327 -~~~~~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~-~ 403 (502) ..+.-|+.++. .-..+.|...++.+-++|+..+ ++...-+++++|.-+..++. ..-....+..-.+.+...|+ + T Consensus 247 iP~~~~ie~~ea~~~~~~~~~~li~~~d~~Isk~i-LGqtlts~~~~Gs~a~~~vh--~~v~~d~~~aDa~~i~~tln~~ 323 (488) T protein:vir:99 247 MPAGMQAELLEAGRSGTADYKTLHDTMDATIAKVG-LGQVASTQGTPGRLGNDDLQ--ADVRLDLVKADADLICESFNLG 323 (488) T ss_pred ecCCceeEEeecCCCChHHHHHHHHHHHHHHHHHH-hhhhhcccccccchhhHHHH--HHHHHHHHHHHHHHHHHHHHHH Confidence 01112343332 1222345556665555665543 22211122222211111222 22234445556667777775 5 Q ss_pred HHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhc-CCCCHHHHHHhcCCCCHHHHHHHHHHHHH Q lcl|NC_012753. 404 LVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAA-GFAPKTMAIEKTLNVTKEQAQEIYQKIND 482 (502) Q Consensus 404 l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~-Gi~S~et~l~~~~~~~deea~~el~ri~~ 482 (502) |++.++.+ |+ ++ ...+.+.|...-+.|..+.++.+.+++.. |+--.+.++++.+|++.++-.+++. T Consensus 324 li~~l~~~----N~-~~----~~~p~~~~~~~e~edl~~~a~~~~~l~~~~G~~i~~~~i~e~~Gip~~~~~~~~~---- 390 (488) T protein:vir:99 324 PARWLTEW----NF-PG----AQPPRVYRVIEEPEDITAKAERDEKVFRMSGFRPTRGYVQETYGVEVESTQAEAT---- 390 (488) T ss_pred HHHHHHHh----Cc-CC----cCCceeEecCCCcccHHHHHHHHHHHHhhcCCCCCHHHHHHHcCCCCcccccccc---- Confidence 77766654 21 11 12245677777788888999999999985 8755566788888887642211111 Q ss_pred hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 483 ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 483 E~~~~~~~~~~~~~~~~~g~ 502 (502) ...+....+....--+. T Consensus 391 ---~~~~~~~~~~~~~~~~~ 407 (488) T protein:vir:99 391 ---APTPSTEFAEGDQPSDP 407 (488) T ss_pred ---cCCCcccCCCCCCCCCc Confidence 11111100000000001 No 214 >protein:vir:7017 Length: 515 # NCBI annotation: head portal protein # Family: family:all:481 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853590;genbank:gi:31711672;genbank:GeneID:1481798 Probab=95.90 E-value=0.0013 Score=36.44 Aligned_cols=430 Identities=9% Similarity=0.068 Sum_probs=166.8 Q ss_pred HHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh Q lcl|NC_012753. 7 IKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL 86 (502) Q Consensus 7 ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~ 86 (502) +.+-+-+ ..-...++..++. .+..++-.....|+.++.=-.+.+......+. ...+.--+.+...++.+|+- T Consensus 1 ~~~~~~~-----~~~~~~~l~~r~~-~Lk~~R~~~e~~w~e~~~~tlP~~~~~~~~~~--~~~~~~dstg~~a~~~LAa~ 72 (515) T protein:vir:70 1 MQDTILE-----YGGQRSKIPKLWE-KFSKKRSPYLDRAKHFAKLTLPYLMNNKGDNE--TSQNGWQGVGAQATNHLANK 72 (515) T ss_pred Ccchhhh-----hcCCHHHHHHHHH-HHHHhhhHHHHHHHHHHHHhcccccCCCCCcc--cccccccchHHHHHHHHHHH Confidence 1111111 1111111111111 12223333333444443222221111111111 11122234566677777766 Q ss_pred hhcC--cc-----eEeeCCH-------------HHHHH-------HHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC Q lcl|NC_012753. 87 VFNE--QA-----TIRVDNE-------------VADAF-------INETLKNDKFSKNFERYLESCLALGGLAMRPYIDG 139 (502) Q Consensus 87 l~~e--p~-----~i~~~d~-------------~~~e~-------l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~ 139 (502) |.+- || ++.+.+. ..+++ +...+..++|...+.++.......|.+++ |.|+ T Consensus 73 l~~~ltpp~~~WF~l~~~d~~~~~l~~~~~~~~~v~~~l~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l--~~d~ 150 (515) T protein:vir:70 73 LAQVLFPAQRSFFRVDLTAKGEKVLDDRGLKKTQLATIFARVETTAMKALEQRQFRPAIVEVFKHLIVAGNCLL--YKPS 150 (515) T ss_pred HHHhhcCCCCcccccccChhhhhccccchhHHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHHhHCeEEE--EEeC Confidence 6642 21 1232221 12223 33346678999999999999999998764 5565 Q ss_pred CceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe---------------------eCCCceEEEEEEEEEEeCCeEEEE Q lcl|NC_012753. 140 DQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKT---------------------EGQKVKYYSLIEFHEWNKETYTIS 198 (502) Q Consensus 140 ~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~---------------------~~~~~~~yt~~E~h~~~~~~~~I~ 198 (502) ... ++.+|-.+++ +..|..+....+|....... ..+...+|+++++ .+.++... T Consensus 151 ~~~-~~~~pl~~y~-v~~d~~G~v~~i~rr~~~t~~~l~~~f~~~~~~~~~~~~~~~~~~v~i~~~v~~---~~~~~~~~ 225 (515) T protein:vir:70 151 KGA-MSAVPMHHYV-VNRDTNGDLMDVILLQEKALRTFDPATRMAIEVGMKGKKCKEDDNVKLYTHAQY---AGEGFWKI 225 (515) T ss_pred CCC-eEEEEcCeEE-EeeCCCcCeeEEEeeeeccHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEEEEe---cCCCceEE Confidence 432 5567767744 45565554444443221100 1112234444433 22222211 Q ss_pred EEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 199 NELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMW 278 (502) Q Consensus 199 ~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~ 278 (502) |..-+...+|. +..+. ...-||++++- +...++.||+|--..+.+-+..|+..--.... T Consensus 226 ---~~e~d~~~~~~-----------es~y~---~~e~P~~~~Rw----~~~~ge~YGrgp~~~~l~D~k~L~~l~~~~l~ 284 (515) T protein:vir:70 226 ---NQSADDIPVGK-----------ESRIK---SEKLPFIPLTW----KRSYGEDWGRPLAEDYSGDLFVIQFLSEAMAR 284 (515) T ss_pred ---EEecCceeecc-----------ccccc---cccCCceeeee----eecCCCCcccchHHHhhHHHHHHHHHHHHHHH Confidence 11111111110 00110 12234444432 22346789999999999999999976665555 Q ss_pred HH-hhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec--cccchHHHHHHHHHHHHHH Q lcl|NC_012753. 279 EV-KMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT--TDIRSDDYIKAINKGLSLF 355 (502) Q Consensus 279 ~~-~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~l~~~l~~i 355 (502) -. ...+..+.||.+.... ...+.+. ....+.. +....+..++ +......-...++.+.+.| T Consensus 285 ~~~~a~~p~~lv~~~g~~~-----~~~l~~~-----~~g~iv~------g~~~~v~~~~~~~~~d~~~~~~~i~~~~~rI 348 (515) T protein:vir:70 285 GAALMADIKYLIRPGSQTD-----VDHFVNS-----GTGEVIT------GVAEDIHIVQLGKYADLTPISAVLEVYTRRI 348 (515) T ss_pred HHHHhcCCCeeeCcccccc-----hhhcccc-----CCceeec------CCcccceeeecCcccchhHHHHHHHHHHHHH Confidence 43 3455555564432211 1111110 0000000 1111122221 2112222233344444444 Q ss_pred HHhcCCChhhccccccccccHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEE Q lcl|NC_012753. 356 EMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRN----SIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVD 431 (502) Q Consensus 356 ~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~----~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~ 431 (502) ....=+.. +....+...|||||....+...+..+ .++.+|...| +..++ . +........ -+.++ T Consensus 349 ~~af~~~~--l~~rd~~rvTAtEV~~r~~E~~~~LGpv~srL~~Ell~Pl---i~r~~--~---~~~p~~P~~--~v~~~ 416 (515) T protein:vir:70 349 GVIFMMET--MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFAMTMQTPI---AMWGL--Q---EAGDSFTSE--LVDPV 416 (515) T ss_pred HHHHhhhh--hhccCCccccHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH---HHHHH--H---hhCCCCChh--hcccc Confidence 33221111 11112234699999877766655443 3444444333 22221 1 112211111 12222 Q ss_pred eCCCccCCHHHHHH------HHHHHHh--cCC-------CCHHHHH---HhcCC----C--CHHHHHHHHHHHHH-hhhc Q lcl|NC_012753. 432 LDDGVFTDRNAEFD------YWSKMVA--AGF-------APKTMAI---EKTLN----V--TKEQAQEIYQKIND-ETMV 486 (502) Q Consensus 432 f~d~i~~d~~~~~~------~~~~~~~--~Gi-------~S~et~l---~~~~~----~--~deea~~el~ri~~-E~~~ 486 (502) +-. +.+.....+ ...+.++ +++ +-...++ ....| + ++||++++.++.++ ++.+ T Consensus 417 ~vs--~l~~L~r~q~~~~i~~~~q~i~~~~~~~p~~~~~id~d~~~~~~a~~~g~p~~~~rs~eev~~~r~q~~~~~~~~ 494 (515) T protein:vir:70 417 IVT--GIEALGRMAELDKLANFAQYMSLPQTWPEPAQRAIRWGDYMDWVRGQISAELPFLKSEEEMQQEMAQQAQAQQEA 494 (515) T ss_pred eeh--hHHHHHHHHHHHHHHHHHHHHHHHhccChhHHhhCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHHHHHHHHH Confidence 211 111111111 1111111 111 1111111 12222 1 66777666443222 2222 Q ss_pred ccCCCCCccccCCCCC Q lcl|NC_012753. 487 STDSFRTSEEVDIYGE 502 (502) Q Consensus 487 ~~~~~~~~~~~~~~g~ 502 (502) .....-.....++-|+ T Consensus 495 ~~~~~~~~a~~~~~~~ 510 (515) T protein:vir:70 495 MLNEGVAKAVPGVIQQ 510 (515) T ss_pred HHHHhhhhhcccchhh Confidence 2222223333344444 No 215 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=95.86 E-value=0.0013 Score=36.32 Aligned_cols=426 Identities=13% Similarity=0.126 Sum_probs=158.6 Q ss_pred HHHHHHHHhhcccccchhhhhccccccCC-HHHHHHHHHHHHHhc-------CCCCccc------------cccC----C Q lcl|NC_012753. 7 IKNFIKRSNYVITNQSLNSITDHPKIAIS-PEEYNRIMDNLRYFA-------GDFDSVT------------YRDS----N 62 (502) Q Consensus 7 ik~~i~~~~~~~~~~~l~~i~~~~~~~~~-~~~~~~i~~~~~~Y~-------g~~~~~~------------~~~~----~ 62 (502) +.++++.. +.+-...-++++++.+|.=+ +.....|+..-++|. ++.+... ++.. . T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 79 (563) T protein:vir:99 1 MADLFKQF-RLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMK 79 (563) T ss_pred Chhhhhhh-hcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCC Confidence 34444321 11111122233333333111 122233333221111 1111100 0000 0 Q ss_pred Ccc---c-cccceecchHHHHHHHHhhhhhcCc-----------ceEeeC-------CH--HHHHHHHHHHh----h--- Q lcl|NC_012753. 63 GSQ---V-KRDFNHLPIGRTASKKVASLVFNEQ-----------ATIRVD-------NE--VADAFINETLK----N--- 111 (502) Q Consensus 63 ~~~---~-~~~~~~~n~~k~iv~~~a~~l~~ep-----------~~i~~~-------d~--~~~e~l~~~~~----~--- 111 (502) .+. . -+.....++...+++..++.+..-. ..|.+. .. .....|..++. + T Consensus 80 ~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p 159 (563) T protein:vir:99 80 NEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDV 159 (563) T ss_pred CcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCC Confidence 000 0 0001111334444444443332110 112111 11 11223444332 1 Q ss_pred --ccHHHHHHHHHHHHhhcCCEEEEEEEe--C-Cc-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEE Q lcl|NC_012753. 112 --DKFSKNFERYLESCLALGGLAMRPYID--G-DQ-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLI 185 (502) Q Consensus 112 --~~f~~~~~~~~~~~~~~G~~~~~~~~d--~-~~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~ 185 (502) ..|...+..++...+..|.+++.+.+. + |. +.+..++|..+-++..+.+.+.... ..|. T Consensus 160 ~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~-------------~~y~-- 224 (563) T protein:vir:99 160 DRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGG-------------KRFV-- 224 (563) T ss_pred CcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccc-------------eeEE-- Confidence 135566777788899999999887653 2 34 3677788888877543332211000 0010 Q ss_pred EEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHH Q lcl|NC_012753. 186 EFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTT 265 (502) Q Consensus 186 E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~l 265 (502) +...+... . .++ +.+ .+.+ ++++.. .....++|+|.+..+... T Consensus 225 --~~~~g~~~-~---------------~~~--------~~e-------vI~~--~~~~~~--d~~~~~~G~Spi~~a~~~ 267 (563) T protein:vir:99 225 --QVVDKRVV-A---------------SFT--------SRE-------LAMG--IRNPRT--ELSSSGYGLSEVEIAMKE 267 (563) T ss_pred --EEeCCcee-E---------------Eec--------Ccc-------eEEE--eccCCC--CcccCcccchHHHHHHHH Confidence 00000000 0 000 000 0111 111111 012246799998877777 Q ss_pred HHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc--cccccc-chhhccccCC-----CCccccceeeecc Q lcl|NC_012753. 266 MDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK--REFETG-HNVYEQFDSG-----DMDKGIGITDLTT 337 (502) Q Consensus 266 id~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~--~~~~~~-~~~~~~~~~~-----~~~~~~~i~~~~~ 337 (502) |.....+..-..+-|..+...-.+ |....+. ...+. ..+... ...+...... --+.+.-++.++. T Consensus 268 i~~~~~~~~~~~~~f~ng~~p~gi----L~~~~~~---~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~ 340 (563) T protein:vir:99 268 FIAYNNTESFNDRFFSHGGTTRGI----LQIRSDQ---QQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTP 340 (563) T ss_pred HHHHHHHHHHHHHHHHccCCCceE----EEeCCCC---CCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccC Confidence 765444444334445554332111 2211110 00000 000000 0111110000 0011223555565 Q ss_pred ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012753. 338 DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYN 416 (502) Q Consensus 338 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~ 416 (502) ....-++++..+...++|+...|++|..+|+...+..+++.-.+.. ....+.. ....++.+|..++..|-...+. . T Consensus 341 ~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~--~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~-~ 417 (563) T protein:vir:99 341 TANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTL--NEADPGKKQQQSQNKGLQPLLRFIEDLVNR-H 417 (563) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccch--hhccHHHHHHHHHHHHHHHHHHHHHHHHHh-h Confidence 5666778898899999999999999999997654322111100000 0011111 1122344444444433322221 1 Q ss_pred ccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH-HHHHHH------------------- Q lcl|NC_012753. 417 LYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK-EQAQEI------------------- 476 (502) Q Consensus 417 ~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d-eea~~e------------------- 476 (502) +... ....+.+.|.+.-+.+..+ +....+++.+|+|+.-++++. .|+.. +..+.- T Consensus 418 L~~~---~~~~~~~~f~r~D~~~~~e-~~~~~~~~~~G~lT~NE~R~~-~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~ 492 (563) T protein:vir:99 418 IISE---YGDKYTFQFVGGDTKSATD-KLNILKLETQIFKTVNEAREE-QGKKPIEGGDIILDASFLQGTAQLQQDKQYN 492 (563) T ss_pred hchh---cccccEEEeccCCHHHHHH-HHHHHHHhcCCccCHHHHHHH-hCCCCCCCcceeecccccccccccccccCCC Confidence 1111 1134567776654433322 223445678899999886554 34321 000000 Q ss_pred -------HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 -------YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 -------l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+.++. ....+....+...+--|+ T Consensus 493 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:99 493 DGKQKERLQMMMS--LLEGDNDDSEEGQSTDSS 523 (563) T ss_pred ccccchhhhhccc--ccCCCCCCCCCCCCCCCC Confidence 000000 000000000111111111 No 216 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=95.86 E-value=0.0013 Score=36.32 Aligned_cols=426 Identities=13% Similarity=0.126 Sum_probs=158.6 Q ss_pred HHHHHHHHhhcccccchhhhhccccccCC-HHHHHHHHHHHHHhc-------CCCCccc------------cccC----C Q lcl|NC_012753. 7 IKNFIKRSNYVITNQSLNSITDHPKIAIS-PEEYNRIMDNLRYFA-------GDFDSVT------------YRDS----N 62 (502) Q Consensus 7 ik~~i~~~~~~~~~~~l~~i~~~~~~~~~-~~~~~~i~~~~~~Y~-------g~~~~~~------------~~~~----~ 62 (502) +.++++.. +.+-...-++++++.+|.=+ +.....|+..-++|. ++.+... ++.. . T Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 79 (563) T protein:vir:95 1 MADLFKQF-RLGKDYGNNSTIAQVPIDEGLQANIKKIEQDNKEYQDLTKSLYGQQQAYAEPFIEMMDTNPEFRDKRSYMK 79 (563) T ss_pred Chhhhhhh-hcccccccccccceeeccCChhhhHhhhhccchhHHHHHhhhccCCCcchhhhHhhhcccccccccccCCC Confidence 34444321 11111122233333333111 122233333221111 1111100 0000 0 Q ss_pred Ccc---c-cccceecchHHHHHHHHhhhhhcCc-----------ceEeeC-------CH--HHHHHHHHHHh----h--- Q lcl|NC_012753. 63 GSQ---V-KRDFNHLPIGRTASKKVASLVFNEQ-----------ATIRVD-------NE--VADAFINETLK----N--- 111 (502) Q Consensus 63 ~~~---~-~~~~~~~n~~k~iv~~~a~~l~~ep-----------~~i~~~-------d~--~~~e~l~~~~~----~--- 111 (502) .+. . -+.....++...+++..++.+..-. ..|.+. .. .....|..++. + T Consensus 80 ~~~~l~~~l~~~~~n~i~~~~I~t~~~~vA~~~~~~~~~~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~p 159 (563) T protein:vir:95 80 NEHNLHDVLKKFGNNPILNAIILTRSNQVAMYCQPARYSEKGLGFEVRLRDLDAEPGRKEKEEMKRIEDFIVNTGKDKDV 159 (563) T ss_pred CcccHHHHHHHhhcchHHHHHHHHHHHHHHHHhhhhhhhcccccceeEEeecCCCcchhhhhhhHHHHHHhhhcCCCCCC Confidence 000 0 0001111334444444443332110 112111 11 11223444332 1 Q ss_pred --ccHHHHHHHHHHHHhhcCCEEEEEEEe--C-Cc-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEE Q lcl|NC_012753. 112 --DKFSKNFERYLESCLALGGLAMRPYID--G-DQ-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLI 185 (502) Q Consensus 112 --~~f~~~~~~~~~~~~~~G~~~~~~~~d--~-~~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~ 185 (502) ..|...+..++...+..|.+++.+.+. + |. +.+..++|..+-++..+.+.+.... ..|. T Consensus 160 ~~~t~~~f~~~lv~~lll~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~-------------~~y~-- 224 (563) T protein:vir:95 160 DRDSFQTFCKKIVRDTYIYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGG-------------KRFV-- 224 (563) T ss_pred CcchHHHHHHHHHHHHHhcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccc-------------eeEE-- Confidence 135566777788899999999887653 2 34 3677788888877543332211000 0010 Q ss_pred EEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHH Q lcl|NC_012753. 186 EFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTT 265 (502) Q Consensus 186 E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~l 265 (502) +...+... . .++ +.+ .+.+ ++++.. .....++|+|.+..+... T Consensus 225 --~~~~g~~~-~---------------~~~--------~~e-------vI~~--~~~~~~--d~~~~~~G~Spi~~a~~~ 267 (563) T protein:vir:95 225 --QVVDKRVV-A---------------SFT--------SRE-------LAMG--IRNPRT--ELSSSGYGLSEVEIAMKE 267 (563) T ss_pred --EEeCCcee-E---------------Eec--------Ccc-------eEEE--eccCCC--CcccCcccchHHHHHHHH Confidence 00000000 0 000 000 0111 111111 012246799998877777 Q ss_pred HHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc--cccccc-chhhccccCC-----CCccccceeeecc Q lcl|NC_012753. 266 MDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK--REFETG-HNVYEQFDSG-----DMDKGIGITDLTT 337 (502) Q Consensus 266 id~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~--~~~~~~-~~~~~~~~~~-----~~~~~~~i~~~~~ 337 (502) |.....+..-..+-|..+...-.+ |....+. ...+. ..+... ...+...... --+.+.-++.++. T Consensus 268 i~~~~~~~~~~~~~f~ng~~p~gi----L~~~~~~---~ls~e~~~~~~~~~~~~~~G~~nagk~~~vl~~G~~~~~l~~ 340 (563) T protein:vir:95 268 FIAYNNTESFNDRFFSHGGTTRGI----LQIRSDQ---QQSQHALENFKREWKSSLSGINGSWQIPVVMADDIKFVNMTP 340 (563) T ss_pred HHHHHHHHHHHHHHHHccCCCceE----EEeCCCC---CCCHHHHHHHHHHHHHHhccccccccceEEcCCCceEEeccC Confidence 765444444334445554332111 2211110 00000 000000 0111110000 0011223555565 Q ss_pred ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012753. 338 DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS-IATLVEKSLKELVISILELAKVYN 416 (502) Q Consensus 338 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~-~~~~~~~~l~~l~~~il~~~~~~~ 416 (502) ....-++++..+...++|+...|++|..+|+...+..+++.-.+.. ....+.. ....++.+|..++..|-...+. . T Consensus 341 ~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~--~~sn~e~~~~~f~~~tL~P~l~~ie~~ln~-~ 417 (563) T protein:vir:95 341 TANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTL--NEADPGKKQQQSQNKGLQPLLRFIEDLVNR-H 417 (563) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccch--hhccHHHHHHHHHHHHHHHHHHHHHHHHHh-h Confidence 5666778898899999999999999999997654322111100000 0011111 1122344444444433322221 1 Q ss_pred ccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH-HHHHHH------------------- Q lcl|NC_012753. 417 LYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK-EQAQEI------------------- 476 (502) Q Consensus 417 ~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d-eea~~e------------------- 476 (502) +... ....+.+.|.+.-+.+..+ +....+++.+|+|+.-++++. .|+.. +..+.- T Consensus 418 L~~~---~~~~~~~~f~r~D~~~~~e-~~~~~~~~~~G~lT~NE~R~~-~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~ 492 (563) T protein:vir:95 418 IISE---YGDKYTFQFVGGDTKSATD-KLNILKLETQIFKTVNEAREE-QGKKPIEGGDIILDASFLQGTAQLQQDKQYN 492 (563) T ss_pred hchh---cccccEEEeccCCHHHHHH-HHHHHHHhcCCccCHHHHHHH-hCCCCCCCcceeecccccccccccccccCCC Confidence 1111 1134567776654433322 223445678899999886554 34321 000000 Q ss_pred -------HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 -------YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 -------l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+.++. ....+....+...+--|+ T Consensus 493 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 523 (563) T protein:vir:95 493 DGKQKERLQMMMS--LLEGDNDDSEEGQSTDSS 523 (563) T ss_pred ccccchhhhhccc--ccCCCCCCCCCCCCCCCC Confidence 000000 000000000111111111 No 217 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=95.83 E-value=0.0014 Score=36.25 Aligned_cols=387 Identities=13% Similarity=0.100 Sum_probs=162.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccc-cceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKR-DFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~-~~~~~n~~k~i 79 (502) |+||.+.+ |+ ........+ ..+-....|-.. ..+..... .-+.+.---.. T Consensus 1 Mg~f~~~~---~r-~~~~~~~~~-------------------~~~~~~~~~~~~------~~~~~~~~~~al~~~~v~~c 51 (416) T protein:vir:45 1 MGIFYKNE---KR-DLQYNEDDL-------------------QMMVQTLPGFQG------TKLRQYKDIEAIRHSDIFTA 51 (416) T ss_pred CCcccccc---cc-cccCCCcch-------------------hHHHHHhccccc------cCccccchhhhhcchHHHHH Confidence 99986432 11 000000000 001111111000 00000000 00111101124 Q ss_pred HHHHhhhhhcCcceEeeCCHH-HHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCe Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEV-ADAFINETLKN--DK---FSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATV 151 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~-~~e~l~~~~~~--~~---f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~ 151 (502) |+..|+-+.+=|..+.-+++. ....+..+|.. |. ...-...++...+..|.+|+.+..+. |.+ .+..++|++ T Consensus 52 v~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~ 131 (416) T protein:vir:45 52 VMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSE 131 (416) T ss_pred HHHHHHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCce Confidence 555555555556554433321 12223333431 21 22334455666778899998888775 443 577788888 Q ss_pred EEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecC Q lcl|NC_012753. 152 FFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNG 231 (502) Q Consensus 152 ~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 231 (502) +-++..+++.+ .+.+...+. ...... ..| +.. T Consensus 132 v~v~~~~~g~~-----~~~~~~~~~----------------~~~~~~-~~~------------~~~-------------- 163 (416) T protein:vir:45 132 IELKSDARGRL-----YYFHQRIDS----------------NGNNIE-RNV------------KFE-------------- 163 (416) T ss_pred eEEEECCCccE-----EEEEEEecC----------------CCceeE-EEE------------ccc-------------- Confidence 76653222221 000000000 000000 000 000 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHH-HHhhccceeeechHHhccCCCCCCcccCc--c Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMW-EVKMGQRRVAVPTQMIKTEYDTNGEKVTV--K 308 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~-~~~~~~~~i~v~~~~l~~~~~~~g~~~~~--~ 308 (502) -.++++.+ ..+...|+|.+.-+...++..... ..+.. -|..+...-.| |+........+-.. . T Consensus 164 ----evihir~~-----~~d~~~G~s~i~~~~~~i~~~~~~-~~~~~~~f~ng~~~~gi----l~~~~~~~~~~~~~~~~ 229 (416) T protein:vir:45 164 ----DMLDIKFY-----SLDGINGLSLLDTLSRTIESDNNG-KDFLNNFLRNGTHAGGI----LKMKGVLDNKKARDRAR 229 (416) T ss_pred ----cEEEeccC-----CCCCccccCHHHHHHHHHHHHHHH-HHHHHHHHhccCCCcEE----EEeCCCCCCHHHHHHHH Confidence 01233322 112356889888877777644433 33433 34544332222 22211111100000 0 Q ss_pred ccccccchhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 309 REFETGHNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 309 ~~~~~~~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) ..|. ..+...... --+.+.-++.++......++.+..+...++|+...|+||..+|.+..+. +.++....| T Consensus 230 ~~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~~~~- 304 (416) T protein:vir:45 230 EEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDY- 304 (416) T ss_pred HHHH---HHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHH- Confidence 0010 011100000 0011223556666666677888888888999999999999998654432 222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) ..+|..++..|....+. .+.. . .....+.++++.-.-.|..+.++...+++.+|+|+.-+++.. T Consensus 305 -------------~~~l~P~~~~ie~~ln~-~l~~-~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~ 368 (416) T protein:vir:45 305 -------------LSTLKPYITCVCAELNF-KFND-E-YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQR 368 (416) T ss_pred -------------HHHHHHHHHHHHHHHhh-hccc-c-ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 12333333322221111 1111 1 112346666666567788999999999999999999997665 Q ss_pred c--CCCCHHHHHH-----H---HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 465 T--LNVTKEQAQE-----I---YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 465 ~--~~~~deea~~-----e---l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) + +++.+.+... - ++-+.+.+....+. .....=+|| T Consensus 369 ~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~---~~~~~kgGe 413 (416) T protein:vir:45 369 DGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRA---TDKKLKGGE 413 (416) T ss_pred hCCCCCCCCCcceEeecccccccccccccCcccccc---cccccCCCC Confidence 3 2332211100 0 01111111111111 111223444 No 218 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=95.83 E-value=0.0014 Score=36.25 Aligned_cols=387 Identities=13% Similarity=0.100 Sum_probs=162.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccc-cceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKR-DFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~-~~~~~n~~k~i 79 (502) |+||.+.+ |+ ........+ ..+-....|-.. ..+..... .-+.+.---.. T Consensus 1 Mg~f~~~~---~r-~~~~~~~~~-------------------~~~~~~~~~~~~------~~~~~~~~~~al~~~~v~~c 51 (416) T protein:vir:81 1 MGIFYKNE---KR-DLQYNEDDL-------------------QMMVQTLPGFQG------TKLRQYKDIEAIRHSDIFTA 51 (416) T ss_pred CCcccccc---cc-cccCCCcch-------------------hHHHHHhccccc------cCccccchhhhhcchHHHHH Confidence 99986432 11 000000000 001111111000 00000000 00111101124 Q ss_pred HHHHhhhhhcCcceEeeCCHH-HHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCe Q lcl|NC_012753. 80 SKKVASLVFNEQATIRVDNEV-ADAFINETLKN--DK---FSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATV 151 (502) Q Consensus 80 v~~~a~~l~~ep~~i~~~d~~-~~e~l~~~~~~--~~---f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~ 151 (502) |+..|+-+.+=|..+.-+++. ....+..+|.. |. ...-...++...+..|.+|+.+..+. |.+ .+..++|++ T Consensus 52 v~~Ia~~iA~~p~~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~i~~~~ 131 (416) T protein:vir:81 52 VMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLTFRKTSE 131 (416) T ss_pred HHHHHHhhccCceEEecCccccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCce Confidence 555555555556554433321 12223333431 21 22334455666778899998888775 443 577788888 Q ss_pred EEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecC Q lcl|NC_012753. 152 FFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNG 231 (502) Q Consensus 152 ~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 231 (502) +-++..+++.+ .+.+...+. ...... ..| +.. T Consensus 132 v~v~~~~~g~~-----~~~~~~~~~----------------~~~~~~-~~~------------~~~-------------- 163 (416) T protein:vir:81 132 IELKSDARGRL-----YYFHQRIDS----------------NGNNIE-RNV------------KFE-------------- 163 (416) T ss_pred eEEEECCCccE-----EEEEEEecC----------------CCceeE-EEE------------ccc-------------- Confidence 76653222221 000000000 000000 000 000 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHH-HHhhccceeeechHHhccCCCCCCcccCc--c Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMW-EVKMGQRRVAVPTQMIKTEYDTNGEKVTV--K 308 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~-~~~~~~~~i~v~~~~l~~~~~~~g~~~~~--~ 308 (502) -.++++.+ ..+...|+|.+.-+...++..... ..+.. -|..+...-.| |+........+-.. . T Consensus 164 ----evihir~~-----~~d~~~G~s~i~~~~~~i~~~~~~-~~~~~~~f~ng~~~~gi----l~~~~~~~~~~~~~~~~ 229 (416) T protein:vir:81 164 ----DMLDIKFY-----SLDGINGLSLLDTLSRTIESDNNG-KDFLNNFLRNGTHAGGI----LKMKGVLDNKKARDRAR 229 (416) T ss_pred ----cEEEeccC-----CCCCccccCHHHHHHHHHHHHHHH-HHHHHHHHhccCCCcEE----EEeCCCCCCHHHHHHHH Confidence 01233322 112356889888877777644433 33433 34544332222 22211111100000 0 Q ss_pred ccccccchhhccccCC----CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHH Q lcl|NC_012753. 309 REFETGHNVYEQFDSG----DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQS 384 (502) Q Consensus 309 ~~~~~~~~~~~~~~~~----~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~ 384 (502) ..|. ..+...... --+.+.-++.++......++.+..+...++|+...|+||..+|.+..+. +.++....| T Consensus 230 ~~~~---~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~~~~- 304 (416) T protein:vir:81 230 EEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITDANLDY- 304 (416) T ss_pred HHHH---HHhcCccccCceeecCCCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCc-cHHHHHHHH- Confidence 0010 011100000 0011223556666666677888888888999999999999998654432 222222222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHh Q lcl|NC_012753. 385 DTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEK 464 (502) Q Consensus 385 ~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~ 464 (502) ..+|..++..|....+. .+.. . .....+.++++.-.-.|..+.++...+++.+|+|+.-+++.. T Consensus 305 -------------~~~l~P~~~~ie~~ln~-~l~~-~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~~T~NE~R~~ 368 (416) T protein:vir:81 305 -------------LSTLKPYITCVCAELNF-KFND-E-YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNIDEIRQR 368 (416) T ss_pred -------------HHHHHHHHHHHHHHHhh-hccc-c-ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH Confidence 12333333322221111 1111 1 112346666666567788999999999999999999997665 Q ss_pred c--CCCCHHHHHH-----H---HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 465 T--LNVTKEQAQE-----I---YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 465 ~--~~~~deea~~-----e---l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) + +++.+.+... - ++-+.+.+....+. .....=+|| T Consensus 369 ~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~---~~~~~kgGe 413 (416) T protein:vir:81 369 DGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRA---TDKKLKGGE 413 (416) T ss_pred hCCCCCCCCCcceEeecccccccccccccCcccccc---cccccCCCC Confidence 3 2332211100 0 01111111111111 111223444 No 219 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=95.70 E-value=0.0016 Score=35.90 Aligned_cols=400 Identities=13% Similarity=0.100 Sum_probs=157.1 Q ss_pred hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHHHH Q lcl|NC_012753. 3 IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTASK 81 (502) Q Consensus 3 ~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~iv~ 81 (502) ....+.+.+.+....+. ..+..-. ..-+.++... -|..| .|... . .+.... +.-+.+.---.+|+ T Consensus 1 ~~~~l~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~-----~~~~~-~g~~~-~-----~g~~v~~~~al~~~~V~~~i~ 66 (434) T protein:vir:43 1 MSKSLGKVLSSATSAPR-SSLFGWG-GKTIRLTDGA-----FWSQF-LGRES-S-----SGKKVTVDKAMKLSAVWACVR 66 (434) T ss_pred Cccchhhhhhhcccccc-hhhhccc-ccccccCchH-----HHHHH-hcCCc-c-----CCceechhhhhccHHHHHHHH Confidence 01111111111101000 0000000 0001111111 12222 23211 0 111110 01111111124556 Q ss_pred HHhhhhhcCcceE-eeC--C---HHHHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCCEEEEEEEeCCce-EEEEEcC Q lcl|NC_012753. 82 KVASLVFNEQATI-RVD--N---EVADAFINETLKN--DK---FSKNFERYLESCLALGGLAMRPYIDGDQI-RVSFVQA 149 (502) Q Consensus 82 ~~a~~l~~ep~~i-~~~--d---~~~~e~l~~~~~~--~~---f~~~~~~~~~~~~~~G~~~~~~~~d~~~~-~i~~v~~ 149 (502) ..|+-+-+=|+.+ ..+ + ...+-.+..+|.. |. -..-...++...+..|.+|+.+..+.|++ .+..++| T Consensus 67 ~ia~~ia~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~~~~G~~~~L~~l~p 146 (434) T protein:vir:43 67 LISTSVAGLPLGVYERKADGSRVDARSFPLYDVVHNSPNDDMTAFQFWQAMVASMLLWGNAYAEIRRAAGRPAALDFLLP 146 (434) T ss_pred HHHHhhhhCceEEEEEcCCCccccccccHHHHHHhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeCCCcEEEEEEEcC Confidence 6666665555553 211 1 1112234444432 32 22444555667788899988877776664 5666777 Q ss_pred CeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceee Q lcl|NC_012753. 150 TVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTL 229 (502) Q Consensus 150 ~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~ 229 (502) +.+-+...+++.. .++++. .+ |..+.+. +. T Consensus 147 ~~v~~~~~~~g~~-----~y~~~~-----------------~~------------------g~~~~~~------~~---- 176 (434) T protein:vir:43 147 SRVDLECDENGRL-----KYFYTT-----------------KK------------------GARREIE------RT---- 176 (434) T ss_pred cceEEEEcCCCeE-----EEEEEe-----------------cC------------------ceEEEEc------cc---- Confidence 7766543222211 000000 00 1111000 00 Q ss_pred cCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc Q lcl|NC_012753. 230 NGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR 309 (502) Q Consensus 230 ~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~ 309 (502) -.++++.+. .+..+|+|.+.-+...+......-.-..+-|..+...-.+ |.....-+.... . T Consensus 177 ------eVih~~~~~-----~dg~~G~spi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gi----l~~~~~l~~e~~---~ 238 (434) T protein:vir:43 177 ------NMLHIPAFT-----LDGRIGLSAIRYGVDVFGSVMSAEDAANGTFKNGLLPTVA----FKVDRILQPAQR---E 238 (434) T ss_pred ------cEEEecCcC-----CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE----EecCCCCCHHHH---H Confidence 012333221 1234688887776666654443222222234443322111 222111111000 0 Q ss_pred cccccchhhccc-cCC---CCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHH Q lcl|NC_012753. 310 EFETGHNVYEQF-DSG---DMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSD 385 (502) Q Consensus 310 ~~~~~~~~~~~~-~~~---~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~ 385 (502) .+......+... +.. --+.+.-++.++......++++..+....+|+...|++|..+|....+..+++.+...... T Consensus 239 ~~r~~~~~~~g~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~ 318 (434) T protein:vir:43 239 EFREYVKSVSGAMNSGRSPVLEQGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLA 318 (434) T ss_pred HHHHHHHHhcCccccCCccccCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHH Confidence 010000111100 000 0011223555555556678888888889999999999999998755433222222111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc Q lcl|NC_012753. 386 TYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT 465 (502) Q Consensus 386 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~ 465 (502) .++.+|..++..|-.-.+. .+....-.....+.|+++.-+..|..+.++...+++.+|+++.-++++. T Consensus 319 ----------f~~~~L~P~~~~ie~~ln~-kL~~~~~~~~~~~~fd~~~llr~d~~~r~~~~~~~~~~G~~T~NE~R~~- 386 (434) T protein:vir:43 319 ----------FLTFSISSITNQIQQCVNK-RLLTAPERIRYYAEFSLEGFLKADSAGRAAWYSTMAQNGFMTRNEGRRK- 386 (434) T ss_pred ----------HHHHHHHHHHHHHHHHHHh-hcCChhhhcCceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH- Confidence 1223333333322221111 1111111112335555555567789999999999999999999987664 Q ss_pred CCCCH-HHHHH--------HHHHHHHhhh--------cccCCCCCccc Q lcl|NC_012753. 466 LNVTK-EQAQE--------IYQKINDETM--------VSTDSFRTSEE 496 (502) Q Consensus 466 ~~~~d-eea~~--------el~ri~~E~~--------~~~~~~~~~~~ 496 (502) .|+.. +..++ -++.+.+.+. ......+.|.+ T Consensus 387 ~gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 387 ENLPELPGGDILTVQSNLVPIDQLGQSNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred hCCCCCCCCCeEeeccCccchhhhhccCCCcchhhhhhccCCCCCCCC Confidence 33322 00100 0111111110 01111111222 No 220 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=95.54 E-value=0.0019 Score=35.52 Aligned_cols=397 Identities=12% Similarity=0.078 Sum_probs=155.4 Q ss_pred HHhhcccccchhhhhccccccCCHHHHHHHHHH--HHHhcCCCCccc---------------cccCCCccccccceecch Q lcl|NC_012753. 13 RSNYVITNQSLNSITDHPKIAISPEEYNRIMDN--LRYFAGDFDSVT---------------YRDSNGSQVKRDFNHLPI 75 (502) Q Consensus 13 ~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~--~~~Y~g~~~~~~---------------~~~~~~~~~~~~~~~~n~ 75 (502) ..||..- -.+-+...++. .+.... --+...+.+.+. .....+.... ....+.. T Consensus 1 ~~~~~~~-~~~~~~~~~~~--------~~~~~~~~~~f~~~e~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~al~~ 70 (441) T protein:vir:98 1 MHWYNTD-CYFVDFKSRKQ--------SRKELVVVGIFYKNEKRDLQYNEDDLQMMVQTLPGFQGTKLRQYK-DIEAIRH 70 (441) T ss_pred CceecCc-cceeccccccc--------hhhhhhccccccccccccccCCCcchHHHHHHhhcccccCccccc-hhhhhcc Confidence 1111110 00000000000 000000 000000000000 0000000000 0000111 Q ss_pred H--HHHHHHHhhhhhcCcceEeeCCHH-HHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEE Q lcl|NC_012753. 76 G--RTASKKVASLVFNEQATIRVDNEV-ADAFINETLKN--DK---FSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVS 145 (502) Q Consensus 76 ~--k~iv~~~a~~l~~ep~~i~~~d~~-~~e~l~~~~~~--~~---f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~ 145 (502) + -..|+..|+-+-+=|+.+.-+++. ....+-.+|.. |. ...-+..++..++..|.+|+.+..+. |. ..+- T Consensus 71 ~~V~acv~~Ia~~iA~lpl~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~G~~~~L~ 150 (441) T protein:vir:98 71 SDIFTAVMMIASDLARMPIRVTVNGQINYSDRIVNLLNTRPNPMYNGYIFKLVVFVSALLTSHGYIEITRDKTGEPMNLT 150 (441) T ss_pred HHHHHHHHHHHHhhccCceEEecCCcccccchHHHHHhcccccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEE Confidence 1 134556666555555554322211 12223333321 21 22344556677788899998887775 44 4677 Q ss_pred EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCc Q lcl|NC_012753. 146 FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEE 225 (502) Q Consensus 146 ~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~ 225 (502) .++|+.+-+...+.+.+ ++.++..+. ....+. ..| +- . T Consensus 151 ~i~~~~v~v~~~~~g~~-----~~~~~~~~~----------------~~~~~~-~~~------------~~--------~ 188 (441) T protein:vir:98 151 FRKTSEIELKLDARGRL-----YYFHQRIDS----------------NGNNIE-RNV------------KF--------E 188 (441) T ss_pred EEcCceeEEEECCCCcE-----EEEEEEecc----------------Ccceee-EEE------------cc--------c Confidence 88888887754332221 000000000 000000 000 00 0 Q ss_pred ceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-HhhccceeeechHHhccCCCCCCcc Q lcl|NC_012753. 226 TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPTQMIKTEYDTNGEK 304 (502) Q Consensus 226 ~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~~~l~~~~~~~g~~ 304 (502) -.++|+.+. .+...|+|.+.-+...|+.... ..++... |+.+...-.| |.........+ T Consensus 189 ----------dviHir~~~-----~dg~~G~spi~~~~~~i~~~~a-~~~~~~~~f~ng~~~~gi----l~~~~~~~~~e 248 (441) T protein:vir:98 189 ----------DMLDIKFYS-----LDGINGLSLLDTLSRTIESDNN-GKDFLNNFLRNGTHAGGI----LKMKGVLDNKK 248 (441) T ss_pred ----------cEEEeccCC-----CCCccccCHHHHHHHHHHHHHH-HHHHHHHHHhccCCCcEE----EEeCCCCCCHH Confidence 012333211 1234688888777776654433 3333333 4554332121 22211111100 Q ss_pred cC--ccccccccchhhccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH Q lcl|NC_012753. 305 VT--VKREFETGHNVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE 378 (502) Q Consensus 305 ~~--~~~~~~~~~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte 378 (502) -. ....|. ..+..... .--+++.-++.++.....-++.+..+....+|+...|+||..+|.+..+. +.++ T Consensus 249 ~~~~~~~~~~---~~~~G~~nag~~~vl~~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~-s~~q 324 (441) T protein:vir:98 249 ARDRAREEFH---KSFSGTKQAGKVVVLDESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANM-SITD 324 (441) T ss_pred HHHHHHHHHH---HHhcCccccCcceecCCCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCCCc-cHHH Confidence 00 000010 11110000 00012223556666666677888888889999999999999998654432 2333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPK 458 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~ 458 (502) ....|. .+..-..+.++.+|... +... .....+.++.+.-+-.|..+.++...+++.+|++++ T Consensus 325 ~~~~y~---~tl~P~~~~ie~~ln~~------------L~~~--~~~~~~~fd~~~llr~d~~~~~~~~~~~~~~G~~T~ 387 (441) T protein:vir:98 325 ANLDYL---STLKPYITCVCAELNFK------------FNDE--YVNREFKFDTTEIRVVDEKTQAEIDKINIDSGKMNI 387 (441) T ss_pred HHHHHH---HHHHHHHHHHHHHHHhh------------cccc--ccCceEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 222221 12222222222222221 1111 112345555555577888999999999999999999 Q ss_pred HHHHHhc--CCCCHHH--H---HH---HHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 459 TMAIEKT--LNVTKEQ--A---QE---IYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 459 et~l~~~--~~~~dee--a---~~---el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) -++++.. +++..-+ + .. -++.+.+.+......-+...++|=-+| T Consensus 388 NE~R~~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 388 DEIRQRDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred HHHHHHhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 9976543 2332211 0 00 001111111111111111222222333 No 221 >protein:vir:105641 Length: 516 # NCBI annotation: putative head-tail connector # Family: family:all:481 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425006;genbank:gi:83571754;uniprot:Q2WC46;genbank:GeneID:3837282 Probab=95.53 E-value=0.0019 Score=35.51 Aligned_cols=430 Identities=10% Similarity=0.068 Sum_probs=166.7 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |. .-.+. -.+....++...+. .+..++-.....|+.++.=-.+.+.... +......++--+.+...+ T Consensus 1 ~~------~~~~~----~~~~~~~~l~~r~~-~L~~~R~~~e~~w~e~a~~~lP~~~~~~--~~~~~~~~~~dstg~~a~ 67 (516) T protein:vir:10 1 MK------QSTDL----EYGGKRSKIPKLWE-KFSTKRSSFLDRAKHYSKLTLPYLMNDK--GDNETSQNGWQGVGAQAT 67 (516) T ss_pred CC------chhhH----hhhhHHHHHHHHHH-HHHHhhhHHHHHHHHHHHhhcccccCCC--CCcccccccccchHHHHH Confidence 10 00000 00000001111111 1222333334444444322222121111 111112223234566777 Q ss_pred HHHhhhhhcC--cc-----eEeeCCH-------------HHHHHH-------HHHHhhccHHHHHHHHHHHHhhcCCEEE Q lcl|NC_012753. 81 KKVASLVFNE--QA-----TIRVDNE-------------VADAFI-------NETLKNDKFSKNFERYLESCLALGGLAM 133 (502) Q Consensus 81 ~~~a~~l~~e--p~-----~i~~~d~-------------~~~e~l-------~~~~~~~~f~~~~~~~~~~~~~~G~~~~ 133 (502) +.+|+-|.+- || ++.+++. .+.++| ...+..++|...+.++.......|.++ T Consensus 68 ~~LAa~l~~~ltpp~~~WF~L~~~d~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~- 146 (516) T protein:vir:10 68 NHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCM- 146 (516) T ss_pred HHHHHHHHhhhcCCCCccccccCChhhHhhhhccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEe- Confidence 7777766652 22 1333321 133333 345667899999999999999999976 Q ss_pred EEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEe---------------------eCCCceEEEEEEEEEEeC Q lcl|NC_012753. 134 RPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKT---------------------EGQKVKYYSLIEFHEWNK 192 (502) Q Consensus 134 ~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~---------------------~~~~~~~yt~~E~h~~~~ 192 (502) .|.|+.. .+..+|-.+++ +..|..+....+|.+..... .......||++++ .+ T Consensus 147 -l~~d~~~-~~~~~pl~~y~-v~~d~~G~v~~ivrr~~~~~~~l~e~~~~~~~~~~~~~~~~~~~~~~i~t~v~~---~~ 220 (516) T protein:vir:10 147 -LYKPSKG-AISAIPMHHYV-VNRDTNGDLLDIILLQEKSLRTFDPATRAVVEVGLKGKKCKEDDSIKLYTHAKY---LG 220 (516) T ss_pred -EEecCCC-CeEEEEcCeEE-EeeCCCCCeEEEeeeecccHHHHHHHhhhhhhhhhhhhccCCCCceEEEEEEEe---cC Confidence 4566543 25566666654 45555544434443221100 0112223444433 33 Q ss_pred CeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHH Q lcl|NC_012753. 193 ETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTT 272 (502) Q Consensus 193 ~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~ 272 (502) ..+...|. ..+...+|. +..+ +...-||+.++-+ ...++.||+|--..+.+-+..|+.. T Consensus 221 ~~~~~~~~---~~d~~~~~~-----------~s~~---~~~e~P~~~~Rw~----~~~ge~YGrgp~~~~L~D~k~L~~l 279 (516) T protein:vir:10 221 EGFWELKQ---SADDIPVGK-----------VSKI---KSEKLPFIPLTWK----RSYGEDWGRPLAEDYSGDLFVIQFL 279 (516) T ss_pred CCceEEEE---eeCceeecc-----------cccc---ccccCCeeeeeee----ecCCCCcccchHHHhhHHHHHHHHH Confidence 33332221 111111110 0111 1223344444422 2347789999989999999999866 Q ss_pred HHHHHHHH-hhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCccccceeeec--cccchHHHHHHHH Q lcl|NC_012753. 273 YDEFMWEV-KMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT--TDIRSDDYIKAIN 349 (502) Q Consensus 273 ~S~~~~~~-~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~l~ 349 (502) --....-. ...+....||.+.... .....+. ....+.. . ....+..++ +..+...-...++ T Consensus 280 ~~~~l~~~~~a~~~~~lv~p~g~~~-----~~~l~~~-----~~g~~~~---g---~~~~v~~~q~~~~~d~~~~~~~i~ 343 (516) T protein:vir:10 280 SEAVARGAALMADIKYLIRPGAQTD-----VDHFVNS-----GTGEVVT---G---VEEDIHIVQLGKYADLTPISAVLE 343 (516) T ss_pred HHHHHHHHHHhcCCCcccCcccccc-----hhhhccC-----CCceeec---C---CcccceeeecCcccchHHHHHHHH Confidence 55555433 3444555554332211 1110010 0011110 1 111122221 2112232333344 Q ss_pred HHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc Q lcl|NC_012753. 350 KGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS----IATLVEKSLKELVISILELAKVYNLYTGEIPTM 425 (502) Q Consensus 350 ~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~----~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~ 425 (502) .+.+.|....=++. +....+...|||||....+.+.+..+- ++.+|...| +.-.+ . +..+.-.... T Consensus 344 ~~~~rI~~af~~~~--l~~rd~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~Pl---i~r~~--~---~~~p~~P~~l 413 (516) T protein:vir:10 344 VYTRRIGVVFMMET--MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQSPV---AMWGL--L---EAGDSFTSDL 413 (516) T ss_pred HHHHHHHHHHhhhh--hhccCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHH---HHHHH--H---hhCCCCChhh Confidence 44444433221211 121223346999999877776655544 444444333 22211 1 1111111111 Q ss_pred cceEEEeCCCccCCHHHHHHHHH------HHHh--cCCCCHH-----------HHHHhcCCC------CHHHHHHHHHHH Q lcl|NC_012753. 426 DEVSVDLDDGVFTDRNAEFDYWS------KMVA--AGFAPKT-----------MAIEKTLNV------TKEQAQEIYQKI 480 (502) Q Consensus 426 ~~i~v~f~d~i~~d~~~~~~~~~------~~~~--~Gi~S~e-----------t~l~~~~~~------~deea~~el~ri 480 (502) ..+++. -..+.....+... +.++ +++ ++. ..+....|+ ++||++++.+.- T Consensus 414 v~~~~v----~~i~~L~raq~~~~i~~~~q~i~~~~q~-~p~v~d~id~d~~~~~~a~~~gvp~~~irs~eev~~~r~~~ 488 (516) T protein:vir:10 414 VDPVII----TGIEALGRMAELDKLANFAQYMSLPLQW-PEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMEQEQEAQ 488 (516) T ss_pred cCccee----hhHHHHHHHHHHHHHHHHHHHHHHHhcC-ChHHHhhcCHHHHHHHHHHHhCCChhccCCHHHHHHHHHHH Confidence 122221 1112222221111 1110 111 111 112222232 556665553322 Q ss_pred HH-hhh-cccCCCCCccccCCCCC Q lcl|NC_012753. 481 ND-ETM-VSTDSFRTSEEVDIYGE 502 (502) Q Consensus 481 ~~-E~~-~~~~~~~~~~~~~~~g~ 502 (502) ++ ++. ...+..-..-++.+..| T Consensus 489 ~~~q~~~~~~~~~~~~~~~~~~~~ 512 (516) T protein:vir:10 489 MQAQQAQMLEEGVAKAVPGVIQQE 512 (516) T ss_pred HHHHHHHHHHHHhhhcccchhhhh Confidence 21 111 11222223344455555 No 222 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=95.33 E-value=0.0023 Score=35.06 Aligned_cols=390 Identities=13% Similarity=0.104 Sum_probs=160.1 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCcccc-ccceecchHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTA 79 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~i 79 (502) |-+.. .....+-+++.. .-|..+..+..-.. ...+.... +.-+.+.--..+ T Consensus 1 m~~~~--------------------~~~~~~~~~s~~-----~~w~~~~~~~~~~~---~~~g~~vt~~~al~~~~v~~~ 52 (421) T protein:vir:10 1 MFIPQ--------------------MFEGKKRSVSGG-----GFWEAMLGGVRSSH---SKAGVMITPETALALSAVRAC 52 (421) T ss_pred CCCcc--------------------hhcccccccCcc-----hhhHHHhhhhccCc---ccCCceechHHhhccHHHHHH Confidence 33322 222222222211 11323322211100 00111111 111122222345 Q ss_pred HHHHhhhhhcCcceE-eeC--CH---HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEE Q lcl|NC_012753. 80 SKKVASLVFNEQATI-RVD--NE---VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSF 146 (502) Q Consensus 80 v~~~a~~l~~ep~~i-~~~--d~---~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~ 146 (502) |+..|+-+-.=|+.+ ..+ +. ..+.-+..+|.. | ....-....+...+..|.+|+.+..+. |.+ .+-. T Consensus 53 i~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~~~i~r~~~G~~~~L~~ 132 (421) T protein:vir:10 53 VTLLAESVAQLPVELYRRDKNGGRQRATDHPIYDLIHSQPNKKDTSFEYFEQQQGLLGLEGNCYSIIDRDGKGYPKELIP 132 (421) T ss_pred HHHHHHhhccCceEEEEEcCCCceeecccchHHHHHhhcccCCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEE Confidence 555555555545543 111 11 111123333321 1 233344555677888899988887765 443 5666 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcc Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEET 226 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~ 226 (502) ++|+.+-++. +.++. .+|. + + ..|..+|..+ T Consensus 133 l~~~~v~v~~-~~~g~-----------------~~y~-------------~----~------~~g~~~~~~e-------- 163 (421) T protein:vir:10 133 INPKKVIVLK-GPDGM-----------------PYYE-------------I----P------EIGETLPMRM-------- 163 (421) T ss_pred ecCceEEEEE-CCCce-----------------EEEE-------------E----c------CCCcEEchhh-------- Confidence 6777766542 21111 0110 0 0 0011122110 Q ss_pred eeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHH-HHhhccceeeechHHhccCCCCCCccc Q lcl|NC_012753. 227 VTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMW-EVKMGQRRVAVPTQMIKTEYDTNGEKV 305 (502) Q Consensus 227 ~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~-~~~~~~~~i~v~~~~l~~~~~~~g~~~ 305 (502) .++++.+. .+...|+|.+.-+...|+.... ..++.. -|..+...=.+ |....+..+..- T Consensus 164 ----------iih~~~~~-----~d~~~G~spi~~~~~~i~~~~~-~~~~~~~~f~ng~~~~gi----l~~~~~~~~~~~ 223 (421) T protein:vir:10 164 ----------MHHVKVFS-----LDGYIGSSPIQTNADVLGLNLA-VEEHASAVFRRGATMSGV----IERPKEAPAIKS 223 (421) T ss_pred ----------EEEecCcC-----CCCcccccHHHHHHHHHHHHHH-HHHHHHHHHhcCCCccEE----EEecCccCccCC Confidence 12333321 1234688888877777754433 333333 34543322111 222211111100 Q ss_pred Cccc-ccccc-chhhccccC----CCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHH Q lcl|NC_012753. 306 TVKR-EFETG-HNVYEQFDS----GDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATE 378 (502) Q Consensus 306 ~~~~-~~~~~-~~~~~~~~~----~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAte 378 (502) .... .+... ...+..... .--+.+.-++.++.....-++.+..+...++|+...|+||..++....+. ++.++ T Consensus 224 ~e~~~~~~~~~~~~~~g~~n~~~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~ 303 (421) T protein:vir:10 224 QEKIDQLLAKWTDRYSGINNMFSVALLQEGMSYKQMSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMVQMLAKATNNNIEH 303 (421) T ss_pred HHHHHHHHHHHHHHhcCccccCcceecCCCceEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCcCCccccHHH Confidence 0000 00000 011111100 00022234666666666777888888889999999999999998755432 22222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPK 458 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~ 458 (502) ....+ ++.+|..++..|-...+. .+..........+.++.+.-+..|..+.++...+++.+|+|+. T Consensus 304 ~~~~f-------------~~~tl~P~~~~ie~~ln~-kL~~~~~~~~~~v~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~ 369 (421) T protein:vir:10 304 QGLQF-------------VMYTLLAWLKRHEGALQR-DLLLPSERRDLYIEFNVSGLLRGDQKSRYESYALGRQWGWLSV 369 (421) T ss_pred HHHHH-------------HHHHHHHHHHHHHHHHhh-hccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCH Confidence 21111 222333333322211111 1111111112234444445556788999999999999999999 Q ss_pred HHHHHhcCCCCH-HHHHHHHHHHH---Hhhh----cccCCCCCccccCCCCC Q lcl|NC_012753. 459 TMAIEKTLNVTK-EQAQEIYQKIN---DETM----VSTDSFRTSEEVDIYGE 502 (502) Q Consensus 459 et~l~~~~~~~d-eea~~el~ri~---~E~~----~~~~~~~~~~~~~~~g~ 502 (502) -++++.+ |+.. +..++-+.... .++. .......+.+.-++.+. T Consensus 370 NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~e~d~~~~~ 420 (421) T protein:vir:10 370 NDIRRME-NLPPIAGGDKYLTPLNMVDSAQIIPGDKKPTAQQMAEIDTILSR 420 (421) T ss_pred HHHHHHh-CCCCCCCcceeeeccccccccccccCCCCcccccCccccccccc Confidence 9977653 4322 11111110000 0010 00011112222223333 No 223 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=95.09 E-value=0.0028 Score=34.59 Aligned_cols=384 Identities=10% Similarity=0.039 Sum_probs=154.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhh-------hccccccCCHHHHHHHHHHHH--HhcCCCCccccccCCCcccc-ccc Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSI-------TDHPKIAISPEEYNRIMDNLR--YFAGDFDSVTYRDSNGSQVK-RDF 70 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i-------~~~~~~~~~~~~~~~i~~~~~--~Y~g~~~~~~~~~~~~~~~~-~~~ 70 (502) |+|++.++..-.. ........+.+. ......... .-.+-..+.+ .+.|-...+... .+.... ..- T Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~g~~~~~~~~--~~~~~t~~~~ 75 (409) T protein:vir:83 1 MGFWSNLFGIPSI-PDLPNDNGPVDYNPGDPDMVEFRGPEEE--PEARALPWIRPTAWSGYPESWATP--SWGSAQDKLR 75 (409) T ss_pred CchhhhhcccccC-CCcccccccccccCCCCceeeccCCCcc--hhhhhccccccccccccccccccc--CccccchhhH Confidence 9999998876110 000000111000 000011100 0011111211 111211111110 111111 112 Q ss_pred eecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--ccH--HHHHHHHHHHHhhcCCEEEEEE-EeC-Cce-E Q lcl|NC_012753. 71 NHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKN--DKF--SKNFERYLESCLALGGLAMRPY-IDG-DQI-R 143 (502) Q Consensus 71 ~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~~f--~~~~~~~~~~~~~~G~~~~~~~-~d~-~~~-~ 143 (502) +.+......|+..|+-+-+=|+.+--+++... .+..++.. |.+ ...|.+.+...+.+|.+|+.+. .+. |.+ . T Consensus 76 ~~~~~v~acV~~Ia~~iA~lpl~~~~~~~~~~-~~~~ll~~~PN~~~t~~~f~~~l~~~lllGnay~~~i~r~~~G~~~~ 154 (409) T protein:vir:83 76 TLIDVAWACIDLNASVLSSMPIYRMRNGRIID-SVAWMSNPDPEVYTSWQEFAKQLFWDFQLGEAFVLPMAHGSDGYPIR 154 (409) T ss_pred hhhHHHHHHHHHHHHhhccCceEEeeCCcccc-chhhhcccCCCCCCCHHHHHHHHHHHHhhCCcEEEEEEECCCCcEEE Confidence 22233345666677666555654322222211 12222321 211 1233344444455688877654 454 443 5 Q ss_pred EEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCC Q lcl|NC_012753. 144 VSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDL 223 (502) Q Consensus 144 i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l 223 (502) +..++|+.+-+...+ ++. .+ |++... . . T Consensus 155 L~pl~p~~v~v~~~~-~g~-----------------~~-------------y~~~~~-----~----------------~ 182 (409) T protein:vir:83 155 FRVVPPWLVNVELKK-GAR-----------------RE-------------YRIGGL-----N----------------V 182 (409) T ss_pred EEEECCcceEEEEcC-Cce-----------------EE-------------EEEccc-----c----------------C Confidence 666666654432111 110 00 111100 0 0 Q ss_pred CcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-HhhccceeeechHHhccCCCCCC Q lcl|NC_012753. 224 EETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPTQMIKTEYDTNG 302 (502) Q Consensus 224 ~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~~~l~~~~~~~g 302 (502) +. ..++++.+. ..+..+|+|-+.-+...|+... ...++... |..+... ..+|.....-+. T Consensus 183 ~~----------eiiHir~~~----~~~~~~G~spi~~~~~~i~~~~-a~~~~~~~~f~nga~p----~gil~~~~~ls~ 243 (409) T protein:vir:83 183 TD----------EILHIRYQG----NTADAHGHGPLESAAPRQVVIG-LLQKYVQNLAETGGVP----LYWLGVERRLSE 243 (409) T ss_pred cc----------ceEEeCCCC----CCCCcccccHHHHHHHHHHHHH-HHHHHHHHHHhcCCCc----ceEeecCCCCCH Confidence 00 122333211 1234468888877777776443 33444443 3443322 112332211111 Q ss_pred cccC-cccccccc--chhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc----cc Q lcl|NC_012753. 303 EKVT-VKREFETG--HNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM----KT 375 (502) Q Consensus 303 ~~~~-~~~~~~~~--~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~----~t 375 (502) .... ....+... .+....+...++. ...+.++..-..-++.+..+...++|+...|++|..+|....+. ++ T Consensus 244 e~~~~~~~~~~~~~~~nag~~~il~~g~--~~~~~~~~s~~d~q~le~r~~~~~eIa~~fgVPp~llg~~~~~~~~tysn 321 (409) T protein:vir:83 244 TEAVDLMDRWIESRSKYAGHPALVTGGA--TLNQAKSMSAQDLSLMELTQFNEARIAILLGVPPFLVGLPGATGSLTYSN 321 (409) T ss_pred HHHHHHHHHHHHhhCCccCccceecCCc--ccccccCCCHHHHHHHHHHHhhHHHHHHHhCCCHHHccCCCCcccccccc Confidence 0000 00000000 0000001111110 00111222333446778778888999999999999998644322 12 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCC Q lcl|NC_012753. 376 ATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGF 455 (502) Q Consensus 376 Atei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi 455 (502) .++....+ ...++.-..+.++.+|.. .++. ....+.++++.-+-.|..+.++...+++++|+ T Consensus 322 ~eq~~~~f--~~~tL~P~~~~ie~~l~~------------~Ll~----~~~~~~f~~~~llr~d~~~r~~~~~~~~~~G~ 383 (409) T protein:vir:83 322 IEQLFSFH--DRSSLRPKATAVMAALDR------------WALP----SPQHLELNRDDYTRPSLVERATAYKIMIEAGV 383 (409) T ss_pred HHHHHHHH--HHHHHHHHHHHHHHHHHH------------hhCC----CCcEEEeehhhhhccCHHHHHHHHHHHHhCCC Confidence 22222111 112222233333333332 1111 12246666666677888888888888999998 Q ss_pred CCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 456 APKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 456 ~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) |+.-++++. .++ +..+.-+..+++|| T Consensus 384 lT~NE~R~~-~gl-----------------pp~~ggd~l~~~gv 409 (409) T protein:vir:83 384 MEPNEARAM-ERL-----------------HSEAAAVRLSGGGV 409 (409) T ss_pred cCHHHHHHH-hCC-----------------CCCCCCcccCCCCC Confidence 888775432 233 22223334456666 No 224 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=95.02 E-value=0.0029 Score=34.46 Aligned_cols=436 Identities=13% Similarity=0.114 Sum_probs=189.9 Q ss_pred CChhH------HHHHH-----------HHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCC Q lcl|NC_012753. 1 MGIIQ------TIKNF-----------IKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNG 63 (502) Q Consensus 1 m~~~~------~ik~~-----------i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~ 63 (502) -.+-. ++-.+ +-.+|+... ...+ . ..-.....-|+.++.+... +-+. T Consensus 7 f~~~~~~~~~~~~~s~~~p~~ddg~~~~~~~g~~~~---~~~~----~-~~~~~~~eLI~~YR~ma~~--pEvd------ 70 (558) T protein:vir:10 7 FSIEETQKKSTSIISPVPKNNEDGVDNFISSGFYGQ---YVDI----E-GAYRSEYDLIRRYREMALH--PEAD------ 70 (558) T ss_pred chhhhhhhhccCCccccCCCccccccceeccceeee---eecc----c-chhhhHHHHHHHHHHHhhc--cchh------ Confidence 11111 00000 001111100 0000 0 0012334455666665433 1110 Q ss_pred ccccccceecchHHHHHHHHh-hhhhcCcceEeeCC--------HHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 64 SQVKRDFNHLPIGRTASKKVA-SLVFNEQATIRVDN--------EVADAFINETLKNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 64 ~~~~~~~~~~n~~k~iv~~~a-~~l~~ep~~i~~~d--------~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) +--..||+... .=-..+|+++.+++ +...+..+.+++--+|+++..+.+....+-|..|++ T Consensus 71 ----------~Av~eIVneaiv~d~~~~pV~i~Ld~~~~s~~iK~kI~eEF~~Il~ll~F~~~~~e~fR~WYVDgRiyfH 140 (558) T protein:vir:10 71 ----------GAIEDVVNEAIVSDLYDSPVEVELSNLNASNTLKKKIREEFRYIKEMMDFDKKSHEIFRNWYVDGRVFYL 140 (558) T ss_pred ----------hHHHHhhcceeEecCCCceEEEEecccCcchHHHHHHHHHHHHHHHHhccchhhhHHHhhheeeeEEEEE Confidence 00011222111 11112455566653 234556677777779999999999999999999999 Q ss_pred EEEeCC----c-eEEEEEcCCeEEEEEEc---CCCeEEEEEEEEEEEeeCCCce-EEE-EEEEEEEeCCeEEEEEEEEec Q lcl|NC_012753. 135 PYIDGD----Q-IRVSFVQATVFFPLQAN---TQDVSSAAIVTKSTKTEGQKVK-YYS-LIEFHEWNKETYTISNELYES 204 (502) Q Consensus 135 ~~~d~~----~-~~i~~v~~~~~~Pi~~d---~~~~~~~~~~~~~~~~~~~~~~-~yt-~~E~h~~~~~~~~I~~~l~~~ 204 (502) .++|+. + ..+.+++|..+-+|..- ........ +..+..+. ++. ..|++..+.+.. .+.+ T Consensus 141 Kiid~k~pk~GI~ELr~lDPr~i~~Vr~i~~~~~~~~~~~------~~~~~~~~~~~~~~~eyy~Y~~~~~-----~~~~ 209 (558) T protein:vir:10 141 KVIDTKNPQEGIQDLRYIDPLKIKFIRQEKRKPGNQDPAI------RVRSEQDVVPNPEFEEFYIYTPKVQ-----HPTG 209 (558) T ss_pred EEEeCCCccccceeeeeeCcccceeeeeecccccccccee------eeecccceeeccceeEeeeecCCcc-----cccc Confidence 999853 3 47888999888665421 11111111 11111111 110 012221111100 0000 Q ss_pred CCc-cccCce--eecccc-cc--CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHH- Q lcl|NC_012753. 205 ESK-TIIGQR--VPLSTL-YE--DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFM- 277 (502) Q Consensus 205 ~~~-~~lG~~--v~l~~~-~~--~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~- 277 (502) .++ -..++. +|-+.+ |. ||.. ...+.- +|-|..+......|=-+-+.++ T Consensus 210 ~~~~~~~~~~vkI~~dAI~y~hSGL~d-----~~~~~i-------------------~syLhkAIKp~NQLkmlEDAlVI 265 (558) T protein:vir:10 210 MVGQMGGKNSIKIAKDSITMCTSGLVD-----RNKNRV-------------------LSYLHKAIKALNQLRMIEDSLVI 265 (558) T ss_pred cceeecCCCceeechhheeeeccccee-----cCCCee-------------------eecchHhhHhHHhhHHHHhhHHH Confidence 000 001111 111111 10 1100 001111 2333333322222221111111 Q ss_pred -HHHhhccceee-ech---------HHh----c-----cCCCCCCcccCccccccccchhhccccCCCCccccceeeecc Q lcl|NC_012753. 278 -WEVKMGQRRVA-VPT---------QMI----K-----TEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLTT 337 (502) Q Consensus 278 -~~~~~~~~~i~-v~~---------~~l----~-----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 337 (502) +-.++-..||| |+- .+| . .+-+...++++.++-+..-...|..- --+|+.+.-|+++.. T Consensus 266 YRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLp-RReGgrgTEItTLpG 344 (558) T protein:vir:10 266 YRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLP-RREGGRGTEITTLPG 344 (558) T ss_pred HhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhccc-ccCCCCccceeeccc Confidence 11233344444 211 001 0 01122233333332222222222211 113333344666544 Q ss_pred ccchHHHHHHHHHHHHHHHHhcCCChhhccccccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_012753. 338 DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKS-MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYN 416 (502) Q Consensus 338 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~-~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~ 416 (502) --...+ +.-++.+.+.+....++|.+.++.+++. ..-++||....-....-+.+++..|..-+.++++.-|.+-. T Consensus 345 gqnLge-m~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKg--- 420 (558) T protein:vir:10 345 GQNLGE-LSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKN--- 420 (558) T ss_pred cCCcch-HHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc--- Confidence 322222 2346666778888888887777665432 22345665555555567788888888888888887665432 Q ss_pred ccCCCcccc--cceEEEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHHHhcCCCCHHHHHHHHHHHHHhhh Q lcl|NC_012753. 417 LYTGEIPTM--DEVSVDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAIEKTLNVTKEQAQEIYQKINDETM 485 (502) Q Consensus 417 ~~~~~~~~~--~~i~v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l~~~~~~~deea~~el~ri~~E~~ 485 (502) ++...-+.. ..+.++|...--..+..+++.+..-. ..+ ..|.+++.++....||+|.+++.+.|++|.. T Consensus 421 iit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k 500 (558) T protein:vir:10 421 IVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQ 500 (558) T ss_pred CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHh Confidence 222221111 35778886654444444444433211 112 4699998888889999999999999999865 Q ss_pred ccc---CCCCCccccCCC---CC Q lcl|NC_012753. 486 VST---DSFRTSEEVDIY---GE 502 (502) Q Consensus 486 ~~~---~~~~~~~~~~~~---g~ 502 (502) ..- |...++-.++.. |. T Consensus 501 ~~~~~~p~~~~~~~~~~~~~~~~ 523 (558) T protein:vir:10 501 KGIIPDPSQIDPITGEPLPQEGD 523 (558) T ss_pred CCCCCCccccChhhccccCccCC Confidence 421 222222222222 11 No 225 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=94.89 E-value=0.0032 Score=34.23 Aligned_cols=377 Identities=12% Similarity=0.070 Sum_probs=144.6 Q ss_pred cchhh-hhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH--HHHHHHHhhhhhcCcceEeeC Q lcl|NC_012753. 21 QSLNS-ITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG--RTASKKVASLVFNEQATIRVD 97 (502) Q Consensus 21 ~~l~~-i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~--k~iv~~~a~~l~~ep~~i~~~ 97 (502) |.|.. -.. . .+. .|...+... .+. ....+...... .+..+ -..|+..|+-+-.=|+.+--. T Consensus 1 m~~~~~~~~--~--~~~-------~~~~~~~~~--~~~-~~~~g~~~~~~--Al~~~~V~~cv~~ia~~iA~lp~~~~~~ 64 (417) T protein:vir:38 1 MKLFRGLAT--E--VDP-------HWADHLLDS--GVI-PSFRGGYLGIS--ALRNSDVLTAVSIVSGDVSRFPLVITDS 64 (417) T ss_pred Ccccccccc--C--CCc-------cchhhhccc--ccc-cccCCceechh--hcccHHHHHHHHHHHHhhccCeeEEEEc Confidence 11110 000 0 000 011111100 000 00011111111 11111 235566666665556554221 Q ss_pred C--HH-HHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC--Cce-EEEEEcCCeEEEEEEcCCCeEEEE Q lcl|NC_012753. 98 N--EV-ADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG--DQI-RVSFVQATVFFPLQANTQDVSSAA 166 (502) Q Consensus 98 d--~~-~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~-~i~~v~~~~~~Pi~~d~~~~~~~~ 166 (502) + .. ....+..+|.. | ....-....+..++..|.+|+.+..|. +.+ .+.+++|+++-+...+.+.+ T Consensus 65 ~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gn~y~~i~r~~~g~~~~~l~~l~p~~v~v~~~~~~~~---- 140 (417) T protein:vir:38 65 STDEVIDLANIEYLMNTKVNKRLSAYQWKFPMMVNAILTGNAYSRIVRDPITNEPAMFEFYAPSQTQVDTSDPDNI---- 140 (417) T ss_pred CCcceeccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCCEEEEEEEeCCceEEEEEcCCCeE---- Confidence 1 11 11123333321 2 222334445666778899998887764 333 45667787776543322211 Q ss_pred EEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccc Q lcl|NC_012753. 167 IVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMN 246 (502) Q Consensus 167 ~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n 246 (502) +|+ +...++.... .+ +. .+ .++|+.+- T Consensus 141 --------------~y~----~~~~~~~~~~---~~------------~~--------~d----------viH~r~~~-- 167 (417) T protein:vir:38 141 --------------IYR----FTPYNSSMQK---VC------------GF--------ED----------VIHWKFFS-- 167 (417) T ss_pred --------------EEE----EEEcCCcEEE---Ee------------cC--------cc----------eEEecCCC-- Confidence 110 0000111000 00 00 00 13344321 Q ss_pred cccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCC Q lcl|NC_012753. 247 NKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDM 326 (502) Q Consensus 247 ~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 326 (502) .+...|+|.+.-+...|.....+-.-..+-|+.+...=.| +.....- .+... ..-...+........ T Consensus 168 ---~d~~~G~s~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~i----l~~~~~l-----~~e~~-~~~~~~~~~~~~g~n 234 (417) T protein:vir:38 168 ---YDTIMGRSPLLSLGDEIGLQESGVSTLQKFFKSGLKGSII----KAKESRL-----SAEAR-QKIREDFERAQAGAD 234 (417) T ss_pred ---CCCccccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEE----EEeCCCC-----CHHHH-HHHHHHHHHHhcccc Confidence 1234688888777776655444433333344544332112 2221111 10000 000011111111100 Q ss_pred -------ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 327 -------DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEK 399 (502) Q Consensus 327 -------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~ 399 (502) +.+.-++.++.....-++++..+...++|+...|+||..+|.... .+++++.... .++. T Consensus 235 ~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~-~s~~e~~~~~-------------~~~~ 300 (417) T protein:vir:38 235 AGSPIIVDATMDYQPLEVDTNVLNLINSNNYSTAQIAKALRVPAYRLAQNSP-NQSVKQLADD-------------YIRN 300 (417) T ss_pred cCCceeccCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHhCCCCc-chhHHHHHHH-------------HHHH Confidence 112234445545455567787888889999999999999984332 2233332211 1233 Q ss_pred HHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHH- Q lcl|NC_012753. 400 SLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEI- 476 (502) Q Consensus 400 ~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~e- 476 (502) .|..+++.|..-.+. .++... ......|.|+..- .+. ..+....+++.+|+++.-++++.. +|+.+.++++- T Consensus 301 tl~P~~~~ie~~l~~-~Ll~~~--~~~~~~~~fd~~~-l~~-~~~~~~~~~~~~G~~T~NE~R~~~gl~pi~~g~~d~~~ 375 (417) T protein:vir:38 301 DLPFYFEPITSEFEL-KLLDDA--QRHQYCIGFDTKS-VNG-LPIADVNTAVNGGLWTGNEGRAELGKKPLKDPNMDRIQ 375 (417) T ss_pred HHHHHHHHHHHHHHh-hhcChh--hcccceEEechhh-hhH-HHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCCeee Confidence 344433333222111 111111 1123456776432 222 224456678889999999976653 33433222111 Q ss_pred -------HHHHHHhhhcccCC---CCCccccCC--CCC Q lcl|NC_012753. 477 -------YQKINDETMVSTDS---FRTSEEVDI--YGE 502 (502) Q Consensus 477 -------l~ri~~E~~~~~~~---~~~~~~~~~--~g~ 502 (502) +....+++...... -++...++- -|+ T Consensus 376 ~~~n~~~~d~~~~~~~~~~~~~kgg~~~~~~~~~~~~~ 413 (417) T protein:vir:38 376 STLNTVFLDQKEAYQAEHAAELKGGDTNAKGNQNGSGT 413 (417) T ss_pred ecccccccccccccccccccccCCCCCCCCCCCcCCCC Confidence 11111111111000 011111111 111 No 226 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=94.76 E-value=0.00061 Score=38.21 Aligned_cols=181 Identities=11% Similarity=-0.004 Sum_probs=82.5 Q ss_pred cCCcchhhhHHHHHH----HHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccccccchhhccccCCCCcc Q lcl|NC_012753. 253 PLGLSIFDNAKTTMD----FINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDK 328 (502) Q Consensus 253 p~G~S~~~~~~~lid----~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (502) -|.+. ++..+++ ++.+++..+ +..+. .+..+..++.+ T Consensus 1 V~k~~---~l~~~~~~~~~~~~~r~~~~-~~~~~----------------------------------~~~~~~ld~~~- 41 (201) T protein:vir:10 1 MWKAK---GLADLCDDSDGAARLRLAQV-DNNSG----------------------------------VGQAIGIDADS- 41 (201) T ss_pred Cccch---HHHHHhcCChHHHHHHHHHH-HHhhh----------------------------------hhhhheeecCC- Confidence 11111 2222221 111111111 00010 00001111111 Q ss_pred ccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-cccccccc-ccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHH Q lcl|NC_012753. 329 GIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSM-KTATEVVSEQSDTYQMRNSI-ATLVEKSLKELV 405 (502) Q Consensus 329 ~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~-~tAtei~~~~~~l~~~~~~~-~~~~~~~l~~l~ 405 (502) .-++.++.++ .-....+......++..+|+|... ||...+|. +|+..-...|.+ .+..+ ++.++..|++|+ T Consensus 42 -e~~e~~~~~l--sGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~d~~nyyd---~i~~~Qe~~l~p~le~l~ 115 (201) T protein:vir:10 42 -EEYNVLNSDI--GGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNTALETFYG---YVDRKRKAELLPLLEFLL 115 (201) T ss_pred -cceeeeecCc--CChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchhHHHHHHH---HHHHHHHHHHHHHHHHHH Confidence 1133333322 335556777788888899998554 77777775 355544333333 33333 366788888876 Q ss_pred HHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHH-------HHHHHhcCCCCHHHHHHhc-----CC-CCHHH Q lcl|NC_012753. 406 ISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDY-------WSKMVAAGFAPKTMAIEKT-----LN-VTKEQ 472 (502) Q Consensus 406 ~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~-------~~~~~~~Gi~S~et~l~~~-----~~-~~dee 472 (502) .++. ...+++|.|+.-...++.+.++. +.+++.+|++|..++...+ .+ ..++. T Consensus 116 ~~~~--------------~~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~ 181 (201) T protein:vir:10 116 PFIV--------------TEQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAAGIIDADEARDTLRAISTEVKIGEGS 181 (201) T ss_pred Hhhc--------------CCCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHhcCCcCCCCCCC Confidence 6432 12468999999877777665544 4566677888877765432 11 22222 Q ss_pred HHHHHHHHHHhhhcccCCCCCcccc Q lcl|NC_012753. 473 AQEIYQKINDETMVSTDSFRTSEEV 497 (502) Q Consensus 473 a~~el~ri~~E~~~~~~~~~~~~~~ 497 (502) ++.++..-..+.+.. .|.+- T Consensus 182 ~~~~~~~~e~~dp~~-----~~~~~ 201 (201) T protein:vir:10 182 IQTEVVINESEDPLD-----VSANN 201 (201) T ss_pred CCccccccccCCCCC-----CCCCC Confidence 333322211111111 11111 No 227 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=94.60 E-value=0.004 Score=33.74 Aligned_cols=410 Identities=11% Similarity=0.088 Sum_probs=163.8 Q ss_pred CChh-HHHHHHHHHHhh-cccccc---hhhh-hccccccCCHHHHHHHHHHHHHhcCCCCccccccCC-Cccccccceec Q lcl|NC_012753. 1 MGII-QTIKNFIKRSNY-VITNQS---LNSI-TDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSN-GSQVKRDFNHL 73 (502) Q Consensus 1 m~~~-~~ik~~i~~~~~-~~~~~~---l~~i-~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~-~~~~~~~~~~~ 73 (502) |+-| +.-.+-+++-.- ...+-. +.++ ..|+---+++.++.+|-.... .|+ ...+.... ....++.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~--~gd--~~~~~~L~~~m~e~D~~--- 73 (528) T protein:vir:10 1 MAAIVDIYGNPLRTQQLRKQQTAHLAGLAKEFANHPAKGLTPAKLAHILIEAE--QGH--LQAQAELFMDMEERDAH--- 73 (528) T ss_pred CCeeECCCCCccccccccchhhhhhhhhhhhhcccCCCCCCHHHHHHHHHhhh--CCC--HHHHHHHHHHHHhhChH--- Confidence 3322 111111110000 000000 0011 122222455666655544322 111 10000000 00000111 Q ss_pred chHHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCCEEEEEEEeC--Cce-- Q lcl|NC_012753. 74 PIGRTASKKVASLVFNEQATIRVD------NEVADAFINETLKND-KFSKNFERYLESCLALGGLAMRPYIDG--DQI-- 142 (502) Q Consensus 74 n~~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~~~~~-~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~-- 142 (502) + .-.+.+-..-+++.+..|... ++...+++++++.+- .|...+..++. |..+|-+++-+.|.. +.. T Consensus 74 -i-~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~i~~~ld-a~~~G~s~~Ei~w~~~~g~~~~ 150 (528) T protein:vir:10 74 -L-FAEMSKRKRAVLGLDWTIEPPRNASAAEKADAEYLHELLLDLEGIEDLMLDCMD-GVGHGYSAIELDWSLQGREWLP 150 (528) T ss_pred -H-HHHHHHHHHHHhcCCceEecCCCCCHHHHHHHHHHHHHHhCCccHHHHHHHHHh-hhhhcceeEEEEEeecCCceeE Confidence 1 233444455566777777542 234567788888663 47777765554 888998888776642 322 Q ss_pred -EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccccc Q lcl|NC_012753. 143 -RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYE 221 (502) Q Consensus 143 -~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~ 221 (502) ++.++++..|.. ...++. .+++. +...-|.++| T Consensus 151 ~~~~~r~~~~f~~--~~~~~~-------------------------------~l~~~-------~~~~~g~~l~------ 184 (528) T protein:vir:10 151 QAFDHRPQSWFQL--NPDDQD-------------------------------ELRLR-------DNSIAGEVLQ------ 184 (528) T ss_pred EEeeeecccceee--ccCCCc-------------------------------EEecc-------CCCCCceeec------ Confidence 333334322110 000000 01110 0000121111 Q ss_pred CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCCC Q lcl|NC_012753. 222 DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYDT 300 (502) Q Consensus 222 ~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~~ 300 (502) +. -|+.++. ....++|+|.|.+..+.-..---+..+..|+.=++- |... .+ .+ ++.+ T Consensus 185 ---~~---------k~iv~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~-~i----gk-y~~~ 242 (528) T protein:vir:10 185 ---PF---------GWIMHKP----RSRSGYVARSGLFRVLAWPYLFKHYSTADLAEMLEIYGLPI-RL----GK-YPPG 242 (528) T ss_pred ---CC---------CeEEEee----cCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCCe-EE----Ee-cCCC Confidence 00 0222221 123456778888777665554444444444443332 3322 22 11 2211 Q ss_pred CCcccCccccccccchhhccccCCCC---ccccceeeecc-ccchHHHHHHHHHHHHHHHHhcCCChhhc-ccccc---c Q lcl|NC_012753. 301 NGEKVTVKREFETGHNVYEQFDSGDM---DKGIGITDLTT-DIRSDDYIKAINKGLSLFEMQLGVSTGMF-SFDGK---S 372 (502) Q Consensus 301 ~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~-~~~~~---~ 372 (502) .... ....+ .+....+..+.+ ..+.-|+.++. .-..+.|...++.+-++|+..+ ++ +++ ++.++ | T Consensus 243 a~~~--ek~~L---~~al~~i~~~~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LG-qtlTs~~~~g~~g 315 (528) T protein:vir:10 243 TPDE--EKVTL---LRAVTGLGHAAAGIIPESMSIDFQEASKGSAEPFMAMMRWCDDSMSKAI-LG-GTLTSQTSESGGG 315 (528) T ss_pred CCHH--HHHHH---HHHHHHHhhCcEEEecCCceeEEeecCCCChhHHHHHHHHHHHHHHHHH-hh-hhhhccccccccc Confidence 1110 00000 011111111100 01112333332 1233446666666666665554 23 222 22111 1 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_012753. 373 MKTATEVVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMV 451 (502) Q Consensus 373 ~~tAtei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~ 451 (502) ....-++. ..-....+..-.+.+...|+ +|++.++.+ | +++......-+.+.|+..-+.|..+.++.+.+++ T Consensus 316 S~Alg~vh--~~v~~di~~aDa~~i~~tln~~li~~l~~~----N-~~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~ 388 (528) T protein:vir:10 316 AYALGQVH--NEVRHDLLAADARQLAATLSRDLLWPLLVL----N-RSGNLDARRAPRLVFDLKDRADLAAMATSLPPLV 388 (528) T ss_pred hhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----C-CCCCCCccccceEEecCCCcccHHHHHHHHHHHH Confidence 11111221 22233444555566777775 577776654 2 2222223334678888888899889999999999 Q ss_pred hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 452 AAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 452 ~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ..|+--.+.++.+.+|++..+-.+++..-+...+........+.....+.+ T Consensus 389 ~~G~~i~~~~i~e~~gip~p~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 439 (528) T protein:vir:10 389 KLGVQVPVNWVQEQLGIPLPANGEAVLGDQAGAGIAQLSRRPGPRIAALAQ 439 (528) T ss_pred hCCCCCCHHHHHHHhCCCCCCCCcccccCCCcccccccCcccccccccccc Confidence 999833444578878875432112221111111111111111111111112 No 228 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=94.48 E-value=0.0043 Score=33.55 Aligned_cols=397 Identities=12% Similarity=0.117 Sum_probs=157.7 Q ss_pred hhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHH-HHHhcCCCCccccccCCCcccc-ccceecchHHHHH Q lcl|NC_012753. 3 IIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDN-LRYFAGDFDSVTYRDSNGSQVK-RDFNHLPIGRTAS 80 (502) Q Consensus 3 ~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~-~~~Y~g~~~~~~~~~~~~~~~~-~~~~~~n~~k~iv 80 (502) +|+ ++|+.- . ..+....++..-+..+-.+ ..-|.|- ...+.... +.-+.+.--...| T Consensus 1 ~~~----~~~~~~-----~-----~~~~~~~~~~~~~~~~~~~~~~~~~g~-------~~~g~~v~~~~al~~~~V~~~v 59 (454) T protein:vir:93 1 MWN----LLRRTR-----K-----NQKSGRDVREAGWTSLFQAVAEPFAGA-------WQQGVKADPEAVLSFHAVFACI 59 (454) T ss_pred CCC----ccccCc-----c-----cccccccccchhhhhhhhhhhhhhcch-------hhcCcccChHHhhccHHHHHHH Confidence 121 221100 0 0000111111111111111 1112221 00111100 0111111112345 Q ss_pred HHHhhhhhcCcceEe-eC-C---H-HHHHHHHHHHhh-c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIR-VD-N---E-VADAFINETLKN-D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQ 148 (502) Q Consensus 81 ~~~a~~l~~ep~~i~-~~-d---~-~~~e~l~~~~~~-~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~ 148 (502) +..|+-+-+=|+.+- .+ + + .....+..++.. | ....-+..++...+..|.+|+.+-.+. |.+ .+..++ T Consensus 60 ~~Ia~~iA~lp~~~~~~~~~g~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gna~~~i~r~~~G~~~~L~~i~ 139 (454) T protein:vir:93 60 SLISQDIAKMRLRLMQTDAQGIRRETRRGDIARLCRRPNAQQNRIQFFELWLNAKLRHGNTVVLKIRNARGQIKELRILD 139 (454) T ss_pred HHHHHhhccCceEEEEeccCCccchhhhHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEECCCCcEEEEEEEc Confidence 555555555565541 11 1 1 111122333322 2 223445556667888999999888875 444 677778 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++=++..+++.+ + +++.. .. ..+ .|..+.+ . .. T Consensus 140 ~~~v~v~~~~~g~~----~-y~~~~-~~----------------~~~--------------~~~~~~~---~---~~--- 174 (454) T protein:vir:93 140 WNRVEPLVADDGEV----F-YRITP-DR----------------NCG--------------ITEAVTV---P---AR--- 174 (454) T ss_pred CcceEEEEcCCCcE----E-EEEEe-cc----------------ccc--------------cceeEEe---c---Cc--- Confidence 88776653332211 0 11100 00 000 0000000 0 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) -.++++.+. ..+..+|+|.+..+...+.....+-....+-|..+...-.+ |+....-+.... T Consensus 175 -------eViH~k~~~----~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi----l~~~~~l~~e~~--- 236 (454) T protein:vir:93 175 -------EVIHDRFNC----FFHPLIGLPPVYAAGLAATQGHHIQENSTSFFRNGGRPSGV----IEIPGSITEENA--- 236 (454) T ss_pred -------ceEEeccCC----CCCCceeccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE----EecCCCCCHHHH--- Confidence 023343221 12345688888877777764444433333334543332111 222111000000 Q ss_pred cccccc-chhhccccCCC---CccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccc-cHHHHHHHH Q lcl|NC_012753. 309 REFETG-HNVYEQFDSGD---MDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK-TATEVVSEQ 383 (502) Q Consensus 309 ~~~~~~-~~~~~~~~~~~---~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~-tAtei~~~~ 383 (502) ..+... ...+..-+.+. -+.+.-++.++.....-++.+..+....+|+...|+|+..+|...++.. ++++....+ T Consensus 237 ~~~~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~f 316 (454) T protein:vir:93 237 KKLKSNWDSGYTGENAGKTAILSNGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQY 316 (454) T ss_pred HHHHHHHHHHhcccccCCceeccCCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHHH Confidence 000000 01111000000 0122235555555556677888888889999999999999987554322 222221111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) ...++.=....++..|... +..+ ....+.+++++-+..|..+.++...+++.+|+|+.-+++. T Consensus 317 --~~~~l~P~~~~ie~~ln~~------------L~~~---~~~~~~f~~~~ll~~D~~~r~~~~~~~~~~G~~T~NE~R~ 379 (454) T protein:vir:93 317 --YSQCLQTLIESIELLLDEA------------LETG---ENESTEFDVTTLLRMDSERRMKTLGDAVKNTLLTPNEARK 379 (454) T ss_pred --HHHHHHHHHHHHHHHHHHh------------hcCC---CCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 1122222222233222221 1111 1234666666667788999999999999999999988665 Q ss_pred hcCCCCH----HHH--HH---HHHHHHHhhhc---------ccCCCCCccccCCCCC Q lcl|NC_012753. 464 KTLNVTK----EQA--QE---IYQKINDETMV---------STDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~~~~~d----eea--~~---el~ri~~E~~~---------~~~~~~~~~~~~~~g~ 502 (502) . .|+.. |+. .. -+..+.+.+.. ......+....| +|+ T Consensus 380 ~-~gl~pi~ggD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d-~~~ 434 (454) T protein:vir:93 380 R-ENLPPLAGGDALYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASD-GNK 434 (454) T ss_pred H-hCCCCCCCCCeeeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCC-CCC Confidence 4 23322 110 00 01111111100 011111112122 222 No 229 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=94.38 E-value=0.0046 Score=33.41 Aligned_cols=424 Identities=11% Similarity=0.055 Sum_probs=162.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhh---c----cccccCCHHHHHHHHHHHHHhcCC----CCccccccCCCccc--- Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSIT---D----HPKIAISPEEYNRIMDNLRYFAGD----FDSVTYRDSNGSQV--- 66 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~---~----~~~~~~~~~~~~~i~~~~~~Y~g~----~~~~~~~~~~~~~~--- 66 (502) |-|..-+++-+--... .++... + +.-.++.+-+.+ . ++.+.|- .....+........ T Consensus 1 ~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~g~~~~~~~~~~~~ 71 (535) T protein:vir:10 1 MAILKDLRNAFSLSNK-----KSTSYIELGDYDKDIVNKAIRPGRAS-A---RDTVDGIDIADGNVAGQYSVASISDVLS 71 (535) T ss_pred ChhhHHHHHHHHhhhh-----hhhhhHHHhhhhHHHHHhhhhhhhhh-h---hccccccccccCCcccccccCccccccC Confidence 7776666655422111 111110 0 001112222111 1 1222221 00000111000000 Q ss_pred ----cccceecchHHHHHH----HHhhhhh---------cCcceEe-e----CC--HHHHHHHHHHHhh--cc------H Q lcl|NC_012753. 67 ----KRDFNHLPIGRTASK----KVASLVF---------NEQATIR-V----DN--EVADAFINETLKN--DK------F 114 (502) Q Consensus 67 ----~~~~~~~n~~k~iv~----~~a~~l~---------~ep~~i~-~----~d--~~~~e~l~~~~~~--~~------f 114 (502) .+.....++...+++ ..|.|-+ +=|+.+. . +. ......|..+|.. |. | T Consensus 72 ~~~l~~~~~~~~~~~~~i~t~~~~va~~~~i~~~s~~~~~~~i~l~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~~~~~~ 151 (535) T protein:vir:10 72 TKKLLKAYADNDIVQAIIRTRTNQVLTYSNPSRYNRNGVGFKVELKDATKVMSKAQIKRAHEIEDFIYNTGSEYYEWRDT 151 (535) T ss_pred HHHHHHHhccChhHHHHHHHHHHHHHHHHHHHHHhcccCcceeEEEeccCCCcchhhhhhhHHHHHHHhCCCCCCChhHH Confidence 000011122233333 3332221 2222221 1 11 1122345555531 22 2 Q ss_pred H-HHHHHHHHHHhhcCC-EEEEEEEeC-Cce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEE Q lcl|NC_012753. 115 S-KNFERYLESCLALGG-LAMRPYIDG-DQI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEW 190 (502) Q Consensus 115 ~-~~~~~~~~~~~~~G~-~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~ 190 (502) . ..+..++..++.+|+ +|+.+..+. |++ .+..++|..+.+.....+... ..+| +.. T Consensus 152 ~~~~~~~lv~d~l~~~g~ay~~i~r~~~G~~~~L~~l~p~~V~v~~d~~~~~~---------------~~~~-----~~~ 211 (535) T protein:vir:10 152 FPRLLTKIINDMYVQDQINIERIFKNDSNELDHFNAVDASKVVISYSPRSKDQ---------------PRKF-----EQF 211 (535) T ss_pred HHHHHHHHHHHHHhhCCceEEEEEECCCCcEEEEEEeCCceeEEEEcCccccC---------------ceEE-----EEE Confidence 2 344556666777775 577776664 444 577788888776432211110 0011 000 Q ss_pred eCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHH Q lcl|NC_012753. 191 NKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFIN 270 (502) Q Consensus 191 ~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld 270 (502) .++.... .++ .. + .++|+.+- ..-.....+|+|.+..+...|.... T Consensus 212 ~~~~~~~---------------~~~--------~~-------e---iih~~~~~-~~~~~~~~~G~Spi~~~~~~i~~~~ 257 (535) T protein:vir:10 212 VSETKSV---------------KFS--------ER-------N---LTFINYWN-LSDTDRRGYGYSPVEASIPLIRAIY 257 (535) T ss_pred ecCceeE---------------EEC--------cc-------c---EEEEeccC-CCCcccccccccHHHHHHHHHHHHH Confidence 0000000 000 00 0 12333211 1111234579999888887776665 Q ss_pred HHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc--cccccc-hhhccc------cCCCCccccceeeeccccch Q lcl|NC_012753. 271 TTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR--EFETGH-NVYEQF------DSGDMDKGIGITDLTTDIRS 341 (502) Q Consensus 271 ~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~--~~~~~~-~~~~~~------~~~~~~~~~~i~~~~~~ir~ 341 (502) .+-.-..+-|..+...-.| |...... +....+.. .+.... ..+... ..-. +.+.-++.++..... T Consensus 258 aa~~~~~~~f~ng~~p~gi----L~~~~~~-~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~-~~g~~~~~l~~~~~D 331 (535) T protein:vir:10 258 DTEQFNARFFSQGGTTRGI----LVIDQDG-DAQANQMMLAGIRRQWTSQGSGLGGAWKIPILA-AKDAKFVNMTQNSRD 331 (535) T ss_pred HHHHHHHHHHhccCCccEE----EEecCCC-CcccCHHHHHHHHHHHHHHhcCccccccccccc-CCCceEEecCCChhH Confidence 4444334445655432111 2221111 01111000 000000 001100 0000 112234455555667 Q ss_pred HHHHHHHHHHHHHHHHhcCCChhhccccccccccHH--HHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_012753. 342 DDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT--EVVSEQSDTYQMRNSI-ATLVEKSLKELVISILELAKVYNLY 418 (502) Q Consensus 342 e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt--ei~~~~~~l~~~~~~~-~~~~~~~l~~l~~~il~~~~~~~~~ 418 (502) .++.+..+...++|+...|++|..+|+...+.-+.. .-...+.. ++... +..++.+|..++..|-...+. .++ T Consensus 332 ~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s---~~E~~~~~~~~~~L~P~l~~ie~~ln~-~Ll 407 (535) T protein:vir:10 332 MEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGS---TAKAKLESSKDKGLTPLLSFIEQVIND-KIM 407 (535) T ss_pred HHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhh---hHHHHHHHHHHHHHHHHHHHHHHHHhh-hcc Confidence 788888889999999999999999998654332111 11111111 11122 222344555555444333222 122 Q ss_pred CCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH-HHHHHHHHHHHHh-----hh---cccC Q lcl|NC_012753. 419 TGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK-EQAQEIYQKINDE-----TM---VSTD 489 (502) Q Consensus 419 ~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d-eea~~el~ri~~E-----~~---~~~~ 489 (502) .. ....+.+.|+.....|..+..+.. ++..+|.|+.-++++. .|+.. +.-+.-+-.+... +. ...+ T Consensus 408 ~~---~~~~~~f~f~~l~~~d~~~r~~~~-~~~~~g~lT~NE~R~~-~gl~piegGD~~~~~~~~~~~~~~~~~~~~~~p 482 (535) T protein:vir:10 408 RY---VDTDYRFSFTLGDAQDKLQEEQVW-KLKLANGYFINEYRKD-HGLKTVDGLDVPGFIGSAENFINATGFGQPNVP 482 (535) T ss_pred cc---cCCeEEEEeccccccCHHHHHHHH-HHHHcCCCCHHHHHHH-hCCCCCCCccccccccchhhcccccccccccCC Confidence 11 123577888887888877665544 4555777899886654 34322 0000000000000 00 0000 Q ss_pred CCCCccccC-------------CC---CC Q lcl|NC_012753. 490 SFRTSEEVD-------------IY---GE 502 (502) Q Consensus 490 ~~~~~~~~~-------------~~---g~ 502 (502) ....+.+.. .- |. T Consensus 483 ~~~~~~~~~~~~~~~q~~~~~~~~~~~g~ 511 (535) T protein:vir:10 483 DSSDDSGSTLGERERQERIQHSKDYEKGK 511 (535) T ss_pred CCCCCccccCCccccCcccccccccccCC Confidence 000001111 10 11 No 230 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=94.01 E-value=0.0057 Score=32.90 Aligned_cols=407 Identities=10% Similarity=0.081 Sum_probs=166.9 Q ss_pred CChhHHHHHHHH---HHhhcccccchhh-h------hccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccc Q lcl|NC_012753. 1 MGIIQTIKNFIK---RSNYVITNQSLNS-I------TDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDF 70 (502) Q Consensus 1 m~~~~~ik~~i~---~~~~~~~~~~l~~-i------~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~ 70 (502) -.+.+.+-.+-. ..|..+...++.. . +..+-.. -.....-|+.++.++.. T Consensus 17 ~~~~~~~~~~~~p~~~dG~s~i~~~~~~~~~~~~~~~~~~gg~-~~n~~eLI~~YR~ma~~------------------- 76 (533) T protein:vir:58 17 TNFLSPMYGMGAPHGAGGSSMIPINMYHPFATAGYASRFYGGI-EFNRFFLYDMYDRMDYT------------------- 76 (533) T ss_pred HHhhchhhcccCccCCCCCccccCCCCcchhhhhhhhhhhccc-cccHHHHHHHHHHhhcc------------------- Confidence 111111111101 1111111111000 0 0000000 00112233333333211 Q ss_pred eecchHHHHHHHHhh-----hhhcCcceEeeCCHH----HHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-- Q lcl|NC_012753. 71 NHLPIGRTASKKVAS-----LVFNEQATIRVDNEV----ADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-- 139 (502) Q Consensus 71 ~~~n~~k~iv~~~a~-----~l~~ep~~i~~~d~~----~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-- 139 (502) ++-.--.|+..++ --...|+.+.+++.+ .-+++.+++ +|+++..+.+....+.|..|++.-.++ T Consensus 77 --~pEVd~AideIvneaiv~d~~~~pV~v~l~~~e~s~~iK~kI~~ll---df~~~~~~~fR~WYVDGriy~Hkiik~~k 151 (533) T protein:vir:58 77 --DPLISTVLDIIADECTIPNENGNIVDVVTKDIELAKAILSYLDYVI---NIEKNAYPIIRNMIKYGDMFLHILEKGSD 151 (533) T ss_pred --CcchhhHHHhhhceeeEecCCCceeEeecccccccHHHHHHHHHHh---cchhhhhHHHHhhhhcceeEEEeccCCcc Confidence 0111111222111 112356666666533 334554444 699999999999999999999986542 Q ss_pred Cce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc Q lcl|NC_012753. 140 DQI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST 218 (502) Q Consensus 140 ~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~ 218 (502) ++| .+.+++|-.+=+++.- ..+ .|++ +-+..+........+..+|-.+ T Consensus 152 ~GI~elr~lDPr~i~~vr~~----------------~t~-------~eyy--------vy~~~~~~~~s~~~~~kI~~da 200 (533) T protein:vir:58 152 GTIEKFQVVSPYIFSKRYNP----------------ETD-------TWYY--------VITDVYRNVVSGYFNEDIPEED 200 (533) T ss_pred cchhhheecCCeeeEEEEee----------------ccc-------eEEE--------eecccccccccCccccccchhh Confidence 344 7888888776554210 000 0111 1111111111111111222111 Q ss_pred cccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh---hccceee-ech--- Q lcl|NC_012753. 219 LYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK---MGQRRVA-VPT--- 291 (502) Q Consensus 219 ~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~---~~~~~i~-v~~--- 291 (502) .+|+.-. ..+..++.++|-|..+......|=.+-+.++ -++ .-..||| |.- T Consensus 201 ------------------I~y~~SG---l~d~~~~~iisyLhkAiKp~NQLkmiEDAlV-IYRisRAPeRRvFYIDVGNl 258 (533) T protein:vir:58 201 ------------------VIHFSHK---IDTNFFPYGRSYLESARAIWNQLRLMEDALM-LYRVVRSVDRRVFYVDVGNV 258 (533) T ss_pred ------------------eeeeeec---cccCCCCceehhhhHHHHHHHHHHHHHHHHH-HHhhcCChhheEEEEeecCC Confidence 1111111 1233456677777776544444433222222 223 3333444 211 Q ss_pred ------HHh----c-----cCCCCCCcccCcccccc---ccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHH Q lcl|NC_012753. 292 ------QMI----K-----TEYDTNGEKVTVKREFE---TGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLS 353 (502) Q Consensus 292 ------~~l----~-----~~~~~~g~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~ 353 (502) .++ . .+-+...+++...+-+. .-...|.. .--+|+.+.-|+++...- . .-++-++.+.+ T Consensus 259 pk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWL-pRReGgrgTEI~TLpGg~-l-gemeDV~YF~k 335 (533) T protein:vir:58 259 PPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFI-PRRGDRRAVEIDILQGSK-V-DLAEDVEYMLN 335 (533) T ss_pred CccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhcc-cccCCCccceeeecCCCC-C-CcHHHHHHHHH Confidence 000 0 00111222221111111 00011110 011233334466665432 2 23455777788 Q ss_pred HHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeC Q lcl|NC_012753. 354 LFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLD 433 (502) Q Consensus 354 ~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~ 433 (502) .+....++|...++.+++.. -++||....-....-+.+++..|. +++.-.|.+ . ++....+..++|. T Consensus 336 kLy~ALnVP~sRl~~e~~fg-r~~eItRDEiKF~KFI~rLR~rF~----~ll~~qLil-------k-~iit~eew~~~f~ 402 (533) T protein:vir:58 336 RLISALKVPKAFIGYEGDVN-AKNTLATQDIKFNNTIKRIQGFFV----EELERMVRM-------N-KEFADQDFRLVMN 402 (533) T ss_pred HHHHHhCCCeeecCCCCCCc-cchhhhHHHHHHHHHHHHHHHHHH----HHHhccccc-------c-cCcchhheeeeee Confidence 88888899888887665432 244554333223334444444444 333322321 1 2334445677776 Q ss_pred CCccCCHHHHHHHHHHH---H--hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCC----CccccCCCCC Q lcl|NC_012753. 434 DGVFTDRNAEFDYWSKM---V--AAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFR----TSEEVDIYGE 502 (502) Q Consensus 434 d~i~~d~~~~~~~~~~~---~--~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~----~~~~~~~~g~ 502 (502) ..--..+..+++.+..- . ..+.+++.++.++..-.||| .+++.+.|++|....--..+ .-..+++-|| T Consensus 403 ~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk~yi~k~ILr~tde-i~~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~ 479 (533) T protein:vir:58 403 RSNSIVEGERFAVIEQRIGIAERLKGWVREDWIYSNILQIPYD-LKPQEEVAEAAGGGGLFDTGGFGEETTPADFLGE 479 (533) T ss_pred ccchHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHhcCChh-hhHHHHHHHHhhcCCCCCCCCcccccCCcccCcc Confidence 65433444444333321 1 23678888766677788885 44444667776443211000 0012222333 No 231 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=93.68 E-value=0.0067 Score=32.50 Aligned_cols=387 Identities=11% Similarity=0.054 Sum_probs=159.9 Q ss_pred cchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccc-cccceecchHHHHHHHHhhhhhcCcceE-e-eC Q lcl|NC_012753. 21 QSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQV-KRDFNHLPIGRTASKKVASLVFNEQATI-R-VD 97 (502) Q Consensus 21 ~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-~~~~~~~n~~k~iv~~~a~~l~~ep~~i-~-~~ 97 (502) |.|.+........ +.-.|.....+ .... ....+... ..+.+.+.--...|+..|+-+-+=|..+ . .+ T Consensus 1 m~~~~~~~~~~~~-------~~~~~~~~~~~-~~~~--~~~~g~~v~~~~al~~~~v~~~i~~ia~~ia~lp~~~~~~~~ 70 (419) T protein:vir:57 1 MFIPQFWKGRPSE-------NRVNWQVVPGG-MRSS--SSQAGVIITPETALALSAVRACVTLLAESVAQLPCVLYRRTE 70 (419) T ss_pred CcchhhhccCCcc-------ccccccccccc-cccc--cccCCceechHHhhccHHHHHHHHHHHHhhccCceEEEEEcC Confidence 3333332211100 00001110011 0000 00011111 0111222222455566666555545543 1 11 Q ss_pred CH--H--HHHHHHHHHh-----hccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEEcCCCeEEEE Q lcl|NC_012753. 98 NE--V--ADAFINETLK-----NDKFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQANTQDVSSAA 166 (502) Q Consensus 98 d~--~--~~e~l~~~~~-----~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~d~~~~~~~~ 166 (502) +. . ....|..+|. ......-....+...+..|.+|+.+..+. |. +.+..++|..+-+.. +.++. T Consensus 71 ~g~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~pl~~~~v~v~~-~~~g~---- 145 (419) T protein:vir:57 71 NGGREIAFDHPLHDLIRYQPNRKDTAFEYHEQTQGVLGLEGNSYSLIDRNGRGDITELIPINPHKVIVLK-GPDGM---- 145 (419) T ss_pred CCceeccccchHHHHHhhccccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEEcCcceEEEE-CCCce---- Confidence 11 1 1223444442 12334445556777788899988888775 44 366667777665532 11111 Q ss_pred EEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccc Q lcl|NC_012753. 167 IVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMN 246 (502) Q Consensus 167 ~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n 246 (502) .+|+ ..+. |..+|..+ .++++.+. T Consensus 146 -------------~~y~------~~~~-----------------~~~~~~~~------------------vih~r~~~-- 169 (419) T protein:vir:57 146 -------------PYYD------IPSI-----------------GEILPMRM------------------VHHIKSFS-- 169 (419) T ss_pred -------------EEEE------EcCC-----------------ceEEchhh------------------EEEecCcC-- Confidence 0110 0000 11111110 12333221 Q ss_pred cccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhcc-ceeeechHHhccCCCCCCcccCccc-ccccc-chhhcccc- Q lcl|NC_012753. 247 NKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQ-RRVAVPTQMIKTEYDTNGEKVTVKR-EFETG-HNVYEQFD- 322 (502) Q Consensus 247 ~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~-~~i~v~~~~l~~~~~~~g~~~~~~~-~~~~~-~~~~~~~~- 322 (502) .+..+|+|.+..+...|+....+-....+-|..+. ..-+ |......+........ .+... ...+.... T Consensus 170 ---~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi-----l~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n 241 (419) T protein:vir:57 170 ---LDGYIGTSPIQTNPDVLGLGIAVEQHAAQVFARGTTMSGV-----IERPFEAKAIASQAAVDAILAKWTERYGGVRN 241 (419) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHHccCCccEE-----EEecCcCCcccCHHHHHHHHHHHHHHhccccc Confidence 12457999888888777755444333333345433 2222 2211111110000000 00000 00000000 Q ss_pred ---CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 323 ---SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVE 398 (502) Q Consensus 323 ---~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~ 398 (502) ..--+.+..++.++.....-++.+..+...++|+...|++|..++....+. +++++.. ...++ T Consensus 242 ag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~-------------~~f~~ 308 (419) T protein:vir:57 242 AFSVGMLQEGMTYKQLSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQG-------------LQYVI 308 (419) T ss_pred cccceecCCCceEEEcCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHH-------------HHHHH Confidence 000012223555665566667888888888999999999999998655432 2222221 11123 Q ss_pred HHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCH-HHHHHH- Q lcl|NC_012753. 399 KSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTK-EQAQEI- 476 (502) Q Consensus 399 ~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~d-eea~~e- 476 (502) ..|..++..|-...+. .+..........+.|+++.-+..|..+.++...+++.+|+++.-++++. .|+.. +..++- T Consensus 309 ~~l~P~~~~ie~~l~~-~ll~~~~~~~~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~T~NE~R~~-~gl~p~~ggD~~~ 386 (419) T protein:vir:57 309 YTMLAILKRHESAMMR-DLLLPSERRDFYIEFNVSSLLRGDQKSRYESYALGRQWGWLSVNDIRRM-ENLTPIPGGDKYL 386 (419) T ss_pred HHHHHHHHHHHHHHHh-hccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH-hCCCCCCCcCeee Confidence 3344433333222111 1111111122345555556667789999999999999999999997654 34432 111111 Q ss_pred -------HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 -------YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 -------l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+.+.+... +.|....+.+....=. T Consensus 387 ~~~n~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 418 (419) T protein:vir:57 387 TPLNMVDSKALTGIGK-ATPQQLKDIEAILCTR 418 (419) T ss_pred eccccccccccccccC-CCcccCcchhhhhhcc Confidence 111111111 1111111111111101 No 232 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=93.61 E-value=0.007 Score=32.41 Aligned_cols=436 Identities=13% Similarity=0.146 Sum_probs=190.0 Q ss_pred CChhH---HHHHHHHHHhhcccccchhh----hhc--------cc--------------------cccCCHHHHHHHHHH Q lcl|NC_012753. 1 MGIIQ---TIKNFIKRSNYVITNQSLNS----ITD--------HP--------------------KIAISPEEYNRIMDN 45 (502) Q Consensus 1 m~~~~---~ik~~i~~~~~~~~~~~l~~----i~~--------~~--------------------~~~~~~~~~~~i~~~ 45 (502) |+++. -+|.|-+.--. ...+.+.+ +.. .. ..++ +....-|+.+ T Consensus 4 ~~~~~~l~~~~~~~~~d~~-~~~~~~~~~~~s~~~p~~~dGa~~i~~~~~~~~~~g~~~~~y~~~e~~~-~~~~eLI~~Y 81 (524) T protein:vir:98 4 LGFGNVLSFFKNFAREDEI-ELEQQLKNDTGSVAPPKNNDGAYEIETDLNNQKYAGVFQQFYSGQDPAI-QNKEQLINTY 81 (524) T ss_pred cchhhHHHHhhhhhhhhhh-hHhhhhcCCcccccCCCCCCCceeecCCCCcceecceeeeecccccccc-chHHHHHHHH Confidence 23332 23333221000 00000000 000 00 0000 0111222222 Q ss_pred HHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhh-hhcCcceEeeCCH--------HHHHHHHHHHhhccHHH Q lcl|NC_012753. 46 LRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASL-VFNEQATIRVDNE--------VADAFINETLKNDKFSK 116 (502) Q Consensus 46 ~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~-l~~ep~~i~~~d~--------~~~e~l~~~~~~~~f~~ 116 (502) +.+... +-+. +--..||+...-+ -..+|+.+.+++. ...+..+.+++--+|++ T Consensus 82 R~ma~~--pEvd----------------~Av~eIVneaIv~~~~~~pV~l~L~~~~~s~~iK~kI~eeF~~Il~ll~F~~ 143 (524) T protein:vir:98 82 RGIMSY--PEVE----------------NAVSEIIDDAIVNEQGKDIITMDLAKTNFSKAIQDKIVEEFDNVLNIYDFDN 143 (524) T ss_pred HHHhhc--cchh----------------hHHHhhhcceeEecCCCceEEEEecccccchHHHHHHHHHHHHHHHHhccch Confidence 222211 0000 0001222221111 1125666666643 24566677777779999 Q ss_pred HHHHHHHHHhhcCCEEEEEEEeCC---c-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEE-EEEEEEEe Q lcl|NC_012753. 117 NFERYLESCLALGGLAMRPYIDGD---Q-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYS-LIEFHEWN 191 (502) Q Consensus 117 ~~~~~~~~~~~~G~~~~~~~~d~~---~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt-~~E~h~~~ 191 (502) +..+.+....+-|..|++..+|++ + ..+..++|..+-+|..--.+. .++.. ..++ ..|++..+ T Consensus 144 ~~~~~fR~WYVDgRi~fhkiid~~~~kGI~ELr~lDPr~i~~vr~~~~~~-----------~~~~~-~v~~~~~e~f~Y~ 211 (524) T protein:vir:98 144 MGARLFRDWYVDSRIYFHKIMHKDESKGIRELRQLDPRCMELIRESITET-----------LDGGV-KVFRGYREFFVYS 211 (524) T ss_pred hhhHHHhhhhhcceeEEEEEEcCCCCcceeeeeeeCCccceeeeeccccc-----------cccch-hhccceeeeeeec Confidence 999999999999999999998753 2 468888998876653110000 00000 0010 01221110 Q ss_pred --CCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHH Q lcl|NC_012753. 192 --KETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFI 269 (502) Q Consensus 192 --~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~l 269 (502) ...|...-..|.. +. +-.+|-+.+ ++...+-.+ +.... +|-|..+......| T Consensus 212 ~~~~~~~~~g~~~~~-~~---~ikI~~dAI--------vy~hSGL~d-------------~~~~i-isyLhkAiKp~NQL 265 (524) T protein:vir:98 212 APKAGYTYNGQIYQA-NQ---KIKIPRSAI--------VYAHSGLED-------------CSNNI-IGYLHRAVKPANQL 265 (524) T ss_pred cCCCccccccceecC-CC---ceeechhhe--------eeeccCccc-------------CCCCe-eeehhHhhHhHHhh Confidence 1111110011110 00 111221111 111000000 00000 23333333222222 Q ss_pred HHHHHHHH--HHHhhccceee-ech-------------HHhcc-----CCCCCCcccCccccccccchhhccccCCCCcc Q lcl|NC_012753. 270 NTTYDEFM--WEVKMGQRRVA-VPT-------------QMIKT-----EYDTNGEKVTVKREFETGHNVYEQFDSGDMDK 328 (502) Q Consensus 270 d~~~S~~~--~~~~~~~~~i~-v~~-------------~~l~~-----~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 328 (502) =-+-+.++ +-.++-..||| |+- ++... +-+...++++.++-+..-...|..- --+|+. T Consensus 266 km~EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLp-RReGgr 344 (524) T protein:vir:98 266 RLLEDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLM-RRDGKA 344 (524) T ss_pred HHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhhhccc-ccCCCC Confidence 21111111 11234444554 211 01100 0122233333222222222222211 113333 Q ss_pred ccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccc--cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 329 GIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKS--MKTATEVVSEQSDTYQMRNSIATLVEKSLKELVI 406 (502) Q Consensus 329 ~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~--~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~ 406 (502) +.-|+++..--...+ ++-++.+.+.+....++|.+.+..++++ .--++||....-....-+.+++..|..-+.++++ T Consensus 345 gTEItTLpggqnlge-m~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~ 423 (524) T protein:vir:98 345 ITEVSTLPGGQNFSD-MDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLSDPLK 423 (524) T ss_pred ccceeeccccCCcCh-HHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 344666544322222 2346666777888888887777543221 1124455544444556777888888888888888 Q ss_pred HHHHHHHhhcccCCC-ccc-ccceEEEeCCCccCCHHHHHHHHHHHH-----hcC----CCCHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 407 SILELAKVYNLYTGE-IPT-MDEVSVDLDDGVFTDRNAEFDYWSKMV-----AAG----FAPKTMAIEKTLNVTKEQAQE 475 (502) Q Consensus 407 ~il~~~~~~~~~~~~-~~~-~~~i~v~f~d~i~~d~~~~~~~~~~~~-----~~G----i~S~et~l~~~~~~~deea~~ 475 (502) .-|.+-. ++... +.. ...+.++|...--..+..+++.+..-. ..+ ..|.+++.++....||+|.++ T Consensus 424 ~qLilKg---iit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~ 500 (524) T protein:vir:98 424 TNLIAKK---IITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDE 500 (524) T ss_pred Hhhhhhc---CCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHHHHH Confidence 7665432 22211 111 124778886654444444444433211 112 688999878888999999999 Q ss_pred HHHHHHHhhhcccCCCCCccccCC Q lcl|NC_012753. 476 IYQKINDETMVSTDSFRTSEEVDI 499 (502) Q Consensus 476 el~ri~~E~~~~~~~~~~~~~~~~ 499 (502) +.+.|++|....--..+..+..|| T Consensus 501 ~~k~I~~E~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 501 QAKLIEEESKEERFKNPEAEEENF 524 (524) T ss_pred HHHHHHHHHhCCCCcCCccccccC Confidence 999999998766555556666677 No 233 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=93.56 E-value=0.0071 Score=32.35 Aligned_cols=409 Identities=11% Similarity=0.048 Sum_probs=155.0 Q ss_pred CCh----hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH Q lcl|NC_012753. 1 MGI----IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG 76 (502) Q Consensus 1 m~~----~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~ 76 (502) |.+ ++.+ .|.-..+- ++-..|..+..++. +-.... =++ T Consensus 92 ~~~~~~~~~~l-~~~~~~~F-~Gy~~la~laQ~~e------yr~~~~------------------------------~ia 133 (698) T protein:vir:10 92 LDFNGTSMDAL-SFVTSSGF-PGFPTLVLLAQLPE------YRAMHE------------------------------VLA 133 (698) T ss_pred hcccccccccc-hhhhccCc-chHHHHHHHhhccc------hhhHHH------------------------------HHH Confidence 332 0000 01100000 00011111111100 000000 011 Q ss_pred HHHHHHHhhhhhcC-----------cceEee-CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeCCceEE Q lcl|NC_012753. 77 RTASKKVASLVFNE-----------QATIRV-DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDGDQIRV 144 (502) Q Consensus 77 k~iv~~~a~~l~~e-----------p~~i~~-~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i 144 (502) +..+++|-....+. ...+.- ++.+..+.|+.-++.-+....++++++++-.+|++.+.+-++++.... T Consensus 134 ~e~~R~w~~~~~~~~e~~~~~g~~~~~~~~~~~d~dqi~~L~~e~erl~V~~~l~eai~~aRlfGGa~~~i~I~gdd~~l 213 (698) T protein:vir:10 134 DECIRTWGEAIGGTKEKADTSGLAAGGNAASTSDGDQLKQINDEIERLRIRDAVRTTVIHDQAFGRAHPYFKIKGDDQIM 213 (698) T ss_pred HHhhcccceeccccchhhhhhcccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEEeecCcccc Confidence 11111111111010 001111 233455677777777789999999999999999998776665422000 Q ss_pred E---EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccccc Q lcl|NC_012753. 145 S---FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYE 221 (502) Q Consensus 145 ~---~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~ 221 (502) . +..+.. ...|......++.+. +.......+..-+.-+-.....|+| .|.+|.-+ T Consensus 214 ~~PL~~~~~~-----I~kGslKGL~ViDp~-~vtP~~~n~~dP~spdfgkP~~y~V------------~G~~IH~S---- 271 (698) T protein:vir:10 214 DTPLVPRPYT-----VPKGSFQGLRVVEPY-WVTPNNYNSINPVADDFYKPSTWWM------------IGSEVHAT---- 271 (698) T ss_pred cccccccccc-----ccCccceeeeeeccc-ccccchhhhccchhhccCCCceEEE------------ecceecce---- Confidence 0 001111 011111111111111 0000000000000000000001111 12221100 Q ss_pred CCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeee-chHHhccCCCC Q lcl|NC_012753. 222 DLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV-PTQMIKTEYDT 300 (502) Q Consensus 222 ~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v-~~~~l~~~~~~ 300 (502) .+ ..+.+.+.| -.+|+. ...+|+|....+.+-+++.+++......=+. +..+-+ -.+|-. ... T Consensus 272 RL---~~~vg~pvp--d~LKp~-------y~f~G~Sv~q~~~e~V~~~~rT~~~v~~Li~--~~~~~~l~~dla~-aL~- 335 (698) T protein:vir:10 272 RL---HTIVSRPVG--DMLKPT-------YSFAGISMTQLAMPYIDNWLRTRQSVSDIVK--QFSVSGILMDLAQ-ALT- 335 (698) T ss_pred eE---EEecCCCch--hhhcch-------hccCCccHHHHHHHHHHHHHHHhhhHHHHHH--HhhHHHHHHHHHH-hcC- Confidence 00 011111111 112221 2357999999999999998876654443221 011111 112111 111 Q ss_pred CCcccCccccccccchhhcccc---CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhh-cccccccc-cc Q lcl|NC_012753. 301 NGEKVTVKREFETGHNVYEQFD---SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGM-FSFDGKSM-KT 375 (502) Q Consensus 301 ~g~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~-~t 375 (502) +|+.......+.. .+.++... .-|+ +..-++.+ +....-....+....++++..++++... ||.+..|. +| T Consensus 336 ~g~~~~l~~R~el-i~~~Rsn~G~~llDk-~~Eefeq~--st~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNAT 411 (698) T protein:vir:10 336 PGANVDLSMRAEL-INRYRDNRNILFLDK-ATEEFFQF--NTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNAS 411 (698) T ss_pred ChhhHHHHHHHHH-HHHhcCccceEEEec-CCcceEEE--ecCcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCcc Confidence 1211111100000 01111111 1111 11223333 3444556677777788888888888654 88887775 67 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHH-------HH Q lcl|NC_012753. 376 ATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDY-------WS 448 (502) Q Consensus 376 Atei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~-------~~ 448 (502) +..=...|-+..... .++.++..|+.|+.+|..-. + |.. +.++++.|+---..++.+.++. .. T Consensus 412 GE~D~rnYYD~I~s~--Qe~~L~p~L~rl~~ii~rS~-----~-G~i--dp~i~~~fnPL~qmtd~EkAeI~~k~A~~d~ 481 (698) T protein:vir:10 412 SEGEIRVWYDYVRAY--QRNALQQLMNDVIVMIQLSL-----F-GAV--DPSIKWQWNALRELDDLEVAEARYKQAQSDV 481 (698) T ss_pred chhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHh-----c-CCC--CCcceEEeCCCCCcCHHHHHHHHhhhhHHHH Confidence 764333444433322 36778999999887765321 1 222 2368999986555555554443 33 Q ss_pred HHHhcCCCCHHHHHHhcC-----CCC---H----------HHHHHHHHHHH---HhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 449 KMVAAGFAPKTMAIEKTL-----NVT---K----------EQAQEIYQKIN---DETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 449 ~~~~~Gi~S~et~l~~~~-----~~~---d----------eea~~el~ri~---~E~~~~~~~~~~~~~~~~~g~ 502 (502) .++..|+++......++- +.. | ++++.++..++ +--....|+ ...+-.-|- T Consensus 482 ~~~~~gvI~~~evr~rL~~d~~s~Y~~~~d~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ 553 (698) T protein:vir:10 482 LYVQEQVIRPDQVAARLNTEPDGPYAGKLDANDDPGAPADDDIDGVLTYVQRMAEGGDTGAPT---APGGARAGA 553 (698) T ss_pred HHHHhcCCCHHHHHHHHhccCCCccccccCCcccCCCCCCCcchHHHhhhcCCcCCCCccccc---ccccccCCC Confidence 444567777766544321 111 0 01111111000 000000000 000011111 No 234 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=93.53 E-value=0.0073 Score=32.31 Aligned_cols=392 Identities=12% Similarity=0.093 Sum_probs=152.4 Q ss_pred CChhHHHHHHHHHHhh-cccccchh----hhhccccccCCHHHHHHHHHH----HHHhcCCCCccccccCCCccccccce Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNY-VITNQSLN----SITDHPKIAISPEEYNRIMDN----LRYFAGDFDSVTYRDSNGSQVKRDFN 71 (502) Q Consensus 1 m~~~~~ik~~i~~~~~-~~~~~~l~----~i~~~~~~~~~~~~~~~i~~~----~~~Y~g~~~~~~~~~~~~~~~~~~~~ 71 (502) =.|+..-.++++..-. .....++. .+..+.-..+.+.... +... .+.|+- .. .+.. T Consensus 3 ~~i~~~~g~~~~~~~~~~~~~~~ia~~~~~~~~~~~~~~~p~~~~-il~~~~~~~~~y~~------------m~-~D~~- 67 (491) T protein:vir:79 3 KGLWVSPTEFVKFGEPDKSLSSQIATRARSIDFFALGMYLPNPDP-VLKALGKDIRVYRE------------LR-ADAH- 67 (491) T ss_pred CeeeCCCCCcccccccchhHHHHHhhhccccccccccccCcchhH-HHhhccCCHHHHHH------------Hh-hChH- Confidence 0111111111111000 00000000 0001111111111111 1000 011100 00 0111 Q ss_pred ecchHHHHHHHHhhhhhcCcceEee--CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC--Cce---EE Q lcl|NC_012753. 72 HLPIGRTASKKVASLVFNEQATIRV--DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG--DQI---RV 144 (502) Q Consensus 72 ~~n~~k~iv~~~a~~l~~ep~~i~~--~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~---~i 144 (502) + .-.+.+-..-+++.+..|.. +++...+++.+++++-.|...+..++ +|..+|-+++-+.|+. +.. ++ T Consensus 68 ---i-~s~l~~Rk~av~~~~w~i~~~~~~~~~a~~i~e~l~~~~~~~~i~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l 142 (491) T protein:vir:79 68 ---V-GGCVRRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIATEML-DAVLYGYQPMEITWGKVGNYIVPIDV 142 (491) T ss_pred ---H-HHHHHHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeeEEee Confidence 1 22334445556677777765 34456789999998878888887776 4888998888777753 332 35 Q ss_pred EEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC Q lcl|NC_012753. 145 SFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE 224 (502) Q Consensus 145 ~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~ 224 (502) .++++..|.. +..+.. ++ +..+...-|.++| T Consensus 143 ~~r~~~~f~~---d~~~~l--------------------------------~l-----~~~~~~~~g~~lp--------- 173 (491) T protein:vir:79 143 VGKPADWFVY---DPENQL--------------------------------RF-----RSKEHWVQGEELP--------- 173 (491) T ss_pred eeecccceee---ccCCce--------------------------------EE-----eecCCCCCceeec--------- Confidence 5555543321 111100 00 0000000011111 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCCCCCc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~ 303 (502) +. -|+.++. ....++|+|.|.+..+--..---+..+..|+.=++- |... .+ .+ ++.+... T Consensus 174 ~~---------k~i~~~~----~~~~g~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~~G~P~-~i----gk-y~~~a~~ 234 (491) T protein:vir:79 174 AR---------KFLVPRQ----EATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPM-LV----GK-HPRSASD 234 (491) T ss_pred CC---------CeEEEEe----cCCCCCcccchhHHHHHHHHHHHHhhHHHHHHHHHHcCCCe-EE----Ee-cCCCCCH Confidence 00 1233321 123456888888888766554444444444433332 3322 22 12 2221111 Q ss_pred ccCccccccccchhhccccCCCC---ccccceeeecccc---chHHHHHHHHHHHHHHHHhcCCChhhccccccccccHH Q lcl|NC_012753. 304 KVTVKREFETGHNVYEQFDSGDM---DKGIGITDLTTDI---RSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT 377 (502) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt 377 (502) . ....+ .+....+..+.+ ..+.-|+.++..- ..+.|.+.++.+-++|+..+ ++. +++-+++|..... T Consensus 235 ~--ek~~l---~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LGq-tlTt~~~gs~a~~ 307 (491) T protein:vir:79 235 A--ETNLL---LDRLEDMVQDAVAVIPDDSSIEIKEAAGKSGSADVYERLLHFCRGEVSIAL-LGQ-NQTTEATSTRASA 307 (491) T ss_pred H--HHHHH---HHHHHHHhcCeEEEecCCceeEEEeccCCCCChhHHHHHHHHHHHHHHHHH-hhh-hhccCcccchhhH Confidence 0 00000 011111111100 0112244433221 12235555555445554433 221 1221222221122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 378 EVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 378 ei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) ++. ..-....+..-.+.+...|++|++-++.+ +. . ....+.+.|.+.-- ..+..++.+.+++..|+-- T Consensus 308 ~vh--~~v~~~i~~~D~~~i~~tln~li~~l~~~----N~---~--~~~~p~f~~~e~ee-~~~~~a~~~~~L~~~G~~i 375 (491) T protein:vir:79 308 QAG--LEVTDDIRDGDKAIVVEAMNMLIRWICDL----NF---D--GAARPVFDMWEQEQ-VDEIQAGRDEKLTRAGARF 375 (491) T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cC---C--CCCcceEeecCcCc-hhHHHHHHHHHHHhCCCcc Confidence 232 12233444455666778888888776654 21 1 11235566765332 2245678888999999855 Q ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 458 KTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 458 ~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) .+.++.+.+|+++.+-.++........... ........--+. T Consensus 376 ~~~~~~e~~Gip~~~~~e~~~~~~~~~~~~---~~~~~~~~~~~~ 417 (491) T protein:vir:79 376 TPAYFKRAYNLQDGDLDERPLPVSAVDAVG---AASFAEFEAPDQ 417 (491) T ss_pred CHHHHHHHhCCCCCCCCccccCcCcccccc---cccccccCCCCC Confidence 556688888886532222211111111000 000000000011 No 235 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=93.24 E-value=0.0083 Score=32.00 Aligned_cols=407 Identities=12% Similarity=0.098 Sum_probs=160.1 Q ss_pred CChhH-----HHH-HHHHHHhhcccccchhh-hhccccccCCHHHHHHHHHHHHHhcCCCCccccccCC-Ccccccccee Q lcl|NC_012753. 1 MGIIQ-----TIK-NFIKRSNYVITNQSLNS-ITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSN-GSQVKRDFNH 72 (502) Q Consensus 1 m~~~~-----~ik-~~i~~~~~~~~~~~l~~-i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~-~~~~~~~~~~ 72 (502) |+-|= -|+ .-+++. .....--+.+ +..|+---+++.++.+|-.... .| +...+.... ....++.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~-~~~~~~~~~~~~~~~~~~gltp~~l~~iLr~a~--~g--d~~~~~~L~e~m~e~D~~-- 73 (526) T protein:vir:99 1 MAQIVDVYGNPIRTQQLREP-QTSRLAGLAKEFAQHPAKGLTPAKLARILVEAE--QG--NLQAQAELFMDMEERDAH-- 73 (526) T ss_pred CCeeECCCCCccccccccch-hhhhhhhhhhhhcccCcCCCCHHHHHHHHHhhh--CC--CHHHHHHHHHHHHhhChH-- Confidence 32211 000 000000 0000000001 1122222445666555543221 01 100000000 00000111 Q ss_pred cchHHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCCEEEEEEEeC--Cce- Q lcl|NC_012753. 73 LPIGRTASKKVASLVFNEQATIRVD------NEVADAFINETLKND-KFSKNFERYLESCLALGGLAMRPYIDG--DQI- 142 (502) Q Consensus 73 ~n~~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~~~~~-~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~- 142 (502) + .-...+-..-+++.+..|.-. ++...+++++++.+- +|...+..++ +|..+|-+++-+.|+. +.. T Consensus 74 --i-~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~l-da~~~G~s~~Eivw~~~~g~~~ 149 (526) T protein:vir:99 74 --L-FAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDAL-DGIGHGYSCIELEWALQGREWM 149 (526) T ss_pred --H-HHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcccCHHHHHHHHH-HhhhhcceeEEEEEeecCCcee Confidence 1 122333344455666666542 234567888888763 5888887766 4888998888777753 222 Q ss_pred --EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccc Q lcl|NC_012753. 143 --RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLY 220 (502) Q Consensus 143 --~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~ 220 (502) .+.+.++..|.- +... ...+. ++. ...-|.++| T Consensus 150 ~~~l~~r~~~~f~~---~~~~--------------------------------~~~l~---~~~--~~~~g~~l~----- 184 (526) T protein:vir:99 150 PLAFHHRPQSWFQL---NPED--------------------------------QNELR---LRD--NSPAGEALQ----- 184 (526) T ss_pred EEEeeeecccceee---ccCC--------------------------------CcEEE---ecC--CCCCceeec----- Confidence 233333332110 0000 00000 000 000011111 Q ss_pred cCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-hccceeeechHHhccCCC Q lcl|NC_012753. 221 EDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-MGQRRVAVPTQMIKTEYD 299 (502) Q Consensus 221 ~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~~~~~i~v~~~~l~~~~~ 299 (502) +. -|++++. ....++|+|.|.+..+.-..--=+..+..|+.=++ .|....+. + ++. T Consensus 185 ----~~---------k~i~~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~ig-----k-y~~ 241 (526) T protein:vir:99 185 ----PF---------GWIIHRP----RARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLG-----K-YPP 241 (526) T ss_pred ----CC---------CeEEEee----cCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEE-----e-cCC Confidence 00 1222221 22346788888887765554433334444443333 23332222 1 222 Q ss_pred CCCcccCccccccccchhhccccCCCC---ccccceeeecc-ccchHHHHHHHHHHHHHHHHhcCCChhhcccccc--cc Q lcl|NC_012753. 300 TNGEKVTVKREFETGHNVYEQFDSGDM---DKGIGITDLTT-DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGK--SM 373 (502) Q Consensus 300 ~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~--~~ 373 (502) +.... ....+ .+....+..+.+ ..+.-|+.++. .-..+.|.+.++.+-++|+..+ ++...-++.++ +. T Consensus 242 ~a~~~--ek~~L---~~av~~i~~d~~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~g 315 (526) T protein:vir:99 242 GTADE--EKATL---LRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGG 315 (526) T ss_pred CCCHH--HHHHH---HHHHHHHhhCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccCcch Confidence 11110 00000 011111111100 01112333332 1233446666666666665554 22221121111 11 Q ss_pred ccHH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_012753. 374 KTAT-EVVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMV 451 (502) Q Consensus 374 ~tAt-ei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~ 451 (502) +-|. ++. ..-....+..-.+.+...|+ +|++.++.+ |. ++......-+.+.|+..-++|..+.++.+.+++ T Consensus 316 S~a~g~vh--~~v~~di~~aDa~~i~~tln~~Li~~l~~~----N~-~~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~ 388 (526) T protein:vir:99 316 AFALGQVH--NEVRHDLLASDARQLAATLSRDLLWPLLVL----NR-PGSPDVRRAPRLVFDLREQADITSMAQSIPALV 388 (526) T ss_pred hhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CC-CCcCCccccceEEeCCCCcccHHHHHHHHHHHH Confidence 1111 121 22223344445566777775 587776654 21 111122234677888888899899999999999 Q ss_pred hcCC-CCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 452 AAGF-APKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 452 ~~Gi-~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ..|+ +|.+ ++.+.+|+.+.+-.+.+- ........+....+...--+.+ T Consensus 389 ~~G~~i~~~-~i~e~~Gip~~~~~e~~l--~~~~~~~~~~~~~~~~~~~~~~ 437 (526) T protein:vir:99 389 NVGLEIPSA-WVYDKLGIPQPAKNEPVL--RSAAQPAILSRQHGQRVAALAT 437 (526) T ss_pred hCCCccCHH-HHHHHhCCCCCCCccccc--CCCCCCcccccccccccccccc Confidence 9998 5555 577778875422111211 0011110111101111000111 No 236 >protein:vir:96988 Length: 516 # NCBI annotation: 29 # Family: family:all:481 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654130;genbank:gi:108862014;genbank:GeneID:5075937 Probab=92.89 E-value=0.0096 Score=31.65 Aligned_cols=430 Identities=10% Similarity=0.078 Sum_probs=162.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |. +=++ .-+...-.++...+. .+..++-.....|+.++.=-.+.+... .+......++--+.+...+ T Consensus 1 ~~------~~~~----~~~~~~~~~l~~r~~-~L~~~R~~~e~~w~e~a~~~lP~~~~~--~~~~~~~~~~~dstg~~a~ 67 (516) T protein:vir:96 1 MK------QSID----LEYGGKRSKIPKLWE-KFSNKRSSFLDRAKHYSKLTLPYLMND--KGDNETSQNGWQGVGAQAT 67 (516) T ss_pred Cc------chhh----hhhhhhHHHHHHHHH-HHHHHhhHHHHHHHHHHHhhcccccCC--CCCccccCCcccchHHHHH Confidence 10 0000 000000111111111 122333333445554432221212111 1111112222234566777 Q ss_pred HHHhhhhhcC--cc-----eEeeCCH-------------HHHHH-------HHHHHhhccHHHHHHHHHHHHhhcCCEEE Q lcl|NC_012753. 81 KKVASLVFNE--QA-----TIRVDNE-------------VADAF-------INETLKNDKFSKNFERYLESCLALGGLAM 133 (502) Q Consensus 81 ~~~a~~l~~e--p~-----~i~~~d~-------------~~~e~-------l~~~~~~~~f~~~~~~~~~~~~~~G~~~~ 133 (502) +.+|+-|.+- || ++++++. .+.++ +...+..++|...+.++.......|.+++ T Consensus 68 ~~LAa~l~~~ltpp~~~WF~L~~~~~~~~~~~~~~~~~~~v~~~L~~ve~~~~~~l~~snf~~~~~~~~~~L~~~G~a~l 147 (516) T protein:vir:96 68 NHLANKLAQVLFPAQRSFFRVDLTAQGEKVLNQRGLKKTELATIFAQVETRAMKELEQRQFRPAVVEAFKHLIVAGSCML 147 (516) T ss_pred HHHHHHHHhhhcCCCCcccccccChhHHhhccccCchhHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHCeEeE Confidence 7777776652 22 1333321 13333 34456678999999999999999998764 Q ss_pred EEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEE---------------------eeCCCceEEEEEEEEEEeC Q lcl|NC_012753. 134 RPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTK---------------------TEGQKVKYYSLIEFHEWNK 192 (502) Q Consensus 134 ~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~---------------------~~~~~~~~yt~~E~h~~~~ 192 (502) |.|+.. .+..+|-.+++ +..|..+....+|...... ........|+++++ ++ T Consensus 148 --~~d~~~-~~~~~pl~~y~-v~~d~~G~v~~i~rr~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~v~v~~~v~~---~~ 220 (516) T protein:vir:96 148 --YKPSKG-AISAIPMHHYV-VNRDTNGDLLDIILLQEKALRTFDPATRAVVEVGLKGKKCKEDDSVKLYTHAKY---LG 220 (516) T ss_pred --EecCCC-CEEEEEcCeEE-EeeCCCCCeeeehhhhHhhHHHHHHhhhhhhhhhhhhhhcCCCCceEEEEeeee---eC Confidence 556543 25566666744 4556554443444221100 01112234444443 33 Q ss_pred CeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHH Q lcl|NC_012753. 193 ETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTT 272 (502) Q Consensus 193 ~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~ 272 (502) +.+...| ...+...+|. +..+ +...-||+.++-+ ...++.||+|--..+.+-+..|+.. T Consensus 221 ~~~~~~~---~~~d~~~~~~-----------es~~---~~~e~P~~~~Rw~----~~~ge~YGrgp~~~~L~D~k~L~~l 279 (516) T protein:vir:96 221 DGFWELK---QSADDIPVGK-----------VSKI---KSEKLPFIPLTWK----RSYGEDWGRPLAEDYSGDLFVIQFL 279 (516) T ss_pred CceeEEE---EEeCceeecc-----------cccc---ccccCCeeeeeee----ecCCCCcccchHHHhhHHHHHHHHH Confidence 3332222 1111111111 1111 1122344444422 2347789999989999999999866 Q ss_pred HHHHHHHHh-hccceeeechHH-hccCCCCCCcccCccccccccchhhccccCCCCccccceeeec--cccchHHHHHHH Q lcl|NC_012753. 273 YDEFMWEVK-MGQRRVAVPTQM-IKTEYDTNGEKVTVKREFETGHNVYEQFDSGDMDKGIGITDLT--TDIRSDDYIKAI 348 (502) Q Consensus 273 ~S~~~~~~~-~~~~~i~v~~~~-l~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~--~~ir~e~~~~~l 348 (502) --....-.. ..+....||.+. ++.. ...+. ....+. ... ...+..++ +..+...-...+ T Consensus 280 ~~~~l~~~~~a~~~~~lv~p~g~~~~~------~l~~~-----~~g~i~---~g~---~~~v~~~q~~~~~d~~~~~~~i 342 (516) T protein:vir:96 280 SEAVARGAALMADIKYLIRPGAQTDVD------HFVNS-----GTGEVV---TGV---EEDIHIVQLGKYADLTPISAVL 342 (516) T ss_pred HHHHHHHHHHhcCCccccCcccccchh------hhccC-----CCceee---cCC---cccceeeecCcccchhHHHHHH Confidence 555554332 344444554322 2111 11100 000000 011 11122221 211222223334 Q ss_pred HHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhhcccCCCccc Q lcl|NC_012753. 349 NKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNS----IATLVEKSLKELVISILELAKVYNLYTGEIPT 424 (502) Q Consensus 349 ~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~----~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~ 424 (502) +.+.+.|....=+.. +....+...|||||....+.+.+..+- ++.+|.. .|+.-++... .+ . .+ T Consensus 343 ~~~~~rI~~af~~~~--l~~r~~~rvTAtEV~~r~~E~~~~LGpv~~rl~~Ell~---Pli~r~l~~~-----~p-~-lp 410 (516) T protein:vir:96 343 EVYTRRIGVVFMMET--MTRRDAERVTAVEIQRDALEIEQNMGGVYSLFATTMQS---PVAMWGLLEA-----GE-S-FT 410 (516) T ss_pred HHHHHHHHHHHhhhh--hccCCCccccHHHHHHHHHHHHHHhhhHHHHHHHHHHH---HHHHHHHHhc-----CC-C-Cc Confidence 444444433221111 222233446999999887776665554 4444433 3333232211 11 1 11 Q ss_pred ccceEEEeCCCccCCHHHHHH------HHHHHHh--cCC-------CCHHHHH---HhcCCC------CHHHHHHHHHHH Q lcl|NC_012753. 425 MDEVSVDLDDGVFTDRNAEFD------YWSKMVA--AGF-------APKTMAI---EKTLNV------TKEQAQEIYQKI 480 (502) Q Consensus 425 ~~~i~v~f~d~i~~d~~~~~~------~~~~~~~--~Gi-------~S~et~l---~~~~~~------~deea~~el~ri 480 (502) ...+.++.-.. .+.....+ +..+.++ +++ +-...++ ....|+ ++||++++.+.- T Consensus 411 ~~~v~~~~vs~--l~~l~r~~~~~~i~~~~~~i~~~~~~~p~v~d~id~d~~~~~~a~~~Gvp~~~irs~eev~~~~~~~ 488 (516) T protein:vir:96 411 SDLVDPVIITG--IEALGRMAELDKLANFAQYMSLPLQWPEPVLAAVKWPDYMDWVRGQISAELPFLKSAEEMAQEQEAQ 488 (516) T ss_pred cccccceeech--HHHHHHHHHHHHHHHHHHHHHHHhcCChhHHhcCCHHHHHHHHHHHhCCCccccCCHHHHHHHHHHH Confidence 11222222111 11111111 1111110 111 1112222 222332 455554443322 Q ss_pred HHhhh-cccCCCCCccccCCCCC Q lcl|NC_012753. 481 NDETM-VSTDSFRTSEEVDIYGE 502 (502) Q Consensus 481 ~~E~~-~~~~~~~~~~~~~~~g~ 502 (502) .+.++ +..-..-+....+.=|- T Consensus 489 ~~~q~~~~~a~~~~~~~~~~~~~ 511 (516) T protein:vir:96 489 MQAQQAQMLEEGVAKAVPGVIQQ 511 (516) T ss_pred HHHHHHHHHHHHhhhhhhHHhhc Confidence 21111 10000000000011111 No 237 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=92.78 E-value=0.01 Score=31.54 Aligned_cols=387 Identities=11% Similarity=0.065 Sum_probs=150.2 Q ss_pred Hhhcccc---------cchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccc-cccceecchHHHHHHHH Q lcl|NC_012753. 14 SNYVITN---------QSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQV-KRDFNHLPIGRTASKKV 83 (502) Q Consensus 14 ~~~~~~~---------~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-~~~~~~~n~~k~iv~~~ 83 (502) |.|-..+ +-|+.+.......-.... +.-+ ++... .+.. .+... .++-+.+.--...|+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~lf~~~~~~~~~~~---~~~~--~~~~~--~~~~---~~~~vs~~~al~~~~v~~cv~~I 70 (424) T protein:vir:45 1 MLYCWWAHWLWPEGGRVLLDALFRSKSLENPSTP---ITGD--AVDTD--GLFR---ADVYVSPETAMKLAAVYSCIYVL 70 (424) T ss_pred CeeEeeeceecCcchhHHHHhhccccCCCCCccc---cchh--hhhhh--cccc---CCceechHHhhccHHHHHHHHHH Confidence 3221111 111111111110000000 0000 00000 0000 00000 00111111223345556 Q ss_pred hhhhhcCcceEee-C-CH---HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCe Q lcl|NC_012753. 84 ASLVFNEQATIRV-D-NE---VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATV 151 (502) Q Consensus 84 a~~l~~ep~~i~~-~-d~---~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~ 151 (502) |+-+-+=|+.+-- + +. ..+.-+.++|.. | ....-...++..++..|.+|+.+-.+. |.+ .+..++|.. T Consensus 71 a~~iA~lp~~v~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~v~~lll~Gna~~~i~r~~~G~~~~L~~l~~~~ 150 (424) T protein:vir:45 71 SSSLAQMPLHVMRRHKGKVEPARDHPAFYLVHDEPNTWQTSYKWRELKQRHILGWGNGYTWVKRNRRGEVISLDCCMPWE 150 (424) T ss_pred HHHHhhCceEEEEecCCceeecccchHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEEcCCCcEEEEEEecCce Confidence 6655555655421 1 11 111123333321 2 222344456777888899998887765 443 566677766 Q ss_pred EEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecC Q lcl|NC_012753. 152 FFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNG 231 (502) Q Consensus 152 ~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~ 231 (502) +-+. .+.++ ..|+. .-.++.. .++ + . T Consensus 151 v~i~-~~~~~------------------~~y~~----~~~~~~~-----------------~~~-----~---~------ 176 (424) T protein:vir:45 151 TTLM-NTGGR------------------YTYGL----YNEYGAF-----------------AIS-----P---D------ 176 (424) T ss_pred EEEE-EcCCe------------------EEEEE----EecCceE-----------------EEC-----c---c------ Confidence 5432 11111 01100 0000000 000 0 0 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHH-HhhccceeeechHHhccCCCCCCcccCcccc Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWE-VKMGQRRVAVPTQMIKTEYDTNGEKVTVKRE 310 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~-~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~ 310 (502) -.++++.+.. +...|+|.+..+...|.....+- .+... |..+...-.| |.....-+.... .. T Consensus 177 ----eVih~r~~~~-----d~~~G~spi~~~~~~i~~~~~~~-~~~~~~f~ng~~p~gi----l~~~~~l~~e~~---~~ 239 (424) T protein:vir:45 177 ----DMIHIRALGN-----NQKMGLSPIMQHAETIGMGMSGQ-KYTESFFSGNARPAGI----VSVKSGLNKESW---GW 239 (424) T ss_pred ----cEEEecCcCC-----CCcccccHHHHHHHHHHHHHHHH-HHHHHHHhccCCccEE----EEeCCCCCHHHH---HH Confidence 1233443211 23568888877776665443332 33333 4543332111 221111110000 00 Q ss_pred cccc-chhhccccCCCC-----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHHHH Q lcl|NC_012753. 311 FETG-HNVYEQFDSGDM-----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVSEQ 383 (502) Q Consensus 311 ~~~~-~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~ 383 (502) +... ...+.......+ +.+.-++.++.....-++.+..+....+|+...|++|..+|...++. +++.+.... T Consensus 240 ~~~~~~~~~~g~~~n~g~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~- 318 (424) T protein:vir:45 240 LKDQWQKASQALRRQENKTMLLPADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQ- 318 (424) T ss_pred HHHHHHHHhccccccCCceeEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH- Confidence 0000 011111100000 12223445554444557888888889999999999999998754432 233322111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC-CCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYT-GEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAI 462 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~-~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l 462 (502) .++..|..+++.|-.-.+. .+.. ........+.++.+.-+-.|..+.++...+++.+|+|+.-+++ T Consensus 319 ------------f~~~tL~P~~~~ie~~ln~-kLl~~~e~~~g~~i~fd~~~llr~d~~~r~~~~~~~~~~g~~T~NE~R 385 (424) T protein:vir:45 319 ------------FVRYTMMPWVTNWEQELNR-RLFTRAELAAGYYVRFNLTGLLRGTPQERAQFYHFAITDGWMSRNEAR 385 (424) T ss_pred ------------HHHHHHHHHHHHHHHHHHH-hcCChhhhcCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcCHHHHH Confidence 1222333333332221111 1111 1111122355555555667888999999999999999999976 Q ss_pred HhcCCCCH-HHHHHHHHHHH-----HhhhcccCCCCCccc Q lcl|NC_012753. 463 EKTLNVTK-EQAQEIYQKIN-----DETMVSTDSFRTSEE 496 (502) Q Consensus 463 ~~~~~~~d-eea~~el~ri~-----~E~~~~~~~~~~~~~ 496 (502) +. .|+.. +..++-+.-.. .+..+...+...+.+ T Consensus 386 ~~-~gl~pi~ggD~~~~~~n~~~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 386 AF-EDMNPVEGLDEMLVSVNAANPAGDFKPPKNDEGKTNE 424 (424) T ss_pred HH-hCCCCCCCcceeeecccccccccccCCCCCCCCCCCC Confidence 54 34432 11111111100 000000011111111 No 238 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=92.69 E-value=0.01 Score=31.47 Aligned_cols=418 Identities=10% Similarity=0.000 Sum_probs=158.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccc-cccCCHHH------HHHHHHHHHHhcCCCCccccccCCCccc-ccccee Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHP-KIAISPEE------YNRIMDNLRYFAGDFDSVTYRDSNGSQV-KRDFNH 72 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~-~~~~~~~~------~~~i~~~~~~Y~g~~~~~~~~~~~~~~~-~~~~~~ 72 (502) |+++++++.-.+.... +.+....... ....+... ..++. .+..|....+... .+... ....+. T Consensus 1 M~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~g~~~~~~~~--~g~~v~~~~a~~ 71 (466) T protein:vir:81 1 MRLIDRLLSTRGAAPR----MSIDDYAQMLNEFAFNGIGYGFGGGVPRIQ---QTLAGPSTELAPD--TFVGLATQAYQA 71 (466) T ss_pred CchhHHHhhccCcccc----cchhhhhhhhhhhhccccccccccccHHHH---HhhccccccccCc--cccccchhhhhc Confidence 9999999987653221 1111100000 00000000 00111 2222221111111 11111 112333 Q ss_pred cchHHHHHHHHhhhhhcCcceEeeCCH-----HHHHHHHHHHhh----ccHHHHHHHHHHHHhhcCCEEEEEEEeCCc-- Q lcl|NC_012753. 73 LPIGRTASKKVASLVFNEQATIRVDNE-----VADAFINETLKN----DKFSKNFERYLESCLALGGLAMRPYIDGDQ-- 141 (502) Q Consensus 73 ~n~~k~iv~~~a~~l~~ep~~i~~~d~-----~~~e~l~~~~~~----~~f~~~~~~~~~~~~~~G~~~~~~~~d~~~-- 141 (502) ++....+|+..|+-+.+=|..+--.++ .....+-.++.. .....-...++..++..|.+|+.+..++.+ T Consensus 72 ~~~v~~~i~~Ia~~ia~lp~~~~~~~~~~~~~~~~~~~~~L~~~PN~~~t~~~f~~~l~~~lll~Gnay~~i~r~~~g~l 151 (466) T protein:vir:81 72 NGPVFACMLVRQLVFSSVRFRWQRLRDGKPSDTFGSRDLQILETPWKGGTTQDMLSRMIQDADLAGNSYWTIVDGEFVRM 151 (466) T ss_pred cHHHHHHHHHHHHhhccCceEEEEecCCceeeccccHHHHHhhCCCCCCCHHHHHHHHHHHHHhcCCeEEEEEecCcccc Confidence 444566777777777666665532211 111223333332 123334455567778889999888765421 Q ss_pred --------eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCce Q lcl|NC_012753. 142 --------IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQR 213 (502) Q Consensus 142 --------~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~ 213 (502) ..+..++|+.+-+.....+.. ... | ++...+..... .... T Consensus 152 ~~~~~g~~~~l~~l~~~~v~~~~~~~~~~-~~~---------------y----~~~~~~~~~~~------------~~~~ 199 (466) T protein:vir:81 152 RPDWVDVVVEERMVRGGRGELGGGQLGWR-KVG---------------Y----LYTEGGRQSGN------------ESVG 199 (466) T ss_pred ccccCcceeEEEEecCcceEEEEcCCCce-EEE---------------E----EEEecCccccc------------ceee Confidence 233334444433321111100 000 0 00000000000 0000 Q ss_pred eeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHH Q lcl|NC_012753. 214 VPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQM 293 (502) Q Consensus 214 v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~ 293 (502) ++.. -.++|+... + ..+...|+|-+..+...|+....+-....+-|..+...=.| T Consensus 200 ~~~~------------------dviHir~~~-~--~~d~~~G~s~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gi---- 254 (466) T protein:vir:81 200 FLAE------------------DVVHFAPIP-D--PLASYRGMSWLTPILREIRADQAMSKHQAKFFDNGATVNLV---- 254 (466) T ss_pred eccc------------------cEEEEcCCC-C--cccccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceE---- Confidence 0000 012333210 0 12334688888877777754433333233334544332222 Q ss_pred hccCCCCCCcccCcccccccc-chhhcccc----CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccc Q lcl|NC_012753. 294 IKTEYDTNGEKVTVKREFETG-HNVYEQFD----SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSF 368 (502) Q Consensus 294 l~~~~~~~g~~~~~~~~~~~~-~~~~~~~~----~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~ 368 (502) |.....-+.... ..+... ...+.... ..--+.+.-++.++.....-++.+..+....+|+...|++|..+|. T Consensus 255 l~~~~~l~~e~~---~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lG~ 331 (466) T protein:vir:81 255 IKHNPMADPAAV---KKWADEVNSKHAGVDNAWKNLNLYPGADADVVGSNLQEIDFKNVRGGGETRIAAAAGVPPVIVGL 331 (466) T ss_pred EecCCCCCHHHH---HHHHHHHHHHhcCccccccceEcCCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccc Confidence 322211111000 000000 01111100 0001122235566655566778888888999999999999999986 Q ss_pred cccc-cccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCC--ccCCHHHHH Q lcl|NC_012753. 369 DGKS-MKTATEVVSEQSD-TYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDG--VFTDRNAEF 444 (502) Q Consensus 369 ~~~~-~~tAtei~~~~~~-l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~--i~~d~~~~~ 444 (502) ..+. ..|...+...... ...++.-..+.|+..|... +... .......+.|+.. +-.|..+.. T Consensus 332 ~~~~~~st~sn~eq~~~~f~~~tl~P~~~~ie~~l~~~------------L~~~--~~~~~~~~~f~~~~llr~d~~~r~ 397 (466) T protein:vir:81 332 SEGLAAATYSNYGQARRRLADGTAHPLWQNLSGCIGHV------------MPDM--GPDVRLWYDADDVPFLREDEKDAA 397 (466) T ss_pred ccCCCccccccHHHHHHHHHHHHHHHHHHHHHHHHHhh------------cCCc--ccCcceEEEecchhhhccCHHHHH Confidence 5432 2222112111111 1122233333333333321 1111 1112344556543 334544333 Q ss_pred H-------HHHHHHhcCCCCHHHHHHhcCCCCHH----HHHHHHHHHHHhhh-cccCCCCCccccCCCCC Q lcl|NC_012753. 445 D-------YWSKMVAAGFAPKTMAIEKTLNVTKE----QAQEIYQKINDETM-VSTDSFRTSEEVDIYGE 502 (502) Q Consensus 445 ~-------~~~~~~~~Gi~S~et~l~~~~~~~de----ea~~el~ri~~E~~-~~~~~~~~~~~~~~~g~ 502 (502) + ....++.+|+ .+.+++....+-+.. .-..-++-+...+. ...+..+...+++=.|- T Consensus 398 ~~~~~~~~~~~~~~~~g~-t~nE~r~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 398 DIQKVRAETINTLITAGY-EPESVVAAVNSGDLRLLKHTGLTSVQLLPPGVSASASSDTPTSGGADDNGN 466 (466) T ss_pred HHHHHHHHHHHHHHHcCC-ChhhccccccCCccccccCCCcchhhhcccccccccCCCCcccCCCCcCCC Confidence 2 2445667775 555544332211100 00000111111111 11122223344444444 No 239 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=92.62 E-value=0.011 Score=31.40 Aligned_cols=376 Identities=11% Similarity=0.056 Sum_probs=126.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+|+++++.|+.+.-...+.... ..+-.+.. ... ...+...--..+| T Consensus 1 Mg~~~~~~~~~~~~~~~~~~~~~----------------------~~~~~~~~-------~~~----~~~l~~~~v~~~v 47 (395) T protein:vir:40 1 MGFKSWVSGFFNEEQRTLNLTDT----------------------VWCSIPSE-------KLK----ELSIKKWAIDSCA 47 (395) T ss_pred CchHHHHHhhhcccccccccccc----------------------hhhccccc-------cch----hhhhhhHHHHHHH Confidence 99999999998653222211110 00000000 000 0001111123344 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPL 155 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi 155 (502) +..|+-+-.-|..+--+++....-+..+|.. | ....-....+..++..|.+|+.+. .++.. .+..+... T Consensus 48 ~~Ia~~ia~~p~~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~~~--~~~~~----~~~~~~~~ 121 (395) T protein:vir:40 48 NKIANTLSCAEVLTYEKGEEVRKKNWYMFNVEANQNQNATEFWKKAIYKLVYDNEALIFMQ--DEYIY----VADSFTKN 121 (395) T ss_pred HHHHHHHhhCceeeccCCccccchHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCceEEEEe--cCcee----ecCCcccc Confidence 5555544444544332333333334444432 1 112223334555666777775443 22221 11111110 Q ss_pred EEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC----cceeecC Q lcl|NC_012753. 156 QANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE----ETVTLNG 231 (502) Q Consensus 156 ~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~----~~~~~~~ 231 (502) ... .....+ .....++ ..+.+ ++ . .--|-|--|-.......+.. +......+. ......+ T Consensus 122 ~~~---~~~~~~--~~v~~~~---~~~~~-~~---~--~~evih~r~~~~~~~~~~~~--l~~~~~~~~~~~~~~~~~~~ 185 (395) T protein:vir:40 122 DKS---LYENTY--TEVTLKD---LTLKK-EF---K--ESEVLHLTLNNESIKSIIDG--FYLLYGDLLTAAVNKYKKLN 185 (395) T ss_pred ccc---ccccee--eeeeecC---ceeee-ee---c--cccEEEeecCCCCccccchh--HHHHHHHHHHHHHHHHHhcC Confidence 000 000000 0000000 00000 00 0 00111100100000000000 000000000 0000011 Q ss_pred CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccccc Q lcl|NC_012753. 232 LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREF 311 (502) Q Consensus 232 ~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~ 311 (502) ..++ ...++.+ -+.++ +..+.+...+++.+... .....+++| + T Consensus 186 ~~~~-~l~~~~~----------~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~~~v----l----------------- 228 (395) T protein:vir:40 186 SRKI-IVKLKAM----------FGQTP-EAEEKLRLMLSERMKKF----LAEGDSALP----V----------------- 228 (395) T ss_pred CCCc-eEEEecc----------cCCCH-HHHHHHHHHHHHHHHHh----hccCCceee----c----------------- Confidence 1111 0111000 00000 00111111111111111 001111111 0 Q ss_pred cccchhhccccCCCCccccceeeeccccchHHHHHH---HHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHH Q lcl|NC_012753. 312 ETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKA---INKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQ 388 (502) Q Consensus 312 ~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~---l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~ 388 (502) +.+..++.++.....-++.+. .+.+..+|+...|+||..++.+.+ +.++.. T Consensus 229 ---------------~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~~~s---n~e~~~-------- 282 (395) T protein:vir:40 229 ---------------EDGMEIDELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKGDTV---GLSEQV-------- 282 (395) T ss_pred ---------------CCCceEEeccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCCc---CHHHHH-------- Confidence 111124444444434444432 234467899999999998863221 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcccC-CCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc-- Q lcl|NC_012753. 389 MRNSIATLVEKSLKELVISILELAKVYNLYT-GEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT-- 465 (502) Q Consensus 389 ~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~-~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~-- 465 (502) ...++.+|..++..|..-.+. .++. ........+.++++.-+-.|..+.++...+++.+|+++.-++++.. T Consensus 283 -----~~f~~~~L~P~~~~ie~~l~~-kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~~g~ 356 (395) T protein:vir:40 283 -----NSFLMFSINPIAEMFTDEGNR-KFYGRDSVLERTYMKLDTTRIKVQDIQEIASSMDVLFHIGVNTIDDNLRMIGR 356 (395) T ss_pred -----HHHHHHHHHHHHHHHHHHHHH-hcCChhhhcCCceEEEechhhhccCHHHHHHHHHHHHhCCCCCHHHHHHHhCC Confidence 112333444444333222211 1111 1112223466666676778888999999999999999999876653 Q ss_pred CCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 466 LNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 466 ~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +|+...+.++-.-. .-............+++-.++ T Consensus 357 ~pi~~~~gD~~~~~--~n~~~~~~~~~~~kgge~~~~ 391 (395) T protein:vir:40 357 EPVMSPETQERFVT--KNYAPLGENEEDLKGGDINEN 391 (395) T ss_pred CCCCCCCCceeeec--cccccccccccccCCCCCCCC Confidence 23432222211110 000000111111222333333 No 240 >protein:vir:345 Length: 663 # NCBI annotation: virion structural protein # Family: family:all:3199 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203459;genbank:gi:15320615;genbank:GeneID:921720 Probab=92.45 E-value=0.011 Score=31.25 Aligned_cols=443 Identities=11% Similarity=0.037 Sum_probs=194.6 Q ss_pred CCh----------hHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccc Q lcl|NC_012753. 1 MGI----------IQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDF 70 (502) Q Consensus 1 m~~----------~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~ 70 (502) |+= -+.-|.|++++-...-+ -...-.+.+.+.+.|.|..+.-+. ....-|. T Consensus 1 m~~~~~~~~~~tpe~la~~W~~~I~~a~~~--------------~~~~h~r~~~~~k~y~~~~~~~~~-----~~~r~nl 61 (663) T protein:vir:34 1 MNESQPTDFADTPQGWAQRWQEEMSAAREP--------------LEKWHTQGKEIVKRYRDERDSAHD-----AETRWNL 61 (663) T ss_pred CCccccccchhcchhHHHHHHHHHHHHHhc--------------cchHHHHHHHHHHHhhccccCCCc-----cccccch Confidence 221 12355666553321111 113335778888888875433221 1122355 Q ss_pred eecchHHHHHHHHhhhhhcCcceEeeC------C----HHHHHHHHHHH------hhccHHHHHHHHHHHHhhcCCEEEE Q lcl|NC_012753. 71 NHLPIGRTASKKVASLVFNEQATIRVD------N----EVADAFINETL------KNDKFSKNFERYLESCLALGGLAMR 134 (502) Q Consensus 71 ~~~n~~k~iv~~~a~~l~~ep~~i~~~------d----~~~~e~l~~~~------~~~~f~~~~~~~~~~~~~~G~~~~~ 134 (502) +|.|+-..+-...| .+|.++|. + ..+.+.+.+.+ ++++|...+...+.+++..|.+.++ T Consensus 62 ~~sni~~i~P~iYa-----r~P~p~V~~rf~d~d~~~~r~ase~leR~~~~~~~~D~~~l~~~~~~~v~d~ll~~rG~~~ 136 (663) T protein:vir:34 62 FSTNIQTQMASLYG-----QTPKVSVSRRFADADDDVARVASELLERLLNTDIEKDSDTFQQALEYALQDRLLPGFGLCR 136 (663) T ss_pred hhhhHHHHhhhhhc-----CCCcceeeecccCcccchhhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhhccccceEE Confidence 66565554444444 46666552 2 23456666665 5567999999999999999988888 Q ss_pred EEEeC------------------C--------------ceEEEEEcCCeEEEE-EEcCCCeEEEEEEEEEEEee-----C Q lcl|NC_012753. 135 PYIDG------------------D--------------QIRVSFVQATVFFPL-QANTQDVSSAAIVTKSTKTE-----G 176 (502) Q Consensus 135 ~~~d~------------------~--------------~~~i~~v~~~~~~Pi-~~d~~~~~~~~~~~~~~~~~-----~ 176 (502) +.|.. + +++|.+|.-..|+-- ......+.-+++....++.+ + T Consensus 137 v~Ye~~~~~~~~~~~~~D~~~~~~~a~~~~~~e~~a~E~v~id~v~~~dfl~~pAr~W~ev~wva~r~~mtk~e~~~rf~ 216 (663) T protein:vir:34 137 IRYEVEWEEVAGVDAILDEATGAELAAAVPPTQRKAYECVETDYLHWQDVLWSPARVWHEVRWLAFRNLLDMREFNARFD 216 (663) T ss_pred EEeecccchhccccccCCCccccchhcccccchhhcccceeeeeechhhcccchhhccccccceeeeccCCHHHHHHhhc Confidence 76621 0 345555544444210 00111121122211110000 0 Q ss_pred CCc-----------------------eE---EEEEEEEEEeCCeEEEEEEEEe-cCCccccCceeeccccccCCCcceee Q lcl|NC_012753. 177 QKV-----------------------KY---YSLIEFHEWNKETYTISNELYE-SESKTIIGQRVPLSTLYEDLEETVTL 229 (502) Q Consensus 177 ~~~-----------------------~~---yt~~E~h~~~~~~~~I~~~l~~-~~~~~~lG~~v~l~~~~~~l~~~~~~ 229 (502) .++ .. --..|. |+..+ |++|- .+ |-.+.|.. -++...+ T Consensus 217 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~a~VwEI--WdK~~----~~V~w~~e-----g~~~~L~~----~~p~lgl 281 (663) T protein:vir:34 217 ADGSRNLWASVPKVGKPKDGKDGQSCHPWDRAEVWEI--WDKGG----RKVDWYVE-----GYSAVLDT----QPDPLGL 281 (663) T ss_pred CChhhhhhhhccCcCCccccCCCCCcchhcCcceeEE--EecCC----cEEEEEEc-----Ccceeccc----CCCCCCC Confidence 000 00 000111 11111 01110 01 11111111 0111222 Q ss_pred cCC---CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC Q lcl|NC_012753. 230 NGL---TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT 306 (502) Q Consensus 230 ~~~---~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~ 306 (502) .|+ ++|++.+. ..++-..+++|.-++++++++|.+-.++.-=.+..+.+-+.|.+. |.... T Consensus 282 ~~ffPcPrpl~~~~--------~~ds~ipvpd~~~y~~~~~E~n~~t~Rin~l~d~ikv~gvy~~~~--------g~~i~ 345 (663) T protein:vir:34 282 ESFFPCPKPLLANW--------TTDKVVPRPDFVLAQDLYKEIDLVSTRITLLERAIRVVGVYDKSS--------GLTIG 345 (663) T ss_pred CCCCCCccccccee--------cCCCeecCCcHHHHHHHHHHHHHHHHHHHHHHhhhhhceeecccc--------chhHH Confidence 333 44443332 223556789999999999999976655532224445555544221 11000 Q ss_pred ccccc-cccchhhccc-----cCCCCccccceeeeccccchHHHH---HHHHHHHHHHHHhcCCChhhccccccccccHH Q lcl|NC_012753. 307 VKREF-ETGHNVYEQF-----DSGDMDKGIGITDLTTDIRSDDYI---KAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT 377 (502) Q Consensus 307 ~~~~~-~~~~~~~~~~-----~~~~~~~~~~i~~~~~~ir~e~~~---~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt 377 (502) . .+ ...++....+ ..+.++....|..+..+--+.... ..-..+...+.+.+|++.-.=| ...-.+||| T Consensus 346 ~--~l~~a~~n~lvpV~~~~~~~~~gg~~k~I~~~pi~~~~~aI~~l~~~r~qir~d~~qITGiaDi~Rg-a~~a~ETat 422 (663) T protein:vir:34 346 R--LLSEAAQNDLIPVENWLTFADKGGLRGVVDWFPLEPVVAALTSLRDYRRELVDALHQVTGMADIMRG-ASDPRETAM 422 (663) T ss_pred H--HHHHhhCCCceecchhhhhhhhcCccchhhcccchhHHHHHHHHHHHHHHHHHHHHHHHhHHHHhhc-ccCcchhhH Confidence 0 00 0000111111 111122223344333332222222 2223455567888898843322 223345777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------hcccCCCcc-----------------cccceEEEeC Q lcl|NC_012753. 378 EVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKV-------YNLYTGEIP-----------------TMDEVSVDLD 433 (502) Q Consensus 378 ei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~-------~~~~~~~~~-----------------~~~~i~v~f~ 433 (502) |-..+.+-+-.++..++.++++..+++++..-++..- ..+.+...+ ....+.|.=+ T Consensus 423 AQ~IKsq~gS~RIqe~qdevqR~arDi~ql~AEIl~~~~~~etl~~m~~~elp~~~ei~~~~~~L~n~~~r~~~ldIe~d 502 (663) T protein:vir:34 423 AQGVKAKFGSIRLQRLQDEVARFASDIQRLKAEVIAEHYDVASILAQANAEFTFDKELAPKAAELIKSRFSMYRVEVKPE 502 (663) T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHhcCCCCcccchhHHHHHHhcCCCcceeeeeccC Confidence 7666667788999999999999999999877655331 112222222 2234455556 Q ss_pred CCccCCHHHHHHHHHHHHhc--------------CCCCHHHHHHhc-----CCCCH-HHHHHHHHHHH--HhhhcccCCC Q lcl|NC_012753. 434 DGVFTDRNAEFDYWSKMVAA--------------GFAPKTMAIEKT-----LNVTK-EQAQEIYQKIN--DETMVSTDSF 491 (502) Q Consensus 434 d~i~~d~~~~~~~~~~~~~~--------------Gi~S~et~l~~~-----~~~~d-eea~~el~ri~--~E~~~~~~~~ 491 (502) -.+..|+.++.+..++.+.+ +-+.... +.++ -++.. .+++.-++++. .|+++..+.. T Consensus 503 sT~~~D~~~eK~~~~E~l~~i~~~~qq~~pl~~q~p~~~p~-l~Ellk~~~~~f~~~~qie~ai~~~~~~~e~aa~~~~~ 581 (663) T protein:vir:34 503 AVSLQDFAALRNEKMEVLSGIASFMQGVAPLAQQVPGSAPF-LLQMLKWSVSGLRGSSTIEGVLDKAIAAAEEAQKQAAQ 581 (663) T ss_pred CCCcCChHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHH-HHHHHHHHhhcCChhhhHHHHHHHHHhhhHHHhhccCC Confidence 67778887766666554422 1011111 1111 11111 11222222222 2333332222 Q ss_pred CCccc--cCC--CCC Q lcl|NC_012753. 492 RTSEE--VDI--YGE 502 (502) Q Consensus 492 ~~~~~--~~~--~g~ 502 (502) +.+.. .+. .++ T Consensus 582 ~~pa~~~~~~k~~~~ 596 (663) T protein:vir:34 582 QSPAPQQPDPKVVAQ 596 (663) T ss_pred CCcccchhhHHHHHH Confidence 22221 111 112 No 241 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=92.38 E-value=0.012 Score=31.19 Aligned_cols=408 Identities=11% Similarity=0.082 Sum_probs=159.9 Q ss_pred CChh-HH----HH-HHHHHHhhcccccchhhh-hccccccCCHHHHHHHHHHHHHhcCCCCccccccCCC-cccccccee Q lcl|NC_012753. 1 MGII-QT----IK-NFIKRSNYVITNQSLNSI-TDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNG-SQVKRDFNH 72 (502) Q Consensus 1 m~~~-~~----ik-~~i~~~~~~~~~~~l~~i-~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~-~~~~~~~~~ 72 (502) |+-| +. |+ .-+++. ....--.+.++ ..|+---+++.++.+|-.... .| +...+..... ...++.. T Consensus 1 ~~~~~d~~g~p~~~~~~~~~-~~~~~~~~~~~~~~~~~~gltp~~l~~il~~a~--~g--d~~~~~~L~edm~e~D~~-- 73 (526) T protein:vir:79 1 MAQIVDVYGNPIRPQQLREP-QTSRLAGLAKEFAQHPAKGLTPAKLARILVEAE--QG--NLQAQAELFMDMEERDAH-- 73 (526) T ss_pred CCeeeCCCCCccCccccchh-hhhhhhhhhhhcccCCCCCcCHHHHHHHHHHhh--CC--CHHHHHHHHHHHHhhChH-- Confidence 3211 10 00 000000 00000001111 122222456666665544321 11 1111000000 0000111 Q ss_pred cchHHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCCEEEEEEEeC--Cce- Q lcl|NC_012753. 73 LPIGRTASKKVASLVFNEQATIRVD------NEVADAFINETLKND-KFSKNFERYLESCLALGGLAMRPYIDG--DQI- 142 (502) Q Consensus 73 ~n~~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~~~~~-~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~- 142 (502) + .-...+-..-+++.+..|.-. ++...+++++++.+- +|...+..++. |..+|-+++-+.|+. |.. T Consensus 74 --i-~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~~~~~i~~~ld-A~~~G~s~~Ei~w~~~~g~~~ 149 (526) T protein:vir:79 74 --L-FAEMSKRKRAILGLDWAVEPPRNASAAEKADADYLHELLLDLEGLEDLLLDALD-GIGHGYSCIELEWALQGREWM 149 (526) T ss_pred --H-HHHHHHHHHHHhCCCceEecCCCCChHHHHHHHHHHHHHhcccCHHHHHHHHHh-hhhhcceeEEEEEeecCCcee Confidence 1 223334445566667776542 235667888888753 58887776665 888998888777753 322 Q ss_pred --EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccc Q lcl|NC_012753. 143 --RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLY 220 (502) Q Consensus 143 --~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~ 220 (502) ++.+.++..|.. +..+ ..... ++ +...-|.++| T Consensus 150 ~~~l~~r~~~~F~~---~~~~--------------------------------~~~l~---~~--~~~~~g~~l~----- 184 (526) T protein:vir:79 150 PLAFHHRPQSWFQL---NPED--------------------------------QNELR---LR--DNSPAGEALQ----- 184 (526) T ss_pred EEEeeeecccceEe---ccCC--------------------------------CcEEE---ec--CCCCCceeec----- Confidence 233333322110 0000 00000 00 0000011111 Q ss_pred cCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-hccceeeechHHhccCCC Q lcl|NC_012753. 221 EDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-MGQRRVAVPTQMIKTEYD 299 (502) Q Consensus 221 ~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~~~~~i~v~~~~l~~~~~ 299 (502) +. -|++++. ....++|+|.|.+..+.-..--=+..+..|+.=++ .|....+. + ++. T Consensus 185 ----~~---------k~iv~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~F~E~yG~P~~ig-----k-y~~ 241 (526) T protein:vir:79 185 ----PF---------GWIIHRP----RARSGYVARSGLFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLG-----K-YPP 241 (526) T ss_pred ----CC---------ceEEEee----cCCcCCccccchHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEE-----e-cCC Confidence 00 1222221 22346778888877765544333333344443233 23322221 1 222 Q ss_pred CCCcccCccccccccchhhccccCCCC---ccccceeeecc-ccchHHHHHHHHHHHHHHHHhcCCChhhccccc--ccc Q lcl|NC_012753. 300 TNGEKVTVKREFETGHNVYEQFDSGDM---DKGIGITDLTT-DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDG--KSM 373 (502) Q Consensus 300 ~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~--~~~ 373 (502) +.... ....+ .+....+..+.. ..+.-|+.++. .-..+.|.+.++.+-++|+..+ ++...-++.+ ++. T Consensus 242 ~a~~~--ek~~L---~~av~~i~~da~~iiP~~~~ie~~ea~~~~~~~f~~li~~~d~~Isk~i-LGqtlTs~~~~g~~g 315 (526) T protein:vir:79 242 GTADE--EKATL---LRAVTGLGHAAAGIIPETMAIDFQQAAQGSSEPFLAMMRQSEDAISKAV-LGGTLTSTTSQSGGG 315 (526) T ss_pred CCCHH--HHHHH---HHHHHHHhcCcEEEecCCceeEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhccccccCcch Confidence 11110 00000 011111111100 01112333332 2233446666666666665554 2322112111 111 Q ss_pred ccH-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHH Q lcl|NC_012753. 374 KTA-TEVVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMV 451 (502) Q Consensus 374 ~tA-tei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~ 451 (502) +-| .++. ..-....+..-.+.+...|+ +|++.++.+ |.. +......-+.+.|+..-+.|..+.++.+.+++ T Consensus 316 S~a~g~vh--~~v~~di~~aDa~~i~~tln~~Li~~l~~~----N~~-~~~~~~~~p~~~~~~~e~eDl~~~a~~~~~L~ 388 (526) T protein:vir:79 316 AFALGQVH--NEVRHDILASDARQLAATLSRDLLWPLLVL----NRP-GSPDVRRAPRLVFDLREQADITSMAQSIPALV 388 (526) T ss_pred hhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCC-CcCCccccceEEeCCCCcccHHHHHHHHHHHH Confidence 111 1121 11223344445566777775 587776654 221 11122234677888888889889999999999 Q ss_pred hcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 452 AAGFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 452 ~~Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ..|+--.+.++.+.+|++..+-.+.+- ..+..+..+....+...--+.+ T Consensus 389 ~~G~~i~~~~i~e~~gip~~~~~e~~l--~~~~~~~~~~~~~~~~~~~~~~ 437 (526) T protein:vir:79 389 NVGLEIPSAWVYDKLGIPQPAKNEPVL--RPAAQPAILSRQHGQRVAALAT 437 (526) T ss_pred hCCCcCCHHHHHHHhCCCCCCCchhhc--cccCCccccccccccccccccc Confidence 999844445678878875422111111 1111111111101110000100 No 242 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=92.34 E-value=0.012 Score=31.16 Aligned_cols=367 Identities=12% Similarity=0.098 Sum_probs=129.0 Q ss_pred cch-hhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEeeCCH Q lcl|NC_012753. 21 QSL-NSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNE 99 (502) Q Consensus 21 ~~l-~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~ 99 (502) |.| ..+.....-. ..+..+..+. .. .....+....-..+|+..|+-+..-|..+--+++ T Consensus 1 Mg~f~~lf~~~~~~---------~~~~~~~~~~-------~v----~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~ 60 (395) T protein:vir:10 1 MSILEKIFKTRKDI---------TYMLDLDMIE-------DL----SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR 60 (395) T ss_pred CchhhhhhccCccc---------cccccchhcc-------cc----chhhhhhhHHHHHHHHHHHHhhccceeEeccCCc Confidence 222 2222211100 0000000010 00 0011222233345566666666555554433333 Q ss_pred HHHHHHHHHHhh--ccH--HHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee Q lcl|NC_012753. 100 VADAFINETLKN--DKF--SKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE 175 (502) Q Consensus 100 ~~~e~l~~~~~~--~~f--~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~ 175 (502) .....+..+|.. |.+ ...|.+.+...+.+|+.++.+..+++++.. +++-..-|. .. . T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~~~--~~~~~~~~~----------~~-------~ 121 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKELLI--ADSFYREEY----------AL-------Y 121 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCeEe--cCCccceeE----------ee-------c Confidence 222233333321 211 123333344444556655544333333211 111110110 00 0 Q ss_pred CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCC Q lcl|NC_012753. 176 GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLG 255 (502) Q Consensus 176 ~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G 255 (502) +.. +..+.. . .+.+.. .++-.+ +++++.+.... ..+| T Consensus 122 ~~~---~~~~~~---~--~~~~~~-------------~~~~~e------------------vih~~~~~~~~----~~~G 158 (395) T protein:vir:10 122 DDI---FKDVTV---K--DYTYQR-------------TFTMQE------------------VIYLKYNNNKV----THFV 158 (395) T ss_pred Ccc---eeEEEE---c--Cceeee-------------eecccc------------------EEEEccCCCCc----cccc Confidence 000 000000 0 000000 011000 11122111111 1246 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHhhccceeee--chHHhccCCCCCCcccCccccccccc-hhhccccCCCC-----c Q lcl|NC_012753. 256 LSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV--PTQMIKTEYDTNGEKVTVKREFETGH-NVYEQFDSGDM-----D 327 (502) Q Consensus 256 ~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v--~~~~l~~~~~~~g~~~~~~~~~~~~~-~~~~~~~~~~~-----~ 327 (502) .|.++.+..+++... ..+. ..+..+-++ +...+ ..... ..+.... ..+........ + T Consensus 159 ~spi~~~~~~~~~~~---~~~~---~~~~~~gii~~~~~~~------~~e~~---~~~~~~~~~~~~~~~~~~~~v~~l~ 223 (395) T protein:vir:10 159 ESLFEDYGKIFGRMI---GAQL---KNYQIRGILKSASSAY------DEKNI---EKLQAFTNKLFNTFNKNQLAIAPLI 223 (395) T ss_pred chHHHHHHHHHHHHH---HHHH---hcCCCceEEEeCCCCC------CHHHH---HHHHHHHHHHhccccccCcceEEcC Confidence 666666555554332 2222 233332222 11111 00000 0000000 01111000000 0 Q ss_pred cccceeeecc-----ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 328 KGIGITDLTT-----DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLK 402 (502) Q Consensus 328 ~~~~i~~~~~-----~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~ 402 (502) .+..++.++. +....++.+..+...++|+...|+||..++... +++++....+. ..++.-....++..|. T Consensus 224 ~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~---sn~e~~~~~~~--~~~l~P~~~~ie~~l~ 298 (395) T protein:vir:10 224 EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGET---ADLEKNTLVFE--KFCLTPLLKKIQNELN 298 (395) T ss_pred CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcc---cCHHHHHHHHH--HHHHHHHHHHHHHHHH Confidence 1111222221 223446888888889999999999999886322 22222211110 1112222222222222 Q ss_pred HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHH---- Q lcl|NC_012753. 403 ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEI---- 476 (502) Q Consensus 403 ~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~e---- 476 (502) .- ++... .....+.|+++.-+-.|..+.++...+++.+|+++.-++++.. +++.+.++++- T Consensus 299 ~k------------L~~~~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:10 299 AK------------LITQS-MYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred Hh------------hcChh-hhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 10 11100 0111234556666677888899999999999999999876653 34433222111 Q ss_pred -HHHHHH-hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 -YQKIND-ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 -l~ri~~-E~~~~~~~~~~~~~~~~~g~ 502 (502) +..++. +.....+....+.+++-.|. T Consensus 366 n~~~~~~~~~~~~~~~~~~~kgg~~~~~ 393 (395) T protein:vir:10 366 NYEKANSGENDEKEKDENTLKGGDEDES 393 (395) T ss_pred ccccccccccccCcccccccCCCCCCCC Confidence 111110 11111222223333333333 No 243 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=92.34 E-value=0.012 Score=31.16 Aligned_cols=367 Identities=12% Similarity=0.098 Sum_probs=129.0 Q ss_pred cch-hhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEeeCCH Q lcl|NC_012753. 21 QSL-NSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNE 99 (502) Q Consensus 21 ~~l-~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~ 99 (502) |.| ..+.....-. ..+..+..+. .. .....+....-..+|+..|+-+..-|..+--+++ T Consensus 1 Mg~f~~lf~~~~~~---------~~~~~~~~~~-------~v----~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~ 60 (395) T protein:vir:95 1 MSILEKIFKTRKDI---------TYMLDLDMIE-------DL----SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR 60 (395) T ss_pred CchhhhhhccCccc---------cccccchhcc-------cc----chhhhhhhHHHHHHHHHHHHhhccceeEeccCCc Confidence 222 2222211100 0000000010 00 0011222233345566666666555554433333 Q ss_pred HHHHHHHHHHhh--ccH--HHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee Q lcl|NC_012753. 100 VADAFINETLKN--DKF--SKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE 175 (502) Q Consensus 100 ~~~e~l~~~~~~--~~f--~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~ 175 (502) .....+..+|.. |.+ ...|.+.+...+.+|+.++.+..+++++.. +++-..-|. .. . T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~~~--~~~~~~~~~----------~~-------~ 121 (395) T protein:vir:95 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKELLI--ADSFYREEY----------AL-------Y 121 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCeEe--cCCccceeE----------ee-------c Confidence 222233333321 211 123333344444556655544333333211 111110110 00 0 Q ss_pred CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCC Q lcl|NC_012753. 176 GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLG 255 (502) Q Consensus 176 ~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G 255 (502) +.. +..+.. . .+.+.. .++-.+ +++++.+.... ..+| T Consensus 122 ~~~---~~~~~~---~--~~~~~~-------------~~~~~e------------------vih~~~~~~~~----~~~G 158 (395) T protein:vir:95 122 DDI---FKDVTV---K--DYTYQR-------------TFTMQE------------------VIYLKYNNNKV----THFV 158 (395) T ss_pred Ccc---eeEEEE---c--Cceeee-------------eecccc------------------EEEEccCCCCc----cccc Confidence 000 000000 0 000000 011000 11122111111 1246 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHhhccceeee--chHHhccCCCCCCcccCccccccccc-hhhccccCCCC-----c Q lcl|NC_012753. 256 LSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV--PTQMIKTEYDTNGEKVTVKREFETGH-NVYEQFDSGDM-----D 327 (502) Q Consensus 256 ~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v--~~~~l~~~~~~~g~~~~~~~~~~~~~-~~~~~~~~~~~-----~ 327 (502) .|.++.+..+++... ..+. ..+..+-++ +...+ ..... ..+.... ..+........ + T Consensus 159 ~spi~~~~~~~~~~~---~~~~---~~~~~~gii~~~~~~~------~~e~~---~~~~~~~~~~~~~~~~~~~~v~~l~ 223 (395) T protein:vir:95 159 ESLFEDYGKIFGRMI---GAQL---KNYQIRGILKSASSAY------DEKNI---EKLQAFTNKLFNTFNKNQLAIAPLI 223 (395) T ss_pred chHHHHHHHHHHHHH---HHHH---hcCCCceEEEeCCCCC------CHHHH---HHHHHHHHHHhccccccCcceEEcC Confidence 666666555554332 2222 233332222 11111 00000 0000000 01111000000 0 Q ss_pred cccceeeecc-----ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 328 KGIGITDLTT-----DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLK 402 (502) Q Consensus 328 ~~~~i~~~~~-----~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~ 402 (502) .+..++.++. +....++.+..+...++|+...|+||..++... +++++....+. ..++.-....++..|. T Consensus 224 ~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~---sn~e~~~~~~~--~~~l~P~~~~ie~~l~ 298 (395) T protein:vir:95 224 EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGET---ADLEKNTLVFE--KFCLTPLLKKIQNELN 298 (395) T ss_pred CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcc---cCHHHHHHHHH--HHHHHHHHHHHHHHHH Confidence 1111222221 223446888888889999999999999886322 22222211110 1112222222222222 Q ss_pred HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHH---- Q lcl|NC_012753. 403 ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEI---- 476 (502) Q Consensus 403 ~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~e---- 476 (502) .- ++... .....+.|+++.-+-.|..+.++...+++.+|+++.-++++.. +++.+.++++- T Consensus 299 ~k------------L~~~~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:95 299 AK------------LITQS-MYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred Hh------------hcChh-hhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 10 11100 0111234556666677888899999999999999999876653 34433222111 Q ss_pred -HHHHHH-hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 -YQKIND-ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 -l~ri~~-E~~~~~~~~~~~~~~~~~g~ 502 (502) +..++. +.....+....+.+++-.|. T Consensus 366 n~~~~~~~~~~~~~~~~~~~kgg~~~~~ 393 (395) T protein:vir:95 366 NYEKANSGENDEKEKDENTLKGGDEDES 393 (395) T ss_pred ccccccccccccCcccccccCCCCCCCC Confidence 111110 11111222223333333333 No 244 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=92.34 E-value=0.012 Score=31.16 Aligned_cols=367 Identities=12% Similarity=0.098 Sum_probs=129.0 Q ss_pred cch-hhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEeeCCH Q lcl|NC_012753. 21 QSL-NSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNE 99 (502) Q Consensus 21 ~~l-~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~ 99 (502) |.| ..+.....-. ..+..+..+. .. .....+....-..+|+..|+-+..-|..+--+++ T Consensus 1 Mg~f~~lf~~~~~~---------~~~~~~~~~~-------~v----~~~~~~~~~~v~~~i~~Ia~~iA~~p~~~~~~~~ 60 (395) T protein:vir:10 1 MSILEKIFKTRKDI---------TYMLDLDMIE-------DL----SQQAYVKRLAIDSCIEFVARAVAQSHFKVLEGNR 60 (395) T ss_pred CchhhhhhccCccc---------cccccchhcc-------cc----chhhhhhhHHHHHHHHHHHHhhccceeEeccCCc Confidence 222 2222211100 0000000010 00 0011222233345566666666555554433333 Q ss_pred HHHHHHHHHHhh--ccH--HHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEee Q lcl|NC_012753. 100 VADAFINETLKN--DKF--SKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTE 175 (502) Q Consensus 100 ~~~e~l~~~~~~--~~f--~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~ 175 (502) .....+..+|.. |.+ ...|.+.+...+.+|+.++.+..+++++.. +++-..-|. .. . T Consensus 61 ~~~~~~~~ll~~~PN~~~t~~~f~~~~~~~lll~g~~~~~~~~~~~~~~--~~~~~~~~~----------~~-------~ 121 (395) T protein:vir:10 61 IQKNDVYYKLNIKPNTDLSSDSFWQQVIYKLIYDNEVLIVVSDSKELLI--ADSFYREEY----------AL-------Y 121 (395) T ss_pred cccchHHHHHHhccCcCCCHHHHHHHHHHHHhhCCceEEEEecCCCeEe--cCCccceeE----------ee-------c Confidence 222233333321 211 123333344444556655544333333211 111110110 00 0 Q ss_pred CCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCC Q lcl|NC_012753. 176 GQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLG 255 (502) Q Consensus 176 ~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G 255 (502) +.. +..+.. . .+.+.. .++-.+ +++++.+.... ..+| T Consensus 122 ~~~---~~~~~~---~--~~~~~~-------------~~~~~e------------------vih~~~~~~~~----~~~G 158 (395) T protein:vir:10 122 DDI---FKDVTV---K--DYTYQR-------------TFTMQE------------------VIYLKYNNNKV----THFV 158 (395) T ss_pred Ccc---eeEEEE---c--Cceeee-------------eecccc------------------EEEEccCCCCc----cccc Confidence 000 000000 0 000000 011000 11122111111 1246 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHhhccceeee--chHHhccCCCCCCcccCccccccccc-hhhccccCCCC-----c Q lcl|NC_012753. 256 LSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAV--PTQMIKTEYDTNGEKVTVKREFETGH-NVYEQFDSGDM-----D 327 (502) Q Consensus 256 ~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v--~~~~l~~~~~~~g~~~~~~~~~~~~~-~~~~~~~~~~~-----~ 327 (502) .|.++.+..+++... ..+. ..+..+-++ +...+ ..... ..+.... ..+........ + T Consensus 159 ~spi~~~~~~~~~~~---~~~~---~~~~~~gii~~~~~~~------~~e~~---~~~~~~~~~~~~~~~~~~~~v~~l~ 223 (395) T protein:vir:10 159 ESLFEDYGKIFGRMI---GAQL---KNYQIRGILKSASSAY------DEKNI---EKLQAFTNKLFNTFNKNQLAIAPLI 223 (395) T ss_pred chHHHHHHHHHHHHH---HHHH---hcCCCceEEEeCCCCC------CHHHH---HHHHHHHHHHhccccccCcceEEcC Confidence 666666555554332 2222 233332222 11111 00000 0000000 01111000000 0 Q ss_pred cccceeeecc-----ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 328 KGIGITDLTT-----DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVEKSLK 402 (502) Q Consensus 328 ~~~~i~~~~~-----~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~~~l~ 402 (502) .+..++.++. +....++.+..+...++|+...|+||..++... +++++....+. ..++.-....++..|. T Consensus 224 ~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~~~---sn~e~~~~~~~--~~~l~P~~~~ie~~l~ 298 (395) T protein:vir:10 224 EGFDYEELSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYGET---ADLEKNTLVFE--KFCLTPLLKKIQNELN 298 (395) T ss_pred CCceeeeccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcCcc---cCHHHHHHHHH--HHHHHHHHHHHHHHHH Confidence 1111222221 223446888888889999999999999886322 22222211110 1112222222222222 Q ss_pred HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHH---- Q lcl|NC_012753. 403 ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEI---- 476 (502) Q Consensus 403 ~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~e---- 476 (502) .- ++... .....+.|+++.-+-.|..+.++...+++.+|+++.-++++.. +++.+.++++- T Consensus 299 ~k------------L~~~~-~~~~~~~f~~~~l~~~D~~~~~~~~~~~~~~G~lt~NE~R~~~g~~p~~~g~~d~~~~~~ 365 (395) T protein:vir:10 299 AK------------LITQS-MYLKDTRIEIVGVNKKDPLQYAEAIDKLVSSGSFTRNEVRIMLGEEPSDNPELDEYLITK 365 (395) T ss_pred Hh------------hcChh-hhcccceecchhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCceeeecc Confidence 10 11100 0111234556666677888899999999999999999876653 34433222111 Q ss_pred -HHHHHH-hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 -YQKIND-ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 -l~ri~~-E~~~~~~~~~~~~~~~~~g~ 502 (502) +..++. +.....+....+.+++-.|. T Consensus 366 n~~~~~~~~~~~~~~~~~~~kgg~~~~~ 393 (395) T protein:vir:10 366 NYEKANSGENDEKEKDENTLKGGDEDES 393 (395) T ss_pred ccccccccccccCcccccccCCCCCCCC Confidence 111110 11111222223333333333 No 245 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=91.82 E-value=0.014 Score=30.73 Aligned_cols=383 Identities=13% Similarity=0.123 Sum_probs=163.5 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+... +...+..... ....+++-.-+.+.....|-... .|.... |... . .+.. + .-.+ T Consensus 15 ~~~~~--~~~~~~ia~~------~~~~~~~~~~~~~~~~~~iLr~~---~~~~~~--y~~m---~-~D~~----i-~s~l 72 (491) T protein:vir:10 15 FGEPD--KSLSSQIATR------ARSIDFFALGMYLPNPDPVLKAL---GKDIRV--YREL---R-ADAH----V-GGCV 72 (491) T ss_pred cccCC--hHHHHHHHhh------hcccccccccCCccchHHHHHhc---CCCHHH--HHHH---h-hChH----H-HHHH Confidence 22211 1222221110 01122222233344444442211 010000 0000 0 0111 1 2334 Q ss_pred HHHhhhhhcCcceEee--CCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEe--CCce---EEEEEcCCeEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRV--DNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYID--GDQI---RVSFVQATVFF 153 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~--~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d--~~~~---~i~~v~~~~~~ 153 (502) ++-..-+++.+..|.. +++...+++.+++++-.|...+..++ +|..+|-+++-+.|. ++.. ++.++++..|. T Consensus 73 ~~Rk~av~~~~w~i~~~~~~~~~~e~v~e~l~~~~~~~~l~~~l-da~~~G~s~~Ei~w~~~~g~~~~~~l~~r~~~~f~ 151 (491) T protein:vir:10 73 RRRKAAVKALEWGLDRGKAKSRVAKSIADVFADLDLSRIVTEML-DAVLYGYQPMEITWGKVGNYIVPIDVVGKPADWFV 151 (491) T ss_pred HHHHHHHhCCCcEEecCCCCHHHHHHHHHHHhcCCHHHHHHHHH-HhhhhcceeEEEEEeecCCeeEEEEeeeeccccee Confidence 4445556677777764 34557789999998888999888776 588899888877774 3332 34444554332 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) + +..+.. + ++..+...-|.++| + T Consensus 152 ~---d~~~~l--------------------------------~-----~~~~~~~~~g~~l~---------~-------- 174 (491) T protein:vir:10 152 Y---DPENQL--------------------------------R-----FRSKDHWMQGEELP---------A-------- 174 (491) T ss_pred e---ccCCce--------------------------------E-----EecCCCCCCcceec---------C-------- Confidence 1 111100 0 00000000011111 1 Q ss_pred cceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCCCCCcccCcccccc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYDTNGEKVTVKREFE 312 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~ 312 (502) .-|+.++. ....++|+|.|.+..+.-..---+..+..|+.=.+- |...++. + ++.+.... ....+ T Consensus 175 -~k~i~~~~----~~~~~~p~g~gLl~~~~w~~~fK~~~~~~w~~f~E~yG~P~~ig-----k-y~~~a~~~--ek~~l- 240 (491) T protein:vir:10 175 -RKFLVPRQ----EATYLNPYGFPDLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVG-----K-HPRSASDG--EKNLL- 240 (491) T ss_pred -CCEEEEEe----cCCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEE-----e-cCCCCCHH--HHHHH- Confidence 01333332 123457889999988877665555555555543332 3332222 2 22211110 00000 Q ss_pred ccchhhccccCCCC---ccccceeeecccc---chHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHH Q lcl|NC_012753. 313 TGHNVYEQFDSGDM---DKGIGITDLTTDI---RSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDT 386 (502) Q Consensus 313 ~~~~~~~~~~~~~~---~~~~~i~~~~~~i---r~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l 386 (502) .+....+..+.. ..+..|+.++..- ..+-|.+.++.+-++|+..+ ++ ++++-+++|...+.++... -. T Consensus 241 --~~al~~~~~~a~~viP~~~~ie~~ea~~~~g~~~~y~~li~~~d~~Isk~i-LG-qtlTt~~~gs~a~~~vh~~--v~ 314 (491) T protein:vir:10 241 --LDCLEDMVQDAVAVVPDDSSIEIKEAAGKTGSADVYERLLHFCRGEVSIAL-LG-QNQTTEATSTRASAQAGLE--VT 314 (491) T ss_pred --HHHHHHHhcCcEEEecCCceeEEEecCCCCCChhHHHHHHHHHHHHHHHHH-hh-hhcccCcccchhHHHHHHH--HH Confidence 011111111100 0112344443321 12235555555555554433 12 2222222222222223211 22 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcC Q lcl|NC_012753. 387 YQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTL 466 (502) Q Consensus 387 ~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~ 466 (502) ...++.-.+.+...|++|++-++.+ +.. . . ..+.+.|... ..+..+.++.+.+++..|+--.+.++.+.+ T Consensus 315 ~di~~~D~~~i~~tln~li~~l~~~----N~~--~-~--~~p~f~~~~~-~e~~~~~a~~~~~L~~~G~~i~~~~i~e~~ 384 (491) T protein:vir:10 315 DDIRDGDKAVVSEAMNMLIRWICDL----NFD--G-A--DRPVFDMWEQ-EQVDEIQAGRDQKLTQAGARFTPAYFKRAY 384 (491) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh----cCC--C-C--CcceEEecCc-CchhHHHHHHHHHHHhCCCcCCHHHHHHHh Confidence 3334444566778888888776654 211 1 1 2356777653 333356788889999999855566788888 Q ss_pred CCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 467 NVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 467 ~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) |+.+.+-++.....+.. .+....-+++ T Consensus 385 Gip~~~~~~~~~~~~~~---------~~~~~~~~~~ 411 (491) T protein:vir:10 385 NLQDGDLDERPLPVSAV---------DTVGAASFAE 411 (491) T ss_pred CCCCCCcCccccccCCC---------CCcccccccc Confidence 88653222211110000 0111111122 No 246 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=91.05 E-value=0.018 Score=30.18 Aligned_cols=387 Identities=11% Similarity=0.002 Sum_probs=158.8 Q ss_pred cchhhhhccc--cccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEee-C Q lcl|NC_012753. 21 QSLNSITDHP--KIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRV-D 97 (502) Q Consensus 21 ~~l~~i~~~~--~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~-~ 97 (502) |.+....... ....+ ...|.....|-.....-..+.. ++-+.+.--..+|+.+|+-+-+=|+.+-- + T Consensus 1 ~~~~r~~~~~~~~~~~~------~~~~~~~~~g~~~s~~~~~vt~----~~al~~~~v~~~v~~ia~~iA~lp~~~~~~~ 70 (419) T protein:vir:14 1 MFFSRQLLSNLGQTQMS------AGGWVSALLGSSRSDSGQVVTP----ASALALTVLQNCVTLLAESIAQLPIELYERS 70 (419) T ss_pred CcccccccccccccccC------cchhhHHhhcCCCccCCcccch----HHhhccHHHHHHHHHHHHhhccCceEEEEec Confidence 2222211100 00111 0112222222111111000000 11122223345666667666665665422 1 Q ss_pred CH----HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEcCCeEEEEEEcCCCeEEEE Q lcl|NC_012753. 98 NE----VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQATVFFPLQANTQDVSSAA 166 (502) Q Consensus 98 d~----~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~~~~~~Pi~~d~~~~~~~~ 166 (502) ++ ..+..|..+|.. | ....-....+...+..|.+|+.+..+. |.+ .+-.++|+.+-+.. +.+.. T Consensus 71 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~l~pl~~~~v~v~~-~~~~~---- 145 (419) T protein:vir:14 71 GEDRKPATDHPLYSILKYEPNSWQTPFEYQEQSQVAVGLRGNSYSFIDRDSDGVIQGLYPLDNEAVTVMR-GSDLK---- 145 (419) T ss_pred CCccccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEecCceEEEEE-CCCce---- Confidence 11 111234444432 2 223334445777788899988887775 443 46666776665532 21111 Q ss_pred EEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccc Q lcl|NC_012753. 167 IVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMN 246 (502) Q Consensus 167 ~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n 246 (502) .+| ++. ... .++. + -+++++.+. T Consensus 146 -------------~~y-------------~~~-----~~~------~~~~---------~---------~i~h~~~~~-- 168 (419) T protein:vir:14 146 -------------PVY-------------RVR-----GSD------PMPQ---------R---------LVHHVRWMS-- 168 (419) T ss_pred -------------EEE-------------EEc-----cCc------ccch---------h---------heeEecCcC-- Confidence 001 000 000 0110 0 012233221 Q ss_pred cccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccc-eeeechHHhccCCCCCCcccCccc-cccccc-hhhcccc- Q lcl|NC_012753. 247 NKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQR-RVAVPTQMIKTEYDTNGEKVTVKR-EFETGH-NVYEQFD- 322 (502) Q Consensus 247 ~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~-~i~v~~~~l~~~~~~~g~~~~~~~-~~~~~~-~~~~~~~- 322 (502) .+..+|+|.+.-+...|+....+-....+-|..+.. ..+ |....+.++....... .+.... ..+.... T Consensus 169 ---~dg~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gi-----l~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n 240 (419) T protein:vir:14 169 ---INGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGV-----IERPKDAPALKDQASVDRITDGWNAKFGGSGN 240 (419) T ss_pred ---CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEE-----EEecCCCCcccCHHHHHHHHHHHHHHhcCccc Confidence 123578898887777776555443333334555433 222 2221111110000000 000000 0000000 Q ss_pred ---CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccc-cHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 323 ---SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMK-TATEVVSEQSDTYQMRNSIATLVE 398 (502) Q Consensus 323 ---~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~-tAtei~~~~~~l~~~~~~~~~~~~ 398 (502) ..--+.+..++.++.....-++.+..+....+|+...|++|..++...++.- +..+... ..++ T Consensus 241 ag~~~vl~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s~~E~~~~-------------~f~~ 307 (419) T protein:vir:14 241 AKKVALLQEGMTFRPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSL-------------QFVI 307 (419) T ss_pred cCCceecCCCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH-------------HHHH Confidence 0000112234455544445567787778889999999999999986544322 2222211 1123 Q ss_pred HHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCCHH-HHHHHH Q lcl|NC_012753. 399 KSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVTKE-QAQEIY 477 (502) Q Consensus 399 ~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~de-ea~~el 477 (502) ..|..++..|-...+. .++.........+.++++.-+-.|..+.++...+++.+|+++.-++++. .|+..- ..+.-+ T Consensus 308 ~~L~P~~~~ie~~l~~-kll~~~~~~~~~i~fd~~~l~r~d~~~~~~~~~~~~~~G~~T~NE~R~~-~gl~p~~gGD~~~ 385 (419) T protein:vir:14 308 YTLLPWVKRHEQAKTR-DLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDIRRL-ENMPPVKGGDIYL 385 (419) T ss_pred HHHHHHHHHHHHHHhh-hccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHH-hCCCCCCCcCeee Confidence 3333333333221111 1111111122345555556566788999999999999999999997654 343221 011111 Q ss_pred H-----HHHH-hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 478 Q-----KIND-ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 478 ~-----ri~~-E~~~~~~~~~~~~~~~~~g~ 502 (502) . .+.. ++.+.....+...+.+-.|- T Consensus 386 ~~~n~~~~~~~~~~~~~~~~~~~~~~~e~~~ 416 (419) T protein:vir:14 386 SPMNMVDASKPQQLPVGKSEPTKAAIDEIGR 416 (419) T ss_pred eccccccccccccccCCCCCCccccccchhc Confidence 1 0000 01111112222233333333 No 247 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=90.19 E-value=0.022 Score=29.65 Aligned_cols=269 Identities=16% Similarity=0.082 Sum_probs=114.5 Q ss_pred hhcCcceEeeCCHHHHHHHHHHHh-----hccHHHHHHHHHHHHhhcCCEEEEEEEeC-Cc-eEEEEEcCCeEEEEEEcC Q lcl|NC_012753. 87 VFNEQATIRVDNEVADAFINETLK-----NDKFSKNFERYLESCLALGGLAMRPYIDG-DQ-IRVSFVQATVFFPLQANT 159 (502) Q Consensus 87 l~~ep~~i~~~d~~~~e~l~~~~~-----~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~-~~i~~v~~~~~~Pi~~d~ 159 (502) +-+=|+.+--+++.....+..+|. ......-+..++...+..|.+++.+..+. |. +.+..++|+.+-+...+. T Consensus 1 ia~l~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~ll~~Gna~~~i~r~~~G~~~~l~~l~~~~v~v~~~~~ 80 (278) T protein:vir:78 1 MASLPLKMYEDYKVVNTEVSDLLTVSPNNSLSSFDFINQIETIRNEKGNAYVLIERDIYHQPSKLFLLNPDVVEMLIENQ 80 (278) T ss_pred CccceeEEEecCcccccHHHHHHHhcCCCCCCHHHHHHHHHHHHhhcCCEEEEEEECCCCcEEEEEEECCceeEEEEcCC Confidence 222233222222222222222222 12334456667778888999988887764 44 356667777665432222 Q ss_pred CCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEE Q lcl|NC_012753. 160 QDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTY 239 (502) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~ 239 (502) +.. .+|. ++ . .. |..+.+. +. -.++ T Consensus 81 ~~~-----------------~~y~-~~---~--~~----------------g~~~~~~------~~----------evih 105 (278) T protein:vir:78 81 SRE-----------------LYYS-IH---A--AT----------------GNKLIVH------NM----------DMLH 105 (278) T ss_pred Cce-----------------EEEE-EE---c--CC----------------ceEEEEc------cc----------cEEE Confidence 211 0010 00 0 00 1111000 00 0233 Q ss_pred ecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-ccccccccchhh Q lcl|NC_012753. 240 LKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFETGHNVY 318 (502) Q Consensus 240 ~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~~~~~~~ 318 (502) ++.+. .....+|+|.+..+...++....+...-...+..+...|+. .....+..... ....|.....-. T Consensus 106 ~~~~~----~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~------~~~~l~~e~~~~~~~~~~~~~~~~ 175 (278) T protein:vir:78 106 FKHIV----ASNMVQGISPIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLK------YGSNVGKEKRQQVLEDFKQYYEEN 175 (278) T ss_pred ECCCC----CCCCeeeccHHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEE------eCCCCCHHHHHHHHHHHHHHhccC Confidence 44321 12345799998888888876555443322233333333332 11111110000 000010000000 Q ss_pred ccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccc-cccHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 319 EQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKS-MKTATEVVSEQSDTYQMRNSIATLV 397 (502) Q Consensus 319 ~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~-~~tAtei~~~~~~l~~~~~~~~~~~ 397 (502) ..+..- +.+.-++.++......++.+..+...++|+...|+||..+|...++ -+|+.+... ..+ T Consensus 176 g~~~vl--~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~-------------~~~ 240 (278) T protein:vir:78 176 GGILFQ--EPGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNR-------------FYL 240 (278) T ss_pred CCceec--CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHH-------------HHH Confidence 000011 1122366666666777888888889999999999999999876543 334433221 112 Q ss_pred HHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCcc Q lcl|NC_012753. 398 EKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVF 437 (502) Q Consensus 398 ~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~ 437 (502) +.+|..++..|....+. .++... .......|.|+-+.. T Consensus 241 ~~~l~P~~~~i~~~ln~-~L~~~~-e~~~g~~~~f~~~~l 278 (278) T protein:vir:78 241 QHTLLPIVKQYEEEFNR-KLLTKT-DREKIGILNLTLNLI 278 (278) T ss_pred HHHHHHHHHHHHHHHHh-hcCChh-HhcCCceEEEecccC Confidence 22333333322222111 111110 011123455554333 No 248 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=90.15 E-value=0.022 Score=29.63 Aligned_cols=409 Identities=14% Similarity=0.136 Sum_probs=160.7 Q ss_pred CChhH-------HHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccC-CCcccccccee Q lcl|NC_012753. 1 MGIIQ-------TIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDS-NGSQVKRDFNH 72 (502) Q Consensus 1 m~~~~-------~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~-~~~~~~~~~~~ 72 (502) |+-|= ......+....... .--.....|+---+++.++.+|-.... .| +...+... .....++.. T Consensus 1 m~~~~d~~g~p~~~~~~~~~~~~~~~-~~~~~~~~~~~~gltp~~l~~iL~~a~--~g--d~~~~~~L~~dm~~~D~h-- 73 (512) T protein:vir:19 1 MGRILDISGQPFDFDDEMQSRSDELA-MVMKRTQEHPSSGVTPNRAAQMLRDAE--RG--DLTAQADLAFDMEEKDTH-- 73 (512) T ss_pred CcceeCCCCCccccccccccccchhc-ccchhhccccccCCCHHHHHHHHHHhh--CC--CHHHHHHHHHHHHhhChH-- Confidence 32111 00000000000000 000011122223456666665544322 01 11110000 000001111 Q ss_pred cchHHHHHHHHhhhhhcCcceEeeC------CHHHHHHHHHHHhhc-cHHHHHHHHHHHHhhcCCEEEEEEEe--CCce- Q lcl|NC_012753. 73 LPIGRTASKKVASLVFNEQATIRVD------NEVADAFINETLKND-KFSKNFERYLESCLALGGLAMRPYID--GDQI- 142 (502) Q Consensus 73 ~n~~k~iv~~~a~~l~~ep~~i~~~------d~~~~e~l~~~~~~~-~f~~~~~~~~~~~~~~G~~~~~~~~d--~~~~- 142 (502) + .-...+--.-+++.+..|.-. ++...+++++++.+- .|...+..++ +|..+|-+++-+.|. ++.. T Consensus 74 --i-~s~l~~Rk~av~~~~w~I~p~~~~~~~~~~~a~~v~~~l~~~~~f~~~~~~ll-dA~~~G~s~~Ei~w~~~~g~~~ 149 (512) T protein:vir:19 74 --L-FSELSKRRLAIQALEWRIAPARDASAQEKKDADMLNEYLHDAAWFEDALFDAG-DAILKGYSMQEIEWGWLGKMRV 149 (512) T ss_pred --H-HHHHHHHHHHHhCCCceEecCCCCCHHHHHHHHHHHHHHhcCCCHHHHHHHHH-hhhhhcceeeeeEeeeeCCcee Confidence 1 223334445566777777531 234567888888653 5888777665 488899888877664 3322 Q ss_pred --EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccc Q lcl|NC_012753. 143 --RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLY 220 (502) Q Consensus 143 --~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~ 220 (502) ++.++++..|.. +.+.. ..+. ++. ...-|.++| T Consensus 150 ~~~~~~r~~~~f~~---~~~~~--------------------------------~~lr---~~~--~~~~G~~l~----- 184 (512) T protein:vir:19 150 PVALHHRDPALFCA---NPDNL--------------------------------NELR---LRD--ASYHGLELQ----- 184 (512) T ss_pred eeeeeeecccccee---ccCCC--------------------------------cEEE---ecC--CCCCceeec----- Confidence 344455433221 00000 0000 000 000011111 Q ss_pred cCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhh-ccceeeechHHhccCCC Q lcl|NC_012753. 221 EDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKM-GQRRVAVPTQMIKTEYD 299 (502) Q Consensus 221 ~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~-~~~~i~v~~~~l~~~~~ 299 (502) + --|++++. +...++|+|.|.+..+--..---+..+..|+.=.+- |.+ +.+ -+ ++. T Consensus 185 ----~---------~k~i~~~~----~~~~g~p~g~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P-~~i----gk-y~~ 241 (512) T protein:vir:19 185 ----P---------FGWFMHRA----KSRTGYVGTNGLVRTLIWPFIFKNYSVRDFAEFLEIYGLP-MRV----GK-YPT 241 (512) T ss_pred ----C---------CceEEEec----cCCCCCcccccHHHHHHHHHHHHHHHHHHHHHHHHHcCCC-eeE----Ee-cCC Confidence 0 01222221 123467888888877655554444444444443332 332 222 11 221 Q ss_pred CCCcccCccccccccchhhccccCCCC---ccccceeeecc-ccchHHHHHHHHHHHHHHHHhcCCChhhcccccccccc Q lcl|NC_012753. 300 TNGEKVTVKREFETGHNVYEQFDSGDM---DKGIGITDLTT-DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKT 375 (502) Q Consensus 300 ~~g~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~-~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~t 375 (502) +.... ....+ .+....+..+.+ ..+.-|+.++. .-....|...++.+-++|+..+ ++...-+..+++.+. T Consensus 242 ~a~~~--ek~~L---~~al~~~~~~a~~iiP~~~~ie~~ea~~~~~~~y~~li~~~d~~Isk~i-LGqtlTs~~g~~Gs~ 315 (512) T protein:vir:19 242 GSTNR--EKATL---MQAVMDIGRRAGGIIPMGMTLDFQSAADGQSDPFMAMIGWAEKAISKAI-LGGTLTTEAGDKGAR 315 (512) T ss_pred CCCHH--HHHHH---HHHHHHHhhCcEEEecCCceEEEeecCCCCHHHHHHHHHHHHHHHHHHH-hhhhhcccccccchh Confidence 11110 00000 011111111100 01112333322 1222335555655555665543 121111121222111 Q ss_pred H-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhc Q lcl|NC_012753. 376 A-TEVVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAA 453 (502) Q Consensus 376 A-tei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~ 453 (502) | .++. ..-....+..-.+.+...|+ +|++-++.+ |+ +........+.+.|+..-+.|..+.++.+.+++ . T Consensus 316 a~~~vh--~ev~~di~~aDa~~i~~tln~~li~~l~~~----N~-~~~~~~~~~p~~~f~~~e~eDl~~~a~~~~~l~-~ 387 (512) T protein:vir:19 316 SLGEVH--DEVRREIRNADVGQLARSINRDLIYPLLAL----NS-DSTIDINRLPGIVFDTSEAGDITALSDAIPKLA-A 387 (512) T ss_pred hHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CC-CCCCCccccceEEecCCChhhHHHHHHHHHHHh-c Confidence 2 1222 22233445556667778885 687766654 21 111222234677888888888888888888876 7 Q ss_pred CCCCHHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCC---------CccccCCCCC Q lcl|NC_012753. 454 GFAPKTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFR---------TSEEVDIYGE 502 (502) Q Consensus 454 Gi~S~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~---------~~~~~~~~g~ 502 (502) |+--.+.++.+.+|++..+-.+.+............... .....|-.+. T Consensus 388 G~~i~~~~i~e~~Gip~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 445 (512) T protein:vir:19 388 GMRIPVSWIQEKLHIPQPVGDEAVFTIQPVVPDNGSQKEAALSAEDIPQEDDIDRMGV 445 (512) T ss_pred CCCCCHHHHHHHhCCCCCCCccccccCCCccccccccccccccccCCCchhhHhHHhh Confidence 875556678888887542111111111111111100000 0000000011 No 249 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=89.18 E-value=0.028 Score=29.11 Aligned_cols=374 Identities=11% Similarity=0.068 Sum_probs=138.0 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHh-cCCC-CccccccCCCcccccccee-cchHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYF-AGDF-DSVTYRDSNGSQVKRDFNH-LPIGR 77 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y-~g~~-~~~~~~~~~~~~~~~~~~~-~n~~k 77 (502) |+||.. +++-...-..... .++ .+.. ..+. ..+ ...++ .+--. T Consensus 1 Mg~~~~----f~~k~~~~~~~~~-----------------------~~~~~~~~~~~~~---~~~----~~~~~~~~~V~ 46 (403) T protein:vir:80 1 MGLFNF----FRRKTRSEPTNAI-----------------------SWFLTQEAYDTLA---IPG----YTRLSDNPEVR 46 (403) T ss_pred Cccccc----ccccccccccchh-----------------------hhhcccccccccc---cch----hhhhhhhHHHH Confidence 999853 3331110000000 011 1110 0000 000 00111 11112 Q ss_pred HHHHHHhhhhhcCcceEe-e-CC--HHHHHHHHHHHh--hccH---HHHHHHHHHHHhhc--CCEEEEEEEeC-Cce-EE Q lcl|NC_012753. 78 TASKKVASLVFNEQATIR-V-DN--EVADAFINETLK--NDKF---SKNFERYLESCLAL--GGLAMRPYIDG-DQI-RV 144 (502) Q Consensus 78 ~iv~~~a~~l~~ep~~i~-~-~d--~~~~e~l~~~~~--~~~f---~~~~~~~~~~~~~~--G~~~~~~~~d~-~~~-~i 144 (502) ..|+..|+-+.+=|+.+- - ++ ......+..+|. -|.. ..-+..++..++.. |-+++.+.+|. |.+ .+ T Consensus 47 ~~I~~ia~~iA~~p~~~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~v~~~ll~~~Gna~i~~~~~~~g~~~~L 126 (403) T protein:vir:80 47 MAVHKIAELISSMTIHLMQNTDNGDIRIKNELSRKIDINPYSLMTRKAWMYNIVYTMLLDGEGNSVVFPKYTTSGLIDEL 126 (403) T ss_pred HHHHHHHHhhhhCceEEEEecCCceeecCChHHHHHhccCCcCCCHHHHHHHHHHHHhhcCCccEEEEEEEcCCCcEEEE Confidence 345666665555555531 1 11 111222333333 1211 22233345555554 44666666665 333 45 Q ss_pred EEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCC Q lcl|NC_012753. 145 SFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLE 224 (502) Q Consensus 145 ~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~ 224 (502) ..++|..+-++..+++ +.+.+ .+ ..++. T Consensus 127 ~~l~p~~v~~~~~~~g----------------------------------~~~~y---~~-------~~~~~-------- 154 (403) T protein:vir:80 127 IPLAPSKVSFVDTDTG----------------------------------YQIWY---QG-------KAYNY-------- 154 (403) T ss_pred EEEcCCeeEEEEcCCc----------------------------------eEEEE---ee-------cccch-------- Confidence 5566665544321111 11110 00 00000 Q ss_pred cceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhcc-ceeeechHHhccCCCCCCc Q lcl|NC_012753. 225 ETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQ-RRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 225 ~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~-~~i~v~~~~l~~~~~~~g~ 303 (502) .+ .++|+.+. .......|.|.+.-+...+.....+-.....-|..+. ...+ +......... T Consensus 155 ~e----------iih~~~~~---~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~i-----l~~~~~~~~~ 216 (403) T protein:vir:80 155 DE----------VLHFIVNP---DPEKPYMGRGYRVVLKDIVNNLKQATTTKKSFMSGKYMPSLI-----VKVDAATAEL 216 (403) T ss_pred hh----------EEEEeccC---CCcCccccccHHHHHHHHHHHHHHHHHHHHHHHhccCCcceE-----EEeCCCCChH Confidence 00 12222110 0111224777766655555544432222222334332 2222 2111111100 Q ss_pred ccCccccccccchhhccc-cCCCC----ccccceeeec-cccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHH Q lcl|NC_012753. 304 KVTVKREFETGHNVYEQF-DSGDM----DKGIGITDLT-TDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTAT 377 (502) Q Consensus 304 ~~~~~~~~~~~~~~~~~~-~~~~~----~~~~~i~~~~-~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAt 377 (502) ... .....-...+... ..... ........++ .+...-++++..+....+|+...|+||..+|..... +++ T Consensus 217 ~~~--~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~~~ 292 (403) T protein:vir:80 217 SSE--EGRNAVFKKYLEASEAGQPWIIPAELLDVEQVKPLSLKDLAIHETVELDKRTVAGIFGVPAFLLGVGKYD--KDE 292 (403) T ss_pred HHH--HHHHHHHHHHhhhhhcCCeeeecccccccceeccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCcc--HHH Confidence 000 0000000001000 00000 0000122232 233345677778888889999999999998743221 121 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 378 EVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 378 ei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) .. . ..+.+|..++..|-...+. .++. .....+.++.+.-+..|..+.++...+++.+|+|+ T Consensus 293 ~~-----~----------f~~~~l~P~~~~ie~~l~~-kll~---~~~~~~~f~~~~ll~~d~~~~~~~~~~~~~~Gi~t 353 (403) T protein:vir:80 293 YN-----N----------FINSTILPIAKGIEQELTR-KLLI---SPDLYFKFNPRSLYAYDLKELAEVGSNMYVRGLME 353 (403) T ss_pred HH-----H----------HHHHHHHHHHHHHHHHHHH-hccC---CCCcEEEeechhhhccCHHHHHHHHHHHHhCCCcC Confidence 11 1 1223344433333222111 1111 11123334434456678889999999999999999 Q ss_pred HHHHHHhcCCCCHH-HHHHHHHHHH---HhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 458 KTMAIEKTLNVTKE-QAQEIYQKIN---DETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 458 ~et~l~~~~~~~de-ea~~el~ri~---~E~~~~~~~~~~~~~~~~~g~ 502 (502) .-++++. .|+.+- ..++.+.... .+.........+.+..+=.|+ T Consensus 354 ~NE~R~~-~gl~p~~ggd~~~~~~n~~pl~~~~~~~~~k~ge~~~~~~~ 401 (403) T protein:vir:80 354 GNEVRDW-LGLSPKEGLSELVILENYIPLDKIGDQNKLKGGEKGGADGQ 401 (403) T ss_pred HHHHHHH-hCCCCCCCCCeEeecccccchhhccchhhccCCCCCCCCCC Confidence 9997654 354331 1111110000 000000011111111122222 No 250 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=88.85 E-value=0.03 Score=28.95 Aligned_cols=292 Identities=8% Similarity=-0.016 Sum_probs=102.4 Q ss_pred EEEEEEcC-CC-eEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeecccc-ccCCCccee Q lcl|NC_012753. 152 FFPLQANT-QD-VSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTL-YEDLEETVT 228 (502) Q Consensus 152 ~~Pi~~d~-~~-~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~-~~~l~~~~~ 228 (502) ++-+-|.. ++ .....+.++ ... +. .+...... |..+.+..- ..+. T Consensus 1 v~Eivw~~~~g~~~~~~l~~r-----~~~--~~---~~f~~~~~-----------------~~l~~~~~~~~~g~----- 48 (355) T protein:vir:78 1 MFEQVYRIENGRARLGKLAWR-----PPR--TI---SRFDVAPD-----------------GGLVAIEQWGVFGK----- 48 (355) T ss_pred CeEEEEEeeCCeEEEeeeeec-----Ccc--ce---eeeeeccC-----------------CceeEEEecCCCCC----- Confidence 22222211 11 111111000 000 00 00000000 111110000 0000 Q ss_pred ecCCCcce--EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-h-ccceee-echHHhccCCCCCCc Q lcl|NC_012753. 229 LNGLTRPL--FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-M-GQRRVA-VPTQMIKTEYDTNGE 303 (502) Q Consensus 229 ~~~~~~~~--f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~-~~~~i~-v~~~~l~~~~~~~g~ 303 (502) .+.+-|+ |+.++. ....++|+|.|.+..+--..---...+..|+.=++ . ..-.+. .|. +.+. T Consensus 49 -~~~~lp~~kfi~~~~----~~~~g~p~G~gLlr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~--------~~~~ 115 (355) T protein:vir:78 49 -ATVRIPVDRLVVFVN----EREGANWLGQSLLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAP--------LPEA 115 (355) T ss_pred -CcceeccCCEEEEEe----CCCCCCccchhhHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecC--------CCCc Confidence 1111122 344332 22456799999988776544333333333333233 1 122222 211 1111 Q ss_pred ccCcccccc---cc-----chhhccccCCCC-----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccc- Q lcl|NC_012753. 304 KVTVKREFE---TG-----HNVYEQFDSGDM-----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFD- 369 (502) Q Consensus 304 ~~~~~~~~~---~~-----~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~- 369 (502) ......... .+ ......+..+.. ..+.-|+.++..-..-.+...++.+-++|+..+ ++....+.. T Consensus 116 ~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~i-LGqtlTs~~~ 194 (355) T protein:vir:78 116 IARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPHGANFTLTGVQGKLPEMDGPIRYHDEQIARAV-LAHFLTLGGD 194 (355) T ss_pred ccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecCCceEEEeecCCCcccHHHHHHHHHHHHHHHH-hhhhhccccC Confidence 000000000 00 000000000000 011123333222222234455555555665554 332211111 Q ss_pred ccccccH-HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHH Q lcl|NC_012753. 370 GKSMKTA-TEVVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYW 447 (502) Q Consensus 370 ~~~~~tA-tei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~ 447 (502) +++.+-| -++. ..-....++.-.+.+...|+ +|++.++.+ ++ +.. ..-+.+.|+. ...+..+.++.+ T Consensus 195 ~~gGS~Alg~vh--~~v~~~~~~aD~~~i~~~ln~~li~~l~~l----N~--~~~--~~~P~~~~~~-~~~~~~~~a~~~ 263 (355) T protein:vir:78 195 KSTGSYALGDTF--ASFFTGSLNAVMKHIADVTQQHVVEDLVDQ----NW--GPE--EPAPRLVPAQ-LGKEQPVTAEAI 263 (355) T ss_pred CccchhhHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cC--CCC--CCCCEEEecC-cChhHHHHHHHH Confidence 1211112 2222 22233444445566777775 577766654 21 111 1235677764 556667778899 Q ss_pred HHHHhcCCCCH----HHHHHhcCCCCHHHH-HHHHHHHHHhhhcccCCCCCccccC-CCCC Q lcl|NC_012753. 448 SKMVAAGFAPK----TMAIEKTLNVTKEQA-QEIYQKINDETMVSTDSFRTSEEVD-IYGE 502 (502) Q Consensus 448 ~~~~~~Gi~S~----et~l~~~~~~~deea-~~el~ri~~E~~~~~~~~~~~~~~~-~~g~ 502 (502) .+++..|+... ++++.+.+|+.+.+. ++++.- .++.... .....+..+. -.++ T Consensus 264 ~~l~~~G~~~~~~~~~~~~~e~~gip~p~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~ 322 (355) T protein:vir:78 264 RALVECGAFTADPELEKDLRARYGLPAPAERDDGADA-AAAKAAG-RRRAKRLPGQRQGAA 322 (355) T ss_pred HHHHhCCCccccHHHHHHHHHHhCCCCCCCCCcccCC-ccccccc-cccccccCCcccccc Confidence 99999998654 457788888754211 112111 1111111 1111111111 1123 No 251 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=86.23 E-value=0.047 Score=27.86 Aligned_cols=388 Identities=11% Similarity=0.004 Sum_probs=153.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |-+-...++-+ .. .-..+ ..|...+.|-.....-..+.. .+-+.+.--...| T Consensus 1 m~~~~~~~~~~----------------~~-~~~~~-------~~~~~~~~g~~~s~~~~~v~~----~~al~~~~v~~cv 52 (419) T protein:vir:80 1 MFFSRQLLSNL----------------GQ-TQPGS-------GGWVSALLGSARSEAGQVVTP----ASALSLTVLQNCV 52 (419) T ss_pred CCccccccccc----------------Cc-CCCCc-------chhhHHhhcccccccCcccCh----HHhhccHHHHHHH Confidence 21111100000 00 00111 112222222111110000100 1112222234456 Q ss_pred HHHhhhhhcCcceEee---CCHH--HHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC-Cce-EEEEEc Q lcl|NC_012753. 81 KKVASLVFNEQATIRV---DNEV--ADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG-DQI-RVSFVQ 148 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~---~d~~--~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~-~~~-~i~~v~ 148 (502) +..|+-+-+=|+.+-- ++.. .+..+..+|.. | ....-...++...+..|.+|+.+..+. |.+ .+-.++ T Consensus 53 ~~ia~~ia~lp~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~~~i~r~~~G~~~~L~~i~ 132 (419) T protein:vir:80 53 TLLAESIAQLPVELYERSGDDRKPATDHPLYSILKYEPNPWQTPFEYQEQSQVAVGLRGNSYSFIDRDQDGVIQGLYPLD 132 (419) T ss_pred HHHHHhhccCceEEEEecCCCcccccccHHHHHHHhhcccCCCHHHHHHHHHHHHhhcCCeEEEEEECCCCcEEEEEEec Confidence 6666666555665421 1111 11234444431 2 233444556667788899998887775 443 466667 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+.+-+.. +.+.. ..| ...+.. .++ .+ T Consensus 133 ~~~v~i~~-~~~~~-----------------~~y------~~~~~~------------------~~~---------~~-- 159 (419) T protein:vir:80 133 NEAVTVMK-GPDLK-----------------PMY------RVAGAD------------------PLP---------QR-- 159 (419) T ss_pred CceEEEEE-CCCce-----------------EEE------EEcCcc------------------ccc---------hh-- Confidence 76654431 11100 001 000000 010 00 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVK 308 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~ 308 (502) -..+++.+ ..+..+|+|.+..+...|+....+.....+-|..+...-.+ |....+..+...... T Consensus 160 -------~i~h~~~~-----~~d~~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gi----l~~~~~~~~~~~~~~ 223 (419) T protein:vir:80 160 -------LVHHVRWM-----SINGYTGLSPVLLHANAIGHAQAIQQYAGKSFMNGTALSGV----IERPTDAPALKDQAS 223 (419) T ss_pred -------heEEecCC-----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCccEE----EEecCCCCcccCHHH Confidence 01223322 12345789988777776655444332222334544332111 222111111100000 Q ss_pred c-cccccc-hhhcccc----CCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhcccccccc-ccHHHHHH Q lcl|NC_012753. 309 R-EFETGH-NVYEQFD----SGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSM-KTATEVVS 381 (502) Q Consensus 309 ~-~~~~~~-~~~~~~~----~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~-~tAtei~~ 381 (502) . .+.... ..+.... ..--+.+..++.++.....-++.+..+....+|+...|+++..+|...++. +++.+... T Consensus 224 ~~~~~~~~~~~~~g~~n~g~~~vl~~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~ 303 (419) T protein:vir:80 224 VDRITDGWNAKFGGSGNAKKVALLQEGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSL 303 (419) T ss_pred HHHHHHHHHHHhcCccccCCceecCCCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH Confidence 0 000000 0000000 000011223555555555556778888888999999999999998754432 23332221 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHH Q lcl|NC_012753. 382 EQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMA 461 (502) Q Consensus 382 ~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~ 461 (502) . .++..|..++..|-...+. .++.........+.++++.-+..|..+.++...+++.+|+++.-++ T Consensus 304 ~-------------f~~~~l~P~~~~ie~~l~~-kll~~~~~~~~~i~fd~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~ 369 (419) T protein:vir:80 304 Q-------------FVIYTLLPWVKRHEQAKTR-DLLLPSERKQYFIEYNLAGLLRGDQSSRYAAYAVGRQWGWLSINDI 369 (419) T ss_pred H-------------HHHHHHHHHHHHHHHHHhh-hccCccccCCeEEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHH Confidence 1 1222233332222211111 1111111122345555555566788999999999999999999997 Q ss_pred HHhcCCCCHH-HHHHHHH-----HHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 462 IEKTLNVTKE-QAQEIYQ-----KINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 462 l~~~~~~~de-ea~~el~-----ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++. .|+..- ..++-+- ....-+....+. +++... --.| T Consensus 370 R~~-~g~~p~~gGD~~~~~~n~~~~~~~~~~~~~~-~~~~~~-~~~~ 413 (419) T protein:vir:80 370 RRL-ENMPPVKGGDIYLSPMNMVDASKPQPIPMGK-TEPTKA-ALDE 413 (419) T ss_pred HHH-hCCCCCCCcceeeeccccccccccccccCCC-CCchhh-hHHH Confidence 654 344321 0111110 000000000000 000000 0011 No 252 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=85.24 E-value=0.054 Score=27.52 Aligned_cols=430 Identities=10% Similarity=0.084 Sum_probs=157.7 Q ss_pred hhHHHHHHHHHHhhcccccchhhhhcc-ccccCCHHHHHHHHHHHHHhcC----CCCc-cccc--cCCCc--ccccc--- Q lcl|NC_012753. 3 IIQTIKNFIKRSNYVITNQSLNSITDH-PKIAISPEEYNRIMDNLRYFAG----DFDS-VTYR--DSNGS--QVKRD--- 69 (502) Q Consensus 3 ~~~~ik~~i~~~~~~~~~~~l~~i~~~-~~~~~~~~~~~~i~~~~~~Y~g----~~~~-~~~~--~~~~~--~~~~~--- 69 (502) ..+++-+++.+.---.....+..+... ..+ .++..+++....-+.. +..- .... ...+. ...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~a~~~p~~~~~~~~~~~~~~p~~~ 77 (576) T protein:vir:96 1 MVTRLADIFKRLRLGRDYEDIIDTVPIDDGL---QANIRNIEEKSKELNKSLYGKQQAYAEPFLEVMDTNPEFRTKRSYM 77 (576) T ss_pred ChhhHHHHHHHHhccCccccchhhhhcccCh---hHHHHHhhhhhhhhccccCCccchhhcceeeeeecCCCccccCcch Confidence 445555554432200000000000000 011 1222233221111111 0000 0000 00000 00000 Q ss_pred ------------ceecchHHHHHHH----Hhhhhhc-------CcceEee-------CCHH--HHHHHHHHHh----h-- Q lcl|NC_012753. 70 ------------FNHLPIGRTASKK----VASLVFN-------EQATIRV-------DNEV--ADAFINETLK----N-- 111 (502) Q Consensus 70 ------------~~~~n~~k~iv~~----~a~~l~~-------ep~~i~~-------~d~~--~~e~l~~~~~----~-- 111 (502) ....++...+|+. .|.|.+- -+..|.. .+.. ....+...+. . T Consensus 78 ~~~~~~~~~l~~~~~npiv~~~I~~ia~~vA~~~~~~~~~~~~~~~~i~lk~~~~~~~~~~~~~~~~l~~~l~~~~~~~~ 157 (576) T protein:vir:96 78 KNSDNLHDVLKQFGNNPILNAIILTRSNQVAMYCQPSRYNERGLGFEVRMRDLDAEPGKKEKEEIKRIENFILNTGRDKD 157 (576) T ss_pred hhhhhhHHHHHHhhcCHHHHHHHHHHHHHHHhhhhhhhhccccccceeEEecCcCccchhhhHhhhhHHhhHhhccCCCC Confidence 0001223334444 4443210 0011111 1111 1222333332 1 Q ss_pred ---ccHHHHHHHHHHHHhhcCCEEEEEEEeC---Cc-eEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEE Q lcl|NC_012753. 112 ---DKFSKNFERYLESCLALGGLAMRPYIDG---DQ-IRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSL 184 (502) Q Consensus 112 ---~~f~~~~~~~~~~~~~~G~~~~~~~~d~---~~-~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~ 184 (502) ..+..-+..++...+..|.+++.+.++. |. +.+-.++|..+-++..+++... +. T Consensus 158 p~~~t~~~f~~~lv~dlll~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~-------------------~~ 218 (576) T protein:vir:96 158 IDRDSFQSFCRKIVRDTYTYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKII-------------------KG 218 (576) T ss_pred CccccHHHHHHHHHHHHHhcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCcee-------------------ee Confidence 1345566677778899999999887753 33 3577788888877643333211 00 Q ss_pred EEEEE-EeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHH Q lcl|NC_012753. 185 IEFHE-WNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAK 263 (502) Q Consensus 185 ~E~h~-~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~ 263 (502) ..++. ..++.... .++ +.++ +.+. +++.. ......+|+|.+..+. T Consensus 219 ~~~~~~~~~~~~~~---------------~~~--------~~di-------i~~~--~~~~~--d~~~~~~G~Spi~~a~ 264 (576) T protein:vir:96 219 GKRFVQVINKKVVA---------------SFT--------SREM-------AMGI--RNPRT--ELSSSGYGLSEVEIAM 264 (576) T ss_pred eeEEEEecCCceEE---------------Eec--------ccce-------EEEe--ecCCC--CcccCcccccHHHHHH Confidence 01100 00000000 000 0000 1111 11111 0123457999888777 Q ss_pred HHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCccc--cccccc-hhhccccCC-----CCccccceeee Q lcl|NC_012753. 264 TTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKR--EFETGH-NVYEQFDSG-----DMDKGIGITDL 335 (502) Q Consensus 264 ~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~--~~~~~~-~~~~~~~~~-----~~~~~~~i~~~ 335 (502) ..|.....+-.-..+-|..+...-.| |....+. ...+.. .+.... ..+...... --+.+.-++.+ T Consensus 265 ~~i~~~~~~~~~~~~~f~Ng~~p~gi----L~~~~~~---~ls~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~l 337 (576) T protein:vir:96 265 KQFIAYNNTETFNDRFFSHGGTTRGI----LQIKSEQ---QQSQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVNM 337 (576) T ss_pred HHHHHHHHHHHHHHHHHhccCCCceE----EEeCCCC---CCCHHHHHHHHHHHHHHhccccccccceeecCCCceEEec Confidence 77765554433333345554332111 2211111 011000 000000 111110000 00112235566 Q ss_pred ccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHh Q lcl|NC_012753. 336 TTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSI-ATLVEKSLKELVISILELAKV 414 (502) Q Consensus 336 ~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~-~~~~~~~l~~l~~~il~~~~~ 414 (502) +.....-++++..+...++|+...|++|..+|+...+.+++..-... .++..+... +..++.+|..+++.|....+. T Consensus 338 s~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s--~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~ 415 (576) T protein:vir:96 338 TPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNT--LNEADPGKKQQQSQNKGLQPLLRFIEDLINT 415 (576) T ss_pred cCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccc--cccccHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 66667778899999999999999999999999865443322111000 011111111 122344444444443332221 Q ss_pred hcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCH-HHH------------------ Q lcl|NC_012753. 415 YNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTK-EQA------------------ 473 (502) Q Consensus 415 ~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~d-eea------------------ 473 (502) .+... ...++.+.|.+.-+.+..+..+ ......+|+|+.-++++.+ +++.. |+. T Consensus 416 -~Ll~~---~~~~~~~~f~r~d~~~~~e~~~-~~~~~~~G~lT~NE~R~~~gl~piegGD~~~~~~~~~~~~~~~~~~~~ 490 (576) T protein:vir:96 416 -HIISE---YSDKYVFQFVGGDTKSELDKIK-ILQEEVKTYKTVNEARKEKGLKPIEGGDVLLDGSFIQSMSLNTQKEQY 490 (576) T ss_pred -hhchh---ccCceEEEeccCCHHHHHHHHH-HHHHHhcCccCHHHHHHHhCCCCCCCcceeccccccccccccccCCCC Confidence 11111 1234677787764444433333 2334557999998876543 22221 100 Q ss_pred -----HHHHHHHHH--hhhcccCCCCCccccCCCCC Q lcl|NC_012753. 474 -----QEIYQKIND--ETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 474 -----~~el~ri~~--E~~~~~~~~~~~~~~~~~g~ 502 (502) ++.+....+ +..........+.+-.--|+ T Consensus 491 e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~g~ 526 (576) T protein:vir:96 491 EDTKQKERFDMIQQFLNSPDDEEPQQESTEDKVDGR 526 (576) T ss_pred CCccccccccccccccCCCCCCCCCCCCCCCccccc Confidence 000000000 00000000001111111222 No 253 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=83.79 E-value=0.066 Score=27.07 Aligned_cols=365 Identities=11% Similarity=0.121 Sum_probs=135.4 Q ss_pred HhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcce Q lcl|NC_012753. 14 SNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQAT 93 (502) Q Consensus 14 ~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~ 93 (502) ||- |..+..+.. .. ...|.+. .+. ... ....+....-..+++..|+-+.+-|.. T Consensus 1 Mg~------f~~~f~~~~-~~-----------~~~~~~~--~~~--~~~----~~~a~~~~~v~~~i~~ia~~ia~~p~~ 54 (385) T protein:vir:95 1 MGL------FDSVFKRHS-EL-----------SWMYDLE--FLQ--DKS----KKAYLKQIALNTVVEMVARTISQSEFR 54 (385) T ss_pred Cch------hhhhhccCc-cc-----------ccccchh--hhh--ccc----hhhhhhhHHHHHHHHHHHHHHccccee Confidence 211 122221110 00 0011110 000 000 011122222345667777766666665 Q ss_pred EeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEE Q lcl|NC_012753. 94 IRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIV 168 (502) Q Consensus 94 i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~ 168 (502) +--++.....-+..+|.. | ........++...+..|.+|+.+..+++.+ ++..+.+. .........+ T Consensus 55 ~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gna~i~~~~~~~~~-----~~~~~~~~--~~~~~~~~~~- 126 (385) T protein:vir:95 55 VMKNNTKEKGTLYYLLNVRPNRNQNAVDFWQKFIFKLIMDNEVLVVKNDEGHFF-----VADDFEKE--DELGLYSHRF- 126 (385) T ss_pred eeecCccccchHHHHHhcccCcCCCHHHHHHHHHHHHhhcCceEEEEecCCCee-----eccccccc--cccccccccc- Confidence 432333333334444431 1 223344555666777787776543232221 11111110 0000000000 Q ss_pred EEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccc Q lcl|NC_012753. 169 TKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNK 248 (502) Q Consensus 169 ~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~ 248 (502) +. +.. ..+.+.. .++-.+ .++|+.+..... T Consensus 127 -------------~~-~~~-----~~~~~~~-------------~~~~~e------------------iih~~~~~~~~~ 156 (385) T protein:vir:95 127 -------------TN-VLV-----NDFEFKR-------------VFTMDD------------------VIYLKYNNQKLD 156 (385) T ss_pred -------------ee-eee-----cccceee-------------eecccc------------------EEEecCCCCCcc Confidence 00 000 0000000 000000 112222222211 Q ss_pred cccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccccc-chhhccccCCC-- Q lcl|NC_012753. 249 DINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFETG-HNVYEQFDSGD-- 325 (502) Q Consensus 249 ~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~~~-- 325 (502) . +|.|.+..+..+++ ..++...+ ..+..-+++ ++.....+.... ..+... ...+....... T Consensus 157 ~----~G~s~~~~~~~~i~---~~~~~~~~--~~~~~g~l~----~~~~~~~~~e~~---~~~~~~~~~~~~g~~~~~~~ 220 (385) T protein:vir:95 157 A----FSLGLFEDYGEIFG---RMIDLQML--NNQIRGILK----VDATKFYNKEKQ---KELQAYIDTLFDAFQNNTIA 220 (385) T ss_pred c----ccchHHHHHHHHHH---HHHHHHHh--cCCCceEEE----eCCccCCCHHHH---HHHHHHHHHHhhhhhhcCCc Confidence 1 25555554444432 22222211 112222222 110000000000 000000 00001000000 Q ss_pred ---Cccccceeeecc------ccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 326 ---MDKGIGITDLTT------DIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATL 396 (502) Q Consensus 326 ---~~~~~~i~~~~~------~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~ 396 (502) -+.+..++.++. .....++.+..+...++|+...|+||..++. ..+++.+.. ... T Consensus 221 i~~l~~g~~~~~l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~---~~sn~e~~~-------------~~~ 284 (385) T protein:vir:95 221 VVPLTEGLAYEEHSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG---EMADLEKTI-------------ESY 284 (385) T ss_pred eEEcCCCceeEeecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC---CCcCHHHHH-------------HHH Confidence 011111333221 1124578888888899999999999999852 222232211 112 Q ss_pred HHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhcCCCC---HHHH Q lcl|NC_012753. 397 VEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKTLNVT---KEQA 473 (502) Q Consensus 397 ~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~~~~~---deea 473 (502) ++.+|..++..|....+. .+....-.....+.++++.-+..|..+.++...+++.+|+|+.-++++.. |+. ++.. T Consensus 285 ~~~~l~P~~~~ie~~l~~-~L~~~~~~~~~~~~fd~~~l~~~D~~~~~~~~~~~~~~g~lt~NE~R~~~-g~~p~~~~~g 362 (385) T protein:vir:95 285 LQFCINPLLRKIEAELNS-KFFYQDEYLNDDMHIKVVGIDKRDPLKLSEAIDKLVASGTFTRNQVRIMT-GEEPADDPEL 362 (385) T ss_pred HHHHHHHHHHHHHHHHHh-hcCChhhcccceEEEechhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCC Confidence 233333333333322221 11111111222466666777778889999999999999999999876653 443 2222 Q ss_pred HHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 474 QEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 474 ~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++-+-. .+.. ..+...+++-.|| T Consensus 363 d~~~~~---~n~~---~~~~~kgge~~~e 385 (385) T protein:vir:95 363 DKFIIT---KNLQ---SADAFKGGESNEE 385 (385) T ss_pred ceeeec---ccce---ecccccCCCCCCC Confidence 211110 0001 1123455666666 No 254 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=78.31 E-value=0.12 Score=25.72 Aligned_cols=375 Identities=14% Similarity=0.106 Sum_probs=148.2 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchH--HH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIG--RT 78 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~--k~ 78 (502) |++|.+-+. . ..+... -+..+..|... -.+-...+ +..+ -. T Consensus 1 m~~f~~~~~-----------~-----------~~~~~~-----~~~~~~~~~~~-~~~~~~~A---------l~~~~V~~ 43 (406) T protein:vir:97 1 MSFFQPLGT-----------S-----------KVSYDD-----YISSVLAGDVS-QKYLGVSA---------LKNSDILT 43 (406) T ss_pred CccccccCC-----------C-----------CCCcch-----HHHHHhcCCCC-cccccchh---------hccHHHHH Confidence 888753100 0 001000 02223333211 00100001 1111 12 Q ss_pred HHHHHhhhhhcCcceEeeCCH--HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC--Cc-eEEEEEc Q lcl|NC_012753. 79 ASKKVASLVFNEQATIRVDNE--VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG--DQ-IRVSFVQ 148 (502) Q Consensus 79 iv~~~a~~l~~ep~~i~~~d~--~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~-~~i~~v~ 148 (502) .|+..|+-+..=|+.+.-.+. .....+..+|.. | ....-...++...+..|.+|+.+..++ |. ..+..++ T Consensus 44 ~i~~Ia~~iA~lp~~~~~~~g~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~l~l~Gnay~~i~r~~~~g~~~~L~~i~ 123 (406) T protein:vir:97 44 ATSIIAGDIARFPLVKKDVNGDIIHDEDINYLLNVKSTSNASARTWKFAMAVNAILTGNSFSRILRDPKTNQALQFQFYR 123 (406) T ss_pred HHHHHHHhhhhCeeEEEecCccccccchHHHHhhccCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCCCeEEEEEEEC Confidence 344444444333544332221 112234444431 2 223445556777778899998887763 44 3666777 Q ss_pred CCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCccee Q lcl|NC_012753. 149 ATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVT 228 (502) Q Consensus 149 ~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~ 228 (502) |+++-+...+.+.+ .|+.. ...+ |..+.+ + +.+ T Consensus 124 p~~v~v~~~~~~~~------------------~y~~~---~~~~------------------~~~~~~---~---~~e-- 156 (406) T protein:vir:97 124 PSETTVEETDNHEI------------------VYTFT---DMLT------------------AKQVKC---F---AHD-- 156 (406) T ss_pred CCeeEEEEcCCceE------------------EEEEE---ecCC------------------ceEEEE---c---ccc-- Confidence 77765532222211 11100 0000 111100 0 000 Q ss_pred ecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhcc-ceeeechHHhccCCCCCCcccCc Q lcl|NC_012753. 229 LNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQ-RRVAVPTQMIKTEYDTNGEKVTV 307 (502) Q Consensus 229 ~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~-~~i~v~~~~l~~~~~~~g~~~~~ 307 (502) .++|+.+. .+...|+|.+.-+...|+....+..-..+-|+.+. ..+++ ......+.... T Consensus 157 --------vih~r~~~-----~dg~~G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~-----~~~~~l~~e~~-- 216 (406) T protein:vir:97 157 --------VIHWKFFS-----HDTILGRSPLLSLGDEIDLQTGGINTLIKFFKDGFSSGILT-----MKGAQLSGDAR-- 216 (406) T ss_pred --------EEEecCCC-----CCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEE-----ecCCCCCHHHH-- Confidence 12333221 12235888887777666643333332222344332 22222 11111000000 Q ss_pred cccccccchhhccccCCCC-------ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHH Q lcl|NC_012753. 308 KREFETGHNVYEQFDSGDM-------DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVV 380 (502) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~-------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~ 380 (502) ..+ ...+.....++. +.+..++.++.....-++.+..+...++|+...|+||..+|....+..++...+ T Consensus 217 -~~~---~~~~~~~~~g~n~g~~~vl~~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~~~e~~~~ 292 (406) T protein:vir:97 217 -QRA---RQEFEKMREGSVGGSPLVFDSTMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQSVAQLME 292 (406) T ss_pred -HHH---HHHHHHHhcccccCceeecCCCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcchHHHHHH Confidence 000 011111111100 122234555555555567777777789999999999999985433321121111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHH Q lcl|NC_012753. 381 SEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTM 460 (502) Q Consensus 381 ~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et 460 (502) . .++.+|..++..|-.-.+. .+...... ....|.|+- ..+....++...+++.+|+++.-+ T Consensus 293 ~--------------f~~~~l~P~~~~ie~~l~~-kll~~~~~--~~~~i~fd~--~~~~~~~~~~~~~~~~~g~~T~NE 353 (406) T protein:vir:97 293 D--------------YVTNDLPFYFDAITSELGL-KTLNDKDR--RLYHIEFDT--RSVTGRNVDEIVKLVNNQILTPNQ 353 (406) T ss_pred H--------------HHHHHHHHHHHHHHHHHhh-hhcChhhc--cceeEEEec--CccchhhHHHHHHHHhCCCcCHHH Confidence 1 1223333333322221111 11111111 123345542 223445566777888999999999 Q ss_pred HHHhc--CCCCHHHHHHHH-----HHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 461 AIEKT--LNVTKEQAQEIY-----QKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 461 ~l~~~--~~~~deea~~el-----~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) ++..+ +++.+...++-+ ..+..-+....+.-....+++..|| T Consensus 354 ~R~~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~~~~~~~~~~gg~~~~~ 402 (406) T protein:vir:97 354 GLVELGKQKSTDPNMDRYQSSLNYVFLDKKEEYQDKVGIKGKGGEVNAE 402 (406) T ss_pred HHHHhCCCCCCCCCCCeEeeccCccchhcccccccccccccCCCCCCCC Confidence 77654 233221111110 1111100001111123345555666 No 255 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=72.27 E-value=0.18 Score=24.60 Aligned_cols=366 Identities=11% Similarity=0.075 Sum_probs=127.8 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+|+++++..- ...+... +.+. .+... .. ...+...--..+| T Consensus 1 Mgl~d~~~~~~--------~~~~~~~----------------------~~~~--~~~~~--~~----~~~l~~~~v~~~i 42 (395) T protein:vir:96 1 MGILDFFSFKK--------SGTLSDD----------------------DSGS--TTSEK--LT----NVVLKEDALYKCV 42 (395) T ss_pred CcchhhhcCCC--------Ccccccc----------------------cccc--chhhh--cc----hhhhhhHHHHHHH Confidence 99997664310 0111000 0000 00000 00 0001111113345 Q ss_pred HHHhhhhhcCcceEeeCCHH--HHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEV--ADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFF 153 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~--~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~ 153 (502) +..|+-+-.=|+.+--+++. ....+..+|+. | ....-...++...+..|.+|+.+..+.+. +.++. + T Consensus 43 ~~Ia~~ia~lp~~v~~~~~~~~~~~~~~~lL~~~PN~~~t~~~f~~~l~~~lll~Gna~~~~~~~~~~-----~~~~~-~ 116 (395) T protein:vir:96 43 NYLARIISKSTFRIKAPEKLTENQKDWLYWINTKANPNQSASQFWVEVVQKLLVDGETLIFVIPGKGI-----YVADA-F 116 (395) T ss_pred HHHHHhhccceeEEEeCCccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEcCCce-----ecCCc-c Confidence 55555555555544332211 12234444431 2 22333444566666678877666544321 11111 1 Q ss_pred EEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCC Q lcl|NC_012753. 154 PLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLT 233 (502) Q Consensus 154 Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~ 233 (502) +... .+... .|..+.. ..|.+... ++-.+ T Consensus 117 ~~~~---~~~~~---------------~~~~v~~-----~~~~~~~~-------------~~~~d--------------- 145 (395) T protein:vir:96 117 TQDK---KLSGN---------------KFKVSRV-----QGQTYEKI-------------FTFDQ--------------- 145 (395) T ss_pred cccc---ccccc---------------eeeeeee-----ccceeeeE-------------eccCc--------------- Confidence 1100 00000 0000000 01111110 00000 Q ss_pred cceEEEecCCccccccccCcCCcc---hhhhHHHHHHHHHHHHH--HHH-HHHhhccceeeechHHhccCCCCCCcccCc Q lcl|NC_012753. 234 RPLFTYLKPPGMNNKDINSPLGLS---IFDNAKTTMDFINTTYD--EFM-WEVKMGQRRVAVPTQMIKTEYDTNGEKVTV 307 (502) Q Consensus 234 ~~~f~~~~~~~~n~~~~~~p~G~S---~~~~~~~lid~ld~~~S--~~~-~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~ 307 (502) .++|+.+... ..+.+.+ .+..+..+.-++...-+ .+. +-+..+.....+ +........... T Consensus 146 ---vih~k~~~~~----~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-- 212 (395) T protein:vir:96 146 ---VIYLKNDNSD----LMLKVESLWEEYGELLGHVINNQKIANQIRFTMTPPKDKVRERAQ----ENSDGGRQPKSD-- 212 (395) T ss_pred ---eEEecccCCc----cccccccccchHHHHHHHHHHHHHHHHHHHHHhhhccccccccee----eccCchhhHHHH-- Confidence 1223322110 0111112 22233333322211111 111 112222111111 211111110000 Q ss_pred cccccccchhhccccCCCC-----ccccceeeeccccchH------HHHHHHHHHHHHHHHhcCCChhhccccccccccH Q lcl|NC_012753. 308 KREFETGHNVYEQFDSGDM-----DKGIGITDLTTDIRSD------DYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTA 376 (502) Q Consensus 308 ~~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e------~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tA 376 (502) ..+ -...+.......+ +.+..++.++..-..- ++.+.....+++|+...|+||..++.+. ++. T Consensus 213 -~~~--~~~~~~~~~~~~~~v~~l~~g~~~~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~~~---sn~ 286 (395) T protein:vir:96 213 -KDF--FKRTIEKIRTESVVGIPVTANTNYEEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDI---ADN 286 (395) T ss_pred -HHH--HHHHHHHhhcCCcceEEccCCceeEecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCC---ccH Confidence 000 0011111111110 1111233333322222 3334444557889999999999986322 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCC Q lcl|NC_012753. 377 TEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFA 456 (502) Q Consensus 377 tei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~ 456 (502) .+.. +..++.+|..++..|-...+. .+.... .......|+|+.-+..|..+.++...+++.+|++ T Consensus 287 e~~~-------------~~f~~~~L~P~~~~ie~~l~~-~Ll~~~-e~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~ 351 (395) T protein:vir:96 287 QKNY-------------ELLLEGPIESLITNIVDGLEY-AIFDKS-ETLEGSFIKVTGLKNYDLFSISSQADKLISSGFV 351 (395) T ss_pred HHHH-------------HHHHHHHHHHHHHHHHHHHHh-hcCChh-hhcCceeEeecchhccCHHHHHHHHHHHHhCCCc Confidence 2221 112233333333333221111 111111 1112345777777888999999999999999999 Q ss_pred CHHHHHHhc--CCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 457 PKTMAIEKT--LNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 457 S~et~l~~~--~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +.-++++.. +|+++.+.++-+. ..+.. ..+..+++-.+| T Consensus 352 T~NE~R~~~gl~pi~~~~gD~~~~---~~N~~----~~~~~gge~~~~ 392 (395) T protein:vir:96 352 FIDEVREEIGLPELPDGLGKVLYM---TKNYE----SVLERGGEVDEE 392 (395) T ss_pred CHHHHHHHhCCCCCCCCCCceeee---cccce----echhccCCCCCC Confidence 998876653 3443332221110 00000 001122233333 No 256 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=70.25 E-value=0.21 Score=24.28 Aligned_cols=398 Identities=11% Similarity=0.058 Sum_probs=151.4 Q ss_pred hhccccccCCHHHHHHHHHH-----HHHhcC-CC-Ccccccc---C-CCccccccceecchHHHHHHHHhhhhhcCcceE Q lcl|NC_012753. 26 ITDHPKIAISPEEYNRIMDN-----LRYFAG-DF-DSVTYRD---S-NGSQVKRDFNHLPIGRTASKKVASLVFNEQATI 94 (502) Q Consensus 26 i~~~~~~~~~~~~~~~i~~~-----~~~Y~g-~~-~~~~~~~---~-~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i 94 (502) .+++..++...-+..++--. .+.|.- ++ +.+.... . .....++.. + .-.+++-...+++-+.+| T Consensus 1 ~~~~~~~~~p~~~~g~~~~~~~~~~~~~~~~~e~~~~lr~~~~~~ly~~m~e~D~~----i-~s~l~~rk~av~~~~w~v 75 (469) T protein:vir:10 1 MTERVKTAAPVSEAGYVFGSGVVDGWTVWDPFEQTPELQWPQSVAVYSRMDNEDSR----V-TSLLEAISLPIRSTPWRI 75 (469) T ss_pred CCCcccCCCCccchhhhhhcccccchhhccccccccccccccchHHHHHHHhhChH----H-HHHHHHHHHHHhcCCceE Confidence 12222223222222222110 011110 00 0110000 0 000000100 1 223333445566666666 Q ss_pred eeC--CHHHHHHHHHHHhh-----------------ccHHHHHHHHHHHHhhcCCEEEEEEEeC------CceE---EEE Q lcl|NC_012753. 95 RVD--NEVADAFINETLKN-----------------DKFSKNFERYLESCLALGGLAMRPYIDG------DQIR---VSF 146 (502) Q Consensus 95 ~~~--d~~~~e~l~~~~~~-----------------~~f~~~~~~~~~~~~~~G~~~~~~~~d~------~~~~---i~~ 146 (502) .-. +++..+++.+.+.. ..|...+.+.+..+..+|-+++-+.|.. |... +.+ T Consensus 76 ~p~~~~~e~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~w~~~l~~~l~~a~~~G~s~~Eivw~~~~~~~dG~~~~~~l~~ 155 (469) T protein:vir:10 76 RANGASDEVTEFVSRNLMVPIDGEDDVRNPGRSRGRFSWAEHLEEVTSPTLQFGHAVFEQVYRPRNQSPDGRFWLRKLAP 155 (469) T ss_pred ecCCCCHHHHHHHHHHHHhhhhhhhhhhhhhhhhccccHHHHHHHHHHHhhhhCceeeeeeeecccccCCCceeeeeeee Confidence 432 33333444333321 2466777777888888998888777752 2222 222 Q ss_pred EcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcc Q lcl|NC_012753. 147 VQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEET 226 (502) Q Consensus 147 v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~ 226 (502) .++..+--..++.++... . ++. ..+ ........+.. + T Consensus 156 rp~~~i~~~~~~~~~~l~--~--------------~~~---~~~---~~~~~~~~~~~-~-------------------- 192 (469) T protein:vir:10 156 RPQWTISKFNVAPDGGLE--S--------------IEQ---IAP---PARTRGSLYVA-N-------------------- 192 (469) T ss_pred cCcccceeeeeccCCcee--e--------------eee---cCc---ccccccccccC-C-------------------- Confidence 222221111111111000 0 000 000 00000000000 0 Q ss_pred eeecCCCcce--EEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-hccceeeechHHhccCCCCCCc Q lcl|NC_012753. 227 VTLNGLTRPL--FTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-MGQRRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 227 ~~~~~~~~~~--f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~~~~~i~v~~~~l~~~~~~~g~ 303 (502) ..+.+-|+ |+.++.. ...++|+|.|.+..+--..---+..+..++.=.+ .|... .| .+ ++.+... T Consensus 193 --~~~~~lp~~k~i~~~~~----~~~g~p~g~gLlr~~~~~~~fK~~~~~~w~~f~EryG~P~-~v----gk-y~~~a~~ 260 (469) T protein:vir:10 193 --IAPPEIPVNRLVVYTRN----KRPGQWQGKSILRSAYKHWLLKDKLLRIEAATAERNGMGI-PV----GT-ASSATDE 260 (469) T ss_pred --CCccccccCcEEEEEec----CCCCCcccchhHHHHHHHHHHHHHHHHHHHHHHHHcCCcc-eE----Ee-cCCCCCH Confidence 00111111 3333322 2457799999998876654443434444443333 23322 12 11 1111110 Q ss_pred ccCccccccccchhhccccCCCC-----ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHH Q lcl|NC_012753. 304 KVTVKREFETGHNVYEQFDSGDM-----DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATE 378 (502) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAte 378 (502) .....+ .+....+..+.. ..+.-|+.++..-....|...++.+-++|+..+ ++....++..+|..+..+ T Consensus 261 --~ek~~l---~~a~~~~~~g~~a~~iip~~~~ie~~ea~g~~~~~~~li~~~d~~Isk~i-LG~tlTs~~~gGS~a~~~ 334 (469) T protein:vir:10 261 --DEVRKM---AALARSVRGGINAGVGLAQGQILELLGVSGNLPDIRRAIEGHDRSIALSG-LAHFLNLDGKGGSYALAS 334 (469) T ss_pred --HHHHHH---HHHHHHHhcCCceEEEccCCceEEEeecCCCchHHHHHHHHHHHHHHHHH-hcccccccCccchhhHHH Confidence 000000 111111111100 012234444444444456666666666665544 222211221222111122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCC Q lcl|NC_012753. 379 VVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAP 457 (502) Q Consensus 379 i~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S 457 (502) +. ..-....++.-.+.+...|+ +|++-++.+ ++ + . ...-+.+.|+.. ..+.+..++.+++++.+|++. T Consensus 335 vh--~ev~~d~~~sDa~~i~~tln~~li~~l~~l----N~-g-~--~~~~P~~~~~~~-e~~~~~~a~~i~~l~~~G~~~ 403 (469) T protein:vir:10 335 VL--EDPFTQAVHAYATSICRIANQHIIEDLVDI----NF-G-V--DTPAPVLTFDPI-GSRQDLTAAAVKLLYDAGVFD 403 (469) T ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----cC-C-C--CCCccEEEecCC-CCcHHHHHHHHHHHHhcCCcc Confidence 22 22233445555566777885 577766654 21 1 1 122356778653 455667788899999999842 Q ss_pred ----HHHHHHhcCCCCHHHHHHHHHHHHHhhhcccCCCCC-ccccCCCCC Q lcl|NC_012753. 458 ----KTMAIEKTLNVTKEQAQEIYQKINDETMVSTDSFRT-SEEVDIYGE 502 (502) Q Consensus 458 ----~et~l~~~~~~~deea~~el~ri~~E~~~~~~~~~~-~~~~~~~g~ 502 (502) .+.++.+.+|+...+-.+.+..-.+.. +.+.... +....--|+ T Consensus 404 ~~~~~~~~~~e~~gip~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 451 (469) T protein:vir:10 404 DDPAVKRAIRQRFNLPSELNDTPSAEPEEPA--AVPNQSAAPARTRSSGN 451 (469) T ss_pred CccccHHHHHHHhCCCCCCCCcccccchhcc--cCCCCCccccccCCCCC Confidence 356678888875422112221111111 1111111 111111111 No 257 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=62.95 E-value=0.33 Score=23.25 Aligned_cols=359 Identities=12% Similarity=0.061 Sum_probs=117.7 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||+++ +++... ...+. +.+ .+.+ .. . ...+...--..+| T Consensus 1 Mg~f~~l---~~~~~~-------~~~~~------~~~----------~~~~----~~-----~----~~~l~~~~v~~~i 41 (376) T protein:vir:78 1 MGFFSEL---FKRNKE-------IEWMW------DLD----------FLED----KT-----T----KVYLKKMALNTCV 41 (376) T ss_pred Cchhhhh---hccCCc-------ccccc------chh----------hccc----cc-----h----hhhhhhHHHHHHH Confidence 9999864 332100 00000 000 0000 00 0 0011111223455 Q ss_pred HHHhhhhhcCcceEeeCCHHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEE Q lcl|NC_012753. 81 KKVASLVFNEQATIRVDNEVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPL 155 (502) Q Consensus 81 ~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi 155 (502) +..|+-+..-|..+--++......+..+|.. | ....-...++...+..|.+|+.+..++++... ..+|+ T Consensus 42 ~~Ia~~ia~~p~~~~~~~~~~~~~l~~ll~~~PN~~~t~~~f~~~~~~~lll~Gn~~~~~~r~~~~~~~------~~~~~ 115 (376) T protein:vir:78 42 KHIARTIAKSDFRLKNGETSVRDKLYYKLNIRPNTDMSSSSFWEKVIYKLIYDNECLIVLSDTDDFLIA------DSYVR 115 (376) T ss_pred HHHHHhhcccceeeccccccccchHHHHHhhccccCCCHHHHHHHHHHHHhHcCcEEEEEEeCCCeeec------cceee Confidence 5555555554544322222222333333421 2 22233344455556667777666555543211 12332 Q ss_pred EEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcc---eeecCC Q lcl|NC_012753. 156 QANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEET---VTLNGL 232 (502) Q Consensus 156 ~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~---~~~~~~ 232 (502) .. .......+. . ....+ |+....+. .--|-|--|-..+....+.. +......+... ....+. T Consensus 116 ~~--~~~~~~~~~-~-~~~~~-----~~~~~~~~----~~evih~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 180 (376) T protein:vir:78 116 KE--FAFFPDVFE-G-VTVKD-----YRYNRNFS----MDDVIFLEYGNERLSAFTDG--MFEDYGELFGKMIRAQMRNF 180 (376) T ss_pred cc--cceeeeeee-e-eeeec-----ceeeeeec----cccEEEeccCCCCchhhhhH--HHHHHHHHHHHHHHHHHhcC Confidence 11 001000000 0 00000 00000000 00111100000000000000 00000000000 000000 Q ss_pred CcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccCcccccc Q lcl|NC_012753. 233 TRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVTVKREFE 312 (502) Q Consensus 233 ~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~~~~~~~ 312 (502) ..++...+..+.. .+. +. .+.+...|.....-....+.+|++ + T Consensus 181 ~~~~~~~~~~~~~----------~~~-e~----~~~~~~~~~~~~~g~~~~~~~v~~----l------------------ 223 (376) T protein:vir:78 181 QIRGAVNFKMAGV----------ADK-DK----QTKLQEYIDKVYASFNNNEIAIVP----Q------------------ 223 (376) T ss_pred CCceeEEEccCCC----------CCH-HH----HHHHHHHHHHHhccccccCcceEE----c------------------ Confidence 1111111111100 000 00 011111111110000111111111 0 Q ss_pred ccchhhccccCCCCccccceeeecc---c--cchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHH Q lcl|NC_012753. 313 TGHNVYEQFDSGDMDKGIGITDLTT---D--IRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTY 387 (502) Q Consensus 313 ~~~~~~~~~~~~~~~~~~~i~~~~~---~--ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~ 387 (502) +.+..++.++. + ....++.+..+....+|+...|+||..++.+.+ +.++.. T Consensus 224 --------------~~g~~~~~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~~~s---~~e~~~------- 279 (376) T protein:vir:78 224 --------------LEGFNYEEFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHGDMA---DLSNNM------- 279 (376) T ss_pred --------------CCCceEEeeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCC---CHHHHH------- Confidence 00111222211 1 122367888888899999999999999974322 222211 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc-- Q lcl|NC_012753. 388 QMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT-- 465 (502) Q Consensus 388 ~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~-- 465 (502) ...++.+|..++..|-...+. .++.. ....+.+.+..-+-.|..+.++...+++.+|+++.-++++.. T Consensus 280 ------~~f~~~~l~P~~~~ie~~l~~-kll~~---~~~~~~~~~~~ll~~d~~~~~~~~~~~~~~G~~t~NE~R~~lg~ 349 (376) T protein:vir:78 280 ------KAYMEYCIDPLTKKLEDELNA-KLFTF---SEFLAGEHIKIIHKKDIIENAEAVDKLVASGSFNRNEVRELLGA 349 (376) T ss_pred ------HHHHHHHHHHHHHHHHHHHHh-hhCCc---ccceecccchhhcccCHHHHHHHHHHHHhCCCcCHHHHHHHhCC Confidence 111222333333322221111 01110 111222333334556888889999999999999998876543 Q ss_pred CCCCHHHHHHHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 466 LNVTKEQAQEIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 466 ~~~~deea~~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +++.+.++++-+.. . +-..++-.|| T Consensus 350 ~p~~~g~~d~~~~~--------~----n~~~~~~~~e 374 (376) T protein:vir:78 350 ERVDNPELDKYLIT--------K----NYQSADEGGE 374 (376) T ss_pred CCCCCCCCceeeec--------c----Cceehhcccc Confidence 23333222111110 0 1111112233 No 258 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=56.92 E-value=0.45 Score=22.51 Aligned_cols=368 Identities=10% Similarity=0.057 Sum_probs=127.0 Q ss_pred cchhhhhccccc-cCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEeeCC- Q lcl|NC_012753. 21 QSLNSITDHPKI-AISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDN- 98 (502) Q Consensus 21 ~~l~~i~~~~~~-~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d- 98 (502) |.|.+.....+. .+. ..+.+. .... ......+...--..+|+.+|+-+.+=|+.+--.+ T Consensus 1 MGlf~~~~~~~~~~~~-----------~~~~~~--~~~~------~~~~~~~~~~~v~~~I~~ia~~iA~lp~~~~~~~~ 61 (395) T protein:vir:98 1 MGILDFFSFKKSGTLS-----------DDDSGS--TTSE------KLTNVVLKEDALYKCVNYLARIISKSTFRLKTPEK 61 (395) T ss_pred CcchhhhcCCCccccc-----------ccccch--hhhh------hcchhhhhhHHHHHHHHHHHHHHhhCceeEEecCC Confidence 333332221110 000 000010 0000 0000111222223455666666655565442222 Q ss_pred H-HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEE Q lcl|NC_012753. 99 E-VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVSFVQATVFFPLQANTQDVSSAAIVTKST 172 (502) Q Consensus 99 ~-~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~ 172 (502) + ....-+..+|.. | ....-...++...+..|.+|+.+-.+.+.+ .|+.+..+...... . T Consensus 62 ~~~~~~~~~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnayi~~~~~~~~~-----~~~~~~~~~~~~~~----~------ 126 (395) T protein:vir:98 62 LTENQKDWLYWINTKANPNQSASQFWVEVIQKLLVDGETLIFVIPGKGIY-----VADSFTQDKKISGS----Q------ 126 (395) T ss_pred cccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeCCcee-----cCCcccccccccCc----c------ Confidence 1 112223344432 2 223334445666777788887665443221 12211111000000 0 Q ss_pred EeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccC Q lcl|NC_012753. 173 KTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINS 252 (502) Q Consensus 173 ~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~ 252 (502) |..+. . ..|.+.. .|.. . -.++|+.+..+ .. T Consensus 127 ---------~~~~~---~--~~~~~~~-~~~~--------------------~----------evih~k~~~~~----~~ 157 (395) T protein:vir:98 127 ---------FKVSR---V--QGQTYEK-TFTF--------------------D----------QVIYLKNDNSD----LM 157 (395) T ss_pred ---------cceee---e--cCceeee-EecC--------------------c----------cEEEecCCCCC----cc Confidence 00000 0 0111100 0000 0 01233322111 11 Q ss_pred cCCcchhhhHHHHH-HHHHHHHHHH-HHHHhhccceeeechHHhccCCCCCCcccC-ccccccccchhhccccCCC---- Q lcl|NC_012753. 253 PLGLSIFDNAKTTM-DFINTTYDEF-MWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFETGHNVYEQFDSGD---- 325 (502) Q Consensus 253 p~G~S~~~~~~~li-d~ld~~~S~~-~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~~~~~~~~~~~~~~---- 325 (502) +.+.+.+..+..++ .+++...... .+-+..+...-.++.... ...+.... ....+ -...+.+..... T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~--~~~~~~~~~~~~~~v~ 231 (395) T protein:vir:98 158 SKVESLWEEYGELLGHVINNQKIANQIRFTMIPPKDKVRERAQE----NSDGGRQSKSDKDF--FKRTVEKIRTESVVGI 231 (395) T ss_pred ccccchhhhHHHHHHHHHHHHHHHHHHHHhhccccccccccccc----cCCcHHHHHHHHHH--HHHHHhhhhcCCccee Confidence 12222222222222 1122111111 111111111111110000 00000000 00000 000011110000 Q ss_pred -Cccccceeeec------cccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 326 -MDKGIGITDLT------TDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVE 398 (502) Q Consensus 326 -~~~~~~i~~~~------~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~ 398 (502) .+.+..++.++ .....+++.+..+....+|+...|+|+..++.+. ++.++....+ ...++.-..+.++ T Consensus 232 ~l~~g~~~~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l~~~~---sn~e~~~~~f--~~~tl~P~~~~ie 306 (395) T protein:vir:98 232 PVTANTNYEEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHGDI---ADNQKNYELL--LEGPIESLITNIV 306 (395) T ss_pred ecCCCceeEecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHhcCCc---ccHHHHHHHH--HHHHHHHHHHHHH Confidence 01111122222 1234567888888888999999999999986321 1222211111 1122222222233 Q ss_pred HHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHH Q lcl|NC_012753. 399 KSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEI 476 (502) Q Consensus 399 ~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~e 476 (502) .+|..- ++.. ........|+|++-+..|..+.++...+++.+|+++.-++++.. +|++++..++- T Consensus 307 ~~l~~k------------ll~~-~~~~~g~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~NE~R~~~g~~Pi~~~~gD~~ 373 (395) T protein:vir:98 307 DGLEYA------------IFDK-SETLQGSFIKVTGLKNYDLFSISNQADKLISSGFVFIDEVREEIGLPELPDGLGKVL 373 (395) T ss_pred HHHHHh------------cCCh-hhhcCcceeeehhhhccCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCCCcee Confidence 333220 1111 01112345778877888999999999999999999999976653 34544333222 Q ss_pred HHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 477 YQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 477 l~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) +.. -+.... +..+++--++ T Consensus 374 ~~~---~n~~~~----~~~gge~~~~ 392 (395) T protein:vir:98 374 YMT---KNYESV----LERGGEVDEE 392 (395) T ss_pred eec---ccceec----ccccCCCCCC Confidence 111 000000 0111111111 No 259 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=54.82 E-value=0.49 Score=22.26 Aligned_cols=434 Identities=9% Similarity=-0.018 Sum_probs=148.2 Q ss_pred CCh--hHHHHHHHHHHhh-----cccccchhhhh----ccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCc----- Q lcl|NC_012753. 1 MGI--IQTIKNFIKRSNY-----VITNQSLNSIT----DHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGS----- 64 (502) Q Consensus 1 m~~--~~~ik~~i~~~~~-----~~~~~~l~~i~----~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~----- 64 (502) +|- ++.++.+.+.-.+ ......+.-+- -+....-+++.-.+++....++++.+..+...-.... T Consensus 46 ~p~~~~~~L~~~~e~~~~~~~~i~~~~~~iag~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~n~~~t~ 125 (651) T protein:vir:99 46 NPPYNPDRLAAFLELNETLATGIRKKSRYEVGFGFDLVPAQGVDGDDASDAQREVARNFWRGRSSRWQTGPNQAKTPATP 125 (651) T ss_pred CCCCCHHHHHHHHhcChHHHHHHHHHhhhhhccCceeeecccCCCCccchHHHHHHHHHhhccchhhcccccccCCCCCH Confidence 221 1222222221000 00000000000 0011122333334444444544443332211100000 Q ss_pred --ccc---ccceecchH--HHHHHHHh---hhhhcCcceEee--CCHHHHHHHHHHHhh--ccH--HHHHHHHHHHHhhc Q lcl|NC_012753. 65 --QVK---RDFNHLPIG--RTASKKVA---SLVFNEQATIRV--DNEVADAFINETLKN--DKF--SKNFERYLESCLAL 128 (502) Q Consensus 65 --~~~---~~~~~~n~~--k~iv~~~a---~~l~~ep~~i~~--~d~~~~e~l~~~~~~--~~f--~~~~~~~~~~~~~~ 128 (502) ... ..++..+++ ..|.+..+ .+-.-.+..+.+ ++......+..++.. |.. ...+...+ ..... T Consensus 126 ~~i~~~~~~Dle~tGna~ieiIrn~~g~pv~L~~lp~~~~Rv~~~~~~~~~~~~~ll~~~pn~~~~~~~~~~~~-q~~~~ 204 (651) T protein:vir:99 126 ERVKELARQDYHGVGWLALEMLTDIEGRPVGLAYVPARTVRVRRPQNRFDQPRHPEEGRYVDGDVADIASRGYV-QIRNG 204 (651) T ss_pred HHHHHHHHHHHHHHhhHhhhhhhcCccchhhhhhcChhheeeecccccccchhhhhhhcccccccchhHHHHHH-HHHhc Confidence 000 001111111 01111000 000001111222 222222222222221 111 11112222 23334 Q ss_pred CCEEEEEEEeCC-ce-EEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCC Q lcl|NC_012753. 129 GGLAMRPYIDGD-QI-RVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESES 206 (502) Q Consensus 129 G~~~~~~~~d~~-~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~ 206 (502) +..|++++-+.. .+ .+....++.+.++.......... ....+ ...|.+.. + + T Consensus 205 ~~~~~~~~g~~~~~~~~~~~~~~~~v~~~~~~d~~~~~~-----~~~~~----------------~~~g~~~~--~---~ 258 (651) T protein:vir:99 205 NRRYFGEAGDRYRGQEVVIDESGDEPTIRYREDEESERE-----PIFVD----------------RETGDVTT--G---D 258 (651) T ss_pred CcceEEEeeccccceeeeeccCCcceeEEeccCcceeee-----eeccc----------------ceeeeEEE--c---C Confidence 566676664432 22 23334444444433222111000 00000 00111100 0 0 Q ss_pred ccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccce Q lcl|NC_012753. 207 KTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRR 286 (502) Q Consensus 207 ~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~ 286 (502) ..+. . .+. .--.++|+.+. ..+..+|+|.+..+...+.....+-.-..+-|..+... T Consensus 259 --~~~~-~-------------~~~---~~eViHir~~~----~~~g~~G~spl~~a~~~i~~a~~a~~~~~~~f~NG~~p 315 (651) T protein:vir:99 259 --ANGL-E-------------NRP---ANELIFIPNPS----ILEDDYGVPDWVSAIRTISADEAAKDYNRDFFDNDTIP 315 (651) T ss_pred --CCce-e-------------Eec---ccceEEecCCC----CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCC Confidence 0000 0 000 00124454332 12456899998888877754443333333345554432 Q ss_pred ee---echHHhccCCCCCCcccCcccccccc-chhhccccCCCC------ccccc--eeeecccc-chHHHHHHHHHHHH Q lcl|NC_012753. 287 VA---VPTQMIKTEYDTNGEKVTVKREFETG-HNVYEQFDSGDM------DKGIG--ITDLTTDI-RSDDYIKAINKGLS 353 (502) Q Consensus 287 i~---v~~~~l~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~~~~------~~~~~--i~~~~~~i-r~e~~~~~l~~~l~ 353 (502) -. +|...+.. .... .....|... .+.++.+-...+ ..+.+ ++.++... ...++.+..+.... T Consensus 316 ~gil~~~~~~ls~----e~~~-~lr~~~~~~~~nagk~~vL~~~~~~~~~~~~~g~~~~pls~~~~~D~qfle~r~~~~~ 390 (651) T protein:vir:99 316 RMVIKVTGGELSE----ESKR-DLRQMLNGLREESHRAVVLEVEKFQSQLDEDVEIELEPMGQGISEEMDFRQFREKNEH 390 (651) T ss_pred ceEEEecCCCCCH----HHHH-HHHHHHHHHhccCCceEEeecccccccccccCCceEEEcCcCchhhHHHHHHHHHHHH Confidence 11 22111100 0000 000000000 000000000000 00113 33333322 24577888888899 Q ss_pred HHHHhcCCChhhcccccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEe Q lcl|NC_012753. 354 LFEMQLGVSTGMFSFDGKSM-KTATEVVSEQSDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDL 432 (502) Q Consensus 354 ~i~~~~g~s~~~~~~~~~~~-~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f 432 (502) +|+...|++|..+|+..++. +|+.+....+ ...+..-..+.++..|... | +..........+.+.| T Consensus 391 eIa~afgVPp~~lG~~~~~~~sn~E~~~~~f--~~~tL~P~~~~ie~eln~k----L-------l~~~e~~~~~~i~~ef 457 (651) T protein:vir:99 391 EIAKVLEVPPVKIGVTDSANRSNSDQQDKDF--ALEVIQPEQHTFAEWLYQI----I-------HQQALGVTDWTIEYEL 457 (651) T ss_pred HHHHHhCCCHHHhccCCCCCcccHHHHHHHH--HHHHHHHHHHHHHHHHHHh----h-------cCccccccCceEEEEe Confidence 99999999999998765443 2333322111 1122222223333333221 0 1111111223455666 Q ss_pred CC--CccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCCHHHHHHHHHHHHHhhh-------cccCCCCCccccCCCC Q lcl|NC_012753. 433 DD--GVFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVTKEQAQEIYQKINDETM-------VSTDSFRTSEEVDIYG 501 (502) Q Consensus 433 ~d--~i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~deea~~el~ri~~E~~-------~~~~~~~~~~~~~~~g 501 (502) +. -+-.|..+.++....++++|+|+.-++++.. +++.++....-+..++.... ...+....+..-+..+ T Consensus 458 ~~~~llr~D~~~~~e~~~~~i~~G~~T~NE~R~~lglppi~~~~gd~~l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~ 537 (651) T protein:vir:99 458 RGADQPKQEAQLAEQRVRAMRLAGVGLVDEAREELGLDPLGEPYGEMTLSEFEAEVAGDVAGGGETEAVHEPPEENKIGE 537 (651) T ss_pred ccchhhhccHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCccccccccccccccccccccCCCCcccccCcccccccc Confidence 54 4456888888888999999999999976653 34444333222222111100 0001111111111222 Q ss_pred C Q lcl|NC_012753. 502 E 502 (502) Q Consensus 502 ~ 502 (502) + T Consensus 538 ~ 538 (651) T protein:vir:99 538 R 538 (651) T ss_pred c Confidence 2 No 260 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=40.09 E-value=0.99 Score=20.61 Aligned_cols=350 Identities=12% Similarity=0.063 Sum_probs=122.4 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||.+++.+-+..... ....+.. |.+.. .. +.......+| T Consensus 1 Mg~f~~~~~~~~~~~~~-~~~~~~~-----------------------~~~~~--~~-------------~~~~~v~~~v 41 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNN-DTQRVTA-----------------------WQNEA--VE-------------YTSAFVTNIH 41 (378) T ss_pred CCccccchhcccccccC-Ccceeee-----------------------eccch--hH-------------HHHHHHHHHH Confidence 99999998875431110 0000000 00000 00 0001123344 Q ss_pred HHHhhhhhcCcceE-eeC--C----H---HHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeCCceEEE Q lcl|NC_012753. 81 KKVASLVFNEQATI-RVD--N----E---VADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDGDQIRVS 145 (502) Q Consensus 81 ~~~a~~l~~ep~~i-~~~--d----~---~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~~~~~i~ 145 (502) +..|+-+.+=|+.+ .-. + . ..+.-|.++|.. | ....-....+..++..|.+|+.+.++++.-++ T Consensus 42 ~~IA~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~~~~~g~~- 120 (378) T protein:vir:94 42 NKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSPKGERNSMDFWRKVIKKLLSAPYVDLYAVFDDNTGEL- 120 (378) T ss_pred HHHHhhhhhCceeeEEEcccCcccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeeCCCceE- Confidence 55555554445442 111 0 0 011223344432 1 22334445567778889888876665432111 Q ss_pred EEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCc Q lcl|NC_012753. 146 FVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEE 225 (502) Q Consensus 146 ~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~ 225 (502) ++++...+.+. +..+ + |-| +++.....-| .-|+......+.. T Consensus 121 -------~~l~p~~~~~~--------~~~~----------d----------iiH--~~~~~~~~~g-~s~l~~~~~~i~~ 162 (378) T protein:vir:94 121 -------LDLLFADDKKE--------YKPE----------E----------LVR--LTSPFYINED-TSILDNALASIQT 162 (378) T ss_pred -------EEEEecCCeeE--------eeee----------e----------eEE--ecCcCCccch-hHHHHHHHHHHHH Confidence 11111111110 0000 0 000 0000000000 0011110000000 Q ss_pred ceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCccc Q lcl|NC_012753. 226 TVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKV 305 (502) Q Consensus 226 ~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~ 305 (502) . ... ..|.++ ++.+. .++. +..+.+.+.+...++..... ....++.| | T Consensus 163 ~-~~~--~~~~gi-l~~~~----------~l~~-~~~~~~~~~~~~~~~~~~~~--~~~g~~~v----l----------- 210 (378) T protein:vir:94 163 K-LEQ--GKLRGL-LKINA----------FLDI-DNTQEYREKALTTIKNMQEG--SSYNGLTP----V----------- 210 (378) T ss_pred H-Hhc--ccccce-eeeCC----------cCCH-HHHHHHHHHHHHHHHHhhcc--ccccccee----c----------- Confidence 0 000 111111 11110 0010 11222223333222221110 00001111 0 Q ss_pred CccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHH Q lcl|NC_012753. 306 TVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSD 385 (502) Q Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~ 385 (502) +.+.-++.++.+..+.+. ..++.+..+|+...|+||..++. |..+... T Consensus 211 ---------------------~~g~~~~~l~~~~~~~~~-~~~~~~~~~Ia~~fgVP~~~l~~------~~se~~~---- 258 (378) T protein:vir:94 211 ---------------------DNKTEIVELKKDYSVLNK-DEIDLIKSELLTGYFMNENILLG------TASQEQQ---- 258 (378) T ss_pred ---------------------CCCceEEEccCChhhhhH-HHHHHHHHHHHHHhCCCHHHhcC------ChHHHHH---- Confidence 011113333333223332 34456677899999999988841 1112111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccC------C-CcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCH Q lcl|NC_012753. 386 TYQMRNSIATLVEKSLKELVISILELAKVYNLYT------G-EIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPK 458 (502) Q Consensus 386 l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~------~-~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~ 458 (502) +..++.+|..++..|..-.+. .++. + .......+.|+++.-...|..+.++...+++.+|+|+. T Consensus 259 --------~~f~~~tL~P~~~~ie~~l~~-~Ll~~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~~~~~~~~~~~G~~T~ 329 (378) T protein:vir:94 259 --------IYFYNSTIIPLLIQLEKELTY-KLISTNRRRVVKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQ 329 (378) T ss_pred --------HHHHHHHHHHHHHHHHHHHHh-hcCChhHhhhhhhcccccceeecchhhhhcCHHHHHHHHHHHHhCCCcCH Confidence 112233333333332221111 0110 0 01111235566667777899999999999999999999 Q ss_pred HHHHHhcCCCCHH-HHHH-----HHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 459 TMAIEKTLNVTKE-QAQE-----IYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 459 et~l~~~~~~~de-ea~~-----el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) -++++.. |+..- ..++ .+..+...............+.+=.+| T Consensus 330 NE~R~~~-gl~p~~gGD~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 330 NQLLVKM-GEQPIEGGDVYIANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred HHHHHHh-CCCCCCCCCeeeecccccccccchhhcCCcCCCCCCCCCCCC Confidence 8876653 33221 0111 011111000000000011111111122 No 261 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=23.79 E-value=2.3 Score=18.64 Aligned_cols=395 Identities=16% Similarity=0.146 Sum_probs=139.6 Q ss_pred CC-------hhHHHHHHHHHHhhcccccchhhhhccccccC---CH---------HHHHHHHHHHHHhcCCCCccccccC Q lcl|NC_012753. 1 MG-------IIQTIKNFIKRSNYVITNQSLNSITDHPKIAI---SP---------EEYNRIMDNLRYFAGDFDSVTYRDS 61 (502) Q Consensus 1 m~-------~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~---~~---------~~~~~i~~~~~~Y~g~~~~~~~~~~ 61 (502) |+ ..+-.++..-. .+...+.. ++ .-+++++.+.++-+.+. T Consensus 1 ~~~~~~~~p~~~~~~~~~~~-------------~~~~~~~~g~~~~D~~lr~~gg~~~~~~~l~~~m~e~D~-------- 59 (446) T protein:vir:98 1 MNMEVRNAPTPAIRRRTIYA-------------MEHLGLATSYLSEDGGYKRAGKPTYQQLSAWDEAAQTEP-------- 59 (446) T ss_pred CcccccCCCchhhhhhhhhc-------------cccchhhcccCCcchHhhhcCCChHHHHHHHHHHHhcch-------- Confidence 21 11111111100 00001000 00 00011111111111100 Q ss_pred CCccccccceecchHHHHHHHHhhhhhcCcceEeeCCHHHHHHHHHHHhhccHHHHHHHHHHHHhhcCCEEEEEEEeC-- Q lcl|NC_012753. 62 NGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEVADAFINETLKNDKFSKNFERYLESCLALGGLAMRPYIDG-- 139 (502) Q Consensus 62 ~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~~~e~l~~~~~~~~f~~~~~~~~~~~~~~G~~~~~~~~d~-- 139 (502) -+ +-..++-..-+++-+.+|.-.+++..+++.+++++-.|...+.. +..|..+|-++.=+.|.. T Consensus 60 ------------~v-~s~l~~Rk~av~~~~w~V~p~~~~~a~~v~~~l~~~~~~~~~~~-~ldai~~G~s~~Eivw~~~~ 125 (446) T protein:vir:98 60 ------------II-AQGLDSIALSVLNKVGPYQHGDKRIKKFIDDQLRNRAKTWISHC-VKSIMTYGFSLSEQIYAHGA 125 (446) T ss_pred ------------HH-HHHHHHHHHHhhcCCceecCccHHHHHHHHHHHhhcCchhHHHH-HHHHHhhCceeeeEEEeecc Confidence 01 11222333334455666666778889999999988777666655 567888998887777752 Q ss_pred C-ceEEEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccc Q lcl|NC_012753. 140 D-QIRVSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLST 218 (502) Q Consensus 140 ~-~~~i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~ 218 (502) + ....++.+.-. ++.. ....|+ + .. .... + .+..-.+. .+. .+...|+.. T Consensus 126 g~~~p~~~~d~~~----~~~~---~~~r~~--~-~~---~~~~---~-----~~~~~~~~--~~~------~~~~~~~~~ 176 (446) T protein:vir:98 126 RDNMPATVLDDIV----NYHP---LQVMLI--A-ND---NGRI---V-----DGDTVTAS--QYK------SGYWVPLPP 176 (446) T ss_pred cccccchhhcccc----cccc---ccceee--e-cc---CCcc---c-----cccccchh--hcc------cccccCccc Confidence 2 11111111100 0000 000010 0 00 0000 0 00000000 000 000000000 Q ss_pred c-ccC--CCcceeecC--CCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHh-hccceeeechH Q lcl|NC_012753. 219 L-YED--LEETVTLNG--LTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVK-MGQRRVAVPTQ 292 (502) Q Consensus 219 ~-~~~--l~~~~~~~~--~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~-~~~~~i~v~~~ 292 (502) . +.+ +.....-.+ ++..-|+.++. ....++|+|.|.+..+--..---+...-.++.=++ .|....+ T Consensus 177 ~~~~~~~~~~~~~g~~~~iP~~kfi~~~~----~~~~~~p~G~gLlr~~~w~~~fK~~~~~~w~~f~E~yG~P~~v---- 248 (446) T protein:vir:98 177 YRIGDPPKKVDVVGSHVRLPSHKRLFINY----NTKGNNPWGTSCLTSVLDYSIFKRAFRDMMLIALDRYGTPLIY---- 248 (446) T ss_pred chhhhhhhhcccCcccccccccceEEEEe----cCCCCCccccchHHHHHHHHHHHHhhHHHHHHHHhHcCCceeE---- Confidence 0 000 000000001 11112333332 22456799999877665433322322223332222 2222111 Q ss_pred HhccCCCCCCcccCcccc---ccc--cchhhccccCCCCc-----------cccceeeeccccc-hHHHHHHHHHHHHHH Q lcl|NC_012753. 293 MIKTEYDTNGEKVTVKRE---FET--GHNVYEQFDSGDMD-----------KGIGITDLTTDIR-SDDYIKAINKGLSLF 355 (502) Q Consensus 293 ~l~~~~~~~g~~~~~~~~---~~~--~~~~~~~~~~~~~~-----------~~~~i~~~~~~ir-~e~~~~~l~~~l~~i 355 (502) . .++.+........+. ... ......++.....+ .+.-|+.++..-. ...|...++.+=++| T Consensus 249 -G-kyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~P~g~eie~~ea~~~~~~~~~~~i~~~d~~I 326 (446) T protein:vir:98 249 -V-IVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSKEQPVQVGALTTGNNFSDSFERAISLCDNNM 326 (446) T ss_pred -E-eecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccCCCCceEEeeccccCChhhHHHHHHHHHHHH Confidence 1 122111100000000 000 00122222111111 0112333322211 112444455554555 Q ss_pred HHhcCCChhhcccccc-ccccHH-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHhhcccCCCcccc-cceEEE Q lcl|NC_012753. 356 EMQLGVSTGMFSFDGK-SMKTAT-EVVSEQSDTYQMRNSIATLVEKSLK-ELVISILELAKVYNLYTGEIPTM-DEVSVD 431 (502) Q Consensus 356 ~~~~g~s~~~~~~~~~-~~~tAt-ei~~~~~~l~~~~~~~~~~~~~~l~-~l~~~il~~~~~~~~~~~~~~~~-~~i~v~ 431 (502) +..+.-..-+++...+ +.+.|. ++. ..-....++.-.+.+...|+ +|++-++.+ |..+...... ...-+. T Consensus 327 skaiLg~~Ltl~~~~~~~GS~ala~vh--~~V~~d~~~aDa~~i~~tln~~Li~~l~~l----Nf~~~~~~~~~~~~~~~ 400 (446) T protein:vir:98 327 LMGMGIPNLLVQNRETTFGTGRASEIQ--LELFDGKINSIFDTVIHAFTEQVIGNLIRL----NFDPALYPLASNTGYIT 400 (446) T ss_pred HHHHhcccccccccccccchhhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh----CCCccccccccccccce Confidence 5543111111221111 111121 121 11122334445556667775 677766644 2221111110 011123 Q ss_pred eCCCccCCHHHHHHHHHHHHhcCCC-C-HHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 432 LDDGVFTDRNAEFDYWSKMVAAGFA-P-KTMAIEKTLNVTKEQAQE 475 (502) Q Consensus 432 f~d~i~~d~~~~~~~~~~~~~~Gi~-S-~et~l~~~~~~~deea~~ 475 (502) |...-+.|..+.++.+.+++..|++ + .+.++.+.+|+.+.+-.- T Consensus 401 ~~~~e~eDl~~~a~~~~~L~~~G~~~p~~~~~ire~~giP~~~~~~ 446 (446) T protein:vir:98 401 RLPGRATDLAALVEAIKQMHDMGFLVDGDKDHIRSITGLPDAISST 446 (446) T ss_pred eccCChhhHHHHHHHHHHHHhCCccccccHHHHHHHhCcCCCCCCC Confidence 4444567888889999999999974 3 356677777775411000 No 262 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=20.81 E-value=2.7 Score=18.21 Aligned_cols=383 Identities=12% Similarity=0.065 Sum_probs=145.3 Q ss_pred hhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHHHHHhhhhhcCcceEeeCCHH-- Q lcl|NC_012753. 23 LNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTASKKVASLVFNEQATIRVDNEV-- 100 (502) Q Consensus 23 l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv~~~a~~l~~ep~~i~~~d~~-- 100 (502) +..+. .. . |....+......+-. .......+.....|+..|+-+-+=|..+.-.+.. T Consensus 1 ~~~~~------~~-------------~-g~~~~~~~~~~~~~~-~~~~~~~~~V~acV~~Ia~~iA~lpl~l~~~~~~~~ 59 (723) T protein:vir:94 1 MTTFP------SG-------------A-GGWNAWSADSVFGNG-AKGWSNSAVAYRCISMLANNAASVDLVVRGPDGELD 59 (723) T ss_pred Ccccc------cC-------------C-Ccccccccccccccc-HHHHhhhHHHHHHHHHHHHhhccceeEEEcCCCccc Confidence 00000 00 0 000011111111100 0111222333455666666665556554322211 Q ss_pred HHHHHHHHHhh--cc---HHHHHHHHHHHHhhcCCEEEEEEEeC----Cce-EEEEEcCCeEEEEEEcCCCeEEEEEEEE Q lcl|NC_012753. 101 ADAFINETLKN--DK---FSKNFERYLESCLALGGLAMRPYIDG----DQI-RVSFVQATVFFPLQANTQDVSSAAIVTK 170 (502) Q Consensus 101 ~~e~l~~~~~~--~~---f~~~~~~~~~~~~~~G~~~~~~~~d~----~~~-~i~~v~~~~~~Pi~~d~~~~~~~~~~~~ 170 (502) ...-+-.+|.. |. ...-....+...+..|.+|+.+..++ |.+ .+..+++....++..+.+...... T Consensus 60 ~~~~l~~lL~~~PN~~~t~~~f~~~~~~~lll~Gnay~~i~r~~r~~~g~p~~l~~l~~~~~~v~~~~~~~~~~~~---- 135 (723) T protein:vir:94 60 ELHPLSQLWNVMPNRAMPAQVLKALSMTRLQLDGQCHLWLNYNGRTPAGVPDEIWYVYDRVTTIVATRAADAVPQA---- 135 (723) T ss_pred hhhHHHHHHhhCCCCCCCHHHHHHHHHHHHhhcCCeEEEEEecCCccccceeEEEEecCcceEEeecCCCccceee---- Confidence 11123334431 22 22233334555667788888776543 222 344444443333222211110000 Q ss_pred EEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCCCcceeecCCCcceEEEecCCccccccc Q lcl|NC_012753. 171 STKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDLEETVTLNGLTRPLFTYLKPPGMNNKDI 250 (502) Q Consensus 171 ~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~~~f~~~~~~~~n~~~~ 250 (502) +.. .|.+. . .+ |..+++. +. -.++|+.+. .. T Consensus 136 -----------~~~---------~y~~~----~-~~----G~~~~~~------~~----------dIiHir~~~----~~ 166 (723) T protein:vir:94 136 -----------QII---------GYVIE----R-TD----GVRVPVL------AD----------EMLWLRFSD----PY 166 (723) T ss_pred -----------eee---------EEEEE----e-cC----ceeEEec------cc----------ceEEecCCC----CC Confidence 000 01110 0 01 2222110 00 123344221 12 Q ss_pred cCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCcccC-cccccccc----chhhccccCCC Q lcl|NC_012753. 251 NSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGEKVT-VKREFETG----HNVYEQFDSGD 325 (502) Q Consensus 251 ~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~~-~~~~~~~~----~~~~~~~~~~~ 325 (502) +...|+|.+.-+...|.....+-.--.+-|..|...=.| |.. +..+..... ....|... .+....+-... T Consensus 167 dg~~G~Spi~~a~~~i~~~~aa~~~~~~~f~NG~~p~gi----L~~-~~l~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g 241 (723) T protein:vir:94 167 DPLAVMAPWKAARAAVDADFYAATWQRQSFKNGARPGGV----VNL-GDMDEQTFTKTVAAFRSQVEGVQNAGRHLLIAG 241 (723) T ss_pred CCcccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcceE----EEc-CCCCHHHHHHHHHHHHHHhhchhhcCcceeecc Confidence 334689988877766664443322222334554332111 321 110000000 00000000 01111111100 Q ss_pred C-------ccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_012753. 326 M-------DKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQSDTYQMRNSIATLVE 398 (502) Q Consensus 326 ~-------~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~~~l~~~~~~~~~~~~ 398 (502) . +.+.-++.++.....-++.+..+....+|+...|++|..++..........+.++.+ ..++.-..+.++ T Consensus 242 ~~~~~~vl~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st~sN~e~~~~~f~---~~tL~P~~~~ie 318 (723) T protein:vir:94 242 QGSDGGAAGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGSTYENQAEAKAAVW---TETLIPQMEVMA 318 (723) T ss_pred cccccccccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCCcccHHHHHHHHH---HHHHHHHHHHHH Confidence 0 112234455555556678888888899999999999998865432221111111111 122222222222 Q ss_pred HHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCC--ccCCHHHHHHHHHHHHhcCCCCHHHHHHhc--CCCC--HHH Q lcl|NC_012753. 399 KSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDG--VFTDRNAEFDYWSKMVAAGFAPKTMAIEKT--LNVT--KEQ 472 (502) Q Consensus 399 ~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~--i~~d~~~~~~~~~~~~~~Gi~S~et~l~~~--~~~~--dee 472 (502) .+|.. .++.. ....+.++|+.. +-.|..+.++....++.+|+++.-++++.. +|+. +.+ T Consensus 319 ~~ln~------------~Ll~~---~g~~~~~~f~~~~lLr~D~~~r~~~~~~~v~~G~~T~NE~R~~lglpPi~gGd~~ 383 (723) T protein:vir:94 319 SITDL------------QLLPD---IGWTVEWDFNSVPALQEDLEAQAGRNQGYLVNDVLMVDEVRATIGLDPLPGGIGQ 383 (723) T ss_pred HHHhH------------hhccc---ccCceEEeecchhhhhcCHHHHHHHHHHHHhCCCcCHHHHHHHhCCCCCCCCccc Confidence 22222 11111 123467778753 457888999999999999999999876653 2332 100 Q ss_pred ------------------HHHH-HHHHHH--hh-hcccCCCC-CccccCCCCC Q lcl|NC_012753. 473 ------------------AQEI-YQKIND--ET-MVSTDSFR-TSEEVDIYGE 502 (502) Q Consensus 473 ------------------a~~e-l~ri~~--E~-~~~~~~~~-~~~~~~~~g~ 502 (502) +.+| -.|+.+ |. +...|... .....-++|+ T Consensus 384 ~~~~p~~~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~ 436 (723) T protein:vir:94 384 MTLTPYRAQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRATTVLHH 436 (723) T ss_pred ceeccccccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCCCCCCC Confidence 0001 011110 00 00001000 0111112222 No 263 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=20.53 E-value=2.8 Score=18.17 Aligned_cols=355 Identities=14% Similarity=0.080 Sum_probs=118.3 Q ss_pred CChhHHHHHHHHHHhhcccccchhhhhccccccCCHHHHHHHHHHHHHhcCCCCccccccCCCccccccceecchHHHHH Q lcl|NC_012753. 1 MGIIQTIKNFIKRSNYVITNQSLNSITDHPKIAISPEEYNRIMDNLRYFAGDFDSVTYRDSNGSQVKRDFNHLPIGRTAS 80 (502) Q Consensus 1 m~~~~~ik~~i~~~~~~~~~~~l~~i~~~~~~~~~~~~~~~i~~~~~~Y~g~~~~~~~~~~~~~~~~~~~~~~n~~k~iv 80 (502) |+||.+++++.+........... .|.|+. ..+ ....-..+| T Consensus 1 M~if~~~~~~~~~~~~~~~~~~~------------------------~~~~~~--~~~-------------~~~~v~~~v 41 (378) T protein:vir:94 1 MNLFGKVVSFSRGKLNNDTQRVT------------------------AWQNEA--VEY-------------TSAFVTNIH 41 (378) T ss_pred CchhHHhHhhhhcccccCcceee------------------------eeecch--hhh-------------hhHHHHHHH Confidence 99999999875432111110000 011110 000 000112344 Q ss_pred HHHhhhhhcCcceE-eeC--C-------HHHHHHHHHHHhh--c---cHHHHHHHHHHHHhhcCCEEEEEEEeC--CceE Q lcl|NC_012753. 81 KKVASLVFNEQATI-RVD--N-------EVADAFINETLKN--D---KFSKNFERYLESCLALGGLAMRPYIDG--DQIR 143 (502) Q Consensus 81 ~~~a~~l~~ep~~i-~~~--d-------~~~~e~l~~~~~~--~---~f~~~~~~~~~~~~~~G~~~~~~~~d~--~~~~ 143 (502) +..|+-+..=|+.+ .-. + .....-|..+|.. | ....-....+...+..|.+|+.+.+++ |.+. T Consensus 42 ~~Ia~~iA~lp~~~~~~~~~~~~~~~~~~~~~~~l~~lLn~~PN~~~t~~~f~~~~~~~lll~Gnayi~~i~~~~~g~~~ 121 (378) T protein:vir:94 42 NKIANEITKVEFNHVKYKKSDVGSDTLISMAGSDLDEVLNWSSKGERNSMEFWQKVIKKLLTTRYIDLYPIFDSETGELL 121 (378) T ss_pred HHHHHhHhhCceeeeeecccccccccccccccchHHHHHhhcCCCCCCHHHHHHHHHHHHhhcCCeEEEEEeeCCCCcEE Confidence 44444444444321 110 0 0011223333331 1 122233335666667788876654432 2221 Q ss_pred EEEEcCCeEEEEEEcCCCeEEEEEEEEEEEeeCCCceEEEEEEEEEEeCCeEEEEEEEEecCCccccCceeeccccccCC Q lcl|NC_012753. 144 VSFVQATVFFPLQANTQDVSSAAIVTKSTKTEGQKVKYYSLIEFHEWNKETYTISNELYESESKTIIGQRVPLSTLYEDL 223 (502) Q Consensus 144 i~~v~~~~~~Pi~~d~~~~~~~~~~~~~~~~~~~~~~~yt~~E~h~~~~~~~~I~~~l~~~~~~~~lG~~v~l~~~~~~l 223 (502) .. +| .+ + ++.|..-+ -..+.+..+.. .+.. ++......+ T Consensus 122 ~~-------~~--~~-~------------------~~~~~~~d-------vih~~~~~~~~----~~~~--~~~~~~~~~ 160 (378) T protein:vir:94 122 DL-------LF--AN-D------------------KKEYKPEE-------LVRLTSPFYIN----EDTS--ILDNALASI 160 (378) T ss_pred EE-------EE--ec-C------------------cEEechhc-------eeeecCcCCcc----cchh--HHHHHHHHH Confidence 00 00 00 0 00010000 00011000000 0000 000000000 Q ss_pred CcceeecCCCcceEEEecCCccccccccCcCCcchhhhHHHHHHHHHHHHHHHHHHHhhccceeeechHHhccCCCCCCc Q lcl|NC_012753. 224 EETVTLNGLTRPLFTYLKPPGMNNKDINSPLGLSIFDNAKTTMDFINTTYDEFMWEVKMGQRRVAVPTQMIKTEYDTNGE 303 (502) Q Consensus 224 ~~~~~~~~~~~~~f~~~~~~~~n~~~~~~p~G~S~~~~~~~lid~ld~~~S~~~~~~~~~~~~i~v~~~~l~~~~~~~g~ 303 (502) ... ...+ .+.+ +++.+. .++. +..+.+.+.+...|...... ....++.| | T Consensus 161 ~~~-~~~~--~~~g-~l~~~~----------~l~~-~~~~~~~e~~~~~~~~~~~~--~n~~~~~v----l--------- 210 (378) T protein:vir:94 161 QTK-LEQG--KLRG-LLKINA----------FLDI-DNTQEYREKALATIKNMQEG--SSYNGLTP----V--------- 210 (378) T ss_pred HHH-HhhC--Cccc-ceeeCC----------cCCH-HHHHHHHHHHHHHHHHhhcc--ccccccee----c--------- Confidence 000 0001 1111 111110 0110 11122222222222211100 00001221 1 Q ss_pred ccCccccccccchhhccccCCCCccccceeeeccccchHHHHHHHHHHHHHHHHhcCCChhhccccccccccHHHHHHHH Q lcl|NC_012753. 304 KVTVKREFETGHNVYEQFDSGDMDKGIGITDLTTDIRSDDYIKAINKGLSLFEMQLGVSTGMFSFDGKSMKTATEVVSEQ 383 (502) Q Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~~~~tAtei~~~~ 383 (502) +.+.-++.++......+ ...++.+..+|+...|+||..+.. |+.|-.+. T Consensus 211 -----------------------~~g~~~~~l~~~~~~~~-~~~~~~~~~~Ia~~fgvPp~~l~g------~~~e~~~~- 259 (378) T protein:vir:94 211 -----------------------DNKTEIVELKKDYSVLN-KDEIDLIKSELLTGYFMNENILLG------TATQEQQI- 259 (378) T ss_pred -----------------------cCCceEEEccCChHHhh-HHHHHHHHHHHHHHhCCCHHHhcC------CchHHHHH- Confidence 00111333322222222 244556677899999999988742 11121111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccceEEEeCCCccCCHHHHHHHHHHHHhcCCCCHHHHHH Q lcl|NC_012753. 384 SDTYQMRNSIATLVEKSLKELVISILELAKVYNLYTGEIPTMDEVSVDLDDGVFTDRNAEFDYWSKMVAAGFAPKTMAIE 463 (502) Q Consensus 384 ~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~~~~~~~~~Gi~S~et~l~ 463 (502) .-...++.-..+.++.+|..-+-.-..... +-......++.++++.-...|..+.++...+++.+|+++.-++++ T Consensus 260 ~f~~~tl~P~~~~ie~~l~~~Ll~~~e~~~-----g~~~~~~~~~~f~~~~l~~~d~~~~~e~~~~~~~~G~~t~NE~R~ 334 (378) T protein:vir:94 260 YFYNSTIIPLLIQLEKELTYKLISTNRRRV-----VKGNLYYERIIVDNQLFKFATLKELIDLYHENINGPIFTQNQLLV 334 (378) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCChhHhhh-----hhhhcccceeEeecchhhhcCHHHHHHHHHHHHhCCCcCHHHHHH Confidence 001122222222233333221000000000 000111234566667777889999999999999999999998765 Q ss_pred hcCCCCH-HHHH-----HHHHHHHHhhhcccCCCCCccccCCCCC Q lcl|NC_012753. 464 KTLNVTK-EQAQ-----EIYQKINDETMVSTDSFRTSEEVDIYGE 502 (502) Q Consensus 464 ~~~~~~d-eea~-----~el~ri~~E~~~~~~~~~~~~~~~~~g~ 502 (502) . .|+.. +.-+ ..+..+..-............+.+-.+| T Consensus 335 ~-~g~~p~~ggd~~~~~~n~~~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 335 K-MGEQPIEGGDVYIANLNAVAVKNLSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred H-hCCCCCCCCCeeeecccccchhcchhcccccCCCCCCCCCCCC Confidence 4 34322 0000 1111111111111111112223333333 Done!